title: data processing
date: 2020-05-23

This post is a roundup of open source scientific data processing projects that I know of.
I've organized them alphabetically.
I have some minimal notes on a few projects here, but mostly this is just a "roundup" page.
Obviously this only scratches the surface, but I've tried to put in just the ones that I can imagine caring about.

# table of contents

[TOC]

# [GCPy](https://github.com/geoschem/gcpy)

GCPy is a Python-based toolkit containing useful functions for working specifically with the GEOS-Chem model of atmospheric chemistry and composition.

GCPy aims to build on the well-established scientific Python technical stack, leveraging tools like cartopy and xarray to simplify the task of working with model output and performing atmospheric chemistry analyses.

# KOALA

[website](http://www.bristoldynamics.com/resources/)

[publication](https://doi.org/10.1063/1.4884516)

Here, we present a new program for decomposing mixed transient spectra into their individual component spectra and extracting the corresponding kinetic traces: KOALA (Kinetics Observed After Light Absorption).
The software combines spectral target analysis with brute-force linear least squares fitting, which is computationally efficient because of the small nonlinear parameter space of most spectral features.

# pycroscopy

[website](https://pycroscopy.github.io/pycroscopy/about.html)

[source](https://github.com/pycroscopy/pycroscopy)

pycroscopy is a python package for image processing and scientific analysis of imaging modalities such as multi-frequency scanning probe microscopy, scanning tunneling spectroscopy, x-ray diffraction microscopy, and transmission electron microscopy.
pycroscopy uses a data-centric model wherein the raw data collected from the microscope, results from analysis and processing routines are all written to standardized hierarchical data format (HDF5) files for traceability, reproducibility, and provenance.

# PyTrA

[website](https://sourceforge.net/projects/pytra/)

[source](https://github.com/nzjakemartin/PyTrA)

Analysis of ultra-fast transient absorption spectra.

# pyUSID

[website](https://pycroscopy.github.io/pyUSID/about.html)

Python framework for storing, visualizing, and processing spectroscopy, imaging or any observational / experimental data.

The Universal Spectroscopic and Imaging Data (USID) model.

# [Skultrafast](https://github.com/Tillsten/skultrafast)

Skultrafast stands for scikit.ultrafast and is an python package which aims to include everything needed to analyze data from time-resolved spectroscopy experiments in the femtosecond domain.