Quickstart

TidyMS [1] is a Python package that provides tools to process and analyze Mass Spectrometry (MS) data. Although suited for general use, it was designed to be used with datasets from LC-HRMS metabolomics experiments. It uses Numpy, Pandas and scikit-learn for data processing and analysis. Some of the functionality that offers is:

  • read raw data in the mzML format using tidyms.MSData class, optimized for speed and low memory usage.

  • Creation of chromatograms and accumulated spectra from raw data.

  • Feature detection and feature correspondence in metabolomics datasets using the tidyms.Assay class.

  • Read processed data from other mass spectrometry processing software (XCMS, mzmine2, etc…).

  • A container object to manage metabolomics data.

  • Data curation of untargeted metabolomics data sets using widely accepted practices from the metabolomics community [2] [3]

  • Interactive data visualization using bokeh, or publication quality plots using seaborn.

In the rest of this guide, you can find links for different use cases for the TidyMS package. A basic knowledge of MS and metabolomics is assumed, but you can look up in the Definitions the concepts used in the guides. Installation instructions are available here.

You can refer to the following guides to learn about specific topics:

References