Abstract
Infrared ion spectroscopy (IRIS) continues to see increasing use as an analytical tool for small-molecule identification in conjunction with mass spectrometry (MS). The IR spectrum of an m/z selected population of ions constitutes a unique fingerprint that is specific to the molecular structure. However, direct translation of an IR spectrum to a molecular structure remains challenging, as reference libraries of IR spectra of molecular ions largely do not exist. Quantum-chemically computed spectra can reliably be used as reference, but the challenge of selecting the candidate structures remains. Here we introduce an in silico library of vibrational spectra of common MS adducts of over 4500 compounds found in the human metabolome database (HMDB). In total, the library currently contains more than 75 000 spectra computed at the DFT level that can be queried with an experimental IR spectrum. Moreover, we introduce a database of 189 experimental IRIS spectra, which is employed to validate the automated spectral matching routines. This demonstrates that 75% of metabolites in the experimental dataset is correctly identified, based solely on their exact m/z and IRIS spectrum. Additionally, we demonstrate an approach for specifically identifying substructures by performing a search without m/z constraints to find structural analogues. Such an unsupervised search paves the way towards the de novo identification of unknowns that are absent in spectral libraries. We apply the in silico spectral library to identify an unknown in a plasma sample as 3-hydroyxhexanoic acid, highlighting the potential of the method.
Supplementary materials
Title
Experimental spectra vs all computed spectra
Description
For each experimental spectrum the 9 best matching computed spectra in the library are shown, without selection on chemical formula.
Actions
Title
Experimental spectra vs all computed tautomers and conformers
Description
For each experimental spectrum the match with all computed conformers and tautomers are shown.
Actions
Title
Experimental spectra vs all computed isomers
Description
For each experimental spectrum the match with all computed spectra of isomers in the library are shown.
Actions
Title
Main supporting information to the manuscript
Description
LC-MS procedure, optimization of the spectral similarity scoring, additional sample information and tables/figures in support of the results.
Actions
Title
Supporting tables
Description
Excel versions of tables in the supporting information
Actions
Supplementary weblinks
Title
Complete dataset
Description
Link to Zenodo repository with all the experimental and computed IR spectra. Also available on the HMDB website
Actions
View