Mass spectra-based framework for automated structural elucidation of metabolome data to explore phytochemical diversity

Front Plant Sci. 2011 Aug 22:2:40. doi: 10.3389/fpls.2011.00040. eCollection 2011.

Abstract

A novel framework for automated elucidation of metabolite structures in liquid chromatography-mass spectrometer metabolome data was constructed by integrating databases. High-resolution tandem mass spectra data automatically acquired from each metabolite signal were used for database searches. Three distinct databases, KNApSAcK, ReSpect, and the PRIMe standard compound database, were employed for the structural elucidation. The outputs were retrieved using the CAS metabolite identifier for identification and putative annotation. A simple metabolite ontology system was also introduced to attain putative characterization of the metabolite signals. The automated method was applied for the metabolome data sets obtained from the rosette leaves of 20 Arabidopsis accessions. Phenotypic variations in novel Arabidopsis metabolites among these accessions could be investigated using this method.

Keywords: database searching; liquid chromatography-mass spectrometry; metabolome analysis; natural variations in secondary metabolite; structural elucidation.