Geschmack ist vorhersagbar: Mit FlavorMiner. FlavorMiner heißt das Tool, das IPB-Chemiker und Partner aus Kolumbien jüngst entwickelt haben. Das Programm kann, basierend auf maschinellem Lernen (KI), anhand der…
Seit Februar 2021 bietet Wolfgang Brandt, ehemaliger Leiter der Arbeitsgruppe Computerchemie am IPB, sein Citizen Science-Projekt zur Pilzbestimmung an. Dafür hat er in regelmäßigen Abständen öffentliche Vorträge zur Vielfalt…
Herrera-Rocha, F.; Fernández-Niño, M.; Duitama, J.; Cala, M. P.; Chica, M. J.; Wessjohann, L. A.; Davari, M. D.; Barrios, A. F. G.;FlavorMiner: a machine learning platform for extracting molecular flavor profiles from structural dataJ. Cheminform.16140(2024)DOI: 10.1186/s13321-024-00935-9
Flavor is the main factor driving consumers acceptance of food products. However, tracking the biochemistry of flavor is a formidable challenge due to the complexity of food composition. Current methodologies for linking individual molecules to flavor in foods and beverages are expensive and time-consuming. Predictive models based on machine learning (ML) are emerging as an alternative to speed up this process. Nonetheless, the optimal approach to predict flavor features of molecules remains elusive. In this work we present FlavorMiner, an ML-based multilabel flavor predictor. FlavorMiner seamlessly integrates different combinations of algorithms and mathematical representations, augmented with class balance strategies to address the inherent class of the input dataset. Notably, Random Forest and K-Nearest Neighbors combined with Extended Connectivity Fingerprint and RDKit molecular descriptors consistently outperform other combinations in most cases. Resampling strategies surpass weight balance methods in mitigating bias associated with class imbalance. FlavorMiner exhibits remarkable accuracy, with an average ROC AUC score of 0.88. This algorithm was used to analyze cocoa metabolomics data, unveiling its profound potential to help extract valuable insights from intricate food metabolomics data. FlavorMiner can be used for flavor mining in any food product, drawing from a diverse training dataset that spans over 934 distinct food products.Scientific Contribution FlavorMiner is an advanced machine learning (ML)-based tool designed to predict molecular flavor features with high accuracy and efficiency, addressing the complexity of food metabolomics. By leveraging robust algorithmic combinations paired with mathematical representations FlavorMiner achieves high predictive performance. Applied to cocoa metabolomics, FlavorMiner demonstrated its capacity to extract meaningful insights, showcasing its versatility for flavor analysis across diverse food products. This study underscores the transformative potential of ML in accelerating flavor biochemistry research, offering a scalable solution for the food and beverage industry.
Publikation
Baky, M. H.; Kamal, I. M.; Wessjohann, L. A.; Farag, M. A.;Assessment of metabolome diversity in black and white pepper in response to autoclaving using MS- and NMR-based metabolomics and in relation to its remote and direct antimicrobial effects against food-borne pathogensRSC Adv.1410799-10813(2024)DOI: 10.1039/d4ra00100a
Piper nigrum L. (black and white peppercorn) is one of the most common culinary spices used worldwide. The current study aims to dissect pepper metabolome using 1H-NMR targeting of its major primary and secondary metabolites. Eighteen metabolites were identified with piperine detected in black and white pepper at 20.2 and 23.9 mg mg−1, respectively. Aroma profiling using HS-SPME coupled to GC-MS analysis and in the context of autoclave treatment led to the detection of a total of 52 volatiles with an abundance of b-caryophyllene at 82% and 59% in black and white pepper, respectively. Autoclaving of black and white pepper revealed improvement of pepper aroma as manifested by an increase in oxygenated compounds\' level. In vitro remote antimicrobial activity against food-borne Gram-positive and Gram-negative bacteria revealed the highest activity against P. aeruginosa (VP-MIC 16.4 and 12.9 mg mL−1) and a direct effect against Enterobacter cloacae at ca. 11.6 mg mL−1 for both white and black pepper.
Publikation
Farag, M. A.; Baky, M. H.; Morgan, I.; Khalifa, M. R.; Rennert, R.; Mohamed, O. G.; El-Sayed, M. M.; Porzel, A.; Wessjohann, L. A.; Ramadan, N. S.;Comparison of Balanites aegyptiaca parts: metabolome providing insights into plant health benefits and valorization purposes as analyzed using multiplex GC-MS, LC-MS, NMR-based metabolomics, and molecular networkingRSC Adv.1321471-21493(2023)DOI: 10.1039/d3ra03141a
Balanites aegyptiaca (L.) Delile (Zygophyllaceae), also known as the desert date, is an edible fruit-producing tree popular for its nutritional and several health benefits. In this study, multi-targeted comparative metabolic profiling and fingerprinting approaches were conducted for the assessment of the nutrient primary and secondary metabolite heterogeneity in different parts, such as leaves, stems, seeds, unripe, and ripe fruits of B. aegyptiaca using nuclear magnetic resonance (NMR), ultra-performance liquid chromatography (UPLC-MS), and gas chromatography mass-spectrometry (GC-MS) based metabolomics coupled to multivariate analyses and in relation to its cytotoxic activities. NMR-based metabolomic study identified and quantified 15 major primary and secondary metabolites belonging to alkaloids, saponins, flavonoids, sugars, and amino and fatty acids. Principal component analysis (PCA) of the NMR dataset revealed α-glucose, sucrose, and isorhamnetin as markers for fruit and stem and unsaturated fatty acids for predominated seeds. Orthogonal projections to latent structure discriminant analysis (OPLS-DA) revealed trigonelline as a major distinctive metabolite in the immature fruit and isorhamnetin as a major distinct marker in the mature fruit. UPLC-MS/MS analysis using feature-based molecular networks revealed diverse chemical classes viz. steroidal saponins, N-containing metabolites, phenolics, fatty acids, and lipids as the constitutive metabolome in Balanites. Gas chromatography-mass spectroscopy (GC-MS) profiling of primary metabolites led to the detection of 135 peaks belonging to sugars, fatty acids/esters, amino acids, nitrogenous, and organic acids. Monosaccharides were detected at much higher levels in ripe fruit and disaccharides in predominate unripe fruits, whereas B. aegyptiaca vegetative parts (leaves and stem) were rich in amino acids and fatty acids. The antidiabetic compounds, viz, nicotinic acid, and trigonelline, were detected in all parts especially unripe fruit in addition to the sugar alcohol D-pinitol for the first time providing novel evidence for B. aegyptiaca use in diabetes. In vitro cytotoxic activity revealed the potential efficacy of immature fruit and seeds as cytotoxic agents against human prostate cancer (PC3) and human colorectal cancer (HCT-116) cell lines. Collectively, such detailed profiling of parts provides novel evidence for B. aegyptiaca medicinal uses.
Publikation
Zulfiqar, M.; Gadelha, L.; Steinbeck, C.; Sorokina, M.; Peters, K.;MAW: the reproducible Metabolome Annotation Workflow for untargeted tandem mass spectrometryJ. Cheminform.1532(2023)DOI: 10.1186/s13321-023-00695-y
Mapping the chemical space of compounds to chemical structures remains a challenge in metabolomics. Despite the advancements in untargeted liquid chromatography-mass spectrometry (LC–MS) to achieve a high-throughput profile of metabolites from complex biological resources, only a small fraction of these metabolites can be annotated with confidence. Many novel computational methods and tools have been developed to enable chemical structure annotation to known and unknown compounds such as in silico generated spectra and molecular networking. Here, we present an automated and reproducible Metabolome Annotation Workflow (MAW) for untargeted metabolomics data to further facilitate and automate the complex annotation by combining tandem mass spectrometry (MS2) input data pre-processing, spectral and compound database matching with computational classification, and in silico annotation. MAW takes the LC-MS2 spectra as input and generates a list of putative candidates from spectral and compound databases. The databases are integrated via the R package Spectra and the metabolite annotation tool SIRIUS as part of the R segment of the workflow (MAW-R). The final candidate selection is performed using the cheminformatics tool RDKit in the Python segment (MAW-Py). Furthermore, each feature is assigned a chemical structure and can be imported to a chemical structure similarity network. MAW is following the FAIR (Findable, Accessible, Interoperable, Reusable) principles and has been made available as the docker images, maw-r and maw-py. The source code and documentation are available on GitHub (https://github.com/zmahnoor14/MAW). The performance of MAW is evaluated on two case studies. MAW can improve candidate ranking by integrating spectral databases with annotation tools like SIRIUS which contributes to an efficient candidate selection procedure. The results from MAW are also reproducible and traceable, compliant with the FAIR guidelines. Taken together, MAW could greatly facilitate automated metabolite characterization in diverse fields such as clinical metabolomics and natural product discovery.