The IPB has once again been recognized for its exemplary actions in terms of equal opportunity-oriented personnel and organizational policies and has received the TOTAL E-QUALITY certification for the…
The Plant Science Student Conference (PSSC) has been organised by students from the two Leibniz institutes, IPK and IPB, every year for the last 20 years. In this interview, Christina Wäsch (IPK) and…
Herrera-Rocha, F.; Fernández-Niño, M.; Duitama, J.; P. Cala, M.; José Chica, M.; A. Wessjohann, L.; D. Davari, M.; Fernando González Barrios, A.;FlavorMiner: A Machine Learning Platform for Extracting Molecular Flavor Profiles from Structural DataChemRxiv(2024)DOI: 10.26434/chemrxiv-2024-821xm
Flavor is the main factor driving consumers acceptance of food products. However, tracking the biochemistry of flavor is a formidable challenge due to the complexity of food composition. Current methodologies for linking individual molecules to flavor in foods and beverages are expensive and time-consuming. Predictive models based on machine learning (ML) are emerging as an alternative to speed up this process. Nonetheless, the optimal approach to predict flavor features of molecules remains elusive. In this work we present FlavorMiner, an ML-based multilabel flavor predictor. FlavorMiner seamlessly integrates different combinations of algorithms and mathematical representations, augmented with class balance strategies to address the inherent class of the input dataset. Notably, Random Forest and K-Nearest Neighbors combined with Extended Connectivity Fingerprint and RDKit molecular descriptors consistently outperform other combinations in most cases. Resampling strategies surpass weight balance methods in mitigating bias associated with class imbalance. FlavorMiner exhibits remarkable accuracy, with an average ROC AUC score of 0.88. This algorithm was used to analyze cocoa metabolomics data, unveiling its profound potential to help extract valuable insights from intricate food metabolomics data. FlavorMiner can be used for flavor mining in any food product, drawing from a diverse training dataset that spans over 934 distinct food products.