Geschmack ist vorhersagbar: Mit FlavorMiner. FlavorMiner heißt das Tool, das IPB-Chemiker und Partner aus Kolumbien jüngst entwickelt haben. Das Programm kann, basierend auf maschinellem Lernen (KI), anhand der…
Seit Februar 2021 bietet Wolfgang Brandt, ehemaliger Leiter der Arbeitsgruppe Computerchemie am IPB, sein Citizen Science-Projekt zur Pilzbestimmung an. Dafür hat er in regelmäßigen Abständen öffentliche Vorträge zur Vielfalt…
Peters, K.; Bradbury, J.; Bergmann, S.; Capuccini, M.; Cascante, M.; de Atauri, P.; Ebbels, T. M. D.; Foguet, C.; Glen, R.; Gonzalez-Beltran, A.; Guenther, U.; Handakas, E.; Hankemeier, T.; Haug, K.; Herman, S.; Holub, P.; Izzo, M.; Jacob, D.; Johnson, D.; Jourdan, F.; Kale, N.; Karaman, I.; Khalili, B.; Emami Khoonsari, P.; Kultima, K.; Lampa, S.; Larsson, A.; Ludwig, C.; Moreno, P.; Neumann, S.; Novella, J. A.; O'Donovan, C.; Pearce, J. T. M.; Peluso, A.; Pireddu, L.; Piras, M. E.; Reed, M. A. C.; Rocca-Serra, P.; Roger, P.; Rosato, A.; Rueedi, R.; Ruttkies, C.; Sadawi, N.; Salek, R.; Sansone, S.-A.; Selivanov, V.; Spjuth, O.; Schober, D.; Thévenot, E. A.; Tomasoni, M.; van Rijswijk, M.; van Vliet, M.; Viant, M. R.; Weber, R. J. M.; Zanetti, G.; Steinbeck, C.;PhenoMeNal: Processing and analysis of Metabolomics data in the CloudbioRxiv(2018)DOI: 10.1101/409151
Background Metabolomics is the comprehensive study of a multitude of small molecules to gain insight into an organism’s metabolism. The research field is dynamic and expanding with applications across biomedical, biotechnological and many other applied biological domains. Its computationally-intensive nature has driven requirements for open data formats, data repositories and data analysis tools. However, the rapid progress has resulted in a mosaic of independent – and sometimes incompatible – analysis methods that are difficult to connect into a useful and complete data analysis solution.Findings The PhenoMeNal (Phenome and Metabolome aNalysis) e-infrastructure provides a complete, workflow-oriented, interoperable metabolomics data analysis solution for a modern infrastructure-as-a-service (IaaS) cloud platform. PhenoMeNal seamlessly integrates a wide array of existing open source tools which are tested and packaged as Docker containers through the project’s continuous integration process and deployed based on a kubernetes orchestration framework. It also provides a number of standardized, automated and published analysis workflows in the user interfaces Galaxy, Jupyter, Luigi and Pachyderm.Conclusions PhenoMeNal constitutes a keystone solution in cloud infrastructures available for metabolomics. It provides scientists with a ready-to-use, workflow-driven, reproducible and shareable data analysis platform harmonizing the software installation and configuration through user-friendly web interfaces. The deployed cloud environments can be dynamically scaled to enable large-scale analyses which are interfaced through standard data formats, versioned, and have been tested for reproducibility and interoperability. The flexible implementation of PhenoMeNal allows easy adaptation of the infrastructure to other application areas and ‘omics research domains.
Preprints
Emami Khoonsari, P.; Moreno, P.; Bergmann, S.; Burman, J.; Capuccini, M.; Carone, M.; Cascante, M.; de Atauri, P.; Foguet, C.; Gonzalez-Beltran, A.; Hankemeier, T.; Haug, K.; He, S.; Herman, S.; Johnson, D.; Kale, N.; Larsson, A.; Neumann, S.; Peters, K.; Pireddu, L.; Rocca-Serra, P.; Roger, P.; Rueedi, R.; Ruttkies, C.; Sadawi, N.; Salek, R.; Sansone, S.-A.; Schober, D.; Selivanov, V.; Thévenot, E. A.; van Vliet, M.; Zanetti, G.; Steinbeck, C.; Kultima, K.; Spjuth, O.;Interoperable and scalable data analysis with microservices: Applications in MetabolomicsbioRxiv(2017)DOI: 10.1101/213603
Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed in parallel using the Kubernetes container orchestrator. The access point is a virtual research environment which can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and established workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry studies, one nuclear magnetic resonance spectroscopy study and one fluxomics study, showing that the method scales dynamically with increasing availability of computational resources. We achieved a complete integration of the major software suites resulting in the first turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, multivariate statistics, and metabolite identification. Microservices is a generic methodology that can serve any scientific discipline and opens up for new types of large-scale integrative science.