- Results as:
- Print view
- Endnote (RIS)
- BibTeX
- Table: CSV | HTML
Preprints
Preprints
Preprints
Printed publications
Preprints
Preprints
Preprints
Preprints
This page was last modified on 27 Jan 2025 27 Jan 2025 .
Research Mission and Profile
Molecular Signal Processing
Bioorganic Chemistry
Biochemistry of Plant Interactions
Cell and Metabolic Biology
Independent Junior Research Groups
Program Center MetaCom
Publications
Good Scientific Practice
Research Funding
Networks and Collaborative Projects
Symposia and Colloquia
Alumni Research Groups
Preprints
Mapping the chemical space of compounds to chemical structures remains a challenge in metabolomics. Despite the advancements in untargeted liquid chromatography-mass spectrometry (LC-MS) to achieve a high-throughput profile of metabolites from complex biological resources, only a small fraction of these metabolites can be annotated with confidence. Many novel computational methods and tools have been developed to enable chemical structure annotation to known and unknown compounds such as in silico generated spectra and molecular networking. Here, we present an automated and reproducible Metabolome Annotation Workflow (MAW) for untargeted metabolomics data to further facilitate and automate the complex annotation by combining tandem mass spectrometry (MS2) input data pre-processing, spectral and compound database matching with computational classification, and in silico annotation. MAW takes the LC-MS2 spectra as input and generates a list of putative candidates from spectral and compound databases. The databases are integrated via the R package Spectra and the metabolite annotation tool SIRIUS as part of the R segment of the workflow (MAW-R). The final candidate selection is performed using the cheminformatics tool RDKit in the Python segment (MAW-Py). Furthermore, each feature is assigned a chemical structure and can be imported to a chemical structure similarity network. MAW is following the FAIR (Findable, Accessible, Interoperable, Reusable) principles and has been made available as the docker images, maw-r and maw-py. The source code and documentation are available on GitHub. The performance of MAW is evaluated on two case studies. We found that MAW can improve candidate ranking by integrating spectral databases with annotation tools like SIRIUS which contributes to an efficient candidate selection procedure. The results from MAW are also reproducible and traceable, compliant with the FAIR guidelines. Taken together, MAW could greatly facilitate automated metabolite characterization in diverse fields such as clinical metabolomics and natural product discovery.
Preprints
Proteome remodeling is a fundamental adaptive response and proteins in complex and functionally related proteins are often co-expressed. Using a deep sampling strategy we define Arabidopsis thaliana tissue core proteomes at around 10,000 proteins per tissue and absolutely quantify (copy numbers per cell) nearly 16,000 proteins throughout the plant lifecycle. A proteome wide survey of global post translational modification revealed amino acid exchanges pointing to potential conservation of translational infidelity in eukaryotes. Correlation analysis of protein abundance uncovered potentially new tissue and age specific roles of entire signaling modules regulating transcription in photosynthesis, seed development and senescence and abscission. Among others, the data suggest a potential function of RD26 and other NAC transcription factors in seed development related to desiccation tolerance as well as a possible function of Cysteine-rich Receptor-like Kinases (CRKs) as ROS sensors in senescence. All of the components of ribosome biogenesis factor (RBF) complexes were co-expressed tissue and age specifically indicating functional promiscuity in the assembly of these little described protein complexes in Arabidopsis. Treatment of seedlings with flg22 for 16 hours allowed us to characterize proteome architecture in basal immunity in detail. The results were complemented with parallel reaction monitoring (PRM) targeted proteomics, phytohormone, amino acid and transcript measurements. We obtained strong evidence of suppression of jasmonate (JA) and JA-Ile levels by deconjugation and hydroxylation via IAA-ALA RESISTANT3 (IAR3) and JASMONATE-INDUCED OXYGENASE 2 (JOX2) under the control of JASMONATE INSENSITIVE 1 (MYC2). This previously unknown regulatory switch is another part of the puzzle of the as yet understudied role of JA in pattern triggered immunity. The extensive coverage of the Arabidopsis proteome in various biological scenarios presents a rich resource to plant biologists that we make available to the community.
Preprints
The growth of online mass spectrometry metabolomics resources, including data repositories, spectral library databases, and online analysis platforms has created an environment of online/web accessibility. Here, we introduce the Metabolomics Spectrum Resolver (https://metabolomics-usi.ucsd.edu/), a tool that builds upon these exciting developments to allow for consistent data export (in human and machine-readable forms) and publication-ready visualisations for tandem mass spectrometry spectra. This tool supports the draft Human Proteome Organizations Proteomics Standards Initiative’s USI specification, which has been extended to deal with the metabolomics use cases. To date, this resource already supports data formats from GNPS, MassBank, MS2LDA, MassIVE, MetaboLights, and Metabolomics Workbench and is integrated into several of these resources, providing a valuable open source community contribution (https://github.com/mwang87/MetabolomicsSpectrumResolver).
Printed publications
Molecular networking has become a key method used to visualize and annotate the chemical space in non-targeted mass spectrometry-based experiments. However, distinguishing isomeric compounds and quantitative interpretation are currently limited. Therefore, we created Feature-based Molecular Networking (FBMN) as a new analysis method in the Global Natural Products Social Molecular Networking (GNPS) infrastructure. FBMN leverages feature detection and alignment tools to enhance quantitative analyses and isomer distinction, including from ion-mobility spectrometry experiments, in molecular networks.
Preprints
Background Metabolomics is the comprehensive study of a multitude of small molecules to gain insight into an organism’s metabolism. The research field is dynamic and expanding with applications across biomedical, biotechnological and many other applied biological domains. Its computationally-intensive nature has driven requirements for open data formats, data repositories and data analysis tools. However, the rapid progress has resulted in a mosaic of independent – and sometimes incompatible – analysis methods that are difficult to connect into a useful and complete data analysis solution.Findings The PhenoMeNal (Phenome and Metabolome aNalysis) e-infrastructure provides a complete, workflow-oriented, interoperable metabolomics data analysis solution for a modern infrastructure-as-a-service (IaaS) cloud platform. PhenoMeNal seamlessly integrates a wide array of existing open source tools which are tested and packaged as Docker containers through the project’s continuous integration process and deployed based on a kubernetes orchestration framework. It also provides a number of standardized, automated and published analysis workflows in the user interfaces Galaxy, Jupyter, Luigi and Pachyderm.Conclusions PhenoMeNal constitutes a keystone solution in cloud infrastructures available for metabolomics. It provides scientists with a ready-to-use, workflow-driven, reproducible and shareable data analysis platform harmonizing the software installation and configuration through user-friendly web interfaces. The deployed cloud environments can be dynamically scaled to enable large-scale analyses which are interfaced through standard data formats, versioned, and have been tested for reproducibility and interoperability. The flexible implementation of PhenoMeNal allows easy adaptation of the infrastructure to other application areas and ‘omics research domains.
Preprints
Making reproducible, auditable and scalable data-processing analysis workflows is an important challenge in the field of bioinformatics. Recently, software containers and cloud computing introduced a novel solution to address these challenges. They simplify software installation, management and reproducibility by packaging tools and their dependencies. In this work we implemented a cloud provider agnostic and scalable container orchestration setup for the popular Galaxy workflow environment. This solution enables Galaxy to run on and offload jobs to most cloud providers (e.g. Amazon Web Services, Google Cloud or OpenStack, among others) through the Kubernetes container orchestrator. Availability: All code has been contributed to the Galaxy Project and is available (since Galaxy 17.05) at https://github.com/galaxyproject/ in the galaxy and galaxy-kubernetes repositories. https://public.phenomenal-h2020.eu/ is an example deployment.
Preprints
Site-directed methods for the generation of genetic diversity are essential tools in the field of directed enzyme evolution. The Golden Gate cloning technique has been proven to be an efficient tool for a variety of cloning setups. The utilization of restriction enzymes which cut outside of their recognition domain allows the assembly of multiple gene fragments obtained by PCR amplification without altering the open reading frame of the reconstituted gene. We have developed a protocol, termed Golden Muta-genesis that allows the rapid, straightforward, reliable and inexpensive construction of mutagenesis libraries. One to five amino acid positions within a coding sequence could be altered simultaneously using a protocol which can be performed within one day. To facilitate the implementation of this technique, a software library and web application for automated primer design and for the graphical evaluation of the randomization success based on the sequencing results was developed. This allows facile primer design and application of Golden Mutagenesis also for laboratories, which are not specialized in molecular biology.
Preprints
Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed in parallel using the Kubernetes container orchestrator. The access point is a virtual research environment which can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and established workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry studies, one nuclear magnetic resonance spectroscopy study and one fluxomics study, showing that the method scales dynamically with increasing availability of computational resources. We achieved a complete integration of the major software suites resulting in the first turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, multivariate statistics, and metabolite identification. Microservices is a generic methodology that can serve any scientific discipline and opens up for new types of large-scale integrative science.
This page was last modified on 27 Jan 2025 27 Jan 2025 .