jump to searchjump to navigationjump to content

Sort by: Year Type of publication

Displaying results 1 to 10 of 65.

Publications

Schober, D., Jacob, D., Wilson, M., Cruz, J. A., Marcu, A., Grant, J. R., Moing, A., Deborde, C., de Figueiredo, L. F., Haug, K., Rocca-Serra, P., Easton, J., Ebbels, T. M. D., Hao, J., Ludwig, C., Günther, U. L., Rosato, A., Klein, M. S., Lewis, I. A., Luchinat, C., Jones, A. R., Grauslys, A., Larralde, M., Yokochi, M., Kobayashi, N., Porzel, A., Griffin, J. L., Viant, M. R., Wishart, D. S., Steinbeck, C., Salek, R. M. & Neumann, S. nmrML: A community supported open data standard for the description, storage, and exchange of NMR data. Anal Chem 90, 649–656, (2018) DOI: 10.1021/acs.analchem.7b02795

NMR is a widely used analytical technique with a growing number of repositories available. As a result, demands for a vendor-agnostic, open data format for long-term archiving of NMR data have emerged with the aim to ease and encourage sharing, comparison, and reuse of NMR data. Here we present nmrML, an open XML-based exchange and storage format for NMR spectral data. The nmrML format is intended to be fully compatible with existing NMR data for chemical, biochemical, and metabolomics experiments. nmrML can capture raw NMR data, spectral data acquisition parameters, and where available spectral metadata, such as chemical structures associated with spectral assignments. The nmrML format is compatible with pure-compound NMR data for reference spectral libraries as well as NMR data from complex biomixtures, i.e., metabolomics experiments. To facilitate format conversions, we provide nmrML converters for Bruker, JEOL and Agilent/Varian vendor formats. In addition, easy-to-use Web-based spectral viewing, processing, and spectral assignment tools that read and write nmrML have been developed. Software libraries and Web services for data validation are available for tool developers and end-users. The nmrML format has already been adopted for capturing and disseminating NMR data for small molecules by several open source data processing tools and metabolomics reference spectral libraries, e.g., serving as storage format for the MetaboLights data repository. The nmrML open access data standard has been endorsed by the Metabolomics Standards Initiative (MSI), and we here encourage user participation and feedback to increase usability and make it a successful standard.
Publications

Döll, S., Kuhlmann, M., Rutten, T., Mette, M. F., Scharfenberg, S., Petridis, A., Berreth, D.-C. & Mock, H.-P. Accumulation of the coumarin scopolin under abiotic stress conditions is mediated by the Arabidopsis thaliana THO/TREX complex. Plant J 93, 431-444, (2018) DOI: 10.1111/tpj.13797

Secondary metabolites are involved in the plant stress response. Among these are scopolin and its active form scopoletin, which are coumarin derivatives associated with reactive oxygen species scavenging and pathogen defence. Here we show that scopolin accumulation can be induced in the root by osmotic stress and in the leaf by low-temperature stress in Arabidopsis thaliana. A genetic screen for altered scopolin levels in A. thaliana revealed a mutant compromised in scopolin accumulation in response to stress; the lesion was present in a homologue of THO1 coding for a subunit of the THO/TREX complex. The THO/TREX complex contributes to RNA silencing, supposedly by trafficking precursors of small RNAs. Mutants defective in THO, AGO1, SDS3 and RDR6 were impaired with respect to scopolin accumulation in response to stress, suggesting a mechanism based on RNA silencing such as the trans-acting small interfering RNA pathway, which requires THO/TREX function.
Publications

Meier, R., Ruttkies, C., Treutler, H. & Neumann, S. Bioinformatics can boost metabolomics research.  J. Biotechnol. 261 , 137-141, (2017) DOI: 10.1016/j.jbiotec.2017.05.018

Metabolomics is the modern term for the field of small molecule research in biology and biochemistry. Currently, metabolomics is undergoing a transition where the classic analytical chemistry is combined with modern cheminformatics and bioinformatics methods, paving the way for large-scale data analysis. We give some background on past developments, highlight current state-of-the-art approaches, and give a perspective on future requirements.
Publications

Al Shweiki, M. H. D. R., Mönchgesang, S., Majovsky, P., Thieme, D., Trutschel, D. & Hoehenwarter, W. Assessment of Label-Free quantification in discovery proteomics and impact of technological factors and natural variability of protein abundance. J Proteome Res. 16 , 1410–1424, (2017) DOI: 10.1021/acs.jproteome.6b00645

We evaluated the state of label-free discovery proteomics focusing especially on technological contributions and contributions of naturally occurring differences in protein abundance to the intersample variability in protein abundance estimates in this highly peptide-centric technology. First, the performance of popular quantitative proteomics software, Proteome Discoverer, Scaffold, MaxQuant, and Progenesis QIP, was benchmarked using their default parameters and some modified settings. Beyond this, the intersample variability in protein abundance estimates was decomposed into variability introduced by the entire technology itself and variable protein amounts inherent to individual plants of the Arabidopsis thaliana Col-0 accession. The technical component was considerably higher than the biological intersample variability, suggesting an effect on the degree and validity of reported biological changes in protein abundance. Surprisingly, the biological variability, protein abundance estimates, and protein fold changes were recorded differently by the software used to quantify the proteins, warranting caution in the comparison of discovery proteomics results. As expected, ∼99% of the proteome was invariant in the isogenic plants in the absence of environmental factors; however, few proteins showed substantial quantitative variability. This naturally occurring variation between individual organisms can have an impact on the causality of reported protein fold changes.

Publications

Witting, M., Ruttkies, C., Neumann, S. & Schmitt-Kopplin, P. LipidFrag: Improving reliability of in silico fragmentation of lipids and application to the Caenorhabditis elegans lipidome.  PLoS ONE 12, e0172311, (2017) DOI: 10.1371/journal.pone.0172311

Lipid identification is a major bottleneck in high-throughput lipidomics studies. However, tools for the analysis of lipid tandem MS spectra are rather limited. While the comparison against spectra in reference libraries is one of the preferred methods, these libraries are far from being complete. In order to improve identification rates, the in silico fragmentation tool MetFrag was combined with Lipid Maps and lipid-class specific classifiers which calculate probabilities for lipid class assignments. The resulting LipidFrag workflow was trained and evaluated on different commercially available lipid standard materials, measured with data dependent UPLC-Q-ToF-MS/MS acquisition. The automatic analysis was compared against manual MS/MS spectra interpretation. With the lipid class specific models, identification of the true positives was improved especially for cases where candidate lipids from different lipid classes had similar MetFrag scores by removing up to 56% of false positive results. This LipidFrag approach was then applied to MS/MS spectra of lipid extracts of the nematode Caenorhabditis elegans. Fragments explained by LipidFrag match known fragmentation pathways, e.g., neutral losses of lipid headgroups and fatty acid side chain fragments. Based on prediction models trained on standard lipid materials, high probabilities for correct annotations were achieved, which makes LipidFrag a good choice for automated lipid data analysis and reliability testing of lipid identifications.
Publications

Schymanski, E. L., Ruttkies, C., Krauss, M., Brouard, C., Kind, T., Dührkop, K., Allen, F., Vaniya, A., Verdegem, D., Böcker, S., Rousu, J., Shen, H., Tsugawa, H., Sajed, T., Fiehn, O., Ghesquière, B. & Neumann, S. Critical assessment of small molecule identification 2016: automated methods. J. Cheminformatics 9, 22, (2017) DOI: 10.1186/s13321-017-0207-1

Background
The fourth round of the Critical Assessment of Small Molecule Identification (CASMI) Contest (www.casmi-contest.org) was held in 2016, with two new categories for automated methods. This article covers the 208 challenges in Categories 2 and 3, without and with metadata, from organization, participation, results and post-contest evaluation of CASMI 2016 through to perspectives for future contests and small molecule annotation/identification.

Results
The Input Output Kernel Regression (CSI:IOKR) machine learning approach performed best in “Category 2: Best Automatic Structural Identification—In Silico Fragmentation Only”, won by Team Brouard with 41% challenge wins. The winner of “Category 3: Best Automatic Structural Identification—Full Information” was Team Kind (MS-FINDER), with 76% challenge wins. The best methods were able to achieve over 30% Top 1 ranks in Category 2, with all methods ranking the correct candidate in the Top 10 in around 50% of challenges. This success rate rose to 70% Top 1 ranks in Category 3, with candidates in the Top 10 in over 80% of the challenges. The machine learning and chemistry-based approaches are shown to perform in complementary ways.

Conclusions
The improvement in (semi-)automated fragmentation methods for small molecule identification has been substantial. The achieved high rates of correct candidates in the Top 1 and Top 10, despite large candidate numbers, open up great possibilities for high-throughput annotation of untargeted analysis for “known unknowns”. As more high quality training data becomes available, the improvements in machine learning methods will likely continue, but the alternative approaches still provide valuable complementary information. Improved integration of experimental context will also improve identification success further for “real life” annotations. The true “unknown unknowns” remain to be evaluated in future CASMI contests.
Books and chapters

Mönchgesang, S., Strehmel, N., Schmidt, S., Westphal, L., Taruttis, F., Müller, E., Herklotz, S., Neumann, S. & Scheel, D. Natural variation of roots exudates in Arabidopsis thaliana - linking metabolomic and genomic data. Sci Rep 6, 29033 , (2016) DOI: 10.1038/srep29033

Many metabolomics studies focus on aboveground parts of the plant, while metabolism within roots and the chemical composition of the rhizosphere, as influenced by exudation, are not deeply investigated. In this study, we analysed exudate metabolic patterns of Arabidopsis thaliana and their variation in genetically diverse accessions. For this project, we used the 19 parental accessions of the Arabidopsis MAGIC collection. Plants were grown in a hydroponic system, their exudates were harvested before bolting and subjected to UPLC/ESI-QTOF-MS analysis. Metabolite profiles were analysed together with the genome sequence information. Our study uncovered distinct metabolite profiles for root exudates of the 19 accessions. Hierarchical clustering revealed similarities in the exudate metabolite profiles, which were partly reflected by the genetic distances. An association of metabolite absence with nonsense mutations was detected for the biosynthetic pathways of an indolic glucosinolate hydrolysis product, a hydroxycinnamic acid amine and a flavonoid triglycoside. Consequently, a direct link between metabolic phenotype and genotype was detected without using segregating populations. Moreover, genomics can help to identify biosynthetic enzymes in metabolomics experiments. Our study elucidates the chemical composition of the rhizosphere and its natural variation in A. thaliana, which is important for the attraction and shaping of microbial communities.

Publications

Hoehenwarter, W., Mönchgesang, S., Neumann, S., Majovsky, P., Abel, S. & Müller, J. Comparative expression profiling reveals a role of the root apoplast in local phosphate response BMC Plant Biol. 16 , 106, (2016) DOI: 10.1186/s12870-016-0790-8

Plant adaptation to limited phosphate availability comprises a wide range of responses to conserve and remobilize internal phosphate sources and to enhance phosphate acquisition. Vigorous restructuring of root system architecture provides a developmental strategy for topsoil exploration and phosphate scavenging. Changes in external phosphate availability are locally sensed at root tips and adjust root growth by modulating cell expansion and cell division. The functionally interacting Arabidopsis genes, LOW PHOSPHATE RESPONSE 1 and 2 (LPR1/LPR2) and PHOSPHATE DEFICIENCY RESPONSE 2 (PDR2), are key components of root phosphate sensing. We recently demonstrated that the LOW PHOSPHATE RESPONSE 1 - PHOSPHATE DEFICIENCY RESPONSE 2 (LPR1-PDR2) module mediates apoplastic deposition of ferric iron (Fe3+) in the growing root tip during phosphate limitation. Iron deposition coincides with sites of reactive oxygen species generation and triggers cell wall thickening and callose accumulation, which interfere with cell-to-cell communication and inhibit root growth.

Publications

Vinaixa, M., Schymanski, E. L., Neumann, S., Navarro, M., Salek, R. M. & Yanes, O. Mass spectral databases for LC/MS and GC/MS-based metabolomics: state of the field and future prospects. Trends Analyt. Chem. 78, 23-35, (2016) DOI: 10.1016/j.trac.2015.09.005

Mass spectrometry-based metabolomics is now widely used to obtain new insights into human, plant and microbial biochemistry, drug and biomarker discovery, nutrition research and food control. Despite this great shared interest, identifying and characterizing the structure of metabolites has become a major bottleneck for converting raw mass spectrometric data into biological knowledge. In this regard, comprehensive and well-annotated MS-based spectral databases play a key role towards converting raw spectral data into metabolite annotations and thus biological knowledge. The main characteristics of the mass spectral databases currently used in MS-based metabolomics, are reviewed in this paper, underlining the advantages and limitations of each. Extending this, the overlap of compounds with MSn (n≥2) spectra from authentic chemical standards in most public and commercial databases has been calculated for the first time. Finally, future prospects for mass spectral databases are discussed in terms of the needs posed by novel applications and instrumental advancements.

Publications

Nettling, M., Treutler, H., Cerquides, J. & Grosse, I. Detecting and correcting the binding-affinity bias in ChIP-seq data using inter-species information BMC Genomics 17:347, (2016) DOI: 10.1186/s12864-016-2682-6

Background

Transcriptional gene regulation is a fundamental process in nature, and the experimental and computational investigation of DNA binding motifs and their binding sites is a prerequisite for elucidating this process. ChIP-seq has become the major technology to uncover genomic regions containing those binding sites, but motifs predicted by traditional computational approaches using these data are distorted by a ubiquitous binding-affinity bias. Here, we present an approach for detecting and correcting this bias using inter-species information.

Results

We find that the binding-affinity bias caused by the ChIP-seq experiment in the reference species is stronger than the indirect binding-affinity bias in orthologous regions from phylogenetically related species. We use this difference to develop a phylogenetic footprinting model that is capable of detecting and correcting the binding-affinity bias. We find that this model improves motif prediction and that the corrected motifs are typically softer than those predicted by traditional approaches.

Conclusions

These findings indicate that motifs published in databases and in the literature are artificially sharpened compared to the native motifs. These findings also indicate that our current understanding of transcriptional gene regulation might be blurred, but that it is possible to advance this understanding by taking into account inter-species information available today and even more in the future.

IPB Mainnav Search