Publications: Leibniz Institute of Plant Biochemistry English

Advanced Search

Displaying results 1 to 10 of 18.

Results as:
Print view
Endnote (RIS)
BibTeX
Table: CSV | HTML

Publications

Ruttkies, C.; Neumann, S.; Posch, S.; Improving MetFrag with statistical learning of fragment annotations BMC Bioinformatics 20 376 (2019) DOI: 10.1186/s12859-019-2954-7

Publications

Nettling, M.; Treutler, H.; Cerquides, J.; Grosse, I.; Combining phylogenetic footprinting with motif models incorporating intra-motif dependencies BMC Bioinformatics 18 141 (2017) DOI: 10.1186/s12859-017-1495-1

Publications

Nettling, M.; Treutler, H.; Grau, J.; Keilwagen, J.; Posch, S.; Grosse, I.; DiffLogo: a comparative visualization of sequence motifs BMC Bioinformatics 16 387 (2015) DOI: 10.1186/s12859-015-0767-x

Publications

Moreno, P.; Beisken, S.; Harsha, B.; Muthukrishnan, V.; Tudose, I.; Dekker, A.; Dornfeldt, S.; Taruttis, F.; Grosse, I.; Hastings, J.; Neumann, S.; Steinbeck, C.; BiNChE: A web tool and library for chemical enrichment analysis based on the ChEBI ontology BMC Bioinformatics 16 56 (2015) DOI: 10.1186/s12859-015-0486-3

Publications

Libiseller, G.; Dvorzak, M.; Kleb, U.; Gander, E.; Eisenberg, T.; Madeo, F.; Neumann, S.; Trausinger, G.; Sinner, F.; Pieber, T.; Magnes, C.; IPO: a tool for automated optimization of XCMS parameters BMC Bioinformatics 16 118 (2015) DOI: 10.1186/s12859-015-0562-8

BackgroundUntargeted metabolomics generates a huge amount of data. Software packages for automated data processing are crucial to successfully process these data. A variety of such software packages exist, but the outcome of data processing strongly depends on algorithm parameter settings. If they are not carefully chosen, suboptimal parameter settings can easily lead to biased results. Therefore, parameter settings also require optimization. Several parameter optimization approaches have already been proposed, but a software package for parameter optimization which is free of intricate experimental labeling steps, fast and widely applicable is still missing.ResultsWe implemented the software package IPO (‘Isotopologue Parameter Optimization’) which is fast and free of labeling steps, and applicable to data from different kinds of samples and data from different methods of liquid chromatography - high resolution mass spectrometry and data from different instruments.IPO optimizes XCMS peak picking parameters by using natural, stable 13C isotopic peaks to calculate a peak picking score. Retention time correction is optimized by minimizing relative retention time differences within peak groups. Grouping parameters are optimized by maximizing the number of peak groups that show one peak from each injection of a pooled sample. The different parameter settings are achieved by design of experiments, and the resulting scores are evaluated using response surface models. IPO was tested on three different data sets, each consisting of a training set and test set. IPO resulted in an increase of reliable groups (146% - 361%), a decrease of non-reliable groups (3% - 8%) and a decrease of the retention time deviation to one third.ConclusionsIPO was successfully applied to data derived from liquid chromatography coupled to high resolution mass spectrometry from three studies with different sample types and different chromatographic methods and devices. We were also able to show the potential of IPO to increase the reliability of metabolomics data.The source code is implemented in R, tested on Linux and Windows and it is freely available for download at https://github.com/glibiseller/IPO. The training sets and test sets can be downloaded from https://health.joanneum.at/IPO.

Publications

Gonzalez-Beltran, A.; Neumann, S.; Maguire, E.; Sansone, S.-A.; Rocca-Serra, P.; The Risa R/Bioconductor package: integrative data analysis from experimental metadata and back again BMC Bioinformatics 15 (Suppl 1) S11 (2014) DOI: 10.1186/1471-2105-15-S1-S11

BackgroundThe ISA-Tab format and software suite have been developed to break the silo effect induced by technology-specific formats for a variety of data types and to better support experimental metadata tracking. Experimentalists seldom use a single technique to monitor biological signals. Providing a multi-purpose, pragmatic and accessible format that abstracts away common constructs for describing I nvestigations, S tudies and A ssays, ISA is increasingly popular. To attract further interest towards the format and extend support to ensure reproducible research and reusable data, we present the Risa package, which delivers a central component to support the ISA format by enabling effortless integration with R, the popular, open source data crunching environment.ResultsThe Risa package bridges the gap between the metadata collection and curation in an ISA-compliant way and the data analysis using the widely used statistical computing environment R. The package offers functionality for: i) parsing ISA-Tab datasets into R objects, ii) augmenting annotation with extra metadata not explicitly stated in the ISA syntax; iii) interfacing with domain specific R packages iv) suggesting potentially useful R packages available in Bioconductor for subsequent processing of the experimental data described in the ISA format; and finally v) saving back to ISA-Tab files augmented with analysis specific metadata from R. We demonstrate these features by presenting use cases for mass spectrometry data and DNA microarray data.ConclusionsThe Risa package is open source (with LGPL license) and freely available through Bioconductor. By making Risa available, we aim to facilitate the task of processing experimental data, encouraging a uniform representation of experimental information and results while delivering tools for ensuring traceability and provenance tracking.Software availabilityThe Risa package is available since Bioconductor 2.11 (version 1.0.0) and version 1.2.1 appeared in Bioconductor 2.12, both along with documentation and examples. The latest version of the code is at the development branch in Bioconductor and can also be accessed from GitHub https://github.com/ISA-tools/Risa, where the issue tracker allows users to report bugs or feature requests.

Publications

Harloff, H.-J.; Lemcke, S.; Mittasch, J.; Frolov, A.; Wu, J. G.; Dreyer, F.; Leckband, G.; Jung, C.; A mutation screening platform for rapeseed (Brassica napus L.) and the detection of sinapine biosynthesis mutants Theor. Appl. Genet. 124 957-969 (2012) DOI: 10.1007/s00122-011-1760-z

Publications

Wolf, S.; Schmidt, S.; Müller-Hannemann, M.; Neumann, S.; In silico fragmentation for computer assisted identification of metabolite mass spectra BMC Bioinformatics 11 148 (2010) DOI: 10.1186/1471-2105-11-148

BackgroundMass spectrometry has become the analytical method of choice in metabolomics research. The identification of unknown compounds is the main bottleneck. In addition to the precursor mass, tandem MS spectra carry informative fragment peaks, but the coverage of spectral libraries of measured reference compounds are far from covering the complete chemical space. Compound libraries such as PubChem or KEGG describe a larger number of compounds, which can be used to compare their in silico fragmentation with spectra of unknown metabolites.ResultsWe created the MetFrag suite to obtain a candidate list from compound libraries based on the precursor mass, subsequently ranked by the agreement between measured and in silico fragments. In the evaluation MetFrag was able to rank most of the correct compounds within the top 3 candidates returned by an exact mass query in KEGG. Compared to a previously published study, MetFrag obtained better results than the commercial MassFrontier software. Especially for large compound libraries, the candidates with a good score show a high structural similarity or just different stereochemistry, a subsequent clustering based on chemical distances reduces this redundancy. The in silico fragmentation requires less than a second to process a molecule, and MetFrag performs a search in KEGG or PubChem on average within 30 to 300 seconds, respectively, on an average desktop PC.ConclusionsWe presented a method that is able to identify small molecules from tandem MS measurements, even without spectral reference data or a large set of fragmentation rules. With today's massive general purpose compound libraries we obtain dozens of very similar candidates, which still allows a confident estimate of the correct compound class. Our tool MetFrag improves the identification of unknown substances from tandem MS spectra and delivers better results than comparable commercial software. MetFrag is available through a web application, web services and as java library. The web frontend allows the end-user to analyse single spectra and browse the results, whereas the web service and console application are aimed to perform batch searches and evaluation.

Publications

Mittasch, J.; Mikolajewski, S.; Breuer, F.; Strack, D.; Milkowski, C.; Genomic microstructure and differential expression of the genes encoding UDP-glucose:sinapate glucosyltransferase (UGT84A9) in oilseed rape (Brassica napus) Theor. Appl. Genet. 120 1485-1500 (2010) DOI: 10.1007/s00122-010-1270-4

Publications

Lange, E.; Tautenhahn, R.; Neumann, S.; Gröpl, C.; Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements BMC Bioinformatics 9 375 (2008) DOI: 10.1186/1471-2105-9-375

BackgroundLiquid chromatography coupled to mass spectrometry (LC-MS) has become a prominent tool for the analysis of complex proteomics and metabolomics samples. In many applications multiple LC-MS measurements need to be compared, e. g. to improve reliability or to combine results from different samples in a statistical comparative analysis. As in all physical experiments, LC-MS data are affected by uncertainties, and variability of retention time is encountered in all data sets. It is therefore necessary to estimate and correct the underlying distortions of the retention time axis to search for corresponding compounds in different samples. To this end, a variety of so-called LC-MS map alignment algorithms have been developed during the last four years. Most of these approaches are well documented, but they are usually evaluated on very specific samples only. So far, no publication has been assessing different alignment algorithms using a standard LC-MS sample along with commonly used quality criteria.ResultsWe propose two LC-MS proteomics as well as two LC-MS metabolomics data sets that represent typical alignment scenarios. Furthermore, we introduce a new quality measure for the evaluation of LC-MS alignment algorithms. Using the four data sets to compare six freely available alignment algorithms proposed for the alignment of metabolomics and proteomics LC-MS measurements, we found significant differences with respect to alignment quality, running time, and usability in general.ConclusionThe multitude of available alignment methods necessitates the generation of standard data sets and quality measures that allow users as well as developers to benchmark and compare their map alignment tools on a fair basis. Our study represents a first step in this direction. Currently, the installation and evaluation of the "correct" parameter settings can be quite a time-consuming task, and the success of a particular method is still highly dependent on the experience of the user. Therefore, we propose to continue and extend this type of study to a community-wide competition. All data as well as our evaluation scripts are available at http://msbi.ipb-halle.de/msbi/caap.