Geschmack ist vorhersagbar: Mit FlavorMiner. FlavorMiner heißt das Tool, das IPB-Chemiker und Partner aus Kolumbien jüngst entwickelt haben. Das Programm kann, basierend auf maschinellem Lernen (KI), anhand der…
Seit Februar 2021 bietet Wolfgang Brandt, ehemaliger Leiter der Arbeitsgruppe Computerchemie am IPB, sein Citizen Science-Projekt zur Pilzbestimmung an. Dafür hat er in regelmäßigen Abständen öffentliche Vorträge zur Vielfalt…
Zulfiqar, M.; Gadelha, L.; Steinbeck, C.; Sorokina, M.; Peters, K.;MAW - The reproducible Metabolome Annotation Workflow for untargeted tandem mass SpectrometrybioRxiv(2022)DOI: 10.1101/2022.10.17.512224
Mapping the chemical space of compounds to chemical structures remains a challenge in metabolomics. Despite the advancements in untargeted liquid chromatography-mass spectrometry (LC-MS) to achieve a high-throughput profile of metabolites from complex biological resources, only a small fraction of these metabolites can be annotated with confidence. Many novel computational methods and tools have been developed to enable chemical structure annotation to known and unknown compounds such as in silico generated spectra and molecular networking. Here, we present an automated and reproducible Metabolome Annotation Workflow (MAW) for untargeted metabolomics data to further facilitate and automate the complex annotation by combining tandem mass spectrometry (MS2) input data pre-processing, spectral and compound database matching with computational classification, and in silico annotation. MAW takes the LC-MS2 spectra as input and generates a list of putative candidates from spectral and compound databases. The databases are integrated via the R package Spectra and the metabolite annotation tool SIRIUS as part of the R segment of the workflow (MAW-R). The final candidate selection is performed using the cheminformatics tool RDKit in the Python segment (MAW-Py). Furthermore, each feature is assigned a chemical structure and can be imported to a chemical structure similarity network. MAW is following the FAIR (Findable, Accessible, Interoperable, Reusable) principles and has been made available as the docker images, maw-r and maw-py. The source code and documentation are available on GitHub. The performance of MAW is evaluated on two case studies. We found that MAW can improve candidate ranking by integrating spectral databases with annotation tools like SIRIUS which contributes to an efficient candidate selection procedure. The results from MAW are also reproducible and traceable, compliant with the FAIR guidelines. Taken together, MAW could greatly facilitate automated metabolite characterization in diverse fields such as clinical metabolomics and natural product discovery.
Preprints
Schreiber, T.; Tripathee, S.; Iwen, T.; Prange, A.; Vahabi, K.; Grützner, R.; Horn, C.; Marillonnet, S.; Tissier, A.;DNA double strand breaks lead to de novo transcription and translation of damage-induced long RNAs in plantabioRxiv(2022)DOI: 10.1101/2022.05.11.491484
DNA double strand breaks (DSBs) are lethal threats that need to be repaired. Although many of the proteins involved in the early steps of DSB repair have been characterized, recent reports indicate that damage induced long and small RNAs also play an important role in DSB repair. Here, using a Nicotiana benthamiana transgenic line originally designed as a reporter for targeted knock-ins, we show that DSBs generated by Cas9 induce the transcription of long stable RNAs (damage-induced long RNAs - dilRNAs) that are translated into proteins. Using an array of single guide RNAs we show that the initiation of transcription takes place in the vicinity of the DSB. Single strand DNA nicks are not able to induce transcription, showing that cis DNA damage-induced transcription is specific for DSBs. Our results support a model in which a default and early event in the processing of DSBs is transcription into RNA which, depending on the genomic and genic context, can undergo distinct fates, including translation into protein, degradation or production of small RNAs. Our results have general implications for understanding the role of transcription in the repair of DSBs and, reciprocally, reveal DSBs as yet another way to regulate gene expression.
Preprints
Illig, A.-M.; Siedhoff, N. E.; Schwaneberg, U.; Davari, M. D.;A hybrid model combining evolutionary probability and machine learning leverages data-driven protein engineeringbioRxiv(2022)DOI: 10.1101/2022.06.07.495081
Protein engineering through directed evolution and (semi-)rational approaches has been applied successfully to optimize protein properties for broad applications in molecular biology, biotechnology, and biomedicine. The potential of protein engineering is not yet fully realized due to the limited screening throughput hampering the efficient exploration of the vast protein sequence space. Data-driven strategies have emerged as a powerful tool to leverage protein engineering by providing a model of the sequence-fitness landscape that can exhaustively be explored in silico and capitalize on the high diversity potential offered by nature However, as both the quality and quantity of the inputted data determine the success of such approaches, the applicability of data-driven strategies is often limited due to sparse data. Here, we present a hybrid model that combines direct coupling analysis and machine learning techniques to enable data-driven protein engineering when only few labeled sequences are available. Our method achieves high performance in predicting a protein’s fitness based on its sequence regardless of the number of sequences-fitness pairs in the training dataset. Besides reducing the computational effort compared to state-of-the-art methods, it outperforms them for sparse data situations, i.e., 50 − 250 labeled sequences available for training. In essence, the developed method is auspicious for data-driven protein engineering, especially for protein engineers who have only access to a limited amount of data for sequence-fitness landscape modeling.
Preprints
Bao, Z.; Guo, Y.; Deng, Y.; Zang, J.; Zhang, J.; Ouyang, B.; Qu, X.; Bürstenbinder, K.; Wang, P.;The microtubule-associated protein SlMAP70 interacts with SlIQD21 and regulates fruit shape formation in tomatobioRxiv(2022)DOI: 10.1101/2022.08.08.503161
The shape of tomato fruits is closely correlated to microtubule organization and the activity of microtubule associated proteins (MAP), but insights into the mechanism from a cell biology perspective are still largely elusive. Analysis of tissue expression profiles of different microtubule regulators revealed that functionally distinct classes of MAPs are highly expressed during fruit development. Among these, several members of the plant-specific MAP70 family are preferably expressed at the initiation stage of fruit development. Transgenic tomato lines overexpressing SlMAP70 produced elongated fruits that show reduced cell circularity and microtubule anisotropy, while SlMAP70 loss-of-function mutant showed an opposite effect with flatter fruits. Microtubule anisotropy of fruit endodermis cells exhibited dramatic rearrangement during tomato fruit development, and SlMAP70-1 is likely implicated in cortical microtubule organization and fruit elongation throughout this stage by interacting with SUN10/SlIQD21a. The expression of SlMAP70 (or co-expression of SlMAP70 and SUN10/SlIQD21a) induces microtubule stabilization and prevents its dynamic rearrangement, both activities are essential for fruit shape establishment after anthesis. Together, our results identify SlMAP70 as a novel regulator of fruit elongation, and demonstrate that manipulating microtubule stability and organization at the early fruit developmental stage has a strong impact on fruit shape.