LCSB R³
Responsible and Reproducible Research

Functional meta-omics provide critical insights into long and short read assemblies#

Authors#

Valentina Galata, Susheel Bhanu Busi, Benoit Josef Kunath, Laura de Nies, Magdalena Calusinska, Rashi Halder, Patrick May, Paul Wilmes, Cedric Christian Laczny

Abstract#

Real-world evaluations of metagenomic reconstructions are challenged by distinguishing reconstruction artifacts from genes and proteins present in situ. Here, we evaluate short-read-only, long-read-only and hybrid assembly approaches on four different metagenomic samples of varying complexity. We demonstrate how different assembly approaches affect gene and protein inference, which is particularly relevant for downstream functional analyses. For a human gut microbiome sample, we use complementary metatranscriptomic and metaproteomic data to assess the metagenomic data-based protein predictions. Our findings pave the way for critical assessments of metagenomic reconstructions. We propose a reference-independent solution, which exploits the synergistic effects of multi-omic data integration for the in situ study of microbiomes using long-read sequencing data.

Manuscript#

A preprint of the manuscript is available at The Preprint Server for Biology.

Data#

The newly generated data is available under NCBI BioProject accession PRJNA723028 (Biosamples: metag_sr: SAMN18797629, metat_sr: SAMN18797630, and metag_lr: SAMN18797631). Metaproteomics is available at ProteomeXchange under accession PXD025505.

The previously published data is available at ERR2984773 and ERR2887847, SRR7585899 and SRR7585900, as well as ERR2027899, ERR4334939, ERR4334940, and ERR4334941 from the European Nucleotide Archive.

Source code#

The source code is available from GitLab.