Krawczyk, Pawel S and Lipinski, Leszek and Dziembowski, Andrzej (2018) PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic acids research . ISSN 1362-4962
|
PDF
- Published Version
Available under License Creative Commons Attribution. 552kB | |
|
PDF (Supplementary data)
- Supplemental Material
Available under License Creative Commons Attribution. 21kB | |
Microsoft Excel (Supplementary Tables)
- Supplemental Material
Available under License Creative Commons Attribution. 2MB |
Official URL: https://academic.oup.com/nar/advance-article-abstr...
Abstract
Plasmids are mobile genetics elements that play an important role in the environmental adaptation of microorganisms. Although plasmids are usually analyzed in cultured microorganisms, there is a need for methods that allow for the analysis of pools of plasmids (plasmidomes) in environmental samples. To that end, several molecular biology and bioinformatics methods have been developed; however, they are limited to environments with low diversity and cannot recover large plasmids. Here, we present PlasFlow, a novel tool based on genomic signatures that employs a neural network approach for identification of bacterial plasmid sequences in environmental samples. PlasFlow can recover plasmid sequences from assembled metagenomes without any prior knowledge of the taxonomical or functional composition of samples with an accuracy up to 96%. It can also recover sequences of both circular and linear plasmids and can perform initial taxonomical classification of sequences. Compared to other currently available tools, PlasFlow demonstrated significantly better performance on test datasets. Analysis of two samples from heavy metal-contaminated microbial mats revealed that plasmids may constitute an important fraction of their metagenomes and carry genes involved in heavy-metal homeostasis, proving the pivotal role of plasmids in microorganism adaptation to environmental conditions.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | metagenomics, plasmids, plasmid, metagenome, machine learning, neural network, bioinformatics |
Subjects: | Q Science > QA Mathematics > QA76 Computer software Q Science > QH Natural history > QH301 Biology Q Science > QR Microbiology |
Divisions: | Laboratory of RNA Biology and Functional Genomics |
ID Code: | 1485 |
Deposited By: | Dr Pawel S Krawczyk |
Deposited On: | 29 Jan 2018 10:02 |
Last Modified: | 29 Jan 2018 10:02 |
Repository Staff Only: item control page