Abstract
Motivation
Protein orthologous group databases are powerful tools for evolutionary analysis, functional annotation or metabolic pathway modeling across lineages. Sequences are typically assigned to orthologous groups with alignment-based methods, such as profile hidden Markov models, which have become a computational bottleneck.
Results
We present DeepNOG, an extremely fast and accurate, alignment-free orthology assignment method based on deep convolutional networks. We compare DeepNOG against state-of-the-art alignment-based (HMMER, DIAMOND) and alignment-free methods (DeepFam) on two orthology databases (COG, eggNOG 5). DeepNOG can be scaled to large orthology databases like eggNOG, for which it outperforms DeepFam in terms of precision and recall by large margins. While alignment-based methods still provide the most accurate assignments among the investigated methods, computing time of DeepNOG is an order of magnitude lower on CPUs. Optional GPU usage further increases throughput massively. A command-line tool enables rapid adoption by users.
| Originalsprache | Englisch |
|---|---|
| Seiten (von - bis) | 5304–5312 |
| Seitenumfang | 9 |
| Fachzeitschrift | Bioinformatics |
| Volume | 36 |
| Ausgabenummer | 22-23 |
| DOIs | |
| Publikationsstatus | Veröffentlicht - 01 Dez. 2020 |
Wissenschaftszweige
- 202002 Audiovisuelle Medien
- 102 Informatik
- 102001 Artificial Intelligence
- 102003 Bildverarbeitung
- 102015 Informationssysteme
JKU-Schwerpunkte
- Digital Transformation
Projekte
- 2 Abgeschlossen
-
Dust and Data - The Art of Curating in the Age of Artificial Intelligence
Flexer, A. (Projektleiter*in)
01.07.2019 → 31.12.2021
Projekt: Geförderte Forschung › FWF - Österreichischer Wissenschaftsfonds
-
Über valide und zuverlässige Experimente im Musik IR
Flexer, A. (Projektleiter*in)
01.05.2019 → 30.04.2023
Projekt: Geförderte Forschung › FWF - Österreichischer Wissenschaftsfonds
Dieses zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver