Name Uploaded Size
gtdb.tar.gz Mon, 01 Apr 2024 03:37:39 GMT 68.8 GB
refseq_prokaryote_virus.tar.gz Mon, 01 Apr 2024 09:32:29 GMT 88.7 GB
refseq_release217+human.tar.gz Tue, 04 Jul 2023 13:48:16 GMT 306.4 GB
refseq_virus.tar.gz Mon, 01 Apr 2024 02:42:58 GMT 4.0 GB
Metabuli classifies metagenomic reads by comparing them to reference genomes. You can use Metabuli to profile the taxonomic composition of your samples or to detect specific (pathogenic) species.
Sensitive and Specific. Metabuli uses a novel k-mer structure, called metamer, to analyze both amino acid (AA) and DNA sequences. It leverages AA conservation for sensitive homology detection and DNA mutations for specific differentiation between closely related taxa.
A laptop is enough. Metabuli operates within user-specified RAM limits, allowing it to search any database that fits in storage. A PC with 8 GiB of RAM is sufficient for most analyses.
A few clicks are enough. A GUI is available here. You can run Metabuli and browse the results with just a few clicks on your PC.
Short reads, long reads, and contigs. Metabuli can classify all types of sequences.

Data description

gtdb.tar.gz (101 GiB):
- GTDB 214.1 (Complete Genome/Chromosome, CheckM completeness > 90 and contamination < 5) + A human genome (T2T-CHM13v2.0)
refseq_virus.tar.gz (8.1 GiB):
- NCBI RefSeq release 223 virus genomes + A human genome (T2T-CHM13v2.0)
refseq_prokaryote_virus.tar.gz (115.6 GiB):
- RefSeq prokaryote genomes (Complete Genome/Chromosome, 2024-03-26) + RefSeq Virus above + A human genome (T2T-CHM13v2.0)
refseq_release217+human.tar.gz (480.5 GiB):
- Viral and prokaryotic genomes of RefSeq release 217 and human genome (GRCh38.p14)

License

All files are available under a Creative Commons Attribution 4.0 International License.