Difference between revisions of "Biocluster Mirrors"

From Carl R. Woese Institute for Genomic Biology - University of Illinois Urbana-Champaign
Jump to navigation Jump to search
Line 50: Line 50:
 
|-
 
|-
 
|[https://gtdb.ecogenomic.org/ gtdb]
 
|[https://gtdb.ecogenomic.org/ gtdb]
|207
+
|207<br>214
 
|GENOME TAXONOMY DATABASE
 
|GENOME TAXONOMY DATABASE
 
|-
 
|-

Revision as of 04:00, 6 June 2023

Application Installed Versions Description
alphafold-db 20210917
20220118
20230405
Alphafold Databases
BUSCO-db 4 Based on evolutionarily-informed expectations of gene content of near-universal single-copy orthologs, BUSCO metric is complementary to technical metrics like N50.
card-prevalence 3.0.6 November 2019 release - 85 pathogens, 116914 resistomes, and 182532 AMR allele sequences based on sequence data acquired from NCBI on July 31, 2019, analyzed using RGI 5.0.0 (DIAMOND homolog detection) and CARD 3.0.7. Includes pre-compiled k-mer classifier data for pathogen-of-origin prediction.
checkm-db 20150116 CheckM Database
checkm2-db 20230511 CheckM2 Database
chocophlan 0.1.1
clusterblast 20170105
ena 20230511 The European Nucleotide Archive (ENA) captures and presents information relating to experimental workflows that are based around nucleotide sequencing.
funannotate-db 20220428 funannotate is a pipeline for genome annotation (built specifically for fungi, but will also work with higher eukaryotes). Installation, usage, and more information can be found at http://funannotate.readthedocs.io
gatkbundle 20191118 he GATK resource bundle is a collection of standard files for working with human resequencing data with the GATK. We provide several versions of the bundle corresponding to the various reference builds,
genbank 250 GenBank ® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences
gtdb 207
214
GENOME TAXONOMY DATABASE
humann-db 201901b Databases for HUMAnN
interpro 68.0 InterPro provides functional analysis of proteins by classifying them into families and predicting domains and important sites.
KneadData-db 20230405 Databases for KneadData
kraken2-db 20220327
20230314
Kraken 2 is a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies.
ncbi-blastdb 20190808
20201212
20220318
20230124
BLAST search pages under the Basic BLAST section of the NCBI BLAST home page(http://blast.ncbi.nlm.nih.gov/) use a standard set of BLAST databases for nucleotide, protein, and translated BLAST searches.
pfam 32.0
35.0
The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs)
pgap-db 2021-07-01.build5508
refseq-db 211 The Reference Sequence (RefSeq) collection provides a comprehensive, integrated, non-redundant, well-annotated set of sequences, including genomic DNA, transcripts, and proteins.
silva 138.1 SILVA provides comprehensive, quality checked and regularly updated datasets of aligned small (16S/18S, SSU) and large subunit (23S/28S, LSU) ribosomal RNA (rRNA) sequences for all three domains of life (Bacteria, Archaea and Eukarya).
uniprot 2018_04
2020_06
2021_02
2022_05
The mission of UniProt is to provide the scientific community with a comprehensive, high-quality and freely accessible resource of protein sequence and functional information.