Biocluster Mirrors

From Carl R. Woese Institute for Genomic Biology - University of Illinois Urbana-Champaign
Revision as of 03:00, 1 October 2022 by Mediawikiapi (talk | contribs)
Jump to navigation Jump to search
Application Installed Versions Description
alphafold-db 20210917
20220118
Alphafold Databases
card-prevalence 3.0.6 November 2019 release - 85 pathogens, 116914 resistomes, and 182532 AMR allele sequences based on sequence data acquired from NCBI on July 31, 2019, analyzed using RGI 5.0.0 (DIAMOND homolog detection) and CARD 3.0.7. Includes pre-compiled k-mer classifier data for pathogen-of-origin prediction.
checkm-db 20150116 CheckM Database
chocophlan 0.1.1
clusterblast 20170105
ena Release_130 The European Nucleotide Archive (ENA) captures and presents information relating to experimental workflows that are based around nucleotide sequencing.
funannotate-db 20220428 funannotate is a pipeline for genome annotation (built specifically for fungi, but will also work with higher eukaryotes). Installation, usage, and more information can be found at http://funannotate.readthedocs.io
gatkbundle 20191118 he GATK resource bundle is a collection of standard files for working with human resequencing data with the GATK. We provide several versions of the bundle corresponding to the various reference builds,
genbank 250 GenBank ® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences
gtdb 207 GENOME TAXONOMY DATABASE
interpro 68.0 InterPro provides functional analysis of proteins by classifying them into families and predicting domains and important sites.
kraken2-db 20220327 Kraken is a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies.
ncbi-blastdb 20170702
20180404
20190320
20190808
20201212
20220318
BLAST search pages under the Basic BLAST section of the NCBI BLAST home page(http://blast.ncbi.nlm.nih.gov/) use a standard set of BLAST databases for nucleotide, protein, and translated BLAST searches.
pfam 32.0
35.0
The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs)
pgap-db 2021-07-01.build5508
refseq-db 211 The Reference Sequence (RefSeq) collection provides a comprehensive, integrated, non-redundant, well-annotated set of sequences, including genomic DNA, transcripts, and proteins.
silva 138.1 SILVA provides comprehensive, quality checked and regularly updated datasets of aligned small (16S/18S, SSU) and large subunit (23S/28S, LSU) ribosomal RNA (rRNA) sequences for all three domains of life (Bacteria, Archaea and Eukarya).
uniprot 2018_04
2020_06
The mission of UniProt is to provide the scientific community with a comprehensive, high-quality and freely accessible resource of protein sequence and functional information.