Biocluster Mirrors
Revision as of 03:00, 11 April 2023 by Mediawikiapi (talk | contribs)
Application | Installed Versions | Description |
---|---|---|
alphafold-db | 20210917 20220118 20230405 |
Alphafold Databases |
BUSCO-db | 4 | Based on evolutionarily-informed expectations of gene content of near-universal single-copy orthologs, BUSCO metric is complementary to technical metrics like N50. |
card-prevalence | 3.0.6 | November 2019 release - 85 pathogens, 116914 resistomes, and 182532 AMR allele sequences based on sequence data acquired from NCBI on July 31, 2019, analyzed using RGI 5.0.0 (DIAMOND homolog detection) and CARD 3.0.7. Includes pre-compiled k-mer classifier data for pathogen-of-origin prediction. |
checkm-db | 20150116 | CheckM Database |
chocophlan | 0.1.1 | |
clusterblast | 20170105 | |
ena | 20230208 | The European Nucleotide Archive (ENA) captures and presents information relating to experimental workflows that are based around nucleotide sequencing. |
funannotate-db | 20220428 | funannotate is a pipeline for genome annotation (built specifically for fungi, but will also work with higher eukaryotes). Installation, usage, and more information can be found at http://funannotate.readthedocs.io |
gatkbundle | 20191118 | he GATK resource bundle is a collection of standard files for working with human resequencing data with the GATK. We provide several versions of the bundle corresponding to the various reference builds, |
genbank | 250 | GenBank ® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences |
gtdb | 207 | GENOME TAXONOMY DATABASE |
humann-db | 201901b | Databases for HUMAnN |
interpro | 68.0 | InterPro provides functional analysis of proteins by classifying them into families and predicting domains and important sites. |
KneadData-db | 20230405 | Databases for KneadData |
kraken2-db | 20220327 | Kraken 2 is a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies. |
ncbi-blastdb | 20170702 20180404 20190320 20190808 20201212 20220318 20230124 |
BLAST search pages under the Basic BLAST section of the NCBI BLAST home page(http://blast.ncbi.nlm.nih.gov/) use a standard set of BLAST databases for nucleotide, protein, and translated BLAST searches. |
pfam | 32.0 35.0 |
The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs) |
pgap-db | 2021-07-01.build5508 | |
refseq-db | 211 | The Reference Sequence (RefSeq) collection provides a comprehensive, integrated, non-redundant, well-annotated set of sequences, including genomic DNA, transcripts, and proteins. |
silva | 138.1 | SILVA provides comprehensive, quality checked and regularly updated datasets of aligned small (16S/18S, SSU) and large subunit (23S/28S, LSU) ribosomal RNA (rRNA) sequences for all three domains of life (Bacteria, Archaea and Eukarya). |
uniprot | 2018_04 2020_06 2021_02 2022_05 |
The mission of UniProt is to provide the scientific community with a comprehensive, high-quality and freely accessible resource of protein sequence and functional information. |