Difference between revisions of "Biocluster Alphafold"
Jump to navigation
Jump to search
(Created page with "__TOC__ * Load alphafold module <pre> module load alphafold/2.1.1 </pre>") |
|||
Line 4: | Line 4: | ||
<pre> | <pre> | ||
module load alphafold/2.1.1 | module load alphafold/2.1.1 | ||
+ | </pre> | ||
+ | |||
+ | = Parameters = | ||
+ | <pre> | ||
+ | --fasta-paths FASTA_PATHS [FASTA_PATHS ...], -f FASTA_PATHS [FASTA_PATHS ...] | ||
+ | Paths to FASTA files, each containing one sequence. | ||
+ | All FASTA paths must have a unique basename as the | ||
+ | basename is used to name the output directories for | ||
+ | each prediction. | ||
+ | --is-prokaryote-list IS_PROKARYOTE_LIST [IS_PROKARYOTE_LIST ...] | ||
+ | Optional for multimer system, not used by the single | ||
+ | chain system. This list should contain a boolean for | ||
+ | each fasta specifying true where the target complex is | ||
+ | from a prokaryote, and false where it is not, or where | ||
+ | the origin is unknown. These values determine the | ||
+ | pairing method for the MSA. | ||
+ | --max-template-date MAX_TEMPLATE_DATE, -t MAX_TEMPLATE_DATE | ||
+ | Maximum template release date to consider (ISO-8601 | ||
+ | format - i.e. YYYY-MM-DD). Important if folding | ||
+ | historical test sets. | ||
+ | --db-preset {reduced_dbs,full_dbs} | ||
+ | Choose preset model configuration - no ensembling with | ||
+ | uniref90 + bfd + uniclust30 (full_dbs), or 8 model | ||
+ | ensemblings with uniref90 + bfd + uniclust30 (casp14). | ||
+ | --model-preset {monomer,monomer_casp14,monomer_ptm,multimer} | ||
+ | Choose preset model configuration - the monomer model, | ||
+ | the monomer model with extra ensembling, monomer model | ||
+ | with pTM head, or multimer model | ||
+ | --benchmark, -b Run multiple JAX model evaluations to obtain a timing | ||
+ | that excludes the compilation time, which should be | ||
+ | more indicative of the time required for inferencing | ||
+ | many proteins. | ||
+ | --use-precomputed-msas | ||
+ | Whether to read MSAs that have been written to disk. | ||
+ | WARNING: This will not check if the sequence, database | ||
+ | or configuration have changed. | ||
+ | --data-dir DATA_DIR, -d DATA_DIR | ||
+ | Path to directory with supporting data: AlphaFold | ||
+ | parameters and genetic and template databases. Set to | ||
+ | the target of download_all_databases.sh. | ||
+ | --docker-image DOCKER_IMAGE | ||
+ | Alphafold docker image. | ||
+ | --output-dir OUTPUT_DIR, -o OUTPUT_DIR | ||
+ | Output directory for results. | ||
+ | --use-gpu Enable NVIDIA runtime to run with GPUs. | ||
+ | --gpu-devices GPU_DEVICES | ||
+ | Comma separated list of devices to pass to | ||
+ | NVIDIA_VISIBLE_DEVICES. | ||
+ | --cpus CPUS, -c CPUS Number of CPUs to use. | ||
</pre> | </pre> |
Revision as of 18:05, 14 February 2022
Contents
- Load alphafold module
module load alphafold/2.1.1
Parameters[edit]
--fasta-paths FASTA_PATHS [FASTA_PATHS ...], -f FASTA_PATHS [FASTA_PATHS ...] Paths to FASTA files, each containing one sequence. All FASTA paths must have a unique basename as the basename is used to name the output directories for each prediction. --is-prokaryote-list IS_PROKARYOTE_LIST [IS_PROKARYOTE_LIST ...] Optional for multimer system, not used by the single chain system. This list should contain a boolean for each fasta specifying true where the target complex is from a prokaryote, and false where it is not, or where the origin is unknown. These values determine the pairing method for the MSA. --max-template-date MAX_TEMPLATE_DATE, -t MAX_TEMPLATE_DATE Maximum template release date to consider (ISO-8601 format - i.e. YYYY-MM-DD). Important if folding historical test sets. --db-preset {reduced_dbs,full_dbs} Choose preset model configuration - no ensembling with uniref90 + bfd + uniclust30 (full_dbs), or 8 model ensemblings with uniref90 + bfd + uniclust30 (casp14). --model-preset {monomer,monomer_casp14,monomer_ptm,multimer} Choose preset model configuration - the monomer model, the monomer model with extra ensembling, monomer model with pTM head, or multimer model --benchmark, -b Run multiple JAX model evaluations to obtain a timing that excludes the compilation time, which should be more indicative of the time required for inferencing many proteins. --use-precomputed-msas Whether to read MSAs that have been written to disk. WARNING: This will not check if the sequence, database or configuration have changed. --data-dir DATA_DIR, -d DATA_DIR Path to directory with supporting data: AlphaFold parameters and genetic and template databases. Set to the target of download_all_databases.sh. --docker-image DOCKER_IMAGE Alphafold docker image. --output-dir OUTPUT_DIR, -o OUTPUT_DIR Output directory for results. --use-gpu Enable NVIDIA runtime to run with GPUs. --gpu-devices GPU_DEVICES Comma separated list of devices to pass to NVIDIA_VISIBLE_DEVICES. --cpus CPUS, -c CPUS Number of CPUs to use.