Biocluster Alphafold: Difference between revisions
Jump to navigation
Jump to search
Created page with "__TOC__ * Load alphafold module <pre> module load alphafold/2.1.1 </pre>" |
No edit summary |
||
| Line 4: | Line 4: | ||
<pre> | <pre> | ||
module load alphafold/2.1.1 | module load alphafold/2.1.1 | ||
</pre> | |||
= Parameters = | |||
<pre> | |||
--fasta-paths FASTA_PATHS [FASTA_PATHS ...], -f FASTA_PATHS [FASTA_PATHS ...] | |||
Paths to FASTA files, each containing one sequence. | |||
All FASTA paths must have a unique basename as the | |||
basename is used to name the output directories for | |||
each prediction. | |||
--is-prokaryote-list IS_PROKARYOTE_LIST [IS_PROKARYOTE_LIST ...] | |||
Optional for multimer system, not used by the single | |||
chain system. This list should contain a boolean for | |||
each fasta specifying true where the target complex is | |||
from a prokaryote, and false where it is not, or where | |||
the origin is unknown. These values determine the | |||
pairing method for the MSA. | |||
--max-template-date MAX_TEMPLATE_DATE, -t MAX_TEMPLATE_DATE | |||
Maximum template release date to consider (ISO-8601 | |||
format - i.e. YYYY-MM-DD). Important if folding | |||
historical test sets. | |||
--db-preset {reduced_dbs,full_dbs} | |||
Choose preset model configuration - no ensembling with | |||
uniref90 + bfd + uniclust30 (full_dbs), or 8 model | |||
ensemblings with uniref90 + bfd + uniclust30 (casp14). | |||
--model-preset {monomer,monomer_casp14,monomer_ptm,multimer} | |||
Choose preset model configuration - the monomer model, | |||
the monomer model with extra ensembling, monomer model | |||
with pTM head, or multimer model | |||
--benchmark, -b Run multiple JAX model evaluations to obtain a timing | |||
that excludes the compilation time, which should be | |||
more indicative of the time required for inferencing | |||
many proteins. | |||
--use-precomputed-msas | |||
Whether to read MSAs that have been written to disk. | |||
WARNING: This will not check if the sequence, database | |||
or configuration have changed. | |||
--data-dir DATA_DIR, -d DATA_DIR | |||
Path to directory with supporting data: AlphaFold | |||
parameters and genetic and template databases. Set to | |||
the target of download_all_databases.sh. | |||
--docker-image DOCKER_IMAGE | |||
Alphafold docker image. | |||
--output-dir OUTPUT_DIR, -o OUTPUT_DIR | |||
Output directory for results. | |||
--use-gpu Enable NVIDIA runtime to run with GPUs. | |||
--gpu-devices GPU_DEVICES | |||
Comma separated list of devices to pass to | |||
NVIDIA_VISIBLE_DEVICES. | |||
--cpus CPUS, -c CPUS Number of CPUs to use. | |||
</pre> | </pre> | ||
Revision as of 18:05, 14 February 2022
- Load alphafold module
module load alphafold/2.1.1
Parameters
--fasta-paths FASTA_PATHS [FASTA_PATHS ...], -f FASTA_PATHS [FASTA_PATHS ...]
Paths to FASTA files, each containing one sequence.
All FASTA paths must have a unique basename as the
basename is used to name the output directories for
each prediction.
--is-prokaryote-list IS_PROKARYOTE_LIST [IS_PROKARYOTE_LIST ...]
Optional for multimer system, not used by the single
chain system. This list should contain a boolean for
each fasta specifying true where the target complex is
from a prokaryote, and false where it is not, or where
the origin is unknown. These values determine the
pairing method for the MSA.
--max-template-date MAX_TEMPLATE_DATE, -t MAX_TEMPLATE_DATE
Maximum template release date to consider (ISO-8601
format - i.e. YYYY-MM-DD). Important if folding
historical test sets.
--db-preset {reduced_dbs,full_dbs}
Choose preset model configuration - no ensembling with
uniref90 + bfd + uniclust30 (full_dbs), or 8 model
ensemblings with uniref90 + bfd + uniclust30 (casp14).
--model-preset {monomer,monomer_casp14,monomer_ptm,multimer}
Choose preset model configuration - the monomer model,
the monomer model with extra ensembling, monomer model
with pTM head, or multimer model
--benchmark, -b Run multiple JAX model evaluations to obtain a timing
that excludes the compilation time, which should be
more indicative of the time required for inferencing
many proteins.
--use-precomputed-msas
Whether to read MSAs that have been written to disk.
WARNING: This will not check if the sequence, database
or configuration have changed.
--data-dir DATA_DIR, -d DATA_DIR
Path to directory with supporting data: AlphaFold
parameters and genetic and template databases. Set to
the target of download_all_databases.sh.
--docker-image DOCKER_IMAGE
Alphafold docker image.
--output-dir OUTPUT_DIR, -o OUTPUT_DIR
Output directory for results.
--use-gpu Enable NVIDIA runtime to run with GPUs.
--gpu-devices GPU_DEVICES
Comma separated list of devices to pass to
NVIDIA_VISIBLE_DEVICES.
--cpus CPUS, -c CPUS Number of CPUs to use.