Skip to content

TravisWheelerLab/nail-benchmarks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

36 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

nail-benchmarks

A collection of reproducible benchmarks for nail

Requirements

You'll need the following available on your system path:

  • nail
  • MMseqs2
  • HMMER3
  • easel (comes with HMMER3 distributions)
  • the create-profmark binary (comes with HMMER3 distributions)

Download sequence data

This benchmark was originally run using Pfam version 36.0 and Swissprot release-2023_05

To download the data, you can run

$ ./scripts/download-data.sh

which will place Pfam seed alignments & Swissprot sequences in the data/ directory:

$ tree data/
data
β”œβ”€β”€ long-seq
β”‚Β Β  β”œβ”€β”€ query
β”‚Β Β  β”‚Β Β  β”œβ”€β”€ 1.query.fa
β”‚Β Β  β”‚Β Β  β”œβ”€β”€ 2.query.fa
β”‚Β Β  β”‚Β Β  β”œβ”€β”€ 3.query.fa
β”‚Β Β  β”‚Β Β  β”œβ”€β”€ 4.query.fa
β”‚Β Β  β”‚Β Β  β”œβ”€β”€ 5.query.fa
β”‚Β Β  β”‚Β Β  └── 6.query.fa
β”‚Β Β  └── target
β”‚Β Β      β”œβ”€β”€ 1.target.fa
β”‚Β Β      β”œβ”€β”€ 2.target.fa
β”‚Β Β      β”œβ”€β”€ 3.target.fa
β”‚Β Β      β”œβ”€β”€ 4.target.fa
β”‚Β Β      β”œβ”€β”€ 5.target.fa
β”‚Β Β      └── 6.target.fa
β”œβ”€β”€ pfam.sto
β”œβ”€β”€ uniprot.tar.gz
β”œβ”€β”€ uniprot_sprot.dat.gz
β”œβ”€β”€ uniprot_sprot.fasta
β”œβ”€β”€ uniprot_sprot.fasta.ssi
β”œβ”€β”€ uniprot_sprot.xml.gz
└── uniprot_sprot_varsplic.fasta.gz

Build the benchmark

To build the benchmark, run

$ ./scripts/build-benchmark.sh

Run the benchmark

To run the benchmark, run

$ ./scripts/run-all.sh

Produce plots

To produce the plots, run

$ python ./scripts/plots.py ./benchmark/

About

A collection of reproducible benchmarks for nail

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors