[DRAFT - do not review] llamacpppython/run GPU + huggingface/download preview by pinin4fjords · Pull Request #11255 · nf-core/modules

pinin4fjords · 2026-04-21T14:06:08Z

Not for review / will be closed

This draft is open only to preview the combined diff of an existing PR (#11053) plus some proposed follow-up changes on top. It will be closed without merging. Any actual changes will land via that existing PR and/or a PR back into its source branch.

No action needed on this one; leaving it as draft to avoid reviewer attention.

What's being previewed

On top of #11053, this branch:

Drops the hand-rolled Dockerfile and adds an environment.gpu.yml (direct cu124 wheel URL + conda-forge::cuda-runtime pinned to 12.4 + python=3.11), built via Wave freeze with LD_LIBRARY_PATH=/opt/conda/lib baked in through wave --config-env so conda-installed CUDA libs are on the loader path without a script-side export.
Rewrites the container directive as a four-URL dual-container ternary mirroring ribodetector's shape (singularity https blob URLs + Wave docker URLs, branched on task.accelerator).
Drops label 'process_gpu' — accelerator allocation is pipeline-controlled per the draft GPU module spec (docs: add GPU module guidelines website#4142).
Converts the CPU singularity URL from oras:// to https:// blob form so both CPU and GPU paths use the same URL scheme.
Adds tests/main.gpu.nf.test, tests/nextflow.gpu.config, and a pre-generated main.gpu.nf.test.snap so the existing GPU CI workflow (nf-test-gpu.yml) picks the tests up via the "gpu" tag.
Removes nextflow.enable.moduleBinaries from tests/nextflow.config (no longer needed once fully template-based).
Fixes a stray - prompt_file: fragment in both meta.yml input blocks.

Validation

Wave: CPU container rebuild from environment.yml reproduces the existing hash byte-for-byte. GPU container built from environment.gpu.yml.
CPU nf-test passes locally under both -profile docker and -profile singularity.
GPU nf-test passes on a Tesla T4 (g4dn.xlarge) under -profile docker,gpu — end-to-end Gemma-3-1B inference ran at ~143 tokens/sec, both real and stub tests produce stable snapshots.
nf-core modules lint matches ribodetector's warning set (known Wave-tag / version-heuristic limitation on GPU containers).

… llamacpp

…elines/components/modules#naming-conventions

Not so many assertions for stub test Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

… llamacpp

output to ${prefix} Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

… llamacpp

…s and tests

…dules/blob/master/modules/nf-core/ea-utils/gtf2bed/meta.yml

…com/biocorecrg/nf-core-modules/blob/4e03bf614d8338067c864d8ee38c2a4a738311aa/modules/nf-core/amulety/esm2/main.nf

…ec compliant Replace the hand-rolled Dockerfile with a Wave-built GPU container sourced from environment.gpu.yml (pinned CUDA 12.4 runtime + abetlen cu124 wheel), restructure the container directive to follow the dual-container pattern used by ribodetector, drop 'process_gpu' label (accelerator is pipeline controlled per the GPU module spec), and add a main.gpu.nf.test plus nextflow.gpu.config so the GPU CI workflow picks up the tests via the 'gpu' tag. Also: - Convert CPU singularity URL from oras:// to https:// blob form for consistency with GPU URL (matches ribodetector convention). - Drop nextflow.enable.moduleBinaries from tests/nextflow.config (now template-based, not binary-based). - Fix stray '- prompt_file:' text in both meta.yml input blocks. GPU container was validated end-to-end on a g4dn.xlarge (Tesla T4): library loads, supports_gpu_offload=True, real gemma-3-1b inference runs at ~143 tokens/sec. Snapshot file generated on the same host. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

pinin4fjords · 2026-04-21T14:09:50Z

Preview closed. Follow-up work routes through the existing PR #11053 source branch.

toniher and others added 30 commits March 25, 2026 18:55

adding modules for downloading and running gguf modules

7f1b7d7

adding docker support

9031f48

allow custom HF_HOME cache input and other fixes

04cd556

several test fixes

537a891

Upgrade problem with versions and test

20b5d26

fix precommit linting

c9997f5

fix yaml for prettier

5035e82

fix retrieval of version for huggingface

2c796de

Merge branch 'nf-core:master' into llamacpp

4f64bac

importing nextflow.config from HF_DOWNLOAD

ff7039c

Merge branch 'llamacpp' of github.com:biocorecrg/nf-core-modules into…

7a3f8fc

… llamacpp

adding hf_cache for setup as well

fb1768c

moving HF_DOWNLOAD to HUGGINGFACE_DOWNLOAD https://nf-co.re/docs/guid…

ac7f44c

…elines/components/modules#naming-conventions

Update modules/nf-core/huggingface/download/tests/main.nf.test

26168b7

Not so many assertions for stub test Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

more detail and naming of Hugging Face

6dfff97

Merge remote-tracking branch 'upstream/master' into llamacpp

2d171ca

Merge branch 'llamacpp' of github.com:biocorecrg/nf-core-modules into…

ee04a27

… llamacpp

linting modules using

3021074

generate files on the fly

4630e5a

rmed data files for tests

53c7826

upgrade tests to work on the fly and updated snaps

4156db0

upgrading tests - adding new smollm3

421603e

Update modules/nf-core/llamacpp-python/run/main.nf

af5eadd

output to ${prefix} Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

Update modules/nf-core/llamacpp-python/run/main.nf

02d3f6d

output to ${prefix} Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

Update modules/nf-core/llamacpp-python/run/main.nf

297f503

output to ${prefix} Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

Moving name of the module, script name and adapting tests and stubs

ac61a56

update tests

ca4721b

Merge branch 'nf-core:master' into llamacpp

8fdffea

Merge branch 'llamacpp' of github.com:biocorecrg/nf-core-modules into…

9a0d9f3

… llamacpp

update task.accelerator

d65181d

toniher and others added 11 commits April 9, 2026 16:55

moving all assertions into the same snapshot

cd3d1b9

removed unneded nextflow.config for test

fb81365

Merge branch 'nf-core:master' into llamacpp

6f2affa

addressing some coments, such as cache_dir

dfadccb

lint didn't like, so out

1dbb3e9

script moved to work as template and corresponding changes to version…

bfe3bf9

…s and tests

Merge branch 'nf-core:master' into llamacpp

0d594f5

adapt meta to fit versions.yml based on https://github.com/nf-core/mo…

255e7db

…dules/blob/master/modules/nf-core/ea-utils/gtf2bed/meta.yml

adding singularity oras image following this example: https://github.…

9069b44

…com/biocorecrg/nf-core-modules/blob/4e03bf614d8338067c864d8ee38c2a4a738311aa/modules/nf-core/amulety/esm2/main.nf

trying to accomodate nf-core modules lint (sic)

ec9a159

pinin4fjords closed this Apr 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT - do not review] llamacpppython/run GPU + huggingface/download preview#11255

[DRAFT - do not review] llamacpppython/run GPU + huggingface/download preview#11255
pinin4fjords wants to merge 41 commits intonf-core:masterfrom
pinin4fjords:pinin4fjords/llamacpp-gpu-fix

pinin4fjords commented Apr 21, 2026

Uh oh!

pinin4fjords commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pinin4fjords commented Apr 21, 2026

Not for review / will be closed

What's being previewed

Validation

Uh oh!

pinin4fjords commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants