Modules
Edit

Test data guidelines specific to nf-core modules

Modules

The key words “MUST”, “MUST NOT”, “SHOULD”, etc. are to be interpreted as described in RFC 2119.

Reuse of existing test data

Pre-existing files from nf-core/test-datasets MUST be reused if at all possible to keep the size of the test data repository minimal.

If appropriate test data does not exist in the modules branch of nf-core/test-datasets, contact the nf-core community on the nf-core Slack #modules channel to discuss possible options.

Test data alternatives for large datasets

Adding test data may not be possible for some modules if the input data is too large or requires a local database.

In these scenarios, use the Nextflow stub feature to test the module.

Refer to the gtdbtk/classify module and its corresponding test script for an example of how to use this feature for module development.

Module test data organisation

Files SHOULD be organised based on the existing structure.

For bioinformatics pipelines, files are typically organised by discipline, organism, platform, or format.

Relatedness of module test data

Downstream or related test data files SHOULD be named based on the upstream file name.

For example, if genome.fasta is used as the upstream file, the output file should be named genome.<new_extension>.

Module test data documentation

Test data files MUST have an entry in the nf-core/test-datasets repository README.

Get started
- What is nf-core?
- Environment setup
- Run your first pipeline
Running
- Overview
- Advanced topics
  - Google Colab
  - Managing work directory growth
- Configuration
  - Overview
  - System requirements
- Reference genomes
- Running pipelines offline
Developing
- Overview
- Components
- Containers
  - ARM64 on Bioconda
  - Seqera Containers
- Documentation
- Institutional profiles
- Migration guides
  - Migrating to topic channels
- Pipelines
- Template syncs
- Testing
Contributing
- Overview
- Components
- Contribution types
- Contributor's list
- Deprecating modules
- Documentation
- Existing pipelines
- New pipelines
- Project proposals
- Reviewing pull requests
  - nf-core bot
  - Review checklists
    
    Components
    nf-core/tools
    Pipelines
  - Reviewing pipeline releases
Specifications
- Overview
- Components
  - Overview
  - Modules
    
    General
    Naming conventions
    Input/output Options
    Documentation
    Parameters
    Resource requirements
    Software
    Testing
    Misc
  - Subworkflows
    
    General
    Naming conventions
    Input output options
    Subworkflow parameters
    Documentation
    Testing
    Misc
- Pipelines
- Reviews
- Test data
Community
- Advisories
- Brand
- Governance
  - Core team tasks
  - Maintainers tasks
- Regulatory
  - Overview
  - Checklist
- Terminology
nf-core/tools
- API
- CLI
  - Installation
  - Pipelines
    
    create
    list
    launch
    create params file
    download
    licences
    rocrate
    lint
    schema
    bump version
    sync
    create logo
  - Modules
    
    list
    info
    install
    update
    remove
    patch
    create
    lint
    test
    bump versions
  - Subworkflows
    
    list
    info
    install
    update
    remove
    create
    lint
    test
  - Test datasets
    
    list
    list branches
    search
  - TUI

Modules Edit