Running the Nextflow Pipeline¶
Overview¶
The main pipeline (main.nf) executes all three workflows in sequence:
- Short-read processing for Illumina data
- Long-read processing for PacBio/Oxford Nanopore data
- Reference vs modified genome comparison using SyRI
Running Main Workflow¶
Running the pipeline is a two-step process: validate inputs first, then run Nextflow.
Step 1 — Validate inputs¶
This writes validated files to data/valid/run_YYYYMMDD_HHMMSS/ and produces data/valid/validated_params.json (at the top level, always at a fixed path). See the Validation Overview for details on what this file contains.
Step 2 — Run the pipeline¶
The -params-file data/valid/validated_params.json flag loads the parameters generated by the validation step. It overrides the defaults in nextflow.config and automatically sets which workflows to run (run_illumina, run_nanopore, run_pacbio) and the validated file paths — so no manual parameter flags are needed for file inputs.
Nextflow Options¶
| Option | Description |
|---|---|
-resume |
Resume a pipeline run from the point where it previously stopped or failed. |
-with-report |
Generate a visual HTML report of the workflow execution, including task durations, resource usage, and statuses. The report is saved by default to data/outputs/logs/report.html. |
-with-timeline |
Generate a timeline visualization showing when each pipeline process started and finished. The timeline is saved by default to data/outputs/logs/timeline.html. |
-with-dag |
Generate a directed acyclic graph (DAG) illustrating task dependencies in the workflow. |
Pipeline Options¶
| Option | Description | Default |
|---|---|---|
--out_dir |
Output directory | data/outputs |
--max_cpu |
Maximum CPUs per process | 1 |
--clean_work |
Remove work directory after successful run | true |
--help |
Display help message | – |
Next Steps¶
After running the pipeline:
- Review the Tool Parameter Reference for details on analysis thresholds and their rationale
- Review Runtime Messages to understand pipeline progress
- Explore the Output Directory Structure
- Check Pipeline Visualization for workflow diagrams