Overview

This pipeline is a Nextflow-based workflow designed for the analysis of pan serotype Dengue virus (DENV1-4) genomic data. It processes raw sequencing data (Illumina or Nanopore) to identify Dengue virus serotypes, perform genotyping/lineage classification, generate consensus sequences, detect gene mutation, and generate a FHIR-compliant genomics bundle.

Key Features

  • Multi-platform Support: Processes raw read data from diverse platforms.

  • Comprehensive Typing: Serotyping (DENV-1 to DENV-4), and genotyping classification.

  • Host DNA Removal: Removes human host contamination from clinical samples.

  • Gene mutation: Gene mutation detection by Nextstrain.

  • Clinical Integration: Merges genomic data with clinical metadata.

  • Quality Control: QC reporting with MultiQC.

Key Outputs

  • Genome assembly

  • Gene mutation list

  • Serotype and Genotype classification

  • FHIR-compliant genomic reports

  • Quality control metrics

Directory Structure

denv-to-fhir-full
├── main.nf                             # Main workflow
├── nextflow.config                     # Configuration and parameters
├── workflows/
│   ├── illumina.nf                     # Illumina sub-workflow
│   ├── nanopore.nf                     # Nanopore sub-workflow
│   ├── trimming.nf                     # Read trimming
│   ├── host_removal.nf                 # Host read removal
│   ├── serotyping.nf                   # Serotype classification
│   ├── genotyping.nf                   # Genotype classification
│   ├── fhir.nf                         # FHIR Bundle generation
│   ├── validate_fhir.nf                # FHIR validation
│   ├── merge_clinical_data.nf          # Clinical metadata merge
│   ├── upload_fhir.nf                  # FHIR server upload
│   ├── report.nf                       # QC and sample report generation
│   └── utils.nf                        # Utility functions
├── scripts/
│   ├── annotated_to_fhir.py            # Consensus-to-FHIR converter
│   ├── clinical_metadata_parser.py     # Patient/org/practitioner parser
│   ├── serotype_classification.py      # Serotype classifier
│   ├── generate_dengue_report.py       # Per-sample report generator
│   ├── get_access_token.py             # FHIR access token retriever
│   ├── upload_fhir.py                  # FHIR upload script
│   ├── merge_clinical_fhir.py          # FHIR genomics + clinical data merger
│   └── get_versions.py                 # Software version collector
├── data/
│   ├── NGS/                            # Input FASTQ files
│   ├── references/                     # Reference genomes (DENV-1–4 & Sylvatic)
│   ├── patient_clinical_metadata.csv   # Patient metadata
│   ├── organization_metadata.csv       # Organization metadata
│   └── practitioner_metadata.csv       # Practitioner metadata
└── tools/
    └── fhir-validator.jar              # HL7 FHIR validator