Workflow for calling HPV integration sites and events in Illumina short-read sequencing data.
This will clone the repository. You can run workflow within this directory.
git clone https://github.com/vanessa-porter/illuminaCallHPVInt
To run this workflow, you must have snakemake (v6.12.3) and conda. You can install snakemake using this guide. The remaining dependencies will be downloaded automatically within the snakemake workflow.
- WGS alignment (bam file)
- RNA alignment (bam file)
- Use RNA branch for RNA-seq for now
Config files to specify parameters and paths needed for the workflow. The main parameter to include is the genome path.
genome_path: /path/to/genome/fasta
Main config file to specify input files.
bams:
sampleName_1: /path/to/bam/file
sampleName_2: /path/to/bam/file
A text file can be converted to the samples.yaml file using the scripts/sampletsvtoyaml.py script. The tsv file should have the sample name in one column and the path in another and be tab delimited (no header).
scripts/sampletsvtoyaml.py -t samples.txt -o config/samples.yaml
This is the command to run the workflow with snakemake using conda. The -c parameter can be used to specify maximum number of threads.
snakemake -c 30 --use-conda
The contributors of this project are Vanessa Porter