small_RNA_seq

Scripts for analysis of small RNA-seq data with a focus on tRNA-derived fragments, particularly CCA-terminating 3' tRFs. These scripts contain absolute file paths from my computing environment at CSHL, provided for reproducibility. Users will need to adjust the paths to match their own system.

Requirements

Use index_generation/build_bowtie2_index.sh to build an index from the mature tRNA annotation, minus CCA, downloaded from GtRNAdb (https://gtrnadb.org).
Use index_generation/build_bowtie2_index.sh to build an index from the genome, optionally with tRNA sequences masked. See index_generation/tRNA_mask_readme.txt for details.
miRNA annotation. See index_generation/small_RNA_annotation_readme.txt for details. Can be modified to include piRNAs if necessary.
Install dependencies with conda env create -f small_RNA_seq.

Work flow

Scripts should be executed in order using the corresponding wrapper script, which will loop through all samples in a directory and submit jobs to run in parallel, except for steps 1 and 9, which are executed directly.

0_download_sra.sh : if necessary, download fastq files using the metadata file downloaded from SRA explorer (https://sra-explorer.info) in tsv format.
1_adapter_trim.sh : trim adapters and filter by size and quality. Requires one or more adapter sequences in the fastq directory under "adapter.fa".
1.5_remove-spike.sh : if necessary, remove spike-in reads for later quantification.
2_align_tRF.sh : align reads to the tRNA annotation.
3_align_remaining.sh : align unmapped reads from the previous step to the genome.
4_total_mapped.sh : count the total number of aligned reads for later normalization.
5_bed_intersect.sh : intersect reads with the tRNA, miRNA, and piRNA annotations.
6_counts_from_bed.sh : classify and count reads. The normalization and count files should be moved to a local folder for further analysis (step 9).
7_annnotate_counts.R : pool reads from across samples, normalize to total mapped or spike-in reads, and export for differential expression analysis.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

small_RNA_seq

Requirements

Work flow

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
index_generation		index_generation
wrappers		wrappers
0_download_SRA.sh		0_download_SRA.sh
1.5_remove-spike.sh		1.5_remove-spike.sh
1_adapter_trim.sh		1_adapter_trim.sh
2_align_tRF.sh		2_align_tRF.sh
3_align_remaining.sh		3_align_remaining.sh
4_total_mapped.sh		4_total_mapped.sh
5_bed_intersect.sh		5_bed_intersect.sh
6_counts_from_bed.sh		6_counts_from_bed.sh
7_annnotate_counts.R		7_annnotate_counts.R
README.md		README.md
counts_from_bed.R		counts_from_bed.R
small_RNA_seq.yml		small_RNA_seq.yml

Folders and files

Latest commit

History

Repository files navigation

small_RNA_seq

Requirements

Work flow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages