Home

Awesome

TCGA_germline-nf

extraction of germline variants from TCGA data

Dependencies

Install nextflow.

curl -fsSL get.nextflow.io | bash

And move it to a location in your $PATH (/usr/local/bin for example here):

sudo mv nextflow /usr/local/bin

Description

This program takes in input a TCGA data-like folder of VCF files, e.g. myfolder containing all /ids_sample1/sample1.vcf, /ids_sample2/sample2.vcf, /ids_sample3/sample3.vcf, etc.
Then it filters out germline variants from these VCF files, with a minimum coverage --min_DP and variant allelic fraction --min_af.
After that it reformats these files into annovar inputs (by cancer type) and output them in a result folder (--out_folder).

Execution

Nextflow seamlessly integrates with GitHub hosted code repositories:

nextflow run iarcbioinfo/TCGA_germline-nf --TCGA_folder myfolder

Help and options

You can print the help manual by providing --help in the execution command line:

nextflow run iarcbioinfo/TCGA_germline-nf --help