Skip to content

rki-mf1/crymco

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Static Badge

crymco

Detection of cryptic mutation profiles in SARS-CoV-2 metagenomes through variant co-occurrence

System setup and call

  1. Install Nextflow (if not yet in working environment) via
conda create -n nextflow -c bioconda nextflow==25.10.2
conda activate nextflow
  1. Download pipeline
git clone https://github.com/Krannich479/crymco.git
cd crymco
  1. Fill I/O section (files) in nextflow.config
  2. Run pipeline via
nextflow run main.nf -profile local,mamba

Principal pipeline steps

  • Alignment + Variant Calling + Phasing + Haplotagging
  • Reformatting haplotype table into Nextstrain variant nomenclature
    • bin/devider2hdm.py
  • Normalize mutation frequency table + join with haplotype table + compute distance matrix + compute single-linkage hierarchical clustering
    • bin/haploDistance.py

🚧 Open TODOs and improvements

  • The current PNG of the last pipeline step is hardly readable as static image. Needs adjustment; likely in python script
  • Numeric output would be usefull I guess. Printing/using the distance matrix is likely a good starting point
  • There might be a more sophisticated clustering method for this task than currently implemented (e.g. skbio has a module for UPGMA)
  • Generating the mutation frequency table needs documentation for third-party users

About

Detection of cryptic mutation profiles in SARS-CoV-2 metagenomes through variant co-occurrence

Resources

License

Stars

Watchers

Forks

Contributors