You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First issue:
I am working with fungal genomes and wanted to use vg call to genotype some samples on a .gbz graph I built with MC. Everything ran as it should; however, I sometimes got negative quality values, some even labelled "PASS" in the filter field.
I am running VG on version 1.60.0.
Here is an example of this (I filtered to keep only negative values):
#vg pack
singularity run docker://quay.io/vgteam/vg:v1.60.0 vg pack \
-x $FULL_REF/20241017_26pg.pg.full.gbz \
-g $FOLDER/03_align/$SAMPLE\_pg_full_20241021.gam \
-o $FOLDER/07_vgCall/00_pack/$SAMPLE.pack \
-Q 5
#vg snarl
singularity run --bind docker://quay.io/vgteam/vg:v1.60.0 vg snarls \
$FULL_REF/20241017_26pg.pg.full.gbz > $FULL_REF/20241017_26pg.pg.full.snarls
#vg call
singularity run --bind docker://quay.io/vgteam/vg:v1.60.0 vg call \
$FULL_REF/20241017_26pg.pg.full.gbz \
-d 1 \
-a \
-r $FULL_REF/20241017_26pg.pg.full.snarls \
-k $FOLDER/07_vgCall/00_pack/$SAMPLE.pack \
-s $SAMPLE \
-z > $FOLDER/07_vgCall/01_genotypes/$SAMPLE.vcf
#combine output with bcftools
bgzip -c $FOLDER/07_vgCall/01_genotypes/$SAMPLE.vcf > $FOLDER/07_vgCall/01_genotypes/$SAMPLE.vcf.gz
tabix -f -p vcf $FOLDER/07_vgCall/01_genotypes/$SAMPLE.vcf.gz
ls $FOLDER/07_vgCall/01_genotypes/*.vcf | xargs -n1 -P0 bgzip
ls $FOLDER/07_vgCall/01_genotypes/*.vcf.gz | xargs -n1 -P0 tabix -f -p vcf
cat $FOLDER/07_vgCall/01_genotypes/*vcf.gz > to_merge.tsv
bcftools merge --file-list to_merge.tsv -o $FOLDER/07_vgCall/02_merge/300_pgVC.vcf.gz
#filter to keep only QUAL < 0
bcftools filter -i 'QUAL < 0' -o $FOLDER/07_vgCall/02_merge/300_pgVC_NegQual.vcf.gz $FOLDER/07_vgCall/02_merge/300_pgVC.vcf.gz
#subset to only the 4 first samples:
bcftools view -s Sample1,Sample2,Sample3,Sample4 -o $FOLDER/07_vgCall/02_merge/subset_NegQual.vcf.gz -O z $FOLDER/07_vgCall/02_merge/300_pgVC_NegQual.vcf.gz
Second issue:
Unrelated to the quality, I have overlapping structural variants on my result files that give me a hard time interpreting. This is an example:
We can see that just two bases are added at the end of the REF sequence, and it is (I guess!) just a two nucleotides insertion rather than a structural variant event. The paths in the "AT" field are also extremely similar, and there is just an additional "<4044" in the ALT part.
How can I interpret all of these? Are these expected?
I wish you a nice day,
Regards,
Marion
The text was updated successfully, but these errors were encountered:
Dear developers,
I have two questions about
vg call
's results:First issue:
I am working with fungal genomes and wanted to use
vg call
to genotype some samples on a .gbz graph I built with MC. Everything ran as it should; however, I sometimes got negative quality values, some even labelled "PASS" in the filter field.I am running VG on version 1.60.0.
Here is an example of this (I filtered to keep only negative values):
subset_300_pgVC_negQual.vcf.gz
Here are my commands to get the result:
Second issue:
Unrelated to the quality, I have overlapping structural variants on my result files that give me a hard time interpreting. This is an example:
We can see that just two bases are added at the end of the REF sequence, and it is (I guess!) just a two nucleotides insertion rather than a structural variant event. The paths in the "AT" field are also extremely similar, and there is just an additional "<4044" in the ALT part.
How can I interpret all of these? Are these expected?
I wish you a nice day,
Regards,
Marion
The text was updated successfully, but these errors were encountered: