-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to build a manifest.tsv file. Can you provide a sample file? #116
Comments
It's a tab-delimited file, as described here: |
thanks for the links! Command error: name reads vcf |
Is it a lot of samples? If possible could you share the VCF files with me please and I can debug |
it's about 1000 vcf files. the following text is the head of vcf files (all like this). i'm reading this function (find_parallel_cluster_split_points) , what information are we processing in this function. thanks! |
I've just tested and having There's also two nextflow parameters that may be causing your VCFs to be excluded. Their defaults made sense for TB, but maybe not for your data.
Sorry, these should be documented - is on my to do list. |
thank you very much! what should i do with "<NON_REF>", can you add a if "sentence" to process this? |
I'll add in a step that removes all non-ACGT alleles. You could wait for that fix, or remove the |
can i reset these two params? or just add these two params in the nextflow command lines?
|
thanks, i'll wait for the update.
|
Add them to the |
ok.
|
hi. i used 1 vcf file as input, replace("<NON_REF>",".") in the vcf and add "--max_variants_per_sample 100000000 --max_genome_proportion_per_sample 1" to the nextflow run command, got the same error too. |
can you share the vcf file? |
#description ############################################ there is only 1 difference: the character "<NON_REF>" was transformed to "." |
I think the cause is that all of the records are getting ignored either because there's no I've also made a toy data set so you can test the installation. Instructions are here: You'll need to update to the new release 0.12.4 to get the toy data, and that version will also handle the |
error messages (test data): i have figured it out. I'm sure that vt is ok. what should I do? |
another question: |
The For ref calls - you'll need to change the GT to be allele that you want considered for genotyping. But also in your VCF the ALT column has |
comma or tab separated? how about pairwise sequencing?
can I use compressed files, like gz or tar.gz?
here is an example of my manifest.tsv, is this ok?
#########################################################################
name vcf reads reads
201550 NGS_Mapping/SNP/gvcf/201550.gvcf.gz 01.Cleandata/201550/201550.1.fq.clean.gz 01.Cleandata/201550/201550.2.fq.clean.gz
183207 NGS_Mapping/SNP/gvcf/183207.gvcf.gz 01.Cleandata/183207/183207.1.fq.clean.gz 01.Cleandata/183207/183207.2.fq.clean.gz
#########################################################################
thanks.
The text was updated successfully, but these errors were encountered: