Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

empty output files in dataprep stage #214

Open
villegmb opened this issue Mar 22, 2024 · 3 comments
Open

empty output files in dataprep stage #214

villegmb opened this issue Mar 22, 2024 · 3 comments

Comments

@villegmb
Copy link

Hello. I am having problems with the output file from the dataprep stage. This is the code I am running:

xpore dataprep \
--eventalign /ibex/tmp/c2078/Heat_stress_analysis/xpore/data/25rep1/eventalign.txt \
--out_dir /ibex/tmp/c2078/Heat_stress_analysis/xpore/data/25rep1/dataprep \
--gtf_or_gff /ibex/tmp/c2078/Heat_stress_analysis/xpore/Ahem_transcriptome_with_reference.gtf \
--transcript_fasta /ibex/tmp/c2078/Heat_stress_analysis/scripts/transcripts_util_names_edited.fasta \
--genome \
--n_processes 24

This is my eventalign.txt file:

image

This is my gtf:
image

My Fasta:
image

And my dataprep directory looks like this:
image

Empty:
image

data.json: empty

image

image

image

This is from the running process:

image

image

Can you please tell me what am I doing wrong?

@yuukiiwa
Copy link
Collaborator

Hi @villegmb,

Can you try removing the sample column in your eventalign.txt file and run xpore dataprep again, please?

Thanks!

Best wishes,
Yuk Kei

@villegmb
Copy link
Author

Hi @villegmb,

Can you try removing the sample column in your eventalign.txt file and run xpore dataprep again, please?

Thanks!

Best wishes, Yuk Kei

Hello, I did it, but the outcome did not change. I got exactly the same.

@yuukiiwa
Copy link
Collaborator

yuukiiwa commented Apr 1, 2024

Hi @villegmb,

I suspect that this has to do with how the gene_id (STRG.8) and transcript_id (STRG.446.1) look like in your fasta and gtf files. Can you try changing the gene_id and transcript_id to be separated with _ instead of . in your fasta, gtf, and eventalign.txt files as a . is possibly recognized as transcript_version by xpore dataprep.

Thanks!

Best wishes,
Yuk Kei

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants