Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

After "WaitingForReturnCode ", pipeline stuck #105

Open
jaewon-cho opened this issue Jan 11, 2022 · 4 comments
Open

After "WaitingForReturnCode ", pipeline stuck #105

jaewon-cho opened this issue Jan 11, 2022 · 4 comments

Comments

@jaewon-cho
Copy link

I was running the ctat-mutation pipeline with singularity.
I have done the "test" data before

Here is my script:
singularity exec -e -B ${CTAT_GENOME_LIB}:/data/hemberg/jaewon/ctat/new5/GRCh37_gencode_v19_CTAT_lib_Mar012021.plug-n-play/ctat_genome_lib_build_dir/ ~/ctat_mutations.v3.2.0.simg ~/ctat-mutations-CTAT-Mutations-v3.2.0/ctat_mutations --left test_data/SRR11619681_1.fastq --right test_data/SRR11619681_2.fastq --genome_lib_dir ~/sfa_jaewon/ctat/new5/GRCh37_gencode_v19_CTAT_lib_Mar012021.plug-n-play/ctat_genome_lib_build_dir/ -O peto --cpu 6 --sample_id peto --boosting_method=none

The problem is

[2022-01-11 20:28:44,46] [info] BackgroundConfigAsyncJobExecutionActor [2aat_mutations.SplitIntervals:NA:1]: executing: /usr/bin/env bash /PHShome/jst_data/peto/cromwell-executions/ctat_mutations/2a83efc2-5d77-4cde-a0fd-38bb/call-SplitIntervals/execution/script
[2022-01-11 20:28:47,62] [info] BackgroundConfigAsyncJobExecutionActor [2aat_mutations.StarAlign:NA:1]: job id: 16935
[2022-01-11 20:28:47,63] [info] BackgroundConfigAsyncJobExecutionActor [2aat_mutations.SplitIntervals:NA:1]: job id: 16952
[2022-01-11 20:28:47,65] [info] BackgroundConfigAsyncJobExecutionActor [2aat_mutations.SplitIntervals:NA:1]: Status change from - to WaitingForRetur
[2022-01-11 20:28:47,65] [info] BackgroundConfigAsyncJobExecutionActor [2aat_mutations.StarAlign:NA:1]: Status change from - to WaitingForReturnCode

after showing this status, then the pipeline was stuck
I found that from "stdout" in "call-StarAlign/execution"

Jan 11 15:28:44 ..... started STAR run
Jan 11 15:28:44 ..... loading genome
Jan 11 15:32:26 ..... started 1st pass mapping
Jan 11 15:35:28 ..... finished 1st pass mapping
Jan 11 15:35:32 ..... inserting junctions into the genome indices
Jan 11 15:44:03 ..... started mapping
Jan 11 15:47:17 ..... finished mapping
Jan 11 15:47:47 ..... started sorting BAM
Jan 11 15:47:49 ..... finished successfully

I found from my previous result that after Star Alignment, there should be "call-MarkDuplicates", but currently, there is no "call-MarkDuplicates".

Thank you

@brianjohnhaas
Copy link
Collaborator

brianjohnhaas commented Jan 12, 2022 via email

@jaewon-cho
Copy link
Author

Thank you for your comment
But, actually, I have to run more than 10K samples. It is impossible to kill and rerun everything.
Is there any way not to stick the process? (current job has been stopped the whole night yesterday)

@brianjohnhaas
Copy link
Collaborator

brianjohnhaas commented Jan 12, 2022 via email

@jaewon-cho
Copy link
Author

Thank you for your kind and rapid response.
I really appreciate that.
Actually, I am planning to use the LSF system not running in the local environment.
Before sending the queue, I was trying with just one sample if it still runs appropriately.
Thank you again

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants