-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to load annotation for custom #29
Comments
Hi @LPerlaza When using For the format of this file, please refer to this example (for human): https://resources.aertslab.org/cistarget/motif2tf/motifs-v10nr_clust-nr.hgnc-m0.001-o0.0.tbl. I hope this helps, All the best, Seppe |
Hi Seppe, Thanks for the prompt response. I have generated the motif-to-TF annotation file which I have added in the command as This still doesn't work. Do you have any additional recomendation? Thanks! |
Hi @LPerlaza Can you show the output of from pycistarget.utils import load_motif_annotations
load_motif_annotations(specie = None, fname = "motifs-v10-zfish.tbl") All the best, Seppe |
Hi Seppe, thanks for the tip. Here is what the outp
For your reference, the header of the file motifs-v10-zfish.tbl looks like this:
|
Hi Seppe, do you have any advice? I can see that the column names are as expected. Any tip to track back the problem? |
Hi @LPerlaza I see that your motif annotation table is comma separated, the function expects it to be tab seperated. All the best, Seppe |
Hi Seppe, Thanks! That fixed the problem. I managed to run
which generated the following error:
When I checked the code I realized that the condition of specifying assembly only includes human, mouse and fly assembly versions. I try to pass the error by adding the assembly version for zebra fish using this code:
but I get this error:
I can see the options Thanks! and Happy New Year btw! |
Hi Laura Happy new year to you as well. This is indeed a limitation on how SCENIC+ was written in this main branch, different species are better handled in the code of the development branch (https://github.com/aertslab/scenicplus/tree/development). It is still possible to use the main branch, for this please read these instructions: https://scenicplus.readthedocs.io/en/development/pbmc_multiome_tutorial.html#Notes-on-the-individual-function-calls-in-the-wrapper-function. (under the section: "Notes on the individual function calls in the wrapper function."). I hope this helps. All the best, Seppe |
Hi Seppe, Thanks so much for your suggestions. Looking at the notes is very helpful. I was following the other tutorial and I missed that section. I am running into this error and I can't see why. Do You have any advice of how to troubleshoot it? I fail to see where the problem is. I am trying to run this:
get this error:
Both of The
Thanks! |
Hi Laura The issue is caused because the strands should be formatted as "+", "-" instead of 1 and -1. This is where thing starts going wrong: https://github.com/aertslab/scenicplus/blob/e5ba6fcf42459b6e6b70e27359ddd11289d70cc5/src/scenicplus/utils.py#L40 All the best, Seppe |
Hi Seppe, I managed to run the whole pipeline, thanks so much for all your help. I have now a dataframe with the eRegulons which is great! My conundrum now is that some of my genes of interest don't show up on my eRegulons list. I figured that the main problem due to unknown motifs for genes in Zebrafish. I am working with a motif file that I generated by finding the orthologous genes in human and assuming that if the gene is orthologous between human and zebra fish then the motif that connects a gene to the TF is also the same. I have tried including chicken, mouse and fly annotations to check for more orthologies but this is not improving the results. Do you have any advice on how to include more gene-TF interactions. Also any advice on how to explore the complete list of gene-TF interactions vs the gene-TF on the eRegulon results (gene-TF present in the experiment vs gene-TF of importance) |
Hi @LPerlaza Great to hear the workflow is working. To note, the motif enrichment step does not determine which target genes are found (it only determines what TFs are found). So the gene-TF interactions are not determined by the motifs you include in your analysis. The complete list of TF-gene interactions can be found in the All the best, Seppe |
Hi Seppe, I am working with a bigger dataset (including more cells) now and I had to move my project to a different location (cluster). I was running out of RAM in when running
This was working:
This is not working:
I tried to seearch for the notes you directed me last time but the section is gone. So I am out of ideas to solve this. Could you please indicate me to docs for customized references for the new release please, or tell me how I can build the last version on the new cluster. Thanks! |
Hi @LPerlaza Did you install the development version by any chance? Indeed, these functions have changed a lot. You can still download the none-development version of the code if you want, that one is still the default branch on GitHub. All the best, Seppe |
Hi, I am running the Scenic+plus pipeline with Zebra fish and I am getting
Unable to load annotation for custom
warnings when I runrun_pycistarget
All databases created specifically for Zebra fish have been generated:
I have custom Annotation
and this is how I am running
run_pycistarget
warning
Unable to load annotation for custom
commenting the line:
species='custom',
generates the following error.The text was updated successfully, but these errors were encountered: