Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Large genome files #29

Open
elissaralam opened this issue May 25, 2021 · 4 comments
Open

Large genome files #29

elissaralam opened this issue May 25, 2021 · 4 comments

Comments

@elissaralam
Copy link

Hello,

I have two large genome files and would like to use SibeliaZ. I recently read this on a biostars thread "Yes, by SibeliaZ you can get .gfa file out of a fasta file. They recently updated there old version and now large genome file can also be used for the same."
But I cannot find any confirmation that SibeliaZ supports large genome files in your version history. Can you confirm that?

Thanks a lot!
Elissa

@iminkin
Copy link
Collaborator

iminkin commented May 26, 2021

Hi @elissaralam , yes SibeliaZ was developed for handling larger datasets.

@elissaralam
Copy link
Author

Hello,

Thank you for your quick response! SibeliaZ worked perfectly well in generating a global alignment for my two genomes. However, i am trying to generate a gfa file from the maf file using the function "python maf_to_gfa1.py" but i am constantly getting an IndexError "IndexError: list index out of range". Have you encountered this before?

Thank you for your help,
Elissar

@iminkin
Copy link
Collaborator

iminkin commented May 31, 2021

Hi Elissar,

Sorry, but it seems to be a bug in my code. Is it possible for you to share the input and output files you have so that I can try to debug the script?

Ilia

@markopetek
Copy link

I'm also trying to align two large genome files (~840Mbp) of the same species and got the same error with maf_to_gfa1.py.

this is the full error message:

Traceback (most recent call last):
  File "../scripts/maf_to_gfa1.py", line 202, in <module>
    blocks, sequence = split_maf_blocks(args.maf)
  File "../scripts/maf_to_gfa1.py", line 112, in split_maf_blocks
    while prev_column < len(maf[0].body):
IndexError: list index out of range

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants