Pandora mapping is slow #347

iqbal-lab · 2023-10-18T10:37:14Z

Placeholder as @Danderson123 keeps saying this and I don't want to lose it. Mapping reads from one sample to a
PRG (Ecoli + a few thousand AMR genes) taking 30 mins with 64 cores. I realise our performance stats in the pandora paper are
a) way out of date
b) end to end performance of doing pandora compare on 20 samples.
I don;t think mapping can ever be as fast as the subsequent steps @Danderson123 does with amira, so it's not realistic to expect that, but I do want to understand if this is true (and set my expectations on how fast mapping is), or something weird about Dan's setup.

iqbal-lab · 2023-10-18T10:41:41Z

Actually this might be a RAM/speed tradeoff - if we lazy load gene/PRGs into RAM only when we see them?

iqbal-lab · 2023-10-18T10:44:29Z

(Hope i'm not misrepresenting you @Danderson123 , I can delete this or update it)

Danderson123 · 2023-10-18T10:48:25Z

This is correct, I will try to make plots of runtime vs number of reads after I have looked at the gene calling in more detail- without lazy loading the RAM usage was far higher than is reasonable for a laptop

iqbal-lab · 2023-10-18T10:54:01Z

I have to say @Danderson123 , first mapping to an AMR-only PRG and only keeping those reads, and then mapping just those to the big PRG, would probably make a significant difference

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pandora mapping is slow #347

Pandora mapping is slow #347

iqbal-lab commented Oct 18, 2023 •

edited

Loading

iqbal-lab commented Oct 18, 2023

iqbal-lab commented Oct 18, 2023

Danderson123 commented Oct 18, 2023

iqbal-lab commented Oct 18, 2023

Pandora mapping is slow #347

Pandora mapping is slow #347

Comments

iqbal-lab commented Oct 18, 2023 • edited Loading

iqbal-lab commented Oct 18, 2023

iqbal-lab commented Oct 18, 2023

Danderson123 commented Oct 18, 2023

iqbal-lab commented Oct 18, 2023

iqbal-lab commented Oct 18, 2023 •

edited

Loading