Some questions about data preprocessing. #7

Huahuatii · 2023-09-20T06:54:29Z

Your work is very interesting, and I would like to use portal-sc to conduct some tests on our dataset. And it's great to see the work you've done in preprocess_memory_efficient.But I've noticed that the preprocessing order seems to differ from the standard workflow in Scanpy. I was wondering if there's a specific reason for this difference?

The text was updated successfully, but these errors were encountered:

jiazhao97 · 2023-09-21T09:41:23Z

Hi there,

Thank you for your interest in our Portal method! In Portal, we select highly variable genes with flavor 'seurat_v3'. Count data is expected when using flavor 'seurat_v3', while logarithmized data is expected when using other flavors (https://scanpy.readthedocs.io/en/stable/generated/scanpy.pp.highly_variable_genes.html). Therefore, Portal selects genes before obtaining logarithmized data; while standard scanpy pipeline selects genes with another flavor using logarithmized data.

Best,
Jia

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions about data preprocessing. #7

Some questions about data preprocessing. #7

Huahuatii commented Sep 20, 2023

jiazhao97 commented Sep 21, 2023

Some questions about data preprocessing. #7

Some questions about data preprocessing. #7

Comments

Huahuatii commented Sep 20, 2023

jiazhao97 commented Sep 21, 2023