Whether should we force ENSG ID to be unique identifier. #135

hsun3163 · 2022-02-09T20:06:27Z

hsun3163
Feb 9, 2022
Collaborator

After the current annotate_coord step, no records remains due to the log2cpm have gene name as the first column, instead of gene ENSG ID.

This issue can easily be fixed by changing an option of gtf_to_tss_bed function.
bed_template_df = qtl.io.gtf_to_tss_bed(${_input[1]:ar}, feature='transcript',phenotype_id = ${phenotype_id_type} )

However, the output bed of such will retained the gene names, instead of keeping the ENSG ID. This leads to the question in the title, shall we find a way to force it all to be ENSG ID?

I leaned toward of using ensg ID as the unique identifier to avoid ambiguity.

marcora · 2022-02-09T21:34:24Z

marcora
Feb 9, 2022

I often use a ENSGENEID:GENESYM "merge" to avoid ambiguity yet have some level of legibility.

…

On Wed, Feb 9, 2022 at 3:06 PM hsun3163 ***@***.***> wrote: After the current annotate_coord step, no records remains due to the log2cpm have gene name as the first column, instead of gene ENSG ID. This issue can easily be fixed by changing an option of gtf_to_tss_bed function. bed_template_df = qtl.io.gtf_to_tss_bed(${_input[1]:ar}, feature='transcript',phenotype_id = ${phenotype_id_type} ) However, the output bed of such will retained the gene names, instead of keeping the ENSG ID. This leads to the question in the title, shall we find a way to force it all to be ENSG ID? I leaned toward of using ensg ID as the unique identifier to avoid ambiguity. — Reply to this email directly, view it on GitHub <#135>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAABRRZLUYAWKHEC4ZFS4QLU2LCM3ANCNFSM5N6QNGHQ> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whether should we force ENSG ID to be unique identifier. #135

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Whether should we force ENSG ID to be unique identifier. #135

hsun3163 Feb 9, 2022 Collaborator

Replies: 1 comment

marcora Feb 9, 2022

hsun3163
Feb 9, 2022
Collaborator

marcora
Feb 9, 2022