Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migration: clearly distinguish which disease ID a specific Mondo ID _was sourced_ from #627

Open
matentzn opened this issue Aug 5, 2024 · 6 comments

Comments

@matentzn
Copy link
Member

matentzn commented Aug 5, 2024

We should take a note when a disease is sourced from, say, an OMIM id. Right now, we dont know, if we have two "equivalenTo", which one is the "original one". They are the same. I think its good to know the "original one".

Represent as: metadata tag on xref, e.g. MONDO:originalSource.

@joeflack4 joeflack4 changed the title Migration: clearly distinction which disease ID a specific Mondo ID _was sourced_ from Migration: clearly distinguish which disease ID a specific Mondo ID _was sourced_ from Aug 5, 2024
@joeflack4
Copy link
Contributor

Hmm, interesting.

I don't know the details of this, but my immediate thought is: Is it really the case that 100% of the time, the first source that we add to a mondo disease is really that special / guaranteed to be a "central authority"? What if we later find that a disease we get from a source was not created by that source, and that the primary origin or authority of that disease actually came from another source?

Also these alternative solutions popped into my head: (i) MONDO:propertyAdded DATETIME (this would introduce a lot of axioms / text), (ii) MONDO:primarySource.

@matentzn
Copy link
Member Author

matentzn commented Aug 6, 2024

Is it really the case that 100% of the time, the first source that we add to a mondo disease is really that special / guaranteed to be a "central authority"?

It is a good question, not easily answered, but I think regardless of the answer it is good to know the first source, and give it some precedence over subsequently mapped terms when it comes to determining the identity of a term!

@twhetzel
Copy link
Contributor

twhetzel commented Aug 9, 2024

I had the same question since the original source I believe will be influenced by the order that the lex files are reviewed as things currently stand.

As far as the representation in Mondo, @matentzn did you have more thoughts on this? For example, it's already confusing to know which ID equivalentTo refers to in an xref when there is more than one CURIE present. For example for MONDO:0000179 and xref: Orphanet:2671 {source="GARD:0000102", source="MONDO:equivalentTo", source="OMIM:256520"}.

@matentzn
Copy link
Member Author

xref: Orphanet:2671 {source="GARD:0000102", source="MONDO:equivalentTo", source="OMIM:256520"}

This means that Orphanet:2671 is equivalent, and GARD:0000102 and OMIM:256520 are giving evidence to that equivalence. If you add:

xref: Orphanet:2671 {source="GARD:0000102", source="MONDO:equivalentTo", source="OMIM:256520", source="MONDO:originalSource"}

It will state, in addition to the above, that Orphanet:2671 was the original term that gave rise to the existence of the term. So I would suggest we go in this direction?

@joeflack4
Copy link
Contributor

Hmm... but if you say that:

(1) xref: Orphanet:2671 {source="GARD:0000102", source="MONDO:equivalentTo", source="OMIM:256520"}
means that these 2 terms are what "gives evidence for equivalence",

then it would seem to follow that:
(2) xref: Orphanet:2671 {source="GARD:0000102", source="MONDO:equivalentTo", source="OMIM:256520", source="MONDO:originalSource"}
means that these 2 terms both "give evidence for equivalence" and "give evidence for this being the MONDO:originalSource`"...

@matentzn
Copy link
Member Author

For me original source has nothing to do with other provenance. It means, Orphanet:2671 was the original source for the mondo term. The other provenance tags just mean "these are also mapped to Orphanet:2671 and also happened to be equivalent to Mondo". But yeah when you have "original source" other evidence really does not matter that much. But does not hurt either!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants