Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use of 'Activities' or 'Contributions' to describe and organize Entity provenance #10

Open
mbrush opened this issue Mar 21, 2017 · 2 comments
Labels

Comments

@mbrush
Copy link
Member

mbrush commented Mar 21, 2017

Many efforts like ClinGen will have use cases that would benefit from explicitly describing the activities through which Entities in our model (Assertions, Supporting Information, Evidence Lines) are created.
ClinGen's main use case here is to organize provenance information into discrete, traceable objects for a variety of reasons related to the applications they must support, as detailed here.

I imagine other, similar use cases coming form other groups wanting to model evidence and provenance information in fine detail. With this in mind, we should consider possible approaches for allowing for richer description of provenance of Entities in a way that is compatible with and easily harmonized with more compact representations that will support most use cases (where Entities are directly linked to agents who contributed to them and dates of these contributions, using PAV-like relations).

We are currently exploring the creation and use of 'Contribution' objects - essentially representing reified contribution relationships between an agent and an Entity. This is similar in principle to the PROV notion of an 'Attribution' - but extended to allow time stamps and roles to be added in this context.

Alternatively, we could allow for a minimal representation of Activities, whose use is limited to describing provenance of entities, but which would be used to create activity-based paths through the data in the style of PROV (which would describe a VarianntInterpretation as a series of Activities with inputs and outputs and agents).

Diagrams of proposed patterns based on these approaches can be found in the cmap here.

@mbrush mbrush added the clingen label Mar 21, 2017
@mbrush
Copy link
Member Author

mbrush commented Mar 21, 2017

If alternate patterns for describing the same basic content are defined by SEPIO and allowed in practice, we will have to provide some means for post-hoc harmonization (e.g. defining property chains to materialize shortcut relations across the richer model, or pre-defined scripts for programmatic post-processing).

@mbrush
Copy link
Member Author

mbrush commented Mar 21, 2017

On 3-21-17 call, we decided that the v1 ClinGen model would implement the reified Contribution based approach.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant