Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can we now normalize all JSON+LD that goes into MongoDB? #7

Open
justinccdev opened this issue May 24, 2018 · 0 comments
Open

Can we now normalize all JSON+LD that goes into MongoDB? #7

justinccdev opened this issue May 24, 2018 · 0 comments

Comments

@justinccdev
Copy link
Member

So, now that the JSON is going into MongoDB, we can do some nice stuff like find the total count of entries which are about samples

db.samples.find({ 'mainEntity.@type': 'Sample'}).count()

However, for this to work, the JSON+LD going in needs to be normalized (and we need to transform RDFa and Microdata to this normalized form as much as possible when we handle those).

@innovationchef I think you looked at this before in the older scraper, using the pyld library. I remember it wasn't possible to use where it wasn't completely happy about the jsonld format. But perhaps we could use it in the short term with biosamples at least. What do you think?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant