-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
A more robust multistep approach #4
Comments
http://link.springer.com/article/10.1385/MO:23:4:443 This currently fails because there are multiple valid DOIs present and the shortest one is not the right one. Is there some way other aside from shortness one can grade plausibility by? |
@Deleetdk Maybe we should check if the DOI is visible. The other DOIs are in the URLs of the cited papers. |
Can check if it's in the visible part of the code. Like not in href or .On Nov 3, 2016 19:49, "Oliver" [email protected] wrote:
|
Actually, maybe we should check the meta tags as well. The site you linked to has a meta tag for the DOI. |
Also, the DOI handbook states that one should strive to display it as As such, we should probably rank them in order of "correct" format:
|
A better idea is to use a multi-step approach:
This should cover pretty much everything, but could be slow depending on how long it takes to look up DOIs. Unless a page really has multiple valid DOIs (if it has a reference list!), then this should find all the valid ones fairly quickly.
The text was updated successfully, but these errors were encountered: