-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data from Deutsche Nationalbibliothek #29
Comments
link to the MARC21 format dumps: https://www.dnb.de/DE/Professionell/Metadatendienste/Datenbezug/Gesamtabzuege/gesamtabzuege_node.html |
I have created an archive.org item to hold the MARC21 records for import by ia-bulkmarc-bot. It would be better to extract the DNB id and include it in identifiers, and possibly leverage any DNB author authority control ids, if we can extract them. The MARC record for the above example is: https://openlibrary.org/show-records/marc_dnb_202006/dnb_all_dnbmarc_20200615-2.mrc:0:953 |
The first non-serial (i.e. 'book' / monograph item) in the first DNB MARC file is: Test imported as https://openlibrary.org/books/OL30608448M |
What problems were there with data quality? @GLBW and I can probably help to find heuristics to exclude unwanted material that is part of the DNBs collection, but not part of OpenLibrary's scope, if that is necessary. We definitely should import data from the DNB, as they usually offer rather high quality data about books. |
The Deutsche Nationalbibliothek (DNB) offers its catalogue data under CC0. See: Datendienst "Bibliografische Dienstleistungen" (in german)
The text was updated successfully, but these errors were encountered: