Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consolidation scripts: improvements #10

Open
MonikaBarget opened this issue Sep 15, 2023 · 3 comments
Open

Consolidation scripts: improvements #10

MonikaBarget opened this issue Sep 15, 2023 · 3 comments

Comments

@MonikaBarget
Copy link
Contributor

  • to avoid data loss, use dropna=False when data are merged!
  • strip excessive white space?
  • Anreicherung von Ortsdaten mit Territorien des Reiches: Wikidata, Herder, Niklas Alt!? Also integrate place data collected for the Mainz Topographia.
  • "df. merge": be sure to use (how="left")
@MonikaBarget
Copy link
Contributor Author

Transfer list of "Reichsstände" (see Regensburg meeting!) to CSV on Github: how many direct matches with places / cities are possible?

@MonikaBarget
Copy link
Contributor Author

Personen: persons_ProfAPI.csv -> Factoid_PersonNames_merged.csv -> neue IDs für Profs und Studies!

@MonikaBarget
Copy link
Contributor Author

MonikaBarget commented Sep 15, 2023

Orte: Ortsontologie_Geocoded_geprüft.xlsx: Spaltennamen anpassen!!

Verbinden mit 2023_01_Places_AP3_Topographia.xlsx?

Places_Geonames_Wikidata.csv (utf-8): geonames_id and wikidata_id

place_modern = place_name

ID in Topographia = topographia_id?
Place_HRE = place_Topographia

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant