Skip to content

An experimental project on a framework for acquisition of data from open data sources, and supporting several technologies and protocols. The Data Aggregation Lab is a work in progress. It aims to gather information from data providers and aggregators, share experimental results, apply prototypes, and provide demonstrators.

License

Notifications You must be signed in to change notification settings

aisaac/data-aggregation-lab

 
 

Repository files navigation



Open-Data-Acquisition-Lab

Experimental project on a framework for acquisition of data from open data sources, and supporting several technologies and protocols. The Data Aggregation Lab is a work in progress. It aims to gather information from data providers and aggregators, share experimental results, apply prototypes, and provide demonstrators.

Current R&D activities address:

  • Aggregation mechanisms for linked data
  • Aggregation mechanisms for IIIF based on Activity Streams 2.0
  • Aggregation mechanisms for IIIF based on IIIF Collections
  • Aggregation mechanisms for IIIF based on Sitemaps
  • Aggregation mechanisms specific for Schema.org metadata (e.g. HTML Crawlers)
  • Aggregation mechanisms for the WWW (HTML meta, RDFa, RDFaLite, Microdata, etc)
  • Conversion of Schema.org metadata to EDM
  • Metadata profiling

Current operations available in the workbench (for data providers):

  • Register a LOD dataset for aggregation by Europeana - Allows data providers and aggregators to send us the description of a dataset where we will experiment the LOD aggregation mechanisms.
    The guidelines for describing a LOD dataset for Europeana are available in this document.
  • Register a IIIF dataset for aggregation by Europeana - Allows data providers and aggregators to send us the description of a dataset where we will experiment the WWW crawling aggregation mechanisms for extracting structured data within HTML pages.
    IIIF datasets can be used for aggregation via several harvesting methods: IIIF Change Discovery API v0.1, IIIF Collections, and Sitemaps.
  • Register a WWW dataset for aggregation by Europeana - Allows data providers and aggregators to send us the description of a dataset where we will experiment the HTML Crawling and aggregation of micro formats (RDFa, Microdata, etc) and JSON-LD (for example for Schema.org data).
    (a sitemaps.xml file is required)

Current operations available in the workbench (for Europeana aggregation):

  • Manage and process datasets - Allows the Europeana Aggregation team to manage the registered datasets, execute harvests, metadata analysis and conversion.

About

An experimental project on a framework for acquisition of data from open data sources, and supporting several technologies and protocols. The Data Aggregation Lab is a work in progress. It aims to gather information from data providers and aggregators, share experimental results, apply prototypes, and provide demonstrators.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 84.1%
  • XSLT 11.8%
  • HTML 2.7%
  • Other 1.4%