An application that will consume all triples that are being harvested by the "besluiten" harvesters. A configuration is in place to process a set of interesting types, these triples will than be set on the /public
LDES stream.
As we are using services that need a specific configuration, we suggest you read the instructions of the specific services when something is unclear.
Below we will add the most important environment variables so you are up to date with adjustments you can make.
- DCR_CRON_PATTERN_DELTA_SYNC:
0 * * * *
Every hour new triples will be consumed from the harvester. - DCR_START_FROM_DELTA_TIMESTAMP:
2024-10-01
The date from where we want to start digesting the delta's.
- BASE_URL:
https://mandaten-besluiten.lblod.info/streams/ldes/
The LDES stream where the interesting types will be added to.
- LDES_BASE:
https://mandaten-besluiten.lblod.info/streams/ldes
The LDES stream where the interesting types will be added to. - WRITE_INITIAL_STATE:
false
We want to harvested the "besluiten" starting from a specific date. - CRON_PATTERN_CLEANUP:
0 */2 * * *
Every 2 hours a cleanup is done of the consumed triples (from the harvester) graph. This is done as we will put all triples in our store, even the non-interesting types.
- Development
- https://dev.harvesting-self-service.lblod.info (harvester url)
- https://dev.harvesting-self-service.lblod.info/login (login to manage the harvesting jobs, create/monitor)
- Testing/production
- All base mu.semte.ch services to setup an application
- Delta-consumer
- Fragmentation producer
- Ldes-delta-pusher
You can make use of the LDES-client service to start using the published ldes-stream.
To get started this you can add this service to the application where you need the ldes-stream data that is published on the ldes-stream used in this app.
ldes-client:
image: lblod/ldes-client:0.0.3
links:
- database:database
- virtuoso:virtuoso
restart: always
environment:
LDES_BASE: https://mandaten-besluiten.lblod.info/streams/ldes/public/
FIRST_PAGE: https://mandaten-besluiten.lblod.info/streams/ldes/public/1
TARGET_GRAPH: http://mu.semte.ch/graphs/besluiten-consumed
WORKING_GRAPH: http://mu.semte.ch/graphs/besluiten-consumed-tmp
DIRECT_DATABASE_CONNECTION: "http://virtuoso:8890/sparql"
RANDOMIZE_GRAPHS: true
BATCH_SIZE: 100
BYPASS_MU_AUTH: false
labels:
- "logging=true"
logging: *default-logging