SemPub2015 Tools and Extensions

This project implements FitLayout-based applications and tools for automatic information extraction from the CEUR-WS.org workshop proceedings pages. The tools were created as a proposed solution of the Task 1 of the Semantic Publishing Challenge 2015 colocated with the Extended Semantic Web Conference 2015.

How to Build

The whole package is build using maven. Use mvn package for creating the runnable SemPub2015Extractor.jar.

Running the Extraction Task

Run the extraction tool using

java -jar SemPub2015Extractor.jar

This will start a FitLayout JavaScript console. Use help() command for obtaining more info.

For accomplishing the SemPub2015 Task1 the following commands should be used:

processEvaluationSet();
transformToDomain();

This assumes the Blazegraph storage to be running at http://localhost:8080/bigdata. Use storage.connect() to connect another repository.

After this, the storage should contain the complete extracted data.

SPARQL Queries

The SPARQL queries corresponding to the individual SemPub2015 queries are located in sparql/ESWC2015-queries.txt.

The transformation query from the domain-independent logical model to the domain-dependent CEUR workshop ontology is located in logicalTree2domain.sparql. The transformation itself is included in the transformToDomain() call so it's not necessary to execute this query manually.

Publication

The related publication is the following:

MILIČKA Martin and BURGET Radek. Information Extraction from Web Sources based on Multi-aspect Content Analysis. In: Semantic Web Evaluation Challenges, SemWebEval 2015 at ESWC 2015. Portorož: Springer International Publishing, 2015, pp. 81-92. ISBN 978-3-319-25517-0. ISSN 1865-0929.

Acknowledgements

This work was supported by the BUT FIT grant FIT-S-14-2299 and the IT4Innovations Centre of Excellence CZ.1.05/1.1.00/02.0070.

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
.settings		.settings
awk		awk
sparql		sparql
src/main		src/main
test		test
.classpath		.classpath
.gitignore		.gitignore
.project		.project
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SemPub2015 Tools and Extensions

How to Build

Running the Extraction Task

SPARQL Queries

Publication

Acknowledgements

About

Releases

Packages

Languages

License

FitLayout/ToolsEswc

Folders and files

Latest commit

History

Repository files navigation

SemPub2015 Tools and Extensions

How to Build

Running the Extraction Task

SPARQL Queries

Publication

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages