Skip to content

Latest commit

 

History

History
31 lines (18 loc) · 2.23 KB

README.md

File metadata and controls

31 lines (18 loc) · 2.23 KB

Linking LASLA corpus - LiLa LemmaBank

This repository contains the triples linking the LASLA corpus to the LiLa LemmaBank.

The LASLA Latin corpus consists of a large amount of texts of Classical Latin, semi-automatically lemmatized and annotated with morphosyntactical information, with full manual verification. The Laboratory is part of the Université de Liège and is currently composed by an international team.

The linking to the LiLa lemmabank was carried out semi-automatically by a joint team of LiLa and LASLA members (Marco Passarotti, Paolo Ruffolo, Eleonora Litta, Giovanni Moretti for LiLa, Margherita Fantoli for LASLA). Every linking has been manually verified.

Source: LASLA texts are available through data-sharing agreements. Editions are based on the available critical editions at the time of the lemmatization but the annotators have modified the texts according to their own editorial decisions.

Repository description:

  • complete_lasla.ttl--> provides the turtle file containing all the texts of LASLA corpus
  • texts (folder)--> provides a single turtle file for each text of the LASLA corpus
  • conllup (folder)--> The conllup format is a standard 10 columns conllu format with additional five columns that specify:
    • flectional category in LiLa
    • an id that represents the work,author id,work code,sentence number
    • the import line number in lasla_raw db
    • the link to LiLa kwonledge base
    • the token uri in the LiLa kwonledge base

Funding

The LiLa: Linking Latin project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme – Grant Agreement No. 769994.

Copyright

Creative Commons Licence
These resources are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.