ITAINNOVA at SocialDisNER

Detection of disease mentions in tweets (in Spanish, SMM4H 2022 – Task 10)

Task info

The main source code is organized as follows:

/nb: Notebooks used for data checking

/python: Python source code

/data: For corpus transformations needed: offsets to BILOU and BIO tagging scheme
/disease_classifier: Specific funcionality developed for the prediction task: transformers-based model and predictor, gazetters, filters and normalization
/token_classifier: General Token Classifier implemented over Transformers. It also implements and optimized hyperparameter finetuning connected to Wandb

/script: Command line scripts to launch training and predictions

/test: Unitary tests

Models and files paths have to be set up in the python/config.py script (full paths recommended). Wandb functionality has been deactivated.

Paper

ITAINNOVA at SocialDisNER: A Transformers cocktail for disease identification in social media in Spanish

More info at:

Biomedical Text Mining YOUTUBE channel

Contributors

Rosa Montañés - @erremesse

Luis García Garcés - @luisgg98

Irene López Bosque - @irenebosque

Rafael del Hoyo Alonso - @neuralconcept

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
images		images
main		main
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ITAINNOVA at SocialDisNER

Paper

More info at:

Contributors

About

Releases

Packages

Contributors 2

Languages

ITA-TECNOLOGIA/SocialDisNER

Folders and files

Latest commit

History

Repository files navigation

ITAINNOVA at SocialDisNER

Paper

More info at:

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages