Wav2Vec2 Spanish

language

Wav2Vec2 Spanish

Spanish Wav2Vec2 model pre-trained using the Spanish portion of the Common Voice dataset.

Part of the Flax x Hugging Face community event.

Team: @mariagrandury, @mrm8488, @edugp and @pcuenq.

Model description

The model used for training is [Wav2Vec2] by FacebookAI. It was introduced in the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" by Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, and Michael Auli (https://arxiv.org/abs/2006.11477).

This model is available in the 🤗 Model Hub.

Intended uses & limitations

How to use (TODO)

Limitations and bias (TODO)

Training data

Spanish portion of Common Voice. Common Voice is an open source, multi-language dataset of voices part of Mozilla's initiative to help teach machines how real people speak.

The dataset is also available in the 🤗 Datasets library.

Training procedure (TODO: update)

The script used for training (train.sh) is based on this training script and was modified as explained in setup_modifications.md.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
config.json		config.json
config.py		config.py
preprocess_dataset.py		preprocess_dataset.py
preprocessor_config.json		preprocessor_config.json
requirements.txt		requirements.txt
run_wav2vec2_pretrain_flax.py		run_wav2vec2_pretrain_flax.py
setup_modifications.md		setup_modifications.md
test_setup.py		test_setup.py
train.sh		train.sh
train_dummy.sh		train_dummy.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wav2Vec2 Spanish

Model description

Intended uses & limitations

How to use (TODO)

Limitations and bias (TODO)

Training data

Training procedure (TODO: update)

Eval results (TODO)

About

Languages

somosnlp/wav2vec2-spanish

Folders and files

Latest commit

History

Repository files navigation

Wav2Vec2 Spanish

Model description

Intended uses & limitations

How to use (TODO)

Limitations and bias (TODO)

Training data

Training procedure (TODO: update)

Eval results (TODO)

About

Topics

Resources

Code of conduct

Stars

Watchers

Forks

Languages