Wav2vec-U Gan train WER UER loss are not satisfactory wav2vec-u wav2vec-u 2.0 gan #5572

XR1988 · 2024-12-11T13:28:11Z

I’m also trying to reproduce the results, but the WER and UER are not satisfactory (I’ve been stuck for two weeks now) and would like to ask for your guidance.

Modifications: The setup, environment, and models were built according to the original instructions, with no significant changes. Everything runs smoothly, and there are almost no errors.

Data:

LibriSpeech-100h
Librispeech-lm-norm.txt.gz
G2P phonemes were used for processing, and no errors occurred during preprocessing.
Current Status:
For both Wav2vec-U 1.0 and 2.0, the metrics (e.g., WER and UER) drop slightly in the first few minutes of training but then stagnate.

Attempts:

Tried different hyperparameter configurations.
Tested training on CPU or GPU.
Used larger datasets.
Extended training durations.
Unfortunately, the performance remains stagnant regardless of these efforts.

1.0

2.0

data

test resilt

Let me know if additional details are needed!
Looking forward to your advice.

XR1988 added needs triage question labels Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wav2vec-U Gan train WER UER loss are not satisfactory wav2vec-u wav2vec-u 2.0 gan #5572

Wav2vec-U Gan train WER UER loss are not satisfactory wav2vec-u wav2vec-u 2.0 gan #5572

XR1988 commented Dec 11, 2024

Wav2vec-U Gan train WER UER loss are not satisfactory wav2vec-u wav2vec-u 2.0 gan #5572

Wav2vec-U Gan train WER UER loss are not satisfactory wav2vec-u wav2vec-u 2.0 gan #5572

Comments

XR1988 commented Dec 11, 2024