Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do I need to crop long audio for inference based on pretrained models? #5560

Open
xuduo18311199384 opened this issue Oct 29, 2024 · 1 comment

Comments

@xuduo18311199384
Copy link

I have a 5-minute audio file, and the wav2vec features obtained by direct inference and the wav2vec features obtained by cropping into a 10s segment are inconsistent. Is it possible that the accuracy of the results obtained by direct inference of long audio is low? So, how long audio should I crop to get the best result?

@xuduo18311199384
Copy link
Author

@alexeib

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant