Do I need to crop long audio for inference based on pretrained models？ #5560

xuduo18311199384 · 2024-10-29T07:55:45Z

I have a 5-minute audio file, and the wav2vec features obtained by direct inference and the wav2vec features obtained by cropping into a 10s segment are inconsistent. Is it possible that the accuracy of the results obtained by direct inference of long audio is low? So, how long audio should I crop to get the best result?

xuduo18311199384 · 2024-10-29T07:56:54Z

@alexeib

xuduo18311199384 added needs triage question labels Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do I need to crop long audio for inference based on pretrained models？ #5560

Do I need to crop long audio for inference based on pretrained models？ #5560

xuduo18311199384 commented Oct 29, 2024

xuduo18311199384 commented Oct 29, 2024

Do I need to crop long audio for inference based on pretrained models？ #5560

Do I need to crop long audio for inference based on pretrained models？ #5560

Comments

xuduo18311199384 commented Oct 29, 2024

xuduo18311199384 commented Oct 29, 2024