You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a 5-minute audio file, and the wav2vec features obtained by direct inference and the wav2vec features obtained by cropping into a 10s segment are inconsistent. Is it possible that the accuracy of the results obtained by direct inference of long audio is low? So, how long audio should I crop to get the best result?
The text was updated successfully, but these errors were encountered:
I have a 5-minute audio file, and the wav2vec features obtained by direct inference and the wav2vec features obtained by cropping into a 10s segment are inconsistent. Is it possible that the accuracy of the results obtained by direct inference of long audio is low? So, how long audio should I crop to get the best result?
The text was updated successfully, but these errors were encountered: