Adding duration for each AudioSource #1069
popcornell
started this conversation in
Ideas
Replies: 1 comment 5 replies
-
I usually deal with recording source issues at the time of manifest creation. For e.g., you can check |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
would it be a sensible choice to add an optional duration field for each AudioSource ?
I know that by design you avoided to do so and have the duration on the recording only and also the toleration field which can handle the difference in length between AudioSources.
But in some instances you may want to check also the duration of each AudioSources.
E.g.
You are training a (single-channel, suffice to say) ASR model, you are "unlucky" and sample the "wrong" AudioSource for which the current utterance is past its end. With tolerance it will be padded as I understand, so now you have in input zeros and the model still has to learn to predict the utterance.
There are other workarounds right now as discarding the truncated supervision. Just want to bring this up.
Beta Was this translation helpful? Give feedback.
All reactions