Replies: 2 comments 2 replies
-
Thank you, this is a great point that I missed before. I'll look into adding proportional sampling to dynamic bucketing sampler. |
Beta Was this translation helpful? Give feedback.
1 reply
-
Hi Piotr, thank you for your kind response. For my question, I resort to the About the proportional sampling for dynamic sampler. It's true that there's no way to know the duration left in the dynamic scenario. Not a thoughtful one, but I wonder what if we can provide some kind of duration histogram before the training start? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello! In my setting of experiment, a jump of loss is observed at the end of every epoch. After some investigation, I find that is because in the
DynamicBucketingSampler
the buckets for short utterance are used up more quickly than the buckets for longer ones. This situation seems related to #364 and #372 .It seems that using
BucketingSampler
(not the dynamic version) would solve my problem. But it ran out the memory of my server to convert the lazy manifestto_eager()
.My question is, is there any way to use
DynamicBucketingSampler
but also with the approx proportional sampling? or is there any way to mitigate the memory issue when usingBucketingSampler
? Any help would be very appreciated!Beta Was this translation helpful? Give feedback.
All reactions