New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Simple alignments breakdown near end of audio: please help. #295
Comments
I encountered the same issue. Did you figure out the reasons and solutions? |
Same problem here, seems weird to me how the errors accumulate instead of each longer part just chipping off the start of the next one. After all, the start and finish time are the most important and what the thing should analyze, not the duration |
@Oleg-A-LLIto @changyr66 Can you post your code and data file examples? |
Sure, here's an example. Unfortunately, I had to change the json to txt and mp3 to mp4 (github likes it that way). TextInitial.txt |
I am a beginner user of
aeneas
(MacBook 2021 Ventura 13.0.1) with a large amount of experience in natural language processing, audio, algorithms, and software. I understand the basic principals ofaeneas
and forced alignment algorithms.I recently noticed that my configuration 'runs out of room' and the alignment begins to produce errors of the same type.
Can someone familiar with the
aeneas
package help me debug this? I will provide more clear code as we discuss.Here is the basic outline of my usage:
Nothing particularly unique in the above: I have a collection of phrases, each about one sentence long, and I have associated audio. I write the audio to a temporary file, and inside the
forced_alignment
function I will write the phrases to disk.Here I execute the
aeneas
package using the configuration shown above. Typical results are published below. I have also tried varying the length of the phrases and the same problem persists.You can see that the alignment for the first three phrases is roughly correct, and the fourth phrase is essentially provided zero length. This is wrong. It almost appears as though the tempo of the alignment is wrong: in other words, the proportion of the first three phrases is correct, but each 'too long,' and then
aeneas
simply runs out of length of the audio file.This package is very important, and its algorithm and implementation is very streamline and an excellent baseline for many more sophisticated audio applications.
Can we debug?
The text was updated successfully, but these errors were encountered: