Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The limitation of target length in _break_query #72

Open
loelee307 opened this issue Dec 5, 2024 · 0 comments
Open

The limitation of target length in _break_query #72

loelee307 opened this issue Dec 5, 2024 · 0 comments

Comments

@loelee307
Copy link

Hi,

I'm trying to use text search to generate segments from my audio-text resources, based on the recipe of libriheavy. But occasionally I found some cases with incomplete text to cover corresponding audio contents.

I have identified that the issue is likely in the matching._break_query function. From the codes, it appears that when the query is not split, the target length seems to be restricted to less than or equal to query_end - query_start. Is it designed for lower the complexity of levenshtein alignment? What would you advise me to solve these cases?

Thanks so much for your time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant