The limitation of target length in _break_query #72

loelee307 · 2024-12-05T07:31:33Z

Hi,

I'm trying to use text search to generate segments from my audio-text resources, based on the recipe of libriheavy. But occasionally I found some cases with incomplete text to cover corresponding audio contents.

I have identified that the issue is likely in the matching._break_query function. From the codes, it appears that when the query is not split, the target length seems to be restricted to less than or equal to query_end - query_start. Is it designed for lower the complexity of levenshtein alignment? What would you advise me to solve these cases?

Thanks so much for your time.

The text was updated successfully, but these errors were encountered:

loelee307 mentioned this issue Dec 12, 2024

Fix end droping problem #73

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The limitation of target length in _break_query #72

The limitation of target length in _break_query #72

loelee307 commented Dec 5, 2024

The limitation of target length in _break_query #72

The limitation of target length in _break_query #72

Comments

loelee307 commented Dec 5, 2024