Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does preprocess.sh have an effect if using datasets other than phoneix? #47

Open
yulrio opened this issue Jun 2, 2024 · 1 comment
Open

Comments

@yulrio
Copy link

yulrio commented Jun 2, 2024

Does preprocess.sh have an effect if using datasets other than phoneix? I have successfully generated the *-goundtruth-[dev,test,train].stm file. Is that alone enough to train with another dataset?

I look forward to the response.
Thank You

@ycmin95
Copy link
Collaborator

ycmin95 commented Jul 18, 2024

Dear @yulrio,

As indicated in Line 14, the simplifications pertain specifically to the phoenix14/14T datasets. Please adjust the preprocessing step according to the evaluation requirements of any other dataset.

I hope this resolves your question.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants