Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

训练细节 #9

Open
whyandbecause opened this issue Mar 30, 2023 · 1 comment
Open

训练细节 #9

whyandbecause opened this issue Mar 30, 2023 · 1 comment

Comments

@whyandbecause
Copy link

您好,这个工作特别出色,很有启发意义,我有一点小疑问,论文中提到训练经过了100个epoch,但是我实际训练的时候,大概25个epoch后就收敛了,后面的训练loss和验证mae几乎没变化了,这是什么原因?期待您的解答,谢谢!

@WateverOk
Copy link

我也是,大概20多轮就收敛了,另外论文提到每30个epoch学习率衰减一次,但是我看代码里一直是恒定学习率

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants