When I perform adversarial training, I find that the output of BERT is always NAN. #13546
Answered
by
akihironitta
Struggle-Forever
asked this question in
code help: NLP / ASR / TTS
-
Beta Was this translation helpful? Give feedback.
Answered by
akihironitta
Jul 6, 2022
Replies: 1 comment 2 replies
-
As far as I know, training with AMP ( |
Beta Was this translation helpful? Give feedback.
2 replies
Answer selected by
Struggle-Forever
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
As far as I know, training with AMP (
precision=16
) can sometimes be unstable and lead to nan as you report.