You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Seems there is some variation in the numbers of few datasets/models after re-running the benchmark, might be due to AMP. It happens on few datasets on zero-shot classification, diabetic retinopathy being the worse. Retrieval is fine.
It could be that linear probing actually trains a classification head and then evaluates. If the training is on GPU and with pytorch, it will be non deterministic due to cudnn which is a well known issue
Seems there is some variation in the numbers of few datasets/models after re-running the benchmark, might be due to AMP. It happens on few datasets on zero-shot classification, diabetic retinopathy being the worse. Retrieval is fine.
See #56, where it was first detected.
The text was updated successfully, but these errors were encountered: