-
Notifications
You must be signed in to change notification settings - Fork 707
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The regularization of depthwise convolution #56
Comments
Isn't this line taken from the MobilenetV1 paper? I couldn't find any such statement in the MobilenetV2 paper. I wonder if all parameters are to be decayed in MobileNetV2 training - at-least that's the understanding that I get by looking at the repository's (very few) that provide a training script: |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The author wrote following words in paper:
Additionally, we found that it was important to put very little or no weight decay (l2 regularization) on the depthwise filters since their are so few parameters in them.
Therefore, i think that we should set decay_mult: 0.0 in the moblienet prototxt
The text was updated successfully, but these errors were encountered: