Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The regularization of depthwise convolution #56

Open
BOBrown opened this issue Mar 4, 2018 · 1 comment
Open

The regularization of depthwise convolution #56

BOBrown opened this issue Mar 4, 2018 · 1 comment

Comments

@BOBrown
Copy link

BOBrown commented Mar 4, 2018

The author wrote following words in paper:
Additionally, we found that it was important to put very little or no weight decay (l2 regularization) on the depthwise filters since their are so few parameters in them.

Therefore, i think that we should set decay_mult: 0.0 in the moblienet prototxt

@mathmanu
Copy link

mathmanu commented Jun 5, 2018

Isn't this line taken from the MobilenetV1 paper? I couldn't find any such statement in the MobilenetV2 paper.

I wonder if all parameters are to be decayed in MobileNetV2 training - at-least that's the understanding that I get by looking at the repository's (very few) that provide a training script:
eg: https://github.com/Randl/MobileNetV2-pytorch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants