Assignment 1 q2_neural.py softmax gradient not explicitly calculated #17

rshamsy · 2019-02-13T05:16:28Z

In calculating gradients, the gradient of the softmax function is not calculated using the formula that is derived in the lecture notes. It seems like in the code, this step is skipped over, and the gradient of the cost function with respect to yhat is used only ('d3' variable). Am I missing something here?

Spico197 · 2019-07-25T15:20:48Z

I found the codes in Backprop is nothing wrong. Actually, binary classification via cross entropy with softmax has a very simple derivative formula, which is yhat - labels. You can find more details in https://deepnotes.io/softmax-crossentropy

Hope this would help you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assignment 1 q2_neural.py softmax gradient not explicitly calculated #17

Assignment 1 q2_neural.py softmax gradient not explicitly calculated #17

rshamsy commented Feb 13, 2019

Spico197 commented Jul 25, 2019

Assignment 1 q2_neural.py softmax gradient not explicitly calculated #17

Assignment 1 q2_neural.py softmax gradient not explicitly calculated #17

Comments

rshamsy commented Feb 13, 2019

Spico197 commented Jul 25, 2019