Learning rate
0.12
Momentum beta
0.85
Condition number
18
step 0
Play
Reset
The learning rate is too high for this ravine. Plain gradient descent becomes unstable before momentum has a chance to help.
Gradient descent
Momentum
Nesterov