GD learning rate
0.0015
Damping lambda
3.0
Steps/sec
14
step 0
Play
Reset
The Hessian is nearly singular or indefinite here. Pure Newton can point somewhere surprising; damping acts like a trust region.
Gradient descent
Newton
Damped Newton