Base learning rate
0.20
Gradient noise
0.08
Steps/sec
22
step 0 / 160
Play
Reset
Constant
Step decay
Cosine
Warmup + cosine