New paper and codes available!
Boolstering Stochastic Gradient Descent with Model Building: SMB⌗
(joint with Birbil, Martin, Öztoprak) This work provides a new optimizer taking care of second order approximations, challenging ADAM, SLS and SGD. If you make benchmark tests, feedbacks will be very much appreciated. See more at SMB github page