[Jul 23th 2018] On stochastic gradient descent, flatness and generalization – Seminar by Prof. Yoshua Bengio

Prof. Yoshua Bengio (University of Montreal, Department of Computer Science and Operations Research (DIRO) ) Jul 23, 2018 – 11:00 AM DIISM, Artificial Intelligence laboratory (room 201), Siena SI Description The traditional Machine Learning picture is that optimization and generalization are neatly separated aspects. That makes theory easier to handle, separately, but unfortunately this is not the case. Stochastic […]

Read More »