firstbacksecondback
22 Results
Poster
|
Tue 18:30 |
Strength of Minibatch Noise in SGD Liu Ziyin · Kangqiao Liu · Takashi Mori · Masahito Ueda |
|
Spotlight
|
Tue 18:30 |
Strength of Minibatch Noise in SGD Liu Ziyin · Kangqiao Liu · Takashi Mori · Masahito Ueda |
|
Poster
|
Thu 10:30 |
On the Convergence of mSGD and AdaGrad for Stochastic Optimization Ruinan Jin · Yu XING · Xingkang He |
|
Poster
|
Wed 18:30 |
SGD Can Converge to Local Maxima Liu Ziyin · Botao Li · James Simon · Masahito Ueda |
|
Spotlight
|
Wed 18:30 |
SGD Can Converge to Local Maxima Liu Ziyin · Botao Li · James Simon · Masahito Ueda |
|
Poster
|
Mon 10:30 |
Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise Xingyu Wang · Sewoong Oh · Chang-Han Rhee |
|
Poster
|
Mon 10:30 |
Learning by Directional Gradient Descent David Silver · Anirudh Goyal · Ivo Danihelka · Matteo Hessel · Hado van Hasselt |
|
Poster
|
Tue 18:30 |
Sampling with Mirrored Stein Operators Jiaxin Shi · Chang Liu · Lester Mackey |
|
Spotlight
|
Tue 18:30 |
Sampling with Mirrored Stein Operators Jiaxin Shi · Chang Liu · Lester Mackey |
|
Poster
|
Thu 10:30 |
Learning Curves for SGD on Structured Features Blake Bordelon · Cengiz Pehlevan |
|
Poster
|
Tue 10:30 |
A global convergence theory for deep ReLU implicit networks via over-parameterization Tianxiang Gao · Hailiang Liu · Jia Liu · Hridesh Rajan · Hongyang Gao |
|
Spotlight
|
Wed 10:30 |
Assessing Generalization of SGD via Disagreement Yiding Jiang · Vaishnavh Nagarajan · Christina Baek · Zico Kolter |