Search Results - Sievert, Scott
-
1
-
2
-
3
-
4
Improving the convergence of SGD through adaptive batch sizes
Published in arXiv.orgGet full text
Article -
5
-
6
ATOMO: Communication-efficient Learning via Atomic Sparsification
Published in arXiv.orgGet full text
Article -
7
-
8
-
9
-
10
-
11
-
12
-
13
-
14
-
15
-
16
-
17
-
18
-
19
-
20