Showing
1 - 9
results of
9
Skip to content
VuFind
Log in
Library Catalogue Plus
Library
Subject guides
Databases
Referencing
Catalogue
Articles Plus
Keyword
Title
Author
Subject
Find
Advanced Search
Search Results - Kunstner, Frederik
Search Results - Kunstner, Frederik
Showing
1 - 9
results of
9
Refine Results
Sort
Relevance
Date Descending
Author
Title
1
Loading…
Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models
by
Kunstner, Frederik
,
Yadav, Robin
,
Milligan, Alan
,
Schmidt, Mark
,
Bietti, Alberto
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
2
Loading…
Homeomorphic-Invariance of EM: Non-Asymptotic Convergence in KL Divergence for Exponential Families via Mirror Descent
by
Kunstner, Frederik
,
Kumar, Raunak
,
Schmidt, Mark
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
3
Loading…
Searching for Optimal Per-Coordinate Step-sizes with Multidimensional Backtracking
by
Kunstner, Frederik
,
Portella, Victor S
,
Schmidt, Mark
,
Harvey, Nick
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
4
Loading…
Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be
by
Kunstner, Frederik
,
Chen, Jacques
,
Jonathan Wilder Lavington
,
Schmidt, Mark
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
5
Loading…
Limitations of the Empirical Fisher Approximation for Natural Gradient Descent
by
Kunstner, Frederik
,
Balles, Lukas
,
Hennig, Philipp
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
6
Loading…
Convergence Rates for the MAP of an Exponential Family and Stochastic Mirror Descent -- an Open Problem
by
Rémi Le Priol
,
Kunstner, Frederik
,
Scieur, Damien
,
Lacoste-Julien, Simon
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
7
Loading…
BackPACK: Packing more into backprop
by
Dangel, Felix
,
Kunstner, Frederik
,
Hennig, Philipp
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
8
Loading…
Adaptive Gradient Methods Converge Faster with Over-Parameterization (but you should do a line-search)
by
Vaswani, Sharan
,
Laradji, Issam
,
Kunstner, Frederik
,
Si Yi Meng
,
Schmidt, Mark
,
Lacoste-Julien, Simon
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
9
Loading…
SLANG: Fast Structured Covariance Approximations for Bayesian Deep Learning with Natural Gradient
by
Mishkin, Aaron
,
Kunstner, Frederik
,
Nielsen, Didrik
,
Schmidt, Mark
,
Khan, Mohammad Emtiyaz
Published in
arXiv.org
Get full text
Article
Save to List
Saved in:
Search Tools:
RSS Feed
Email Search
Save Search
Back
Refine Results
Page will reload when a filter is selected or excluded.
Limit To
Full Text
9 results
9
Format
Articles
9 results
9
Subjects
Algorithms
4 results
4
Optimization
4 results
4
Approximation
3 results
3
Convergence
3 results
3
Mathematical Analysis
2 results
2
Parameterization
2 results
2
Artificial Neural Networks
1 results
1
Asymptotic Properties
1 results
1
Back Propagation
1 results
1
Back Propagation Networks
1 results
1
Bayesian Analysis
1 results
1
Classification
1 results
1
Computation
1 results
1
Convex Analysis
1 results
1
Covariance Matrix
1 results
1
Cutting
1 results
1
Deep Learning
1 results
1
Divergence
1 results
1
Heuristic Methods
1 results
1
Hyperplanes
1 results
1
Year of Publication
From:
To:
Source
Free E- Journals
9 results
9
Road: Directory Of Open Access Scholarly Resources
9 results
9
Publicly Available Content (Proquest)
9 results
9