Search Results - Chiang, Micah
-
1
-
2
-
3
-
4
-
5
-
6
-
7
-
8
K-SAM: Sharpness-Aware Minimization at the Speed of SGD
Published in arXiv.orgGet full text
Article -
9
-
10
-
11
-
12
-
13
NEFTune: Noisy Embeddings Improve Instruction Finetuning
Published in arXiv.orgGet full text
Article -
14
-
15
WITCHcraft: Efficient PGD attacks with random step size
Published in arXiv.orgGet full text
Article