Search Results - Cody Hao Yu
-
1
-
2
-
3
-
4
-
5
TGPA: Tile-Grained Pipeline Architecture for Low Latency CNN Inference
Conference Proceeding -
6
-
7
Latte: Locality Aware Transformation for High-Level Synthesis
Conference Proceeding -
8
-
9
-
10
-
11
-
12
Efficiently Programming Large Language Models using SGLang
Published in arXiv.orgGet full text
Article -
13
-
14
-
15
RAF: Holistic Compilation for Deep Learning Model Training
Published in arXiv.orgGet full text
Article -
16
Tensor Program Optimization with Probabilistic Programs
Published in arXiv.orgGet full text
Article -
17
-
18
Best-Effort FPGA Programming: A Few Steps Can Go a Long Way
Published in arXiv.orgGet full text
Article -
19
-
20