Search Results - Pang, Yatian
-
1
-
2
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Published in arXiv.orgGet full text
Article -
3
-
4
-
5
Envision3D: One Image to 3D with Anchor Views Interpolation
Published in arXiv.orgGet full text
Article -
6
Next Patch Prediction for Autoregressive Visual Generation
Published in arXiv.orgGet full text
Article -
7
-
8
-
9
Masked Autoencoders for Point Cloud Self-supervised Learning
Published in arXiv.orgGet full text
Article -
10
Open-Sora Plan: Open-Source Large Video Generation Model
Published in arXiv.orgGet full text
Article -
11
-
12