-
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
Kairun Wen, Yuzhi Huang, Runyu Chen, Hui Zheng, Yunlong Lin, Panwang Pan, Chenxin Li, Wenyan Cong, Jian Zhang, Junbin Lu, Chenguo Lin, Dilin Wang, Zhicheng Yan, Hongyu Xu, Justin Theiss, Yue Huang, Xinghao Ding, Rakesh Ranjan, Zhiwen Fan · arXiv preprint arXiv:2512.03000 , 2025
-
Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models
Q Ren, Y Wang, L Guo, W Zhang, Z Fan, C You · arXiv preprint arXiv:2511.19917 , 2025
-
AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback
Y Yang, J Hong, GJ Perin, Z Fan, L Yin, Z Wang, U Topcu · arXiv preprint arXiv:2509.18384 , 2025
-
Mmhu: A massive-scale multimodal benchmark for human behavior understanding
R Li, R Ye, M Wu, HF Yang, Z Fan, H Hu, Z Tu · arXiv preprint arXiv:2507.12463 , 2025
-
Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Longfei Li, Zhiwen Fan, Wenyan Cong, Xinhang Liu, Yuyang Yin, Matt Foutter, Panwang Pan, Chenyu You, Yue Wang, Zhangyang Wang, Yao Zhao, Marco Pavone, Yunchao Wei · NeurIPS 2025 Datasets & Benchmarks , 2025
-
CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy
J Zhang, S Zhou, H Dai, X Liu, P Wang, Z Fan, Y Pei, J Yu · ICCV , 2025
-
E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models
Wenyan Cong, Yiqing Liang, Yancheng Zhang, Ziyi Yang, Yan Wang, Boris Ivanovic, Marco Pavone, Chen Chen, Zhangyang Wang, Zhiwen Fan · arXiv preprint arXiv:2506.01933 , 2025
-
VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
Zhiwen Fan, Jian Zhang, Renjie Li, Junge Zhang, Runjin Chen, Hezhen Hu, Kevin Wang, Huaizhi Qu, Dilin Wang, Zhicheng Yan, Hongyu Xu, Justin Theiss, Tianlong Chen, Jiachen Li, Zhengzhong Tu, Zhangyang Wang, Rakesh Ranjan · arXiv preprint arXiv:2505.20279 , 2025
-
Generative ai for autonomous driving: Frontiers and opportunities
Yuping Wang, Shuo Xing, Cui Can, Renjie Li, Hongyuan Hua, Kexin Tian, Zhaobin Mo, Xiangbo Gao, Keshu Wu, Sulong Zhou, Hengxu You, Juntong Peng, Junge Zhang, Zehao Wang, Rui Song, Mingxuan Yan, Walter Zimmer, Xingcheng Zhou, Peiran Li, Zhaohan Lu, Chia-Ju Chen, Yue Huang, Ryan A Rossi, Lichao Sun, Hongkai Yu, Zhiwen Fan, Frank Hao Yang, Yuhao Kang, Ross Greer, Chenxi Liu, Eun Hak Lee, Xuan Di, Xinyue Ye, Liu Ren, Alois Knoll, Xiaopeng Li, Shuiwang Ji, Masayoshi Tomizuka, Marco Pavone, Tianbao Yang, Jing Du, Ming-Hsuan Yang, Hua Wei, Ziran Wang, Yang Zhou, Jiachen Li, Zhengzhong Tu · arXiv preprint arXiv:2505.08854 , 2025
-
Can Test-Time Scaling Improve World Foundation Model?
Wenyan Cong, Hanqing Zhu, Peihao Wang, Bangya Liu, Dejia Xu, Kevin Wang, David Z Pan, Yan Wang, Zhiwen Fan, Zhangyang Wang · COLM , 2025
-
X-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction
W Yu, Y Cai, R Zha, Z Fan, C Li, Y Yuan · ICCV , 2025
-
STAMP: Scalable Task And Model-agnostic Collaborative Perception
X Gao, R Xu, J Li, Z Wang, Z Fan, Z Tu · ICLR , 2025
-
Videolifter: Lifting videos to 3d with fast hierarchical stereo alignment
Wenyan Cong, Hanqing Zhu, Kevin Wang, Jiahui Lei, Colton Stearns, Yuanhao Cai, Dilin Wang, Rakesh Ranjan, Matt Feiszli, Leonidas Guibas, Zhangyang Wang, Weiyao Wang, Zhiwen Fan · CVPR Workshop , 2025
-
Steepest Descent Density Control for Compact 3D Gaussian Splatting
Peihao Wang, Yuehao Wang, Dilin Wang, Sreyas Mohan, Zhiwen Fan, Lemeng Wu, Ruisi Cai, Yu-Ying Yeh, Zhangyang Wang, Qiang Liu, Rakesh Ranjan · CVPR , 2025
-
Feature4x: Bridging any monocular video to 4d agentic ai with versatile gaussian feature fields
Shijie Zhou, Hui Ren, Yijia Weng, Shuwang Zhang, Zhen Wang, Dejia Xu, Zhiwen Fan, Suya You, Zhangyang Wang, Leonidas Guibas, Achuta Kadambi · CVPR , 2025
-
Instantsplamp: Fast and generalizable stenography framework for generative gaussian splatting
C Li, H Liu, Z Fan, W Li, Y Liu, P Pan, Y Yuan · ICLR , 2025
-
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
Zhiwen Fan, Jian Zhang, Wenyan Cong, Peihao Wang, Renjie Li, Kairun Wen, Shijie Zhou, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone, Yue Wang · NeurIPS , 2024
-
Symbolic visual reinforcement learning: A scalable framework with object-level abstraction and differentiable expression search
W Zheng, SP Sharan, Z Fan, K Wang, Y Xi, Z Wang · IEEE Transactions on Pattern Analysis and Machine Intelligence , 2024
-
Expressive Gaussian Human Avatars from Monocular RGB Video
H Hu, Z Fan, T Wu, Y Xi, S Lee, G Pavlakos, Z Wang · NeurIPS , 2024
-
4k4dgen: Panoramic 4d generation at 4k resolution
Renjie Li, Panwang Pan, Bangbang Yang, Dejia Xu, Shijie Zhou, Xuanyang Zhang, Zeming Li, Achuta Kadambi, Zhangyang Wang, Zhengzhong Tu, Zhiwen Fan · ICLR , 2024
-
Learning traffic crashes as language: Datasets, benchmarks, and what-if causal analyses
Z Fan, P Wang, Y Zhao, Y Zhao, B Ivanovic, Z Wang, M Pavone, HF Yang · arXiv preprint arXiv:2406.10789 , 2024
-
Llmgeo: Benchmarking large language models on image geolocation in-the-wild
Z Wang, D Xu, RMS Khan, Y Lin, Z Fan, X Zhu · arXiv preprint arXiv:2405.20363 , 2024
-
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
Shijie* Zhou, Zhiwen* Fan, Dejia* Xu, Haoran Chang, Pradyumna Chari, Tejas Bharadwaj, Suya You, Zhangyang Wang, Achuta Kadambi · ECCV , 2024
-
MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements
LC Sun, NP Bhatt, JC Liu, Z Fan, Z Wang, TE Humphreys, U Topcu · IROS , 2024
-
InstantSplat: Sparse-view Gaussian Splatting in Seconds
Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang · arXiv preprint arXiv:2403.20309 , 2024
-
Learning to estimate 6dof pose from limited data: A few-shot, generalizable approach using rgb images
P Pan, Z Fan✉, BY Feng, P Wang, C Li, Z Wang · 3DV , 2024
-
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
M Varma T, P Wang, Z Fan, Z Wang, H Su, R Ramamoorthi · CVPR , 2024
-
NeRF as Pretraining at Scale: Generalizable 3D-Aware Semantic Representation Learning from View Prediction
W Cong, H Liang, Z Fan, P Wang, Y Jiang, D Xu, AC Oztireli, Z Wang · ICCV , 2024
-
Taming mode collapse in score distillation for text-to-3d generation
Peihao Wang, Dejia Xu, Zhiwen Fan, Dilin Wang, Sreyas Mohan, Forrest Iandola, Rakesh Ranjan, Yilei Li, Qiang Liu, Zhangyang Wang, Vikas Chandra · CVPR , 2024
-
Feature 3dgs: Supercharging 3d gaussian splatting to enable distilled feature fields
Shijie Zhou, Haoran Chang, Sicheng Jiang, Zhiwen Fan, Zehao Zhu, Dejia Xu, Pradyumna Chari, Suya You, Zhangyang Wang, Achuta Kadambi · CVPR , 2024
-
Lightgaussian: Unbounded 3d gaussian compression with 15x reduction and 200+ fps
Z Fan, K Wang, K Wen, Z Zhu, D Xu, Z Wang · NeurIPS , 2024
-
Pope: 6-dof promptable pose estimation of any object in any scene with one reference
Z Fan, P Pan, P Wang, Y Jiang, D Xu, Z Wang · CVPR Workshop , 2024
-
Steindreamer: Variance reduction for text-to-3d score distillation via stein identity
Peihao Wang, Zhiwen Fan, Dejia Xu, Dilin Wang, Sreyas Mohan, Forrest Iandola, Rakesh Ranjan, Yilei Li, Qiang Liu, Zhangyang Wang, Vikas Chandra · AISTATS , 2023
-
Fsgs: Real-time few-shot view synthesis using gaussian splatting
Z Zhu*, Z Fan*, Y Jiang, Z Wang · ECCV , 2023
-
INR-Arch: A dataflow architecture and compiler for arbitrary-order gradient computations in implicit neural representation processing
S Abi-Karam, R Sarkar, D Xu, Z Fan, Z Wang, C Hao · ICCAD , 2023
-
Edge-moe: Memory-efficient multi-task vision transformer architecture with task-level sparsity via mixture-of-experts
R Sarkar, H Liang, Z Fan, Z Wang, C Hao · ICCAD , 2023
-
Pose-free generalizable rendering transformer
Z Fan, P Pan, P Wang, Y Jiang, H Jiang, D Xu, Z Zhu, D Wang, Z Wang · arXiv preprint arXiv:2310.03704 , 2023
-
Data-model-circuit tri-design for ultra-light video intelligence on edge devices
Yimeng Zhang, Akshay Karkal Kamath, Qiucheng Wu, Zhiwen Fan, Wuyang Chen, Zhangyang Wang, Shiyu Chang, Sijia Liu, Cong Hao · Proceedings of the 28th Asia and South Pacific Design Automation Conference … , 2023
-
Enhancing nerf akin to enhancing llms: Generalizable nerf transformer with mixture-of-view-experts
W Cong, H Liang, P Wang, Z Fan, T Chen, M Varma, Y Wang, Z Wang · ICCV , 2023
-
Steganerf: Embedding invisible information within neural radiance fields
C Li*, BY Feng*, Z Fan*, P Pan, Z Wang · ICCV , 2023
-
Neurallift-360: Lifting an in-the-wild 2d photo to a 3d object with 360deg views
D Xu, Y Jiang, P Wang, Z Fan, Y Wang, Z Wang · CVPR , 2023
-
NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes
Z Fan, P Wang, Y Jiang, X Gong, D Xu, Z Wang · ICLR , 2023