Publications (since 2020)

Generated from Google Scholar and sorted by publication date.

Source: Google Scholar

2025

  1. DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
    Kairun Wen, Yuzhi Huang, Runyu Chen, Hui Zheng, Yunlong Lin, Panwang Pan, Chenxin Li, Wenyan Cong, Jian Zhang, Junbin Lu, Chenguo Lin, Dilin Wang, Zhicheng Yan, Hongyu Xu, Justin Theiss, Yue Huang, Xinghao Ding, Rakesh Ranjan, Zhiwen Fan · arXiv preprint arXiv:2512.03000 , 2025
  2. Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models
    Q Ren, Y Wang, L Guo, W Zhang, Z Fan, C You · arXiv preprint arXiv:2511.19917 , 2025
  3. AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback
    Y Yang, J Hong, GJ Perin, Z Fan, L Yin, Z Wang, U Topcu · arXiv preprint arXiv:2509.18384 , 2025
  4. Mmhu: A massive-scale multimodal benchmark for human behavior understanding
    R Li, R Ye, M Wu, HF Yang, Z Fan, H Hu, Z Tu · arXiv preprint arXiv:2507.12463 , 2025
  5. Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
    Longfei Li, Zhiwen Fan, Wenyan Cong, Xinhang Liu, Yuyang Yin, Matt Foutter, Panwang Pan, Chenyu You, Yue Wang, Zhangyang Wang, Yao Zhao, Marco Pavone, Yunchao Wei · NeurIPS 2025 Datasets & Benchmarks , 2025
  6. CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy
    J Zhang, S Zhou, H Dai, X Liu, P Wang, Z Fan, Y Pei, J Yu · ICCV , 2025
  7. E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models
    Wenyan Cong, Yiqing Liang, Yancheng Zhang, Ziyi Yang, Yan Wang, Boris Ivanovic, Marco Pavone, Chen Chen, Zhangyang Wang, Zhiwen Fan · arXiv preprint arXiv:2506.01933 , 2025
  8. VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
    Zhiwen Fan, Jian Zhang, Renjie Li, Junge Zhang, Runjin Chen, Hezhen Hu, Kevin Wang, Huaizhi Qu, Dilin Wang, Zhicheng Yan, Hongyu Xu, Justin Theiss, Tianlong Chen, Jiachen Li, Zhengzhong Tu, Zhangyang Wang, Rakesh Ranjan · arXiv preprint arXiv:2505.20279 , 2025
  9. Generative ai for autonomous driving: Frontiers and opportunities
    Yuping Wang, Shuo Xing, Cui Can, Renjie Li, Hongyuan Hua, Kexin Tian, Zhaobin Mo, Xiangbo Gao, Keshu Wu, Sulong Zhou, Hengxu You, Juntong Peng, Junge Zhang, Zehao Wang, Rui Song, Mingxuan Yan, Walter Zimmer, Xingcheng Zhou, Peiran Li, Zhaohan Lu, Chia-Ju Chen, Yue Huang, Ryan A Rossi, Lichao Sun, Hongkai Yu, Zhiwen Fan, Frank Hao Yang, Yuhao Kang, Ross Greer, Chenxi Liu, Eun Hak Lee, Xuan Di, Xinyue Ye, Liu Ren, Alois Knoll, Xiaopeng Li, Shuiwang Ji, Masayoshi Tomizuka, Marco Pavone, Tianbao Yang, Jing Du, Ming-Hsuan Yang, Hua Wei, Ziran Wang, Yang Zhou, Jiachen Li, Zhengzhong Tu · arXiv preprint arXiv:2505.08854 , 2025
  10. Can Test-Time Scaling Improve World Foundation Model?
    Wenyan Cong, Hanqing Zhu, Peihao Wang, Bangya Liu, Dejia Xu, Kevin Wang, David Z Pan, Yan Wang, Zhiwen Fan, Zhangyang Wang · COLM , 2025
  11. X-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction
    W Yu, Y Cai, R Zha, Z Fan, C Li, Y Yuan · ICCV , 2025
  12. STAMP: Scalable Task And Model-agnostic Collaborative Perception
    X Gao, R Xu, J Li, Z Wang, Z Fan, Z Tu · ICLR , 2025
  13. Videolifter: Lifting videos to 3d with fast hierarchical stereo alignment
    Wenyan Cong, Hanqing Zhu, Kevin Wang, Jiahui Lei, Colton Stearns, Yuanhao Cai, Dilin Wang, Rakesh Ranjan, Matt Feiszli, Leonidas Guibas, Zhangyang Wang, Weiyao Wang, Zhiwen Fan · CVPR Workshop , 2025
  14. Steepest Descent Density Control for Compact 3D Gaussian Splatting
    Peihao Wang, Yuehao Wang, Dilin Wang, Sreyas Mohan, Zhiwen Fan, Lemeng Wu, Ruisi Cai, Yu-Ying Yeh, Zhangyang Wang, Qiang Liu, Rakesh Ranjan · CVPR , 2025
  15. Feature4x: Bridging any monocular video to 4d agentic ai with versatile gaussian feature fields
    Shijie Zhou, Hui Ren, Yijia Weng, Shuwang Zhang, Zhen Wang, Dejia Xu, Zhiwen Fan, Suya You, Zhangyang Wang, Leonidas Guibas, Achuta Kadambi · CVPR , 2025
  16. Instantsplamp: Fast and generalizable stenography framework for generative gaussian splatting
    C Li, H Liu, Z Fan, W Li, Y Liu, P Pan, Y Yuan · ICLR , 2025

2024

  1. Large Spatial Model: End-to-end Unposed Images to Semantic 3D
    Zhiwen Fan, Jian Zhang, Wenyan Cong, Peihao Wang, Renjie Li, Kairun Wen, Shijie Zhou, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone, Yue Wang · NeurIPS , 2024
  2. Symbolic visual reinforcement learning: A scalable framework with object-level abstraction and differentiable expression search
    W Zheng, SP Sharan, Z Fan, K Wang, Y Xi, Z Wang · IEEE Transactions on Pattern Analysis and Machine Intelligence , 2024
  3. Expressive Gaussian Human Avatars from Monocular RGB Video
    H Hu, Z Fan, T Wu, Y Xi, S Lee, G Pavlakos, Z Wang · NeurIPS , 2024
  4. 4k4dgen: Panoramic 4d generation at 4k resolution
    Renjie Li, Panwang Pan, Bangbang Yang, Dejia Xu, Shijie Zhou, Xuanyang Zhang, Zeming Li, Achuta Kadambi, Zhangyang Wang, Zhengzhong Tu, Zhiwen Fan · ICLR , 2024
  5. Learning traffic crashes as language: Datasets, benchmarks, and what-if causal analyses
    Z Fan, P Wang, Y Zhao, Y Zhao, B Ivanovic, Z Wang, M Pavone, HF Yang · arXiv preprint arXiv:2406.10789 , 2024
  6. Llmgeo: Benchmarking large language models on image geolocation in-the-wild
    Z Wang, D Xu, RMS Khan, Y Lin, Z Fan, X Zhu · arXiv preprint arXiv:2405.20363 , 2024
  7. DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
    Shijie* Zhou, Zhiwen* Fan, Dejia* Xu, Haoran Chang, Pradyumna Chari, Tejas Bharadwaj, Suya You, Zhangyang Wang, Achuta Kadambi · ECCV , 2024
  8. MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements
    LC Sun, NP Bhatt, JC Liu, Z Fan, Z Wang, TE Humphreys, U Topcu · IROS , 2024
  9. InstantSplat: Sparse-view Gaussian Splatting in Seconds
    Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang · arXiv preprint arXiv:2403.20309 , 2024
  10. Learning to estimate 6dof pose from limited data: A few-shot, generalizable approach using rgb images
    P Pan, Z Fan✉, BY Feng, P Wang, C Li, Z Wang · 3DV , 2024
  11. Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
    M Varma T, P Wang, Z Fan, Z Wang, H Su, R Ramamoorthi · CVPR , 2024
  12. NeRF as Pretraining at Scale: Generalizable 3D-Aware Semantic Representation Learning from View Prediction
    W Cong, H Liang, Z Fan, P Wang, Y Jiang, D Xu, AC Oztireli, Z Wang · ICCV , 2024
  13. Taming mode collapse in score distillation for text-to-3d generation
    Peihao Wang, Dejia Xu, Zhiwen Fan, Dilin Wang, Sreyas Mohan, Forrest Iandola, Rakesh Ranjan, Yilei Li, Qiang Liu, Zhangyang Wang, Vikas Chandra · CVPR , 2024
  14. Feature 3dgs: Supercharging 3d gaussian splatting to enable distilled feature fields
    Shijie Zhou, Haoran Chang, Sicheng Jiang, Zhiwen Fan, Zehao Zhu, Dejia Xu, Pradyumna Chari, Suya You, Zhangyang Wang, Achuta Kadambi · CVPR , 2024
  15. Lightgaussian: Unbounded 3d gaussian compression with 15x reduction and 200+ fps
    Z Fan, K Wang, K Wen, Z Zhu, D Xu, Z Wang · NeurIPS , 2024
  16. Pope: 6-dof promptable pose estimation of any object in any scene with one reference
    Z Fan, P Pan, P Wang, Y Jiang, D Xu, Z Wang · CVPR Workshop , 2024

2023

  1. Steindreamer: Variance reduction for text-to-3d score distillation via stein identity
    Peihao Wang, Zhiwen Fan, Dejia Xu, Dilin Wang, Sreyas Mohan, Forrest Iandola, Rakesh Ranjan, Yilei Li, Qiang Liu, Zhangyang Wang, Vikas Chandra · AISTATS , 2023
  2. Fsgs: Real-time few-shot view synthesis using gaussian splatting
    Z Zhu*, Z Fan*, Y Jiang, Z Wang · ECCV , 2023
  3. INR-Arch: A dataflow architecture and compiler for arbitrary-order gradient computations in implicit neural representation processing
    S Abi-Karam, R Sarkar, D Xu, Z Fan, Z Wang, C Hao · ICCAD , 2023
  4. Edge-moe: Memory-efficient multi-task vision transformer architecture with task-level sparsity via mixture-of-experts
    R Sarkar, H Liang, Z Fan, Z Wang, C Hao · ICCAD , 2023
  5. Pose-free generalizable rendering transformer
    Z Fan, P Pan, P Wang, Y Jiang, H Jiang, D Xu, Z Zhu, D Wang, Z Wang · arXiv preprint arXiv:2310.03704 , 2023
  6. Data-model-circuit tri-design for ultra-light video intelligence on edge devices
    Yimeng Zhang, Akshay Karkal Kamath, Qiucheng Wu, Zhiwen Fan, Wuyang Chen, Zhangyang Wang, Shiyu Chang, Sijia Liu, Cong Hao · Proceedings of the 28th Asia and South Pacific Design Automation Conference … , 2023
  7. Enhancing nerf akin to enhancing llms: Generalizable nerf transformer with mixture-of-view-experts
    W Cong, H Liang, P Wang, Z Fan, T Chen, M Varma, Y Wang, Z Wang · ICCV , 2023
  8. Steganerf: Embedding invisible information within neural radiance fields
    C Li*, BY Feng*, Z Fan*, P Pan, Z Wang · ICCV , 2023
  9. Neurallift-360: Lifting an in-the-wild 2d photo to a 3d object with 360deg views
    D Xu, Y Jiang, P Wang, Z Fan, Y Wang, Z Wang · CVPR , 2023
  10. NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes
    Z Fan, P Wang, Y Jiang, X Gong, D Xu, Z Wang · ICLR , 2023

2022

  1. Signal processing for implicit neural representations
    D Xu, P Wang, Y Jiang, Z Fan, Z Wang · NeurIPS , 2022
  2. Point cloud domain adaptation via masked local 3d structure prediction
    H Liang, H Fan, Z Fan, Y Wang, T Chen, Y Cheng, Z Wang · ECCV , 2022
  3. Unified implicit neural stylization
    Z Fan, Y Jiang, P Wang, X Gong, D Xu, Z Wang · ECCV , 2022
  4. Sinnerf: Training neural radiance fields on complex scenes from a single image
    D Xu, Y Jiang, P Wang, Z Fan, H Shi, Z Wang · ECCV , 2022
  5. Can we solve 3D vision tasks starting from a 2D vision transformer?
    Y Wang, Z Fan, T Chen, H Fan, Z Wang · arXiv preprint arXiv:2209.07026 , 2022
  6. M³vit: Mixture-of-experts vision transformer for efficient multi-task learning with model-accelerator co-design
    Z Fan, R Sarkar, Z Jiang, T Chen, K Zou, Y Cheng, C Hao, Z Wang · NeurIPS , 2022
  7. Aug-nerf: Training stronger neural radiance fields with triple-level physically-grounded augmentations
    T Chen, P Wang, Z Fan, Z Wang · CVPR , 2022
  8. Cadtransformer: Panoptic symbol spotting transformer for cad drawings
    Z Fan, T Chen, P Wang, Z Wang · CVPR , 2022

2021

  1. Meshmvs: multi-view stereo guided mesh reconstruction
    R Shrestha, Z Fan, Q Su, Z Dai, S Zhu, P Tan · 2021 International Conference on 3D Vision (3DV), 1290-1300 , 2021
  2. Floorplancad: A large-scale cad drawing dataset for panoptic symbol spotting
    Z Fan, L Zhu, H Li, X Chen, S Zhu, P Tan · ICCV , 2021

2020

  1. A deep error correction network for compressed sensing MRI
    L Sun, Y Wu, Z Fan, X Ding, Y Huang, J Paisley · BMC Biomedical Engineering 2 (1), 4 , 2020
  2. Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching
    X Gu*, Z Fan*, S Zhu, Z Dai, F Tan, P Tan · CVPR , 2020