Hao Chen

陈豪 haochen.umd@gmail.com

I am currently a Research Scientist at Meta. Before this, I worked as a Research Scientist at ByteDance. I earned my Ph.D. in Computer Science from the University of Maryland, College Park, under the guidance of Prof. Abhinav Shrivastava. Prior to this, I completed my Master's degree in Pattern Recognition & Intelligent Systems at Huazhong University of Science & Technology (HUST), supervised by Prof. Guoyou Wang. I also hold a Bachelor's degree from the School of Optical and Electronic Information at HUST.

Projects by areas: Generative Models Neural Representation Others

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Project Page Paper Code Models Demo

Yuang Ai, Jiaming Han, Shaobin Zhuang, Weijia Mao, Xuefeng Hu, Ziyan Yang, Zhenheng Yang, Yali Wang, Huaibo Huang, Xiangyu Yue, Hao Chen^†

UniWeTok: An Unified Binary Tokenizer for Unified Multimodal Large Language Model

Paper

Shaobin Zhuang, Yuang Ai, Jiaming Han, Weijia Mao, Xiaohui Li, Fangyikang Wang, Xiao Wang, Yan Li, Shanchuan Lin, Kun Xu, Zhenheng Yang, Huaibo Huang, Xiangyu Yue, Hao Chen^†, Yali Wang

Growing Visual Generative Capacity for Pre-Trained MLLMs

Project Page Paper Code

Hanyu Wang, Jiaming Han, Ziyan Yang, Qi Zhao, Shanchuan Lin, Xiangyu Yue, Abhinav Shrivastava, Zhenheng Yang, Hao Chen^†

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

NeurIPS 2025

Project Page Paper Code

Jiamin Han, Hao Chen^†, Yang Zhao, Hanyu Wang, Qi Zhao, Zhiyan Yang, Hao He, Xiangyu Yue, Lu Jiang

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Project Page Paper

ByteDance Seed VideoGen Team

LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior

ICLR 2025 (Oral)

Project Page Paper Code

Hanyu Wang, Saurabh Suri, Yixuan Ren, Hao Chen^†, Abhinav Shrivastava

NeRV-Diffusion: Diffuse Implicit Neural Representations for Video Synthesis

Project Page Paper

Yixuan Ren, Hanyu Wang, Hao Chen^†, Bo He, Abhinav Shrivastava

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Project Page Paper

ByteDance Seed VideoGen Team

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Project Page Paper Code

Jianyi Wang, Shanchuan Lin, Zhijie Lin, Yuxi Ren, Meng Wei, Zongsheng Yue, Shangchen Zhou, Hao Chen, Yang Zhao, Ceyuan Yang, Xuefeng Xiao, Chen Change Loy, Lu Jiang

SkipSR: Faster Super-Resolution with Token Skipping

Project Page Paper

Rohan Choudhury, Shanchuan Lin, Jianyi Wang, Hao Chen, Qi Zhao, Feng Cheng, Lu Jiang, Kris Kitani, Laszlo A. Jeni

Fast Encoding and Decoding for Implicit Video Representation

ECCV 2024

[Project Page][Preprint Paper][Code]

Hao Chen, Saining Xie, Ser-Nam Lim, Abhinav Shrivastava

We propose NeRV-Enc, which encodes videos 10⁴ times faster than its predecessor NeRV, utilizing hyper-networks. Additionally, we introduce NeRV-Dec, which decodes video 8.9 times faster than NeRV via parallel decoding, and is 11 times faster compared to the H.264 codec.

HNeRV: A Hybrid Neural Representation for Videos

CVPR 2023

[Project Page][Paper][Code]

Hao Chen, Matt Gwilliam, Ser-Nam Lim, Abhinav Shrivastava

We propose a hybrid video neural representation and a evenly distributed neural network to improve modeling capacity and introduce internal generalization.

Towards Scalable Neural Representation for Diverse Videos

CVPR 2023

[Project Page][Paper][Code]

Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava

We propose D-NeRV, a novel neural representation framework designed to encode large-scale and diverse videos.

CNeRV: Content-adaptive Neural Representation for Visual Data

BMVC 2022 (Oral)

[Project Page][Paper]

Hao Chen, Matt Gwilliam, Bo He, Ser-Nam Lim, Abhinav Shrivastava

We propose a hybrid video neural representation with content-adaptive embedding to introduce internal generalization.

NeRV: Neural Representations for Videos

NeurIPS 2021

[Project Page][Paper][Code]

Hao Chen, Bo He, Hanyu Wang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava

We propose an image-wise neural representation for videos, which achieves good compression results and fast decoding speed.

The Lottery Ticket Hypothesis for Object Recognition

CVPR 2021

[PDF]

Sharath Girish, Shishira R. Maiya, Kamal Gupta, Hao Chen, Larry Davis, Abhinav Shrivastava

Group Ensemble: Learning an Ensemble of ConvNets in a single ConvNet

[PDF][Code]

Hao Chen, Abhinav Shrivastava

HR-RCNN: Hierarchical Relational Reasoning for Object Detection

BMVC 2021

[PDF]

Hao Chen, Abhinav Shrivastava

Progressive Object Transfer Detection

TIP

[PDF]

Hao Chen, Yali Wang, Guoyou Wang, Xiang Bai, Yu Qiao

LSTD: A Low-Shot Transfer Detector for Object Detection

AAAI 2018 (Spotlight)

[PDF]

Hao Chen, Yali Wang, Guoyou Wang, Yu Qiao

Acknowledgements

I appreciate everyone who helped me or encouraged me throughout my life, especially Prof. Abhinav Shrivastava, Prof. Guoyou Wang, Prof. Yu Qiao, Prof. Yali Wang, Prof. Xiang Bai and all good friends I met in China and the US.