Chengjian Feng

I am currently a researcher at Meituan Inc. My primary research interests encompass a broad range of topics, including object detection, autonomous driving, embodied AI, large multimodal models, diffusion models, domain adaptation, and more. Specifically, I am particularly interested in the application of object detection and large multimodal models to engineer reliable and robust autonomous driving systems and agents that improve our daily lives.

Home  /  Email  /  Github  /  Google Scholar

profile photo
Selected Papers
clean-usnob InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
Chenjian Feng, Yujie Zhong, Zequn Jie, Weidi Xie, Lin Ma
CVPR, 2024
project page / arXiv

We introduce a novel paradigm to enhance the ability of object detector by training on synthetic dataset generated from diffusion models.

clean-usnob AeDet: Azimuth-invariant Multi-view 3D Object Detection
Chenjian Feng, Zequn Jie, Yujie Zhong, Xiangxiang Chu, Lin Ma
CVPR, 2023
project page / arXiv

We propose an Azimuth-equivariant Detector (AeDet) that is able to perform azimuth-invariant multi-view 3D object detection.


clean-usnob PromptDet: Towards Open-vocabulary Detection using Uncurated Images
Chenjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma
ECCV, 2022
project page / arXiv

We propose an open-vocabulary object detector PromptDet, which is able to detect novel categories without any manual annotations.

clean-usnob TOOD: Task-aligned One-stage Object Detection
Chenjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang
ICCV, 2021 (Oral)
project page / arXiv

We propose a Task-aligned One-stage Object Detection (TOOD) that explicitly aligns the classification and localization tasks in a learning-based manner.

clean-usnob Exploring Classification Equilibrium in Long-Tailed Object Detection
Chenjian Feng, Yujie Zhong, Weilin Huang
ICCV, 2021
project page / arXiv

We balance the classification of the long-tailed detector via an Equilibrium Loss (EBL) and a Memory-augmented Feature Sampling (MFS) method.

All Research
clean-usnob DriveMM: All-in-One Large Multimodal Model for Autonomous Driving
Zhijian Huang*, Chengjian Feng*, Feng Yan, Baihui Xiao, Zequn Jie, Yujie Zhong, Xiaodan Liang, Lin Ma
(*Equal contribution)
Preprint, 2024
project page / arXiv

We propose a novel all-in-one large multimodal model, DriveMM, robustly equipped with the general capabilities and the generalization ability.

clean-usnob RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation
Feng Yan, Fanfan Liu, Liming Zheng, Yufeng Zhong, Yiyang Huang, Zechao Guan, Chengjian Feng, Lin Ma
Preprint, 2024
project page / arXiv

We propose a multimodal robotic manipulation model, RoboMM, along with a comprehensive dataset, RoboData.

clean-usnob InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
Chenjian Feng, Yujie Zhong, Zequn Jie, Weidi Xie, Lin Ma
CVPR, 2024
project page / arXiv

We introduce a novel paradigm to enhance the ability of object detector by training on synthetic dataset generated from diffusion models.

clean-usnob UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
Yingsen Zeng, Yujie Zhong, Chenjian Feng, Lin Ma
ECCV, 2024
project page / arXiv

We propose a unified architecture, UniMD, for both TAD and MR, and explore a task fusion learning scheme to enhance the mutual benefits between the two tasks.

clean-usnob AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models
Daqin Luo, Chenjian Feng, Yuxuan Nong, Yiqing Shen
ACMMM, 2024
project page / arXiv

We introduce AutoM3L, an innovative Automated Multimodal Machine Learning framework that leverages LLMs as controllers to automatically construct multimodal training pipelines.

clean-usnob RFSR: Improving ISR Diffusion Models via Reward Feedback Learning
Xiaopeng Sun, Qinwei Lin, Yu Gao, Yujie Zhong, Chengjian Feng, Dengjie Li, Zheng Zhao, Jie Hu, Lin Ma
Preprint, 2024
project page / arXiv

We propose a multimodal robotic manipulation model, RoboMM, along with a comprehensive dataset, RoboData.

clean-usnob OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Hao Wang, Pengzhen Ren, Zequn Jie, Xiao Dong, Chenjian Feng, Yinlong Qian, Lin Ma, Dongmei Jiang,, Yaowei Wang, Xiangyuan Lan, Xiaodan Liang
Preprint, 2024
project page / arXiv

We propose a novel unified open-vocabulary detection method called OV-DINO, which is pre-trained on diverse large-scale datasets with language-aware selective fusion in a unified framework.

clean-usnob RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios
Liming Zheng, Feng Yan, Fanfan Liu, Chenjian Feng, Zhuoliang Kang, Lin Ma,
Preprint, 2024
project page / arXiv

We introduces RoboCAS, the first benchmark specifically designed for complex object arrangement scenarios in robotic manipulation.

clean-usnob RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation
Fanfan Liu, Feng Yan, Liming Zheng, Chenjian Feng, Yiyang Huang, Lin Ma,
Preprint, 2024
project page / arXiv

We propose RoboUniView, an innovative approach that decouples visual feature extraction from action learning by a unified view representation.

clean-usnob AeDet: Azimuth-invariant Multi-view 3D Object Detection
Chenjian Feng, Zequn Jie, Yujie Zhong, Xiangxiang Chu, Lin Ma
CVPR, 2023
project page / arXiv

We propose an Azimuth-equivariant Detector (AeDet) that is able to perform azimuth-invariant multi-view 3D object detection.


clean-usnob FastPillars: A Deployment-friendly Pillar-based 3D Detector
Sifan Zhou, Zhi Tian, Xiangxiang Chu, Xinyu Zhang, Bo Zhang, Xiaobo Lu, Chenjian Feng, Zequn Jie, Patrick Yin Chiang, Lin Ma
Preprint, 2023
project page / arXiv

We devise a deployment-friendly pillar-based 3D detector, termed FastPillars, to tackle the challenge of efficient 3D object detection from an industry perspective.

clean-usnob PromptDet: Towards Open-vocabulary Detection using Uncurated Images
Chenjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma
ECCV, 2022
project page / arXiv

We propose an open-vocabulary object detector PromptDet, which is able to detect novel categories without any manual annotations.

clean-usnob TOOD: Task-aligned One-stage Object Detection
Chenjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang
ICCV, 2021 (Oral)
project page / arXiv

We propose a Task-aligned One-stage Object Detection (TOOD) that explicitly aligns the classification and localization tasks in a learning-based manner.

clean-usnob Exploring Classification Equilibrium in Long-Tailed Object Detection
Chenjian Feng, Yujie Zhong, Weilin Huang
ICCV, 2021
project page / arXiv

We balance the classification of the long-tailed detector via an Equilibrium Loss (EBL) and a Memory-augmented Feature Sampling (MFS) method.

clean-usnob Domain adaptation with SBADA-GAN and Mean Teacher
Chenjian Feng, Zhaoshui He, Jiawei Wang, Qinzhuang Lin, Zhouping Zhu, Jun Lv, Shengli Xie
Neurocomputing, 2020

We propose a powerful model for unsupervised domain adaptation by introducing Mean Teacher as a target classifier of SBADA-GAN.


Template gratefully stolen from here.