Chengjian Feng
I am currently a researcher at Meituan Inc.
My primary research interests encompass a broad range of topics, including object detection, autonomous driving, embodied AI, large multimodal models, diffusion models, domain adaptation, and more.
Specifically, I am particularly interested in the application of object detection and large multimodal models to engineer reliable and robust autonomous driving systems and agents that improve our daily lives.
Home  / 
Email  / 
Github  / 
Google Scholar
|
|
|
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
Chenjian Feng, Yujie Zhong,
Zequn Jie,
Weidi Xie,
Lin Ma
CVPR, 2024
project page /
arXiv
We introduce a novel paradigm to enhance the ability of object detector by training on synthetic dataset generated from diffusion models.
|
|
AeDet: Azimuth-invariant Multi-view 3D Object Detection
Chenjian Feng,
Zequn Jie,
Yujie Zhong,
Xiangxiang Chu,
Lin Ma
CVPR, 2023
project page /
arXiv
We propose an Azimuth-equivariant Detector (AeDet) that is able to perform azimuth-invariant multi-view 3D object detection.
|
|
PromptDet: Towards Open-vocabulary Detection using Uncurated Images
Chenjian Feng, Yujie Zhong,
Zequn Jie,
Xiangxiang Chu,
Haibing Ren,
Xiaolin Wei,
Weidi Xie,
Lin Ma
ECCV, 2022
project page /
arXiv
We propose an open-vocabulary object detector PromptDet, which is able to detect novel categories without any manual annotations.
|
|
TOOD: Task-aligned One-stage Object Detection
Chenjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang
ICCV, 2021
(Oral)
project page /
arXiv
We propose a Task-aligned One-stage Object Detection (TOOD) that explicitly aligns the classification and localization tasks in a learning-based manner.
|
|
Exploring Classification Equilibrium in Long-Tailed Object Detection
Chenjian Feng, Yujie Zhong, Weilin Huang
ICCV, 2021
project page /
arXiv
We balance the classification of the long-tailed detector via an Equilibrium Loss (EBL) and a Memory-augmented Feature Sampling (MFS) method.
|
|
DriveMM: All-in-One Large Multimodal Model for Autonomous Driving
Zhijian Huang*,
Chengjian Feng*,
Feng Yan,
Baihui Xiao,
Zequn Jie,
Yujie Zhong,
Xiaodan Liang,
Lin Ma
(*Equal contribution)
Preprint, 2024
project page /
arXiv
We propose a novel all-in-one large multimodal model, DriveMM, robustly equipped with the general capabilities and the generalization ability.
|
|
RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation
Feng Yan,
Fanfan Liu,
Liming Zheng,
Yufeng Zhong,
Yiyang Huang,
Zechao Guan,
Chengjian Feng,
Lin Ma
Preprint, 2024
project page /
arXiv
We propose a multimodal robotic manipulation model, RoboMM, along with a comprehensive dataset, RoboData.
|
|
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
Chenjian Feng, Yujie Zhong,
Zequn Jie,
Weidi Xie,
Lin Ma
CVPR, 2024
project page /
arXiv
We introduce a novel paradigm to enhance the ability of object detector by training on synthetic dataset generated from diffusion models.
|
|
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
Yingsen Zeng,
Yujie Zhong,
Chenjian Feng,
Lin Ma
ECCV, 2024
project page /
arXiv
We propose a unified architecture, UniMD, for both TAD and MR, and explore a task fusion learning scheme to enhance the mutual benefits between the two tasks.
|
|
AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models
Daqin Luo,
Chenjian Feng,
Yuxuan Nong,
Yiqing Shen
ACMMM, 2024
project page /
arXiv
We introduce AutoM3L, an innovative Automated Multimodal Machine Learning framework that leverages LLMs as controllers to automatically construct multimodal training pipelines.
|
|
RFSR: Improving ISR Diffusion Models via Reward Feedback Learning
Xiaopeng Sun,
Qinwei Lin,
Yu Gao,
Yujie Zhong,
Chengjian Feng,
Dengjie Li,
Zheng Zhao,
Jie Hu,
Lin Ma
Preprint, 2024
project page /
arXiv
We propose a multimodal robotic manipulation model, RoboMM, along with a comprehensive dataset, RoboData.
|
|
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Hao Wang,
Pengzhen Ren,
Zequn Jie,
Xiao Dong,
Chenjian Feng,
Yinlong Qian,
Lin Ma,
Dongmei Jiang,,
Yaowei Wang,
Xiangyuan Lan,
Xiaodan Liang
Preprint, 2024
project page /
arXiv
We propose a novel unified open-vocabulary detection method called OV-DINO, which is pre-trained on diverse large-scale datasets with language-aware selective fusion in a unified framework.
|
|
RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios
Liming Zheng,
Feng Yan,
Fanfan Liu,
Chenjian Feng,
Zhuoliang Kang,
Lin Ma,
Preprint, 2024
project page /
arXiv
We introduces RoboCAS, the first benchmark specifically designed for complex object arrangement scenarios in robotic manipulation.
|
|
RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation
Fanfan Liu,
Feng Yan,
Liming Zheng,
Chenjian Feng,
Yiyang Huang,
Lin Ma,
Preprint, 2024
project page /
arXiv
We propose RoboUniView, an innovative approach that decouples visual feature extraction from action learning by a unified view representation.
|
|
AeDet: Azimuth-invariant Multi-view 3D Object Detection
Chenjian Feng,
Zequn Jie,
Yujie Zhong,
Xiangxiang Chu,
Lin Ma
CVPR, 2023
project page /
arXiv
We propose an Azimuth-equivariant Detector (AeDet) that is able to perform azimuth-invariant multi-view 3D object detection.
|
|
FastPillars: A Deployment-friendly Pillar-based 3D Detector
Sifan Zhou,
Zhi Tian,
Xiangxiang Chu,
Xinyu Zhang,
Bo Zhang,
Xiaobo Lu,
Chenjian Feng,
Zequn Jie,
Patrick Yin Chiang,
Lin Ma
Preprint, 2023
project page /
arXiv
We devise a deployment-friendly pillar-based 3D detector, termed FastPillars, to tackle the challenge of efficient 3D object detection from an industry perspective.
|
|
PromptDet: Towards Open-vocabulary Detection using Uncurated Images
Chenjian Feng, Yujie Zhong,
Zequn Jie,
Xiangxiang Chu,
Haibing Ren,
Xiaolin Wei,
Weidi Xie,
Lin Ma
ECCV, 2022
project page /
arXiv
We propose an open-vocabulary object detector PromptDet, which is able to detect novel categories without any manual annotations.
|
|
TOOD: Task-aligned One-stage Object Detection
Chenjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang
ICCV, 2021
(Oral)
project page /
arXiv
We propose a Task-aligned One-stage Object Detection (TOOD) that explicitly aligns the classification and localization tasks in a learning-based manner.
|
|
Exploring Classification Equilibrium in Long-Tailed Object Detection
Chenjian Feng, Yujie Zhong, Weilin Huang
ICCV, 2021
project page /
arXiv
We balance the classification of the long-tailed detector via an Equilibrium Loss (EBL) and a Memory-augmented Feature Sampling (MFS) method.
|
|
Domain adaptation with SBADA-GAN and Mean Teacher
Chenjian Feng, Zhaoshui He, Jiawei Wang, Qinzhuang Lin, Zhouping Zhu, Jun Lv, Shengli Xie
Neurocomputing, 2020
We propose a powerful model for unsupervised domain adaptation by introducing Mean Teacher as a target classifier of SBADA-GAN.
|
Template gratefully stolen from here.
|
|