From 2022 to January 2025, I pursued my Master’s degree at the School of Electrical and Information Engineering, Tianjin University, under the supervision of Associate Prof. Jiale Cao.
Since January 2025, I have been working as a Researcher in the Foundation Model Group at MEGVII Technology. We are hiring research interns all year round, please feel free to drop me your resume at bin_xie@tju.edu.cn.
My research interests primarily focus on three areas: (1) Embodied AI, (2) Video Generation, and (3) Autonomous Driving.
🔥 News
-
2025.01: One paper(Glad) is accepted by ICLR2025.
-
2024.11: I’m awarded National Scholarship!
- 2024.07: One paper(QLSeg) is accepted by Pattern Recognition.
- 2024.02: One paper(SED) is accepted by CVPR2024.
Publications

Glad: A Streaming Scene Generator for Autonomous Driving |ICLR2025|Paper|
Bin Xie, Yingfei Liu, Tiancai Wang, Jiale Cao, Xiangyu Zhang
- Glad is an efficient framework for generating video data in autonomous driving scenarios. It produces temporally coherent videos frame-by-frame, serving as a robust baseline for data synthesis and simulation in autonomous driving.

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation |CVPR2024|Paper|Code|
Bin Xie, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang
- By utilizing the simple yet efficient CER module, SED achieves a better trade-off between accuracy and performance for open-vocabulary semantic segmentation.

Multi-Query and Multi-Level Enhanced Network for Semantic Segmentation |Pattern Recognition|Paper|Code|
Bin Xie, Jiale Cao, Rao Muhammad Anwer, Jin Xie, Jing Nie, Aiping Yang, Yanwei Pang
- To address the limitations of current single-query designs, which fail to fully exploit the diverse, multi-level information available in plain Vision Transformers (ViT), we propose a multi-query and multi-level enhanced network for semantic segmentation.
Service
Invited Reviewer for conferences:
- ICLR 2025, ICCV 2025
Invited Reviewer for journals:
- Pattern Recognition (PR)