From 2022 to January 2025, I pursued my Master’s degree at the School of Electrical and Information Engineering, Tianjin University, under the supervision of Associate Prof. Jiale Cao.

Since January 2025, I have been working as a Researcher in the Foundation Model Group at MEGVII Technology. We are hiring research interns all year round, please feel free to drop me your resume at bin_xie@tju.edu.cn.

My research interests primarily focus on three areas: (1) Embodied AI, (2) Video Generation, and (3) Autonomous Driving.

🔥 News

  • 2025.01: One paper(Glad) is accepted by ICLR2025.

  • 2024.11: I’m awarded National Scholarship!

  • 2024.07: One paper(QLSeg) is accepted by Pattern Recognition.
  • 2024.02: One paper(SED) is accepted by CVPR2024.

Publications

ICLR 2025
sym

Glad: A Streaming Scene Generator for Autonomous Driving |ICLR2025|Paper|

Bin Xie, Yingfei Liu, Tiancai Wang, Jiale Cao, Xiangyu Zhang

  • Glad is an efficient framework for generating video data in autonomous driving scenarios. It produces temporally coherent videos frame-by-frame, serving as a robust baseline for data synthesis and simulation in autonomous driving.
CVPR 2024
sym

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation |CVPR2024|Paper|Code|

Bin Xie, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang

  • By utilizing the simple yet efficient CER module, SED achieves a better trade-off between accuracy and performance for open-vocabulary semantic segmentation.
Pattern Recognition
sym

Multi-Query and Multi-Level Enhanced Network for Semantic Segmentation |Pattern Recognition|Paper|Code|

Bin Xie, Jiale Cao, Rao Muhammad Anwer, Jin Xie, Jing Nie, Aiping Yang, Yanwei Pang

  • To address the limitations of current single-query designs, which fail to fully exploit the diverse, multi-level information available in plain Vision Transformers (ViT), we propose a multi-query and multi-level enhanced network for semantic segmentation.

Service

Invited Reviewer for conferences:

  • ICLR 2025, ICCV 2025

Invited Reviewer for journals:

  • Pattern Recognition (PR)