⚙️ Experience

Qwen logo Research Intern @ Qwen Team, Alibaba Group Mar 2025 – Oct 2025   |   Beijing, China

  • Built an evaluation framework for VLM4VLA, validating vision-language models on robotic operation tasks.
  • Qwen3-VL Technical Report: contributor to Qwen3-VL project, participated in enhancing embodied understanding in VLMs, including data collection and processing with spatial-position annotations from embodied tasks.

Seed logo Research Intern @ Seed Robotics, ByteDance Group Oct 2025 – Present   |   Beijing, China

  • BagelVLA: large unified MLLM in VLA with Bagel for long-horizon manipulation, covering understanding, world modeling, and robotic control.
  • (Recently working on): Exploring mechanisms of unified models + world models + action in VLA architecture for next-generation VLA design.