Yiwei Liu  |  刘益伟

I am an independent scholar, working in ByteDance as a Multimodal LLMs Researcher. Before this, I received my B.E. degree from College of Software Engineering, Sichuan University in 2025, supervised by Prof. Lin Shao (邵林) at National University of Singapore.

Email  /  GitHub  /  WeChat

profile photo

Research

My research interests lie in 🎞️ Computer Vision, 🤖 Embodied AI, and 🏦 Fintech.
Lately, I've been immersed in the engineering implementation of agent benchmarks (e.g., GUI agents), with a particular focus on how to improve the evaluation automation efficiency.
I'm open to collaborations on agent-related projects and welcome any online coffee chat!

Publications (* denotes equal contribution)

Uncertainty-based Ensemble Learning in CMR Semantic Segmentation Uncertainty-based Ensemble Learning in CMR Semantic Segmentation
Yiwei Liu, Ziyi Wu, Liang Zhong, Lingyi Wen, Yuankai Wu
arXiv  /  Code  /  Bibtex
ICASSP 2026  International Conference on Acoustics, Speech, and Signal Processing
Poster Presentation, ICASSP 2026
TL;DR: We propose an ensemble learning paradigm, Streaming, to enhance end-slice accuracy while maintaining near-SOTA DSC performance on cardiac CMR datasets.
Manual2Skill Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models
Chenrui Tie*, Shengxiang Sun*, Jinxuan Zhu, Yiwei Liu, Jingxiang Guo, Yue Hu, Haonan Chen, Junting Chen, Ruihai Wu, Lin Shao
arXiv  /  Website  /  Bibtex  /  Media (机器之心)
RSS 2025  Robotics: Science and Systems
Oral Presentation, CVPR 2025 @ 3D Vision Language Models for Robotic Manipulation
TL;DR: Manual2Skill enables robots to autonomously perform complex assembly tasks in real-life guided by high-level manual instructions.
MetaFold MetaFold: Language-Guided Multi-Category Garment Folding Framework via Trajectory Generation and Foundation Model
Haonan Chen*, Junxiao Li*, Ruihai Wu, Yiwei Liu, Yiwen Hou, Zhixuan Xu, Jingxiang Guo, Chongkai Gao, Zhenyu Wei, Shensi Xu, Jiaqi Huang, Lin Shao
arXiv  /  Website  /  Bibtex  /  Media (机器之心)
IROS 2025  Intelligent Robots and Systems
Oral Presentation, IROS 2025
TL;DR: MetaFold employs language-guided point cloud trajectories for task planning and a foundation model for action prediction, enabling better generalization across garments and instructions.
MulFSA MulFSA: Multi-level Financial Sentiment Analysis Framework for Bond Market
Yiwei Liu, Junbo Wang, Lei Long, Xin Li, Ruiting Ma, Yuankai Wu, Xuebin Chen
arXiv  /  Website  /  Bibtex
Poster Presentation, IJCAI 2025 @ FinLLM
Poster Presentation, ICML 2025 @ NewInML
TL;DR: We propose a novel framework based on PLMs and LLMs, which systematically integrates firm-specific micro-level sentiment, industry-specific meso-level sentiment, and duration-aware smoothing to model the latency and persistence of textual impact.

Education

National University of Singapore 2024.08 - 2025.05

NGNE Program Exchange Student, School of Computing.
Summer Workshop First Prize
An Interesting Project: Motion Sensing FPS Game (CS3247 Game Development)

Sichuan University 2021.09 - 2025.06

B.E. in Software Engineering
Mentor:Yuankai Wu (伍元凯), Computer Science
Xuebin Chen (陈学彬), Economics

Employment

ByteDance, Chengdu, China 2025.07 - Present

Multimodal LLMs Researcher

Siemens Industrial Automation Products (SEWC), Chengdu, China 2024.01 - 2024.06

Embodied AI Intern
An Interesting Demo

Miscellanea

Awards
  • Meritorious Winner in Mathematical Contest in Modeling (MCM) by COMAP 2024
  • National Second Prize of "Citi Cup" Financial Innovation and Application Competition by CSTC 2024

Vistors


Thanks for your visiting😊! Feel free to contact me if you have any problems.
This website is designed based on Jon Barron and Jingxiang Guo. Last Update: