Generative Vision · Multimodal Intelligence

Hanlin Shang

尚翰林

Master's student in Electronic Information at Fudan University, working on generative vision, face video restoration, face video generation, and vision-language-action systems for autonomous driving.

I am interested in building visual generation systems that are robust, controllable, and useful in real-world settings. My recent work spans face video restoration, audio-driven talking-head generation, and VLA models for autonomous driving.

Selected Work

Research Projects

Autonomous driving Co-first author

WAM-Diff

A vision-language-action research project for autonomous driving, exploring generative modeling for action-aware driving intelligence.

View project
Face video generation Contributor

Hallo Series

Contributions to a series of face video generation projects, including Hallo, Hallo2, and Hallo3.

Experience

Research & Industry

Aug 2025 - Jun 2026

引望智能技术有限公司

Internship focused on intelligent systems and applied AI.

Jan 2025 - Aug 2025

上海智能科学研究院

Research internship in intelligent science and visual AI.

Education

Academic Background

Fudan University

School of Computing and Intelligence Innovation

Master's student in Electronic Information

Sep 2024 - Jun 2027

Contact

Get in Touch