Generative Vision · Multimodal Intelligence

Hanlin Shang

尚翰林

Master's student in Electronic Information at Fudan University, working on generative vision, face video restoration, face video generation, and vision-language-action systems for autonomous driving.

Email GitHub

I am interested in building visual generation systems that are robust, controllable, and useful in real-world settings. My recent work spans face video restoration, audio-driven talking-head generation, and VLA models for autonomous driving.

Selected Work

Research Projects

ICCV 2025 Highlight Co-first author

DicFace

A face video restoration project focused on improving degraded facial videos. I led the project and contributed as a co-first author.

View project

Autonomous driving Co-first author

WAM-Diff

A vision-language-action research project for autonomous driving, exploring generative modeling for action-aware driving intelligence.

View project

Face video generation Contributor

Hallo Series

Contributions to a series of face video generation projects, including Hallo, Hallo2, and Hallo3.

Hallo Hallo2 Hallo3

Experience

Research & Industry

Aug 2025 - Jun 2026

引望智能技术有限公司

Internship focused on intelligent systems and applied AI.

Jan 2025 - Aug 2025

上海智能科学研究院

Research internship in intelligent science and visual AI.

Education

Academic Background

Fudan University

School of Computing and Intelligence Innovation

Master's student in Electronic Information

Sep 2024 - Jun 2027

Contact

Get in Touch

Email 3302023514@qq.com GitHub @NinoNeumann