Yanhong Zeng_link管理

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

怕老婆的红金鱼 · 满级绿茶不止有一位金主全文阅读_满级绿茶不止 ...· 2 月前 ·

温柔的苹果 · Review: Intel Core ...· 2 月前 ·

傻傻的瀑布 · 北京当当网信息技术有限公司_百度百科· 2 月前 ·

帅呆的玉米 · JMeter: ...· 6 月前 ·

大气的创口贴 · 浙江诸暨一小区业主装修敲掉两面承重墙致整楼住 ...· 7 月前 ·

Yanhong Zeng is currently a researcher at Shanghai AI Laboratory . Before that, she obtained her computer science Ph.D. degree in the joint doctoral program between Sun Yat-sen University and Microsoft Research Asia (MSRA) in 2022, supervised by Prof. Hongyang Chao and Dr. Baining Guo . Her research interests is generative AI (AIGC), specifically in controllable multimodal (e.g., text, pixels, audio) generation and image/video editing. I am super into the real-word applications and studies of generative AI. I'm open to all kinds of chats, communication, and collaboration.
wechat/discord/gmail/x: zengyh1900 CV / Email / Google Scholar / Twitter / Github / Linkedin 🎉 HumanVid is accepted by NeurIPS 2024 (D&B Track).

[2024.09] 🎉 MotionBooth is accepted by NeurIPS 2024 (Spotlight).

[2024.07] 🎉 PowerPaint is accepted by ECCV 2024.

[2024.03] 🎉 PIA and Make-it-Vivid are accepted by CVPR 2024.

[2024.02] 🔥 Our technology has been shipped in the animation series "Poems of Timeless Acclaim" , which is broadcasted in over 10 languages and on more than 70 mainstream media platforms overseas. It has reached an audience of nearly 100 million worldwide viewers within two weeks.

[2024.01] 🔥 We release MagicMaker , an AI platform that supports image generation, editing and animation! Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing , Gereon Fox , Yanhong Zeng , Xingang Pan , Mohamed Elgharib , KChristian Theobalt , Kai Chen Arxiv , 2024 project page / video / arXiv / demo / Live2Diff is the first attempt that enables uni-directional attention modeling to video diffusion models for live video steam processing, and achieves 16FPS on RTX 4090 GPU. A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting Junhao Zhuang , Yanhong Zeng , Wenran Liu , Chun Yuan , Kai Chen ECCV , 2024 project page / video / arXiv / demo / PowerPaint is the first versatile inpainting model that achieves SOTA in text-guided and shape-guided object inpainting, object removal, outpainting, etc. Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions Yanhong Zeng* , Hongwei Xue* , Tiankai Hang* , Yuchong Sun* , Bei Liu , Huan Yang , Jianlong Fu , Baining Guo CVPR , 2022 arXiv / video /

We collect a large dataset which is the first high-resolution dataset including 371.5k hours of 720p videos and the most diversified dataset covering 15 popular YouTube categories.

Tutorial Talk (ICCV 2023): MMagic: Multimodal Advanced, Generative and Intelligent Creation

Tutorial Talk (CVPR 2023): Learning to Generate, Edit, and Enhance Images and Videos with MMagic

Invited Talk: Towards High-Quality Image Inpainting ( Microsoft China Video Center on Bilibili Live 2019 )

Award: ICML 2022 Outstanding Reviewer.

Award: National Scholarship in 2021 (Top 1% in SYSU).

Award: Outstanding Undergraduate Thesis in 2017.

Award: Outstanding Undergraduate in 2017.

Award: National Scholarship in 2016 (Top 1% in SYSU).

Award: First Prize Excellence Scholarship in 2013, 2014, 2015.