BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Asia/Seoul
X-LIC-LOCATION:Asia/Seoul
BEGIN:STANDARD
TZOFFSETFROM:+0900
TZOFFSETTO:+0900
TZNAME:KST
DTSTART:18871231T000000
DTSTART:19881009T020000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20230103T035307Z
LOCATION:Auditorium\, Level 5\, West Wing
DTSTART;TZID=Asia/Seoul:20221206T100000
DTEND;TZID=Asia/Seoul:20221206T120000
UID:siggraphasia_SIGGRAPH Asia 2022_sess153_papers_191@linklings.com
SUMMARY:Video-driven Neural Physically-based Facial Asset for Production
DESCRIPTION:Technical Papers\n\nVideo-driven Neural Physically-based Facia
 l Asset for Production\n\nZhang, Zeng, Zhang, Lin, Cao...\n\nProduction-le
 vel workflows for producing convincing 3D dynamic human faces have long re
 lied on an assortment of labor-intensive tools for geometry and texture ge
 neration, motion capture and rigging, and expression synthesis. Recent neu
 ral approaches automate individual components but the corresponding latent
  representations cannot provide artists with explicit controls as in conve
 ntional tools. In this paper, we present a new learning-based, video-drive
 n approach for generating dynamic facial geometries with high-quality phys
 ically-based assets. Two key components are well-structured latent spaces 
 due to dense temporal samplings from videos and explicit facial expression
  controls to regulate the latent spaces. For data collection, we construct
  a hybrid multiview-photometric capture stage, coupling with ultra-fast vi
 deo cameras to obtain raw 3D facial assets. We then set out to model the f
 acial expression, geometry and physically-based textures using separate VA
 Es where we impose a global multi-layer perceptron (MLP) based expression 
 mapping across the latent spaces of respective networks, to preserve chara
 cteristics across respective attributes while maintaining explicit control
 s over facial geometry and texture generation. We also introduce the idea 
 to model the delta information as wrinkle maps for the physically-based te
 xtures in our texture VAE, achieving high-quality 4K rendering of dynamic 
 textures. We demonstrate our approach in high-fidelity performer-specific 
 facial capture and cross-identity facial motion transfer and retargeting. 
 In addition, our multi-VAE-based neural asset, along with the fast adaptat
 ion schemes, can also be deployed to handle in-the-wild videos. Besides, w
 e motivate the utility of our explicit facial disentangling strategy by pr
 oviding various promising physically-based editing results like geometry a
 nd material editing or wrinkle transfer with high realism. Comprehensive e
 xperiments show that our technique provides higher accuracy and visual fid
 elity than previous video-driven facial reconstruction and animation metho
 ds.\n\nRegistration Category: FULL ACCESS, EXPERIENCE PLUS ACCESS, EXPERIE
 NCE ACCESS, TRADE EXHIBITOR\n\nLanguage: ENGLISH\n\nFormat: IN-PERSON
URL:https://sa2022.siggraph.org/en/full-program/?id=papers_191&sess=sess15
 3
END:VEVENT
END:VCALENDAR