BEGIN:VCALENDAR VERSION:2.0 PRODID:Linklings LLC BEGIN:VTIMEZONE TZID:Asia/Seoul X-LIC-LOCATION:Asia/Seoul BEGIN:STANDARD TZOFFSETFROM:+0900 TZOFFSETTO:+0900 TZNAME:KST DTSTART:18871231T000000 DTSTART:19881009T020000 END:STANDARD END:VTIMEZONE BEGIN:VEVENT DTSTAMP:20230103T035307Z LOCATION:Auditorium\, Level 5\, West Wing DTSTART;TZID=Asia/Seoul:20221206T100000 DTEND;TZID=Asia/Seoul:20221206T120000 UID:siggraphasia_SIGGRAPH Asia 2022_sess153_papers_448@linklings.com SUMMARY:ICARUS: A Specialized Architecture for Neural Radiance Fields Rend ering DESCRIPTION:Technical Papers\n\nICARUS: A Specialized Architecture for Neu ral Radiance Fields Rendering\n\nRao, Yu, Wan, Zhou, Zheng...\n\nThe pract ical deployment of Neural Radiance Fields (NeRF) in rendering applications faces several challenges, with the most critical one being low rendering speed on even high-end graphic processing units (GPUs). In this paper, we present ICARUS, a specialized accelerator architecture tailored for NeRF r endering. Unlike GPUs using general purpose computing and memory architect ures for NeRF, ICARUS executes the complete NeRF pipeline using dedicated plenoptic cores (PLCore) consisting of a positional encoding unit (PEU), a multi-layer perceptron (MLP) engine, and a volume rendering unit (VRU). A PLCore takes in positions \& directions and renders the corresponding pix el colors without any intermediate data going off-chip for temporary stora ge and exchange, which can be time and power consuming. To implement the m ost expensive component of NeRF, i.e., the MLP, we transform the fully con nected operations to approximated reconfigurable multiple constant multipl ications (MCMs), where common subexpressions are shared across different m ultiplications to improve the computation efficiency. We build a prototype ICARUS using Synopsys HAPS-80 S104, a field programmable gate array (FPGA )-based prototyping system for large-scale integrated circuits and systems design. We evaluate the power-performance-area (PPA) of a PLCore using 40 nm LP CMOS technology. Working at 400 MHz, a single PLCore occupies 16.5 $ mm^2$ and consumes 282.8 mW, translating to 0.105 uJ/sample. The results a re compared with those of GPU and tensor processing unit (TPU) implementat ions.\n\nRegistration Category: FULL ACCESS, EXPERIENCE PLUS ACCESS, EXPER IENCE ACCESS, TRADE EXHIBITOR\n\nLanguage: ENGLISH\n\nFormat: IN-PERSON URL:https://sa2022.siggraph.org/en/full-program/?id=papers_448&sess=sess15 3 END:VEVENT END:VCALENDAR