{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T01:39:12Z","timestamp":1774402752761,"version":"3.50.1"},"reference-count":32,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2013,7,21]],"date-time":"2013-07-21T00:00:00Z","timestamp":1374364800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2013,7,21]]},"abstract":"<jats:p>Stochastic sampling in time and over the lens is essential to produce photo-realistic images, and it has the potential to revolutionize real-time graphics. In this paper, we take an architectural view of the problem and propose a novel hardware architecture for efficient shading in the context of stochastic rendering. We replace previous caching mechanisms by a sorting step to extract coherence, thereby ensuring that only non-occluded samples are shaded. The memory bandwidth is kept at a minimum by operating on tiles and using new buffer compression methods. Our architecture has several unique benefits not traditionally associated with deferred shading. First, shading is performed in primitive order, which enables late shading of vertex attributes and avoids the need to generate a G-buffer of pre-interpolated vertex attributes. Second, we support state changes, e.g., change of shaders and resources in the deferred shading pass, avoiding the need for a single \u00fcber-shader. We perform an extensive architectural simulation to quantify the benefits of our algorithm on real workloads.<\/jats:p>","DOI":"10.1145\/2461912.2462022","type":"journal-article","created":{"date-parts":[[2013,7,16]],"date-time":"2013-07-16T18:06:45Z","timestamp":1373998005000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":28,"title":["A sort-based deferred shading architecture for decoupled sampling"],"prefix":"10.1145","volume":"32","author":[{"given":"Petrik","family":"Clarberg","sequence":"first","affiliation":[{"name":"Intel Corporation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Robert","family":"Toth","sequence":"additional","affiliation":[{"name":"Intel Corporation"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jacob","family":"Munkberg","sequence":"additional","affiliation":[{"name":"Intel Corporation"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2013,7,21]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/166117.166131"},{"key":"e_1_2_2_2_1","unstructured":"Akenine-M\u00f6ller T. Munkberg J. and Hasselgren J. 2007. Stochastic Rasterization using Time-Continuous Triangles. In Graphics Hardware 7--16.   Akenine-M\u00f6ller T. Munkberg J. and Hasselgren J. 2007. Stochastic Rasterization using Time-Continuous Triangles. In Graphics Hardware 7--16."},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2018323.2018343"},{"key":"e_1_2_2_4_1","unstructured":"Boulos S. Luong E. Fatahalian K. Moreton H. and Hanrahan P. 2010. Space-Time Hierarchical Occlusion Culling for Micropolygon Rendering with Motion Blur. In High Performance Graphics 11--18.   Boulos S. Luong E. Fatahalian K. Moreton H. and Hanrahan P. 2010. Space-Time Hierarchical Occlusion Culling for Micropolygon Rendering with Motion Blur. In High Performance Graphics 11--18."},{"key":"e_1_2_2_5_1","unstructured":"Burns C. A. Fatahalian K. and Mark W. R. 2010. A Lazy Object-Space Shading Architecture with Decoupled Sampling. In High Performance Graphics 19--28.   Burns C. A. Fatahalian K. and Mark W. R. 2010. A Lazy Object-Space Shading Architecture with Decoupled Sampling. In High Performance Graphics 19--28."},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/37401.37414"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/54852.378468"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/74333.74341"},{"key":"e_1_2_2_9_1","unstructured":"Harada T. McKee J. and Yang J. C. 2012. Forward+: Bringing Deferred Lighting to the Next Level. In Eurographics 2012 -- Short Papers 5--8.  Harada T. McKee J. and Yang J. C. 2012. Forward+: Bringing Deferred Lighting to the Next Level. In Eurographics 2012 -- Short Papers 5--8."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1283900.1283917"},{"key":"e_1_2_2_11_1","unstructured":"Imagination Technologies Ltd. 2011. POWERVR Series5 Graphics -- SGX architecture guide for developers.  Imagination Technologies Ltd. 2011. POWERVR Series5 Graphics -- SGX architecture guide for developers."},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1137\/070709359"},{"key":"e_1_2_2_13_1","volume-title":"Tech. Rep. NVR-2011-003","author":"Laine S.","year":"2011"},{"key":"e_1_2_2_14_1","volume-title":"Tech. Rep. NVR-2011-004","author":"Laine S.","year":"2011"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1965001"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964950"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2159616.2159640"},{"key":"e_1_2_2_18_1","volume-title":"Real-Time Stochastic Rasterization on Conventional GPU Architectures. In High Performance Graphics, 173--182","author":"McGuire M."},{"key":"e_1_2_2_19_1","unstructured":"Morein S. 2000. ATI Radeon HyperZ Technology. In Graphics Hardware Hot3D Proceedings.  Morein S. 2000. ATI Radeon HyperZ Technology. In Graphics Hardware Hot3D Proceedings ."},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1080\/2151237X.2011.563695"},{"key":"e_1_2_2_21_1","unstructured":"Munkberg J. and Akenine-M\u00f6ller T. 2012. Hyperplane Culling for Stochastic Rasterization. In Eurographics 2012 -- Short Papers 105--108.  Munkberg J. and Akenine-M\u00f6ller T. 2012. Hyperplane Culling for Stochastic Rasterization. In Eurographics 2012 -- Short Papers 105--108."},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2018323.2018341"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/2383795.2383797"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1080\/2151237X.2011.621761"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/2383795.2383809"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1966394.1966396"},{"key":"e_1_2_2_27_1","unstructured":"Rasmusson J. Hasselgren J. and Akenine-M\u00f6ller T. 2007. Exact and Error-Bounded Approximate Color Buffer Compression and Decompression. In Graphics Hardware 41--48.   Rasmusson J. Hasselgren J. and Akenine-M\u00f6ller T. 2007. Exact and Error-Bounded Approximate Color Buffer Compression and Decompression. In Graphics Hardware 41--48."},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/97879.97901"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1360612.1360617"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1944745.1944747"},{"key":"e_1_2_2_31_1","unstructured":"Str\u00f6m J. Wennersten P. Rasmusson J. Hasselgren J. Munkberg J. Clarberg P. and Akenine-M\u00f6ller T. 2008. Floating-Point Buffer Compression in a Unified Codec Architecture. In Graphics Hardware 75--84.   Str\u00f6m J. Wennersten P. Rasmusson J. Hasselgren J. Munkberg J. Clarberg P. and Akenine-M\u00f6ller T. 2008. Floating-Point Buffer Compression in a Unified Codec Architecture. In Graphics Hardware 75--84."},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/2383795.2383798"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2461912.2462022","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2461912.2462022","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:48:39Z","timestamp":1750236519000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2461912.2462022"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,7,21]]},"references-count":32,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2013,7,21]]}},"alternative-id":["10.1145\/2461912.2462022"],"URL":"https:\/\/doi.org\/10.1145\/2461912.2462022","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,7,21]]},"assertion":[{"value":"2013-07-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}