FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion

¹State Key Lab of CAD&CG, Zhejiang University, ²Zhejiang Lab, ³Zhejiang A\&F University, ⁴Zhejiang University City College,

Abstract

The workload of real-time rendering is steeply increasing as the demand for high resolution, high refresh rates, and high realism rises, overwhelming most graphics cards. To mitigate this problem, one of the most popular solutions is to render images at a low resolution to reduce rendering overhead, and then manage to accurately upsample the low-resolution rendered image to the target resolution, a.k.a. super-resolution techniques. Most existing methods focus on exploiting information from low-resolution inputs, such as historical frames. The absence of high frequency details in those LR inputs makes them hard to recover fine details in their high-resolution predictions. With LR images and HR G-buffers as input, the network requires to align and fuse features at multi resolution levels. We introduce an efficient and effective H-Net architecture to solve this problem and significantly reduce rendering overhead without noticeable quality deterioration. Experiments show that our method is able to produce temporally consistent reconstructions in 4 × 4 and even challenging 8 × 8 upsampling cases at 4K resolution with real-time performance, with substantially improved quality and significant performance boost compared to existing works.

Multi-resolution Alignment

To get the utmost out of HR auxilary features, we propuse H-Net that align multi-resolution data at a homologous low resolution screen space. Fusing multi-resolution features into a low resolution feature can not only aggregate all the data with the same screen space coordinate, but also compress into a low resolution form. so H-Net take advantages in both quality and speed.

BRDF pre-integrate demodulation

We further improve the quailty using demodulation. With demodulation, the inferred target of neural network change from high frequency RGB color to low frequency irradiance term.

Performance

		Ours	Ours ⚡	NSRR	MNSS	FSR	XeSS	Ours-8x	NSRR-8x	MNSS-8x
PSNR (dB)	Kite	32.33	31.22	27.74	28.00	29.12	28.30	30.21	25.00	25.72
	Showdown	36.32	31.42	30.27	29.17	26.29	29.31	33.61	29.17	25.62
	Slay	37.02	34.41	35.42	35.39	32.39	34.94	34.26	32.12	33.47
	City	28.94	28.66	27.65	28.23	26.56	27.15	27.20	25.95	26.46
SSIM	Kite	0.933	0.900	0.832	0.829	0.887	0.893	0.899	0.765	0.770
	Showdown	0.976	0.949	0.945	0.914	0.866	0.917	0.955	0.914	0.813
	Slay	0.972	0.958	0.962	0.963	0.928	0.944	0.957	0.939	0.943
	City	0.921	0.901	0.899	0.896	0.836	0.888	0.916	0.873	0.873

FuseSR outperform SOTA method in both quality and speed.

BibTeX

@inproceedings{zhong2023fusesr, title={FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion}, author={Zhong, Zhihua and Zhu, Jingsen and Dai, Yuxin and Zheng, Chuankun and Chen, Guanlin and Huo, Yuchi and Bao, Hujun and Wang, Rui}, booktitle={SIGGRAPH Asia 2023 Conference Papers}, pages={1--10}, year={2023} }

FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion

SIGGRAPH Asia 2023

8x8(64x) super-resolution result in KITE.

Abstract

Multi-resolution Alignment

BRDF pre-integrate demodulation

Performance

Temporal Result

BibTeX