A SINGLE CONVOLUTIONAL NEURAL NETWORK FOR JOINT SUPER-RESOLUTION, GAMUT EXTENSION, AND INVERSE TONE-MAPPING

By Wenyao Gan, Hensheng Zhang, Li Chen, Rong Xie, Li Song 2020-09-15T12:10:00+01:00

No comments

Technical paper: This paper introduces a workflow to down-convert existing UHD HDR videos to their HD SDR versions and proposes a joint super-resolution, gamut extension, and inverse tone-mapping network.

Abstract

With rapid developments of display technology in recent years, Ultra-high definition (UHD) high dynamic range (HDR) displays have emerged in consumer markets. However, due to the lack of UHD HDR video contents, it is necessary to convert legacy high definition (HD) videos with standard dynamic range (SDR) to their UHD HDR versions.

In this paper, we first introduce a workflow to down-convert existing UHD HDR videos to their HD SDR versions and then propose a joint super-resolution, gamut extension, and inverse tone-mapping network (JSGIN), which directly learns the upconversion from the HD SDR videos to their UHD HDR versions. Our JSGIN can enhance visual experience by reconstructing lost information and achieves better subjective visual quality with fewer artifacts than recent state-of-the-art methods.

Introduction

Display technology has developed fast in recent years, Ultra-high definition (UHD) higher dynamic range (HDR) displays have become available for consumers. Nevertheless, because of the shortage of UHD HDR video contents, it is required to up-convert legacy high definition (HD) standard dynamic range (SDR) videos to UHD HDR videos. Compared with the current HD SDR television systems ‘(1)’, UHD television systems ‘(2)’ provide higher spatial resolution and wider colour gamut, and HDR television systems ‘(3)’ provide a higher dynamic range.

Super-resolution (SR) methods up-scale low-resolution images to high-resolution images. Recent convolutional neural network (CNN) based methods have achieved considerable improvements over conventional SR methods. SRCNN ‘Dong et al (4)’ was the first CNN-based SR method. Then, the CNN architecture was improved by various methods such as sub-pixel convolution ‘Shi et al (5)’ and modified residual blocks ‘Lim et al (6)’.

Gamut extension (GE) algorithms extend colours from a source gamut to a wider destination gamut. Linear colour space conversion cannot restore colour information outside the source gamut. Conventional GE algorithms attempt to make full use of the wider destination gamut. Recently, ‘TAKEUCHI et al (7)’ proposed a CNN-based GE algorithm that achieves significant gains against conventional GE algorithms.

Inverse tone-mapping (ITM) methods expand SDR images to HDR images. Compared with conventional ITM methods that only focus on mapping the dynamic range, CNN-based ITM methods can restore the lost details in highlights and shadows. ‘Eilertsen et al (8)’ introduced a deep learning system to reconstruct an HDR image from a single exposed SDR image.

UHD HDR videos can be reconstructed from HD SDR videos by cascading SR, GE, and ITM methods. However, the errors from the previous conversion may accumulate, which leads to less accurate results and more overall complexity compared with the joint learning of SR, GE, and ITM. A multi-purpose CNN structure ‘Kim and Kim (9)’ was first proposed to perform the joint learning task of SR, GE, and ITM to directly up-convert HD SDR videos to UHD HDR videos. Then, Deep SR-ITM ‘Kim et al (10)’ was proposed to achieve better results than ‘(9)’ by introducing input decomposition methods and modulation blocks.

ResNet ‘He et al (11)’ introduces local residual learning to ease the difficulty of training of deep CNNs. Global residual learning in SR was first adopted by VDSR ‘Kim et al (12)’ to facilitate training convergence for a deep CNN. Both local residual learning and global residual learning are adopted in our method.

In this paper, we first introduce a workflow to down-convert the existing UHD HDR videos to their HD SDR versions. Then, we propose a single CNN to jointly learn SR, GE, and ITM, which can directly up-convert HD SDR videos to their UHD HDR versions. Compared to recent state-of-the-art methods ‘(9) (10)’, UHD HDR videos generated by our method provide a better visual experience.

Download the paper below

Downloads

A SINGLE CONVOLUTIONAL NEURAL NETWORK FOR JOINT SUPER-RESOLUTION, GAMUT EXTENSION, AND INVERSE TONE-MAPPING
PDF, Size 0.48 mb

Topics

No comments

No comments yet

You're not signed in.

Only registered users can comment on this article.

News
UK’s M&E industry urged to participate in latest mental health survey

2024-06-27T09:47:00Z By Dan Symonds

The UK’s Film and TV Charity has launched the latest version of its Looking Glass Survey to assess industry workers’ mental health and wellbeing.
Industry Trends
Streamers look to AI to crack the codec code

2024-06-24T16:18:00Z By Adrian Pennington

Streamers are looking to AI to dramatically improve compression performance and reduce costs with London-based Deep Render claiming that its technology has cracked the code.
News
Bectu survey: 92% of creative industry have experienced bullying or harassment

2024-06-01T09:00:00Z By Staff writer

Bectu, the UK’s union for creative industry workers, has published the results of a survey showing that 92% of the workforce has witnessed or experienced bullying or harassment on grounds of their sex or gender in the workplace.

More from Technical papers

Technical Papers
IBC2023 Tech Papers: Technical Overview of Recent AI/DL Model Trends for Super-Resolution Video Enhancement

2023-09-17T13:10:00Z By Nelson Francisco, Julien Le Tanou

IBC2023: This Technical Paper provides a comprehensive overview of state-of-the-art deep learning-based super-resolution methods and their respective advantages and drawbacks, focusing on how they can be tailored for practical deployments in the cloud to mitigate their typical limitations.
Technical Papers
IBC2023 Tech Papers: Daily Context-Adaptive Presentation driven by Personal Data Store

2023-09-17T13:09:00Z By Hiromu Ogawa, Kinji Matsumura, Hiroshi Fujisawa

IBC2023: This Technical Paper demonstrates a system architecture that realizes content presentation based on the user’s moment-to-moment situation by utilizing context recognition and a personal data store.
Technical Papers
IBC2023 Tech Papers: Recommendations for improving on-demand content, post-broadcast derived from an analysis of minute by minute consumption patterns

2023-09-17T13:09:00Z By Michael Armstrong, Iain D. Gilchrist

IBC2023: This Technical Paper outlines a survey of minute-by-minute audience consumption of radio and television programmes, describes the four main patterns of consumption and the way in which these can be characterised through mathematical modelling.

Insight and expertise for the content & technology community

A SINGLE CONVOLUTIONAL NEURAL NETWORK FOR JOINT SUPER-RESOLUTION, GAMUT EXTENSION, AND INVERSE TONE-MAPPING

Abstract

Introduction

Download the paper below

Downloads

A SINGLE CONVOLUTIONAL NEURAL NETWORK FOR JOINT SUPER-RESOLUTION, GAMUT EXTENSION, AND INVERSE TONE-MAPPING

Topics

No comments yet

Only registered users can comment on this article.

Related articles

UK’s M&E industry urged to participate in latest mental health survey

Streamers look to AI to crack the codec code

Bectu survey: 92% of creative industry have experienced bullying or harassment

More from Technical papers

IBC2023 Tech Papers: Technical Overview of Recent AI/DL Model Trends for Super-Resolution Video Enhancement

IBC2023 Tech Papers: Daily Context-Adaptive Presentation driven by Personal Data Store

IBC2023 Tech Papers: Recommendations for improving on-demand content, post-broadcast derived from an analysis of minute by minute consumption patterns

IBC is owned by