IBC2021 Tech Papers: Advances in Audio - Using some remarkable signal processing

2021-12-16T10:34:00+00:00

No comments

The subject of advances in audio is explored by Rob Oldfield in his paper ’Cloud-based AI for automatic audio production’ and by Matteo Torcoli in the paper ’Dialog+ in Broadcasting: First Field Tests using Deep-Learning-based Dialogue Enhancement.’

Every broadcaster knows that the most common complaint from viewers is that programme dialog is hard to discern against a background of atmospheric sounds, mood music and competing voices. It is especially a problem of age, where 90% of people over 60 years old, report problems.

IBC Digital Watch presentations on Cloud-based AI for automatic audio production for personalised immersive XR experiences by Rob Oldfield and Dialog+ in Broadcasting: First Field Tests using Deep-Learning-based Dialogue Enhancement by Matteo Torcoli

Research over decades has unsuccessfully sought a way of enhancing the intelligibility of TV dialog - until now! We present the results of trials by a collaboration of researchers using their deep-neural-network-based technology across a wide range of TV content and age groups. These show a startling performance; join us to judge the benefits for yourself!

Advances in audio

We shall also hear about exciting research using cloud-based AI and 5G connectivity to deliver live immersive experiences to a variety of consumer devices. Key to the experience is the ability of viewers to change their content viewpoint, with live rendering taking place in the cloud. The presentation focuses on the audio which is object-based and AI-driven, and carries with it the metadata necessary for personalised rendering of the scene.

The capture of the background is also critical to the recreation of the audio scene, for this the team chose second-order ambisonics accompanied by Serialised Audio Definition Model descriptive metadata. The presentation will explore detailed aspects of the audio processing and production. Altogether, a fascinating glimpse of the technology required to convey 360° audio for free-viewpoint XR!

More Tech Paper sessions here

Download the papers below

Downloads

CLOUD‐BASED AI FOR AUTOMATIC AUDIO PRODUCTION by Rob Oldfield
PDF, Size 0.89 mb
DIALOG IN BROADCASTING by Matteo Torcoli
PDF, Size 2.87 mb

Download

Topics

No comments

No comments yet

You're not signed in.

Only registered users can comment on this article.

News
IBC Accelerator Programme 2025: Submissions now open!

2024-10-23T08:01:00Z By Staff writer

IBC Accelerator Kickstart Day to take place at iconic BBC Radio Theatre as call for latest industry challenges gets underway.
News
Channel 4 secures new 10-year broadcasting licence

2024-10-20T09:14:00Z By Staff writer

UK regulator Ofcom has awarded a broadcasting licence to Channel 4 for a further 10 years.
News
Warner Bros Discovery confirms Max launch in seven Asian markets

2024-10-19T14:11:00Z By Staff writer

Warner Bros Discovery (WBD) will launch its Max streaming service across seven new markets in the Asia-Pacific region next month.

More from Technical papers

Technical Papers
IET announce Best of IBC Technical Papers

2024-09-27T14:43:00Z

The IET have announced the publication of The best of IET and IBC 2024 from IBC2024, once again showcasing the groundbreaking research presented through the papers. The papers have been selected by IBC’s Technical Papers Committee for being novel, topical, analytical and well-written and which have the potential to make ...
Technical Papers
Technical Papers 2024 Session: Streaming – the view from each end

2024-09-15T15:37:00Z

In this session from IBC2024, AWS India and the BBC present their fascinating work on streaming, as part of the IBC Technical Papers.
Technical Papers
Technical Papers 2024 Session: Provenance – what can we trust?

2024-09-15T15:29:00Z

In this session from IBC2024, two authors from BBC R&D and Verance Corporation present their work on provenance in news broadcasting as part of the IBC Technical Papers.