MEMAD PROJECT: END USER FEEDBACK ON AI IN THE MEDIA PRODUCTION WORKFLOWS

By L. Saarikoski, D. Van Rijsselbergen, M. Hirvonen, M. Koponen, U. Sulubacak, K. Vitikainen 2020-09-15T10:33:00+01:00

No comments

Technical paper: This paper discusses the prototypes built and end-user trials run in the European H2020 project MeMAD (Methods for Managing Audiovisual Data) for implementing more efficient media production based on semiautomated media enrichment tools.

Abstract

This paper discusses the prototypes built and end-user trials run in the European H2020 project MeMAD (Methods for Managing Audiovisual Data) for implementing more efficient media production based on semiautomated media enrichment tools.

The prototypes offer automated content annotation supported by machine translation, cross-language search and retrieval of material and automated multi-lingual video subtitling. Alternative evaluation approaches are described for experimental and close-to-production stage use cases, with the focus alternatively on refining the use cases with qualitative methods or measuring productivity with quantitative methods.

Main findings indicate curious user attitudes towards these types of technologies, with current working practices and individual preferences affecting the results quite strongly. Productivity of subtitling and translation work can be improved by incorporating automated speech recognition (ASR), natural language processing (NLP) and machine translation into the workflows. Using large quantities of metadata raises tool UX design questions and is not fully supported by existing tools. For most purposes tested, the users preferred having the additional metadata available, even in lower quality, instead of hiding or discarding low-quality data.

Introduction

Demonstrations of potential automated metadata extraction services (AME) such as face recognition, automated speech recognition, machine translation and even object detection and scene classification have in the past few years focused on early technical tests or stand-alone user interfaces built to demonstrate the concept. In order to properly evaluate the potential of these deep-learning-based technologies, for which the short-hand term “A.I.” is commonly conveniently used, in media production, the next larger step is to fit these services into existing ecosystems, architectures and workflows in a local context of a media company. This shift from proof-of-concepts (PoC) into production tests marks several important changes, challenges and practical considerations, most notably:

• Envisioned services are for the first time tested in end-to-end workflows instead of isolated sub-processes. Also, the evaluated user experience expands to include all the parts of the user work process and how different parts of the work tie into each other. • On top of the technical performance metrics, a layer of more business-oriented success criteria is introduced, such as productivity and user satisfaction.

Typically, also the amount of data increases between iterations in the evolution from proof-of-concept to in-production use, as amounts of content and number of services involved in a workflow increase. Furthermore, at the stage of production tests, the element of optimizing dataflows is present: Out of the large number of AI services, which ones should be combined and what parts of their data output should be used to create an optimal work process?

The European Horizon2020 project MeMAD attempts to research the challenges mentioned above, with research groups developing the algorithms and other core elements of machine learning technologies such as automated speech recognition (ASR), computer vision and machine translation (MT) for audiovisual media data. Building on these, the project pilots the use of these technologies as iterations of a project prototype, and the most promising elements are further evaluated in a close-to-production use by the Finnish Broadcasting Company Yle, the French National Audiovisual Institute INA, and other interested parties.

This paper focuses on the evaluation of the MeMAD technologies with focus on the stakeholder point of view. The project evaluation activities are referred to as a case study, demonstrating the methods and issues that are relevant in the stage of fitting the project technologies into existing professional production workflows. The full project evaluation reports can be found at https://memad.eu and they are summarized in this paper when needed. New evaluation results will be reported throughout the project and this paper describes the findings as of April 2020, shortly after the second of the three project evaluation rounds has finished.

Download the paper below

Downloads

MEMAD PROJECT END USER FEEDBACK ON AI IN THE MEDIA PRODUCTION WORKFLOWS
PDF, Size 0.53 mb

Topics

No comments

No comments yet

You're not signed in.

Only registered users can comment on this article.

Industry Trends
IBC2024 Accelerator Project: ECOFLOW

2024-07-24T09:38:00Z By John Maxwell Hobbs

How can we make processing, streaming and the consumption of media more sustainable? That’s the question posed by the ECOFLOW: Energy-Conserving Optimization for Future-ready, Low-impact Online Workflows project, proposed by Humans Not Robots and Accedo.tv, with support from Champions ITV and BBC.
News
Warner Bros Discovery reportedly mulls break up

2024-07-24T09:00:00Z By Staff writer

Warner Bros. Discovery (WBD) is reportedly considering a plan to split its digital streaming and studio businesses from its legacy linear television networks.
News
Vivendi plans London listing for Canal+, Amsterdam float for Havas

2024-07-24T09:00:00Z By Staff writer

French media conglomerate Vivendi has revealed more details of its plan to split up its business, which includes listing Canal+ in London and the Havas advertising business on the Euronext Amsterdam stock exchange.

More from Technical papers

Technical Papers
IBC2023 Tech Papers: Technical Overview of Recent AI/DL Model Trends for Super-Resolution Video Enhancement

2023-09-17T13:10:00Z By Nelson Francisco, Julien Le Tanou

IBC2023: This Technical Paper provides a comprehensive overview of state-of-the-art deep learning-based super-resolution methods and their respective advantages and drawbacks, focusing on how they can be tailored for practical deployments in the cloud to mitigate their typical limitations.
Technical Papers
IBC2023 Tech Papers: Daily Context-Adaptive Presentation driven by Personal Data Store

2023-09-17T13:09:00Z By Hiromu Ogawa, Kinji Matsumura, Hiroshi Fujisawa

IBC2023: This Technical Paper demonstrates a system architecture that realizes content presentation based on the user’s moment-to-moment situation by utilizing context recognition and a personal data store.
Technical Papers
IBC2023 Tech Papers: Recommendations for improving on-demand content, post-broadcast derived from an analysis of minute by minute consumption patterns

2023-09-17T13:09:00Z By Michael Armstrong, Iain D. Gilchrist

IBC2023: This Technical Paper outlines a survey of minute-by-minute audience consumption of radio and television programmes, describes the four main patterns of consumption and the way in which these can be characterised through mathematical modelling.