May 29, 2020

How to develop accurate and efficient methods for audiovisual content management

Audiovisual media content is not only an essential tool for communication and entertainment, it is also seen as a useful source of modern history. To enable everyone to benefit from such informative documents, it's crucial to translate moving images and sounds into words in an efficient and cost-effective way. Enter the EU-funded MeMAD project that is developing automatic language-based methods for managing, accessing and publishing pre-existing and originally produced digital content within the creative industries. Focusing on TV broadcasting and on-demand media services, the MeMAD project also aims to enhance digital storytelling.

Over two years into the project, MeMAD partners have developed a prototype platform to help audiovisual content professionals. They have also evaluated various aspects of the platform, as noted in a blog post on the project website. "There were four tracks of evaluation: video editing assistance, searching, intralingual subtitling with the help of automatic speech recognition (ASR) and interlingual subtitling with the help of machine translation (MT)."

The same blog post also states: "In all evaluations, participants filled out User Experience Questionnaire type forms (UEQ) after each task, adapted to focus on the task itself rather than the user interface. After each evaluation session there was a brief semi-structured interview. Additionally, in the video editing assistance and searching evaluations think-aloud data was collected." The blog post adds that ASR transcripts and MT were particularly found "to be useful in both video editing assistance and in archive searching, though there is still room for improvement."

In the same blog post, project partners emphasise that presentation and searchability of the metadata could also be improved. "It would be beneficial to be able to search for segments where a certain person speaks about a topic, combining face recognition and speech recognition data. Segment lengths also need to be examined, though in longer videos shorter segments may result in too many markers for a video editing tool to handle." The blog post continues: "Future evaluations will incorporate face recognition and visual object detection. The same participants should be used in future evaluations, where possible, since they are already familiar with the platform and can compare results."

Novel tools

The MeMAD (Methods for Managing Audiovisual Data: Combining Automatic Efficiency with Human Accuracy) project uses machine learning and processing to enable easier retrieval of data from large volumes of content and from across several languages. It also aims to provide content creators with novel tools to better structure content and automate the delivery of content derivatives to various platforms like social media. Thanks to MT that is used for speech transcripts and subtitles, content will be available for new audiences in foreign languages and also be more accessible to people with hearing and/or vision impairment. One example for various use cases of MeMAD involves automated transcription, translation and subtitling. As a result, video editors editing interviews in a foreign language can work without the need for interpreters. In another use case, visually impaired consumers can follow current affairs shows with the help of auto-generated content descriptions that have been made available.

A periodic project report on CORDIS notes: "The key to the innovation is to provide Creative Industries with a common representation for the master data during the production processes, so that the current document-oriented editorial processes can be substituted with a more structured approach."

More information: MeMAD project website: memad.eu/

Provided by CORDIS

Citation: How to develop accurate and efficient methods for audiovisual content management (2020, May 29) retrieved 30 June 2024 from https://techxplore.com/news/2020-05-accurate-efficient-methods-audiovisual-content.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Computer scientists develop new tool that generates videos from themed text

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

How to develop accurate and efficient methods for audiovisual content management

Novel tools

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Computer scientists develop new tool that generates videos from themed text

YouTube offers small businesses a free tool to create videos

Facebook to offer interactive game shows on video platform

Using artificial intelligence to investigate illegal wildlife trade on social media

Twitter buys SnappyTV video sharing service

MixedEmotions: open-source toolbox for emotion analysis

Underwater signals generated by open sea airplane crashes could be key to detecting final resting place of MH370

Best of Last Year: The top Tech Xplore articles of 2023

Researchers test 3D printing materials used for art projects to see if they will stand the test of time

Best of Last Year: The top Tech Xplore articles of 2022

Are we there yet? Time slows down on crowded train

Trust in algorithmic advice from computers can blind us to mistakes, says study

Phys.org

Medical Xpress

Science X

How to develop accurate and efficient methods for audiovisual content management

Novel tools

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Computer scientists develop new tool that generates videos from themed text

YouTube offers small businesses a free tool to create videos

Facebook to offer interactive game shows on video platform

Using artificial intelligence to investigate illegal wildlife trade on social media

Twitter buys SnappyTV video sharing service

MixedEmotions: open-source toolbox for emotion analysis

Recommended for you

Underwater signals generated by open sea airplane crashes could be key to detecting final resting place of MH370

Best of Last Year: The top Tech Xplore articles of 2023

Researchers test 3D printing materials used for art projects to see if they will stand the test of time

Best of Last Year: The top Tech Xplore articles of 2022

Are we there yet? Time slows down on crowded train

Trust in algorithmic advice from computers can blind us to mistakes, says study

Your Privacy