December 20, 2021

One-size fits all image descriptions on the web don't meet the needs of blind people

by Josh Rhoten, University of Colorado at Boulder

visual information — Credit: CC0 Public Domain

Online image descriptions—or "alt-text"—help people who are blind or have low vision easily access information by providing the context and detail needed to interact with websites meaningfully, securely and efficiently.

However, researchers at the University of Colorado Boulder recently published findings that suggest there is still a lot of work to be done to first generate and then improve these descriptions by creators across numerous platforms. The work, published in ACM SIGACCESS Conference on Computers and Accessibility, aims to fill that gap by exploring ways to create training materials that humans and artificial intelligence can use to author more useful image descriptions.

The research was led by CU Boulder alumnus Abigale Stangl along with Assistant Professor Danna Gurari—who recently joined the College of Engineering and Applied Science. Stangl earned her Ph.D. in Technology, Media and Society from the ATLAS Institute in 2019. She currently works remotely for the National Science Foundation as a Computing Research Association Computing Innovation Fellow (CI-Fellow) at the University of Washington.

She said the goal of the work was to investigate how to quickly create image descriptions that are responsive to the context in which they are found—no matter the platform or situation.

"We presented 28 people who are blind with as much information as possible about five images and then asked them to specify what information they would like about the image for the different scenarios," Stangl said. "Each scenario contained a media source in which an image is found and a predetermined information goal. For instance, we considered a person visiting a shopping website to find a gift for a friend as a potential scenario."

Stangl said the work provided several key findings. One was that the information blind people want in an image description changes based on the scenario in which they are encountering the image.

"For alt-text to be accurate, both human and AI systems will need training to author image descriptions that are responsive or context-aware to the user's information goal along with where the image is found," she said.

Other findings suggest that there are some types of information that blind people want for an image across all scenarios, and thus it may be possible to determine what image content should always be included in those descriptions.

During her Ph.D. studies, Stangl volunteered with the Anchor Center for the Blind, the Colorado Center for the Blind and the National Federation for the Blind to better understand the barriers blind people face in gaining access to information and becoming artists and designers themselves. She said she has always been motivated to make sure that end-users and stakeholders are involved in the design process.

"My research with Professor Gurari was essentially a proof of concept that one-size-fits-all image descriptions do not meet the access needs of blind people. In it, we provide reflections and guidance for how our experimental approach may be used and scaled by others interested in creating user-centered training materials for context-aware image descriptions—or at least minimum viable image descriptions," she said. "I am looking forward to continuing it and exploring new approaches and problems in the near future."

Co-authors of the new study include Nitin Verma and Kenneth Fleischmann of The University of Texas at Austin and Meredith Ringel Morris of Microsoft Research.

More information: Abigale Stangl et al, Going Beyond One-Size-Fits-All Image Descriptions to Satisfy the Information Wants of People Who are Blind or Have Low Vision, The 23rd International ACM SIGACCESS Conference on Computers and Accessibility (2021). DOI: 10.1145/3441852.3471233

Provided by University of Colorado at Boulder

Citation: One-size fits all image descriptions on the web don't meet the needs of blind people (2021, December 20) retrieved 16 August 2024 from https://techxplore.com/news/2021-12-one-size-image-descriptions-web-dont.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Keeping the unseen safe: Improving digital privacy for blind people

24 shares

Feedback to editors

China's growing 'robotaxi' fleet sparks concern, wonder on streets

9 minutes ago

Engineers design tiny batteries for powering cell-sized robots

12 hours ago

Leaf-like solar concentrators promise major boost in solar efficiency

12 hours ago

Why does AI beat humans at the strategy game Diplomacy?

13 hours ago

New technique prints metal oxide thin film circuits at room temperature

14 hours ago

Studies highlight challenges and solutions in making large language models trustworthy

15 hours ago

Finding security flaws in Android ahead of malicious hackers

16 hours ago

Robot planning tool accounts for human carelessness

16 hours ago

From shrimp to steel: Introducing nature-inspired metalworking

17 hours ago

'AI Scientist' model designed to conduct scientific research autonomously

17 hours ago

Load comments (0)

One-size fits all image descriptions on the web don't meet the needs of blind people

China's growing 'robotaxi' fleet sparks concern, wonder on streets

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Keeping the unseen safe: Improving digital privacy for blind people

Chrome descriptions of images will clue in blind and low vision users

Blind and sighted readers have sharply different takes on what content is most useful to include in a chart caption

Artificial intelligence that understands object relationships

Machine-learning model could enable robots to understand interactions in the way humans do

Browser extension helps the visually impaired interpret online images

A two-stage framework to improve LLM-based anomaly detection and reactive planning

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Why does AI beat humans at the strategy game Diplomacy?

Studies highlight challenges and solutions in making large language models trustworthy

Phys.org

Medical Xpress

Science X

One-size fits all image descriptions on the web don't meet the needs of blind people

China's growing 'robotaxi' fleet sparks concern, wonder on streets

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Related Stories

Keeping the unseen safe: Improving digital privacy for blind people

Chrome descriptions of images will clue in blind and low vision users

Blind and sighted readers have sharply different takes on what content is most useful to include in a chart caption

Artificial intelligence that understands object relationships

Machine-learning model could enable robots to understand interactions in the way humans do

Browser extension helps the visually impaired interpret online images

Recommended for you

A two-stage framework to improve LLM-based anomaly detection and reactive planning

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Why does AI beat humans at the strategy game Diplomacy?

Studies highlight challenges and solutions in making large language models trustworthy

Your Privacy