User-Driven Automated Audio Description to Enhance Video Accessibility for Blind and Low Vision Users
Description
Audio descriptions (AD) make videos accessible for blind and low vision (BLV) users by describing visual elements that cannot be understood from the main audio track. AD created by professionals or novice describers is time-consuming and lacks scalability while offering little control to BLV viewers on description length and content and when they receive it. To address this gap, this work explores user-driven AI-generated descriptions, where the BLV viewer controls when they receive descriptions. In a study, 20 BLV participants activated audio descriptions for seven different video genres with two levels of detail: concise and detailed. Results show differences in AD frequency and level of detail BLV users wanted for different videos, their sense of control with this style of AD delivery, its limitations, and variations among BLV users in their AD needs and perception of AI-generated descriptions. The implications of these findings for future AI-based AD tools are discussed.
Details
Contributors
- Cheema, Maryam Saadat (Author)
- Seifi, Hasti (Thesis advisor)
- Fazli, Pooyan (Committee member)
- Kurniawan, Sri (Committee member)
- Arizona State University (Publisher)
Date Created
The date the item was original created (prior to any relationship with the ASU Digital Repositories.)
2024
Topical Subject
Resource Type
Language
- eng
Note
- Partial requirement for: M.S., Arizona State University, 2024
- Field of study: Computer Science
Additional Information
English
Extent
- 57 pages