Audio Description
Audio Descriptions
Under WCAG 2.1, prerecorded synchronized media must provide access to important visual information. At Level AA, audio description is required when essential visual content is not already communicated in the existing audio or narration.
Accessibility Check
For prerecorded video, provide audio description whenever essential visual details are needed for understanding and those details are not already spoken in the video.
WCAG 2.1 Level AA: Audio Description
Success Criterion 1.2.5 (Audio Description—Prerecorded): Audio description must be provided for all prerecorded video content in synchronized media unless all important visual information is already conveyed in the existing audio.
What to describe: Include important visual details that are not communicated in the main audio, such as actions, characters, scene changes, facial expressions, body language, scenery, and on-screen text when they are necessary for understanding the content.
Ways to implement audio description:
- Separate audio track: A secondary, user-selectable track that contains audio description.
- Integrated audio description: Descriptions are built into the main soundtrack during natural pauses in dialogue.
- Extended audio description: The video is paused to allow time for additional description when natural pauses are too short.
Note: If the original audio already communicates all important visual information, additional audio description is not required.
Learning Objectives
By the end of this chapter, you should be able to:
- Explain when audio description is required under WCAG 2.1 Level AA.
- Differentiate between integrated descriptions and post-production audio descriptions.
- Identify when audio descriptions should be included for a video.
- Write concise, relevant audio descriptions for instructional media.
- Identify tools and resources that support accessible video production.
Key Terms
- Integrated description: The practice of including descriptions of important visual content as part of the original video narration.
- Post-production audio description: The process of adding audio descriptions to a video after the original has been created, often through separate recorded narration.
- Standard audio description: Audio descriptions that fit into the natural pauses of the original content.
- Extended audio description: Descriptions that require pausing the original video and audio to provide extra time for essential visual details.
Chapter Overview
This chapter explains when audio description is required, how integrated description differs from post-production audio description, and how to plan, write, and add descriptions for instructional videos.
Integrated Description
Integrated description builds important visual details into the original narration. This approach often creates a smoother viewing experience because the description is planned during production rather than added later.
Types of Audio Description
- Standard Audio Description
- Fits narration into natural pauses in the original audio.
- Does not interrupt dialogue.
- Works best when there is enough space between spoken lines.
- Extended Audio Description
- Pauses the video to allow for longer, more detailed descriptions.
- Used when there is not enough natural pause time.
Example of Standard Audio Description
The video below demonstrates standard audio description. Notice how the narrator describes visual actions during pauses in dialogue without interrupting the original audio.
Video Example
Transcript
When to Include Audio Descriptions
Step 1: Determine Whether the Video Has Essential Visual Content
Consider this question: If a viewer could only listen to the video without seeing it, would they miss key details?
- If no, audio description may not be needed.
- If yes, continue to the next step.
Step 2: Check Whether the Visual Information Is Already Explained
Audio description may not be necessary if the spoken content already describes what appears on screen.
- The presenter reads what is on the slides.
- Data, charts, or statistics are explained verbally.
- On-screen text is spoken aloud.
- Demonstrations are clearly described during the video.
Step 3: Identify Missing Descriptions
If any of the following elements appear without being explained out loud, additional audio description or narration may be needed:
- Text on screen that is not read aloud
- Charts or graphs shown without explanation
- Demonstrations that lack verbal guidance
- Important actions happening with no narration
- Text overlays showing key details such as dates or instructions
- Visual graphics that carry meaning without explanation
Prioritizing Videos for Audio Description
When planning remediation work, prioritize videos based on audience need, usage, and publication timeline.
- Audience demographics: Videos designed for audiences that may include individuals who are blind or have low vision should be addressed first.
- Traffic: Videos that receive high traffic or are among your most-used materials should also be prioritized.
- Publication date: New videos should include audio description as part of the production process whenever possible.
Does This Video Need Descriptions?
Videos can be prioritized based on the level of need for audio description.
- High: The video cannot be fully understood through audio alone.
- Medium: The main ideas are understandable, but some essential details are lost.
- Low: Some information is lost, but it is non-essential.
Writing Audio Descriptions
- Describe relevant visual details.
- Use concise, plain language.
- Focus on meaning, not decoration.
Instructor Tip
When possible, script audio description during video planning. Integrated description is often easier to maintain than adding separate narration later.
Video Production and Audio Description Tools
Free Tools
The following tools are free to get started.
AI Text-to-Speech Tools
Each of the following tools can help you create voice recordings:
- Voice Generator – Streamlit (created by ASU; does not require an account or sign-in)
- ElevenLabs (free account includes up to 10K credits per month)
- Luvvoice (free account allows up to 10,000 characters per month)
Tools for Writing Descriptions of Visual Content
- ASU Image Accessibility – generates a description of an uploaded image
- ChatGPT Alt Text Generator – a ChatGPT tool created by CITT, based on the ASU tool
- Alt Text – Gemini Gem version created by Eric Curts
Able Player
Able Player is a free, open-source media player developed in part by the AccessComputing project at the University of Washington, with financial support from the National Science Foundation. Built with accessibility in mind, it supports multiple timed-text purposes documented in the HTML5 specification.
Paid or Subscriber-Based Tools
Swank
Starting in March 2026, Swank Cloud Streaming will begin updating its platform to include audio description tracks, with full support for WCAG 2.1 AA compliance expected by April 2027. These features will be available on supported titles, with an initial rollout that includes a limited selection of content.
Films On Demand
Films On Demand includes audio description for videos in its platform. Audio descriptive titles are identified with an AD icon on the search results page. You can also view AD titles on the Advanced Search page by selecting the option to show only videos with audio description before viewing results.
Commercial Description Services
The American Council of the Blind has compiled a comprehensive list of commercial services for producing audio description. The following subset of organizations may be a good match for higher education institutions:
- 3PlayMedia (output uses synthesized speech)
- Automatic Sync (output uses synthesized speech)
- Audio Eyes
- Center for Inclusive Design and Innovation (CIDI, at Georgia Tech)
- WGBH Media Access Group
Suggested Video Editing and Audio Recording Software
Video
Audio
Video Editing Labs
Several campuses may have computer labs dedicated to video and audio editing. These labs may include Camtasia and professional microphones that can support audio description projects.
Key Takeaways
- Audio description is required when essential visual information is not included in narration.
- Integrated descriptions reduce the need for post-production fixes.
- Not all videos need description; evaluate based on missing visual meaning.
- Prioritize high-impact and high-need content.
Chapter Summary
Audio description is a critical component of accessible video design under WCAG 2.1 Level AA. It ensures that learners who cannot see visual content still receive essential information. This chapter explained when audio description is required and how to evaluate whether key visual details are already communicated through narration or need to be added.
Review Questions
- When is audio description required under WCAG 2.1 Level AA?
- What is the difference between integrated and post-production description?
- How can you determine whether a video needs audio description?
- What types of visual elements typically require additional narration?
Practice Activity
Select a short instructional video you currently use or plan to create. Then complete the following steps:
- Watch or listen to the video once without looking at the screen.
- Identify any visual information that is essential for understanding but is not spoken aloud.
- Decide whether the video needs no audio description, standard audio description, or extended audio description.
- Write 2 to 4 sample audio description lines for one important segment of the video.
- Revise your description so it is concise, accurate, and focused only on meaningful visual details.
Further Reading and Resources
Licenses and Attribution
CC Licensed Content, Original
This educational material includes AI-generated content from ChatGPT by OpenAI. The original content created by Josh Hill, Neida Abraham, and Emiliana Olavarrieta from Hillsborough College is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
All images in this textbook generated with DALL·E are licensed under the terms provided by OpenAI, allowing their use, modification, and distribution with appropriate attribution.
Other Licensed Content
Show Me Your Art (Integrated Description Example)
AMI: Accessible Media Inc.
License: Standard YouTube License.
Frozen Trailer with Audio Description
IMSTVUK
License: Standard YouTube License.