What is Digital Accessibility in Multimedia?

Producing videos and hosting live streaming events with digital accessibility best practices in mind

Originally published on Envato Tuts+

December 3, 2019

Accessibility Talks (Backstory)

As an effort to spread awareness about digital accessibility and to highlight some awesome speakers, I founded a virtual meet-up group called Accessibility Talks about three years ago. In the beginning, the group met informally and the sessions were not recorded, but soon I realized we needed to record the talks and post them on a media platform so we could share the knowledge with others.

I chose the YouTube platform due to its flexibility and available accessibility options, including auto-captioning. Little did I know at the time, the level of complexity that comes with producing videos and hosting live streaming events with digital accessibility best practices in mind!

Below are some tips, tricks, and information about alternative media types (such as captions, transcriptions, etc.) to help make your next multimedia video or live stream event more successful!

Accessibility in the Modern Digital Agency - Sara Tabor.


Alternative Multimedia Types and Requirements

When it comes to producing accessible digital content of any kind, the best resource is the Web Content Accessibility Guidelines (WCAG) from the W3C group. The goal of this group is to provide a single shared standard for web content accessibility that meets the needs of individuals, organizations, and governments internationally, and to make digital content more accessible to people with disabilities.

For this article, we will focus on multimedia best practices, since the videos I produced also included audio. For more information on the WCAG requirements for other types of media (ex. audio-only files like a podcast), please consult the audio-only and video-only sections of the guidelines.

For accessible multimedia on the web, we need to be concerned with four alternative media types:

  1. captions
  2. transcripts
  3. audio descriptions
  4. sign language interpretation

Below are some definitions of each, some information on which disabilities they target, plus the requirements for each level of WCAG conformance (A, AA, or AAA).


1. What Are Captions?

Captions are text synchronized with the multimedia for people who cannot hear the spoken words.

People often confuse “captions” and “subtitles," but they are not the same thing. Both are text synchronized with the words in a video and both often appear in the same location in the media (usually the bottom of the screen). However captions can be thought of as a transcription of dialogue for people who are Deaf and hard-of-hearing, while subtitles are essentially helper text for people who can hear the audio but may not understand what was said (ex. garbled speech or words spoken in a language you don’t understand). 

Note: there are some geographical differences in defining captions vs subtitles - so double-check the terminology in your location.

Captions come in two forms — open or closed. Closed captioning (CC) can be turned off by the viewer with the click of a button, while open captions are essentially burned into the video and cannot be turned off. Depending on the situation or how the multimedia is going to be consumed, one method might be preferable to the other.

Auto-Craptions

Another aspect of captions is auto-captioning. That is when the media platforms like YouTube will use speech recognition software to try and understand the words being spoken and add them to the multimedia as captions. As exciting as this technology is, it is not yet 100% reliable. In fact, many dub these “auto-craptions” as they are so awful that they are almost funny. 

Your best bet at this point in time is to use the auto-captioning feature as a “first step” in your captioning process. For example, on the Accessibility Talks videos, I upload the media, let YouTube auto-caption it, then I go back and edit the file with the proper captions.

Captions are beneficial to a lot of people including Deaf or hard of hearing, people who are not fluent in the language used in the audio content, and people with cognitive disabilities who may need to see the words, not just hear them.

deafpeopleforberniesanders.wordpress.com

WCAG Requirements for Captions

Pre-recorded Multimedia:

  • A: REQUIRED
  • AA: REQUIRED
  • AAA: REQUIRED

Live Multimedia:

  • A: ENCOURAGED
  • AA: REQUIRED
  • AAA: REQUIRED

2. What Are Transcripts?

Transcripts are the full text of the spoken words and important visual information in the media file, to read as an alternative to watching or listening to the media file.

Transcripts are text-based documents that serve as an alternative to information presented in an audible and visual format. They are similar to captions, but they take the experience to the next level by including important sound effects and other significant visual descriptions (ex. describing eerie sounds in the background). 

Transcripts help people who are hard of hearing, Deaf, or Deafblind. Transcripts are also great for people with cognitive disabilities or people who want to browse through audio and video information at their own speed. For an added bonus, Search Engine Optimization (SEO) gets a boost when your multimedia includes transcripts since search bots cannot crawl your multimedia, but they can crawl your text transcripts.

Screenshot of a11yrules podcast transcript

WCAG Requirements for Transcripts

Pre-recorded Multimedia:

  • A: ENCOURAGED
  • AA: ENCOURAGED
  • AAA: REQUIRED

Live Multimedia:

  • n/a

3. What Are Audio Descriptions and Extended Audio Descriptions?

Audio descriptions are a version of the multimedia file that includes a narrator explaining important visual information (such as unspoken actions and events) for the benefit of people who cannot see what’s happening on the screen.

Audio descriptions, unlike captions and transcripts, are a recording of a person explaining the visual aspects of the video that aren’t in the video’s original dialog or narration (ex. describing facial expressions or scenery). Audio descriptions should convey visual information verbally–that dialogue and other sounds cannot.

Sometimes audio descriptions need to be very detailed due to large amounts of information but there are not enough pauses in the video for audio description to work; enter extended audio descriptions. Extended audio descriptions are essentially expanded audio descriptions. In extended audio descriptions, a video will pause to give a narrator enough time to convey the information in the video.

Audio descriptions and extended audio descriptions primarily help people who are blind or have low vision but could also help people with some cognitive disorders as well.

Audio description options on Netflix

WCAG Requirements for Audio Descriptions

Pre-recorded Multimedia:

Live Multimedia:

  • A: OPTIONAL
  • AA: OPTIONAL
  • AAA: OPTIONAL

WCAG Requirements for Extended Audio Descriptions

Pre-recorded Multimedia:

  • A: OPTIONAL
  • AA: OPTIONAL
  • AAA: REQUIRED

Live Multimedia:

  • n/a

4. What is Sign Language Interpretation?

Sign language interpretation for multimedia is when you add a video of an interpreter, usually shown in a box to the side of the video, who narrates the audio portion through sign language. If you are live streaming your event, typically the sign language interpreter is in the same room as the speaker to one side.

Sign language interpretation is important for multimedia since for many people who are Deaf, sign language is their first and most fluent language. Sign language interpretation is often is more expressive than just written transcripts, so can provide a much richer experience than captions or transcripts alone.

However, sign language interpretation can be cost-prohibitive to many organizations. And even if you do add sign language interpretation to your multimedia, you need to understand that it has regional limitations as there are over 300 different sign languages throughout the world. So adding one sign language interpretation to your multimedia would not be enough if you are targeting a global audience.

RTBF Les Niouzz via tv.signlangtv.org

WCAG Requirements for Sign language interpretation

Pre-recorded Multimedia:

  • A: OPTIONAL
  • AA: OPTIONAL
  • AAA: REQUIRED

Live Multimedia:

  • A: OPTIONAL
  • AA: OPTIONAL
  • AAA: OPTIONAL

Steps for Making Your Multimedia Accessible

As you can tell from this list, there are a lot of factors to think about when working with accessibility and multimedia. I encourage you to work your way backwards, from the most recent media to the oldest. 

  1. Focus first on getting your captions in place and accurate–they can be time-consuming to add, but they are also a fairly straightforward task. You can also pay for captioning services if you have the money but not the time to do them.
  2. Next, work on your transcripts or audio descriptions. Often you can get a good baseline script from your captions. 
  3. If you need to add sign language interpretation, leave that to the pros. There are often local companies and organizations who can point you in the right direction for this task.

Conclusion

Depending on your level of WCAG conformance and how much effort you’ve already put into your multimedia, you may have to rethink your workflow a bit. But don’t be discouraged! With all things accessibility, if you can bake it into your process you will save time, money, and overall effort.