Text (digitised) and videotape: video transcripts help everyone, not just people with access needs

By providing separate transcripts alongside video content on the departmental intranet, the MOD Internal Communications team has moved to ensure that the widest possible audience can access and understand critical information. While these features were originally implemented to meet the rigorous Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, the data suggests that "accessibility" has effectively become a proxy for "enhanced usability" for all users.

The Shift Toward Multi-Modal Content Consumption

The investigation into user behavior began when internal communications specialists noticed a recurring sentiment on online forums: many employees preferred scanning a transcript to watching a full-length video. To quantify this, the MOD conducted a survey of 87 personnel, asking them to identify their preferences for consuming video-based information. The results were telling. Only 21% of respondents reported a preference for watching videos exclusively. In contrast, 55% stated they preferred reading or scanning the transcript, while 24% utilized both mediums simultaneously.

This data underscores a broader trend in digital media consumption where text remains the most efficient medium for information retrieval. For the 55% who preferred reading, the primary motivations included the ability to scan for specific keywords, the speed of consumption compared to the linear nature of video, and the ease of referencing facts without having to scrub through a timeline.

Technical and Environmental Factors Driving Transcript Use

The research identified several non-accessibility reasons why employees opt for transcripts. One of the most prevalent factors is the physical environment of the modern office. Despite the provision of headsets for online meetings and video viewing, many employees expressed a distaste for wearing them for extended periods. "I don’t like wearing a headset" was a common refrain among the free-text responses, suggesting that text-based content allows for a less intrusive integration of information into the workday.

Technical limitations also play a critical role. In environments where network bandwidth may be constrained—a common issue in large-scale government infrastructures or remote deployments—loading high-definition video can be a bottleneck. Transcripts, being lightweight text files, load instantaneously and consume negligible data, ensuring that information remains accessible even when the network underperforms.

Furthermore, the topic of the video often dictates the medium of choice. Respondents indicated that for complex technical briefings or policy updates, having a written record is indispensable. The transcript serves as a "standalone piece of content" that can be easily copied, pasted into notes, or archived for future reference, tasks that are significantly more cumbersome when dealing with raw video files.

Compliance with WCAG 2.2 AA Standards

The MOD’s commitment to providing captions and transcripts is rooted in the legal and ethical necessity of digital inclusion. The UK’s Public Sector Bodies (Websites and Mobile Applications) (No. 2) Accessibility Regulations 2018 require public sector websites to be perceivable, operable, understandable, and robust. The transition to WCAG 2.2 AA standards represents the latest evolution in these requirements.

Under these guidelines, synchronised captions (Success Criterion 1.2.2) are required for all pre-recorded audio content in synchronized media. However, the MOD has gone a step further by providing full transcripts (Success Criterion 1.2.1). These tools are vital for:

  • Hearing Impairments: Captions allow users who are deaf or hard of hearing to follow dialogue in real-time.
  • Cognitive Disabilities: Transcripts allow individuals with processing disorders or neurodivergent traits to consume information at their own pace, reducing the "cognitive load" associated with fast-paced visual and auditory stimuli.
  • Visual Impairments: While video is a visual medium, transcripts allow users of screen readers and web narrators to access the information through synthesized speech or Braille displays.
  • Motor Disabilities: Navigating a video player’s interface can be challenging for those with limited fine motor control; a text-based transcript offers a simpler, more accessible navigation path.

The Curb-Cut Effect in Digital Design

The findings of the MOD study are a classic example of the "Curb-Cut Effect." Originally a term from urban planning, it refers to how the indentations in sidewalks—designed for wheelchair users—ended up benefiting parents with strollers, travelers with luggage, and delivery workers. In the digital realm, designing for the most vulnerable users often results in a better experience for the entire population.

Text (digitised) and videotape: video transcripts help everyone, not just people with access needs

For the 24% of respondents who used both the video and the transcript, the dual-mode approach served as a reinforcement tool. Educational psychology suggests that multi-modal learning—combining visual, auditory, and text-based inputs—can significantly improve retention and understanding. By seeing the speaker’s body language and hearing their tone while simultaneously reading the text, users can grasp nuances that might be lost in a single-medium format.

Organizational Reactions and Implications

While the MOD study focused on internal personnel, digital accessibility experts suggest these behaviors are mirrored in the private sector and general public. Industry data from social media platforms has long shown that a vast majority of videos (up to 80% on some platforms) are watched on mute, making captions and transcripts essential for engagement.

Internal communications departments across various government sectors are now viewing transcripts not as an "add-on" for a minority group, but as a core requirement for effective information dissemination. "Accessibility is usability," the report concludes, suggesting that the divide between "standard" users and "users with access needs" is increasingly artificial.

From a management perspective, the efficiency gains of transcripts are quantifiable. If an employee can find a specific piece of information in a 500-word transcript in 30 seconds, whereas finding it in a 10-minute video would take several minutes of searching, the cumulative time saved across a workforce of thousands is substantial.

Methodology and Future Research

The MOD’s research utilized a mixed-methods approach, combining quantitative data from the survey with qualitative insights from free-text options. This allowed researchers to uncover motivations that were not initially predicted, such as the bandwidth issues and the physical discomfort of headsets.

The department has expressed interest in further exploring how AI-driven transcription services might streamline this process. While current WCAG guidelines emphasize the need for accuracy—which often requires human oversight—the integration of automated speech-to-text tools could make it more cost-effective for departments to provide transcripts for every piece of video content, regardless of its length or perceived importance.

To be considered "good," the MOD notes that transcripts must be:

  • Accurate: Reflecting the exact spoken word and identifying speakers.
  • Descriptive: Including relevant non-speech sounds (e.g., [applause] or [door slams]) that are vital to understanding the context.
  • Accessible: Formatted correctly for screen readers with proper headings and spacing.

Conclusion

The Ministry of Defence’s findings serve as a powerful reminder that inclusive design is simply good design. By catering to the needs of colleagues with disabilities, the MOD has inadvertently created a more flexible, efficient, and resilient communication ecosystem for its entire staff. As digital content continues to dominate the workplace, the transcript is no longer just a tool for the few, but a vital resource for the many, proving that when we design for accessibility, we improve the experience for everyone.

The move toward digitised text and comprehensive video transcripts represents a maturation of digital strategy—one where the focus shifts from the medium itself to the ease with which the user can extract the message. As the MOD continues to refine its internal communications, the lessons learned from this research are expected to influence wider government standards, ensuring that "access for all" becomes a functional reality rather than just a compliance checkbox.

Leave a Reply

Your email address will not be published. Required fields are marked *