Content uploaded by Kristen Shinohara
Author content
All content in this area was uploaded by Kristen Shinohara on Sep 10, 2021
Content may be subject to copyright.
Designing a Podcast Platform for Deaf and Hard of Hearing
Users
Becca Dingman
bad6955@g.rit.edu
School of Information
Rochester Institute of Technology
Rochester, NY, USA
Garreth W. Tigwell
garreth.w.tigwell@rit.edu
School of Information
Rochester Institute of Technology
Rochester, NY, USA
Kristen Shinohara
kristen.shinohara@rit.edu
School of Information
Rochester Institute of Technology
Rochester, NY, USA
ABSTRACT
Listening to podcasts is a popular way for people to spend their
time. However, little focus has been given to how accessible pod-
cast platforms are for Deaf and Hard-of-Hearing (DHH) people.
We present a DHH-centered accessible podcast platform prototype
developed with user-centered design. Our proposed design was
constructed through semi-structured interviews (n=7) and proto-
type design feedback sessions (n=8) with DHH users. We encourage
podcast platform designers to adopt our design recommendations
to make podcasts more inclusive for DHH people and recommend
how podcast hosts can make their shows more accessible.
CCS CONCEPTS
•Human-centered computing →Accessibility.
KEYWORDS
Deaf and Hard-of-Hearing, Podcasts, Design
ACM Reference Format:
Becca Dingman, Garreth W. Tigwell, and Kristen Shinohara. 2021. Design-
ing a Podcast Platform for Deaf and Hard of Hearing Users. In The 23rd
International ACM SIGACCESS Conference on Computers and Accessibility
(ASSETS ’21), October 18–22, 2021, Virtual Event, USA. ACM, New York, NY,
USA, 4 pages. https://doi.org/10.1145/3441852.3476523
1 INTRODUCTION & RELATED WORK
Engaging with podcasts has become an extremely popular way for
people to spend their time. However, little work has investigated
the attitudes and concerns that Deaf and Hard-of-Hearing (DHH)
individuals have toward podcasts.
Podcasts typically take an audio-based format [
18
], in which a
host (or several) discuss a particular topic from a range of dier-
ent subject matter [
15
]. Podcast engagement is on the rise [
19
],
with listeners in the US averaging about 8 podcasts per week [
19
],
demonstrating distinct engagement habits compared to music con-
sumption [
15
]. Podcasts not only provide a source of entertainment,
but they can be educational and support learning/teaching [
6
,
7
,
11
].
Despite the benets of podcasts, written transcripts of the con-
tent are often not provided because they cost money and it takes
time to create them [
5
,
17
], resulting in the worldwide exclusion
Permission to make digital or hard copies of part or all of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed
for prot or commercial advantage and that copies bear this notice and the full citation
on the rst page. Copyrights for third-party components of this work must be honored.
For all other uses, contact the owner/author(s).
ASSETS ’21, October 18–22, 2021, Virtual Event, USA
©2021 Copyright held by the owner/author(s).
ACM ISBN 978-1-4503-8306-6/21/10.
https://doi.org/10.1145/3441852.3476523
of 450 million+ people who are Deaf and Hard-of-Hearing [
20
].
Transcripts can improve podcast accessibility [
5
], support read-
ing comprehension in students [
6
,
8
], and DHH individuals nd
transcripts to be the best format to access long and dense audio
content [
10
], which is typical of podcast audio content. However,
delivering those transcripts must be achieved in an appropriate
way. Podcast platforms such as Spotify and Apple Podcast do not
provide followers with a way to view the transcripts directly within
the platform, which means DHH users have to go to an external
source. This approach draws parallels with the problematic stance
of “separate-but-equal” [
12
,
13
]; instead, we should build an acces-
sible experience within the platform.
There are two requirements to address the current issues of
inaccessible podcast platforms: 1) we need an adaptable interface
for DHH users to choose their preferred transcript style [
10
], and
2) a fast and low cost means of transcribing the audio. Our work
is focused on the rst problem because although ASR technology
still has translation errors, eorts to improve ASR are ongoing [
9
],
yet, there has been no work to achieve the rst requirement.
We present a DHH-centered accessible podcast platform, which
was design through semi-structured interviews (7 DHH users)
and prototype design feedback sessions (8 DHH users). Our user-
centered design approach allowed us to understand how to improve
the current podcast platform design so that interaction with—and
presentation of—transcripts oers an enhanced positive experience
for DHH users.
2 PROTOTYPE DEVELOPMENT
Our work was guided by three research questions:
(RQ1)
What
are the experiences of DHH individuals when following a podcast
episode?
(RQ2)
What kind of features are necessary for a podcast
platform to be accessible by DHH users?
(RQ3)
How do DHH users
prefer to receive podcast transcripts? See [
3
] for a reection on using
interview and think-aloud methods with DHH participants.
2.1 Semi-Structured Interviews with DHH
Participants
We conducted semi-structured interviews to understand DHH users’
experiences with, and desires for, podcast features.
Procedure
: The remote interviews ranged from 20 to 45 minutes
and were led by the rst author who identies as a DHH researcher.
The interviews were in the preferred communication medium for
each of the DHH participants (e.g., American Sign Language (ASL),
captioning, simultaneous signing and spoken English (SimCom)).
All the video recordings were transcribed for analysis systemati-
cally [2].
ASSETS ’21, October 18–22, 2021, Virtual Event, USA Dingman et al.
Participants
: We interviewed seven participants who identied
as DHH, ages ranged between 21-27 years-old. Four participants
identied as male, one as female, and two as non-binary. All except
for one participant had some experience with listening to a podcast.
Results
: In general, we found that DHH users experimented and
wanted to listen to podcasts for various reasons. Two participants
mentioned they like listening to podcasts as background noise while
they did household chores or completed homework assignments.
Meanwhile, one participant never played a podcast before because
they knew they would struggle to hear the spoken content. The
other participants followed podcasts for a specic purpose.
Our rst main theme was: multiple factors need addressing to
make podcasts accessible for DHH users. Specically, participants
discussed challenges with podcast audio quality and voices. Partic-
ipants explained they would ask their hearing siblings, who also
listened to podcasts, for recommendations of podcasts with clear
voices. Whenever they missed information or struggled to under-
stand, participants would go back into the podcast and replay the
section to hopefully understand it the second or third time. Partic-
ipants reported this experience as frustrating. Some participants
shared strategies they used to enhance their listening experience,
such as using an app to automatically transcribe the audio of the
podcast. Participants also expected the timing should be seamless
between the audio and text appearing for captions. There was less
tolerance for prerecorded material to have a lag in the captions ver-
sus if the transcript was being generated in real-time during a live
session. While some participants were still avid podcast followers,
others no longer followed podcasts due to feeling exhausted, when
workarounds required too much attention or mental energy.
Our second main theme was: multiple podcast platform features
desired by DHH users. Specically, participants desired transcripts
or captioning provided on the podcast platforms. All participants
mentioned the desire for a summary of the episode to provide
some context of what will be discussed. One participant suggested
podcast hosts provide an article explaining the story of what will
be discussed, then the hosts share their opinions on the show. This
format would allow podcast users to have some prior understanding
of the story before the hosts’ commentary. Finally, participants
wanted the option to modify the transcript font, text size and color,
and color of the background. Participants suggested allowing users
to select caption or transcript view. Two participants did not want
to see the text ahead of where they were in the podcast, however
they liked the idea of being able to scroll back and see previous text.
Two other participants wanted to see the future text.
2.2 Prototyping cycles
Our interviews informed the initial low-delity prototype design
created in Figma. We ran an initial feedback session, rened the de-
sign to a high-delity prototype, and ran a second feedback session.
Acknowledging that prototyping can focus on dierent dimensions
of delity [
16
], we focused on understanding functionality and
interaction experiences rather than aesthetic and detailed visual re-
nement. Our low-delity prototype had fewer functional features
compared to the later rened high-delity prototype.
Prototype Evaluation Procedure
: The procedure was the same
for the low and high-delity evaluations. Participants spent 20-30
minutes interacting with the prototypes to complete seven scenar-
ios. The tasks were (1) search for a podcast, (2) read the podcast
summary, (3) read the episode summary, (4) turn on the captions, (5)
nd the keyword additional information, (6) review missed parts of
the podcast and (7) change the captioning style. We also asked ques-
tions regarding their preference in the various captioning styles.
Participants were entered in a $25 rae as compensation.
Prototype Evaluation Participants
: For the low-delity ses-
sions, we recruited four participants from the interviews who identi-
ed as DHH, ages ranged between 21-27 years-old. Two participants
identied as male, one as female, and one as non-binary. For the
high-delity sessions, we recruited a new set of four participants
who did not participate in the interviews or low-delity prototype
feedback session. Participants in the high-delity feedback session
identied as DHH, with ages ranging from 25-27 years-old. Three
participants identied as female and one as male.
Prototype Evaluation Results
: Figma is a software that allows
designers to create mock-ups or prototypes with some interaction
such as tapping, timed events/triggers, etc. These interactions are
intended to mimic an experience such as using a mobile application,
however some of the interactions still may not feel like the true
experience. All participants in both low and high-delity sessions
successfully completed all tasks with the exception of reviewing the
missed parts of the episode, but this could be due to the limitation of
Figma such as the software’s inability to allow for true scrolling on
the design screen. Despite this, we found that overall participants
in the nal session were satised with the features of the prototype
(P9: I would prefer hover because I would listen in real-time, and it
would be going at the same time as the speaker is speaking), even
mentioning they wanted to use this platform now to access podcasts
(P10: I would use this app like a few times a week).
3 A DHH-CENTERED PODCAST PLATFORM
Overall, our prototype allowed participants to rst log in to an
account, view a list of un-listened episodes from shows they follow,
the ability to search for podcasts, and read descriptions of the pod-
cast and episodes. Due to space limitations, we share screenshots
in Figure 1 that highlight only the key DHH features. The DHH
user can play an episode and select a CC (Closed Caption) button
(see top corner of Fig 1a), which then reveals a transcript with
interactive
bold & underlined
keywords (lower half of Fig 1a). A
description of the keyword appears in a popup (Fig 1b). The DHH
user can scroll through the transcript if they want to see previ-
ous text (Fig 1c). After selecting the gear icon, a settings menu
appears, which provides the DHH user with various content adjust-
ments including transcript style, text size, style, and color, and also
background color to improve contrast (Fig 1d). We designed the
transcript styles based on interviewee suggestions. The DHH user
has set the transcript text color to white with a black background
and a closed caption style format which only displays three lines
of text (Fig 1e). In the nal screenshot, the DHH user has set the
transcript with low contrasting text in a hover style that can be
repositioned on the screen to wherever is comfortable (Fig 1f).
Designing a Podcast Platform for Deaf and Hard of Hearing Users ASSETS ’21, October 18–22, 2021, Virtual Event, USA
Figure 1: Left-Right,Top-Bottom: A series of screenshots demonstrating the prototype in use.
4 DESIGN RECOMMENDATIONS AND
REFLECTIONS
We recommend:
(1)
accessible podcast platforms include transcripts
in various styles such as transcription, closed captioning and cap-
tioning overlaying the application so users can set their preferred
style;
(2)
adding customization options for the transcript such as
font, font size, color, and background—similar work demonstrates
adapting live television captioning for user preferences can enhance
the user experience [
1
,
4
]; and
(3)
providing DHH users with access
to additional information about the podcast, episode and keywords
in the transcript.
4.1 Recommendations for Podcast Hosts to
Consider
Based on the ndings from this work, it is clear that beyond app
design, audio quality of the podcast can impact user experience.
We recommend hosts:
(1)
begin each episode with a summary,
and then provide additional context of what may be referenced
during the podcast;
(2)
avoid additional sounds when each person
talks (e.g., avoid talking over one another when there are multiple
speakers, and refrain from laughing over another speaker who may
be talking); and
(3)
utilize post-hoc audio adjustments to amplify
speakers, which can be facilitated with a multi-mic setup.
5 CONCLUSION & FUTURE WORK
We conducted semi-structured interviews and prototype design
feedback sessions with DHH participant to develop an accessible
podcast platform. We identied current challenges DHH users face
with podcasts and through the development of a prototype con-
tribute design recommendations. Interesting future work could take
the concept of incorporating visual aids within transcripts [
14
], and
investigate how this could enhance the experience for DHH users.
Furthermore, since podcasts are typically a form of entertainment
people engage with while also completing other tasks (e.g., running),
there are opportunities to investigate how we might maintain multi-
tasking while reading the transcript. Finally, there is an opportunity
for future work to investigate what concerns podcast platforms and
podcast hosts may have with our current recommendations and
whether there needs to be further renement.
REFERENCES
[1]
Akhter Al Amin and Matt Huenerfauth. 2021. Perspectives of Deaf and Hard-
of-Hearing Viewers on Live-TV Caption Quality. In iConference. iSchools. http:
//hdl.handle.net/2142/109692
[2]
Yanto Chandra and Liang Shang. 2019. Inductive Coding. In Qualitative Research
Using R: A Systematic Approach. Springer Singapore, Singapore, 91–106. https:
//doi.org/10.1007/978-981- 13-3170- 1_8
[3]
Becca Dingman, Garreth W. Tigwell, and Kristen Shinohara. 2021. Interview and
Think Aloud Accessibility for Deaf and Hard of Hearing Participants in Design
Research. In The 23rd International ACM SIGACCESS Conference on Computers
and Accessibility (Virtual Event, USA) (ASSETS ’21). Association for Computing
Machinery, New York, NY, USA. https://doi.org/10.1145/3441852.3476526
[4]
Benjamin M. Gorman, Michael Crabb, and Michael Armstrong. 2021. Adaptive
Subtitles: Preferences and Trade-Os in Real-TimeMedia Adaption. In Proceedings
of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama,
Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA,
Article 733, 11 pages. https://doi.org/10.1145/3411764.3445509
[5]
Nicole Hennig. 2017. Podcast literacy: Educational, accessible, and diverse pod-
casts for library users. Library Technology Reports 53, 2 (2017), 1–42.
[6]
Yasamin Heshmat, Lillian Yang, and Carman Neustaedter. 2018. Quality ’Alone’
Time through Conversations and Storytelling: Podcast Listening Behaviors and
Routines. In Proceedings of the 44th Graphics Interface Conference (Toronto,
Canada) (GI ’18). Canadian Human-Computer Communications Society, Wa-
terloo, CAN, 76–83. https://doi.org/10.20380/GI2018.11
[7]
Khe Foon Hew. 2009. Use of audio podcast in K-12 and higher education: A
review of research topics and methodologies. Educational Technology Research
and Development 57, 3 (2009), 333–357.
[8]
James Janossy. 2007. Student reaction to podcast learning material: Preliminary
results. In Engaging the Learner: 12th Annual Instructional Technology Conference.
ERIC, 98–107.
[9]
Sushant Kae and Matt Huenerfauth. 2016. Eect of speech recognition errors on
text understandability for people who are deaf or hard of hearing. In Proceedings
of the 7th Workshop on Speech and Language Processing for Assistive Technologies
(SLPAT), Interspeech’16 (San Francisco, CA). 20–25.
[10]
Raja S. Kushalnagar, Walter S. Lasecki, and Jerey P. Bigham. 2013. Captions
versus Transcripts for Online Video Content. In Proceedings of the 10th Interna-
tional Cross-Disciplinary Conference on Web Accessibility (Rio de Janeiro, Brazil)
(W4A ’13). Association for Computing Machinery, New York, NY, USA, Article
32, 4 pages. https://doi.org/10.1145/2461121.2461142
[11]
Sawsen Lakhal, Hager Khechine, and Daniel Pascot. 2007. Evaluation of the
eectiveness of podcasting in teaching and learning. In E-Learn: World Confer-
ence on E-Learning in Corporate, Government, Healthcare, and Higher Education.
Association for the Advancement of Computing in Education (AACE),6181–6188.
[12]
Jonathan Lazar, Daniel F Goldstein, and Anne Taylor. 2015. Ensuring digital
accessibility through process and policy. Morgan kaufmann.
ASSETS ’21, October 18–22, 2021, Virtual Event, USA Dingman et al.
[13]
J Lazar and B Wentz. 2011. Separate but unequal: Web interfaces for people with
disabilities. User Experience 10, 3 (2011), 2011–3.
[14]
Jihyeon Janel Lee, Mitchell Gordon, and Maneesh Agrawala. 2017. Automatically
Visualizing Audio Travel Podcasts. In Adjunct Publication of the 30th Annual ACM
Symposium on User Interface Software and Technology (Québec City, QC, Canada)
(UIST ’17). Association for Computing Machinery, New York, NY, USA, 165–167.
https://doi.org/10.1145/3131785.3131818
[15]
Ang Li, Alice Wang, Zahra Nazari, Praveen Chandar, and Benjamin Carterette.
2020. Do Podcasts and Music Compete with One Another? Understanding Users’
Audio Streaming Habits. In Proceedings of The Web Conference 2020 (Taipei,
Taiwan) (WWW ’20). Association for Computing Machinery, New York, NY, USA,
1920–1931. https://doi.org/10.1145/3366423.3380260
[16]
Michael McCurdy, Christopher Connors, Guy Pyrzak, Bob Kanefsky, and Alonso
Vera. 2006. Breaking the Fidelity Barrier: An Examination of Our Current Char-
acterization of Prototypes and an Example of a Mixed-Fidelity Success. Asso-
ciation for Computing Machinery, New York, NY, USA, 1233–1242. https:
//doi.org/10.1145/1124772.1124959
[17]
Take Note. [n.d.]. Human Transcription Services vs. Automatic Speech Recog-
nition (ASR). https://takenote.co/human-transcription-services- vs-automatic-
speech-recognition- asr/. Accessed: 2021-4-4.
[18]
Deborah Potter. 2006. iPod, you pod, we all pod: eager to lure news consumers,
media outlets are experimenting with news-on-demand podcasts. They’re fun,
fresh–and often unpolished. American Journalism Review 28, 1 (2006), 64–65.
[19]
Nicholas Quah. 2021. Yes, Podcast Listenership Is Still on the Rise. https://www.
vulture.com/2021/03/podcast-listenership- download-data- on-the-rise.html. Ac-
cessed: 2021-4-15.
[20]
WHO. 2021. Deafness and hearing loss. https://www.who.int/news-room/fact-
sheets/detail/deafness-and- hearing-loss. Accessed: 2021-4-4.