Beware Using Telemedicine for Voice and Speech Therapy

As a result of the coronavirus pandemic, people across the world have experienced how teleconferencing platforms like Zoom help folks stay connected - playing games with friends, hosting virtual weddings, and even visiting a doctor. But when it comes to telemedicine, not all medical care is easily translated to a remote format.

In a virtual world, voice therapy presents a unique challenge because clinicians must rely on acoustic recordings of voice to evaluate the effectiveness of their treatments. But many teleconferencing platforms distort sounds in their efforts to eliminate background noise.

Boston University graduate researcher Hasini Weerathunge wanted to find out if popular teleconferencing platforms used for telemedicine could capture sounds accurately enough for clinicians to successfully treat and evaluate patients with voice and speech disorders. Weerathunge, a graduate student fellow at BU's Rafik B. Hariri Institute for Computing and Computational Science & Engineering and a Ph.D. candidate in biomedical engineering, does research in the lab of Cara Stepp, a College of Health & Rehabilitation Sciences: Sargent College associate professor of speech, language, and hearing sciences.

"Although the COVID-19 crisis appears to be waning, telepractice popularity is here to stay," Stepp says.

Weerathunge and Stepp teamed up with other BU researchers to put five different HIPAA-compliant teleconferencing platforms to the test: Cisco Webex, Microsoft Teams, Doxy.me, VSee Messenger, and Zoom.

As the pandemic unfolded and lockdowns moved much voice and speech therapy online, "there was no consensus among [voice and speech] clinicians [who were] trying to convert to telepractice therapy, and we wanted to determine the accuracy of the acoustic measures they can get through telepractice," Weerathunge says.

Although voice therapists had sometimes conducted telepractice sessions with patients before the pandemic, evaluations of the effectiveness of treatment were always carried out in person. During that process, a patient goes into the clinic and sits in a soundproof booth outfitted for speech recordings. The patient repeats sustained vowel sounds, like "aaa" or "ooo," or reads a short passage that reflects a wide variety of sounds and mouth movements in the English language. The recordings of the patient's voice are then evaluated by algorithms that measure acoustic properties, including the acoustic correlates of perceived pitch and loudness of the voice.

In-person voice evaluations came to halt, however, at the height of the COVID-19 pandemic. Voice evaluations moved to a virtual format, but until now, the accuracy of those evaluation procedures done online has never been examined.

In a soundproof room, the team recorded voice samples from 29 patients, aged 18 to 82, that had a variety of speech or voice diagnoses. These recordings were then played back to researchers through an external speaker over the teleconferencing platforms, simulating telepractice conversations.

The team quickly learned that each platform has its own audio enhancement algorithm that affects the quality of the sound. Zoom was the only platform that enabled users to turn off these audio enhancement features, allowing the researchers to test the platform's original audio.

Despite the enhancements, the team predicted the ability to measure vocal fundamental frequency (pitch) and vocal intensity (loudness) through teleconferencing platforms is not significantly affected.

But the researchers discovered that all the teleconferencing platforms did a poor job at capturing many measurements needed for accurate and clinically meaningful voice evaluations. Pitch varied significantly on all the virtual platforms compared to the real-life recordings. This might be due to internet connection or bandwidth issues that affect how and when sounds get transmitted through the platforms, the researchers say.

They also found the dynamic range of the vocal loudness measured over telepractice was very different from live recordings. "This was the biggest surprise for us," Weerathunge says. The effect was even true for Zoom, where the researchers could turn off the audio enhancements.

Overall, "Microsoft Teams performed the best, in that all our voice measures were the least affected in that platform," Weerathunge says.

Because many of the voice metrics collected from virtual platforms had clinically significant differences from those collected in person, Weerathunge and the team urge caution for voice and speech therapists using telepractice.

"This work is likely to have substantial impact on clinical practice, providing crucial information about the effects of these telepractice platforms on clinical voice evaluations," Stepp says.

Hasini R Weerathunge et al.
Accuracy of Acoustic Measures of Voice via Telepractice Videoconferencing Platforms.
Journal of Speech, Language, and Hearing Research, 2021. doi: 10.1044/2021_JSLHR-20-00625

Most Popular Now

Research Shows AI Technology Improves Pa…

Existing research indicates that the accuracy of a Parkinson's disease diagnosis hovers between 55% and 78% in the first five years of assessment. That's partly because Parkinson's sibling movement disorders...

Who's to Blame When AI Makes a Medi…

Assistive artificial intelligence technologies hold significant promise for transforming health care by aiding physicians in diagnosing, managing, and treating patients. However, the current trend of assistive AI implementation could actually...

First Therapy Chatbot Trial Shows AI can…

Dartmouth researchers conducted the first clinical trial of a therapy chatbot powered by generative AI and found that the software resulted in significant improvements in participants' symptoms, according to results...

DMEA sparks: The Future of Digital Healt…

8 - 10 April 2025, Berlin, Germany. Digitalization is considered one of the key strategies for addressing the shortage of skilled workers - but the digital health sector also needs qualified...

DeepSeek: The "Watson" to Doct…

DeepSeek is an artificial intelligence (AI) platform built on deep learning and natural language processing (NLP) technologies. Its core products include the DeepSeek-R1 and DeepSeek-V3 models. Leveraging an efficient Mixture...

Stepping Hill Hospital Announced as SPAR…

Stepping Hill Hospital, part of Stockport NHS Foundation Trust, has replaced its bedside units with state-of-the art devices running a full range of information, engagement, communications and productivity apps, to...

DMEA 2025: Digital Health Worldwide in B…

8 - 10 April 2025, Berlin, Germany. From the AI Act, to the potential of the European Health Data Space, to the power of patient data in Scandinavia - DMEA 2025...