Skip to the main content

Original scientific paper

https://doi.org/10.32985/ijeces.15.7.6

TelMedAI: A Framework for Patient Speech Recognition and Conversion into Desired Language Towards Telemedicine System

Mrudula Owk orcid id orcid.org/0000-0002-6154-8761 ; GITAM University, Department of CSE, GITAM School of Technology Rushikonda, Visakhapatnam -530045 *
Deepthi Godavarthi ; VIT-AP University, School of Computer Science and Engineering(SCOPE) Amaravati, Andhra Pradesh, India- 522237
Pusarla Sindhu ; GITAM University, Department of CSE, GITAM School of Technology Rushikonda, Visakhapatnam -530045
T. Krishna Mohana ; Department of ECE Aditya College of Engineering

* Corresponding author.


Full text: english pdf 1.612 Kb

page 603-610

downloads: 78

cite


Abstract

Telemedicine is the practice of technology-enabled remote communication between patient and doctor. This phenomenon in healthcare has the potential to make services affordable and save time and money. Besides telemedicine allows care givers and family members to join conversations with doctors. Indian government initiated the National Telemedicine Network (NTN) to serve remote areas in healthcare by integrating existing healthcare facilities.Literature has revealed that existing works lack in an integrated approach for patient speech translation in language-independent fashion and automatic detection of disease and symptoms based on speech.There is a need for an automated system using Artificial Intelligence (AI) to recognize patient's speech and identify symptoms based on given audio description. We proposed a framework known as TelMedAI which is designed to recognize patient speech to comprehend disease symptoms besides converting the speech text into desired language. The framework is useful for realizing a telemedicine system. Speech to Speech (STS) module takes the patient's audio content into English audio. STS module exploits the Bi-LSTM model with an encoder, decoder and attention mechanism for translation. Then Google Speech API is used to convert English audio into English text. Then the framework exploits Natural Language Processing (NLP) to improve the quality of text. Afterwards, the disease and symptoms miner module eventually recognizes a list of diseases and corresponding symptoms. We proposed an algorithm known as Learning based Disease and Symptom Recognition from Patient Speech (LbDSRPS). This algorithm has the functionality to develop TelMedAI which helps doctors in telemedicine. Our empirical study has revealed that TelMedAI takes technology-driven telemedicine research forward significantly. The highest accuracy achieved by the proposed framework is 68.13% which is much better than the baseline LSTM model used for voice translation.

Keywords

Telemedicine System; Patient Speech Recognition; Deep Learning; Artificial Intelligence; Multi-Lingual Text Conversion;

Hrčak ID:

319164

URI

https://hrcak.srce.hr/319164

Publication date:

12.7.2024.

Visits: 222 *