Development Of a Speech-To-Text System for Sanskrit: Leveraging Linguistic and Computing Techniques

Authors

  • Ms. Manisha D Mistry Author
  • Dr. Dikshan N Shah Author

DOI:

https://doi.org/10.53555/AJBR.v27i4S.6670

Keywords:

Indian languages, Automatic Speech Recognition, Speech-to-Text, Sanskrit parser, Constitutional Languages,

Abstract

Objectives: Speech-to-text (STT) is a technology for converting spoken words into printed text. It's also called Automatic Speech Recognition (ASR). STT systems analyze audio data and detect spoken words using complex algorithms and machine learning models. The natural-language idea has been dubbed an "AI-complete problem" since it appears to necessitate both the ability to use language and a thorough grasp of the surrounding world. In the case of Indian languages, there are 1652 dialects and 22 official languages, with more than 30 languages spoken, including six Indian languages on Google Assistant.
Methods: Sanskrit is highly valued in literature, culture, and traditions. Languages such as Assamese, Dogri, Punjabi, Kashmiri, Sanskrit, and others remain understudied. Only a fraction of India's 22 constitutional languages have had NER work done. Some obstacles include a scarcity of resources, verbal ambiguity, and morphological diversity. It is challenging to create a speech recognition system for a language with low resources. This paper presents a lightweight audio recognition method and a text generation technique for the Sanskrit language. A unique algorithm for recognizing Sanskrit speech-to-text is built using a Sanskrit parser. For input purposes, male and female voices are used for selected Sanskrit words. After taking the voice, the Sanskrit Parser is utilized to generate the relevant text.
Findings: 
This study used an innovative approach to convert Indian language speech to text. Speech-to-text for Indian languages is difficult due to their rich morphology. The pronunciation of one language cannot be matched with another, resulting in a pure case of gender-based language voice diversity. By implementing gender-specific voice input, the system produced a variety of performance results. 
Novelty: The Eighth Schedule lists 22 Indian constitutional languages. Few languages are spoken in many states with various dialects, and the number of speakers is very small or none. Sanskrit is one of the languages with the fewest speakers, yet due to its sophisticated grammar and syntax, it plays an important part in contemporary technical advances with AI. We propose a novel technique for Sanskrit speech-to-text. Because there are so few Sanskrit language speakers, we created our own dataset by selecting 100 words from the Sanskrit lexicon dictionary and utilizing them as voice input, gender-specific.

Author Biographies

  • Ms. Manisha D Mistry

    Teaching Assistant, Vanita Vishram Women’s University, Surat, Gujarat, India, 

  • Dr. Dikshan N Shah

    Assistant Professor, Vanita Vishram Women’s University, Surat, Gujarat, India. 

Downloads

Published

2024-11-24

Issue

Section

Research Article

How to Cite

Development Of a Speech-To-Text System for Sanskrit: Leveraging Linguistic and Computing Techniques. (2024). African Journal of Biomedical Research, 27(4S), 13156-13163. https://doi.org/10.53555/AJBR.v27i4S.6670