Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.
At the India AI Impact Summit 2026, the conversation surrounding digital inclusivity reached a new milestone with the formal unveiling of Vachana TTS. Developed by Gnani.ai as a pivotal element of the ...
This study suggests that the analysis of speech data recorded while reading text-dependent sentences could help predict depression status automatically by capturing the characteristics of depression.
Union finance minister Nirmala Sitharaman on Sunday delivered her ninth consecutive Union Budget speech in the Lok Sabha, outlining the government’s fiscal roadmap, policy priorities, and key reforms ...
Abstract: Although recent neural text-to-speech (TTS) systems have achieved high-quality speech synthesis, there are cases where a TTS system generates low-quality speech, mainly caused by limited ...
WhisperS2T is an optimized lightning-fast open-sourced Speech-to-Text (ASR) pipeline. It is tailored for the whisper model to provide faster whisper transcription. It's designed to be exceptionally ...
Abstract: Speech is one of the most important types of communication among the human beings. Speech recognition is one of the most widely used applications of speech processing. Developing a automatic ...