Speech processing Speech Recognition

IoT: GenAI voice helps generate speech recognition models

A new generative AI feature brings voice recognition to tiny devices with a text-to-speech (TTS) synthetic dataset generation capability. It enables developers to generate synthetic speech data with ...

Nature

Speech Perception and Recognition in Auditory Processing

Speech perception and recognition constitute essential cognitive functions that enable individuals to transform acoustic signals into meaningful linguistic constructs. This process involves the ...

Slator

Google Launches MedASR, an Open Medical Speech-to-Text Model

Google introduces MedASR, an open-weight medical speech-to-text model positioned as a foundational layer for healthcare AI ...

Geeky Gadgets

NVIDIA Parakeet 2 vs OpenAI Whisper: Which AI Speech Recognition Model Wins?

What if the race to perfect AI speech recognition wasn’t just about accuracy but also speed and usability? In a world where audio-to-text transcription powers everything from virtual meetings to ...

CU Boulder News & Events

Fine-tuning a Strong Language model to Enable Classroom Speech Recognition

Postdoctorate Viet Anh Trinh led a project within Strand 1 to develop a novel neural network architecture that can both recognize and generate speech. He has since moved on from iSAT to a role at ...

Analytics Insight

Best Voice AI Frameworks to Use in 2026

Overview Leading voice AI frameworks power realistic, fast, and scalable conversational agents across enterprise, consumer, ...

Voice Modality: The Next Frontier In AI And Workflow

The productivity upside is straightforward. Research, like the Stanford report linked above, has repeatedly shown that ...

techtimes

Top 5 Best Speech Recognition Tools in 2024

Choosing a transcription service requires many considerations. In today's fast-paced world, speed is vital, as well as accuracy. You want to spend as little time as possible correcting the errors in ...

Electronic Design

FPGAs "Accel" at Automatic Speech Recognition

Discover how FPGA-accelerated automatic speech recognition (ASR) models are reshaping the landscape of speech-to-text applications by enabling faster inference, significantly reduced latency, and ...

CU Boulder News & Events

Tackling Bias in Automatic Speech Recognition - Two Examples From Our Ongoing Work

Rosy Southwell is a postdoc research scientist at CU Boulder who holds a PhD in Cognitive Neuroscience from University College London, UK and an MS in Natural Sciences from University of Cambridge, UK ...

HR News

Audio to Text Conversion: Benefits and Methods

Today, a wide range of technologies enable the efficient conversion of audio into written text. This capability plays a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results