Speech to Text Using Java

Why Developers Are Dropping Cloud APIs for This Tiny 82M Speech Model

Kokoro 82M is an 82-million-parameter text-to-speech model that beats many TTS APIs while running locally on CPUs, including ...

Top Text-to-Speech Models of 2026: Proprietary vs Open Source Compared

Open source TTS models Kokoro, Orpheus, and Piper are tested on symbols, abbreviations, and prosody with CER and MOS results.

Say goodbye to language barriers as Google Meet speech translation rolls out on Android

Timi is the news and deals reporter for Android Police, who has been reporting on technology since 2008. He has worked in ...

Slator

Mistral Completes Voxtral Speech Stack With Launch of Text-to-Speech Model

Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.

Reuters

Trump says 'Cuba is next' in speech touting US military successes

MIAMI, March 27 (Reuters) - U.S. President Donald Trump on Friday said "Cuba is next" during a speech at an investment ‌forum in Miami during which he touted the successes of U.S. military action in ...

IEEE

Real-Time Sign Language Interpretation and Translation to Speech Using Cuda and Machine Learning

Abstract: This paper introduces a novel deep learning-based system for real-time American Sign Language (ASL) interpretation and translation into speech, aimed at improving communication for ...

TechCrunch

Mistral releases a new open source model for speech generation

French AI company Mistral released a new open source text-to-speech model on Thursday that can be used by voice AI assistants or in enterprise use cases like customer support. The model, which lets ...

GitHub

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.

IEEE

Advancing Text-to-Speech Systems for Low-Resource Languages: Challenges, Innovations, and Future Directions

Abstract: Speech synthesis, the technology that converts text into spoken words, has advanced significantly for high-resource languages like English, Spanish, and Mandarin. However, many languages ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results