Talk to Qwen2Audio with Gradio and WebRTC ⚡️
Simple Space for the Kokoro Model
Whisper model to transcript japanese audio to katakana.
Request evaluation of a speech recognition model
A demo of Indic Parler-TTS
Generate audio from text or modify voice pitch
Voice Clone Multilingual TTS
Generate audio from text with customizable voice
Belarusian TTS
V1.0Convert any Ebook to AudioBook with Xtts + VoiceCloning!
Moonshine ASR models running on-device, in your web browser.
Generate audio from text or file
Generate text and audio responses to user queries
Talk To Qwen Webrtc is a Speech Synthesis application designed to convert spoken words into text and generate responses using Gradio and WebRTC. It allows users to engage in real-time communication where spoken inputs are transcribed and processed to provide meaningful outputs. Ideal for users looking for an interactive and intuitive way to convert speech to text and beyond.
• Real-Time Speech-to-Text Conversion: Transcribes spoken words into text with low latency for immediate feedback. • WebRTC Integration: Leverages WebRTC technology for secure and reliable peer-to-peer communication. • AI-Powered Responses: Generates responses based on the transcribed text, enabling natural-sounding interactions. • Cross-Platform Compatibility: Works seamlessly across browsers and devices, ensuring accessibility. • User-Friendly Interface: Intuitive design for easy navigation and interaction.
1. What browsers are supported by Talk To Qwen Webrtc?
The application supports modern browsers like Google Chrome, Mozilla Firefox, and Microsoft Edge that are compatible with WebRTC.
2. Why am I not able to use the microphone?
Ensure your browser has permission to access your microphone. Check your browser settings or refresh the page and try again.
3. Is there any latency in the transcription process?
While the app strives for real-time transcription, slight delays may occur based on your internet connection speed and device performance.