Talk to Qwen2Audio with Gradio and WebRTC ⚡️
ヘスティアのAI音声合成モデルを作りました。
V1.0Convert any Ebook to AudioBook with Xtts + VoiceCloning!
Sound effect from description
Generate audio from text or modify voice pitch
Convertir texto a audio
Whisper model to transcript japanese audio to katakana.
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
Convert text to speech effortlessly
Generate speech from text with custom voice
Transcribe or translate audio and YouTube videos
Generate speech from text with adjustable rate and pitch
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Talk To Qwen Webrtc is a Speech Synthesis application designed to convert spoken words into text and generate responses using Gradio and WebRTC. It allows users to engage in real-time communication where spoken inputs are transcribed and processed to provide meaningful outputs. Ideal for users looking for an interactive and intuitive way to convert speech to text and beyond.
• Real-Time Speech-to-Text Conversion: Transcribes spoken words into text with low latency for immediate feedback. • WebRTC Integration: Leverages WebRTC technology for secure and reliable peer-to-peer communication. • AI-Powered Responses: Generates responses based on the transcribed text, enabling natural-sounding interactions. • Cross-Platform Compatibility: Works seamlessly across browsers and devices, ensuring accessibility. • User-Friendly Interface: Intuitive design for easy navigation and interaction.
1. What browsers are supported by Talk To Qwen Webrtc?
The application supports modern browsers like Google Chrome, Mozilla Firefox, and Microsoft Edge that are compatible with WebRTC.
2. Why am I not able to use the microphone?
Ensure your browser has permission to access your microphone. Check your browser settings or refresh the page and try again.
3. Is there any latency in the transcription process?
While the app strives for real-time transcription, slight delays may occur based on your internet connection speed and device performance.