Request evaluation of a speech recognition model
Sound effect from description
Generate audio from text in multiple languages
MaskGCT TTS Demo
Convert text into speech in Japanese
Convert text to speech with voice customization
Convert audio to text and summarize highlights
Generate speech from text with adjustable rate and pitch
Generate text from audio input
Generate high-quality speech from text with specified emotion and voice
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
CPU powered, low RTF, emotional, multilingual TTS
Generate speech from text with custom voice
Open ASR Leaderboard is a tool designed to evaluate and benchmark speech recognition models. It provides a platform to assess the performance of automatic speech recognition (ASR) systems by comparing them against standard datasets and metrics. The leaderboard allows users to submit their models for evaluation and view how they perform relative to other models.
• Automatic Evaluation: Easily test and evaluate speech recognition models with minimal setup.
• Benchmarking: Compare your model's performance against industry-standard models and datasets.
• Customizable Testing: Choose from multiple datasets and metrics to tailor your evaluation.
• Real-Time Tracking: Monitor your model's performance in real-time as it processes the test data.
• Results Visualization: Access detailed reports and visualizations of your model's strengths and weaknesses.
What types of speech recognition models are supported?
Open ASR Leaderboard supports a wide range of speech recognition models, including deep learning-based models like CNNs, RNNs, and Transformers.
How long does the evaluation process typically take?
The evaluation time depends on the size of the dataset and the complexity of your model. It can range from a few minutes to several hours.
Can I use custom datasets for evaluation?
Yes, Open ASR Leaderboard allows users to upload custom datasets for evaluation, providing flexibility in testing specific scenarios or languages.