Request evaluation of a speech recognition model
Kokoro is an open-weight TTS model with 82 million parameters.
Convert text to speech with customizable settings
Belarusian TTS
Transcribe Persian audio to text
Generate high-quality speech from text with specified emotion and voice
Convert text to speech effortlessly
"Designed for all users, including those with disabilities."
Transcribe spoken Russian into text
Generate speech from text with customizable options
Convert text into speech in Japanese
Generate speech from text or files
Open ASR Leaderboard is a tool designed to evaluate and benchmark speech recognition models. It provides a platform to assess the performance of automatic speech recognition (ASR) systems by comparing them against standard datasets and metrics. The leaderboard allows users to submit their models for evaluation and view how they perform relative to other models.
• Automatic Evaluation: Easily test and evaluate speech recognition models with minimal setup.
• Benchmarking: Compare your model's performance against industry-standard models and datasets.
• Customizable Testing: Choose from multiple datasets and metrics to tailor your evaluation.
• Real-Time Tracking: Monitor your model's performance in real-time as it processes the test data.
• Results Visualization: Access detailed reports and visualizations of your model's strengths and weaknesses.
What types of speech recognition models are supported?
Open ASR Leaderboard supports a wide range of speech recognition models, including deep learning-based models like CNNs, RNNs, and Transformers.
How long does the evaluation process typically take?
The evaluation time depends on the size of the dataset and the complexity of your model. It can range from a few minutes to several hours.
Can I use custom datasets for evaluation?
Yes, Open ASR Leaderboard allows users to upload custom datasets for evaluation, providing flexibility in testing specific scenarios or languages.