Speech Corpus Creation Tool
Support by Parquet, CSV, Jsonl, XLS
Upload files to a Hugging Face repository
Browse and extract data from Hugging Face datasets
Search and find similar datasets
Build datasets and workflows using AI models
Manage and analyze datasets with AI tools
Perform OSINT analysis, fetch URL titles, fine-tune models
Manage and annotate datasets
Browse a list of machine learning datasets
sign in to receive news on the iPhone app
List of French datasets not referenced on the Hub
Convert PDFs to a dataset and upload to Hugging Face
Dhravani is a Speech Corpus Creation Tool designed to help users efficiently create and manage speech datasets. It enables the recording of voices and the transcription of speech into text, making it an essential tool for building high-quality speech corpora. The tool is user-friendly, with AI-powered transcription capabilities that ensure accuracy and speed in processing speech data.
• Voice Recording: Capture high-quality audio directly through the platform. • Automatic Transcription: AI-driven transcription converts speech to text in real-time. • Noise Reduction: Advanced filters to minimize background noise and improve audio clarity. • Multi-Language Support: Create speech corpora in multiple languages. • Customizable Formats: Export data in various formats suitable for different applications. • Collaboration Tools: Work with teams to annotate and validate transcriptions. • Scalability: Handle large-scale projects with ease.
What languages does Dhravani support?
Dhravani supports a wide range of languages, including English, Spanish, Mandarin, French, and many others. Check the official documentation for the full list of supported languages.
Can I use Dhravani for commercial projects?
Yes, Dhravani is suitable for both academic and commercial projects. Ensure compliance with licensing terms when using the tool for commercial purposes.
Is my data secure on Dhravani?
Dhravani prioritizes data security. All recordings and transcriptions are encrypted and stored securely on the platform.