Create and manage datasets for training ML models
List of French datasets not referenced on the Hub
Manage and annotate datasets
Browse and extract data from Hugging Face datasets
Upload files to a Hugging Face repository
Manage and label datasets for your projects
Save user inputs to datasets on Hugging Face
Create datasets with FAQs and SFT prompts
Create a large, deduplicated dataset for LLM pre-training
Convert a model to Safetensors and open a PR
Manage and analyze labeled datasets
Find and view synthetic data pipelines on Hugging Face
My Public Argilla is a tool designed to create and manage datasets for training machine learning models. It simplifies the process of dataset creation, making it more accessible and efficient for users. With My Public Argilla, you can streamline dataset management and focus on building high-quality training data for your ML projects.
• Dataset Creation: Easily create and structure datasets for various machine learning tasks.
• Data Management: Organize and manage datasets efficiently with intuitive tools.
• Collaboration: Share datasets with team members or the public for collaborative work.
• Version Control: Track changes and maintain different versions of your datasets.
• Integration: Compatible with popular machine learning frameworks and tools.
What types of data can I upload to My Public Argilla?
You can upload data in various formats, including CSV, JSON, Excel, and more.
Can I share datasets privately?
Yes, My Public Argilla allows you to share datasets publicly or restrict access to specific team members.
How does version control work in My Public Argilla?
Version control lets you track changes to your datasets over time. You can revert to previous versions if needed.