Evaluate adversarial robustness using generative models
Measure over-refusal in LLMs using OR-Bench
Demo of the new, massively multilingual leaderboard
Merge machine learning models using a YAML configuration file
Load AI models and prepare your space
Teach, test, evaluate language models with MTEB Arena
View RL Benchmark Reports
Browse and evaluate ML tasks in MLIP Arena
Explore and benchmark visual document retrieval models
Browse and filter ML model leaderboard data
Upload a machine learning model to Hugging Face Hub
Display leaderboard for earthquake intent classification models
Browse and submit LLM evaluations
GREAT Score is a tool designed to evaluate adversarial robustness using generative models. It provides a framework to assess how well machine learning models, particularly those in the realm of generative AI, can withstand adversarial attacks. By leveraging generative models, GREAT Score offers insights into the vulnerabilities of AI systems, helping developers and researchers improve model resilience and security.
• Adversarial Attack Generation: Utilizes advanced techniques to craft sophisticated adversarial examples to test model robustness.
• Multiple Threat Models: Supports various threat models, including L2, L∞, and other common adversarial attack constraints.
• Model Agnostic: Works with diverse types of machine learning models, including neural networks and traditional models.
• Flexible Evaluation Metrics: Provides detailed metrics to quantify model performance under adversarial conditions.
• Integration with Popular Libraries: Compatible with leading machine learning frameworks such as TensorFlow and PyTorch.
• Customizable Parameters: Allows users to tweak attack strength, step sizes, and other parameters for tailored evaluations.
• Comprehensive Reporting: Delivers detailed reports on model vulnerabilities and suggested improvements.
pip install great-score
to install the package.import great_score
in your Python script or notebook.What does GREAT Score stand for?
GREAT Score is an acronym that reflects its purpose of evaluating adversarial robustness using generative models.
Can GREAT Score be used with any type of machine learning model?
Yes, GREAT Score is designed to be model-agnostic and can work with various types of machine learning models, including neural networks and traditional models.
How do I interpret the results from GREAT Score?
The results provide metrics on model performance under adversarial conditions, highlighting vulnerabilities and suggesting potential improvements to enhance robustness.