OR-Bench Leaderboard

Measure over-refusal in LLMs using OR-Bench

What is OR-Bench Leaderboard ?

OR-Bench Leaderboard is a tool designed to measure and compare over-refusal (OR) behavior in large language models (LLMs). It provides a standardized framework to evaluate how models respond to refusal scenarios, ensuring consistent and fair benchmarking across different models. The leaderboard helps researchers and developers understand the limitations and capabilities of LLMs in handling refusal tasks.

Features

Comprehensive Benchmarking: Evaluate LLMs on a diverse set of refusal scenarios.
Customizable Metrics: Measure over-refusal using multiple defined metrics.
Model Tracking: Compare performance across different versions of models.
Version Support: Compatibility with various model architectures and frameworks.
Open Source: Transparent and accessible for the research community.
Community-Driven: Encourages contributions and improvements from users.

How to use OR-Bench Leaderboard ?

Install the Tool: Clone the repository and install dependencies.
Run Benchmarking: Execute the benchmarking script with the desired models.
Analyze Results: Review the generated metrics and comparisons.
Submit Results: Contribute your findings to the leaderboard for community sharing.

Frequently Asked Questions

What is over-refusal in LLMs?
Over-refusal refers to when a model refuses to respond to a query, even when it could provide a meaningful answer.

Why is benchmarking over-refusal important?
Benchmarking helps identify models that may excessively refuse to answer, potentially limiting their utility in real-world applications.

How do I interpret the results from OR-Bench Leaderboard?
Results show how often and in what contexts models refuse to respond, enabling comparisons of refusal behavior across different models.

Recommended Category

View All

🎙️

OR-Bench Leaderboard

You May Also Like

PaddleOCRModelConverter

ConvCodeWorld

LLM HALLUCINATIONS TOOL

README

ARCH

Model Drops Tracker

GGUF Model VRAM Calculator

Project RewardMATH

SD To Diffusers

HHEM Leaderboard

Pinocchio Ita Leaderboard

MEDIC Benchmark

What is OR-Bench Leaderboard ?

Features

How to use OR-Bench Leaderboard ?

Frequently Asked Questions

Recommended Category

Transcribe podcast audio to text

Speech Synthesis

Face Recognition

Text Generation

Character Animation

Remove background noise from an audio

Remove background from a picture

Background Removal

Music Generation

Style Transfer

Model Benchmarking

Recommendation Systems

Try on virtual clothes

Image

Generate a custom logo