Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
MTEB Leaderboard
Embedding Leaderboard
Open Ko-LLM Leaderboard
Explore and filter language model benchmark results
The Tokenizer Playground
Experiment with and compare different tokenizers
AI2 WildBench Leaderboard (V2)
Display and explore model leaderboards and chat history
Open Arabic LLM Leaderboard
Track, rank and evaluate open Arabic LLMs and chatbots
Exbert
Explore BERT model interactions
Open Chinese LLM Leaderboard
Display and filter LLM benchmark results
Judge Arena
Compare AI models by voting on responses
openai-detector
Detect if text was generated by GPT-2
GLiNER-Multiv2.1
Identify named entities in text
RAGOndevice AI
Open LLM(CohereForAI/c4ai-command-r7b-12-2024) and RAG
NuExtract 1.5
Playground for NuExtract-v1.5
Tokenizer Arena
Compare different tokenizers in char-level and byte-level.
Prime Number Finder
"One-minute creation by AI Coding Autonomous Agent MOUSE"
Grobid
Extract bibliographical metadata from PDFs
Stick To Your Role! Leaderboard
Compare LLMs by role stability
Arabic NLP Demo
Explore Arabic NLP tools
AraGen Leaderboard
Generative Tasks Evaluation of Arabic LLMs
RADAR AI Text Detector
Identify AI-generated text