Track, rank and evaluate open LLMs and chatbots
Embedding Leaderboard
Explore and filter language model benchmark results
Experiment with and compare different tokenizers
Display and explore model leaderboards and chat history
Track, rank and evaluate open Arabic LLMs and chatbots
Explore BERT model interactions
Display and filter LLM benchmark results
Compare AI models by voting on responses
Detect if text was generated by GPT-2
Identify named entities in text
Open LLM(CohereForAI/c4ai-command-r7b-12-2024) and RAG
Playground for NuExtract-v1.5
Compare different tokenizers in char-level and byte-level.
"One-minute creation by AI Coding Autonomous Agent MOUSE"
Extract bibliographical metadata from PDFs
Compare LLMs by role stability
Explore Arabic NLP tools
Generative Tasks Evaluation of Arabic LLMs
Identify AI-generated text