Generative Tasks Evaluation of Arabic LLMs
Similarity
Find collocations for a word in specified part of speech
Explore Arabic NLP tools
Search for similar AI-generated patent abstracts
Analyze similarity of patent claims and responses
Aligns the tokens of two sentences
Ask questions and get answers from PDFs in multiple languages
Compare LLMs by role stability
Detect harms and risks with Granite Guardian 3.1 8B
Electrical Device Feedback Sentiment Classifier
Identify AI-generated text
Classify text into categories
AraGen Leaderboard is a platform designed for evaluating and comparing the performance of Arabic language models (LLMs) on generative tasks. It serves as a benchmarking tool to assess the capabilities of different models in understanding and generating Arabic text. AraGen Leaderboard is free to use and is particularly useful for researchers, developers, and users interested in Arabic NLP.
What is AraGen Leaderboard used for?
AraGen Leaderboard is used to evaluate and compare the performance of Arabic language models on generative tasks, helping users identify the best models for their needs.
How are models evaluated on AraGen Leaderboard?
Models are evaluated based on predefined metrics and custom tasks set by the user, ensuring a comprehensive assessment of their capabilities.
Can I use AraGen Leaderboard for languages other than Arabic?
No, AraGen Leaderboard is specifically designed for evaluating Arabic language models and does not support other languages.