VMLU favicon

VMLU

Free
VMLU screenshot
Click to visit website
Feature this AI

About

VMLU is a human-centric benchmark suite specifically designed to assess the overall capabilities of foundation models, with a strong specialization for the Vietnamese language. It comprises four distinct datasets: Vi-MQA for multiple-choice question answering, Vi-SQuAD for reading comprehension, Vi-DROP for logical reasoning, and Vi-Dialog for conversational ability. These datasets offer diverse evaluation tasks spanning general knowledge, reading comprehension, logical reasoning, and conversational ability, enriching Vietnamese NLP evaluation benchmarks. The platform aims to drive the development of more robust foundation models and encourage further research in LLMs. The dataset sources include examinations from various educational institutions and the Ministry of Education and Training. VMLU also provides a GitHub repository with extensive information, benchmarking results for publicly available models, and accessible benchmarking code for replication.

Platform
Web
Task
language benchmarking

Features

extensive dataset sourced from educational institutions

offers reproducible benchmarking code for replication

provides benchmarking results for publicly available models

evaluates general knowledge, reading comprehension, logical reasoning, and conversational ability

comprises four distinct datasets: vi-mqa, vi-squad, vi-drop, vi-dialog

specialized for vietnamese language models

human-centric benchmark suite for llms

Pricing Plans

Free
Free Plan

Access to Vi-MQA dataset

Access to Vi-SQuAD dataset

Access to Vi-Drop dataset

Access to Vi-Dialog dataset

Access to Github repository with benchmarking code

Job Opportunities

There are currently no job postings for this AI tool.

Explore AI Career Opportunities

Social Media

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Featured Tools

adly.news favicon
adly.news

adly.news is a 100% free newsletter advertising marketplace connecting businesses with engaged newsletter audiences, offering automated payouts and secure payments.

View Details
Whisk AI Image Generator favicon
Whisk AI Image Generator

Whisk AI Image Generator is a Google Labs-Powered Image Remix Platform that blends visual inputs (subject, scene, style) to create stunning 4K artwork quickly.

View Details
APIPASS favicon
APIPASS

APIPASS is a unified marketplace for discovering, integrating, and managing thousands of APIs, providing developers with fast, reliable, and cost-effective access to leading AI models.

View Details
VO4 AI favicon
VO4 AI

VO4 AI is the best AI video maker that turns your ideas into stunning videos. Make professional videos from text or images with our smart AI technology.

View Details
VO4 AI favicon
VO4 AI

VO4 AI is a professional AI video generator studio utilizing the VO4 Model to create stunning, cinematic 1080p videos from text prompts or static images.

View Details
Voe 4 favicon
Voe 4

Voe 4 is an AI video generator offering lightning-fast text-to-video and image-to-video conversion, delivering high-resolution, professional 4K AI videos in seconds.

View Details
Modelfy 3D favicon
Modelfy 3D

Modelfy 3D is an Enterprise-Grade AI Image to 3D Model Generator that transforms any 2D image into professional 3D models with up to 300K polygons and PBR textures.

View Details
Questie.ai favicon
Questie.ai

Questie.ai is an advanced AI gaming companion that watches your actual gameplay in real-time and provides intelligent commentary through natural AI voice chat.

View Details