Benchmarking 24 Large Language Models for Automated Multiple-Choice Question Generation in Latvian

Anna Daupare; Gints Jēkabsons

doi:10.2478/acss-2025-0010

ScienceGate Book Chapters

JOURNAL ARTICLE

Benchmarking 24 Large Language Models for Automated Multiple-Choice Question Generation in Latvian

Anna Daupare Gints Jēkabsons

Year: 2025 Journal: Applied Computer Systems Vol: 30 (1)Pages: 85-90 Publisher: Polish Association for Knowledge Promotion

DOI: 10.2478/acss-2025-0010

Get Full-Text PDF Get Analytical Report

Abstract

Abstract Large Language Models (LLMs) are increasingly being used for a wide range of text generation tasks. This paper investigates the generation of Multiple-Choice Questions in Latvian to assess both the ability of LLMs to generate high-quality questions and answers and, more broadly, their capability to process Latvian, a lower-resourced language that has received relatively little attention in LLM research. This study benchmarks 24 different LLMs, specifically those developed by Anthropic, DeepSeek, OpenAI, Google, Meta, Mistral, and Microsoft. The findings highlight the varying capabilities of these models in handling Latvian, producing grammatically correct, coherent, and meaningful text. The best-performing closed-weights model is claude-3.5-sonnet (by Anthropic), the best-performing open-weights model is deepseek-v3 (by DeepSeek), and the best-performing small open-weights model is open-mistral-nemo (by Mistral).

Keywords:

Latvian Benchmarking Computer science Process (computing) Comprehension Language model Natural language processing Best practice Artificial intelligence Software engineering Linguistics Programming language Political science Marketing Business

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.07

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Expert finding and Q&A systems

Physical Sciences → Computer Science → Information Systems

Benchmarking 24 Large Language Models for Automated Multiple-Choice Question Generation in Latvian

Abstract

Metrics

Topics

Related Documents

Automated multiple-choice question generation in Spanish using neural language models

Benchmarking Large Language Models for Automated Verilog RTL Code Generation

Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models

Automated Multiple-Choice Question Generation and Analysis for Language Learning Assessment

An Automated Multiple-Choice Question Generation using Natural Language Processing Techniques