Euskarazko lehen C1 ebaluatzaile automatikoa
DOI:
https://doi.org/10.26876/ikergazte.vi.03.15Keywords:
artificial intelligence, machine learning, language models, automatic essay evaluationAbstract
In this article, we have developed an automatic evaluator that determines whether texts written in Basque meet the C1 level. To train the system, we used 10,000 transcribed essays obtained through an agreement between HABE and HiTZ. To analyze the potential impact of essay topics, we designed the training in two ways: using texts from only one exam period and using texts from two exam periods. To establish baselines, we trained two Language Models for Basque, RoBERTa and Latxa, and then worked on different techniques to address data scarcity, prevent system overfitting, and improve performance: EDA, SCL, and regularization. Finally, we conducted analyses of different system behaviors to measure model calibration and the impact of artifacts.
License
Copyright (c) 2025 IkerGazte. Nazioarteko ikerketa euskaraz

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
