MarIA and BETO are sexist: evaluating gender bias in large language models for Spanish

Garrido-Muñoz, Ismael; Martínez-Santiago, Fernando; Montejo-Ráez, Arturo

MarIA and BETO are sexist: evaluating gender bias in large language models for Spanish

Archivos

2023_MarIAandBETO.pdf (1.78 MB)

Fecha

2023-07-23

Autores

Garrido-Muñoz, Ismael

Martínez-Santiago, Fernando

Montejo-Ráez, Arturo

Editor

Springer

Resumen

The study of bias in language models is a growing area of work, however, both research and resources are focused on English. In this paper, we make a first approach focusing on gender bias in some freely available Spanish language models trained using popular deep neural networks, like BERT or RoBERTa. Some of these models are known for achieving state-of-the-art results on downstream tasks. These promising results have promoted such models’ integration in many real-world applications and production environments, which could be detrimental to people affected for those systems. This work proposes an evaluation framework to identify gender bias in masked language models, with explainability in mind to ease the interpretation of the evaluation results. We have evaluated 20 different models for Spanish, including some of the most popular pretrained ones in the research community. Our findings state that varying levels of gender bias are present across these models.This approach compares the adjectives proposed by the model for a set of templates. We classify the given adjectives into understandable categories and compute two new metrics from model predictions, one based on the internal state (probability) and the other one on the external state (rank). Those metrics are used to reveal biased models according to the given categories and quantify the degree of bias of the models under study.

Palabras clave

BERT, Bias evaluation, Deep learning, Gender bias, Language model, RoBERTa

Citación

Garrido-Muñoz, I., Martínez-Santiago, F. & Montejo-Ráez, A. MarIA and BETO are sexist: evaluating gender bias in large language models for Spanish. Lang Resources & Evaluation (2023). https://doi.org/10.1007/s10579-023-09670-3

URI

https://link.springer.com/article/10.1007/s10579-023-09670-3
https://hdl.handle.net/10953/1911

Colecciones

DI-Artículos

Página completa del ítem

RUJA: Repositorio Institucional de Producción Científica

MarIA and BETO are sexist: evaluating gender bias in large language models for Spanish

Archivos

Fecha

Autores

Título de la revista

ISSN de la revista

Título del volumen

Editor

Resumen

Descripción

Palabras clave

Citación

URI

Colecciones