Open Innovation Campus

Digital Life

Develop a Spanish Benchmark for LALMs Assessment

Available

Resources

The project will mainly use public tools and datasets.

Are you interested?

If you are a professor or university student and you are interested in participating in the TUTORING program, register your information so that we can start the program.

Student registration
Academic registration

Context

Challenge proposed for students with linguistic knowledge, given that the focus is on speech.

Basic technical knowledge of Large Language Models, prompting, and Python is also recommended.

Intro

Understanding auditory information is essential for fostering natural human-machine interactions.

Large Audio Language Models (LALMs) have rapidly advanced, with models like LTU, SALMONN, GAMA, Audio Flamingo 2, Qwen2.5-Omni, Audio Resoner, Kimi-Audio, and Audio Flamingo 3 showing major progress in processing audio inputs.

Evaluating these models is key to understanding their performance and limitations. 

Existing benchmarks such as MMAU, MMAR, MMSU, and MMAU-Pro focus mainly on English. For Spanish, a clear gap remains, highlighting the need for a dedicated benchmark.

Challenge Description

This challenge aims to create a Multiple-Choice Question Answering benchmark to assess LALMs’ speech capabilities in Spanish.

It will use real-world Spanish audio data and linguistic expertise to build questions and answer choices in Spanish.

The process includes:
- Defining target capabilities (comprehension, reasoning, context).
- Collecting diverse Spanish audio from various dialects and contexts.
- Creating balanced multiple-choice questions and plausible answers.
- Validating quality through human review.
- Evaluating selected models and analyzing results.

A scientific report analyzing the benchmark, plus the benchmark itself with the audio and corresponding questions and answers.

Who is challenging you?

Telefónica's Industrial Tutors accompany you in the development of the TFG/TFM, providing their real vision of the industry. They will share their knowledge and experience, offering you feedback so that you can develop a project with an innovative impact.
Fernando López Telefónica

Fernando López Gavilánez

Product Exploration and Prototyping - Digital Home / Telefónica Innovación Digital