Health

The UNED promotes an AI that converts medical reports into standardized clinical codes

The UNED participates in an AI that codes medical reports into ICD, improves F1 by 3.42%, and explains which text fragments justify each code.

3 minutes

fotonoticia 20260521181102 1920

By Agencias

Published

21 MAY, 2026

3 minutes

Elecciones al Parlamento de Andalucía de 17 de mayo de 2026

🔗 Ver todos los resultados

Datos actualizados: 18/05/2026 00:34 🔄

Próxima actualización en 60s

Escrutado: 99.90% Votantes: 4.218.032 Participación: 64.85%

Votos

Partido	Escaños	Votos	Porcentaje
PP	53 -5	1.735.819	41.60%
PSOE-A	28 -2	947.713	22.71%
VOX	15 +1	576.635	13.82%
ADELANTE ANDALUCÍA	8 +6	401.732	9.62%
PorA	5 =	263.615	6.31%
SALF	0 =	105.761	2.53%
PACMA	0 =	25.056	0.60%
100x100	0 =	14.753	0.35%
ANDALUCISTAS-PA	0 =	12.319	0.29%
ESCAÑOS EN BLANCO	0 =	9.281	0.22%
JM+	0 =	7.961	0.19%
PCPA	0 =	5.849	0.14%
FE de las JONS	0 =	4.962	0.11%
MUNDO+JUSTO	0 =	4.696	0.11%
PARTIDO AUTÓNOMOS	0 =	3.693	0.08%
NA	0 =	3.012	0.07%
HE>	0 =	2.134	0.05%
PCTE	0 =	1.777	0.04%
PODER ANDALUZ	0 =	1.076	0.02%
29	0 =	741	0.01%
ALM	0 =	646	0.01%
ANDALUSÍ	0 =	532	0.01%
IZAR	0 =	502	0.01%
JUFUDI	0 =	396	0.01%
IPAL	0 =	360	0.01%
CONECTA	0 =	329	0.01%
SOCIEDAD UNIDA	0 =	237	0.01%

Escaños (109)

Mayoría: 55

PP 53 escaños

PSOE-A 28 escaños

VOX 15 escaños

ADELANTE ANDALUCÍA 8 escaños

PorA 5 escaños

Mapa

Ganador por provincia

Cargando mapa…

Selecciona un municipio para ver el detalle.

Resultados electorales por provincia
Provincia	Partido ganador	% Escrutado	Escaños
Almería	PP	100.0%	12
Cádiz	PP	100.0%	15
Córdoba	PP	100.0%	12
Granada	PP	100.0%	13
Huelva	PP	100.0%	11
Jaén	PP	100.0%	11
Málaga	PP	99.8%	17
Sevilla	PP	99.8%	18

As detailed by the university, the project proposes an architecture that converts medical texts into codes of the International Classification of Diseases (CIE) through a three-stage workflow: entity recognition, supervised classification, and semantic similarity analysis. Tested on Spanish and English corpora, the system has obtained competitive results and an average improvement of 3.42% in the F1 metric compared to previous approaches.

Clinical coding makes it possible to translate the content of reports -diagnoses, symptoms, procedures or history- into a common language based on standardized codes. In this way, professionals from different centers can record and consult data homogeneously, which simplifies health management and the subsequent analysis of large volumes of information.

Despite this, specialists in the sector recall that this task continues to be carried out largely manually, with a high consumption of time and human resources. "Systems that automate the process provide a lot of value, as they convert a very tedious and time-consuming task into a fast and efficient process, which in turn allows more time to be dedicated to investigating and analyzing information," explains Alicia Ramírez, researcher of the NLP&IR group of the Department of Languages and Computer Systems of UNED and participant in the development of the system.

The proposal also integrates two relevant contributions. On the one hand, it incorporates an unsupervised phase that allows locating codes that the model had not found during training, expanding its adaptability to real scenarios. On the other hand, it can interpret complex references present in the clinical text, such as fragmented or superimposed mentions that must be understood jointly to assign the appropriate code.

An AI capable of justifying its decisions

One of the most novel elements of the work is that the system does not act as a black box. In addition to generating a proposed coding, it highlights the specific fragments of the medical report that support each prediction. "The presented system, in addition to predicting ICD-10 codes, returns the parts of the text that justify these predictions," indicates the researcher. This allows healthcare personnel to understand why a certain code has been chosen and to validate the result more quickly and transparently.

To validate its behavior, the system underwent tests with corpora in Spanish and English. Although the models were trained separately for each dataset, the objective was to demonstrate that the methodology maintains its performance in diverse contexts and is not limited to a single type of clinical documentation.

According to Ramírez, the improvement achieved - 3.42% in F1 - is especially significant in a task that is very demanding from a computational point of view. The expert recalls that automatic clinical coding operates with highly specialized medical language and with more than 100,000 possible codes, so that apparently modest advances have a notable effect on the system's accuracy and coverage.

The next step in the research will be to build a functional demo that allows the introduction of clinical texts and automatically displays both the suggested codes and the fragments that justify each decision. If the results are satisfactory, this line of work could open the door to future applications in care or research settings.

The development has been carried out thanks to the UNED infrastructure and the specialized knowledge of the NLP&IR group in natural language processing applied to the biomedical field, a consolidated line of work within the Department of Computer Languages and Systems.

The UNED promotes an AI that converts medical reports into standardized clinical codes

Elecciones al Parlamento de Andalucía de 17 de mayo de 2026

Votos

Escaños (109)

Mapa

Most read

Congress will act against massive IP blockages by LaLiga

The judge rejects Gotham's journalistic privilege after its report against Grifols

Tubos Reunidos: the indefinite strike ends, but the battle for the ERE continues

The PSOE-A of María Jesús Montero signs its worst historical result in Andalusia and falls to 28 seats

Diving accident in Maldives: the clue of the empty tank and the keys to the death of five Italians in a cave

An AI capable of justifying its decisions