Enhancing Machine Learning Predictions Through Knowledge Graph Embeddings

Majlinda Llugiqi*, Fajar J. Ekaputra, Marta Sabou

*Korrespondierende*r Autor*in für diese Arbeit

Publikation: Beitrag in Buch/KonferenzbandBeitrag in Konferenzband

Abstract

Despite their widespread use, machine learning (ML) methods often exhibit sub-optimal performance. The accuracy of these models is primarily hindered by insufficient training data and poor data quality, with particularly severe consequences in critical areas such as medical diagnosis prediction. Our hypothesis is that enhancing ML pipelines with semantic information such as those available in knowledge graphs (KG) can address these challenges and improve ML prediction accuracy. To that end, we extend the state of the art through a novel approach that uses KG embeddings to augment tabular data in various innovative ways within ML pipelines. Concretely, we introduce and examine several integration techniques of KG embeddings and the influence of KG characteristics on model performance, specifically accuracy and F2 scores. We evaluate our approach with four ML algorithms and two embedding techniques, applied to heart and chronic kidney disease prediction. Our results indicate consistent improvements in model performance across various ML models and tasks, thus confirming our hypothesis, e.g. we increased the F2 score for the KNN from 70% to 82.22%, and the F2 score for SVM from 74.53% to 81.71%, for heart disease prediction.

OriginalspracheEnglisch
Titel des SammelwerksNeural-Symbolic Learning and Reasoning - 18th International Conference, NeSy 2024, Proceedings
Untertitel des SammelwerksBarcelona, Spain, September 9-12, 2024
Herausgeber*innenTarek R. Besold, Artur d’Avila Garcez, Ernesto Jimenez-Ruiz, Roberto Confalonieri, Pranava Madhyastha, Benedikt Wagner
ErscheinungsortCham
VerlagSpringer Nature Switzerland AG
Seiten279-295
Seitenumfang17
Band1
ISBN (elektronisch)9783031711671
ISBN (Print)9783031711664
DOIs
PublikationsstatusVeröffentlicht - 2024
Veranstaltung18th International Conference on Neural-Symbolic Learning and Reasoning, NeSy 2024 - Barcelona, Spanien
Dauer: 9 Sept. 202412 Sept. 2024

Publikationsreihe

ReiheLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band14979
ISSN0302-9743

Konferenz

Konferenz18th International Conference on Neural-Symbolic Learning and Reasoning, NeSy 2024
Land/GebietSpanien
OrtBarcelona
Zeitraum9/09/2412/09/24

Bibliographische Notiz

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

Zitat