Abstract
Despite their widespread use, machine learning (ML) methods often exhibit sub-optimal performance. The accuracy of these models is primarily hindered by insufficient training data and poor data quality, with particularly severe consequences in critical areas such as medical diagnosis prediction. Our hypothesis is that enhancing ML pipelines with semantic information such as those available in knowledge graphs (KG) can address these challenges and improve ML prediction accuracy. To that end, we extend the state of the art through a novel approach that uses KG embeddings to augment tabular data in various innovative ways within ML pipelines. Concretely, we introduce and examine several integration techniques of KG embeddings and the influence of KG characteristics on model performance, specifically accuracy and F2 scores. We evaluate our approach with four ML algorithms and two embedding techniques, applied to heart and chronic kidney disease prediction. Our results indicate consistent improvements in model performance across various ML models and tasks, thus confirming our hypothesis, e.g. we increased the F2 score for the KNN from 70% to 82.22%, and the F2 score for SVM from 74.53% to 81.71%, for heart disease prediction.
Originalsprache | Englisch |
---|---|
Titel des Sammelwerks | Neural-Symbolic Learning and Reasoning - 18th International Conference, NeSy 2024, Proceedings |
Untertitel des Sammelwerks | Barcelona, Spain, September 9-12, 2024 |
Herausgeber*innen | Tarek R. Besold, Artur d’Avila Garcez, Ernesto Jimenez-Ruiz, Roberto Confalonieri, Pranava Madhyastha, Benedikt Wagner |
Erscheinungsort | Cham |
Verlag | Springer Nature Switzerland AG |
Seiten | 279-295 |
Seitenumfang | 17 |
Band | 1 |
ISBN (elektronisch) | 9783031711671 |
ISBN (Print) | 9783031711664 |
DOIs | |
Publikationsstatus | Veröffentlicht - 2024 |
Veranstaltung | 18th International Conference on Neural-Symbolic Learning and Reasoning, NeSy 2024 - Barcelona, Spanien Dauer: 9 Sept. 2024 → 12 Sept. 2024 |
Publikationsreihe
Reihe | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Band | 14979 |
ISSN | 0302-9743 |
Konferenz
Konferenz | 18th International Conference on Neural-Symbolic Learning and Reasoning, NeSy 2024 |
---|---|
Land/Gebiet | Spanien |
Ort | Barcelona |
Zeitraum | 9/09/24 → 12/09/24 |
Bibliographische Notiz
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.