Abstract
The rise of large language models (LLMs) has advanced information retrieval, yet issues like limited knowledge updating, lack of transparency and interpretability, as well as hallucinations persist. Retrieval-augmented generation (RAG) addresses these problems, though it still lacks interpretability due to reliance on opaque vector-based representations. Our work presents a RAG framework using a knowledge graph (KG) as the primary knowledge base to address this problem, relying solely on open-source components to enable user customization. Our pipeline comprises multiple stages: (i) a translation module for multilingual support, (ii) entity linking, (iii) knowledge retrieval through verbalized triples or SPARQL query generation, and (iv) answer generation, which incorporates ontology (properties and classes) retrieval. We evaluate our system on Wikidata, DBpedia, and a domain-specific KG. With the optimal configuration determined through an ablation study, the system achieves Jaccard similarity scores of 0.458, 0.517, and 0.976 for each respective KG. The ablation study further reveals that ontology retrieval is the most crucial component in providing context to the LLM in generating SPARQL queries.
| Originalsprache | Englisch |
|---|---|
| Titel des Sammelwerks | 4th International Workshop on LLM-Integrated Knowledge Graph Generation from Text (Text2KG) |
| Untertitel des Sammelwerks | co-located with the Extended Semantic Web Conference (ESWC 2025) |
| Herausgeber*innen | Sanju Tiwari, Nandana Mihindukulasooriya, Jennifer D'Souza, Francesco Osborne |
| Verlag | CEUR Workshop Proceedings |
| Seiten | 116-134 |
| Seitenumfang | 19 |
| Band | 4020 |
| Publikationsstatus | Veröffentlicht - 2025 |
| Veranstaltung | Joint of the 4th International Workshop on LLM-Integrated Knowledge Graph Generation from Text and the 2nd International BiKE Challenge, TEXT2KG 2025 and BIKE 2025 - Portoroz, Slowenien Dauer: 1 Juni 2025 → 5 Juni 2025 |
Publikationsreihe
| Reihe | CEUR Workshop Proceedings |
|---|---|
| ISSN | 1613-0073 |
Konferenz
| Konferenz | Joint of the 4th International Workshop on LLM-Integrated Knowledge Graph Generation from Text and the 2nd International BiKE Challenge, TEXT2KG 2025 and BIKE 2025 |
|---|---|
| Land/Gebiet | Slowenien |
| Ort | Portoroz |
| Zeitraum | 1/06/25 → 5/06/25 |
Bibliographische Notiz
Publisher Copyright:© 2025 Copyright for this paper by its authors.