Abstract
Ontologies often contain multilingual textual information in
annotation properties, such as rdfs:label and rdfs:comment. While
the motivation for using such annotation properties is to provide a human
readable description of abstract conceptualization of the domain,
we notice that the importance of appropriate natural language use and
representation is often neglected. The same can be observed with resources
on the Web, such as multilingual taxonomies. Terms often lack
consistency and completeness, hampering also an accurate automated
natural language processing of such text. We propose a pattern-based
transformation of terms in labels, thereby also supporting a multilingual
alignment of (sub)components of labels. The source data for our
approach is an ontology we derived from an industry classification taxonomy,
which we improve as regards consistency and completeness and
apply to the process of lexicalization.
annotation properties, such as rdfs:label and rdfs:comment. While
the motivation for using such annotation properties is to provide a human
readable description of abstract conceptualization of the domain,
we notice that the importance of appropriate natural language use and
representation is often neglected. The same can be observed with resources
on the Web, such as multilingual taxonomies. Terms often lack
consistency and completeness, hampering also an accurate automated
natural language processing of such text. We propose a pattern-based
transformation of terms in labels, thereby also supporting a multilingual
alignment of (sub)components of labels. The source data for our
approach is an ontology we derived from an industry classification taxonomy,
which we improve as regards consistency and completeness and
apply to the process of lexicalization.
Originalsprache | Englisch |
---|---|
Titel des Sammelwerks | Proceedings of the 3rd Workshop on the Multilingual Semantic Web (MSW3) |
Herausgeber*innen | Paul Buitelaar ; Philipp Cimiano; David Lewis; James Pustejovsky; Felix Sasaki |
Erscheinungsort | Boston |
Verlag | CEUR Workshop Proceedings |
Seiten | 16 - 27 |
Publikationsstatus | Veröffentlicht - 1 Nov. 2012 |