Open City Data Pipeline: Collecting, Integrating, and Predicting Open City Data

  • Stefan Bischof
  • , Christoph Martin
  • , Axel Polleres
  • , Patrik Schneider

Publikation: Beitrag in Buch/KonferenzbandBeitrag in Konferenzband

Abstract

Having access to high quality and recent data is crucial both for decision makers in cities as well as for informing the public, likewise, infrastructure
providers could offer more tailored solutions to cities based on such data. However, even though there are many data sets containing relevant indicators about
cities available as open data, it is cumbersome to integrate and analyze them,
since the collection is still a manual process and the sources are not connected
to each other upfront. Further, disjoint indicators and cities across the available
data sources lead to a large proportion of missing values when integrating these
sources. In this paper we present a platform for collecting, integrating, and enriching open data about cities in a re-usable and comparable manner: we have integrated various open data sources and present approaches for predicting missing
values, where we use standard regression methods in combination with principal
component analysis to improve quality and amount of predicted values. Further,
we re-publish the integrated and predicted values as linked open data.
OriginalspracheEnglisch
Titel des SammelwerksProceedings of the 4th Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data co-located with 12th Extended Semantic Web Conference (ESWC'15)
Herausgeber*innen Johanna Völker, Heiko Paulheim, Jens Lehmann and Vojtech Svatek
ErscheinungsortPortoroz, Slovenia
VerlagCEUR Workshop Proceedings
PublikationsstatusVeröffentlicht - 1 Juni 2015

Zitat