Characteristics of Open Data CSV Files

Publication: Chapter in book/Conference proceedingContribution to conference proceedings


The following work presents a data corpus consisting of tabular Open Data sources and studies the characteristics and properties of the files from a consumers point of view. Earlier reports showed that the CSV (comma-separated values) format is the predominant format in the Open Data landscape [8]. The main reason is the simplicity and independence of this format: it stores tabular data in plain text where each line of the file is a data record. Each record consists of one or more fields which are separated by a delimiter, typically a comma.
Original languageEnglish
Title of host publication2nd International Conference on Open and Big Data (OBD)
Editors IEEE
Place of PublicationVienna, Austria
Pages72 - 79
Publication statusPublished - 2016

Cite this