Text Mining Infrastructure in R

Ingo Feinerer, Kurt Hornik, David Meyer

Publikation: Wissenschaftliche FachzeitschriftOriginalbeitrag in FachzeitschriftBegutachtung

691 Downloads (Pure)

Abstract

During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining applications within R. We give a survey on text mining facilities in R and explain how typical application tasks can be carried out using our framework. We present techniques for count-based analysis methods, text clustering, text classification and string kernels.
OriginalspracheEnglisch
Seiten (von - bis)1 - 54
FachzeitschriftJournal of Statistical Software
Jahrgang25
Ausgabenummer5
DOIs
PublikationsstatusVeröffentlicht - 2008

Zitat