Multivariate Weibull mixtures with proportional hazard restrictions for dwell time based session clustering with incomplete data

Patrick Mair, Marcus Hudec

Publikation: Wissenschaftliche FachzeitschriftOriginalbeitrag in FachzeitschriftBegutachtung

Abstract

Emanating from classical Weibull mixture models we propose a framework for clustering survival data with various more parsimonious models by imposing restrictions on the distributional parameters. We show that these restrictions on the Weibull mixtures correspond to different proportional hazard restrictions across mixture components and Web page areas. A parametric cluster approach based on the EM algorithm is carried out on a multivariate data set. Our model set-up encompasses incomplete-data structures as well as censoring observations. We apply the methodology on retail data stemming from a global e-commerce company. Sessions are clustered with respect to the dwell times that a user spends on certain page areas. The cluster solution that is found allows for a detailed examination of the navigation behaviour in terms of the hazard and survivor functions within each component
DOI: 10.1111/j.1467-9876.2009.00665.x
OriginalspracheEnglisch
Seiten (von - bis)619 - 639
FachzeitschriftJournal of the Royal Statistical Society: Series C (Applied Statistics)
Jahrgang58
Ausgabenummer5
PublikationsstatusVeröffentlicht - 1 Okt. 2009

Zitat