Open Access Te Herenga Waka-Victoria University of Wellington
Browse

Model-Based Clustering using Adjacent-Categories Logit Models via Finite Mixtures

Download (9.89 MB)
thesis
posted on 2023-11-29, 04:05 authored by Lingyu Li

Many researchers treat ordinal variables as continuous or nominal. Losing ordering information makes statistical analysis and inference weaker, which is not ideal when we try to get as much information as possible from our data. Clustering for ordinal data faces many challenges as well. Traditional clustering methods such as hierarchical clustering (Murtagh, 1983; Murtagh and Contreras, 2012), centroid-based clustering, and association analysis (Strehl et al., 1999) are not likelihood-based, and statistical inference is not available. In this dissertation, we apply clustering via finite mixtures to the adjacent-categories logit model (Agresti, 1999) for ordinal data, an extension of the likelihood-based clustering methods in Pledger and Arnold (2014); Fernandez et al. (2016).

Our data form a matrix where the rows are subjects, and the columns are a set of ordinal responses by those subjects to, say, the questions in a questionnaire. By implementing model-based clustering via a finite mixture model, the subjects (the rows of the matrix) and/or the questions (the columns of the matrix) are grouped into a finite number of clusters. The Expectation-Maximisation (EM) algorithm is utilised to estimate the model parameters. Several different stopping criteria for the EM algorithm are proposed and studied for our model. Additionally, asymptotic standard errors are provided by the standard information-based method. For model selection, six commonly used information criteria are evaluated through an empirical study. After information criteria are used to select the best model, the randomised quantile residual is proposed for model diagnostic and model checking. About the applications, we illustrate the models through two real-life data examples.

History

Copyright Date

2023-11-29

Date of Award

2023-11-29

Publisher

Te Herenga Waka—Victoria University of Wellington

Rights License

Author Retains Copyright

Degree Discipline

Applied Statistics

Degree Grantor

Te Herenga Waka—Victoria University of Wellington

Degree Level

Doctoral

Degree Name

Doctor of Philosophy

ANZSRC Socio-Economic Outcome code

280118 Expanding knowledge in the mathematical sciences

ANZSRC Type Of Activity code

3 Applied research

Victoria University of Wellington Item Type

Awarded Doctoral Thesis

Language

en_NZ

Victoria University of Wellington School

School of Mathematics and Statistics

Advisors

Liu, Ivy; Arnold, Richard