Fitting a distance model to homogeneous subsets of variables: Points of view analysis of categorical data |
| |
Authors: | Jacqueline J. Meulman |
| |
Affiliation: | (1) University of Illinois at Urbana-Champaign, Urbana, USA;(2) Present address: Department of Data Theory, Faculty of Social Sciences, Leiden University, P.O. Box 9555, 2300 RB Leiden, The Netherlands |
| |
Abstract: | An approach is presented for analyzing a heterogeneous set of categorical variables assumed to form a limited number of homogeneous subsets. The variables generate a particular set of proximities between the objects in the data matrix, and the objective of the analysis is to represent the objects in lowdimensional Euclidean spaces, where the distances approximate these proximities. A least squares loss function is minimized that involves three major components: a) the partitioning of the heterogeneous variables into homogeneous subsets; b) the optimal quantification of the categories of the variables, and c) the representation of the objects through multiple multidimensional scaling tasks performed simultaneously. An important aspect from an algorithmic point of view is in the use of majorization. The use of the procedure is demonstrated by a typical example of possible application, i.e., the analysis of categorical data obtained in a free-sort task. The results of points of view analysis are contrasted with a standard homogeneity analysis, and the stability is studied through a Jackknife analysis. |
| |
Keywords: | Categorical data Multidimensional scaling Homogeneity analysis Optimal scaling Majorization Clustering of variables |
本文献已被 SpringerLink 等数据库收录! |
|