Events and Meetings of Italian Statistical Society, Statistics and Demography: the Legacy of Corrado Gini

Font Size: 
Recipes for sparse LDA of horizontal data
Nickolay Trendafilov

Last modified: 2015-09-05

Abstract


Many important modern applications require analyzing data with more variables than observations, called for short horizontal. In such situation the classical Fisher's linear discriminant analysis (LDA) does not possess solution because the within-group scatter matrix is singular. Moreover, the number of the variables is usually huge and the classical type of solutions (discriminant functions) are difficult to interpret as they involve \textit{all} available variables. Nowadays, the aim is to develop fast and reliable algorithms for sparse LDA of horizontal data. The resulting discriminant functions depend on very few original variables, which facilitates their interpretation. The main theoretical and numerical challenge is how to cope with the singularity of the within-group scatter matrix. This work aims at classifying the existing approaches according to the way they tackle this singularity issue, and suggest new ones.

Full Text: PDF