Logistic Discriminant Analysis

Logistic Discriminant Analysis

Feature extraction is one of the most important problems in pattern recognition. Linear discriminant analysis (LDA) is one of the well-known methods to extract the best features for multi-class discrimination. LDA is formulated as a problem to find an optimal linear mapping by which the within-class scatter in the mapped feature space is made as small as possible relative to the between-class scatter. LDA is useful for linear separable cases, but for more complicated cases, it is necessary to extend it to non-linear.

----------------------------------------------------------------------------------------------------------------------------

Prerequisite :

· Bayesian Decision Theory

· Multi-Variate Linear Algebra

· Numerical Computations

-----------------------------------------------------------------------------------------------------------------------------

(Methods Used Before Logistic Discriminant Analysis)

Linear Discriminant Analysis

It is a well known linear-based method to extract the features for multi-class discrimination/classification.

The main objective of the linear discriminant analysis is to maximize the discriminant criterion which is used to evaluate the performance of new feature vector y, which is generated due to

Discriminant Criterion
Where ∑_T and ∑_B are total covariance and between-class covariance.

dimension reduction linear mapping in the original input feature vector. We maximize the discriminant criterion to obtain the optimal coefficient matrix A by solving an eigen equation.

The j^th column of A is the eigenvector corresponding to the j^th largest eigenvalue. Therefore, the importance of each element of the new feature vector y is evaluated by the corresponding eigenvalues. The dimension of the new feature vector y is bounded by min (K -1, N).

-----------------------------------------------------------------------------------------------------------------------------

(Foundation for Logistic Discriminant Analysis)

Optimal Non-Linear Discriminant Analysis

In this analysis, a dimension reducing non-linear mapping is constructed to maximize the discriminant criterion J.

^Where

Optimal Non-Linear Discriminant Mapping

The vectors u_k (k = 1, …, K) is a class representative vectors which are determined by the following Eigen equation:

And,

It is important to remember that the optimal non-linear mapping is closely related to Bayesian decision theory, namely the posterior probabilities P(C_k|x).

Thus, we construct optimal nonlinear discriminant features by ONDA from a given input features if we can know or estimate all the Bayesian posterior probabilities correctly.

-----------------------------------------------------------------------------------------------------------------------------

Multi-Nominal Logistic Regression

Logistic regression (LR) is one of the simplest models for binary classification and can directly estimate the posterior probabilities.

Multi-nominal logistic regression (MLR) is a natural extension of LR to multi-class classification problems. By modifying the outputs of the linear predictor by the link function, MLR can naturally estimate the posterior probabilities.

For the K-class classification problem, The outputs of MLR estimate the posterior probabilities P(t^k_i| x_i) . They are defined as follows:

Outputs Of MLR / Posterior Probabilities

The optimal parameters of MLR are obtained by minimizing the negative log-likelihood which is a convex optimization problem and it has only a single, global minimum.

Minimized negative log-likelihood

Again, the optimal parameter W can be efficiently found using Newton-Raphson method or an iterative re-weighted least squares (IRLS) procedure. In each iteration step, W is updated as

Parameter W is updated continuously until it converges.

Regularization of MLR

In general, the regularization term is introduced to control over-fitting. The regularization methods of MLR were proposed such as the shrinkage method (regularized MLR) and locality preserving multi-nominal logistic regression (LPMLR).

In the shrinkage method, unnecessary growth of the parameters are penalized by introducing the regularization term EW defined as follows:

the optimal parameters of the regularized MLR is determined by minimizing the negative log-likelihood as given above, which is a convex optimization problem and lambda w is the pre-specified regularization parameter of E_w.

The multiplicative update rule for the regularized MLR is the same as (19). However, the elements of the block Hessian matrix H are different from MLR [10]. the block Hessian matrix H of the regularized MLR is defined as follows:

----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Logistic Discriminant Analysis

the outputs of the ordinal MLR or the regularized MLR can be interpreted as estimates of the Bayesian posterior probabilities (P(C₁|X),….., P(C_k|X))^T.

Logistic Discriminant Analysis is obtained by substituting the Bayesian probabilities with the outputs of regularized MLR in ONDA.

It is expected that the discriminant space constructed by LgDA is better than the one constructed by LDA, because MLR is more natural as the estimates of the posterior probabilities than the linear approximation of them used in LDA.

The representative vectors of each class u^~k are determined by the following eigen equation:

Class Representative Vector

The nonlinear discriminant mapping constructs the good approximation of the ultimate nonlinear discriminant mapping ONDA in terms of the discriminant criterion.

-------------------------------------------------------------------------------------------------------------------

Examples of Discriminations

-----------------------------------------------------------------------------------------------------------

- L.A. Lalit Viyogi

Viyogi Learnings

------------------------

Search Blog

Viyogi Learning

Logistic Discriminant Analysis

Comments

Post a Comment

Popular posts from this blog

Task 2 Video Explanation Video

ML sem 4 2021

Viyogi Learnings