Two-dimensional approximately harmonic projection for gait recognition

Wang, Ziqiang; Sun, Xia; Sun, Lijun

Journal of Vibroengineering

Browse Journal

Submit article

Published: 30 June 2013

Check for updates

Two-dimensional approximately harmonic projection for gait recognition

Ziqiang Wang¹

Xia Sun²

Lijun Sun³

^{1, 2, 3}School of Information Science and Engineering, Henan University of Technology, Zhengzhou 450001, China

Cite the article Download PDF

Downloads 1354

Abstract

This paper presents a two-dimensional approximately harmonic projection (2DAHP) algorithm for gait recognition. 2DAHP is originated from the approximately harmonic projection (AHP), while 2DAHP offers some advantages over AHP. 1) 2DAHP can preserve the local geometrical structure and cluster structure of image data as AHP. 2) 2DAHP encodes images as matrices or second-order tensors rather than one-dimensional vectors, so 2DAHP can keep the correlation among different coordinates of image data. 3) 2DAHP avoids the singularity problem suffered by AHP. 4) 2DAHP runs faster than AHP. Extensive experiments on gait recognition show the effectiveness and efficiency of the proposed method.

1. Introduction

Recently, the average silhouettes-based human gait recognition has received extensive attention due to its potential applications in many fields [1-4], such as identity authentication and video surveillance. In general, a binary silhouette image of size 128×88 in the USF HumanID gait database is represented as a vector in the image space $R^{128×88}$ . Consequently, a major challenge of gait recognition is that the captured gait image often lies in a high-dimensional image space. Due to the consideration of the curse of dimensionality, a common way to resolve this problem is to use dimensionality reduction techniques. Once we obtain lower-dimensional representations of the original gait images, the traditional classification methods can be applied in the reduced feature space. Therefore, the main objective of this paper is to find techniques that can introduce lower-dimensional feature representations of gait images with enhanced discriminatory power.

The most representative algorithms for dimensionality reduction are principal component analysis (PCA) and linear discriminant analysis (LDA) [5]. Although PCA and LDA have been successfully applied to face recognition, image retrieval, and gait recognition, they are designed for discovering only the global Euclidean structure, whereas the local manifold structure is ignored. In fact, the global statistics such as variance is often difficult to compute when there are no sufficient samples. In addition, a number of research efforts have shown that the images possibly reside on a nonlinear submanifold and the representation of image is fundamentally related to the problem of manifold learning [6-9]. Given a set of high-dimensional data points, manifold learning techniques aim to discover the geometric properties of the data space. In the past years, a number of manifold learning algorithms have been developed, representative algorithms include locally linear embedding (LLE) [10], ISOMAP [11], and Laplacian eigenmaps (LE) [12]. LLE is designed to maintain the local linear reconstruction relationship among neighboring points in the lower-dimensional space. ISOMAP aims to preserve global geodesic distances of all pairs of samples. LE aims to preserve proximity relationships by manipulations on an undirected weighted graph, which indicates neighbor relations of pairwise samples. These nonlinear methods do yield impressive results on some artificial benchmarks and several real applications. However, they suffer from the out of sample problem, i.e., they can only obtain mappings that are defined on the training data points and how to explicitly calculate the mappings on novel testing data points remains unclear. Therefore, these nonlinear manifold learning algorithms might not be suitable for gait recognition. To cope with the out of sample problem, locality preserving projection (LPP) [13] applies a linearization procedure to construct explicit mappings over new samples. In the recent research, Lin et al. [14] point that utilizing the affine hulls of the manifold and the connected components is more effective for preserving the local geometrical structure and cluster structure of original data, and propose a new algorithm termed approximately harmonic projection (AHP) for dimensionality reduction. AHP is a linear manifold learning method based on the harmonic framework, and the optimal transformation can be obtained by approximating the Dirichlet integral. It is worth noting that all these methods unfold input data into vectors before dimensionality reduction. But images are naturally in the form of second-order or higher-order tensors [15-17]. For example, gray-level images represent second-order (matrix), and Gabor-filtered image represents third-order tensors. Consequently, such kind of vectorization largely increases the computational costs and seriously destroys the intrinsic tensor structure of images. To cope with these issues, multilinear extensions of PCA, LDA, and LPP, namely 2DPCA [18], 2DLDA [19], and 2DLPP [20] are proposed, respectively. These methods aim to conduct subspace analysis by directly encoding images as two-dimensional image matrices rather than one-dimensional vectors. The advantages of using image-as-matrix representation have been indeed consistently pointed out in a number of recent research efforts [15-20], especially when the number of training samples is small. Nevertheless, the multilinear (tensor) extension of AHP and its application to gait recognition are still a research area where few people have tried to explore.

This paper represents a gray-level average silhouette image of size $n_{1} \times n_{2}$ as the matrix (or second-order tensor) in the tensor space $R^{n_{1}} \times R^{n_{2}}$ . Then a two-dimensional approximately harmonic projection (2DAHP) is proposed by tensorizing AHP. Compared with the original AHP, 2DAHP can directly process gait images in their original matrix form and utilize correlations among pixels within different dimensions (i.e., rows and columns). Moreover, the smaller number of data entries along each data dimension facilitates subspace learning with limited training data. 2DAHP is much more computational efficient since the decomposed matrices are of size $n_{1} \times n_{1}$ or $n_{2} \times n_{2}$ , which is much smaller than that of size $n \times n (n = n_{1} \times n_{2})$ in AHP. 2DAHP can avoid the singularity problem. In addition, the trace ratio optimization technique is also applied to efficiently solve 2DAHP.

The remainder of this paper is organized as follows. Section 2 briefly reviews AHP. Section 3 introduces our proposed 2DAHP algorithm. Experimental results on gait recognition are presented in Section 4. The concluding remarks are provided in Section 5.

2. Brief review of approximately harmonic projection (AHP)

AHP is a recently proposed linear manifold learning method for dimensionality reduction [14]. It is based on the approximate affine hull and explicitly utilizes the edge length to reflect the geometrical structure of the manifold structure of the data space.

Given a set of data points ${x_{1}, \dots, x_{n}} \subset R^{m}$ , let $X = [x_{1}, \dots, x_{n}]$ . Let $W^{c}$ and $W^{b}$ be two weight matrices defined on the data points. The optimal projection of AHP can be obtained by solving the following minimization problem:

1

a_{o p t} = a r g \underset{a}{m i n} \frac{1}{2} \sum_{i \sim j} \int_{e_{i j}} {‖ \nabla f_{e_{i j}} ‖}^{2} d t = a r g \underset{a}{m i n} \frac{1}{2} \sum_{i \sim j} \int_{0}^{d_{i j}} {(\frac{a^{T} x_{i} - a^{T} x_{j}}{d_{i j}})}^{2} d t = a r g \underset{a}{m i n} a^{T} X (D^{c} - W^{c}) X^{T} a,

with the constraint:

2

3 \sum_{i \sim j} \int_{0}^{d_{i j}} {(a^{T} x_{i} + \frac{t}{d_{i j}} (a^{T} x_{i} - a^{T} x_{j}))}^{2} d t = a^{T} X (2 D^{b} + W^{b}) X^{T} a = 1,

where $e_{i j} = x_{j} - x_{i}$ represents an edge vector that has an orientation from $x_{i}$ to $x_{j},$ $d_{i j} = ‖ x_{j} - x_{i} ‖$ denotes the length of the edge between $x_{i}$ and $x_{j}$ , $t$ is the arc length of $e_{i j}$ . $W^{c}$ and $W^{b}$ are two matrices defined as follows: if $x_{i}$ and $x_{j}$ are connected, then $W_{i j}^{c} = 1 / d_{i j}$ and $W_{i j}^{b} = d_{i j};$ otherwise, $W_{i j}^{c} = W_{i j}^{b} = 0 .$ $D^{c}$ and $D^{b}$ are two diagonal matrices defined as $D_{i i}^{c} = \sum_{j} W_{i j}^{c}$ , $D_{i i}^{b} = \sum_{j} W_{i j}^{b}$ . $\nabla f_{e_{i j}}$ denotes the gradient on each edge, its definition is as follows:

3

\nabla f_{e_{i j}} = \frac{a^{T} x_{j} - a^{T} x_{i}}{d_{i j}} .

Unlike the standard spectral graph methods which mainly consider the connectivity of graph, AHP explicitly makes use of the edge length and edge orientation which reflect the geometrical structure of the manifold. Therefore, AHP can precisely model multiple connected components of the data manifold, which is especially important for discriminating data with different submanifold (cluster) structure.

The objective function in AHP aims to use the approximate affine hull of the graph to separate data points sampled from different components. Therefore, minimizing it is to ensure that if $x_{i}$ and $x_{j}$ lie in the multiple connected components, then $y_{i} (= a^{T} x_{i})$ and $y_{j} (= a^{T} x_{j})$ are made close by the optimal projection. Finally, the projection vector $a$ that minimizes (1) is given by the minimum eigenvalue solution to the generalized eigenvalue problem:

4

X (D^{c} - W^{c}) X^{T} a = λ X (2 D^{b} + W^{b}) X^{T} a .

Note that, in the appearance-based image analysis, one is often confronted with the fact the dimension of image vector is much smaller than the number of images. Thus, the matrix $X (2 D^{b} + W^{b}) X^{T}$ is singular. To avoid the singularity problem, one may first apply PCA to remove the components corresponding to zero eigenvalues. Thus, the projection vector of AHP can be considered as the eigenvectors of the matrix ${(X (2 D^{b} + W^{b}) X^{T})}^{- 1} X (D^{c} - W^{c}) X^{T}$ associated with the smallest eigenvalues. In addition, since ${(X (2 D^{b} + W^{b}) X^{T})}^{- 1} X (D^{c} - W^{c}) X^{T}$ is not usually symmetric, the AHP projection axes are not orthogonal.

Let the column vector of $a_{1}, a_{2}, \dots, a_{d}$ be the solution of (4) ordered according to their eigenvalues $λ_{1} < λ_{2} < \dots < λ_{d}$ . Thus, the embedding is given by $x_{i} \to y_{i} = A^{T} x_{i}$ , where $y_{i}$ is a $d$ -dimensional vector and $A = (a_{1}, a_{2}, \dots, a_{d})$ is an $n \times d$ matrix.

3. Two-dimensional approximately harmonic projection (2DAHP)

Given a set of data points ${X_{i}}_{i = 1}^{n}$ in the second-order tensor (or matrix) space $R^{n_{1}} \otimes R^{n_{2}}$ , let ${u_{i}}_{i = 1}^{n_{1}}$ be an orthonormal basis of $R^{n_{1}}$ and ${v_{j}}_{j = 1}^{n_{2}}$ be an orthonormal basis of $R^{n_{2}}$ , it has been shown that ${u_{i} \otimes v_{j}}$ forms a basis of the tensor space $R^{n_{1}} \otimes R^{n_{2}}$ [20]. Thus, a second-order tensor $X$ can be uniquely defined as $X = \sum_{i, j} (u_{i}^{T} X v_{j}) u_{i} v_{j}^{T}$ .

Given a set of data points ${\{X_{i}\}}_{i = 1}^{n}$ in $R^{n_{1}} \otimes R^{n_{2}}$ , two-dimensional approximately harmonic projection (2DAHP) aims to find two projection matrices $U \in R^{n_{1} \times l_{1}}$ and $V \in R^{n_{2} \times l_{2}}$ that maps each data point $X_{i} (i = 1, \dots, n)$ to a lower-dimensional matrix representation $Y_{i} \in R^{l_{1}} \times R^{l_{2}} (i = 1, \dots n, l_{1} < n_{1}, l_{2} < n_{2})$ by $Y_{i} = U^{T} X_{i} V$ such that $Y_{i}$ represents $X_{i}$ .

Let $U$ and $V$ be the projection matrices, according to (1) and (2), the optimal objective function of 2DAHP with the matrix representation can be rewritten as follows:

5

(U^{*}, V^{*}) = a r g \underset{U, V}{m i n} \frac{1}{2} \sum_{i \sim j} \frac{1}{d_{i j}} {(U^{T} X_{i} V - U^{T} X_{j} V)}^{2},

with the constraint:

6

3 \sum_{i \sim j} \int_{0}^{d_{i j}} {(U^{T} X_{i} V + \frac{t}{d_{i j}} (U^{T} X_{i} V - U^{T} X_{j} V))}^{2} d t = 1,

where $d_{i j}$ is similarly defined as AHP.

Let $Y_{i} = U^{T} X_{i} V$ and $D^{c}$ be a diagonal matrix, $D_{i i}^{c} = \sum_{j} W_{i j}^{c}$ . Since ${‖ A ‖}^{2} = Tr (A A^{T})$ , we have:

7

\frac{1}{2} \sum_{i \sim j} \frac{1}{d_{i j}} {(U^{T} X_{i} V - U^{T} X_{j} V)}^{2} = \frac{1}{2} \sum_{i \sim j} Tr ((Y_{i} - Y_{j}) {(Y_{i} - Y_{j})}^{T}) W_{i j}^{c} = \frac{1}{2} \sum_{i \sim j} Tr (Y_{i} Y_{i}^{T} + Y_{j} Y_{j}^{T} - Y_{i} Y_{j}^{T} - Y_{j} Y_{i}^{T}) W_{i j}^{c} = Tr (\sum_{i} D_{i i}^{c} Y_{i} Y_{i}^{T} - \sum_{i, j} W_{i j}^{c} Y_{i} Y_{j}^{T}) = Tr (U^{T} (\sum_{i} D_{i i}^{c} X_{i} V V^{T} X_{i}^{T} - \sum_{i, j} W_{i j}^{c} X_{i} V V^{T} X_{j}^{T}) U) = Tr (U^{T} (P_{V}^{c} - Q_{V}^{c}) U),

where $P_{V}^{c} = \sum_{i} D_{i i}^{c} X_{i} V V^{T} X_{i}^{T}$ and $Q_{V}^{c} = \sum_{i, j} W_{i j}^{c} X_{i} V V^{T} X_{j}^{T}$ . Similarly, ${‖ A ‖}^{2} = Tr (A^{T} A)$ , so we can also obtain:

8

\frac{1}{2} \sum_{i \sim j} \frac{1}{d_{i j}} {(U^{T} X_{i} V - U^{T} X_{j} V)}^{2} = \frac{1}{2} \sum_{i \sim j} Tr ({(Y_{i} - Y_{j})}^{T} (Y_{i} - Y_{j})) W_{i j}^{c} = \frac{1}{2} \sum_{i \sim j} Tr (Y_{i}^{T} Y_{i} + Y_{j}^{T} Y_{j} - Y_{i}^{T} Y_{j} - Y_{j}^{T} Y_{i}) W_{i j}^{c} = Tr (\sum_{i} D_{i i}^{c} Y_{i}^{T} Y_{i} - \sum_{i, j} W_{i j}^{c} Y_{i}^{T} Y_{j}) = Tr (V^{T} (\sum_{i} D_{i i}^{c} X_{i}^{T} U U^{T} X_{i} - \sum_{i, j} W_{i j}^{c} X_{i}^{T} U U^{T} X_{j}) V) = Tr (V^{T} (P_{U}^{c} - Q_{U}^{c}) V),

where $P_{U}^{c} = \sum_{i} D_{i i}^{c} X_{i}^{T} U U^{T} X_{i}$ and $Q_{U}^{c} = \sum_{i, j} W_{i j}^{c} X_{i}^{T} U U^{T} X_{j} .$ Consequently, we should simultaneously minimize $Tr (U^{T} (P_{V}^{c} - Q_{V}^{c}) U)$ and $Tr (V^{T} (P_{U}^{c} - Q_{U}^{c}) V)$ .

In addition, similar to the above derivation process, the left side of constraint function equation (6) can be converted to:

9

3 \sum_{i \sim j} \int_{0}^{d_{i j}} {(U^{T} X_{i} V + \frac{t}{d_{i j}} (U^{T} X_{i} V - U^{T} X_{j} V))}^{2} d t = Tr (U^{T} (2 P_{V}^{b} + Q_{V}^{b}) U) = Tr (V^{T} (2 P_{U}^{b} + Q_{U}^{b}) V),

where $P_{V}^{b} = \sum_{i} D_{i j}^{b} X_{i} V V^{T} X_{i}^{T}$ , $Q_{V}^{b} = \sum_{i} W_{i j}^{b} X_{i} V V^{T} X_{j}^{T}$ , $P_{U}^{b} = \sum_{i} D_{i j}^{b} X_{i}^{T} U U^{T} X_{i}$ , $Q_{U}^{b} = \sum_{i} W_{i j}^{b} X_{i}^{T} U U^{T} X_{j},$ and $D^{b}$ is a diagonal matrix, $D_{i i}^{b} = \sum_{j} W_{i j}^{b}$ .

Finally, the optimal objective function (5) subject to (6) can be transformed as:

10

\underset{U, V}{m i n} \frac{Tr (U^{T} (P_{V}^{c} - Q_{V}^{c}) U)}{Tr (U^{T} (2 P_{V}^{b} + Q_{V}^{b}) U)},

11

\underset{U, V}{m i n} \frac{Tr (V^{T} (P_{U}^{c} - Q_{U}^{c}) V)}{Tr (V^{T} (2 P_{U}^{b} + Q_{U}^{b}) V)} .

Because of difficulty in solving the optimal $U$ and $V$ simultaneously, we follows the similar computational methods as [20] to compute $U$ and $V$ iteratively. We first initialize $U$ with an identity matrix, then $V$ can be approximately computed with generalized eigenvalue decomposition (GED) by transforming the optimal objective function (11) into the tractable ratio trace form $Tr ({(V^{T} (2 P_{U}^{b} + Q_{U}^{b}) V)}^{- 1} (V^{T} (P_{U}^{c} - Q_{U}^{c}) V))$ . That is, $V$ can be regarded as the eigenvectors associated with the minimum eigenvalues of the following generalized eigenvector problem:

12

(P_{U}^{c} - Q_{U}^{c}) V = λ (2 P_{U}^{b} + Q_{U}^{b}) V .

Once $V$ is obtained, similarly, we can update $U$ by solving the following generalized eigenvector problem:

13

U^{T} (P_{V}^{c} - Q_{V}^{c}) U = λ (2 P_{V}^{b} + Q_{V}^{b}) U

Therefore, we can obtain the final optimal $U$ and $V$ by iteratively solving the generalized eigenvector problems (12) and (13).

In the preceding section, we approximately computed the the optimal objective functions of (10) and (11) by converting them into ratio trace problems, which are solved by GED. However, the obtained solutions may deviate from the original objectives, which may lead to uncertainty in subsequent classification [21]. To address these problems, we describes how to directly solve (10) and (11) with the Iterative algorithm for the Trace Ratio (ITR) optimization problem introduced in [21]. To compute $U$ , we first fix $V$ and initialize $U^{0}$ as an arbitrary columnly orthogonal matrix. In each iterative step, we solve a trace difference problem $U^{t} = a r g \underset{U^{T} U = I}{m i n} Tr (U^{T} ((P_{V}^{c} - Q_{V}^{c}) - λ^{t} (2 P_{V}^{b} + Q_{V}^{b})) U)$ , where $λ^{t}$ is the trace ratio value calculated from the projection matrix $U^{t - 1}$ of the previous step, i.e., $λ^{t} = Tr (U^{t - 1^{T}} (P_{V}^{c} - Q_{V}^{c}) U^{t - 1}) / Tr (U^{t - 1^{T}} (2 P_{V}^{b} + Q_{V}^{b}) U^{t - 1})$ . Once $U$ is obtained, similarly, we can update $V$ by solving $V^{t} = a r g \underset{V^{T} V = I}{m i n} Tr (V^{T} ((P_{U}^{c} - Q_{U}^{c}) - λ^{t} (2 P_{U}^{b} + Q_{U}^{b})) V)$ where $λ^{t} = Tr (V^{t - 1^{T}} (P_{U}^{c} - Q_{U}^{c}) V^{t - 1}) / Tr (V^{t - 1^{T}} (2 P_{U}^{b} + Q_{U}^{b}) V^{t - 1})$ . Finally, output the final $U$ and $V$ when the iterative algorithm converges to optimal solutions. The detailed iteration algorithm for solving (10) and (11) can be presented as follows:

Algorithm1: The iteration algorithm for directly solving the optimal problem (10) and (11) in 2DAHP.

Step 1: Initialize $U^{0}$ and $V^{0}$ as two arbitrary column-wise orthogonal matrices.

Step 2: For $t = 1,2, \dots T_{m a x}$ , do

Step 2.1: Calculate the trace ratio value $λ^{t}$ according to the projection matrix $U^{t - 1}$ :

14

λ^{t} = \frac{Tr ({(U^{t - 1})}^{T} (P_{V}^{c} - Q_{V}^{c}) U^{t - 1})}{Tr ({(U^{t - 1})}^{T} (2 P_{V}^{b} + Q_{V}^{b}) U^{t - 1})} .

Step 2.2: Obtain the new $U^{t}$ by solving the following eigen-decomposition problem:

15

((P_{V}^{c} - Q_{V}^{c}) - λ^{t} (2 P_{V}^{b} + Q_{V}^{b})) U_{i}^{t} = τ_{i}^{t} U_{i}^{t} .

Step 2.3: For the given $U^{t}$ , calculate the trace ratio value $λ^{t}$ according to the projection matrix $V^{t - 1}$ :

16

λ^{t} = \frac{Tr ({(V^{t - 1})}^{T} (P_{U}^{c} - Q_{U}^{c}) V^{t - 1})}{Tr ({(V^{t - 1})}^{T} (2 P_{U}^{b} + Q_{U}^{b}) V^{t - 1})} .

Step 2.4: Obtain the new $V^{t}$ by solving the following eigen-decomposition problem:

17

((P_{U}^{c} - Q_{U}^{c}) - λ^{t} (2 P_{U}^{b} + Q_{U}^{b})) V_{i}^{t} = τ_{i}^{t} V_{i}^{t},

where $τ_{0}^{t} \leq τ_{1}^{t} \leq \dots \leq τ_{d - 1}^{t}$ are the $d$ smallest eigenvalues, and $V_{i}^{t}$ is the eigenvector associated with eigenvalue $τ_{i}^{t}$ , which constitutes the ith column vector of the matrix $V^{t}$ .

Step 2.5: If $‖ U^{t} - U^{t - 1} ‖ < \sqrt{d} ε$ and $‖ V^{t} - V^{t - 1} ‖ < \sqrt{d} ε$ , then break.

Step 3: Output the projection matrices $U = U^{t}$ and $V = V^{t}$ .

From the above algorithemic procedure, it can be easily observe that the obtained projection matrices $U$ and $V$ are orthogonal. In addition, following the conclusions in [21], the above iteration algorithm will converge to an optimal value. For more details about proof of convergence, please refer to [21].

4. Experimental results

In this section, to investigate the performance of our proposed 2DAHP algorithm for gait recognition, we compare the 2DAHP algorithm with 2DPCA [18], 2DLDA [19], 2DLPP [20], and the original AHP [14] algorithms for gait recognition on the well-known USF HumanID gait database, where 2DPCA, 2DLDA, and 2DLPP are three popular tensor methods in face recognition and gait recognition, and the original AHP algorithm is a vector-based algorithm. The settings of these compared algorithms are identical to the description in the corresponding papers. In addition, to cope with the singular problem existed in the original AHP, we apply PCA to remove the components corresponding to zero eigenvalues before carrying out AHP. For 2DAHP, we empirically set the optimal iteration number $T_{m a x}$ as 5 for each probe set, since the latter experimental results show that the 2DAHP algorithm converges quite quickly.

The USF HumanID gait database is constructed by Sarkar et al. [1], it contains 1870 sequences from 122 individuals walking on an elliptical path in front of two cameras. This database provided one gallery set containing the sequences from 122 individuals and 12 probe sets containing different numbers of individuals varying from 33 to 122 for algorithm training and testing, respectively. More details information about the USF HumanID gait database can be found in [1]. In this gait database, we consider sequences of binary silhouette images. As in [22] and [23], we construct the average silhouette-based gait image representation: First, a complete sequence is partitioned into several subsequences according to the gait period length $N_{g a i t}$ provided by Sarkar et al. [1]. Then, the binary silhouette images within each gait cycle of a sequence are averaged to acquire several gray-level average silhouette images according to:

18

A T_{i} = \frac{1}{N_{g a i t}} \sum_{k = (i - 1) N_{g a i t} + 1}^{k = i N_{g a i t}} T (k), i = 1, \dots, ⌊F / N_{g a i t}⌋,

where ${T (1), \dots, T (F)}$ represents the binary images for one sequence with $F$ frames, $⌊ F / N_{g a i t} ⌋$ denotes the largest integer less than or equal to $F / N_{g a i t}$ . Since numerous researches have experimentally shown that the average silhouette image is more effective and efficient than the original binary silhouette image for human gait recognition, we also utilize the average silhouette image for gait recognition. Fig. 1 shows some original binary images and the average silhouette images of two different individuals, where the first seven images and the last image in each row denote the binary silhouette images and the average silhouette images, respectively. As can be seen, different individuals have different average silhouette images.

To perform gait recognition, we first obtain the average silhouette image subspaces by dimensionality reduction algorithms. Then, all the averaged images from both the gallery set and probe sets are projected into the image subspaces. Finally, the nearest-neighbor classifier is adopted to identify new average silhouette images, where the distance measure uses the median operator for its robust to noise [1, 23]:

19

Dist (L S_{P}, L S_{G}) = {Median}_{i = 1}^{N_{p}} ({Min}_{j = 1}^{N_{g}} {‖L S_{P} (i) - L S_{G} (j)‖}^{2}),

where $L S_{P} (i)$ , $i = 1, \dots, N_{p}$ and $L S_{G} (j)$ , $j = 1, \dots, N_{g}$ are the lower representations from one probe sequence and one gallery sequence, respectively, $N_{p}$ and $N_{g}$ denote the total number of average silhouette images. For each dimensionality reduction algorithm, we only show its performance in the $l -$ or $(l \times l) -$ dimensional subspace. For each case, we average the results over 20 random splits of training and testing sets.

Fig. 1Some original binary images and the average silhouette images of two different individuals in the USF HumanID gait database

The recognition accuracies are shown in Table 1 and Table 2, where Rank-1 indicates that the correct subject is ranked as the top candidate, Rank-5 means that the correct subject is ranked among the top five candidates, and Average denotes the recognition rate among all the probe sets. Moreover, we also plot the recognition rate variance with different numbers of iterations for probe sets A, B, C, D, E, F, G, H, I, J, K and L in Fig. 2. Finally, we report the running times of 2DAHP and AHP in Table 3.

From the experimental results listed in Table 1-3 and Fig. 2, we can have the following observations:

1) Our proposed 2DAHP consistently outperforms the 2DPCA, 2DLDA, 2DLPP, and AHP algorithms, which demonstrates that it is beneficial to use simultaneously two-dimensional matrix representation as well as the local geometrical structure and cluster structure for gait recognition.

2) 2DPCA performs the worst among the compared algorithms. A possible explanation is as follows: similar to the traditional PCA, the 2DPCA is simply achieves object reconstruction and it is not necessarily useful for discriminating gait images with different subjects which is the ultimate goal of gait recognition.

3) The 2DLDA performs comparatively to 2DLPP. This demonstrates that it is hard to evaluate whether local manifold structure or class label information is more important, which is consistent with existing studies.

4) The 2DAHP algorithm converges quite quickly for probe sets A, B, C, D, E, F, G, H, I, J, K and L, and recognition rates changes slightly with different iteration numbers. After about 5 iterations, 2DAHP can converge to the optimal solution in all 12 probe sets.

5) 2DAHP achieves significant speed up comparing to AHP. Theses results are consistent with the theoretical analysis of the efficiency, i.e., 2DAHP can utilize the intrinsic tensor structure of gait images to improve running efficiency.

Table 1Performance comparison in terms of Rank-1 recognition results (%)

Probe	A	B	C	D	E	F	G	H	I	J	K	L	Average
2DPCA	86	87	75	26	27	18	19	56	61	53	10	11	44.1
2DLDA	89	91	82	33	33	23	25	67	78	67	19	19	52.2
2DLPP	90	92	81	34	36	22	24	69	82	65	18	20	52.8
AHP	88	90	78	32	40	21	20	64	79	61	13	15	50.1
2DAHP	93	94	85	45	48	26	33	84	84	69	27	22	59.2

Fig. 2Some original binary images and the average silhouette images of two different individuals in the USF HumanID gait database

a)

b)

c)

d)

e)

f)

g)

h)

i)

j)

k)

l)

Table 2Performance comparison in terms of Rank-5 recognition results (%)

Probe	A	B	C	D	E	F	G	H	I	J	K	L	Average
2DPCA	89	94	88	51	53	44	42	80	79	64	22	17	60.3
2DLDA	97	99	95	58	57	50	50	86	93	77	43	40	70.5
2DLPP	95	98	93	59	60	48	51	88	91	75	45	41	70.3
AHP	91	95	89	60	59	49	48	85	82	69	29	26	65.2
2DAHP	99	100	96	75	77	56	60	94	94	83	46	38	76.4

Table 3Running time(s) comparison on the USF HumanID gait database

Probe	A	B	C	D	E	F	G	H	I	J	K	L
AHP	2.51	1.17	1.18	2.64	1.29	2.48	1.32	2.41	1.31	2.40	1.06	1.08
2DAHP	0.32	0.15	0.15	0.34	0.21	0.30	0.19	0.30	0.18	0.28	0.10	0.11

5. Conclusions

This paper introduces a tensor dimensionality reduction algorithm called two-dimensional approximately harmonic projection (2DAHP). Compared with the original AHP, 2DAHP can directly conducts subspace analysis by encoding an image as a two-dimensional matrix and has higher computational efficiency. Experimental results on gait recognition have demonstrated the effectiveness and efficiency of our proposed approach.

References

Sarkar S., Phillips P. J., Liu Z., Vega I. R., Grother P., Bowyer K. W. The HumanID gait challenge problem: data sets, performance, and analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 27, Issue 2, 2005, p. 162-177.

Search CrossRef
Han J., Bhanu B. Individual recognition using gait energy image. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 28, Issue 2, 2006, p. 316-322.

Search CrossRef
Wang L., Tan T., Ning H., Hu W. Silhouette analysis-based gait recognition for human identification. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 25, Issue 12, 2003, p. 1505-1518.

Search CrossRef
Xu D., Yan S., Tao D., Lin S., Zhang H.-J. Marginal fisher analysis and its variants for human gait recognition and content based image retrieval. IEEE Transactions on Image Processing, Vol. 16, Issue 11, 2007, p. 2811-2821.

Search CrossRef
Duda R. O., Hart P. E., Stork D. G. Pattern Classification. Second Edition, Wiley-Interscience, Hoboken, 2000.

Search CrossRef
He X., Yan S., Hu Y., Niyogi P., Zhang H.-J. Face recognition using laplacianfaces. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 27, Issue 3, 2005, p. 328-340.

Search CrossRef
Cai D., He X., Han J., Zhang H.-J. Orthogonal laplacianfaces for face recognition. IEEE Transactions on Image Processing, Vol. 15, Issue 11, 2006, p. 3608-3614.

Search CrossRef
Yan S., Xu D., Zhang B., Zhang H.-J., Yang Q., Lin S. Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, Issue 1, 2007, p. 40-51.

Search CrossRef
Mu Y., Tao D. Biologically inspired feature manifold for gait recognition. Neurocomputing, Vol. 73, Issue 4-6, 2010, p. 895-902.

Search CrossRef
Roweis S. T., Saul L. K. Nonlinear dimensionality reduction by locally linear embedding. Science, Vol. 290, Issue 5500, 2000, p. 2323-2326.

Search CrossRef
Tenenbaum J. B., Silva V., Langford J. C. A global geometric framework for nonlinear dimensionality reduction. Science, Vol. 290, Issue 5500, 2000, p. 2319-2323.

Search CrossRef
Belkin M., Niyogi P. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation, Vol. 15, Issue 6, 2003, p. 1373-1396.

Search CrossRef
He X., Niyogi P. Locality preserving projections. Advances in Neural Information Processing Systems, 2003, p. 585-591.

Search CrossRef
Lin B., He X., Zhou Y., Liu L., Lu K. Approximately harmonic projection: theoretical analysis and an algorithm. Pattern Recognition, Vol. 43, Issue 10, 2010, p. 3307-3313.

Search CrossRef
Tao D., Li X., Wu X., Maybank S. General tensor discriminant analysis and Gabor features for gait recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, Issue 10, 2007, p. 1700-1715.

Search CrossRef
Xu D., Yan S., Tao D., Zhang L., Li X., Zhang H.-J. Human gait recognition with matrix representation. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 16, Issue 7, 2006, p. 896-903.

Search CrossRef
Yan S., Xu D., Yang Q., Zhang L., Tang X., Zhang H. Multilinear discriminant analysis for face recognition. IEEE Transactions on Image Processing, Vol. 16, Issue 1, 2007, p. 212-220.

Search CrossRef
Yang J., Zhang D., Frangi A. F., Yang J. Y. Two-dimensional PCA: a new approach to appearance-based face representation and recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 26, Issue 1, 2004, p. 131-137.

Search CrossRef
Ye J., Janardan R., Li Q. Two-dimensional linear discriminant analysis. Neural Information Processing Systems, 2005, p. 1569-1576.

Search CrossRef
He X., Cai D., Niyogi P. Tensor subspace analysis. Advances in Neural Information Processing Systems, 2005, p. 499-506.

Search CrossRef
Wang H., Yan S., Xu D., Tang X., Huang T. Trace ratio vs. ratio trace for dimensionality reduction. Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2007, p. 1-8.

Search CrossRef
Han J., Bhanu B. Statistical feature fusion for gait-based human recognition. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004, p. 842-847.

Search CrossRef
Liu Z., Sarkar S. Simplest representation yet for gait recognition: averaged silhouette. Proceedings of the 17th International Conference on Pattern Recognition, 2004, p. 211-214.

Search CrossRef

About this article

Received

02 February 2013

Accepted

03 June 2013

Published

30 June 2013

Keywords

dimensionality reduction

approximately harmonic projection

matrix representation

gait recognition

Acknowledgements

This work is supported by NSFC (Grant No. 70701013), the National Science Foundation for Post-Doctoral Scientists of China (Grant No. 2011M500035), and the Specialized Research Fund for the Doctoral Program of Higher Education of China (Grant No.20110023110002).

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.