Simultaneous Maximization of the Trace of a Set of Matrices

(1)

Simultaneous Maximization of the Trace of a Set of

Matrices

Lieven De Lathauwer

Katholieke Universiteit Leuven

Group Science, Engineering and Technology, 8500 Kortrijk, Belgium and E.E. Dept. (ESAT) - SCD - SISTA, B-3001 Leuven, Belgium

Email: Lieven.DeLathauwer@kuleuven-kortrijk.be and Lieven.DeLathauwer@esat.kuleuven.be

Nov. 14, 2009

Problem description

Given M1, . . . , MK ∈ RN×N, and R < N, we wish to maximize

f(a, Q) = K X k=1 R X r=1 ak( ˜Mk)rr, (1) in which Q ∈ RN_×N is orthogonal, a ∈ RK is unit-norm and ˜Mk = Q ·

Mk · QT, k = 1, . . . , K. We will also consider the case where a is either

unit-norm or zero, and nonnegative. The matrices Mk are not necessarily

positive (semi)definite. Eq. (1) will be written as f(a, Q) =

K

X

k₌₁

aktraceR( ˜Mk). (2)

Note that traceR(·) is a linear function, i.e.,

traceR(αA + βB) = α traceR(A) + β traceR(B). (3)

(2)

Alternating algorithm

We can compute the solution by means of an alternating algorithm:

1. Update a for given estimate of Q. 2. Update Q for given estimate of a. 3. Go back to 1 until convergence.

Convergence to a local optimum is guaranteed, because in each step we will maximally increase the objective function f . It may be necessary to reini-tialize a number of times in order to find the global optimum.

Updating a.

Given Q, f in (1) can be written as f(a) = K X k=1 ak R X r=1 ( ˜Mk)rr = aT · R X r₌₁ ( ˜M₁)rr, . . . , R X r₌₁ ( ˜MK)rr !T def = aT · x. The optimal vector a is given by x

kxk and the optimal value of f is given by

kxk.

Special case: a is either unit-norm or zero, and nonnegative. Denote by ˜

x _{∈ R}K the vector obtained from x by setting the negative components equal to zero. If ˜x 6= 0, then the optimal vector a is given by x˜

k˜xk and the

optimal value of f is given by k˜xk. If ˜x = 0, then we restart the iteration from a new initial point. After a number of restarts, we decide that the overall optimal vector a and the overall optimal value of f are zero.

(3)

Updating Q.

Given a, f can be written as

f(Q) = traceR(Q · ( K X k=1 akMk) · Q T ). (4)

Hence, we have to determine the optimal Q for only one matrix, namely PK

k=1akMk, which we write as Y ∈ RN×N. Only the first R rows of Q are

of importance. If Y has not less than R positive eigenvalues, then the first R rows of Q are taken equal to the R eigenvectors that correspond to the largest eigenvalues of Y. If Y has only ˜R < R positive eigenvalues, then the first R rows of Q are taken equal to the ˜R eigenvectors that correspond to the positive eigenvalues of Y, complemented with the R − ˜R eigenvectors that correspond to the least negative eigenvalues of Y.

Initialization.

The (first R rows of the) matrix Q may be initialized as (part of) the identity matrix, or the matrix obtained from the truncated MLSVD, or the matrix obtained from HOOI, or a set of random mutually orthonormal vectors.

Preprocessing: dimensionality reduction.

Stack the matrices Mk in a tensor M ∈ R

N_{×N ×K}

. Let the MLSVD of M be given by

M = S •1U•2U•3V. (5)

Assume that for some R′ _> _{R, the mode-1 singular values σ}(1)

R′₊₁, . . . , σ

(1) N

are very small, in the sense that the corresponding part of S does not really contribute to M. Then we may do the analysis for S(1 : R′_,_{1 : R}′_,_{1), . . . ,}

S(1 : R′_,_{1 : R}′_{, K) and afterwards multiply by U(:, 1 : R}′_{). (This is a bit}

delicate when, for instance, all the Mk are negative semidefinite, but that is

a detail; beyond R′_{, we should actually have a margin of at least R, to avoid}

(4)

problems with negativity; to be discussed.) One may wish to conclude with a few fine-tuning steps of the algorithm in the original space.

Acknowledgments

Research supported by: (1) Research Council K.U.Leuven: GOA-MaNet, CoE EF/05/006 Optimization in Engineering (OPTEC), CIF1, STRT1/08/023, (2) F.W.O.: (a) project G.0427.10N, (b) Research Communities ICCoS, AN-MMM and MLDM, (3) the Belgian Federal Science Policy Office: IUAP P6/04 (DYSCO, “Dynamical systems, control and optimization”, 2007–2011), (4) EU: ERNSI.