Copyright © by SIAM. Unauthorized reproduction of this article is prohibited.

(1)

!

Vol. 36, No. 2, pp. 496–522

COUPLED CANONICAL POLYADIC DECOMPOSITIONS AND (COUPLED) DECOMPOSITIONS IN MULTILINEAR

RANK-(Lr,n, Lr,n, 1) TERMS—PART I: UNIQUENESS∗

MIKAEL SØRENSEN† _AND _{LIEVEN DE LATHAUWER}†

Abstract. Coupled tensor decompositions are becoming increasingly important in signal pro-cessing and data analysis. However, the uniqueness properties of coupled tensor decompositions have not yet been studied. In this paper, we first provide new uniqueness conditions for one factor matrix of the coupled canonical polyadic decomposition (CPD) of third-order tensors. Then, we present necessary and sufficient overall uniqueness conditions for the coupled CPD of third-order tensors. The results demonstrate that improved uniqueness conditions can indeed be obtained by taking into account the coupling between several tensor decompositions. We extend the results to higher-order tensors and explain that the higher-order structure can further improve the uniqueness results. We discuss the special case of coupled matrix-tensor factorizations. We also present a new variant of the coupled CPD model called the coupled block term decomposition (BTD). On one hand, the cou-pled BTD can be seen as a variant of coucou-pled CPD for the case where the common factor contains collinear columns. On the other hand, it can also be seen as an extension of the decomposition into multilinear rank-(Lr, Lr, 1) terms to coupled factorizations.

Key words. coupled decompositions, higher-order tensor, parallel factor (PARAFAC), canonical decomposition (CANDECOMP), canonical polyadic decomposition, coupled matrix-tensor factoriza-tion

AMS subject classifications. 15A22, 15A23, 15A69 DOI. 10.1137/140956853

1. Introduction. The coupled canonical polyadic decomposition (CPD) model

(formally defined in subsection 4.1) seems to have been first used in psychometrics

[21,22] as a way of integrating several three-way studies that involve the same stimuli and as a means of coping with missing data in coupled data sets. The technique was also later considered in chemometrics [36]. In recent years coupled canonical polyadic decompositions have had a resurgence in several engineering disciplines. We mention data mining, where they are used as an explorative tool for finding structure in coupled

data sets [3,1], and bioinformatics, where they are used as a tool for fusion of data

obtained by different analytical methods such as nuclear magnetic resonance and

fluorescence spectroscopy [32,48]. In chemometrics it has been suggested that coupled

matrix-tensor factorizations can be used to fuse data obtained by different analytic methods [2]. We also mention that in biomedical engineering several multisubject or data fusion methods that combine different modalities (fMRI, EEG, MEG, etc.) ∗_{Received by the editors February 13, 2014; accepted for publication (in revised form) by}

D. P. O’Leary February 17, 2015; published electronically May 7, 2015. This research was supported by Research Council KU Leuven under GOA/10/09 MaNet and CoE PFV/10/002 (OPTEC); F.W.O. under project G.0427.10, G.0830.14N, G.0881.14N; the Belgian Federal Science Policy Office under IUAP P7 (DYSCO II, Dynamical Systems, Control and Optimization, 2012–2017); and the Euro-pean Research Council under the EuroEuro-pean Union’s Seventh Framework Programme (FP7/2007– 2013)/ERC Advanced Grant BIOTENSORS (339804). This paper reflects only the authors’ views and the Union is not liable for any use that may be made of the contained information.

http://www.siam.org/journals/simax/36-2/95685.html

†_{Group Science, Engineering and Technology, KU Leuven - Kulak, 8500 Kortrijk,}

Bel-gium, and STADIUS Center for Dynamical Systems, Signal Processing and Data Analyt-ics, and iMinds Department Medical Information Technologies, Departement Elektrotechniek (ESAT), KU Leuven, B-3001 Leuven-Heverlee, Belgium (Mikael.Sorensen@kuleuven-kulak.be,

Lieven.DeLathauwer@kuleuven-kulak.be).

496

(2)

can be interpreted as coupled CPD problems [19, 28, 9, 20, 29, 4]. Despite their importance, to the best of our knowledge, no algebraic studies of coupled tensor decompositions have been provided so far. In particular, no dedicated uniqueness conditions for coupled CPD problems are available.

Several problems in signal processing involve polyadic decompositions that have factor matrices with collinear columns. A particular case is of block term decompo-sitions, which are decompositions of a tensor in terms of low multilinear rank [13].

We mention applications in array processing [34, 38, 39], wireless communication

[35,10,12,31,37], and blind separation of signals that can be modeled as exponen-tial polynomials [14]. There are also applications in chemometrics [6]. Hence, in the study of the coupled CPD model we should pay special attention to collinearity.

The rest of the introduction presents our notation. Sections2and3briefly review

the CPD and the decomposition into multilinear rank-(Lr, Lr, 1) terms. In section 4

we introduce the coupled CPD and study its uniqueness properties. The results are (i) necessary coupled CPD uniqueness conditions, (ii) sufficient uniqueness conditions for the common factor matrix of the coupled CPD, (iii) sufficient overall uniqueness conditions for the coupled CPD, (iv) extensions to tensors of arbitrary order, and (v) a discussion of the uniqueness properties of the coupled matrix-tensor

factoriza-tion. Section 5 discusses a new coupled CPD model in which the common factor

matrix contains collinear components. The paper is concluded in section6.

1.1. Notation. Vectors, matrices, and tensors are denoted by lowercase bold-face, uppercase boldbold-face, and uppercase calligraphic letters, respectively. The rth

column vector of A is denoted by ar. The symbols⊗ and " denote the Kronecker

and Khatri–Rao product, defined as

A⊗ B :=    a11B a12B . . . a21B a22B . . . .. . ... . ..    , A " B :=' a1⊗ b1 a2⊗ b2 . . . (,

in which (A)_mn = amn. The outer product of N vectors a(n) ∈ CIn is denoted by

a(1)_{◦ a}(2)_{◦ · · · ◦ a}(N )_{∈ C}I1×I2×···×IN_{, such that}

)

a(1)◦ a(2)_{◦ · · · ◦ a}(N )*

i1,i2,...,iN

= a(1)_i₁ a(2)_i₂ · · · a(N )iN .

The identity matrix, all-zero matrix, and all-zero vector are denoted by IM∈ CM×M,

0M,N ∈ CM×N, and 0M ∈ CM, respectively. The all-ones vector is denoted by

1R= [1, . . . , 1]T∈ CR.

The transpose, Moore–Penrose pseudo-inverse, Frobenius norm, determinant,

range, and kernel of a matrix are denoted by (·)T, (·)†, % · %F, |·|, range (·), and

ker (·), respectively. The cardinality of a set S is denoted by card (S).

MATLAB index notation will be used for submatrices of a given matrix. For example, A(1 : k, :) represents the submatrix of A consisting of the rows from 1 to k

of A. Dk(A)∈ CJ×J denotes the diagonal matrix holding row k of A∈ CI×J on its

diagonal. Given A∈ CI×J_{, Vec (A)}_{∈ C}IJ _{will denote the column vector defined by}

(Vec (A))_i+(j_−1)I= (A)_ij.

The matrix that orthogonally projects onto the orthogonal complement of the

column space of A∈ CI×J _{is denoted by}

PA= II− FFH∈ CI×I,

(3)

498 MIKAEL SØRENSEN AND LIEVEN DE LATHAUWER

where the column vectors of F constitute an orthonormal basis for range (A).

The Heaviside step function H :Z → {0, 1} is defined as

H[n] = +

0 , n < 0 ,

1 , n≥ 0 .

The rank of a matrix A is denoted by r (A) or rA. The k-rank of a matrix A

is denoted by k (A) or kA. It is equal to the largest integer k (A) such that every

subset of k (A) columns of A is linearly independent. More generally, the k$_{-rank of}

a partitioned matrix A is denoted by k$_{(A). It is equal to the largest integer k}$_(A)

such that any set of k$(A) submatrices of A yields a set of linearly independent

columns. The number of nonzero entries of a vector x is denoted by ω (x) in the tensor decomposition literature, dating back to the work of Kruskal [26].

Let Ck

n= _k!(n−k)!n! denote the binomial coefficient. The kth compound matrix of

A∈ Cm×n_{is denoted by C}

k(A)∈ CC

k

m×Cnk _{and its entries correspond to the k-by-k}

minors of A, ordered lexicographically. As an example, let A∈ C4×3_{; then}

C2(A) =         |A ([1, 2], [1, 2])| |A ([1, 2], [1, 3])| |A ([1, 2], [2, 3])| |A ([1, 3], [1, 2])| |A ([1, 3], [1, 3])| |A ([1, 3], [2, 3])| |A ([1, 4], [1, 2])| |A ([1, 4], [1, 3])| |A ([1, 4], [2, 3])| |A ([2, 3], [1, 2])| |A ([2, 3], [1, 3])| |A ([2, 3], [2, 3])| |A ([2, 4], [1, 2])| |A ([2, 4], [1, 3])| |A ([2, 4], [2, 3])| |A ([3, 4], [1, 2])| |A ([3, 4], [1, 3])| |A ([3, 4], [2, 3])|         .

See [23,15] for discussion of compound matrices.

2. Canonical polyadic decomposition. Consider the third-order tensorX ∈

CI×J×K_{. We say that}_{X is a rank-1 tensor if it is equal to the outer product of some}

nonzero vectors a∈ CI_{, b}_{∈ C}J_{, and c}_{∈ C}K_{such that x}

ijk= aibjck. Decompositions

into a sum of rank-1 terms are called polyadic decompositions (PDs): X = R , r=1 ar◦ br◦ cr. (2.1)

The rank of a tensorX is equal to the minimal number of rank-1 tensors that yield

X in a linear combination. Assume that the rank of X is R; then (2.1) is called

the canonical PD (CPD) of X . The CPD is also known as the PARAllel FACtor

(PARAFAC) [22] and the CANonical DECOMPosition (CANDECOMP) [7]. Let us

stack the vectors{ar}, {br}, and {cr} into the matrices

A = [a1, . . . , aR]∈ CI×R, B = [b1, . . . , bR]∈ CJ×R, C = [c1, . . . , cR]∈ CK×R.

The matrices A, B, and C will be referred to as the factor matrices of the CPD in (2.1). The following subsection presents matrix representations of (2.1) that will be used throughout the paper.

2.1. Matrix representations. Let X(i··)∈ CJ×K _{denote the matrix such that}

(X(i··))jk= xijk; then X(i··)= BDi(A) CT and

CIJ×K _{) X}

(1):=

-X(1··)T, . . . , X(I··)T.T = (A" B) CT.

(2.2)

(4)

More generally, the PD or CPD of the higher-order tensor X ∈ CI1×···×IM _has

the matrix representations

(2.3) X(w)=  1 p∈Γw A(p)" 1 q∈Υw A(q)   4 1 r∈Ψw A(r) 5T ,

where A(m)∈ CIm×R_{and the sets Γ}

w, Υw, and Ψwhave properties Γw6Υw6Ψw=

{1, 2, . . . , M}, Γw7Υw=∅, Γw7Ψw=∅, and Υw7Ψw=∅.

2.2. Uniqueness conditions for one factor matrix of a CPD. A factor

matrix, say C, of the CPD ofX ∈ CI×J×K _{is said to be unique if it can be}

deter-mined up to the inherent column scaling and permutation ambiguities fromX . More

formally, the factor matrix C is unique if all the triplets ( 8A, 8B, 8C) satisfying (2.1)

also satisfy the condition

8

C = CP∆ ,

where P is a permutation matrix and ∆ is a diagonal matrix. One of the first unique-ness conditions for one factor matrix of a CPD was obtained by Kruskal in [26]. In this paper we will make use of the following result.

Theorem 2.1. Consider the PD ofX ∈ CI×J×K _{in (2.1). If}

     k (C)≥ 1, min (I, J)≥ R − r (C) + 2,

C_R−r(C)+2(A)" CR−r(C)+2(B) has full column rank, (2.4)

then the rank ofX is R and the factor matrix C is unique [15].

Condition (2.4) is more relaxed than Kruskal’s, and the proof of the theorem admits a constructive interpretation [17].

2.3. Overall uniqueness conditions for CPD. The rank-1 tensors in (2.1) can be arbitrarily permuted without changing the decomposition. The vectors within the same 1 tensor can also be arbitrarily scaled provided that the overall rank-1 term remains the same. We say that the CPD is unique when it is only subject to the mentioned indeterminacies. One of the first deep CPD uniqueness results was obtained by Kruskal [26]. For a recent comprehensive study of CPD uniqueness in the

third-order case we refer the reader to [15,16]. Below we state some uniqueness results

for CPD that we will extend to the coupled CPD case. The results are summarized

in Table1.

Table 1

Full column rank (f.c.r.) requirements for different CPD uniqueness conditions. In the case where C has f.c.r., we further distinguish between a sufficient (S) and a necessary and sufficient (N and S) condition.

Thm. 2.2 Thm.2.3 Thm. 2.4 Thm.2.5

Matrices required to have f.c.r. None C C C and A

Condition S N and S S N and S

Together with related results in [16], the following is one of the most relaxed deterministic conditions for CPD uniqueness. It does not require any of the factor matrices to have full column rank.

(5)

Theorem 2.2. Consider the PD ofX ∈ CI×J×K_{in (2.1). Let S denote a subset}

of {1, . . . , R} and let Sc ₌_{{1, . . . , R} \ S denote the complementary set. Stack the}

columns of C with index in S in C(S) ∈ CK×card(S) _{and stack the columns of C}

with index in Sc _{in C}(Sc₎

∈ CK×(R−card(S))_{. Stack the columns of A (resp., B) in}

the same order such that A(S) ∈ CI×card(S) _{(resp., B}(S)

∈ CJ×card(S)_{) and A}(Sc₎

∈ CI×(R−card(S))_{(resp., B}(S)

∈ CJ×(R−card(S))_{) are obtained. If} =

k (C)≥ 2,

r (CR−rC+2(A)" CR−rC+2(B)) = C

R−rC+2

R ,

and if there exists a subset S of{1, . . . , R} with 0 ≤ card (S) ≤ rC such that1, 2

      

C(S) has full column rank (r_C(S)= card (S)) ,

B(Sc) _{has full column rank (r}

B(Sc )= R− card (S)) , r)-P_C(S)C(S c )_{" A}(Sc)_{, P} C(S)c(S c₎ r ⊗ II .* = I + R− card (S) − 1 ∀r ∈ Sc_,

then the rank ofX is R and the CPD of X is unique [40].

If one factor matrix has full column rank, say C, then the following condition is not only sufficient but also necessary.

Theorem 2.3. Consider the PD of X ∈ CI×J×K _{in (2.1). Define E(w) =}

>R

r=1wrarb T

r. Assume that C has full column rank. The rank of X is R and the

CPD of X is unique if and only if [42,25,46,14]

r (E(w))≥ 2 ∀w ∈?x∈ CR@@ ω(x) ≥ 2A.

(2.5)

Generically,3 _{condition (2.5) is satisfied and C has full column rank if R}_{≤ K and}

R≤ (I − 1)(J − 1) [42].

In practice, condition (2.5) may not be easy to check. Instead we may resort to the following more convenient result in the case where one factor matrix has full column rank.

=

C has full column rank,

C2(A)" C2(B) has full column rank,

(2.6)

then the rank of X is R and the CPD of X is unique [25, 11,46, 15]. Generically,

condition (2.6) is satisfied if R≤ K and 2R(R − 1) ≤ I(I − 1)J(J − 1) [11,43].

In the case where two factor matrices, say A and C, have full column rank,

Theorems2.3and2.4simplify to the following.

Theorem 2.5. Consider the PD of X in (2.1). Assume that A and C have full

column rank. The rank of X is R and the CPD of X is unique if and only if kB≥ 2

(see, e.g., [27]). Generically, this is satisfied if R≤ min(I, K) and 2 ≤ J.

1_{Note that the set S in Theorem}_2.2_{may be empty, i.e., card (S) = 0 such that S =}_{∅. This}

corresponds to the case where P_C(S)= IK. 2_{The last condition states that M}

r = [P_C(S)C(S c₎ " A(Sc)_{, P} C(S)c(S c₎ r ⊗ II] has a

one-dimensional kernel for every r ∈ Sc_{, which is minimal since [n}T r, a

(Sc)T

r ]T ∈ ker (Mr) for some

nr∈ Ccard(Sc).

3_{A tensor decomposition property is called generic if it holds with probability one when the}

entries of the factor matrices are drawn from absolutely continuous probability density measures.

(6)

3. CPD with collinearity in a factor matrix. We consider PDs of X ∈

CI×J×K _{that involve collinearities in the factor matrix C of the type}

X = R , r=1 Lr , l=1 a(r)_l ◦ b(r)l ◦ c(r)= R , r=1 ) A(r)B(r)T*◦ c(r), (3.1) where A(r) = [a(r)₁ , . . . , a(r)_L_r]∈ CI×Lr_{, B}(r) _{= [b}(r) 1 , . . . , b (r) Lr]∈ C J×Lr_{. Similarly to}

A(r) and B(r), we may define C(r) = 1T

Lr⊗ c

(r) _{∈ C}K×Lr_{, i.e., column vector c}(r)

is repeated Lr times. Note that, if Lr ≥ 2 for some r ∈ {1, . . . , R}, then the PD of

X cannot be unique (see, e.g., [44]). In cases like this, it is impossible to recover the

individual columns of the factors A(r)and B(r). If the matrices A(r)B(r)T have rank

Lr, then the decomposition (3.1) is also known as the decomposition into multilinear

rank-(Lr, Lr, 1) terms [13].

3.1. Matrix representation. Let us stack the above matrices and vectors into the matrices A =-A(1), . . . , A(R).∈ CI×(!Rr=1Lr)_, _B ₌ -B(1), . . . , B(R).∈ CJ×(!Rr=1Lr)_, C =-C(1)_{, . . . , C}(R)._{∈ C}K×(!R r=1Lr)_, _C(red) ₌-_c(1)_{, . . . , c}(R)._{∈ C}K×R_,

where “red” stands for reduced. The PD or CPD of the tensorX in (3.1) with collinear

columns in C admits the following matrix representation:

CIJ×K_{) X} (1)= -X(1··)T, . . . , X(I··)T.T = (A" B) CT (3.2)

=-Vec)B(1)A(1)T*, . . . , Vec)B(R)A(R)T*.C(red)T.

(3.3)

3.2. Overall uniqueness conditions for decomposition into multilinear

rank-(Lr, Lr, 1) terms. Let {{ 8A

(n)

}, { 8B(n)}, 8C} yield an alternative

decomposi-tion of X into multilinear rank-(Lr, Lr, 1) terms. The multilinear rank-(Lr, Lr, 1)

tensors in (3.1) can be arbitrarily permuted, and the vectors within the same

cou-pled multilinear rank-(Lr, Lr, 1) tensor can be arbitrarily scaled provided the overall

coupled multilinear rank-(Lr, Lr, 1) term remains the same. We say that the

decom-position into multilinear rank-(Lr, Lr, 1) terms is unique when it is only subject to

the mentioned indeterminacies.

The following uniqueness condition for decomposition ofX into multilinear

rank-(Lr, Lr, 1) terms has been obtained in [13].

k$(A) = R and k$(B) + k (C)≥ R + 2 ,

(3.4)

then the minimal number of multilinear rank-(Lr, Lr, 1) terms is R and the

decompo-sition of X into multilinear rank-(Lr, Lr, 1) terms is unique.

Other related uniqueness results can be found in [13]. For the case where C has full column rank, the following necessary and sufficient uniqueness condition for

decomposition ofX into multilinear rank-(Lr, Lr, 1) terms has been obtained in [14].

Theorem 3.2. Consider the PD of X ∈ CI×J×K _{in (3.1). Define E(w) =}

>R

r=1wrA

(r)_B(r)T_{. Assume that C has full column rank. A necessary and sufficient}

(7)

X

(1)

=

c1 a(1)1 b(1)1 +· · · + cR a(1)R b(1)R

..

.

X

(N )

=

c1 a(N )1 b(N )1 +· · · + cR a(N )R b(N )_R

Fig. 1. Coupled PD of the third-order tensorsX(1)_{, . . . ,}_X(N)_.

condition for uniqueness of the decomposition of X into multilinear rank-(Lr, Lr, 1)

terms is that r (E(w)) > max r|wr&=0 Lr ∀w ∈ ? x∈ CR@@ ω(x) ≥ 2A. (3.5)

Generalizing CPD results in [8], generic uniqueness bounds for the BTD have been obtained in [50].

4. New results for coupled CPD. In subsection4.1we introduce some

defini-tions and notation associated with the coupled CPD. Subsection4.2presents necessary

conditions for coupled CPD uniqueness. Subsection4.3presents uniqueness conditions

for the common factor matrix. In subsection4.4we develop sufficient uniqueness

con-ditions for the coupled CPD. Subsection 4.5briefly explains that the results can be

extended to tensors of order greater than three. Subsection 4.6 comments on the

coupled matrix-tensor factorization problem.

4.1. Definitions and notation. We say that a collection of tensors X(n) _∈

CIn×Jn×K_{, n}_{∈ {1, . . . , N}, admits an R-term coupled polyadic decomposition if each}

tensorX(n)_{can be written as}

X(n)= R , r=1 a(n)_r ◦ b(n)r ◦ cr, n∈ {1, . . . , N}, (4.1)

with factor matrices

A(n)=- a(n)₁ , . . . , a(n)_R . ∈ CIn×R_, _n_{∈ {1, . . . , N},} B(n)=- b(n)₁ , . . . , b(n)_R . ∈ CJn×R_, _n_{∈ {1, . . . , N},} C =' c1, . . . , cR ( ∈ CK×R_.

The coupled PD of the third-order tensors{X(n)_{} is visualized in Figure}_1.

(8)

We define the coupled rank of{X(n)_{} as the minimal number of coupled rank-1}

tensors that yield {X(n)_{} in a linear combination. Assume that the coupled rank of}

{X(n)_{} is R; then (}_{4.1) will be called the coupled CPD of}_{X(n)_}.

It is clear that the coupled rank-1 tensors in (4.1) can be arbitrarily permuted and that the vectors within the same coupled rank-1 tensor can be arbitrarily scaled provided the overall coupled rank-1 term remains the same. We say that the coupled CPD is unique when it is only subject to these trivial indeterminacies.

In this paper we will make use of the matrix representation of{X(n)_},

X =     X(1)₍₁₎ .. . X(N )₍₁₎     =    A(1)" B(1) .. . A(N )" B(N )    CT= FCT∈ C(!N n=1InJn)×K_, (4.2) where F =    A(1)_{" B}(1) .. . A(N )" B(N )    ∈ C(!Nn=1InJn)×R_. (4.3)

4.2. Necessary conditions for coupled CPD uniqueness. Propositions4.1

and4.2following generalize well-known necessary uniqueness conditions for CPD (see,

e.g., [30,44]) to the coupled CPD case.

Proposition 4.1. If the coupled CPD of{X(n)_{} in (4.1) is unique, then k}

C≥ 2.

Proof. Assume that k (C) = 1, say c1and c2are collinear; then linear

combina-tions of c1and c2will yield an alternative coupled CPD of{X(n)} that is not related

via trivial column scaling and permutation ambiguities.

Note that in contrast to ordinary CPD, Proposition 4.1 does not prevent that

k_A(n)= 1 and/or k_B(n) = 1 for some n∈ {1, . . . , N}. Indeed, the coupled CPD may

be unique in such cases, as will be explained in subsection4.4.

Proposition 4.2. If the coupled CPD of {X(n)_{} in (4.1) is unique, then F has}

full column rank.

Proof. The result follows directly from relation (4.2). Indeed, if F does not have

full column rank, then for any x ∈ ker (F) we obtain X = FCT = F(CT + xyT_),

where y∈ CK_.

Again, in contrast to ordinary CPD, Proposition 4.2 does not prevent that for

some n ∈ {1, . . . , N} the individual Khatri–Rao product matrices A(n)" B(n) are

rank deficient. This will be further discussed in subsection4.4.

It is well known that the condition kC≥ 2 is generically satisfied if K ≥ 2. Based

on Lemma4.3we explain in Proposition4.4that F generically has full column rank

if >N_n=1InJn ≥ R. Hence, the necessary conditions stated in Propositions 4.1and

4.2are expected to be satisfied under mild conditions.

Lemma 4.3. Given an analytic function f : Cn_{→ C, if there exists an element}

x∈ Cn _{such that f (x)}_{-= 0, then the set { x | f (x) = 0 } is of Lebesgue measure zero}

(see, e.g., [24]).

Proposition 4.4. Consider F ∈ C(!N

n=1InJn)×R _{given by (4.3). For generic}

matrices{A(n)} and {B(n)}, the matrix F has rank min(>Nn=1InJn, R).

Proof. Due to Lemma4.3we just need to find one example where the statement

made in this lemma holds. We give an example in the supplementary material. Another necessary condition for CPD uniqueness is that none of the column

vectors of A" B (similarly for A " C and B " C) in (2.2) can be written as linear

(9)

combinations of its remaining column vectors [15,14]. Proposition4.5extends the

result to coupled CPD.

Proposition 4.5. Consider the coupled PD ofX(n)_{∈ C}In×Jn×K_{, n}_{∈ {1, . . . , N},}

in (4.1). Define E(n)(w) = R , r=1 wra(n)r b(n)Tr and Ω = ? x∈ CR@_{@ ω(x) ≥ 2}A_. (4.4)

If the coupled CPD of{X(n)_{} in (4.1) is unique, then}

∀w ∈ Ω ∃ n ∈ {1, . . . , N} : r)E(n)(w)*≥ 2 .

(4.5)

Proof. The necessity of r (F) = R has already been mentioned in Proposition4.2.

Assume now that there exists a vector w(r) _{∈ C}R _{with ω(w}(r)₎ _{≥ 2 such that for}

some r∈ {1, . . . , R} we have (4.6) Ba(n)r ⊗ Bb (n) r = R , s=1 w(r) s ) a(n) s ⊗ b(n)s * ∀n ∈ {1, . . . , N} .

Since F has full column rank, its column vectors are linearly independent, that is, > s&=rw (r) s (a(n)s ⊗ b(n)s ) cannot be proportional to a (n) r ⊗ b(n)r for all n∈ {1, . . . , N}, and consequentlyBa(n)r ⊗ Bb (n) r is not proportional to a (n) r ⊗ b(n)r for all n∈ {1, . . . , N}.

This means that factor matrices {{ BA(n)}, { BB(n)}, BC} with property (4.6) yield an

alternative coupled CPD of {X(n)_{} which is not related to {{A}(n)

}, {B(n)}, C} via

the intrinsic column scaling and permutation ambiguities.

In contrast to ordinary CPD, Proposition 4.5 does not prevent that for some

n∈ {1, . . . , N} the individual columns of the matrices A(n)" B(n)may be written as

linear combinations of its remaining column vectors.

4.3. Uniqueness conditions for common factor matrix. This subsection presents conditions that guarantee the uniqueness of the common factor C of the

coupled CPD of {X(n)_{} in (}_{4.1), even in cases where some of the remaining factor}

matrices{A(n)_{} and {B}(n)_{} contain all-zero column vectors. This is in contrast with}

ordinary CPD where kA(n)≥ 2 and k_B(n)≥ 2 are necessary conditions.

Proposition4.6is a variant of Theorem2.1for coupled CPD.

in (4.1). W.l.o.g. we assume that min(I1, J1)≥ min(I2, J2)≥ · · · ≥ min(IN, JN).

De-note Q =>N_n=1H [min (In, Jn)− R + rC− 2], where H [·] denotes the Heaviside step

function. Define G(m)=      Cm ) A(1)*_{" C} m ) B(1)* .. . Cm ) A(Q)*_{" C} m ) B(Q)*     ∈ C( !Q n=1CmInCmJn)×CmR_, (4.7) where m = R− rC+ 2. If = k (C)≥ 1, r(G(m)) = Cm R, (4.8)

(10)

then the coupled rank of{X(n)_{} is R and the factor matrix C is unique.}

Proof. The result is a technical variant of [15, Proposition 4.3]. It is provided in the supplementary material.

In the case that the common factor matrix C has full column rank, Proposition

4.6directly reduces to the following result. (Compare to Theorem2.4.)

Corollary 4.7. Consider the coupled PD ofX(n)_{∈ C}In×Jn×K_{, n}_{∈ {1, . . . , N},}

in (4.1). Let G(2) be defined as in (4.7). If

=

G(2) has full column rank,

(4.9)

If additionally some of the factor matrices in the set{A(n)_{} also have full column}

rank, then Corollary4.7further reduces to the following result. (Compare to Theorem

2.5.)

Corollary 4.8. Consider the coupled PD of {X(n)_{} in (}_{4.1). Consider also a}

subset S of{1, . . . , N} with card (S) = Q. W.l.o.g., we assume that S = {1, . . . , Q}.

If for some Q∈ {1, . . . , N}, we have

(4.10)        rC= R , r_A(n)= R ∀n ∈ {1, . . . , Q}, ∀r ∈ {1, . . . , R}, ∀s ∈ {1, . . . , R} \ r, ∃ n ∈ {1, . . . , Q} : k)-b(n)_r , b(n)_s .*= 2 ,

Proof. Due to Corollary4.7we know that the coupled rank of{X(n)_{} is R and}

the factor matrix C is unique. We assume that for some Q∈ {1, . . . , N} the matrix

(4.11) G(2)=C)C2 ) A(1)*" C2 ) B(1)**T, . . . ,)C2 ) A(Q)*" C2 ) B(Q)**T DT

has full column rank. As in ordinary CPD [47], we can premultiply each A(n) by a

nonsingular matrix without affecting the rank or the uniqueness of the coupled CPD

of{X(n)_{}. Hence, w.l.o.g. we can set A}(n)₌'_I

R, 0TIn−R,R

(T

. Likewise, as in ordinary

CPD [45], the premultiplication of A(n)by a nonsingular matrix does not affect the

rank of G(2). The problem of determining the rank of G(2) reduces to finding the

rank of H =        C2 EC IR 0I1−R,R DF " C2 ) B(2)* .. . C2 EC IR 0IQ−R,R DF " C2 ) B(Q)*        .

After removing the all-zero row-vectors of H we need to find the rank of B H =C)IR(R−1) 2 " C2 ) B(1)**T, . . . ,)IR(R−1) 2 " C2 ) B(Q)**T DT .

(11)

506 MIKAEL SØRENSEN AND LIEVEN DE LATHAUWER Note that C2(B(n)) = [d(n)1,2, . . . , d(n)1,R, d (n) 2,3, . . . , d(n)2,R, . . . , d (n) R−2,R−1, d(n)R−2,R, d(n)R−1,R],

where d(n)p,q = C2([b(n)p , bq(n)]) ∈ CJn(Jn−1)/2. Note also that BH corresponds to a

row-permuted version of a block-diagonal matrix holding the column vectors {Bdp,q}

defined as Bdp,q = [d(1)Tp,q , . . . , d(N )Tp,q ]T ∈ C(

!N

n=1Jn(Jn−1)/2) _{on its block-diagonal. It}

is now clear that BH has full column rank if for every pair (r, s)∈ {1, . . . , R}2 _with

r -= s there exists an n ∈ {1, . . . , Q} such that ω(d(n)r,s) ≥ 1. Equivalently, for every

pair (r, s) ∈ {1, . . . , R}2 _{with r}_{-= s there should exist an n ∈ {1, . . . , Q} such that}

k([b(n)_r , b(n)_s ]) = 2.

In the case where C has full column rank we have the following necessary and

sufficient uniqueness condition for the common factor. (Compare to Theorem2.3.)

in (4.1). Define E(n)(w) and Ω as in (4.4). Assume that C has full column rank. The

coupled rank of{X(n)_{} is R and the factor matrix C is unique if and only if condition}

(4.5) is satisfied.

Proof. The necessity of condition (4.5) has already been demonstrated in

Propo-sition4.5. Let us now prove the sufficiency of condition (4.5) in the case where C has

full column rank. Note that (4.5) implies that F has full column rank. Indeed, if F

is rank deficient, then there exists a vector x∈ CR _{with property ω(x)}_{≥ 2 such that}

>R

r=1xrfr= 0. This will contradict (4.5). Let{{ BA

(n)

}, { BB(n)}, BC} denote the factor

matrices of an alternative coupled CPD ofX(n)_{, n}_{∈ {1, . . . , N}, where B}_A(n)_{∈ C}In× "R_,

B

B(n)∈ CJn× "R_{, and B}_C_{∈ C}K× "R _{with B}_R_{≤ R. Further, let B}_F_{∈ C}(!Nn=1InJn)× "R_denote

the alternative F constructed from{ BA(n)} and { BB(n)} such that

(4.12) X = FCT= BF BCT.

Since F has full column rank and C has full column rank by assumption, we know

from (4.12) that R = BR, that BC and BF have full column rank, and that range (C) =

range)CB*.

We obtain from (4.12) the relation

(4.13) FH = BF,

where H = CT( BCT)†_{∈ C}R×R_{is nonsingular. This may be expressed in a columnwise}

manner as (4.14) Ba(n)r ⊗ Bb (n) r = R , s=1 hsr ) a(n) s ⊗ b(n)s * , r∈ {1, . . . , R}, n ∈ {1, . . . , N} .

Combination of (4.5) and (4.14) now yields that the nonsingular matrix H has exactly

one nonzero entry in every column. This implies that H = PD, where P∈ CR×R_{is a}

permutation matrix and D∈ CR×R_{is a nonsingular diagonal matrix. From (4.12) we}

obtain that BC = CPD−1. We conclude that the common factor C is unique.

While Proposition 4.9provides a necessary and sufficient condition for the case

where C has full column rank, Corollaries 4.7 and 4.8 may be easier to check in

practice.

4.4. Sufficient uniqueness conditions for coupled CPD. We first present a

condition in Proposition4.10and Theorem4.11for the case where at least one of the

(12)

involved CPDs is unique. Next, in Theorem4.12we extend Theorem2.2to coupled

CPD. It is a more relaxed condition than Proposition4.10and Theorem 4.11since

it requires only that the overall coupled CPD be unique, i.e., none of the individual

CPDs are required to be unique. In Corollary 4.13 and Theorem 4.15 we extend

Theorems2.3and2.4to the coupled CPD case in which the common factor matrix

has full column rank. Finally, in Corollary 4.14we extend Theorem2.5to coupled

CPD. Table2summarizes the organization and structure.

Table 2

Relations between uniqueness conditions for the single CPD and coupled CPD for different rank properties of the common factor matrix C. The coupled CPD, case 1, corresponds to the cases where one of the individual CPDs is unique, while the coupled CPD, case 2, corresponds to the cases where none of the individual CPDs are required to be unique. In the case where C has full column rank, we further distinguish between sufficient (S) conditions and necessary and sufficient (N and S) conditions.

(S) (N and S) (S) (S)

k (C)≥ 2 r (C) = R r (C) = R r (C) = R

Single CPD Thm. 2.2 Thm.2.3 Thm. 2.4 Thm. 2.5

Coupled CPD, case 1 Thm. 4.11 Prop.4.10 Prop.4.10 Cor. 4.14

Coupled CPD, case 2 Thm. 4.12 Thm.4.15 Cor. 4.13 Cor. 4.14

in (4.1). If4

∃ n ∈ {1, . . . , N} : the rank of X(n)_{is R and the CPD of} _X(n)_{is unique,}

and if C has full column rank, then the coupled rank of {X(n)_{} is R and the coupled}

CPD of {X(n)_{} is unique.}

Proof. If there exists an integer n∈ {1, . . . , N} such that the rank of X(n)_{is R and}

the CPD of X(n) _{is unique, then obviously the common factor matrix C is unique.}

Compute Y(n) = X(n)(CT)†; then the remaining factor matrices are obtained by

recognizing that the columns of Y(n)are vectorized rank-1 matrices:

min a(n)r ,b(n)r G G Gy(n)r − a(n)r ⊗ b(n)r G G G2 F, r∈ {1, . . . , R}, n ∈ {1, . . . , N}.

Hence, the coupled CPD of {X(n)_{} is unique and the coupled rank of {X}(n)_}

is R.

Proposition4.10tells us that a coupled CPD in which the common factor matrix

has full column rank is unique if one of the involved CPDs is unique. This simple observation already demonstrates that a coupled CPD can be unique even if some of

the involved CPDs are individually nonunique. For instance, Proposition 4.10does

not prevent in the coupled CPD that some of the Khatri–Rao products are rank deficient, which is not allowed in the ordinary CPD. As an example, we consider

X = H X(1)₍₁₎ X(2)₍₁₎ I = C A(1)" B(1) A(2)" B(2) D CT_,

4_{As an example, Theorem}_2.4_{states that if r(C}

2(A(n))" C2(B(n))) =R(R−1)₂ , then the rank of

X(n) _{is R and the CPD of}_X(n)_{is unique. Alternatively, Theorem}_2.3_{states that if r(E}(n)_{(w)) =}

!R

r=1wra(n)r b(n)Tr ≥ 2 for all w ∈ Ω = {x ∈ CR

"

" ω(x) ≥ 2}, then the rank of X(n) _{is R and the}

CPD ofX(n)_{is unique.}

(13)

where A(1)∈ C3×4_{, A}(2)

∈ C3×4_{, B}(1)

∈ C3×4_{, B}(2)

∈ C3×4_{, and C}_{∈ C}4×4_{. Further,}

let a(2)1 ⊗ b(2)1 = a2(2)⊗ b(2)2 ; then generically r(A(2)" B(2)) = 3, and consequently the

CPD ofX(2) _{is not unique [30]. However, Proposition}_4.10_{tells us that the coupled}

CPD ofX(1) _and_X(2)_{is generically unique.}

Theorem4.11considers the more general case where C does not necessarily have

full column rank.

Theorem 4.11. Consider the coupled PD of X(n)_{∈ C}In×Jn×K_{, n}_{∈ {1, . . . , N},}

in (4.1). Let Sndenote a subset of{1, . . . , R}, and let Snc={1, . . . , R}\Sndenote the

complementary set. Stack the columns of C with index in Snin C(Sn)∈ CK×card(Sn)

and stack the columns of C with index in Sc

n in C(S

c

n)_{∈ C}K×(R−card(Sn))_{. Stack the}

columns of A(n) (resp., B(n)) in the same order such that A(n,Sn) _{∈ C}In×card(Sn)

(resp., B(n,Sn) _{∈ C}Jn×card(Sn)_{) and A}(n,Snc) ∈ CIn×(R−card(Sn)) _{(resp., B}(n,Sn) _∈

CJn×(R−card(Sn))_{) are obtained. If}5

∃ n ∈ {1, . . . , N} : the rank of X(n)_{is R and CPD of}_X(n) _{is unique,}

(4.15a)

and for all n∈ {1, . . . , N} there exist an index set Sn with 0≤ card (Sn)≤ rC such

that C(Sn) _{has full column rank and}

=

B(n,Snc) _{has full column rank,}

r)-P_C(Sn)C(S c n)_{" A}(n,Snc)_{, P} C(Sn)c(S c n) r ⊗ IIn .* = αn ∀r ∈ Snc, (4.15b) where αn= In+ R− card (Sn)− 1, or =

A(n,Snc) _{has full column rank,}

r)-P_C(Sn)C(S c n)_{" B}(n,Snc)_{, P} C(Sn)c(S c n) r ⊗ IJn .* = βn ∀r ∈ Snc, (4.15c)

where βn= Jn+R−card (Sn)−1, then the coupled rank of {X(n)} is R and the coupled

CPD of{X(n)_{} is unique. Generically, condition (4.15b) or (4.15c) is satisfied if for}

all n∈ {1, . . . , N} we have = R≤ min)Vn+ min (K, R) ,Vn(Wn−1)+W_W_nn(K−1)+1 * when Vn< R , R≤ (K − 1)Wn+ 1 when Vn≥ R , (4.16)

where Vn= max(In, Jn) and Wn= min(In, Jn).

Proof. We assume that the rank ofX(p) _{is R and the CPD of}_X(p) _{is unique for}

some p ∈ {1, . . . , N}. The overall uniqueness of the CPD of X(p) _{implies that the}

common factor matrix C is unique with property k (C) ≥ 2. We now consider the

individual CPDs of the tensors{X(n)_{} with matrix representations}

X(n)₍₁₎ =)A(n)" B(n)*CT, n∈ {1, . . . , N},

as CPDs with a known factor matrix. We know from [40, Theorem 4.8] that the CPD

of the tensor X(n) _{with known factor C is unique if condition (4.15b) or (4.15c) is}

5_{As an example, if the conditions stated in Theorem}_2.2_{are satisfied for some p}_{∈ {1, . . . , N} in}

which the roles of A(p)_{, B}(p)_{, and C may be interchanged, then the rank of}_X(p)_{is R and the CPD}

ofX(p)_{is unique.}

(14)

satisfied. We also know from [40, Theorem 4.8] that the CPD of the tensorX(n)_with known factor C is generically unique if conditions (4.16) are satisfied. We conclude

that the coupled CPD of {X(n)_{} linked via the matrix C is unique and the coupled}

rank of{X(n)_{} is R.}

Theorem4.11tells us that a coupled CPD is unique under more relaxed conditions

than the individually involved ordinary CPDs even in cases where C does not have full column rank. This also means that some of the involved CPDs are allowed to be individually nonunique. As an example, we consider

X = H X(1)₍₁₎ X(2)₍₁₎ I = C A(1)" B(1) A(2)" B(2) D CT, where A(1) ∈ C4×5_{, A}(2) ∈ C4×5_{, B}(1) ∈ C4×5_{, B}(2) ∈ C4×5_{, and C} _{∈ C}4×5_.

Furthermore, let b(2)₁ = b(2)₂ ; then generically k(B(2)) = 1 and consequently the

CPD of X(2) _{is not unique (see, e.g., [44]). Since C does not have full column rank,}

Proposition 4.10does not apply. However, Theorem4.11 tells us that the coupled

CPD ofX(1) _and_X(2) _{is generically unique. Note that this result is not obtained by}

inverting C as in the proof of Proposition4.10.

Theorem 4.12. Consider the coupled PD of X(n)_{∈ C}In×Jn×K_{, n}_{∈ {1, . . . , N}}

in (4.1). Let Sndenote a subset of{1, . . . , R}, and let Snc={1, . . . , R}\Sndenote the

complementary set. Stack the columns of C with index in Snin C(Sn)∈ CK×card(Sn)

and stack the columns of C with index in Sc

n in C(S

c

n)∈ CK×(R−card(Sn))_{. Stack the}

columns of A(n) (resp., B(n)) in the same order such that A(n,Sn) _{∈ C}In×card(Sn)

(resp., B(n,Sn) _{∈ C}Jn×card(Sn)_{) and A}(n,Snc) _{∈ C}In×(R−card(Sn)) _{(resp., B}(n,Sn) _∈

CJn×(R−card(Sn))_{) are obtained.}

If C is unique6_{with property k (C)}_{≥ 2, and if for all n ∈ {1, . . . , N} there exists}

an index set Sn with 0 ≤ card (Sn) ≤ rC such that C(Sn) has full column rank and

condition (4.15b) or (4.15c) is satisfied, then the coupled rank of{X(n)_{} is R and the}

coupled CPD of{X(n)_{} is unique.}

Proof. The necessity of k (C)≥ 2 has already been mentioned in Proposition4.1.

Assuming that the common factor matrix C is unique with k (C)≥ 2, we can consider

the individual CPDs of the tensors{X(n)_{} as CPDs with a known factor matrix C.}

We know from [40, Theorem 4.8] that the CPD of the tensorX(n)_{with known factor}

C is unique if condition (4.15b) or (4.15c) is satisfied. We can now conclude that the

coupled CPD of {X(n)_{} linked via the matrix C is unique and the coupled rank of}

{X(n)_{} is R.}

Note that Theorem 4.12, unlike Proposition 4.10 and Theorem 4.11, does not

assume that the CPD of one of the individual tensorsX(n)_{is unique. As an example,}

we consider X = H X(1)₍₁₎ X(2)₍₁₎ I = C A(1)" B(1) A(2)" B(2) D CT, where A(1)∈ C4×5_{, A}(2) ∈ C4×5_{, B}(1) ∈ C4×5_{, B}(2) ∈ C4×5_{, and C}_{∈ C}4×5_{. Further,}

let b(1)₁ = b(1)₂ and b₃(2) = b(2)₄ ; then generically k(B(1)) = 1 and k(B(2)) = 1.

Consequently the individual CPDs of X(1) _and _X(2) _{are not unique, which means}

6_{As an example, if the conditions (}_4.8_{) stated in Proposition}_4.6_{are satisfied, then the coupled}

rank of{X(n)_{} is R and the common factor matrix C is unique.}

(15)

that neither Proposition 4.10 nor Theorem 4.11 can be used to establish coupled

CPD uniqueness. However, Proposition4.6, together with Theorem4.12, tells us that

the coupled CPD ofX(1) _and_X(2)_{is generically unique.}

The above example explains that in some cases it is better to first establish

uniqueness of the common factor matrix C via, for instance, Proposition 4.6, and

thereafter establish coupled CPD uniqueness of {X(n)_{} by treating the individual}

CPDs of{X(n)_{} as CPDs with a known factor C. However, in other cases it is better}

to first establish CPD uniqueness of one of the individual tensors, sayX(p)_{, via, for}

instance, Theorem 2.2, and thereafter establish coupled CPD uniqueness of {X(n)_}

by treating the individual CPDs of{X(n)_{} as CPDs with a known factor C. As an}

example, we consider X = H X(1)₍₁₎ X(2)₍₁₎ I = C A(1)" B(1) A(2)" B(2) D CT, where A(1)∈ C4×6_{, A}(2) ∈ C4×6_{, B}(1) ∈ C6×6_{, B}(2)

∈ C5×6_{, and C}_{∈ C}3×6_{. For this}

problem Proposition4.6cannot be used since the matrix G(5)is not defined. On the

other hand, Theorem2.2, together with Theorem4.11, tells us that the coupled CPD

ofX(1) _and_X(2)_{is generically unique.}

Let us now assume that the common factor matrix C has full column rank. In

that case, Theorem4.12reduces to Corollary4.13, which in turn can be understood

as an extension of Theorem 2.4to coupled CPD. Corollary4.13can also be seen as

a generalization of Proposition4.10to the case where none of the involved CPDs are

required to be unique.

Corollary 4.13. Consider the coupled PD ofX(n)_{∈ C}In×Jn×K_{, n}_{∈ {1, . . . , N},}

in (4.1). Let G(2) be defined as in (4.7). If

=

G(2) has full column rank,

(4.17)

then the coupled rank of{X(n)_{} is R and the coupled CPD of {X}(n)_{} is unique.}

Proof. Due to Corollary4.7we know that the coupled rank of{X(n)_{} is R and}

the common factor matrix C is unique when condition (4.17) is satisfied. Assuming that C has full column rank, the remaining factors follow from rank-1 approximations

as explained in the proof of Proposition4.10.

If additionally some of the factor matrices in the set{A(n)} also have full column

rank, then we may use the Corollary4.14following, which can be understood as an

extension of Theorem2.5to coupled CPD.

Corollary 4.14. Consider the coupled PD of{X(n)_{} in (}_{4.1). Consider also a}

subset S of{1, . . . , N} with card (S) = Q. W.l.o.g., we assume that S = {1, . . . , Q}.

If for some Q∈ {1, . . . , N} we have

(4.18)        rC= R , rA(n)= R ∀n ∈ {1, . . . , Q}, ∀r ∈ {1, . . . , R}, ∀s ∈ {1, . . . , R} \ r, ∃ n ∈ {1, . . . , Q} : k)-b(n)_r , b(n)s .* = 2 ,

then the coupled rank of{X(n)_{} is R and the coupled CPD of {X}(n)_{} is unique.}

(16)

Proof. Due to Corollary4.8we know that the coupled rank of{X(n)_{} is R and} the common factor C is unique. Since C is unique and has full column rank, the remaining factors follow from rank-1 approximations as explained in the proof of

Proposition4.10.

Comparison of Theorem 2.5 with condition (4.19) shows that the coupling has

relaxed the uniqueness condition.

Finally, we generalize the necessary and sufficient uniqueness condition (2.5)

stated in Theorem2.3to the coupled CPD problem.

Theorem 4.15. Consider the coupled PD ofX(n)_{∈ C}In×Jn×K_{, n}_{∈ {1, . . . , N},}

in (4.1). Assume that C has full column rank. The coupled rank of{X(n)_{} is R and}

the coupled CPD of {X(n)_{} is unique if and only if}

∀w ∈ Ω , ∃ n ∈ {1, . . . , N} : r)E(n)(w)*≥ 2 ,

(4.19)

where E(n)and Ω are defined as in (4.4).

Proof. Due to Proposition4.9we know that the coupled rank of{X(n)_{} is R and}

the common factor C is unique if and only if the condition (4.19) is satisfied. Since the common factor C is unique and has full column rank, the remaining factors follow

from rank-1 approximations as explained in the proof of Proposition4.10.

As in the case of ordinary CPD, the conditions in Theorem4.15may be harder

to check than those in Corollary4.13or Corollary4.14.

4.5. Extension to tensors of arbitrary order. The uniqueness properties of the CPD of higher-order tensors are not just a straightforward generalization of those for third-order tensors. As a matter of fact, they are conceptually quite different. We note that the idea of simultaneously considering different matrix representations of the CPD of a single higher-order tensor for the case where one factor matrix has full column rank was first considered in [46]. As our contribution, first we generalize the idea to cases where none of the involved factor matrices are required to have full column rank. In fact, based on the connection between coupled CPD and higher-order tensors, we even demonstrate that the (coupled) CPD of a higher-order tensor(s) can be unique despite collinearities in all factor matrices. Second, we extend the coupled CPD framework in subsections 4.3 and 4.4 to tensors of arbitrary order. More specifically, we demonstrate that by taking into account both the coupled and higher-order structures, improved uniqueness conditions are obtained.

We consider coupled PDs of X(n) _{∈ C}I1,n×···×IMn,n×K_{, n} _{∈ {1, . . . , N}, of the}

form X(n)= R , r=1 a(1,n)r ◦ · · · ◦ a(Mr n,n)◦ cr, n∈ {1, . . . , N}. (4.20)

The factor matrices are

A(m,n)=- a(m,n)1 , . . . , a(m,n)R . ∈ CIm,n×R_, _m_{∈ {1, . . . , M} n}, n ∈ {1, . . . , N}, C =' c1, . . . , cR ( ∈ CK×R.

The coupled PD of tensors{X(n)_{} of arbitrary order is visualized in Figure}_2.

Note that the tensorsX(n)_{may have different orders M}

nand different sizes Im,n.

As a special case, we have the case of a single tensor (N = 1) of order M ≥ 4. Our

key idea is that, if one or more tensors have order Mn ≥ 4, then we may combine

(17)

512 MIKAEL SØRENSEN AND LIEVEN DE LATHAUWER X(1) = ⊗ · · · ⊗ c1 b(M1,1) 1 a(1,1)1 +· · · + ⊗ · · · ⊗ cR b(M1,1) R a(1,1)R .. . X(N ) = ⊗ · · · ⊗ c1 b(MN,N ) 1 a(1,N )1 +· · · + ⊗ · · · ⊗ cR b(MN,N ) R a(1,N )R

Fig. 2. Coupled PD of tensorsX(1)_{, . . . ,}_X(N)_{in which b}(Mn,n)

r = a(2,n)r ⊗ · · · ⊗ a(Mr n,n).

the coupled third-order CPD results discussed in subsections4.2–4.4with results for

higher-order tensors [46]. More precisely, uniqueness results may be obtained by reducing the associated higher-order PDs to coupled third-order PDs. Namely, we simultaneously consider several matrix representations of the form

(4.21) X(w,n)=   1 p∈Γw,n A(p,n)" 1 q∈Υw,n A(q,n)   CT ₌)_A[w,n] " B[w,n]*CT, where A[w,n] ₌ J p∈Γw,nA (p,n) _{∈ C}_Iˆ_w,n_×R with ˆIw,n = K_p∈Γw,nIp,n, B [w,n] ₌ J q∈Υw,nA (q,n) ∈ CJˆw,n×R _{with ˆ}_J

w,n =Kq∈Υw,nIq,n, and the sets Γw,n and Υw,n

have properties Γw,n6Υw,n={1, 2, . . . , Mn} and Γw,n7Υw,n =∅. Let us assume

that there are Wn sets{Γw,n} and {Υw,n} for each n ∈ {1, . . . , N}. We collect the

matrices{X(w,n)} into the matrix

X =-X(1)T, X(2)T, . . . , X(N )T.T, X(n)=-X(1,n)T, X(2,n)T, . . . , X(Wn,n)T.T_, such that (4.22) X = FCT, F =      F(1) F(2) .. . F(N )     , F (n)₌      A[1,n]" B[1,n] A[2,n]" B[2,n] .. . A[Wn,n] " B[Wn,n]     .

We now ignore the Khatri–Rao structure of A[w,n] and B[w,n] and treat (4.22) as a

matrix representation of a set of coupled third-order CPDs.

For establishing uniqueness, we may resort to the different results in subsection 4.4. For the results that make use of G(R−rC+2)_{, we may work with the}

follow-ing generalization. We limit ourselves to the LWn sets {Γw,n} and {Υw,n} for each

n ∈ {1, . . . , N} in which min(K_p∈Γw,nIp,n,

K

q∈Υw,nIq,n) ≥ R − rC+ 2. Define

(18)

G(R−rC+2,#Wn)_{∈ C}( !Wn# w=1C R_−rC+2 $ p∈Γw,n Ip,nC R_−rC+2 $ q∈Υw,n Iq,n)×C R_−rC+2 R as follows: G(R−rC+2,#Wn)₌      CR−rC+2 ) A[1,n]*" CR−rC+2 ) B[1,n]* .. . CR−rC+2 ) A[#Wn,n] * " CR−rC+2 ) B[#Wn,n] *     , n∈ {1, . . . , N}.

The following matrix generalizes G(R−rC+2) _{in (4.7):}

G(m)=     G(m,#W1) .. . G(m,#WN)     ∈ C !N n=1( !Wn# w=1C$mp∈Γw,n Ip,nC m $ q∈Υw,n Iq,n)×C m R , (4.23)

where m = R− rC+ 2. In the extensions of Theorems4.11and4.12, it suffices to

check condition (4.15b) or (4.15c) for one of the LWn matrix representations.

As an example, consider the fourth-order tensorsX(n)_{∈ C}I×J×K×L_{, n}_{∈ {1, 2},}

with PDs, (4.24) X(n)₌ R , r=1 a(n) r ◦ b(n)r ◦ c(n)r ◦ dr, n∈ {1, 2}, in which I = 4, J = 5, K = 4, L = 3, R = 4, a(1)₂ = a(1)₃ , b(1)₁ = b(1)₃ , c(1)₁ = c(1)₂ ,

c(1)₃ = c(1)₄ , a(2)₁ = a(2)₄ , b₁(2) = b(2)₂ = b(2)₃ , and c(2)₃ = c(2)₄ . Note that generically

k_A(1) = k_B(1) = k_C(1) = k_A(2) = k_B(2) = k_C(2) = 1 and kD ≥ 2. The existing

CPD uniqueness conditions for higher-order tensors stated in [33,46,5] do not apply.

Similarly, the uniqueness conditions for coupled CPD based on third-order tensors (i.e., if we ignore the fourth-order structure by combining two modes) discussed in

subsection4.4do not apply either. We now explain that by simultaneously exploiting

both the higher-order and coupled structures of the PDs in (4.24), coupled CPD uniqueness can be established. Generically D has rank 3. Denote

G(n)=      C3 ) A(n)*" C3 ) B(n)" C(n)* C3 ) B(n)*" C3 ) C(n)" A(n)* C3 ) C(n)*" C3 ) A(n)" B(n)*     .

Using Lemma 4.3 it can be verified that, although the matrices G(1) and G(2) are

rank deficient, the matrix G = [G(1)T, G(2)T]T _{generically has full column rank.}

Thus, Proposition4.6tells us that by taking into account the higher-order structure

and the coupling betweenX(1)_and_X(2)_{, uniqueness of D can be established. Using}

Lemma 4.3it also can be verified that E(1) = B(1)" C(1) and E(2) = B(2)" C(2)

generically have full column rank and that the matrix [D" A(n), dr⊗ II] generically

has a one-dimensional kernel for every r ∈ {1, 2, 3, 4} and n ∈ {1, 2}. Invoking

Theorem4.12we can conclude that the coupled CPD ofX(1) _and_X(2)_{(in which the}

Khatri–Rao structure of E(1) and E(2) has been ignored) is unique. Consequently,

the factors {A(n)}, {E(n)}, and D are unique despite the collinearities in the factor

matrices. Finally, the rank-1 structure of the columns of E(n)= B(n)" C(n)_implies

that{B(n)} and {C(n)} are also unique.

(19)

We now demonstrate that (coupled) CPD of higher-order tensors can even be unique despite collinearities in all factor matrices; i.e., the factor matrices of the PDs

of{X(n)_{} in (}_{4.20) may satisfy k}

C = 1 and kA(m,n) = 1 for all m∈ {1, . . . , M_n}, for

all n ∈ {1, . . . , N}. For this reason Proposition4.1does not extend to higher-order

tensors in an obvious manner. Note that the existing CPD uniqueness conditions for

higher-order tensors stated in [33,46,5] do not apply in this case. As an example,

consider N = 1 and the PD ofX ∈ CI×J×K×L_{given by}

(4.25) X =

R , r=1

ar◦ br◦ cr◦ dr,

in which a1= a2, b1= b3, c3= c4, d2= d3, and I = J = K = L = R = 4. Since

kA = kB= kC= kD= 1 the results discussed in subsection4.4cannot be applied in

a direct manner. We will establish uniqueness by reducing the fourth-order PD to a

coupled third-order PD and by following a deflation argument. Generically rA = 3.

Using Lemma4.3it can be verified that

  CCRR−r−rAA+2+2(B)(C)" C" CRR−r−rAA+2+2(C(B" D)" D) CR−rA+2(D)" CR−rA+2(B" C)   =   CC33(B)(C)" C" C33(C(B" D)" D) C3(D)" C3(B" C)  

generically has full column rank. Proposition 4.6implies that the factor matrix A

is unique. The next step is to demonstrate that the rank-1 term a4◦ b4◦ c4◦ d4is

unique. The PD ofX in (4.25) has matrix representation

X = (A" B) (C " D)T = (A" B) ET, E = C" D.

Lemma4.3can also tell us that E generically has full column rank and that generically

r ([A" B, ar⊗ IJ]) = R + J− Γr(A) ,

where Γ1(A) = Γ2(A) = 2 and Γ3(A) = Γ4(A) = 1. Since Γ4(A) = 1, [41,

Proposition 5.2] tells us that the vectors b4 and e4 are unique. As a consequence of

the rank-1 structure of e4we also know that c4and d4 are unique. We subtract the

unique rank-1 term,

Y = X − a4◦ b4◦ c4◦ d4=

3 , r=1

ar◦ br◦ cr◦ dr.

The PD ofY has the factor matrices A(2) _{= [a}

1, a2, a3], B(2) = [b1, b2, b3], C(2) =

[c1, c2, c3], and D(2)= [d1, d2, d3]. The matrix C(2)generically has full column rank.

Using Lemma4.3it can be verified that

     C2 ) A(2)*" C2 ) B(2)" D(2)* C2 ) B(2)*" C2 ) D(2)" A(2)* C2 ) D(2)*" C2 ) A(2)" B(2)*     

generically has full column rank. Due to Corollary4.13 we now know that the

re-maining factors A(2), B(2), C(2), and D(2) are unique.

More generally, for the case of coupled CPD of higher-order tensors it is possible in some cases to establish coupled CPD uniqueness via a sequence of deflation steps. See the supplementary material for a brief discussion.

(20)

4.6. Coupled matrix-tensor factorization. A simple case of coupled decom-positions is the coupled matrix-tensor factorization, admitting a matrix representation of the form X = H X(1)₍₁₎ X(2)₍₁₎ I = C A(1)" B(1) A(2) D CT. (4.26)

Because of its simplicity, (4.26) is common in the analysis of multiview data [49,1,18,

2]. Note that coupled matrix-tensor factorization is a special case of coupled CPD.

Indeed, define B(2)= [1, 1, . . . , 1]∈ C1×R_{; then (4.26) can also be written as}

X = H X(1)₍₁₎ X(2)₍₁₎ I = C A(1)" B(1) A(2)" B(2) D CT_,

which is of form (4.2), so that several of the results presented in this paper can be applied.

A notable limitation of the coupled matrix-tensor factorization (4.26) is that

in order to guarantee the uniqueness of A(2), the common factor C must have full

column rank. More precisely, if C has full column rank, then A(2)follows from A(2)=

X(2)₍₁₎(CT)†_{. On the other hand, if C does not have full column rank, then there will}

be an intrinsic indeterminacy between A(2)and C. Indeed, when C does not have full

column rank, the null space of C is not empty. Any vector y∈ ker (C) will generate an

alternative coupled matrix-tensor factorization X in which X(2)₍₁₎= (A(2)+ xyT_)CT_,

where x∈ CI2_.

5. Coupled CPD with collinearity in common factor. We consider coupled

PDs ofX(n)_{∈ C}In×Jn×K_{, n}_{∈ {1, . . . , N}, of the following form:}

X(n)= R , r=1 L_,r,n l=1 a(r,n)_l ◦ b(r,n)l ◦ c(r)= R , r=1 ) A(r,n)B(r,n)T*◦ c(r). (5.1)

On one hand, this is an extension of (3.1) to the coupled case. On the other hand, it is a variant of the coupled PD in (4.1) for collinearity constrained C. If the matrices A(r,n)B(r,n)T have rank Lr,n, then (5.1) is a coupled decomposition into multilinear

rank-(Lr,n, Lr,n, 1) terms. We will briefly call this a coupled block term decomposition

(BTD). The coupled BTD of the third-order tensors{X(n)_{} is visualized in Figure}_3.

The coupled multilinear rank-(Lr,n, Lr,n, 1) tensors in (5.1) can be arbitrarily

per-muted, and the vectors/matrices within the same coupled multilinear

rank-(Lr,n, Lr,n, 1) tensor can be arbitrarily scaled provided the overall coupled multilinear

rank-(Lr,n, Lr,n, 1) term remains the same. We say that the coupled BTD is unique

when it is only subject to the mentioned indeterminacies.

In this section we limit the exposition to third-order tensors. Analogous to the

coupled CPD in subsection4.5, the coupled BTD and its associated properties can

be extended to tensors of arbitrary order. In the supplementary material we briefly explain that it can be reduced to a set of coupled BTDs of third-order tensors.

(21)

516 MIKAEL SØRENSEN AND LIEVEN DE LATHAUWER X(1) = c(1) A(1,1) L1,1 B(1,1)T +· · · + c(R) A(R,1) LR,1 B(R,1)T .. . X(N ) = c(1) A(1,N ) L1,N B(1,N )T +· · · + c(R) A(R,N ) LR,N B(R,N )T

Fig. 3. Coupled BTD of the third-order tensorsX(1)_{, . . . ,}_X(N)_.

5.1. Matrix representation. Denote Rtot,n=>Rr=1Lr,n. The coupled PD of

the tensors{X(n)_{} of the form (5.1) has the factor matrices}

A(r,n)=- a(r,n)₁ , . . . , a_L(r,n)_r,n .∈ CIn×Lr,n_, A(n)=' A(1,n), . . . , A(R,n) (∈ CIn×Rtot,n_, _n_{∈ {1, . . . , N},} B(r,n)=- b(r,n)₁ , . . . , b(r,n)_L_r,n .∈ CJn×Lr,n_, B(n)=' B(1,n), . . . , B(R,n) (∈ CJn×Rtot,n_, _n_{∈ {1, . . . , N},} C(red)=' c(1)_{, . . . , c}(R) (_{∈ C}K×R_, (5.2) C(n)=-1T Lr,n⊗ c (1)_{, . . . , 1}T LR,n⊗ c (R)._{∈ C}K×Rtot,n_, (5.3)

and matrix representation

X =-X(1)T₍₁₎ , . . . , X(N )T₍₁₎ .T= F(red)C(red)T∈ C(!N n=1InJn)×K_, (5.4) where F(red)∈ C(!N n=1InJn)×R_{is given by} F(red)=      Vec)B(1,1)A(1,1)T* · · · Vec)B(R,1)A(R,1)T* .. . . .. ... Vec)B(1,N )A(1,N )T* · · · Vec)B(R,N )A(R,N )T*     . (5.5)

Denote Lr,max= maxn∈{1,...,N}Lr,nand Rext=>Rr=1Lr,max, where “ext” stands

for extended. By appending all-zero column vectors to A(r,n) and B(r,n), (5.4) may

also be expressed as

X = F(ext)C(ext)T ∈ C(!N

n=1InJn)×K_,

(5.6)

(22)

where F(ext)= HE B A(1)" BB(1) FT , . . . , E B A(N )" BB(N ) FTIT ∈ C(!N n=1InJn)×Rext_, (5.7) C(ext)=-1T L1,max⊗ c (1)_{, . . . , 1}T LR,max⊗ c (R)._{∈ C}K×Rext_, (5.8) in which B A(r,n)=-A(r,n), 0In,(Lr,max−Lr,n) . ∈ CIn×Lr,max_, B A(n)= C B A(1,n), . . . , BA(R,n) D ∈ CIn×Rext_, _n_{∈ {1, . . . , N},} B B(r,n)=-B(r,n), 0Jn,(Lr,max−Lr,n) . ∈ CJn×Lr,max_, B B(n)= C B B(1,n), . . . , BB(R,n) D ∈ CJn×Rext_, _n_{∈ {1, . . . , N}.}

5.2. Uniqueness conditions for coupled CPD with collinearity in

com-mon factor. Let {{ 8A(r,n)}, { 8B(r,n)}, {8c(r)}} yield an alternative coupled BTD of

the tensors {X(n)_{} in (}_{5.1). We say that the coupled BTD of}_{X(n)_{} is unique if it}

is unique up to a permutation of the coupled multilinear rank-(Lr,n, Lr,n, 1) terms

{( 8A(r,n)B8(r,n)T)◦ 8c(r)} and up to the following indeterminacies within each term: 8

A(r,n)= α(r,n)A(r,n)Hr,n, 8B (r,n)

= β(r,n)B(r,n)H−1r,n, 8c(r)= γ(r)c(r),

where Hr,n ∈ CLr,n×Lr,n are nonsingular matrices and α(r,n), β(r,n), γ(r) ∈ C are

scalars satisfying α(r,n)_β(r,n)_γ(r)_{= 1, r}_{∈ {1, . . . , R}, n ∈ {1, . . . , N}. From (}_{5.4) it is}

clear that uniqueness requires k_C(red)≥ 2. From (5.4) it is also clear that F(red)must

have full column rank in order to guarantee the uniqueness of the coupled BTD of

{X(n)_{}. Proposition}_5.1_{below extends the necessary conditions stated in Propositions}

4.1,4.2, and4.5to coupled BTD.

Proposition 5.1. Consider the coupled PD of X(n)_{, n} _{∈ {1, . . . , N}, in (}_5.1).

Define E(n)(w) =>R_r=1wrA(r,n)B(r,n)T and Ω =

?

x∈ CR@_{@ ω(x) ≥ 2}A_{. If the coupled}

BTD of{X(n)_{} in (5.1) is unique, then}

(i) k_C(red)≥ 2,

(ii) F(red) has full column rank,

(iii) for all w∈ Ω , ∃ n ∈ {1, . . . , N} : r(E(n)(w)) > max_r|wr&=0Lr,n.

Proof. The proof is analogous to that of Propositions4.1,4.2, and4.5.

Proposition5.2tells us that this is generically true if F(red)has at least as many

rows as columns.

Proposition 5.2. Consider F(red)∈ C(!N

n=1InJn)×R_{given by (5.5). For generic}

matrices{A(r,n)} and {B(r,n)}, the matrix F(red) has rank min(>N_n=1InJn, R).

Proof. Due to Lemma4.3we just need to find one example for which the

propo-sition holds. Since the coupled CPD (4.1) is a particular case of (5.4), a

particu-lar example is the matrix F(red) in (4.3). (Formally, we take a(r,n)_l = 0In for all

l∈ {2, . . . , Lr,n}, for all r ∈ {1, . . . , R}, for all n ∈ {1, . . . , N}.) The proposition now

follows directly from Proposition4.4.

We will now discuss extensions of Theorems 4.11, 4.12, and 4.15 to the case

where the common factor matrix contains collinear components. The generalizations

(23)

of Proposition4.10and Corollary4.13follow immediately and are therefore not

con-sidered in this section.

Theorem 5.3 can be seen as a version of Theorem4.11 for the case where the

common factor matrix contains collinear columns.

Theorem 5.3. Consider the coupled PD ofX(n)_{, n}_{∈ {1, . . . , N}, in (}_{5.1). Let S}

n

denote a subset of{1, . . . , R}, and let Sc

n={1, . . . , R} \ Sndenote the complementary

set. Stack the columns of C(red)with index in Snin C(red,Sn)∈ CK×card(Sn)and stack

the columns of C(red) with index in Sc

n in C(red,S

c

n)∈ CK×(R−card(Sn))_{. Stack A}(r,n)

(resp., B(r,n) and C(r,n)) in the same order such that A(n,Sn)

∈ CIn×(

!

p∈SnLp,n)

(resp., B(n,Sn) _{∈ C}Jn×(!_p∈SnLp,n) _{and C}(n,Sn) _{∈ C}K×(!_p∈SnLp,n)_{) and A}(n,Snc) _∈

CIn×(!_p∈Sc nLp,nt)(resp., B(n,S c n)_{∈ C}Jn×( ! p∈ScnLp,n) and C(n,S c n)_{∈ C}K×( ! p∈ScnLp,n))

are obtained. Denote D(n,Scn)_{= P}

C(red,Sn)C(n,S c n)_{. If} 7     

∃ p ∈ {1, . . . , N} : the minimal number of rank-(Lr,p, Lr,p, 1)

terms inX(p) _{is R and the decomposition of}_X(p) _into

rank-(Lr,p, Lr,p, 1) terms is unique,

(5.9a)

and for every n∈ {1, . . . , N} there exists an index set Sn⊆ {1, . . . , R} with property

0≤ card (Sn)≤ rC(red), such that

  

B(n,Sc

n) _{has full column rank}

) r_B(n,Sc_n) =>_p ∈Sc nLp,n * , r)-D(n,Snc)" A(n,Scn)_{, d}(n,Snc) r ⊗ IIn .* = αr,n ∀r ∈ Snc, (5.9b) where αr,n= In+>_p∈Sc nLp,n− Lr,n, or   

A(n,Snc) _{has full column rank}

) r_A(n,Sc_{n )}=>_p ∈Sc nLp,n * , r)-D(n,Snc)_{" B}(n,Snc)_{, d}(n,Scn) r ⊗ IJn .* = βr,n ∀r ∈ Snc, (5.9c) where βr,n= Jn+>_p∈Sc

nLp,n− Lr,n, then the minimal number of coupled multilinear

rank-(Lr,n, Lr,n, 1) terms is R and the coupled BTD of{X(n)} is unique.

Proof. Assume that there exists an integer p∈ {1, . . . , N} such that the minimal

number of rank-(Lr,p, Lr,p, 1) terms inX(p) is R and the decomposition ofX(p) into

rank-(Lr,p, Lr,p, 1) terms is unique. Since C(red)is unique and the condition (5.9b) or

(5.9c) is satisfied for every n∈ {1, . . . , N}, we know from [41, Proposition 5.2] that

the coupled BTD of{X(n)_{} is unique and the minimal number of coupled multilinear}

rank-(Lr,n, Lr,n, 1) terms is R.

Theorem 5.4below can be interpreted as a version of Theorem4.12for the case

where the common factor matrix contains collinear columns.

Theorem 5.4. Consider the coupled PD ofX(n)_{, n}_{∈ {1, . . . , N}, in (5.1). Let S}

n

denote a subset of{1, . . . , R} and let Sc

n={1, . . . , R} \ Sndenote the complementary

7_{As an example, if r(C}

Rtot,p−r_C(red)+2(A(p))" CRtot,p−r_C(red)+2(B(p))) = C

Rtot,p−r_C(red)+2

Rtot,p ,

then Theorem 2.1 tells us that the rank ofX(p) _is!R

r=1Lr,p and the factor C(p) is unique up

to a column permutation and scaling. The uniqueness of the decomposition of X(p) _into

rank-(Lr,p, Lr,p, 1) terms now follows from condition (5.9b) or (5.9c), as explained by Proposition 5.2 in

[41].