UNIQUENESS OF THE OVERALL DECOMPOSITION ∗

(1)

DECOMPOSITION OF THIRD-ORDER TENSORS — PART II:

UNIQUENESS OF THE OVERALL DECOMPOSITION ^∗

IGNAT DOMANOV

^{† ‡}

AND LIEVEN DE LATHAUWER

^{† ‡}

Abstract. Canonical Polyadic (also known as Candecomp/Parafac) Decomposition (CPD) of a higher-order tensor is decomposition in a minimal number of rank-1 tensors. In Part I, we gave an overview of existing results concerning uniqueness and presented new, relaxed, conditions that guarantee uniqueness of one factor matrix. In Part II we use these results for establishing overall CPD uniqueness in cases where none of the factor matrices has full column rank. We obtain uniqueness conditions involving Khatri-Rao products of compound matrices and Kruskal-type conditions. We consider both deterministic and generic uniqueness. We also discuss uniqueness of INDSCAL and other constrained polyadic decompositions.

Key words. Canonical Polyadic Decomposition, Candecomp, Parafac, three-way array, tensor, multilinear algebra, Khatri-Rao product, compound matrix

AMS subject classifications. 15A69, 15A23

1. Introduction.

1.1. Problem statement. Throughout the paper F denotes the field of real or complex numbers; (·) ^∗ , (·) ^T , and (·) ^H denote conjugate, transpose, and conjugate transpose, respectively; r A , range(A), and ker(A) denote the rank, the range, and the null space of a matrix A, respectively; Diag(d) denotes a square diagonal matrix with the elements of a vector d on the main diagonal; span{f 1 , . . . , f k } denotes the linear span of the vectors f 1 , . . . , f k ; e ^R _r denotes the r-th vector of the canonical basis of F ^R ; C _n ^k denotes the binomial coefficient, C _n ^k = _k!(n−k)! ^n! ; O _m×n , 0 _m , and I _n are the zero m × n matrix, the zero m × 1 vector, and the n × n identity matrix, respectively.

We have the following basic definitions. A third-order tensor T = (t ijk ) ∈ F ^{I×J ×K} is rank-1 if there exist three nonzero vectors a ∈ F ^I , b ∈ F ^J and c ∈ F ^K such that T = a ◦ b ◦ c, in which “◦” denotes the outer product. That is, t ijk = a i b j c k for all values of the indices.

A Polyadic Decomposition (PD) of a third-order tensor T ∈ F ^{I×J ×K} expresses T as a sum of rank-1 terms:

T =

R

X

r=1

a r ◦ b r ◦ c r , (1.1)

where a r ∈ F ^I , b r ∈ F ^J , c r ∈ F ^K are nonzero vectors.

We call the matrices A = a ₁ . . . a _R ∈ F ^I×R , B = b ₁ . . . b _R ∈ F ^{J ×R} and C = c 1 . . . c R

∈ F ^K×R the first, second and third factor matrix of T , respectively. We also write (1.1) as T = [A, B, C] R .

∗

Research supported by: (1) Research Council KU Leuven: GOA-Ambiorics, GOA-MaNet, CoE EF/05/006 Optimization in Engineering (OPTEC), CIF1, STRT 1/08/23, (2) F.W.O.: (a) project G.0427.10N, (b) Research Communities ICCoS, ANMMM and MLDM, (3) the Belgian Federal Sci- ence Policy Office: IUAP P6/04 (DYSCO, “Dynamical systems, control and optimization”, 2007–

2011), (4) EU: ERNSI.

†

Group Science, Engineering and Technology, KU Leuven Campus Kortrijk, Etienne Sabbelaan 53, 8500 Kortrijk, Belgium, (ignat.domanov, lieven.delathauwer@kuleuven-kulak.be).

‡

Department of Electrical Engineering (ESAT), SCD, KU Leuven, Kasteelpark Arenberg 10, postbus 2440, B-3001 Heverlee (Leuven), Belgium.

1

(2)

Definition 1.1. The rank of a tensor T ∈ F ^{I×J ×K} is defined as the minimum number of rank-1 tensors in a PD of T and is denoted by r _T .

Definition 1.2. A Canonical Polyadic Decomposition (CPD) of a third-order tensor T expresses T as a minimal sum of rank-1 terms.

Note that T = [A, B, C] _R is a CPD of T if and only if R = r _T .

Let us reshape T into a matrix T ∈ F ^{IJ ×K} as follows: the (i, j, k)-th entry of T corresponds to the ((i − 1)J + j, k)-th entry of T. In particular, the rank-1 tensor a ◦ b ◦ c corresponds to the rank-1 matrix (a ⊗ b)c ^T , in which “⊗” denotes the Kronecker product. Thus, (1.1) can be identified with

T ⁽¹⁾ := T =

R

X

r=1

(a _r ⊗ b r )c ^T _r = [a ₁ ⊗ b 1 · · · a R ⊗ b R ]C ^T = (A B)C ^T , (1.2)

in which “” denotes the Khatri-Rao product or column-wise Kronecker product.

Similarly, one can reshape a ◦ b ◦ c into any of the matrices

(b ⊗ c)a ^T , (c ⊗ a)b ^T , (a ⊗ c)b ^T , (b ⊗ a)c ^T , (c ⊗ b)a ^T and obtain the factorizations

T ⁽²⁾ = (B C)A ^T , T ⁽³⁾ = (C A)B ^T , T ⁽⁴⁾ = (A C)B ^T etc. (1.3) The matrices T ⁽¹⁾ , T ⁽²⁾ , . . . are called the matrix representations or matrix unfoldings of the tensor T .

It is clear that in (1.1)–(1.2) the rank-1 terms can be arbitrarily permuted and that vectors within the same rank-1 term can be arbitrarily scaled provided the overall rank-1 term remains the same. The CPD of a tensor is unique when it is only subject to these trivial indeterminacies. Formally, we have the following definition.

Definition 1.3. Let T be a tensor of rank R. The CPD of T is essentially unique if T = [A, B, C] R = [ ¯ A, ¯ B, ¯ C] R implies that there exist an R × R permutation matrix Π and R × R nonsingular diagonal matrices Λ A , Λ B , and Λ C such that

A = AΠΛ ¯ A , B = BΠΛ ¯ B , C = CΠΛ ¯ C , Λ A Λ B Λ C = I R .

PDs can also be partially unique. That is, a factor matrix may be essentially unique without the overall PD being essentially unique. We will resort to the following definition.

Definition 1.4. Let T be a tensor of rank R. The first (resp. second or third) factor matrix of T is essentially unique if T = [A, B, C] R = [ ¯ A, ¯ B, ¯ C] R implies that there exist an R × R permutation matrix Π and an R × R nonsingular diagonal matrix Λ _A (resp. Λ _B or Λ _C ) such that

A = AΠΛ ¯ A (resp. B = BΠΛ ¯ B or C = CΠΛ ¯ C ).

For brevity, in the sequel we drop the term “essential”, both when it concerns the uniqueness of the overall CPD and when it concerns the uniqueness of one factor matrix.

In this paper we present both deterministic and generic uniqueness results. Deter-

ministic conditions concern one particular PD T = [A, B, C] _R . For generic uniqueness

we resort to the following definitions.

(3)

Definition 1.5. Let µ be the Lebesgue measure on F ^{(I+J +K)R} . The CPD of an I × J × K tensor of rank R is generically unique if

µ{(A, B, C) : the CPD of the tensor [A, B, C] _R is not unique } = 0.

Definition 1.6. Let µ be the Lebesgue measure on F ^{(I+J +K)R} . The first (resp.

second or third) factor matrix of an I × J × K tensor of rank R is generically unique if

µ {(A, B, C) : the first (resp. second or third) factor matrix of the tensor [A, B, C] _R is not unique} = 0.

Let the matrices A ∈ F ^I×R , B ∈ F ^{J ×R} and C ∈ F ^K×R be randomly sampled from a continuous distribution. Generic uniqueness then means uniqueness that holds with probability one.

1.2. Literature overview. We refer to the overview papers [3, 6, 12] and the references therein for background, applications and algorithms for CPD. Here, we focus on results concerning uniqueness of the CPD.

1.2.1. Deterministic conditions. We refer to [7, Subsection 1.2] for a detailed overview of deterministic conditions. Here we just recall three Kruskal theorems and new results from [7] that concern the uniqueness of one factor matrix. To present Kruskal’s theorem we recall the definition of k-rank.

Definition 1.7. The k-rank of a matrix A is the largest number k A such that every subset of k A columns of the matrix A is linearly independent.

Kruskal’s theorem states the following.

Theorem 1.8. [14, Theorem 4a, p. 123] Let T = [A, B, C] R and let

k A + k B + k C ≥ 2R + 2. (1.4)

Then r _T = R and the CPD of T = [A, B, C] _R is unique.

Kruskal also obtained the following more general results which are less known.

Theorem 1.9. [14, Theorem 4b, p. 123] (see also Corollary 1.29 below) Let T = [A, B, C] R and let

min(k _A , k _C ) + r _B ≥ R + 2, min(k A , k B ) + r C ≥ R + 2,

r _A + r _B + r _C ≥ 2R + 2 + min(r _A − k _A , r _B − k _B ), r A + r B + r C ≥ 2R + 2 + min(r A − k A , r C − k C ).

Then r _T = R and the CPD of T = [A, B, C] R is unique.

Let the matrices A and B have R columns. Let ˜ A be any set of columns of A, let ˜ B be the corresponding set of columns of B, and define

H _AB (δ) := min

card( ˜ A)=δ

[r A ˜ + r B ˜ − δ] for δ = 1, 2, . . . , R.

We will say that condition (H m ) holds for the matrices A and B if

H _AB (δ) ≥ min(δ, m) for δ = 1, 2, . . . , R. (H m )

(4)

The following Theorem is the strongest result about uniqueness from [14].

Theorem 1.10. [14, Theorem 4e, p. 125](see also Corollary 1.27 below) Let T = [A, B, C] R and let m B := R − r B + 2, m C := R − r C + 2. Assume that

(i) (H 1 ) holds for B and C;

(ii) (H _mB ) holds for C and A;

(iii) (H _mC ) holds for A and B.

Then r _T = R and the CPD of T = [A, B, C] R is unique.

For the formulation of other results we recall the definition of compound matrix.

Definition 1.11. [7, Definition 2.1 and Example 2.2] The k-th compound matrix of I × R matrix A (denoted by C _k (A)) is the C _I ^k × C _R ^k matrix containing the determinants of all k × k submatrices of A, arranged with the submatrix index sets in lexicographic order.

With a vector d = d ₁ . . . d R

T

we associate the vector

b d ^m := d 1 · · · d m d 1 · · · d m−1 d m+1 . . . d R−m+1 · · · d R ^T

∈ F ^C

^m^R

, (1.5) whose entries are all products d i

₁

· · · d i

_m

with 1 ≤ i 1 < · · · < i m ≤ R. Let us define conditions (K m ), (C m ), (U m ) and (W m ), which depend on matrices A ∈ F ^I×R , B ∈ F ^{J ×R} , C ∈ F ^K×R and an integer parameter m:

r A + k B ≥ R + m,

k A ≥ m or

r B + k A ≥ R + m,

k B ≥ m ; (K m )

C m (A) C m (B) has full column rank; (C m ) ( (C _m (A) C _m (B))b d ^m = 0,

d ∈ F ^R ⇒ d b ^m = 0; (U m )

( (C m (A) C m (B))b d ^m = 0,

d ∈ range(C ^T ) ⇒ d b ^m = 0. (W m )

In the sequel, we will for instance say that “condition (U m ) holds for the matrices X and Y” if condition (U m ) holds for the matrices A and B replaced by the matrices X and Y, respectively. We will simply write (U m ) (resp. (K m ),(H m ),(C m ) or (W m )) when no confusion is possible.

It is known that conditions (K 2 ), (C 2 ), (U 2 ) guarantee uniqueness of the CPD with full column rank in the third mode (see Proposition 1.15 below), and that condition (K m ) guarantees the uniqueness of the third factor matrix [8], [7, Theorem 1.12].

In the following Proposition we gather, for later reference, properties of conditions (K m ), (C m ), (U m ) and (W m ) that were established in [7, §2–§3]. The proofs follow from properties of compound matrices [7, Subsection 2.1].

Proposition 1.12.

(1) If (K m ) holds, then (C m ) and (H m ) hold [7, Lemmas 3.8, 3.9];

(2) if (C m ) or (H m ) holds, then (U m ) holds [7, Lemmas 3.1, 3.10];

(3) if (U m ) holds, then (W m ) holds [7, Lemma 3.3];

(4) if (K m ) holds, then (K k ) holds for k ≤ m [7, Lemma 3.4];

(5) if (H m ) holds, then (H k ) holds for k ≤ m [7, Lemma 3.5];

(6) if (C m ) holds, then (C k ) holds for k ≤ m [7, Lemma 3.6];

(7) if (U m ) holds, then (U k ) holds for k ≤ m [7, Lemma 3.7];

(8) if (W m ) holds and min(k _A , k _B ) ≥ m − 1, then (W k ) holds for k ≤ m [7,

Lemma 3.12 ];

(5)

(9) if (U m ) holds, then min(k A , k B ) ≥ m [7, Lemma 2.8 ].

The following schemes illustrate Proposition 1.12:

k A ≥ m, k B ≥ m

(W m ) (W m-1 ) . . . (W 2 ) (W 1 )

⇑ ⇑ . . . ⇑ ⇑

⇐ (U m ) ⇒ (U m-1 ) ⇒ . . . ⇒ (U 2 ) ⇒ (U 1 )

⇑ ⇑ . . . ⇑ m

(C m ) ⇒ (C m-1 ) ⇒ . . . ⇒ (C 2 ) ⇒ (C 1 )

⇑ ⇑ . . . ⇑ ⇑

(K m ) ⇒ (K m-1 ) ⇒ . . . ⇒ (K 2 ) ⇒ (K 1 )

, (1.6)

and

if min(k _A , k _B ) ≥ m − 1, then (W m ) ⇒ (W m-1 ) ⇒ . . . ⇒ (W 2 ) ⇒ (W 1 ). (1.7) Scheme (1.6) also remains valid after replacing conditions (C m ),. . . ,(C 1 ) and equiva- lence (C 1 ) ⇔ (U 1 ) by conditions (H m ),. . . ,(H 1 ) and implication (H 1 ) ⇒ (U 1 ), respectively. One can easily construct examples where (C m ) holds but (H m ) does not hold.

We do not know examples where (H m ) is more relaxed than (C m ).

Deterministic results concerning the uniqueness of one particular factor matrix were presented in [7, §4]. We first have the following proposition.

Proposition 1.13. [7, Proposition 4.9] Let A ∈ F ^I×R , B ∈ F ^{J ×R} , C ∈ F ^K×R , and let T = [A, B, C] R . Assume that

(i) k _C ≥ 1;

(ii) m = R − r _C + 2 ≤ min(I, J );

(iii) A B has full column rank;

(iv) the triplet of matrices (A, B, C) satisfies conditions (W m ), . . . , (W 1 ).

Then r _T = R and the third factor matrix of T is unique.

Combining Propositions 1.12 and 1.13 we obtained the following result.

Proposition 1.14. [7, Proposition 4.3, Corollaries 4.4 and 4.5] Let A, B, C, and T be as in Proposition 1.13. Assume that k C ≥ 1 and m = m C := R − r C + 2.

Then

(1.4) ====⇒ ^trivial

(C m )

(K m ) (U m )

(H m )

(1.6) (1.6)

(1.6)

==⇒



 

  (C 1 )

min(k A , k B ) ≥ m − 1, (W m )

(1.7)

==⇒

( (C 1 )

(W 1 ), . . . , (W m ) ⇒

( r _T = R,

the third factor matrix of T is unique.

(1.8)

Note that for r C = R, we have m = 2 and (U 2 ) is equivalent to (W 2 ). Moreover, in this case (U 2 ) is necessary for uniqueness. We obtain the following counterpart of Proposition 1.14.

Proposition 1.15. [4, 10, 15] Let A, B, C, and T be as in Proposition 1.13.

(6)

Assume that r C = R. Then

(1.4) ⇒

(C 2 ) (K 2 ) (U 2 )

(H 2 )

⇔

( r _T = R,

the CPD of T is unique. (1.9)

1.2.2. Generic conditions. Let the matrices A ∈ F ^I×R , B ∈ F ^{J ×R} and C ∈ F ^K×R be randomly sampled from a continuous distribution. It can be easily checked that the equations

k _A = r _A = min(I, R), k _B = r _B = min(J, R), k _C = r _C = min(K, R) hold generically. Thus, by (1.4), the CPD of an I × J × K tensor of rank R is generically unique if

min(I, R) + min(J, R) + min(K, R) ≥ 2R + 2. (1.10) The generic uniqueness of one factor matrix has not yet been studied as such. It can be easily seen that in (1.8) the generic version of (K m ) for m = R − K + 2 is also given by (1.10).

Let us additionally assume that K ≥ R. Under this assumption, (1.10) reduces to

min(I, R) + min(J, R) ≥ R + 2.

The generic version of condition (C 2 ) was given in [4, 16]. It was indicated that the C _I ² C _J ² × C _R ² matrix U = C ₂ (A) C ₂ (B) generically has full column rank whenever the number of columns of U does not exceed the number of rows. By Proposition 1.15 the CPD of an I × J × K tensor of rank R is then generically unique if

K ≥ R and I(I − 1)J (J − 1)/4 = C _I ² C _J ² ≥ C _R ² = R(R − 1)/2. (1.11) The four following results have been obtained in algebraic geometry.

Theorem 1.16. [18, Corollary 3.7] Let 3 ≤ I ≤ J ≤ K, K − 1 ≤ (I − 1)(J − 1), and let K be odd. Then the CPD of an I × J × K tensor of rank R is generically unique if R ≤ IJ K/(I + J + K − 2) − K.

Theorem 1.17. [2, Theorem 1.1] Let I ≤ J ≤ K. Let α, β be maximal such that 2 ^α ≤ I and 2 ^β ≤ J . Then the CPD of an I ×J ×K tensor of rank R is generically unique if R ≤ 2 ^α+β−2 .

Theorem 1.18. [2, Proposition 5.2], [18, Theorem 2.7] Let R ≤ (I − 1)(J − 1) ≤ K. Then the CPD of an I × J × K tensor of rank R is generically unique.

Theorem 1.19. [2, Theorem 1.2] The CPD of an I × I × I tensor of rank R is generically unique if R ≤ k(I), where k(I) is given in Table 1.1.

Finally, for a number of specific cases of dimensions and rank, generic uniqueness

results have been obtained in [19].

(7)

Table 1.1

Upper bound k(I) on R under which generic uniqueness of the CPD of a I × I × I tensor is guaranteed by Theorem 1.19.

I 2 3 4 5 6 7 8 9 10

k(I) 2 3 5 9 13 18 22 27 32

1.3. Results and organization. In this paper we use the conditions in (1.8) to establish CPD uniqueness in cases where r C < R.

In §2 we assume that a tensor admits two PDs that have one or two factor matrices in common. We establish conditions under which both decompositions are the same.

We obtain the following results.

Proposition 1.20. Let T = [A, B, C] ^R = [ ¯ A, ¯ B, CΠΛ C ] R , where Π is an R×R permutation matrix and Λ C is a nonsingular diagonal matrix. Let the matrices A, B and C satisfy the following condition

max(min(k A , k B − 1), min(k A − 1, k B )) + k C ≥ R + 1. (1.12) Then there exist nonsingular diagonal matrices Λ A and Λ B such that

A = AΠΛ ¯ _A , B = BΠΛ ¯ _B , Λ _A Λ _B Λ _C = I _R .

Proposition 1.21. Let T = [A, B, C] R = [AΠ A Λ A , ¯ B, CΠ C Λ C ] R , where Π A

and Π C are R × R permutation matrices and where Λ A and Λ C are nonsingular diagonal matrices. Let the matrices A, B and C satisfy at least one of the following conditions

k C ≥ 2 and max(min(k A , k B − 1), min(k A − 1, k B )) + r C ≥ R + 1,

k A ≥ 2 and max(min(k B , k C − 1), min(k B − 1, k C )) + r A ≥ R + 1. (1.13) Then Π _A = Π _C and ¯ B = BΠ _A Λ ⁻¹ _A Λ ⁻¹ _C .

Note that in Propositions 1.20 and 1.21 we do not assume that R is minimal.

Neither do we assume in Proposition 1.21 that Π _A and Π _C are the same.

In §3 we obtain new results concerning the uniqueness of the overall CPD by combining (1.8) with results from §2.

Combining (1.8) with Proposition 1.20 we prove the following statements.

Proposition 1.22. Let T = [A, B, C] R and m C := R − r C + 2. Assume that (i) condition (1.12) holds;

(ii) condition (W _mC ) holds for A, B, and C;

(iii) A B has full column rank. (C 1 )

Then r T = R and the CPD of tensor T is unique.

Corollary 1.23. Let T = [A, B, C] R and m C := R − r C + 2. Assume that (i) condition (1.12) holds;

(ii) condition (U _mC ) holds for A and B.

Then r _T = R and the CPD of tensor T is unique.

Corollary 1.24. Let T = [A, B, C] ^R and m C := R − r C + 2. Assume that (i) condition (1.12) holds;

(ii) condition (H _mC ) holds for A and B.

Then r _T = R and the CPD of tensor T is unique.

Corollary 1.25. Let T = [A, B, C] R and m _C := R − r _C + 2. Assume that

(8)

(i) condition (1.12) holds;

(ii) C m

_C

(A) C m

_C

(B) has full column rank.

Then r _T = R and the CPD of tensor T is unique.

Note that Proposition 1.15 is a special case of the results in Proposition 1.22, Corollaries 1.23–1.25 and Kruskal’s Theorem 1.8. In the former, one factor matrix is assumed to have full column rank (r C = R) while in the latter this is not necessary (r C = R − m C + 2 with m C ≥ 2). The condition on C is relaxed by tightening the conditions on A and B. For instance, Corollary 1.23 allows r C = R − m C + 2 with m := m C ≥ 2 by imposing (1.12) and (C m ). From scheme (1.6) we have that (C m ) implies (C 2 ), and hence (C m ) is more restrictive than (C 2 ). Scheme (1.6) further shows that Corollary 1.23 is more general than Corollaries 1.24 and 1.25. In turn, Proposition 1.22 is more general than Corollary 1.23. Note that we did not formulate a combination of implication (K m ) ⇒ (C m ) (or (H m )) from scheme (1.8) with Proposition 1.20. Such a combination leads to a result that is equivalent to Corollary 1.29 below.

Combining (1.8) with Proposition 1.21 we prove the following results.

Proposition 1.26. Let T = [A, B, C] ^R and let

m _A := R − r _A + 2, m _B := R − r _B + 2, m _C := R − r _C + 2. (1.14) Assume that at least two of the following conditions hold

(i) condition (U _mA ) holds for B and C;

(ii) condition (U _mB ) holds for C and A;

(iii) condition (U _mC ) holds for A and B.

Then r _T = R and the CPD of tensor T is unique.

Corollary 1.27. Let T = [A, B, C] ^R and consider m A , m B , and m C defined in (1.14). Assume that at least two of the following conditions hold

(i) condition (H _mA ) holds for B and C;

(ii) condition (H _mB ) holds for C and A;

(iii) condition (H _mC ) holds for A and B.

Then r T = R and the CPD of tensor T is unique.

Corollary 1.28. Let T = [A, B, C] R and consider m A , m B , and m C defined in (1.14). Let at least two of the matrices

C m

_A

(B) C m

_A

(C), C m

_B

(C) C m

_B

(A), C m

_C

(A) C m

_C

(B) (1.15) have full column rank. Then r _T = R and the CPD of tensor T is unique.

Corollary 1.29. Let T = [A, B, C] R and let (X, Y, Z) coincide with (A, B, C), (B, C, A), or (C, A, B). If

k X + r Y + r Z ≥ 2R + 2,

min(r Z + k Y , k Z + r Y ) ≥ R + 2, (1.16) then r _T = R and the CPD of tensor T is unique.

Corollary 1.30. Let T = [A, B, C] ^R and let the following conditions hold



 

 

k A + r B + r C ≥ 2R + 2, r A + k B + r C ≥ 2R + 2, r A + r B + k C ≥ 2R + 2.

(1.17)

Then r _T = R and the CPD of tensor T is unique.

(9)

Let us compare Kruskal’s Theorems 1.8–1.10 with Corollaries 1.24, 1.27, 1.29, and 1.30. Elementary algebra yields that Theorem 1.9 is equivalent to Corollary 1.29.

From Corollary 1.27 it follows that assumption (i) of Theorem 1.10 is redundant. We will demonstrate in Examples 3.2 and 3.3 that it is not possible to state in general which of the Corollaries 1.24 or 1.27 is more relaxed. Thus, Corollary 1.24 (obtained by combining implication (H m ) ⇒ (U m ) from scheme (1.8) with Proposition 1.21) is an (H m )–type result on uniqueness that was not in [14]. Corollary 1.30 is a special case of Corollary 1.29, which is obviously more relaxed than Kruskal’s well-known Theorem 1.8. Finally we note that if condition (H m ) holds, then r A + r B + r C ≥ 2R + 2. Thus, neither Kruskal’s Theorems 1.8–1.10 nor Corollaries 1.24, 1.27, 1.29, 1.30 can be used for demonstrating the uniqueness of a PD [A, B, C] _R when r _A + r _B + r _C < 2R + 2.

We did not present a result based on a combination of (W m )-type implications from scheme (1.8) with Proposition 1.21 because we do not have examples of cases where such conditions are more relaxed than those in Proposition 1.26.

In §4 we indicate how our results can be adapted in the case of PD symmetries.

Well-known necessary conditions for the uniqueness of the CPD are [21, p. 2079, Theorem 2], [13, p. 28], [18, p. 651]

min(k _A , k _B , k _C ) ≥ 2, (1.18)

A B, B C, C A have full column rank. (1.19) Further, the following necessary condition was obtained in [5, Theorem 2.3]

(U 2 ) holds for pairs (A, B), (B, C), and (C, A). (1.20) It follows from scheme (1.6) that (1.20) is more restrictive than (1.18) and (1.19).

Our most general condition concerning uniqueness of one factor matrix is given in Proposition 1.13. Note that in Proposition 1.13, condition (i) is more relaxed than (1.18) and condition (iii) coincides with (1.19). One may wonder whether condition (iv) in Proposition 1.13 is necessary for the uniqueness of at least one factor matrix.

In §5 we show that this is not the case. We actually study an example in which CPD uniqueness can be established without (W m ) being satisfied.

In §6 we study generic uniqueness of one factor matrix and generic CPD uniqueness. Our result on overall CPD uniqueness is the following.

Proposition 1.31. The CPD of an I × J × K tensor of rank R is generically unique if there exist matrices A 0 ∈ F ^I×R , B 0 ∈ F ^{J ×R} , and C 0 ∈ F ^K×R such that at least one of the following conditions holds:

(i) C m

_C

(A 0 ) C m

_C

(B 0 ) has full column rank, where m C = R − min(K, R) + 2;

(ii) C m

_A

(B 0 ) C m

_A

(C 0 ) has full column rank, where m A = R − min(I, R) + 2;

(iii) C m

_B

(C 0 ) C m

_B

(A 0 ) has full column rank, where m B = R − min(J, R) + 2.

We give several examples that illustrate the uniqueness results in the generic case.

2. Equality of PDs with common factor matrices. In this section we assume that a tensor admits two not necessarily canonical PDs that have one or two factor matrices in common. In the latter case, the two PDs may have the columns of the common factor matrices permuted differently. We establish conditions that guarantee that the two PDs are the same.

2.1. One factor matrix in common. In this subsection we assume that two

PDs have the factor matrix C in common. The result that we are concerned with, is

Proposition 1.20. The proof is based on the following three lemmas.

(10)

Lemma 2.1. For matrices A, ¯ A ∈ F ^I×R and indices r 1 , . . . , r n ∈ {1, . . . , R}

define the subspaces E r

₁

...r

_n

and ¯ E r

₁

...r

_n

as follows

E r

₁

...r

_n

:= span{a r

₁

, . . . , a r

_n

}, E ¯ r

₁

...r

_n

:= span{¯ a r

₁

, . . . , ¯ a r

_n

}.

Assume that k A ≥ 2 and that there exists m ∈ {2, . . . , k A } such that

E r

₁

...r

_m−1

⊆ ¯ E r

₁

...r

_m−1

for all 1 ≤ r 1 < r 2 < · · · < r m−1 ≤ R. (2.1) Then there exists a nonsingular diagonal matrix Λ such that A = ¯ AΛ.

Proof. For m = 2 we have

span{a r

₁

} = E r

₁

⊆ ¯ E r

₁

= span{¯ a r

₁

}, for all 1 ≤ r 1 ≤ R, (2.2) such that the Lemma trivially holds. For m ≥ 3 we arrive at (2.2) by downward induction on l = m, m − 1, . . . , 3. Assuming that

E _r

₁

_...r

_l−1

⊆ ¯ E _r

₁

_...r

_l−1

for all 1 ≤ r ₁ < r ₂ < · · · < r _l−1 ≤ R, (2.3) we show that

E _r

₁

_...r

_l−2

⊆ ¯ E _r

₁

_...r

_l−2

for all 1 ≤ r ₁ < r ₂ < · · · < r _l−2 ≤ R.

Assume r ₁ , r ₂ , . . . , r _l−2 fixed and let i, j ∈ {1, . . . , R} \ {r ₁ , . . . , r _l−2 }, with i 6= j.

Since l ≤ m ≤ k _A , we have that dim E _r

₁

_,...,r

_l−2

_,i,j = l. Because

l = dim E _r

₁

_,...,r

_l−2

_,i,j ≤ dim span{E r

1

,...,r

l−2

,i , E _r

₁

_,...,r

_l−2

_,j }

(2.3)

≤ dim span{ ¯ E r

₁

,...,r

_l−2

,i , ¯ E r

₁

,...,r

_l−2

,j } we have

E ¯ r

₁

,...,r

_l−2

,i 6= ¯ E r

₁

,...,r

_l−2

,j . (2.4) Therefore,

E r

₁

,...,r

_l−2

⊆ E r

₁

,...,r

_l−2

,i ∩ E r

₁

,...,r

_l−2

,j

(2.3)

⊆ E ¯ r

1

,...,r

_l−2

,i ∩ ¯ E r

1

,...,r

_l−2

,j

^(2.4)

= E ¯ r

1

,...,r

_l−2

.

The induction follows. To conclude the proof, we note that Λ is nonsingular since k A ≥ 2.

Lemma 2.2. Let C ∈ F ^K×R and consider m such that m ≤ k C . Then for any set of distinct indices I = {i 1 , . . . , i m−1 } ⊆ {1, . . . , R} there exists a vector x ∈ F ^K such that

x ^T c _i = 0 for i ∈ I and x ^T c _i 6= 0 for i ∈ I ^c := {1, . . . , R} \ I. (2.5)

Proof. Let C _I ∈ F ^K×(m−1) and C _I

c

∈ F ^K×(R−m+1) contain the columns of C indexed by I and I ^c , respectively, and let the columns of C ^⊥ _I ∈ F ^K×(K−m+1) form a basis for the orthogonal complement of range(C _I ). The matrix (C ^⊥ _I ) ^H C _I

c

cannot have a zero column, otherwise the corresponding column of C _I

c

would be in

(11)

range(C _I ), which would be a contradiction with k C ≥ m. We conclude that (2.5) holds for x = (C ^⊥ _I y) ^∗ , with y ∈ F ^K−m+1 generic.

Lemma 2.3. Let P be an R ×R permutation matrix. Then for any vector λ ∈ F ^R ,

Diag(Πλ)Π = ΠDiag(λ). (2.6)

Proof. The lemma follows directly from the definition of permutation matrix.

We are now ready to prove Proposition 1.20.

Proof. Let b A := ¯ AΠ ^T and b B := ¯ BΛ ⁻¹ _C Π ^T . Then

T = [A, B, C] _R = [ ¯ A, ¯ B, CΠΛ _C ] _R = [ b A, b B, C] _R . (2.7) We show that the columns of A and B coincide up to scaling with the corresponding columns of b A and b B, respectively. Consider indices i ₁ , . . . , i _R−k

_C

₊₁ such that 1 ≤ i ₁ < · · · < i _R−k

_C

₊₁ ≤ R. Let m := k _C and let I := {1, . . . , R} \ {i ₁ , . . . , i _R−k

_C

₊₁ }.

From Lemma 2.2 it follows that there exists a vector x ∈ F ^K such that x ^T c i = 0 for i ∈ I and x ^T c i 6= 0 for i ∈ I ^c = {i 1 , . . . , i R−k

_C

+1 }.

Let d = x ^T c _i

₁

. . . x ^T c _i

_R−kC+1

^T

. Then (AB)C ^T x = ( b A b B)C ^T x is equivalent to

a i

₁

. . . a i

_R−kC+1

b i

₁

. . . b i

_R−kC+1

d =

b a i

₁

. . . a b i

_R−kC+1

h

b b i

₁

. . . b b i

_R−kC+1

i

d, which may be expressed as

a i

₁

. . . a i

_R−kC+1

Diag(d) b i

₁

. . . b i

_R−kC+1

^T

=

b a i

₁

. . . b a i

_R−kC+1

Diag(d) h

b b _i

₁

. . . b b _i

_R−kC+1

i ^T . By (1.12), min(k _A , k _B ) ≥ R − k _C + 1. Hence, the matrices a i

1

. . . a i

_R−kC+1

and

b i

₁

. . . b i

_R−kC+1

have full column rank. Since by construction the vector d has only nonzero components, it follows that

a i

1

, . . . , a i

_R−kC+1

∈ span{ b a i

1

, . . . , b a i

_R−kC+1

}, b i

₁

, . . . , b i

_R−kC+1

∈ span{b b i

₁

, . . . , b b i

_R−kC+1

}.

By (1.12), max(k A , k B ) ≥ m := R − k C + 2 ≥ 2. Without loss of generality we confine ourselves to the case k A ≥ m. Then, by Lemma 2.1, there exists a nonsingular diagonal matrix Λ such that A = b AΛ. Denoting λ A := Π ^T diag(Λ ⁻¹ ) and Λ A = Diag(λ _A ) and applying Lemma 2.3, we have

A = b ¯ AΠ = AΛ ⁻¹ Π = ADiag(Πλ _A )Π = AΠDiag(λ _A ) = AΠΛ _A . It follows from (2.7) and (1.2) that

(C A)B ^T = (CΠΛ _C ¯ A) ¯ B ^T = (CΠΛ _C AΠΛ _A ) ¯ B ^T = (C A)ΠΛ _C Λ _A B ¯ ^T . Since k _A ≥ R − k _C + 2, it follows that condition (K 1 ) holds for the matrices A and C. From Proposition 1.12 (1) it follows that the matrix C A has full column rank.

Hence, B ^T = ΠΛ _C Λ _A B ¯ ^T , i.e., ¯ B = BΠΛ ⁻¹ _A Λ ⁻¹ _C =: BΠΛ _B .

(12)

Example 2.4. Consider the 2 × 3 × 3 tensor given by T = [ b A, b B, b C] 3 , where

A = b

1 1 1

−1 −2 3

, B = b





6 12 2

3 4 −1

4 6 −4



 , C = b





1 0 0 0 1 0 0 0 1



 . Since k

A b + k

B b + k

C b = 2 + 3 + 3 ≥ 2 × 3 + 2, it follows from Theorem 1.8 that r _T = 3 and that the CPD of T is unique.

Increasing the number of terms, we also have T = [A, B, C] 4 for

A =

1 0 1 1 0 1 1 2

, B =





1 1 0 0 1 0 1 0 1 0 0 1



 , C =





6 −6 −3 −2

12 −24 −8 −6

2 6 −3 −6



 . Since k A = 2 and k B = k C = 3, condition (1.12) holds. Hence, by Proposition 1.20, if T = [ ¯ A, ¯ B, ¯ C] ₄ and ¯ C = C, then there exists a nonsingular diagonal matrix Λ such that ¯ A = AΛ and ¯ B = BΛ ⁻¹ .

The following condition is also satisfied:

max(min(k A , k C − 1), min(k A − 1, k C )) + k B ≥ R + 1.

By symmetry, we have from Proposition 1.20 that, if T = [ ¯ A, ¯ B, ¯ C] 4 and ¯ B = B, then there exists a nonsingular diagonal matrix Λ such that ¯ A = AΛ and ¯ C = CΛ ⁻¹ .

Finally, we show that the inequality of condition (1.12) is sharp. We have max(min(k _B , k _C − 1), min(k B − 1, k C )) + k _A = R < R + 1.

One can verify that T = [ ¯ A, ¯ B, ¯ C] 4 with ¯ A = A and with ¯ B and ¯ C given by

B = ¯





6 12 2

3 4 −1

4 6 −4









1 0 0

0 α 0

0 0 β









1 1 1 1

1 2 4/3 3/2

1 −3 3 9



 ,

C = ¯





1 0 0

0 1/α 0

0 0 1/β









6 −6 −3 −2

−24/5 48/5 16/5 12/5 2/15 2/5 −1/5 −2/5



 ,

for arbitrary nonzero α and β. Hence, there exist infinitely many PDs T = [ ¯ A, ¯ B, ¯ C] 4

with ¯ A = A; the columns of ¯ B and ¯ C are only proportional to the columns of B and C, respectively, for α = −2/5 and β = 1/15. We conclude that the inequality of condition (1.12) is sharp.

2.2. Two factor matrices in common. In this subsection we assume that two PDs have the factor matrices A and C in common. We do not assume however that in the two PDs the columns of these matrices are permuted in the same manner. The result that we are concerned with, is Proposition 1.21.

Proof. Without loss of generality, we confine ourselves to the case

k _C ≥ 2 and min(k _A − 1, k B ) + r _C ≥ R + 1. (2.8)

We set for brevity r := r C . Denoting Π = Π A Π ^T _C and b B = ¯ BΛ A Λ C Π ^T _C , we have

[AΠ A Λ A , ¯ B, CΠ C Λ C ] R = [AΠ A Π ^T _C , ¯ BΛ A Λ C Π ^T _C , C] R = [AΠ, b B, C] R . We will

show that, under (2.8), [A, B, C] _R = [AΠ, b B, C] _R implies that Π = I _R . This, in

turn, immediately implies that Π _A = Π _C and ¯ B = BΠ _A Λ ⁻¹ _A Λ ⁻¹ _C .

(13)

(i) Let us fix integers i 1 , . . . , i r such that the columns c i

₁

, . . . , c i

_r

form a basis of range(C) and let us set {j 1 , . . . , j R−r } := {1, . . . , R} \ {i 1 , . . . , i r }. Let X ∈ F ^K×r , denote a right inverse of c _i

₁

. . . c _i

_r

T

, i.e., c _i

₁

. . . c _i

_r

T

X = I _r . Define the subspaces E, E _i

_k

⊆ F ^R as follows:

E = span{e ^R _j

₁

, . . . e ^R _j

_R−r

},

E _i

_k

= span{e ^R _l : c ^T _l x _k 6= 0, l ∈ {j 1 , . . . , j _R−r }}, k ∈ {1, . . . , r}.

By construction, E _i

_k

⊆ E and e ^R _i

l

∈ E / _i

_k

, k, l ∈ {1, . . . , r}.

(ii) Let us show that Πspan{E i

_k

, e ^R _i

_k

} = span{E i

_k

, e ^R _i

_k

} for all k ∈ {1, . . . , r}.

Let us fix k ∈ {1, . . . , r}. Assume that C ^T x _k has nonzero entries at positions k ₁ , . . . , k _L . Denote these entries by α ₁ , . . . , α _L . From the definition of X and E _i

_k

it follows that L ≤ R − r + 1 and span{e ^R _k

1

, . . . , e ^R _k

L

} = span{E _i

_k

, e ^R _i

k

}.

Define P _k = e ^R _k

1

. . . e ^R _k

L

. Then we have

P k P ^T _k Diag(C ^T x k )P k P ^T _k = Diag(C ^T x k ), (2.9) P ^T _k Diag(C ^T x _k )P _k = Diag(α 1 . . . α L ). (2.10) Further, [A, B, C] R = [AΠ, b B, C] R implies that

ADiag(C ^T x _k )B ^T = AΠDiag(C ^T x _k ) b B ^T . (2.11) Using (2.9)–(2.11), we obtain

AP k Diag(α ₁ . . . α L )P ^T _k B ^T = AP k P ^T _k Diag(C ^T x k )P k P ^T _k B ^T

= ADiag(C ^T x _k )B ^T

= AΠDiag(C ^T x k ) b B ^T

= AΠP k P ^T _k Diag(C ^T x k )P k P ^T _k B b ^T

= AΠP _k Diag(α 1 . . . α L )P ^T _k B b ^T . (2.12) Note that BP k = b k

₁

. . . b k

_L

. Since, by (2.8), k B ≥ R−r+1 ≥ L, it follows that the matrix P ^T _k B b ^T has full row rank. Further noting that AP _k = a _k

₁

. . . a _k

_L

and AΠP k = (AΠ) _k

₁

. . . (AΠ) k

_L

, we obtain from (2.12) that

span{a k

₁

, . . . , a k

_L

} ⊆ span{(AΠ) k

₁

, . . . , (AΠ) k

_L

}. (2.13) Since, by (2.8), k _A ≥ R − r + 2 ≥ L + 1, (2.13) is only possible if Πspan{E i

k

, e ^R _i

k

} = span{E i

_k

, e ^R _i

k

}.

(iii) Let us show that ΠE = E. Let us fix j ∈ {j ₁ , . . . , j _R−r }. From X ^T c _i

_k

= e ^r _k for k ∈ {1, . . . , r}, the fact that the vectors c _i

₁

, . . . , c _i

_r

form a basis of range(C), and k _C ≥ 2, it follows that the vector X ^T c _j has at least two nonzero components, say, the m-th and n-th component. Since c ^T _j x _m 6= 0 and c ^T _j x _n 6= 0, we have e ^R _j ∈ E _i

_m

∩ E _i

_n

. From the preceding steps we have

Πe ^R _j ∈ Π(E i

_m

∩ E i

_n

) ⁽ⁱ⁾ = Π span{E i

_m

, e ^R _i

_m

} ∩ span{E i

_n

, e ^R _i

_n

}

(ii)

⊆ span{E i

_m

, e ^R _i

_m

} ∩ span{E i

_n

, e ^R _i

_n

} ⁽ⁱ⁾ = E i

_m

∩ E i

_n

⊆ E.

Since this holds true for any index j ∈ {j ₁ , . . . , j _R−r }, it follows that ΠE = E.

(14)

(iv) Let us show that Πe ^R _i

k

= e ^R _i

k

for all k ∈ {1, . . . , r}. From the preceding steps we have

ΠE i

_k

(i) = Π span{E i

_k

, e ^R _i

_k

} ∩ E (ii), (iii)

⊆ span{E i

_k

, e ^R _i

_k

} ∩ E ⁽ⁱ⁾ = E i

_k

. On the other hand, we have from step (iii) that Πspan{E i

_k

, e ^R _i

k

} = {E i

_k

, e ^R _i

k

}, with, as shown in step (i), e ^R _i

_k

∈ E / i

_k

. It follows that Πe ^R _i

_k

= e ^R _i

_k

for all k ∈ {1, . . . , r}.

(v) We have so far shown that, if the columns c i

1

, . . . , c i

r

form a basis of range(C), then Π e ^R _i

1

. . . e ^R _i

r

= e ^R _i

₁

. . . e ^R _i

r

. To complete the proof of the overall equality Π = I _R , it suffices to note that a basis of range(C) can be constructed starting from any column of C.

3. Overall CPD uniqueness. In Proposition 1.22 and Corollaries 1.23–1.25 overall CPD uniqueness is derived from uniqueness of one factor matrix, where the latter is guaranteed by Proposition 1.20. In Proposition 1.26 and Corollaries 1.28–

1.30 overall CPD is derived from uniqueness of two factor matrices, where the latter is guaranteed by Proposition 1.21. We illustrate our results with some examples.

Proof of Proposition 1.22. By (1.12), k C ≥ 1 and min(k A , k B ) ≥ m C − 1. Hence, by Proposition 1.14, r _T = R and the third factor matrix of T is unique. The result now follows from Proposition 1.20.

Proof of Corollary 1.23. From Proposition 1.12 (3) it follows that (W _mC ) holds for A, B, and C. Since (U 1 ) is equivalent to (C 1 ), it follows from Proposition 1.12 (7) that A B has full column rank. The result now follows from Proposition 1.22.

Proof of Corollaries 1.24 and 1.25. By Proposition 1.12 (2), both (H _mC ) and (C _mC ) imply (U _mC ). The result now follows from Corollary 1.23.

Proof of Proposition 1.26. Without loss of generality we assume that (i) and (iii) hold. By Proposition 1.12 (9),

min(k B , k C ) ≥ m A ≥ 2, min(k A , k B ) ≥ m C ≥ 2. (3.1) It follows from Proposition 1.14 that r _T = R and that the first and third factor matrices of the tensor T are unique. One can easily check that (3.1) implies (1.13).

Hence, by Proposition 1.21, the CPD of T is unique.

Proof of Corollary 1.27. Without loss of generality we assume that (ii) and (iii) hold. From Proposition 1.12 (2) it follows that (ii) and (iii) in Proposition 1.26 also hold. Hence, by Proposition 1.26, r _T = R and the CPD of T is unique.

Proof of Corollary 1.28. By Proposition 1.12 (2), if two of the matrices in (1.15) have full column rank, then at least two of conditions (i)–(iii) in Proposition 1.26 hold. Hence, by Proposition 1.26, r _T = R and the CPD of T is unique.

Proof of Corollary 1.29. Without loss of generality we assume that (X, Y, Z) = (B, C, A). Then,



 



 



( k _B + r _A + r _C ≥ 2R + 2, k A + r C ≥ R + 2, ( k _B + r _A + r _C ≥ 2R + 2,

r A + k C ≥ R + 2,

⇒

( (K _mA ) holds for B and C,

(K _mC ) holds for A and B,

(15)

where m A = R − r A + 2 and m C = R − r C + 2. From Proposition 1.12 (1) it follows that the matrices C m

_A

(B) C m

_A

(C) and C m

_C

(A) C m

_C

(B) have full column rank.

Hence, by Corollary 1.28, r _T = R and the CPD of T is unique.

Proof of Corollary 1.30. It can be easily checked that all conditions of Corollary 1.29 hold. Hence, r _T = R and the CPD of T is unique.

Example 3.1. Consider a 5 × 5 × 5 tensor given by the PD T = [A, B, C] 6 , where the matrices A, B, C ∈ C ^5×6 satisfy

r A = r B = r C = 5, k A = k B = k C = 4.

For instance, consider

A =







1 0 0 0 0 ∗

0 1 0 0 0 ∗

0 0 1 0 0 ∗

0 0 0 1 0 ∗

0 0 0 0 1 0





 , B =







1 0 0 0 0 ∗

0 1 0 0 0 ∗

0 0 1 0 0 ∗

0 0 0 1 0 0

0 0 0 0 1 ∗





 , C =







1 0 0 0 0 ∗

0 1 0 0 0 ∗

0 0 1 0 0 0

0 0 0 1 0 ∗

0 0 0 0 1 ∗





 ,

where ∗ denotes arbitrary nonzero entries. Then Kruskal’s condition (1.4) does not hold. On the other hand, the conditions of Corollary 1.29 are satisfied. Hence, the PD of T is canonical and unique.

Example 3.2. Consider the 4 × 4 × 4 tensor given by the PD T = [A, B, C] ⁵ , where

A =







1 0 0 0 1

0 1 0 0 1

0 0 1 0 1

0 0 0 1 0







, B =







1 0 0 0 1

0 1 0 0 1

0 0 1 0 0

0 0 0 1 1







, C =







1 0 0 0 1

0 1 0 0 0

0 0 1 0 1

0 0 0 1 1





 .

We have

r A = r B = r C = 4, k A = k B = k C = 3, m A = m B = m C = 3.

Hence, Kruskal’s condition (1.4) does not hold. Moreover, condition (K 3 ) does not hold for (A, B), (C, A), nor (B, C). Hence, the conditions of Corollary 1.29 are not satisfied. On the other hand, we have

C ₃ (A)C ₃ (B)

= e ¹⁶ ₁ e ¹⁶ ₆ e ¹⁶ ₂ e ¹⁶ ₁₁ e ¹⁶ _1,−3 e ¹⁶ _6,10 e ¹⁶ ₁₆ e ¹⁶ _1,4 e ¹⁶ _6,−14 e ¹⁶ 11,12,15,16 , C 3 (C)C ₃ (A)

= e ¹⁶ ₁ e ¹⁶ ₆ e ¹⁶ _1,5 e ¹⁶ ₁₁ −e ¹⁶ ₉ e ¹⁶ _10,11 e ¹⁶ ₁₆ e ¹⁶ _1,13 e ¹⁶ 6,16,−8,−14 e ¹⁶ _11,12 , C 3 (B)C 3 (C)

= e ¹⁶ ₁ e ¹⁶ ₆ e ¹⁶ _5,6 e ¹⁶ ₁₁ e ¹⁶ _11,−3 e ¹⁶ ₇ e ¹⁶ ₁₆ e ¹⁶ _1,4,13,16 e ¹⁶ _6,−8 e ¹⁶ _11,15 , where

e ¹⁶ _i,±j := e ¹⁶ _i ± e ¹⁶ _j , e ¹⁶ _i,j,±k,±l := e ¹⁶ _i + e ¹⁶ _j ± e ¹⁶ _k ± e ¹⁶ _l , i, j, k, l ∈ {1, . . . , 16}.

It is easy to check that the matrices C ₃ (A) C ₃ (B), C ₃ (C) C ₃ (A) and C ₃ (B) C ₃ (C)

have full column rank. Hence, by Corollary 1.28, the PD is canonical and unique.

(16)

One can easily verify that H AB (δ) = H BC (δ) = H CA (δ) = min(δ, 3). Hence the uniqueness of the CPD follows also from Corollary 1.27.

Note that, since condition (1.12) does not hold, the result does not follow from Proposition 1.22 and its Corollaries 1.23–1.25.

Example 3.3. Consider the 5 × 5 × 8 tensor given by the PD T = [A, B, C] ⁸ , where

A =

A b (e ⁸ ₁ ) ^T

∈ F ^5×8 , B =

B b (e ⁸ ₈ ) ^T

∈ F ^5×8 , C = I 8

and b A and b B are 4 × 8 matrices such that k

A b = k

B b = 4. We have r A = r B = 5, k A = k B = 4, and r C = k C = 8. One can easily check that

H AB (δ) =



 

 

δ, 1 ≤ δ ≤ 4, 3, δ = 5, 2, 6 ≤ δ ≤ 8

≥ min(δ, 8 − 8 + 2)

and that condition (1.12) holds. Hence, by Corollary 1.24, the PD is canonical and unique. On the other hand, H BC (δ) = H CA (δ) = 4 < min(δ, 8 − 5 + 2) for δ = 5.

Hence, the result does not follow from Corollary 1.27.

Example 3.4. Let

A =





1 0 0 1 1

0 1 0 1 2

0 0 1 1 3



 , B =





1 0 0 1 1

0 1 0 1 3

0 0 1 1 5



 , C = I ₅ .

It has already been shown in [17] that the CPD of the tensor T = [A, B, C] 5 is unique.

We give a shorter proof, based on Corollary 1.23. It is easy to verify that

C ₂ (A) C ₂ (B) =







1 0 1 6 0 1 1 0 0 2

0 0 1 10 0 0 0 0 0 4

0 0 0 0 0 −1 −5 0 0 2

0 0 1 9 0 0 0 0 0 4

0 1 1 15 0 0 0 1 1 8

0 0 0 0 0 0 0 1 3 4

0 0 0 0 0 −1 −3 0 0 2

0 0 0 0 0 0 0 1 2 4

0 0 0 0 1 1 15 1 6 2





 ,

ker(C ₂ (A) C ₂ (B)) = span{0 0 −4 0 0 2 0 −4 0 −1 ^T }.

If d ∈ C ⁵ is such that diag(C 2 (Diag(d))) ∈ ker(C 2 (A) C 2 (B)), we have d ₁ d ₂ = 0, d ₂ d ₃ = 0, d ₃ d ₄ = −4c, d ₄ d ₅ = −c.

d 1 d 3 = 0, d 2 d 4 = 2c, d 3 d 5 = 0, d ₁ d ₄ = −4c, d ₂ d ₅ = 0,

d 1 d 5 = 0.

One can check that this set of equations only has a solution if c = 0, in which case

d = 0. Hence, by Corollary 1.23, the PD is canonical and unique. Note that, since

m _A = m _B = 5−3+2 = 4, the m _A -th compound matrix of A and the m _B -th compound

(17)

matrix of B are not defined. Hence, the uniqueness of the matrices A and B does not follow from Proposition 1.26.

Example 3.5. Experiments indicate that for random 7 × 10 matrices A and B, the matrix AB has full column rank and that condition (U 5 ) does not hold. Namely, the kernel of the 441 × 252 matrix C 5 (A) C 5 (B) is spanned by a vector b d ⁵ associated with some d ∈ F ¹⁰ . Let C be a 7 × 10 matrix such that d 6∈ range(C ^T ). Then (W 5 ) holds for the triplet (A, B, C). If additionally k C ≥ 5, then (1.12) holds. Hence, by Proposition 1.22, r T = 10 and the CPD of T = [A, B, C] 10 is unique.

The same situation occurs for tensors with other dimensions (see Table 3.1).

Table 3.1

Some cases where the rank and the uniqueness of the CPD of T = [A, B, C]

R

may be easily obtained from Proposition 1.22 or its Corollary 1.23 (see Example 3.5). Matrices A, B and C are generated randomly. Simulations indicate that the dimensions of A and B cause the dimension of ker(C

m

(A) C

m

(B)) to be equal to 1. Thus, (U m ) and (W m ) may be easily checked.

dimensions of T , r

T

= R m=R-K+2 dimensions of (U m ) (W m )

I × J × K C

_m

(A) C

m

(B)

4 × 5 × 6 7 3 40 × 35 does not hold holds

4 × 6 × 14 14 2 90 × 91 holds holds

5 × 7 × 7 9 4 175 × 216 does not hold holds

6 × 9 × 8 11 5 756 × 462 does not hold holds

7 × 7 × 7 10 5 441 × 252 does not hold holds

4. Application to tensors with symmetric frontal slices and Indscal. In this section we consider tensors with symmetric frontal slices (SFS), which we will briefly call SFS-tensors. We are interested in PDs of which the rank-1 terms have the same symmetry. Such decompositions correspond to the INDSCAL model, as introduced by Carroll and Chang [1]. A similar approach may be followed in the case of full symmetry.

We start with definitions of SFS-rank, SFS-PD, and SFS-CPD.

Definition 4.1. A third-order SFS-tensor T ∈ F Î×I×K is SFS-rank-1 if it equals the outer product of three nonzero vectors a ∈ F Î , a ∈ F Î and c ∈ F ^K .

Definition 4.2. A SFS-PD of a third-order SFS-tensor T ∈ F ^I×I×K expresses T as a sum of SFS-rank-1 terms:

T =

R

X

r=1

a r ◦ a r ◦ c r , (4.1)

where a r ∈ F ^I , c r ∈ F ^K , 1 ≤ r ≤ R.

Definition 4.3. The SFS-rank of a SFS-tensor T ∈ F ^I×I×K is defined as the minimum number of SFS-rank-1 tensors in a PD of T and is denoted by r SF S,T .

Definition 4.4. A SFS-CPD of a third-order SFS-tensor T expresses T as a minimal sum of SFS-rank-1 terms.

Note that T = [A, B, C] _R is a SFS-CPD of T if and only if T is an SFS-tensor, A = B, and R = r _{SF S,T} .

Now we can define uniqueness of the SFS-CPD.

Definition 4.5. Let T be a SFS-tensor of SFS-rank R. The SFS-CPD of T is unique if T = [A, A, C] _R = [ ¯ A, ¯ A, ¯ C] _R implies that there exist an R × R permutation matrix Π and R × R nonsingular diagonal matrices Λ A and Λ C such that

A = AΠΛ ¯ _A , C = CΠΛ ¯ _C , Λ ² _A Λ _C = I _R .

(18)

Example 4.6. Some SFS-tensors admit both SFS-CPDs and CPDs of which the terms are not partially symmetric. For instance, consider the SFS-tensor T ∈ R ^I×I×K in which I _I is stacked K times. Let E denote the K × I matrix of which all entries are equal to one. Then T = [X, (X ⁻¹ ) ^T , E] _I , is a CPD of T for any nonsingular I × I matrix X. On the other hand, T = [A, A, E] I , is a SFS-CPD of T for any orthogonal I × I matrix A.

The following result was obtained in [20]. We present the proof for completeness.

Lemma 4.7. Let T be a SFS-tensor of rank R and let the CPD of T be unique.

Then r SF S,T = r T , and the SFS-CPD of T is also unique.

Proof. Let [A, B, C] R be a CPD of the SFS-tensor T . Because of the symmetry we also have T = [B, A, C] R . Since the CPD of T is unique, there exist an R × R permutation matrix Π and R × R nonsingular diagonal matrices Λ A , Λ B , and Λ C

such that B = AΠΛ A , A = BΠΛ B , C = CΠΛ C and Λ A Λ B Λ C = I R . Since the CPD is unique, by (1.18), we have k C ≥ 2. Hence, Π = Λ C = I R and B = AΛ A . Thus, any CPD of T is in fact a SFS-CPD. Hence, r SF S,T = r T , and the SFS-CPD of T is unique.

Remark 4.8. To the authors’ knowledge, it is still an open question whether there exist SFS-tensors with unique SFS-CPD but non-unique CPD.

Lemma 4.7 implies that conditions guaranteeing uniqueness of SFS-CPD may be obtained from conditions guaranteeing uniqueness of CPD by just ignoring the SFS- structure. To illustrate this, we present SFS-variants of Corollaries 1.25 and 1.28.

Proposition 4.9. Let T = [A, A, C] ^R and m C := R − r C + 2. Assume that (i) k _A + k _C ≥ R + 2;

(ii) C _m

_C

(A) C _m

_C

(A) has full column rank.

Then r SF S,T = R and the SFS-CPD of tensor T is unique.

Proof. From Corollary 1.25 it follows that r _T = R and that the CPD of tensor T is unique. The proof now follows from Lemma 4.7.

Remark 4.10. Under the additional assumption r ^C = R, Proposition 4.9 was proved in [15].

Proposition 4.11. Let T = [A, A, C] R and m _A := R − r _A + 2. Assume that (i) k A + max(min(k C − 1, k A ), min(k C , k A − 1)) ≥ R + 1;

(ii) C _m

_A

(A) C _m

_A

(C) has full column rank.

Then r SF S,T = R and the SFS-CPD of tensor T is unique.

Proof. By Lemma 4.7 it is sufficient to show that r _T = R and that the CPD of tensor T is unique. Both these statements follow from Corollary 1.25 applied to the tensor [A, C, A] R .

Proposition 4.12. Let T = [A, A, C] R , m _A = R −r _A + 2 and m _C = R −r _C +2.

Assume that the matrices

C m

_A

(A) C m

_A

(C), (4.2)

C m

_C

(A) C m

_C

(A) (4.3)

have full column rank. Then r SF S,T = R and the SFS-CPD of tensor T is unique.

Proof. From Corollary 1.28 it follows that r _T = R and that the CPD of tensor T is unique. The proof now follows from Lemma 4.7.

5. Uniqueness beyond (W m ). In this section we discuss an example in which

even condition (W m ) is not satisfied. Hence, CPD uniqueness does not follow from

Proposition 1.13 or Proposition 1.14. A fortiori, it does not follow from Proposition

(19)

1.22, Corollaries 1.23–1.25, Proposition 1.26, and Corollaries 1.28–1.30. We show that uniqueness of the CPD can nevertheless be demonstrated by combining subresults.

In this section we will denote by ω(d) the number of nonzero components of d and we will write a k b if the vectors a and b are collinear, that is there exists a nonzero number c ∈ F such that a = cb.

For easy reference we include the following lemma concerning second compound matrices.

Lemma 5.1. [7, Lemma 2.4 (1) and Lemma 2.5]

(1) Let the product XYZ be defined. Then the product C 2 (X)C 2 (Y)C 2 (Z) is also defined and

C 2 (XYZ) = C 2 (X)C 2 (Y)C 2 (Z).

(2) Let d = d ₁ d ₂ . . . d _R ∈ F ^R . Then C ₂ (Diag(d)) = Diag(b d ² ).

In particular, ω(d) ≤ 1 if and only if b d ² = 0 if and only if C 2 (Diag(d)) = 0.

Example 5.2. Let T ^α = [A, B, C] 5 , where

A =







0 α 0 0 0

1 0 1 0 0

1 0 0 1 0

0 0 0 0 1





 , B =







0 1 0 0 0

1 0 1 0 0

0 0 0 1 0

1 0 0 0 1





 , C =







1 1 0 0 0

0 0 1 0 0

0 0 0 1 0

1 0 0 0 1







, α 6= 0.

Then r A = r B = r C = 4, k A = k B = k C = 2, and m := m A = m B = m C = 5 − 4 + 2 = 3. One can check that none of the triplets (A, B, C), (B, C, A), (C, A, B) satisfies condition (W m ). Hence, the rank and the uniqueness of the factor matrices of T _α do not follow from Proposition 1.13 or Proposition 1.14. We prove that r _T

_α

= 5 and that the CPD T _α = [A, B, C] ₅ is unique.

(i) A trivial verification shows that

A B, B C, C A, have full column rank, (5.1)

C ₂ (A) C ₂ (B), C ₂ (B) C ₂ (C), C ₂ (C) C ₂ (A) have full column rank. (5.2) Elementary algebra yields

ω(A ^T x) = 1 ⇔ xk e ⁴ ₁ or x k e ⁴ ₄ , (5.3) ω(B ^T y) = 1 ⇔ yk e ⁴ ₁ or y k e ⁴ ₃ , (5.4) ω(C ^T z) = 1 ⇔ z k e ⁴ ₂ or z k e ⁴ ₃ . (5.5) (ii) Consider a CPD T α = [ ¯ A, ¯ B, ¯ C] R ¯ , i.e. R = r ¯ _T

_α

is minimal. We have R ≤ 5. For later use we show that any three solutions of the equation ω( ¯ ¯ C ^T z) = 1 are linearly dependent. Indeed, assume that there exist three vectors z 1 , z 2 , z 3 ∈ F ⁴ such that ω( ¯ C ^T z 1 ) = ω( ¯ C ^T z 2 ) = ω( ¯ C ^T z 3 ) = 1. By (1.2)–(1.3),

T ⁽¹⁾ = ( ¯ A ¯ B) ¯ C ^T = (A B)C ^T , (5.6) T ⁽²⁾ = ( ¯ B ¯ C) ¯ A ^T = (B C)A ^T , (5.7) T ⁽³⁾ = ( ¯ C ¯ A) ¯ B ^T = (C A)B ^T . (5.8) From (5.6) it follows that ADiag(C ^T z i )B ^T = ¯ ADiag( ¯ C ^T z i ) ¯ B ^T , and hence, by Lemma 5.1 (1),

C ₂ (A)C ₂ (Diag(C ^T z _i ))C ₂ (B ^T ) = C ₂ ( ¯ A)C ₂ (Diag( ¯ C ^T z _i ))C ₂ ( ¯ B ^T ) = O, i ∈ {1, 2, 3},

UNIQUENESS OF THE OVERALL DECOMPOSITION ∗

DECOMPOSITION OF THIRD-ORDER TENSORS — PART II: