• No results found

Perceptual mapping based on three-way binary data

N/A
N/A
Protected

Academic year: 2021

Share "Perceptual mapping based on three-way binary data"

Copied!
62
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Perceptual mapping based on three-way binary dataBüyükkurt, K.B.; Kroonenberg, P.M.

CitationBüyükkurt, K. B., & Kroonenberg, P. M. (2008). Perceptual mapping based on three-way binary data. Kayseri,Turkey. Retrieved from https://hdl.handle.net/1887/13528

Version:Not Applicable (or Unknown)License:Leiden University Non-exclusive licenseDownloaded from:https://hdl.handle.net/1887/13528

Note: To cite this publication please use the final published version (if applicable).

(2)

Perceptual Mapping

Based on Three-Way Binary Data

B. Kemal Büyükkurt, Concordia University, Montreal, Canada

&

Pieter M. Kroonenberg, Leiden University, The Netherlands

01100100

10010101

11100011

(3)

Perceptual Mapping - 1

– Perceptual mapping: Graphical display summarizing consumers’ perceptions of multi-attribute objects.

– Example: Displaying brands in a product class together with their attributes

e.g. brands for treating stomach problems.

– Brunswik’s (1955) Lens Model:

Theoretical foundation for understanding the importance of perceptions in consumer purchases

Perceptions Preferences Choice

(4)

Perceptual Mapping - 2

Goals perceptual mapping

– Aid for strategic marketing decisions

– Summarizing nature and degree of competition among a set brands via key product attributes.

Common application areas – product positioning

– identification of market gaps for new product development.

(5)

• Basic data

– Brands are scored on a number of attributes by several individuals

– Scores averaged over individuals – Result: Brand by Attribute matrix

• Common data analysis techniques – correspondence analysis

– principal component analysis, – multidimensional analysis

– discriminant analysis – factor analysis

Perceptual Mapping - 3

I nd iv id ua ls Attributes

Brands

Average Brands

Attributes

(6)

Perceptual Mapping - 4

• Basic data

– Doctor thinks a brand posesses an attribute =>

score = 1 , if not: score = 0

– Three-way binary data: Brands ×××× Attributes ×××× Doctors – Why average over doctors?

– Different doctors may be sensitive to different attributes

• Three-way data analysis techniques

– Three-mode binary hierarchical cluster analysis

– Three-mode principal component analysis (numerical)

(7)

The Binary Data Cube

i=1,....,I

Objects (Brands)

MODE A k=1,...,K

Subjects (Doctors) MODE C

Fibers Slices

MODE B j=1,...,J

Variables (Attributes)

011001

100101

111000

(8)

Stacked Two-Way Data

Columns: Attributes 1 through J

Doctor 1 (k=1) Doctor 2 (k=2)

Doctor K (k=K)

Rows: Brands 1 through I Rows: Brands 1 through I

Rows: Brands 1 through I

011001

100101

111000

010001

101101

111010

010001

010101

110001

(9)

HICLAS3: Algebraic Representation

(Tucker3-HICLAS)

• Hiclas3 model (uses Boolean algebra)

• m

ijk

=1 iff ã

ip

= 1 and b

jq

= 1 and c

kr

= 1 and g

pqr

= 1 for at least one combination of p, q, and r;

• ã

ip

, b

jq

, c

kr

: elements binary component matrices A, B, and C, respectively (brands, attributes, doctors).

• g

pqr

: element of the P ×××× Q ×××× R three-way binary core array G G G G ,

indicates links between binary components of the three modes

pqr kr

jq ip

R r Q q P ijk p

ijk m a b c g

x ˆ ~ ~ ~ ~

1 1

1 = =

= ⊕ ⊕

=

=

(10)

HICLAS3 – Pictorial Representation

0 1

1 1

0 0

1 0

0 1

1 0

1 0

1 1

1 0

0 1

1 0

1 1

0 1

0 1

0 1

1 0

0 1

1 0

1 0

1 1

A

1 0

1 0

1 0

0 1 B

G

1

C

G

2

1 2 3 4 5 6 7 8

m

211

= 1 as a

22

b

12

c

11

g

221

= 1 ×××× 1 ×××× 1 ×××× 1 (all other 7 combinations contain a zero)

brands attributes doctors core array

c2 b1 b2

a1 a2 a1 a2

b1

b2

c1

(11)

Three-Mode Component Analysis

• Tucker3 model (numerical)

– i=1,...,I (brands); j=1,...,J (attributes); k=1,...,K (doctors);

– m

ijk

is the model matrix or structural image

– a

ip

, b

jq

, c

kr

: elements loading matrices A, B, and C, respectively (brands, attributes, doctors).

– g

pqr

: element of the P ×××× Q ×××× R three-way core array G G G G ;

indicates strength of the link between the components of the three modes

∑ ∑ ∑

= = =

=

= P

p

Q

q

R

r

pqr kr

jq ip

ijk

ijk m a b c g

x

1 1 1

ˆ

(12)

Three-Mode Binary Analysis in Action

Perceptions of Medical Doctors w.r.t.

Gastro-Intestinal Drugs

(13)

Perceptions of Medical Doctors

Gastro-Intestinal Drugs

• Tagamet

• Zantac

• Pepcid

• Axid

• Sulcrate

• Cytotec

• Losec

(14)

Attributes

Adjectives [Binary answers– no (0) or yes (1)]

• Relieves Pain RelPain

• Does not have serious side effects NoSideEf

• Relatively safe w.r.t.

potential interactions with other drugs Safe

• Flexible in terms of dosage FlexDose

• Not too costly for the patient LowCost

• Relieves symptoms RelSymptoms

• Promotes healing Heals

• Prophylactic Prophylactic

(15)

Data: Brands ×××× Attributes ×××× Doctors

(7 ×××× 8 ×××× 283)

i=1,....,7

Objects (Brands)

MODE A

j=1,...,8

Variables (Attributes)

MODE B

k=1,...,283

Subjects (Doctors)

MODE C

(16)

Perceptions of Medical Doctors

Central questions

• What is the position of brands w.r.t. each other?

• Which attributes are related to this positioning?

• Do doctors differ in their perceptions in which brands

have which attributes?

(17)

HiClas3 Model

Tucker3 hierarchical classes model Basic elements

• Binary components for all three modes (doctors, brands and attributes)

• Plus linkage information about the components Basic literature

• Papers by Ceulemans, Van Mechelen in Psychometrika

(Catholic University Leuven, Belgium)

(18)

1,3,3

HiClas3 – Choosing a Model

Brands ×××× Attributes ×××× Doctors

3600 3800 4000 4200 4400 4600 4800

300 400 500 600 700 800 900

N u m b er o f d is cr ep a n ci es

Degrees of freedom 1,1,1

2,2,1 3,3,1

2,1,2

1,2,2

2,2,2 3,2,2 2,3,2

3,3,2

3,1,3

2,2,3 3,2,3

2,3,3 3,3,3

Model complexity: (3,3,2) = (Brands = 3 components ; Attr = 3; Docs = 2)

Discrepancy : Data have a 1, model matrix a 0 and vice versa

(19)

Binary Component Matrices

(brands; attributes)

Brand Brand Brand

Brand DiscreDiscreDiscreDiscre---- pancies pancies pancies

pancies Fit B1 B2 B3Fit B1 B2 B3Fit B1 B2 B3Fit B1 B2 B3 --- --- --- --- Sulcrate

Sulcrate Sulcrate

Sulcrate 659 0.626 1 0 1659 0.626 1 0 1659 0.626 1 0 1659 0.626 1 0 1 Cytotec

Cytotec Cytotec

Cytotec 645 0.564 1 0 0645 0.564 1 0 0645 0.564 1 0 0645 0.564 1 0 0 Zantac 488 0.709 0 0 1 Zantac 488 0.709 0 0 1 Zantac 488 0.709 0 0 1 Zantac 488 0.709 0 0 1 Pepcid

Pepcid Pepcid

Pepcid 388 0.743 0 0 1388 0.743 0 0 1388 0.743 0 0 1388 0.743 0 0 1 Axid

Axid Axid

Axid 467 0.691 0 0 1467 0.691 0 0 1467 0.691 0 0 1467 0.691 0 0 1 Losec

Losec Losec

Losec 499 0.665 0 0 1499 0.665 0 0 1499 0.665 0 0 1499 0.665 0 0 1 Tagamet

Tagamet Tagamet

Tagamet 627 0.589 0 1 0627 0.589 0 1 0627 0.589 0 1 0627 0.589 0 1 0

--- --- --- ---

B1 = Cytoprotective agent B2 = Tagamet (Oldest)

B3 = Histamines; H-2 blocker

Attribute Attribute Attribute

Attribute DiscreDiscreDiscreDiscre---- pancies pancies pancies

pancies Fit A1 A2 A3Fit A1 A2 A3Fit A1 A2 A3Fit A1 A2 A3 --- --- --- Relieves Pain 369 0.79 1 1 1 Relieves Pain 369 0.79 1 1 1Relieves Pain 369 0.79 1 1 1 Relieves Pain 369 0.79 1 1 1 Relieves

Relieves Relieves

Relieves SymptomsSymptomsSymptomsSymptoms 330330330330 0.82 1 1 10.82 1 1 10.82 1 1 10.82 1 1 1 Promotes Health 406 0.77 1 1 1 Promotes Health 406 0.77 1 1 1Promotes Health 406 0.77 1 1 1 Promotes Health 406 0.77 1 1 1 No Side Effects 517 0.60 0 0 1 No Side Effects 517 0.60 0 0 1No Side Effects 517 0.60 0 0 1 No Side Effects 517 0.60 0 0 1 Relatively Safe 542 0.57 0 0 1 Relatively Safe 542 0.57 0 0 1Relatively Safe 542 0.57 0 0 1 Relatively Safe 542 0.57 0 0 1 Flexible Dose 632 0.52 0 1 0 Flexible Dose 632 0.52 0 1 0Flexible Dose 632 0.52 0 1 0 Flexible Dose 632 0.52 0 1 0 Prophylatic

ProphylaticProphylatic

Prophylatic 500 0.61 1 0 0500 0.61 1 0 0500 0.61 1 0 0500 0.61 1 0 0 Low Cost 477 0.00 0 0 0 Low Cost 477 0.00 0 0 0Low Cost 477 0.00 0 0 0 Low Cost 477 0.00 0 0 0 --- --- ---

A1 = Primary medical A2 = Use in practice A3 = Secondary medical

Low Cost had no

relations with

other attributes

(20)

Binary Component Matrices

(doctors)

Doctors Doctors Doctors

Doctors MD1 MD1 MD1 MD1 MD2 MD2 MD2 MD2 f Prop. 1s f Prop. 1s f Prop. 1s f Prop. 1s ---

--- --- --- Doctor Type 1

Doctor Type 1 Doctor Type 1

Doctor Type 1 1 1 1 1 1 1 1 1 70 .73 70 .73 70 .73 70 .73 Doctor Type 2

Doctor Type 2 Doctor Type 2

Doctor Type 2 1 0 1 0 1 0 1 0 69 .50 69 .50 69 .50 69 .50 Doctor Type 3

Doctor Type 3 Doctor Type 3

Doctor Type 3 0 1 0 1 0 1 0 1 98 .61 98 .61 98 .61 98 .61 Doctor Type 4

Doctor Type 4 Doctor Type 4

Doctor Type 4 0 0 0 0 0 0 0 0 46 46 46 46 .28 .28 .28 .28 --- --- --- --- Average

Average Average

Average sd sd sd = .09 sd = .09 = .09 = .09

Doctor Type 4 (0,0) has no links with other doctors

(21)

Binary Core Array

Dr2 (1 0) Dr2 (1 0) Dr2 (1 0)

Dr2 (1 0) A1 A1 A1 A1 A2 A3 A2 A3 A2 A3 A2 A3 Prim.

Prim.

Prim.

Prim. Prac Prac Prac Prac Secon Secon Secon Secon. . . . Med.

Med.

Med.

Med. tice tice tice tice Med. Med. Med. Med.

--- --- --- --- B1 ( B1 ( B1 (

B1 (Cytoprotective Cytoprotective Cytoprotective) 1 0 0 Cytoprotective ) 1 0 0 ) 1 0 0 ) 1 0 0 B2 (

B2 ( B2 (

B2 (Tagamet Tagamet Tagamet Tagamet ) 1 1 0 ) 1 1 0 ) 1 1 0 ) 1 1 0 B3 (Histamines ) 1 0 0 B3 (Histamines ) 1 0 0 B3 (Histamines ) 1 0 0 B3 (Histamines ) 1 0 0 --- --- --- ---

Dr3 Dr3 Dr3

Dr3 (0 1) (0 1) (0 1) (0 1)

--- --- --- --- B1 (

B1 ( B1 (

B1 (Cytoprotective Cytoprotective Cytoprotective) 1 0 1 Cytoprotective ) 1 0 1 ) 1 0 1 ) 1 0 1 B2 (

B2 ( B2 (

B2 (Tagamet Tagamet Tagamet Tagamet ) 0 1 0 ) 0 1 0 ) 0 1 0 ) 0 1 0 B3 (Histamines ) 0 1 1 B3 (Histamines ) 0 1 1 B3 (Histamines ) 0 1 1 B3 (Histamines ) 0 1 1 --- --- --- ---

1 : a link exists between components of the three modes

Dr1 =

Dr2 + Dr3

(22)

No side effects Safe

Doctor Type 2

Sulcrate

Zantac, Axid, Pepcid, Losec Cytotec Tagamet

Relieves pain and symptoms, Promotes health

Low cost Flexible dose

Prophylactic

A

n = 69

(23)

Doctor Type 3

Sulcrate

Zantac, Axid, Pepcid, Losec Cytotec Tagamet

Relieves pain and symptoms, Promotes health

Low cost Flexible dose

Prophylactic No side effects

Safe

n = 98

(24)

Doctor Type 1

Sulcrate

Zantac, Axid, Pepcid, Losec Cytotec Tagamet

Relieves pain and symptoms, Promotes health

Low cost Flexible dose

Prophylactic No side effects

Safe

A

n = 70

(25)

Doctor Types

Sulcrate

Zantac, Axid, Pepcid, Losec Cytotec Tagamet

Relieves pain and symptoms, Promotes health Low cost

Flexible dose Prophylactic

No side effects Safe

A

Dr3 Dr2

Dr1

(26)

Characterisation of Doctor Types

a0, a1, a2,XX a0, ,a2

a0, a1, a2 Tagamet

a0, a1, a2, a3 a0, a1, a2, a3

a0, a1 Sulcrate

a0, a1, XX,a3 a0, a1, ,a3

a0, a1 Cytotec

a0, a1, a2, a3 a0, ,a2, a3

a0, a1 Losec

a0, a1, a2, a3 a0, ,a2, a3

a0, a1 Pepcid

a0, a1, a2, a3 a0, ,a2, a3

a0, a1 Axid

a0, a1, a2, a3 a0, ,a2, a3

a0, a1 Zantac

Dr 1 (n=70) Dr 3 (n=98)

Dr 2 (n=69) Brand Name

a0={Relieves Pain, Relieves Symptoms, Promotes Healing}

• a1={Prophylactic}, a2={Flexible Dosage}, a3={No Side Effects, Safe}

•Doctor Type 4 has no links; Low Cost has no links

(27)

Further Considerations

• No information on Low Cost (more complex HiClas models can model a separate component for Low Cost)

• Tagamet is relatively inexpensive, while the others are not

• Don’t the doctors see this?

• HiClas3 suggest they do not.

(28)

Proportions of Ones across Doctors

Tagamet Tagamet Tagamet

Tagamet Zantac Zantac Zantac PepCid Zantac PepCid PepCid Axid PepCid Axid Losec Axid Axid Losec Losec Sulcrate Losec Sulcrate Sulcrate Sulcrate Cytotec Cytotec Cytotec Cytotec RelievePain

RelievePain RelievePain

RelievePain .8 .8 .8 .8 .9 .8 .7 .9 .8 .7 .9 .8 .7 .9 .8 .7 .8 .7 .8 .7 .8 .7 .8 .7 .5 .5 .5 .5 RelieveSymptoms

RelieveSymptoms RelieveSymptoms

RelieveSymptoms .9 .9 .9 .9 .9 .9 .9 .9 .8 .8 .8 .8 .8 .8 .8 .8 .9 .7 .9 .7 .9 .7 .9 .7 .6 .6 .6 .6 PromotesHealth

PromotesHealth PromotesHealth

PromotesHealth .7 .7 .7 .7 .8 .7 .8 .7 .8 .7 .8 .7 .7 .7 .7 .7 .8 .7 .8 .7 .8 .7 .8 .7 .6 .6 .6 .6 NoSideEffect

NoSideEffect NoSideEffect

NoSideEffect .3 .3 .3 .3 .7 .7 .7 .7 .6 .6 .6 .6 .6 .6 .6 .6 .4 .4 .4 .4 .8 .8 .8 .8 .4 .4 .4 .4 RelativeSafe

RelativeSafe RelativeSafe

RelativeSafe . .2 . . 2 2 2 .6 .6 .6 .6 .5 .5 .5 .5 .5 .5 .5 .5 .4 .4 .7 .4 .4 .7 .7 .7 .4 .4 .4 .4 FlexbileDose

FlexbileDose FlexbileDose

FlexbileDose .7 .7 .7 .7 .7 .7 .7 .7 .5 .4 .3 .4 .3 .5 .4 .3 .4 .3 .5 .4 .3 .4 .3 .5 .4 .3 .4 .3 Prophylactic

Prophylactic Prophylactic

Prophylactic . . .4 . 4 4 4 .5 .4 .3 .2 .6 .5 .4 .3 .2 .5 .4 .3 .2 .5 .4 .3 .2 .6 .6 .6 .7 .7 .7 .7 LowCost

LowCost LowCost

LowCost .7 .7 .7 .7 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .2 .0 .3 .1 .0 .3 .1 .0 .3 .1 .0 .3 .1

(29)

Further Considerations

• Surprise

Tagamet is the only relatively inexpensive brand

• Possible reason:

Doctors from all groups say Tagamet is not expensive.

Thus unrelated to the present groups.

• Possible solution:

More groups for attributes (we are working on this)

• Question

Other variability not present in HiClas solution?

(30)

Further Analyses

• Treat the binary data as numerical and analyse with Tucker3.

• Handle the data such that emphasis is on:

– relative differences between brands

– relative differences between attributes.

(31)

Three-Mode Component Analysis

•Concentrate on consensus and individual

differences between doctors in the relationships between brands and attributes.

• Absolute differences between brand and between attributes are ignored.

M

A B C D E

0 5

M

A B C D E

5 10 M

A B C D E

-2 +2

M

A B C D E

-2 +2

(32)

Component Scores

1.2 0.9

0.6 0.3

0.0 -0.3

Tucker3_332 Subject Component 1 0.50

0.25

0.00

-0.25

-0.50

-0.75

Tucker3_332 Component 2

Consensus among doctors

In d iv id u a l d if fe re n ce s b et w ee n d o ct o rs

*

*

*

(33)

Joint Biplot

(Consensus among doctors - Mean)

First Component

-3

-2

-1 0 1 2

-3 -2 -1 0 1 2

S ec o n d C o m p o n en t

Tagamet Zantac

Pepcid

Sulcrate Cytotec

Axid Losec RelPain

NoSideEff Safe FlexDose

LowCost

RelSymptoms Heals

Prophylactic Mean of each brand

and each attribute

(34)

Joint Biplot

(Individual differences between doctors - Deviations from mean)

-2 -1.5

-1 -0.5

0

0.5 1 1.5

-2 -1.5 -1 -0.5 0 0.5 1 1.5

S e c o n d C o m p o n e n

t

First Component Tagamet

Zantac Pepcid Sulcrate

Cytotec

Axid Losec

Relieves Pain

NoSideEff Safe

FlexDose

LowCost Relieves

Symptoms Heals

Prophylatic

(35)

Conclusions - 1

HiClas model

• Given the data are binary, the binary hierarchical classes model is an obvious analysis method and has a relatively straightforward interpretation.

• Effective graphics to display results

• Many components might be necessary to model all

systematic variability present.

(36)

Conclusions - 2

Tucker3 model

• By using a numerical model variance can be portrayed in a different and also insightful manner

• Differential weighting may simplify model description

• Enlightning graphics are available (joint biplots), but it

requires some training to understand them

(37)

Conclusions - 3

Substantive conclusions concern the perceptual mappings of the brands with respect to the attributes as seen by the doctors.

The main patterns have been discussed during the

presentation and will not be repeated.

(38)

Thank You.

01100100 10010101 11100011

kemalbk@jmsb.concordia.ca

kroonenb@fsw.leidenuniv.nl

(39)

Tucker3 Model in Matrix Notation

A, (I × P) loadings matrix for brands

B, (J × Q) loadings matrix for attributes C, (K × R) loadings matrix for subjects

G, (P × Q × R) core array with links between the components

εεεε

+

= AG ( B ' C ' )

X

(40)

PARAFAC/CANDECOMP Model :

• (Harshman 1970, 1976; Harshman and Lundy 1984, 1994; Carroll and Chang 1970)

• Based on the principle of Parallel Proportional Profiles (Cattell 1944).

=

=

= S

s

ks js

is sss

ijk

ijk m g a b c

x

1

ˆ

m

ijk

is the model matrix or structural image A is the (I × S) loadings matrix for brands B is the (J × S) loadings matrix for attributes C is the (K × S) loadings matrix for subjects G is the (S × S × S) superdiagonal core array

exclusive links between the components s of the three modes

(41)

MODELS NUMBER OF COMPONENTS STANDARDIZED Number St.Fit/#Param A B C SS of (x1000) ________ _____ _____ _______ ____________ Param. ____________

TUCKALS2 3 3 --- .49 2754 .19 TUCKALS2 2 3 --- .40 1723 .23 TUCKALS3 3 3 5 .40 1462 .27

TUCKALS2 2 2 --- .31 1154 .27 TUCKALS3 2 2 4 .31 1154 .27 TUCKALS3 2 3 4 .35 1165 .30 TUCKALS3 3 3 4 .37 1179 .37

TUCKALS3 3 3 3

TRILIN 3 3 3 .32 888 .36 TUCKALS3 2 3 3 .32 883 .36

TUCKALS3 2 2 2 .27 592 .46 TRILIN 2 2 2 .27 592 .46 TUCKALS3 2 3 2 .28 599 .47 COMPUTATION OF NUMBER OF PARAMETERS:

A + B + C + core - transformational freedom

--- TUCKALS2: I*P + J*Q + + P*Q*K - P**2 - Q**2

TUCKALS3: I*P + J*Q + K*R + P*Q*R - P**2 - Q**2 - R**2 PARAFAC : I*S + J*S + K*S + S - S - S - S

Three-Mode Components Analysis: Model

Comparison

(42)

Varimax Rotation: Deciding On Weights

Relative Weights Varimax Value

A B C Core A B C

unrotated 2.136 1.130 1.099 1.503

0 0 0 2.782 1.600 1.404 1.488

0.5 0.5 0.5 2.665 2.635 2.336 1.490 1.0 1.0 1.0 2.603 2.643 2.416 1.491 1.5 1.5 1.5 2.561 2.644 2.446 1.494 2.0 2.0 2.0 2.519 2.644 2.463 1.501 2.5 2.5 2.5 2.443 2.644 2.475 1.523 3.0 3.0 3.0 2.180 2.635 2.454 1.678 3.5 3.5 3.5 2.134 2.637 2.465 1.679 4.0 4.0 4.0 2.099 2.638 2.473 1.679 4.5 4.5 4.5 2.071 2.640 2.478 1.679 5.0 5.0 5.0 2.049 2.640 2.482 1.680 5.5 5.5 5.5 2.030 2.641 2.484 1.680 100 100 100 1.895 2.644 2.470 1.681 1000 1000 1000 1.843 2.644 2.498 1.681 10000 10000 10000 1.842 2.644 2.498 1.681 0.5 0.5 3.0 2.629 2.641 2.274 1.520 0.5 1.0 3.0 2.522 2.644 2.424 1.522 1.0 1.0 3.0 2.522 2.644 2.424 1.522 1.0 1.0 3.5 2.582 2.596 2.250 1.678

(43)

-2 -1.5 -1 -0.5 0 0.5 1 1.5

-2 -1.5 -1 -0.5 0 0.5 1 1.5

Third Component

First Component

Joint biplot for Brands and Attrubutes First versus Third Component for Second Component of Doctors

16/05/08 10:48:17

Tagamt Zantac PepCid Sulcrt

Cytotc

Axid

Losec RlvPn

NoSiEf RelSaf

FlxDoz NotCst RlvSym PromHl

Prophy

(44)

Components for Brands and Attributes

Mode

Unrotated Components (Orthonormal)

Components After Varimax Rotation of

the Core Matrix

Components After Joint Varimax Rotation of Components and the

Core

1 2 3 1 2 3 1 2 3

Brands:

A .595 -.701 .836 -.385 .919 .034

G .098 .213 -.001 .234 -.175 .371

D -.609 -.385 -.390 -.606 -.220 -.414

E -.462 -.083 -.384 -.270 -.074 -.716

B .089 .304 -.048 .031 -.159 .259

C .105 .285 -.025 .302 -.184 .258

F .183 -.367 .011 .410 -.107 .209

Attributes:

Inexp .481 .536 -.012 .573 -.436 -.001 .709 .100 .080

NoSiEf -.350 -.222 .127 -.361 .188 -.149 -.419 -.005 .111

Safe -.467 -.251 .229 -.461 .225 -.266 -.535 -.001 .218

Prophy -.490 .575 -.421 -.424 -.736 .160 -.006 .863 -.064

RelPain .291 -.302 -.122 .194 .301 .249 .050 -.327 -.284

FlexDo .130 .195 .762 .302 .036 -.737 .177 -.261 .732

RelSym .245 -.278 -.267 .130 .234 .371 .037 -.224 -.397

Heals .160 -.253 -.296 .048 .188 .373 -.013 -.145 -.395

(45)

Core Array

Unrotated Varimax Rotation of the Core Only

Joint Varimax Rotation of the Components and the

Core Components for

Brands:

Components for Attributes

Frontal Slice 1:

1 2 3 1 2 3 1 2 3

1 13.777 -2.269 2.164 14.441 .045 .550 14. 105 -1.235 .610

2 -3.628 -10.714 .950 .202 10.959 .562 .592 -9.748 .563

Frontal Slice 2:

1 2 3 1 2 3 1 2 3

1 -2.081 5.647 3.469 -.672 4.805 2.312 -.048 2.372 -.804

2 -5.500 -.754 4.409 4.306 1.911 7.113 -2.241 7.680 7.997

(46)

Assessment of Goodness of Model Fit:

• Kroonenberg and De Leeuw (1980), and Kroonenberg (1983) show that

SS(Residual) = SS (Total) - SS(Fit)

SS Accounted For = SS(Fit) / SS (Total)

• Also, as it has been shown by Ten Berge, De Leeuw, and Kroonenberg (1987), when the ALS algorithm has converged,

SS (Residual

m

) = SS (Total

m

) - SS(Fit

m

)

where m stands for any level of any mode of the data matrix.

• Using the last relationship, the relative fit of individual levels of a mode can be

established. Also, whether a given level fits the model well or badly can be

determined .

(47)

Model selection Tucker3 model

1000 1100 1200 1300 1400

2 3 4 5 6 7 8 9 10

Deviance (SS(Residual))

Sum of Numbers of Components (S = P + Q + R) Deviance versus Sum of Numbers of Components

(Three-Mode Scree Plot)

16/05/08 09:58:28

1x1x1

1x2x2

1x3x3 2x1x2

2x2x1

2x2x2

2x2x3 2x3x2

2x3x3 3x1x3

3x2x2

3x2x3 3x3x1

3x3x2

3x3x3 Deviance versus Sum of Numbers of Components

(48)

Tucker 3 Solutions

Raw SS Standardized SS

SS(Total) 1564.857 1.0000

A.EST.SS(Fit) 885.938 .5661

B.EST.SS(Fit) 1008.107 .6442

C.EST.SS(Fit) 465.270 .2973

SS(Fit) 430.970 .2754

SS(Residual) 1133.887 .7246

DF = Number of data points (minus loss of information

due to preprocessing or missing data) minus the number of independent parameters

Number of independent parameters =

(I*P) + (J*Q) + (K*R) + (P*Q*R) - P**2 - Q**2 - R**2

with I, J, K the numbers of levels of 1st, 2nd, and 3rd modes, respectively,

and P, Q, R the numbers of components of 1st, 2nd, and 3rd modes, respectively.

(49)

Relating Subject Components to External Variables

Y: Number of years of experience as a medical doctor (standardized)

X

1

: First component score for “subjects” mode,

X

2

: Second component score for “subjects” mode,

Linear Model: Y = X

1

B

1

+ X

2

B

2

Estimates: B

1

= -1.054 , std. error = 0.997, t= 1.058, p-value=0.29

B

2

= 1.350, std. error = 0.997, t =1.345, p-value=0.18

R

2

=0.01, F-value=1.477, df=(2, 282), p-value=0.23.

• Conclusion: Subject components are not related to number of years

of experience as a medical doctor.

(50)

Relating Residuals to External Variables:

• Y: Number of years of experience as a medical doctor (standardized)

• X: Sum of squares of residuals for each subject

• Linear Regression: Y = B X

• Estimated B = - 0.006, R

2

= 0.0007, F-value = 0.212, d.f. = (1, 282)

• p-value = 0.646

• Conclusion : Residuals are not related to number of years of

experience.

(51)

Result HiClas3-model (2D ×××× 3A ×××× 3B)

Sulcrate

Zantac, Axid, Pepcid, Losec Cytotec Tagamet

Relieves pain and symptoms, Promotes health

Low cost

Flexible dose

Prophylactic No side effects

Safe

D1,D3 D1,D3 D1,D3

D1,D2 D1,D2

D4

D1 (1 1) = 70 D2 (1 0) = 69 D3 (0 1) = 98 D4 (0 0) = 46

(52)

HICLAS3: Example

(Tucker3-HICLAS)

Accused by instructor;

People tell lies about you;

Persistently contradicted;

Unfairly blamed for error Ignored in restaurant;

Disconnecting operator;

Closing store;

Missing page in book;

Grimace

Turn away; Lose patience;

Feel irritated; Curse

Become enraged;

Become tense;

Heart beats faster

P1 P3

P1 P3 P2

P3

Hands tremble;

Perspire; Want to strike

Stimuli

Subjects

Response

Based on example Leuven group

(11)

(10)

(010) (100)

(011) (111)

(53)

HiClas3 – Three-mode scree plot

Doctors ×××× Attributes ×××× Brands

3600 3800 4000 4200 4400 4600 4800

1 2 3 4 5 6 7 8 9

Number of discrepancies

Sum of Numbers of Components (S = P + Q + R) 1,1,1

1,2,2

1,3,3 2,1,2

2,2,1

2,2,2 2,2,3 2,3,2

2,3,3 3,1,3

3,2,2

3,2,3 3,3,1

3,3,2

3,3,3

**

(54)

Preprocessing: Double Centring

(in three-mode component analysis)

• Double centring:

doctors may not use the

attrubutes uniformly across the brands and across the attributes.

• Double centring:

Scores in deviations from brand means

attribute means.

Origin = zero point for both brands and attributes of each subject’s scores

k k

i jk

ijk

ijk x x x x

x * = − .. + ..

1...j...J

1 … .i.I B ra n d s

Attributes 1…

k…

.K D oc to rs

J x K matrix of means removed

jk

ik

K I x

m at ri x of m

ea n s re m ov ed

x . jk k

x i .

(55)

ijk ij

jk ik

k j

i

ijk

m a b c ac bc ab abc

x = + + + + + + +

Preprocessing: Double centring

ijk ij

jk ik

k j

i

ijk

m a b c ac bc ab abc

x = + + + + + + +

Three-way factorial design without replacement (1 observation per cell):

Dependent variable: Brand possesses attribute (score = 1)

After centring:

Analysed with Three-mode PCA

ab

ij

= consensus of doctors about attributes of brands

abc

ijk

= differences between doctors about attributes of brands

(56)

Model selection Tucker3 model

1000 1100 1200 1300 1400

-1040 -1020 -1000 -980 -960 -940 -920 -900

Deviance (SS(Residual))

Degrees-of-Freedom

Deviance versus Degrees-of-Freedom

1x1x1 2x2x1

3x3x1

2x1x2

1x2x2 2x2x2

3x2x2

2x3x2

3x3x2

3x1x3

2x2x3

3x2x3

1x3x3

2x3x3 3x3x3

Model complexity: (Docs = 2; Attr = 3; Brands = 3) or (Docs = 3; Attr = 3; Brands = 3)

(57)

Joint biplot

(Consensus)

First Component

-3 -2 -1 0 1 2

-3 -2 -1 0 1 2

T h ir d C o m p o n e n t

Tagamet

Zantac Pepcid

Sulcrate Cytotec

Axid Losec

RelPain

NoSideEff Safe

FlexDose LowCost

RelSymptoms Heals

Prophylactic

(58)

Joint biplot (Consensus)

-3 -2 -1 0 1 2

-3 -2 -1 0 1 2

Second Component

Tagamet Zantac

PepCid

Sulcrate Cytotec

Axid Losec RelPain

NoSideEff Safe FlexDose

LowCost

RelSymptoms Heals

Prophylactic

-3 -2 -1 0 1 2

-3 -2 -1 0 1 2

Third Component

Tagamet

Zantac Pepcid

Sulcrate Cytotec

Axid Losec

RelPain

NoSideEff Safe

FlexDose LowCost

RelSymptoms Heals

Prophylactic

(59)

Conclusions

Where to go from here

• Irregular patterns in some doctors combined with low number of ones were excluded from the HiClas analysis while these doctors were

scattered all over the plot of the doctors’ components.

Thus Tucker analysis picked up some information which was not available to the HiClas analysis. Similarly for the LowCost attribute.

• Is the numerical information such as the variance somewhere to be found in the HiClas results and if so can it be used?

• Construct exactly fitting hierarchical classes models and run a Tucker3 analysis on them.

• Construct doctors/attributes/brands artificially according to a specific pattern and include them in the analysis to facilitate interpretation.

• Sort out the mathematics of the comparison between models.

(60)

HiClas3 – Three-Mode Deviance Plot

Doctors ×××× Attributes ×××× Brands

3600 3800 4000 4200 4400 4600 4800

300 400 500 600 700 800 900

Number of discrepancies

Degrees of freedom 1,1,1

1,2,2 1,3,3

2,1,2

2,2,1 2,2,2 2,2,3

2,3,2

2,3,3

3,1,3

3,2,2

3,2,3

3,3,1 3,3,2

3,3,3

Model complexity: (Docs = 2; Attr = 3; Brands = 3) or

(Docs = 3; Attr = 3; Brands = 3)

**

(61)

Component Scores

1.2 0.9

0.6 0.3

0.0 -0.3

Tucker3_332 Subject Component 1 0.50

0.25

0.00

-0.25

-0.50

-0.75

Tucker3_332 Component 2

00 = Dr4 01 = Dr3 10 = Dr2 11 = Dr1 HiClas3_332

Consensus among doctors

In d iv id u a l d if fe re n ce s b et w ee n d o ct o rs

(62)

Gastro –SVD-Biplot

(means across doctors - attributes centred)

-0.6 -0.4 -0.2 0 0.2 0.4

-0.6 -0.4 -0.2 0 0.2 0.4 Second Component

First Component

24/06/08 13:41:59

RlvPn

NoSiEf RelSaf

FlxDoz

NotCst RlvSym

PromHl

Prophy

Tagamet ZantacPepCid

Sulcrt

Cytotc Axid

Losec

-0.6 -0.4 -0.2 0 0.2 0.4

-0.6 -0.4 -0.2 0 0.2 0.4 First Component

RlvPn NoSiEf

RelSaf

FlxDoz

NotCst RlvSym

PromHl Prophy

Tagamet Zantac

PepCid Sulcrt

Cytotc Axid

Losec

Third Component

Referenties

GERELATEERDE DOCUMENTEN

Longitudinal assessment of child behaviour based on three-way binary dataKroonenberg, P.M.; Büyükkurt, K.B.; Mesman, J..

Abstract: k-Adic formulations (for groups of objects of size k) of a variety of 2- adic similarity coefficients (for pairs of objects) for binary (presence/absence) data are

The proposed extension—which includes the earlier model as a special case—is obtained by adapting the multilevel latent class model for categorical responses to the three-way

To us, WR 46 is “strange” because of its short-periodic photometric double-wave, its peculiar radial-velocity be- haviour, from being constant to large amplitude cycles, and

Exploiting the link between this process and the 1D Random Field Ising Model (RFIM), we are able to identify the Gibbs potential of the resulting Hidden Markov process.. Moreover,

By setting the distance between the two categories of every response variable to be equal, the MLD model becomes equivalent to a marginal model for multivariate binary data

Applying the Pearson and Deviance goodness-of-fit tests to a model with two categorical predictors revealed a distinct patter in the statistical power of the test

PMKT2, a new killer toxin from Pichia membranifaciens, and its promising biotechnological properties for control of the spoilage yeast