• No results found

Seed by Example

N/A
N/A
Protected

Academic year: 2021

Share "Seed by Example"

Copied!
28
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Seed by Example

A conjoint solution to the cold-start problem in recommender systems

by

(2)

Recommender systems

• Important driver of online revenue

• Used by Amazon, Netflix, Reddit, Linkedin

• Clearly a marketing topic, yet usually discussed in

machine learning and information retrieval domains

(3)

Research goals

• Reduce cold-start by using seed data

• Performance comparison between seeded and

unseeded recommender

(4)

Collaborative filtering

• Find preference patterns between similar users

• ‘Birds of a feather flock together’

Marketing Marketing Research Product Management Consultant Product
 Management Consultant

?

?

(5)

Cold-start problem

Marketing Marketing Research Product Management Consultant

?

Consultant Marketing

?

• No information about new user

(6)

• No overlap, no similarity Marketing Marketing Research

?

Consultant Product
 Management

?

?

?

Annemarie Maurice Sophie Krik

(7)

Marketing

?

?

?

?

?

?

?

• New online service has no data

Annemarie Maurice Sophie Krik

(8)

What if there were users that rated every job,

always!?

• Conjoint analysis to compute utility levels for every job • Extract latent classes (underlying segments)

• Transform utility to choice probability (MNL Logit)

• Compute choice probability for each job, cut-off: 50%

• Generate seed node from transformed latent class profiles


(9)

Conjoint experiment

• Existing dataset from marketing engineering course

winter 2014/2015

N = 158

• Fractional factorial conjoint choice elicitation task

(10)
(11)

Model selection

• Estimate 8 models

• Numeric or nominal attributes


(12)

Model selection

• Log-likelihood ratio > test against null model and

each other

• Model 7 selected for latent class analysis


(13)

Latent Class Analysis

(14)

Latent Class Analysis

• More than 5 classes (kNN = 5)

• Estimate several models with different classes

(15)

Multinomial logit

• Transform utility to choice probability

(16)
(17)

Latent Class Analysis

(18)

Latent Class Analysis

(19)
(20)

Seeded Collaborative Filter

No Yes Yes ? Job 1 Yes Yes ? ? Job 2 No No ? ? Job 3 Yes No ? No Job 4 No Yes ? ? Job 5 ? ?

• Absence of overlap solved

(21)

Implementation

• CF implemented with Raccoon

• Experiment designed with custom survey HTML /

CSS / JavaScript

(22)

Collaborative filter

(23)

k-Nearest Neighbor

(24)

Jaccard similarity

• Similarity measure between user and kNNs

• Divide intersection by union (overlap / total)

• Common binary like / dislike similarity measure

(25)

Performance test

• 2 groups


CF without seed nodes (N = 50)
 CF with seed nodes (N = 50)

(26)
(27)

Conclusion

• Seeding outperforms not seeding

• Cold start partially addressed

• Managers can use this method without changes to algorithm • Step towards full conjoint recommender


(hybrid top N-recommender with conjoint choice sets)


(28)

Referenties

GERELATEERDE DOCUMENTEN

To be concluded, based on our primary and secondary findings of price and dual-response attribute, as well as the evaluation of model fit and its relation

In order to get a better insight of data and have a model that can explain the underlying needs of job seekers, an aggregated model is built, in the model, every variable list

Preference class q membership is determined using of respondents’ age, gender, and occupation, whereas scale class s membership is determined based on respondent engagement

Ook voor niet typische soorten is nog duidelijk een behandelingseffect aanwezig in 2019 (Tab. 3.5), maar dit effect kon door totale afwezigheid van deze soorten in de

tijden, langere standtijden, betere maatnauwkeurigheid, een betere oppervlaktekwaliteit en dikwijls lagere bewerkings- kosten worden bereikt. Het feit of lagere

Bovendien is het ook belangrijk het draagvlak in de samenleving te onderzoeken en ook hier ervaringen op te doen die belangrijk zijn voor toekomstige initiatieven. Hoe kunt u

Gedurende twee jaar (2004-2005) werd, onder andere, de gasvormige emissies (ammoniak, broeikasgassen, geur) uit verschillende bronnen (stal, mesttoediening, mestopslag) gemeten.

The first phase of this study used Grounded Theory to propose a theoretical framework that illustrated how ICV provides a firm with a strategic process that effectively