SlideShare a Scribd company logo
1 of 42
Download to read offline
Admixture of Poisson MRFs: A New Topic
Model with Word Dependencies
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon
April 30, 2015
* Presenter
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Analyzing Large Collections of Documents
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
Bag of Words Matrix:
- Removes order and
syntax information
- Unrealistic but powerful
2
Model
Computation
3
Summary
4
Collection of
Documents
1
)
Examples:
1. Research papers
2. News articles
3. Twitter posts
1
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Analyzing Large Collections of Documents
Model
Computation
3
Summary
4
Collection of
Documents
1
)
Examples:
1. Research papers
2. News articles
3. Twitter posts
1
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
Bag of Words Matrix:
- Removes order and
syntax information
- Unrealistic but powerful
2
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Analyzing Large Collections of Documents
Summary
4
Collection of
Documents
1
)
Examples:
1. Research papers
2. News articles
3. Twitter posts
1
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
Bag of Words Matrix:
- Removes order and
syntax information
- Unrealistic but powerful
2
Model
Computation
3
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Analyzing Large Collections of Documents
Collection of
Documents
1
)
Examples:
1. Research papers
2. News articles
3. Twitter posts
1
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
Bag of Words Matrix:
- Removes order and
syntax information
- Unrealistic but powerful
2
Model
Computation
3
Summary
4
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Research Paper Example - Top Words
Top Words
(Frequency)
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Collection of
Documents
Digital
Representation
Model
Computation Summary
Examples:
1. Research papers
2. News articles
3. Twitter posts
Bag of Words Matrix:
- Removes order and
syntax information
- Unrealistic but powerful
1
Titles of Research Papers:
1. Machine Learning (ICML, NIPS)
2. Communication Networks (INFOCOM)
3. Programming Languages (PLDI, CAV, POPL, OOPSLA)
2
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Research Paper Example - Topic Modeling
Topic
Modeling
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Collection of
Documents
Digital
Representation
Model
Computation Summary
Examples:
1. Research papers
2. News articles
3. Twitter posts
Bag of Words Matrix:
- Removes order and
syntax information
- Unrealistic but powerful
1
Titles of Research Papers:
1. Machine Learning (ICML, NIPS)
2. Communication Networks (INFOCOM)
3. Programming Languages (PLDI, CAV, POPL, OOPSLA)
2
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Applications for Topic Modeling
Applications
1. Summarize/Visualize [Hall et al. 2008]
2. Word sense disambiguation [Boyd-Graber et al. 2007]
3. Multi-lingual understanding [Mimno et al. 2009]
4. Information retrieval [Wei & Croft 2006]
Different domains
1. Genetics [Pritchard et al. 2000 (14,000 citations)]
2. Computer vision [Li et al. 2010]
3. Social networks [Airoldi et al. 2008]
4. Social science surveys [Roberts et al. 2014]
5. Social E-commerce [Hu et al. 2014]
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Brief History - Latent Semantic Analysis (LSA)
2
k p
n
k
k k
U VT
Σ
Singular Value
Decomposition
3
Low Dimensional
Document
Representation
“Latent Topic”
4
1
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
n
p
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Brief History - Latent Semantic Analysis (LSA)
3
Low Dimensional
Document
Representation
“Latent Topic”
4
1
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
Singular Value
Decomposition
n
p
2
k p
n
k
k k
U VT
Σ
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Brief History - Latent Semantic Analysis (LSA)
“Latent Topic”
4
1
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
Singular Value
Decomposition
n
p
2
k p
n
k
k k
U VT
Σ
3
Low Dimensional
Document
Representation
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Brief History - Latent Semantic Analysis (LSA)
1
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
Singular Value
Decomposition
n
p
2
k p
n
k
k k
U VT
Σ
“Latent Topic”
4
3
Low Dimensional
Document
Representation
Positive and negative values difficult to interpret
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Brief History - Probabilistic Topic Models
3
Topic Weights
per Document
“Topics”
(Word weights)
4
1
Doc 1
Doc 2
Doc 3
Doc 4
...
“networks”
“learning”
“program
m
ing”
…
Digital
Representation
n
p
2
k p
n
k
Probabilistic Topic
Models
Related through
probability model
Probability vectors are much easier to interpret
LDA - Added Bayesian priors for regularization
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Comparison of 2D Projections
SVD dimensions are difficult
to interpret
APM has smooth
distribution compared to
LDA
SVD
LDA
APM
comm−net.6978
mach−learn.8925
prog−lang.6618
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Brief History - Extensions/Variants
1. Add time information [Blei & Lafferty 2006]
2. Add author information [Rosen-Zvi et al. 2004]
3. Add document category information [Mcauliffe & Blei 2008]
4. Automatically discover number of topics [Teh et al. 2006]
5. Model correlation between topics [Blei & Lafferty 2006]
6. . . .
7. . . .
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Brief History - Extensions/Variants
1. Add time information [Blei & Lafferty 2006]
2. Add author information [Rosen-Zvi et al. 2004]
3. Add document category information [Mcauliffe & Blei 2008]
4. Automatically discover number of topics [Teh et al. 2006]
5. Model correlation between topics [Blei & Lafferty 2006]
6. . . .
7. . . .
Previous models - topics only have weights for single words
Our model - topics have weights for pairs of words
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Interpreting Topics
LDA 3 topics
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Interpreting Topics
LDA 3 topics
LDA 6 topics
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Interpreting Topics
LDA 3 topics
LDA 30 topics
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Interpreting Topics
LDA 3 topics
LDA 30 topics
APM 3 topics
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Overview of APM
Admixture of Poisson MRFS
(APM)
Multinomial Admixture Poisson MRF
Gaussian
MRF
Mixture
LDA
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Overview of APM
Admixture of Poisson MRFS
(APM)
Multinomial Admixture Poisson MRF
Gaussian
MRF
Mixture
LDA
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Mixtures
Multiple sub-populations
The sub-populations are usually unknown a priori
Each individual from the population comes from exactly one
subpopulation
Figure source: Kalai, Moitra, and Valiant. Disentangling Gaussians. Communications
of the ACM. 2012.
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Admixtures
Mixtures - Draws from single
component distribution. (Top)
Admixtures - Draws from a
distribution whose parameters are a
convex combination of component
parameters. (Bottom)
x2
x1
"Documents"
Mixture
Components
x2
x1
Dense
"Topic"
Sparse
"Document"
Dense
"Document”
Sparse
"Topic"
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Overview of APM
Admixture of Poisson MRFS
(APM)
Multinomial Admixture Poisson MRF
Gaussian
MRF
Mixture
LDA
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Gaussian MRFs
Allows for dependencies between variables
What if the data dimension is large?
If dimension is 1000, 10002
/2 =500,000 parameters
Assume some conditional independence between variables.
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Independent PMRFIndependent PMRF
Count of Word 1
CountofWord2
0 2 4 6 8
8
6
4
2
0
1. Each conditional (”slice”)
of a PMRF is 1-D Poisson.
2. Distinct from Gaussian
MRF
3. Positive dependencies can
model word co-occurence.
Positive Dependency PMRFPMRF Positive Dependency
Count of Word 1
CountofWord2
0 2 4 6 8
8
6
4
2
0
Negative Dependency PMRFPMRF Negative Dependency
Count of Word 1
CountofWord2
0 2 4 6 8
8
6
4
2
0
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Poisson MRFs [Yang et al., 2012]
P(A | B, C) P(B | A, C) P(C | A, B)
P(A, B, C) ??
If we assume the node conditional distributions are Poisson,
does there exist a joint MRF distribution
that has these conditionals?
Poisson MRF joint distribution:
Pr
PMRF
(x | θ, Θ) ∝ exp θT
x + xT
Θx −
p
s=1
ln(xs!) .
Node conditionals are 1-D Poissons:
Pr(xs | x−s, θs, Θs) ∝ exp{ (θs + xT
Θs
ηs
) xs − ln(xs!) }.
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Overview of APM
Admixture of Poisson MRFS
(APM)
Multinomial Admixture Poisson MRF
Gaussian
MRF
Mixture
LDA
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Admixture of Poisson MRFs (APM) [Inouye et al. 2014]
APM replaces standard Multinomial with Poisson MRF
Pr
APM
(x, w, θ1...k
, Θ1...k
)
= Pr
PMRF
x ¯θ =
k
j=1
wj θj
, ¯Θ =
k
j=1
wj Θj
Pr
Dir
(w)
k
j=1
Pr(θj
, Θj
)
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
APM Algorithm
1. Optimization problem is not convex
2. Want to exploit parallel computing
3. Large optimization problem: APM has O(kp2) parameters
versus O(kp) for LDA
LDA(k = 5, p = 1000) ⇒ 5,000 parameters
APM(k = 5, p = 1000) ⇒ 5,000,000 parameters
APM(k = 5, NNZ(Θ) = 10 per word) ⇒ 50,000 free
parameters
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Parallel Alternating Newton-like Algorithm
1. Split the algorithm into alternating convex problems
arg min
Φ1,Φ2,··· ,Φp
−
1
n
p
s=1
tr(Ψs
Φs
) −
n
i=1
exp(zT
i Φs
wi ) +
p
s=1
λ vec(Φs
)1 1
arg min
w1,w2,··· ,wn∈∆k
−
1
n
n
i=1
ψT
i wi −
p
s=1
exp(zT
i Φs
wi )
where zi = [1 xT
i ]T
Ψs
= f (X, W)
φj
s = [θj
s (Θj
s )T
]T
ψi = f (X, Φ1...k
)
Φs
= [φ1
s φ2
s · · · φk
s ]
2. Subproblems in summation can be computed in parallel
3. Use fast Newton-like optimization method [Hsieh et al. 2014]
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Timing Results on Wikipedia Dataset (k = 5, λ = 0.5)
1
3.1
3.4
0.6
2.2 2.2
0
1
2
3
4
n = 20,000
p = 5,000
# of Words = 50M
n = 100,000
p = 5,000
# of Words = 133M
n = 20,000
p = 10,000
# of Words = 57M
Time(hrs)
APM Training Time on Wikipedia Dataset
1st Iter. Avg. Next 3 Iter.
Algorithm scales approximately as O(np2)
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Parallel Speedup
0
5
10
15
20
0 5 10 15 20
Speedup
# of MATLAB Workers
Parallel Speedup on BNC Dataset
Perfect Speedup
Actual Speedup
BNC dataset has n = 4049 and p = 1646
Speedup could be O(min(n, p)) on distributed system
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Evaluating APM: No Direct Evaluation of Edge Parameters
Previous metrics evaluate the similarity of word pairs
[Newman et al. 2010, Mimno et al. 2011, Aletras and Court 2013]
Averaged statistic for all 10
2 pairs of top words computed
Attempted to correlate with human judgment
Unlike previous topic models, APM explicitly models
dependencies between words
How can we semantically evaluate the parameters for these
dependencies?
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Evocation [Boyd-Graber et al. 2006]
Evocation denotes the idea of which words “evoke” or “bring
to mind” other words
Different types of evocation:
1. Rose - Flower (example)
2. Brave - Noble (kind)
3. Yell - Talk (manner)
4. Eggs - Bacon (co-occurence)
5. Snore - Sleep (setting)
6. Wet - Desert (antonymy)
7. Work - Lazy (exclusivity)
8. Banana - Kiwi (likeness)
Distinctive from word similarity or synonymy
Collected human scores for approximately 15% of word pairs
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Evocation Metric Illustration
Word Pair H M
w1 ↔ w2
w1 ↔ w3
w1 ↔ w4
w2 ↔ w3
w2 ↔ w4
w3 ↔ w4
w2 ↔ w3
w2 ↔ w4
w3 ↔ w4
w1 ↔ w3
w1 ↔ w2
w1 ↔ w4
Word Pair H M
w2 ↔ w4
w3 ↔ w4
w1 ↔ w3
w1 ↔ w4
Word Pair H M
Rank by model weights M Sum top-m human scores H
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Models for Comparison
APM: Admixture of Poisson MRFs
APM-LowReg: Very small regularization parameter
APM-HeldOut: Chooses λ from held-out documents
CTM: Correlated Topic Models
HDP: Hierarchical Dirichlet Process (Non-parametric)
LDA: Latent Dirichlet Allocation
RSM: Replicated Softmax (Undirected Topic Model)
RND: Random baseline
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Evocation Metric Results
k = 1 3 5 10 25 50 k = 1 3 5 10 25
Evoc-1 (Avg. Evoc. of Topics) Evoc-2 (Evoc. of Avg. Topic)
APM APM-LowReg APM-HeldOut CTM HDP LDA RSM RND
0
200
400
600
800
1000
1200
1400
1600
k = 1 3 5 10 25 50 k = 1 3 5
Evoc-1 (Avg. Evoc. of Topics) Evoc-2 (Ev
Evocation(m=50)
APM APM-LowReg APM-HeldOut CTM HDP LDA
0
200
400
600
800
1000
1200
1400
1600
k = 1 3 5 10 25 50 k = 1 3 5
Evoc-1 (Avg. Evoc. of Topics) Evoc-2 (E
Evocation(m=50)
APM APM-LowReg APM-HeldOut CTM HDP LDA
0
200
400
600
800
1000
1200
1400
1600
k = 1 3 5 10 25 50 k = 1 3 5
Evoc-1 (Avg. Evoc. of Topics) Evoc-2 (Ev
Evocation(m=50)
APM APM-LowReg APM-HeldOut CTM HDP LDA
5 10 25 50 k = 1 3 5 10 25 50
(Avg. Evoc. of Topics) Evoc-2 (Evoc. of Avg. Topic)
PM-LowReg APM-HeldOut CTM HDP LDA RSM RND
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Evocation Metric Top Word Pairs
Table: Top 20 Word Pairs for Best LDA
Human
Score
Human
Score
Human
Score
100 run.v ↔ car.n 38 woman.n ↔ man.n 100 telephone.n
82 teach.v ↔ school.n 38 give.v ↔ church.n 97 husband.n
69 school.n ↔ class.n 38 wife.n ↔ man.n 82 residential.a
63 van.n ↔ car.n 38 engine.n ↔ car.n 76 politics.n
51 hour.n ↔ day.n 35 publish.v ↔ book.n 75 steel.n
50 teach.v ↔ student.n 32 west.n ↔ state.n 75 job.n
44 house.n ↔ government.n 32 year.n ↔ day.n 75 room.n
44 week.n ↔ day.n 25 member.n ↔ give.v 72 aunt.n
38 university.n ↔ institution.n 25 dog.n ↔ animal.n 72 printer.n
38 state.n ↔ government.n 25 seat.n ↔ car.n 60 love.v
Word Pair Word Pair Wo
Table: Top 20 Word Pairs for Best APM
Human
Score
Human
Score
n.n ↔ man.n 100 telephone.n ↔ call.n 57 question.n ↔ answer.n
e.v ↔ church.n 97 husband.n ↔ wife.n 57 prison.n ↔ cell.n
e.n ↔ man.n 82 residential.a ↔ home.n 51 mother.n ↔ baby.n
e.n ↔ car.n 76 politics.n ↔ political.a 50 sun.n ↔ earth.n
h.v ↔ book.n 75 steel.n ↔ iron.n 50 west.n ↔ east.n
.n ↔ state.n 75 job.n ↔ employment.n 44 weekend.n ↔ sunday.n
r.n ↔ day.n 75 room.n ↔ bedroom.n 41 wine.n ↔ drink.v
.n ↔ give.v 72 aunt.n ↔ uncle.n 38 south.n ↔ north.n
g.n ↔ animal.n 72 printer.n ↔ print.v 38 morning.n ↔ afternoon.n
.n ↔ car.n 60 love.v ↔ love.n 38 engine.n ↔ car.n
Word Pair Word Pair Word Pair
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Current and Future Work
networksnetworks
learninglearning
basedbased
usingusing
analysisanalysis
networknetwork
wirelesswireless
datadata
modelmodel
multimulti
controlcontrol
efficientefficient
timetime
performanceperformance
routingrouting
distributeddistributed
optimaloptimal
algorithmsalgorithms
algorithmalgorithm
sensorsensor
traffictraffic
schedulingscheduling
highhigh
largelarge
multiplemultiple
mobilemobile
atmatm
packetpacket
delaydelay
allocationallocation
flowflow
protocolprotocol
accessaccess
multicastmulticast
energyenergy
channelchannel
realreal
scalescale
powerpower
locallocal
hochoc
raterate
randomrandom
serviceservice
evaluationevaluation
codingcoding
radioradio
bandwidthbandwidth
opticaloptical
speedspeed
videovideo
endend
peerpeer
resourceresource
cloudcloud
computingcomputing
congestioncongestion
hophop
distributiondistribution
contentcontent
cognitivecognitive
switchswitch
spectrumspectrum
switchingswitching
privacyprivacy
wdmwdm
layerlayer
streamingstreaming
locationlocation
queueingqueueing
engineeringengineering
inputinput
crosscross
areaarea
qualityquality
loadload
wavelengthwavelength
preservingpreserving
admissionadmission
assignmentassignment
reliablereliable
switchesswitches
macmac
faultfault
toleranttolerant
balancingbalancing
switchedswitched
varyingvarying
registerregister
widewide
centercenter
networksnetworks
learninglearning
basedbased
usingusing
analysisanalysis
networknetwork
wirelesswireless
datadata
modelmodel
multimulti
systemssystems
modelsmodels
timetime
neuralneural
objectobject
optimaloptimal
informationinformation
highhigh
bayesianbayesian
largelarge
optimizationoptimization
multiplemultiple
inferenceinference
linearlinear
nonnon
sparsesparse
clusteringclustering
estimationestimation
selectionselection
kernelkernel
supportsupport
stochasticstochastic
scalescale
gaussiangaussian
featurefeature
markovmarkov
processprocess
processesprocesses
randomrandom
codingcoding
classclass
decisiondecision
recognitionrecognition machinesmachines
predictionprediction
visualvisual
vectorvector
lowlow
supervisedsupervised
structuredstructured
policypolicy
treestrees
functionfunction
approximateapproximate
continuouscontinuous
semisemi
gradientgradient
reductionreduction
maximummaximum
latentlatent
dimensionaldimensional
matrixmatrix
convexconvex
propagationpropagation
marginmargin
graphicalgraphical
variablevariable
hiddenhidden
variationalvariational
tasktask
componentcomponent
mixturemixture
speechspeech
spectralspectral
rankrank
theoretictheoretic
neuronsneurons
fieldsfields
densitydensity
vlsivlsi
instanceinstance
analoganalog
montemonte
messagemessage
carlocarlo topictopic
labellabel
entropyentropy
neighborneighbor
nearnear
dirichletdirichlet
spikingspiking
seriesseries
beliefbelief
factorizationfactorization
dynamicaldynamical
partiallypartially
descentdescent
differencedifference
nearestnearest
dimensionalitydimensionality
passingpassing
completioncompletion principalprincipal
leastleast
boltzmannboltzmann
likelihoodlikelihood
squaressquares
observableobservable
networksnetworkslearninglearning
basedbased
usingusing
analysisanalysis
networknetworkwirelesswireless
datadata
modelmodel
multimulti
systemssystems
timetime
approachapproach
programmingprogramming
objectobjectdistributeddistributed
languagelanguage
designdesign
orientedoriented
systemsystem
informationinformation
highhigh
softwaresoftware
programsprograms
inferenceinference
verificationverification
checkingchecking
flowflow
codecode
typetype
realrealprogramprogram
languageslanguages
orderorder
machinemachine
levellevel
temporaltemporal
studystudy
domaindomain
virtualvirtual
logiclogic
generationgeneration
implementationimplementation
casecase
staticstatic
hybridhybrid
abstractabstract
structuresstructures
formalformal
higherhigher
sessionsession
specificspecific
collectioncollection
firstfirst
garbagegarbage
extendedextended
posterposter
aspectaspect
1. Visualization
2. Better inference of
parameters
3. Extension to other domains
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
Thanks for listening!
Admixture of Poisson MRFS
(APM)
Multinomial Admixture Poisson MRF
Gaussian
MRF
Mixture
LDA
David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs

More Related Content

What's hot

An Evolution of Deep Learning Models for AI2 Reasoning Challenge
An Evolution of Deep Learning Models for AI2 Reasoning ChallengeAn Evolution of Deep Learning Models for AI2 Reasoning Challenge
An Evolution of Deep Learning Models for AI2 Reasoning ChallengeTraian Rebedea
 
Opinion mining for social media and news items in Romanian
Opinion mining for social media and news items in RomanianOpinion mining for social media and news items in Romanian
Opinion mining for social media and news items in RomanianTraian Rebedea
 
Practical machine learning - Part 1
Practical machine learning - Part 1Practical machine learning - Part 1
Practical machine learning - Part 1Traian Rebedea
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and ChallengesJens Lehmann
 
Deep learning Type Inference for Dynamic Programming Languages
Deep learning Type Inference for Dynamic Programming Languages Deep learning Type Inference for Dynamic Programming Languages
Deep learning Type Inference for Dynamic Programming Languages Amir M. Mir
 
Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLPGVS Chaitanya
 
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...Europeana
 
From TREC to Watson: is open domain question answering a solved problem?
From TREC to Watson: is open domain question answering a solved problem?From TREC to Watson: is open domain question answering a solved problem?
From TREC to Watson: is open domain question answering a solved problem?Constantin Orasan
 
LinkedUp kickoff meeting session 4
LinkedUp kickoff meeting session 4LinkedUp kickoff meeting session 4
LinkedUp kickoff meeting session 4Hendrik Drachsler
 
Keynote reusability measurement and social community analysis from mooc con...
Keynote   reusability measurement and social community analysis from mooc con...Keynote   reusability measurement and social community analysis from mooc con...
Keynote reusability measurement and social community analysis from mooc con...HannibalHsieh
 
Research on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and ListsResearch on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and ListsDenis Parra Santander
 
Learning Analytics for Communities of Inquiry
Learning Analytics for Communities of InquiryLearning Analytics for Communities of Inquiry
Learning Analytics for Communities of InquiryVitomir Kovanovic
 
A Novel Model of Cognitive Presence Assessment Using Automated Learning Analy...
A Novel Model of Cognitive Presence Assessment Using Automated Learning Analy...A Novel Model of Cognitive Presence Assessment Using Automated Learning Analy...
A Novel Model of Cognitive Presence Assessment Using Automated Learning Analy...Vitomir Kovanovic
 
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Shalin Hai-Jew
 
Challenges in transfer learning in nlp
Challenges in transfer learning in nlpChallenges in transfer learning in nlp
Challenges in transfer learning in nlpLaraOlmosCamarena
 
Machine learning and Bioinformatics (IMC007)
Machine learning and Bioinformatics (IMC007)Machine learning and Bioinformatics (IMC007)
Machine learning and Bioinformatics (IMC007)butest
 

What's hot (20)

An Evolution of Deep Learning Models for AI2 Reasoning Challenge
An Evolution of Deep Learning Models for AI2 Reasoning ChallengeAn Evolution of Deep Learning Models for AI2 Reasoning Challenge
An Evolution of Deep Learning Models for AI2 Reasoning Challenge
 
Opinion mining for social media and news items in Romanian
Opinion mining for social media and news items in RomanianOpinion mining for social media and news items in Romanian
Opinion mining for social media and news items in Romanian
 
Practical machine learning - Part 1
Practical machine learning - Part 1Practical machine learning - Part 1
Practical machine learning - Part 1
 
Question answering
Question answeringQuestion answering
Question answering
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Deep learning Type Inference for Dynamic Programming Languages
Deep learning Type Inference for Dynamic Programming Languages Deep learning Type Inference for Dynamic Programming Languages
Deep learning Type Inference for Dynamic Programming Languages
 
Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLP
 
Nvivo
NvivoNvivo
Nvivo
 
ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3
 
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
 
From TREC to Watson: is open domain question answering a solved problem?
From TREC to Watson: is open domain question answering a solved problem?From TREC to Watson: is open domain question answering a solved problem?
From TREC to Watson: is open domain question answering a solved problem?
 
LinkedUp kickoff meeting session 4
LinkedUp kickoff meeting session 4LinkedUp kickoff meeting session 4
LinkedUp kickoff meeting session 4
 
Keynote reusability measurement and social community analysis from mooc con...
Keynote   reusability measurement and social community analysis from mooc con...Keynote   reusability measurement and social community analysis from mooc con...
Keynote reusability measurement and social community analysis from mooc con...
 
Research on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and ListsResearch on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and Lists
 
Fuschi current Research and Developments
Fuschi current Research and DevelopmentsFuschi current Research and Developments
Fuschi current Research and Developments
 
Learning Analytics for Communities of Inquiry
Learning Analytics for Communities of InquiryLearning Analytics for Communities of Inquiry
Learning Analytics for Communities of Inquiry
 
A Novel Model of Cognitive Presence Assessment Using Automated Learning Analy...
A Novel Model of Cognitive Presence Assessment Using Automated Learning Analy...A Novel Model of Cognitive Presence Assessment Using Automated Learning Analy...
A Novel Model of Cognitive Presence Assessment Using Automated Learning Analy...
 
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
 
Challenges in transfer learning in nlp
Challenges in transfer learning in nlpChallenges in transfer learning in nlp
Challenges in transfer learning in nlp
 
Machine learning and Bioinformatics (IMC007)
Machine learning and Bioinformatics (IMC007)Machine learning and Bioinformatics (IMC007)
Machine learning and Bioinformatics (IMC007)
 

Viewers also liked

Topic modeling using big data analytics
Topic modeling using big data analytics Topic modeling using big data analytics
Topic modeling using big data analytics Farheen Nilofer
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)Svitlana volkova
 
Tutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social NetworksTutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social Networkspjing2
 
Using Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesUsing Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesVivian S. Zhang
 
Introduction à l'analyse de réseaux avec R
Introduction à l'analyse de réseaux avec RIntroduction à l'analyse de réseaux avec R
Introduction à l'analyse de réseaux avec RLaurent Beauguitte
 
Optimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesOptimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesGautier Marti
 

Viewers also liked (6)

Topic modeling using big data analytics
Topic modeling using big data analytics Topic modeling using big data analytics
Topic modeling using big data analytics
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)
 
Tutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social NetworksTutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social Networks
 
Using Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesUsing Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York Times
 
Introduction à l'analyse de réseaux avec R
Introduction à l'analyse de réseaux avec RIntroduction à l'analyse de réseaux avec R
Introduction à l'analyse de réseaux avec R
 
Optimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesOptimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time Series
 

Similar to Admixture of Poisson MRFs: A New Topic Model with Word Dependencies

Make our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebMake our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebFranck Michel
 
Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1IPLODProject
 
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Bhaskar Mitra
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...kevig
 
Mining Users Rare Sequential Topic Patterns from Tweets based on Topic Extrac...
Mining Users Rare Sequential Topic Patterns from Tweets based on Topic Extrac...Mining Users Rare Sequential Topic Patterns from Tweets based on Topic Extrac...
Mining Users Rare Sequential Topic Patterns from Tweets based on Topic Extrac...IRJET Journal
 
Social Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIASocial Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIAInsight_Altmetrics
 
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...Julien PLU
 
Survey On Building A Database Driven Reverse Dictionary
Survey On Building A Database Driven Reverse DictionarySurvey On Building A Database Driven Reverse Dictionary
Survey On Building A Database Driven Reverse DictionaryEditor IJMTER
 
Names project update
Names project updateNames project update
Names project updateAmanda Hill
 
Toxic Comment Classification
Toxic Comment ClassificationToxic Comment Classification
Toxic Comment Classificationijtsrd
 
Query-Focused Extractive Text Summarization for Multi-Topic Document
Query-Focused Extractive Text Summarization for Multi-Topic DocumentQuery-Focused Extractive Text Summarization for Multi-Topic Document
Query-Focused Extractive Text Summarization for Multi-Topic Documentssuserf66333
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking Mohamed BEN ELLEFI
 
Mining knowledge graphs to map heterogeneous relations between the internet o...
Mining knowledge graphs to map heterogeneous relations between the internet o...Mining knowledge graphs to map heterogeneous relations between the internet o...
Mining knowledge graphs to map heterogeneous relations between the internet o...IJECEIAES
 
Eprints Special Session - DC-2006, Mexico
Eprints Special Session - DC-2006, MexicoEprints Special Session - DC-2006, Mexico
Eprints Special Session - DC-2006, MexicoEduserv Foundation
 
Inter-university Upper atmosphere Global Observation NETwork (IUGONET)
Inter-university Upper atmosphere Global Observation NETwork (IUGONET) Inter-university Upper atmosphere Global Observation NETwork (IUGONET)
Inter-university Upper atmosphere Global Observation NETwork (IUGONET) Iugo Net
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibEl Habib NFAOUI
 
A pragmatic view on Semantic Technologies
A pragmatic view on Semantic TechnologiesA pragmatic view on Semantic Technologies
A pragmatic view on Semantic TechnologiesRoberto García
 

Similar to Admixture of Poisson MRFs: A New Topic Model with Word Dependencies (20)

Make our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebMake our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the Web
 
Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1
 
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
 
Mining Users Rare Sequential Topic Patterns from Tweets based on Topic Extrac...
Mining Users Rare Sequential Topic Patterns from Tweets based on Topic Extrac...Mining Users Rare Sequential Topic Patterns from Tweets based on Topic Extrac...
Mining Users Rare Sequential Topic Patterns from Tweets based on Topic Extrac...
 
Social Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIASocial Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIA
 
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
 
Survey On Building A Database Driven Reverse Dictionary
Survey On Building A Database Driven Reverse DictionarySurvey On Building A Database Driven Reverse Dictionary
Survey On Building A Database Driven Reverse Dictionary
 
Eprints Application Profile
Eprints Application ProfileEprints Application Profile
Eprints Application Profile
 
Names project update
Names project updateNames project update
Names project update
 
Toxic Comment Classification
Toxic Comment ClassificationToxic Comment Classification
Toxic Comment Classification
 
Linked sensor data
Linked sensor dataLinked sensor data
Linked sensor data
 
Query-Focused Extractive Text Summarization for Multi-Topic Document
Query-Focused Extractive Text Summarization for Multi-Topic DocumentQuery-Focused Extractive Text Summarization for Multi-Topic Document
Query-Focused Extractive Text Summarization for Multi-Topic Document
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
 
Mining knowledge graphs to map heterogeneous relations between the internet o...
Mining knowledge graphs to map heterogeneous relations between the internet o...Mining knowledge graphs to map heterogeneous relations between the internet o...
Mining knowledge graphs to map heterogeneous relations between the internet o...
 
The RDFIndex-MTSR 2013
The RDFIndex-MTSR 2013The RDFIndex-MTSR 2013
The RDFIndex-MTSR 2013
 
Eprints Special Session - DC-2006, Mexico
Eprints Special Session - DC-2006, MexicoEprints Special Session - DC-2006, Mexico
Eprints Special Session - DC-2006, Mexico
 
Inter-university Upper atmosphere Global Observation NETwork (IUGONET)
Inter-university Upper atmosphere Global Observation NETwork (IUGONET) Inter-university Upper atmosphere Global Observation NETwork (IUGONET)
Inter-university Upper atmosphere Global Observation NETwork (IUGONET)
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_Habib
 
A pragmatic view on Semantic Technologies
A pragmatic view on Semantic TechnologiesA pragmatic view on Semantic Technologies
A pragmatic view on Semantic Technologies
 

Recently uploaded

High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...kumargunjan9515
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...HyderabadDolls
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...kumargunjan9515
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...vershagrag
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?RemarkSemacio
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numberssuginr1
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...HyderabadDolls
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareGraham Ware
 

Recently uploaded (20)

High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 

Admixture of Poisson MRFs: A New Topic Model with Word Dependencies

  • 1. Admixture of Poisson MRFs: A New Topic Model with Word Dependencies David Inouye*, Pradeep Ravikumar, Inderjit Dhillon April 30, 2015 * Presenter David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 2. Analyzing Large Collections of Documents Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation Bag of Words Matrix: - Removes order and syntax information - Unrealistic but powerful 2 Model Computation 3 Summary 4 Collection of Documents 1 ) Examples: 1. Research papers 2. News articles 3. Twitter posts 1 David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 3. Analyzing Large Collections of Documents Model Computation 3 Summary 4 Collection of Documents 1 ) Examples: 1. Research papers 2. News articles 3. Twitter posts 1 Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation Bag of Words Matrix: - Removes order and syntax information - Unrealistic but powerful 2 David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 4. Analyzing Large Collections of Documents Summary 4 Collection of Documents 1 ) Examples: 1. Research papers 2. News articles 3. Twitter posts 1 Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation Bag of Words Matrix: - Removes order and syntax information - Unrealistic but powerful 2 Model Computation 3 David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 5. Analyzing Large Collections of Documents Collection of Documents 1 ) Examples: 1. Research papers 2. News articles 3. Twitter posts 1 Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation Bag of Words Matrix: - Removes order and syntax information - Unrealistic but powerful 2 Model Computation 3 Summary 4 David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 6. Research Paper Example - Top Words Top Words (Frequency) Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Collection of Documents Digital Representation Model Computation Summary Examples: 1. Research papers 2. News articles 3. Twitter posts Bag of Words Matrix: - Removes order and syntax information - Unrealistic but powerful 1 Titles of Research Papers: 1. Machine Learning (ICML, NIPS) 2. Communication Networks (INFOCOM) 3. Programming Languages (PLDI, CAV, POPL, OOPSLA) 2 David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 7. Research Paper Example - Topic Modeling Topic Modeling Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Collection of Documents Digital Representation Model Computation Summary Examples: 1. Research papers 2. News articles 3. Twitter posts Bag of Words Matrix: - Removes order and syntax information - Unrealistic but powerful 1 Titles of Research Papers: 1. Machine Learning (ICML, NIPS) 2. Communication Networks (INFOCOM) 3. Programming Languages (PLDI, CAV, POPL, OOPSLA) 2 David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 8. Applications for Topic Modeling Applications 1. Summarize/Visualize [Hall et al. 2008] 2. Word sense disambiguation [Boyd-Graber et al. 2007] 3. Multi-lingual understanding [Mimno et al. 2009] 4. Information retrieval [Wei & Croft 2006] Different domains 1. Genetics [Pritchard et al. 2000 (14,000 citations)] 2. Computer vision [Li et al. 2010] 3. Social networks [Airoldi et al. 2008] 4. Social science surveys [Roberts et al. 2014] 5. Social E-commerce [Hu et al. 2014] David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 9. Brief History - Latent Semantic Analysis (LSA) 2 k p n k k k U VT Σ Singular Value Decomposition 3 Low Dimensional Document Representation “Latent Topic” 4 1 Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation n p David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 10. Brief History - Latent Semantic Analysis (LSA) 3 Low Dimensional Document Representation “Latent Topic” 4 1 Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation Singular Value Decomposition n p 2 k p n k k k U VT Σ David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 11. Brief History - Latent Semantic Analysis (LSA) “Latent Topic” 4 1 Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation Singular Value Decomposition n p 2 k p n k k k U VT Σ 3 Low Dimensional Document Representation David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 12. Brief History - Latent Semantic Analysis (LSA) 1 Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation Singular Value Decomposition n p 2 k p n k k k U VT Σ “Latent Topic” 4 3 Low Dimensional Document Representation Positive and negative values difficult to interpret David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 13. Brief History - Probabilistic Topic Models 3 Topic Weights per Document “Topics” (Word weights) 4 1 Doc 1 Doc 2 Doc 3 Doc 4 ... “networks” “learning” “program m ing” … Digital Representation n p 2 k p n k Probabilistic Topic Models Related through probability model Probability vectors are much easier to interpret LDA - Added Bayesian priors for regularization David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 14. Comparison of 2D Projections SVD dimensions are difficult to interpret APM has smooth distribution compared to LDA SVD LDA APM comm−net.6978 mach−learn.8925 prog−lang.6618 David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 15. Brief History - Extensions/Variants 1. Add time information [Blei & Lafferty 2006] 2. Add author information [Rosen-Zvi et al. 2004] 3. Add document category information [Mcauliffe & Blei 2008] 4. Automatically discover number of topics [Teh et al. 2006] 5. Model correlation between topics [Blei & Lafferty 2006] 6. . . . 7. . . . David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 16. Brief History - Extensions/Variants 1. Add time information [Blei & Lafferty 2006] 2. Add author information [Rosen-Zvi et al. 2004] 3. Add document category information [Mcauliffe & Blei 2008] 4. Automatically discover number of topics [Teh et al. 2006] 5. Model correlation between topics [Blei & Lafferty 2006] 6. . . . 7. . . . Previous models - topics only have weights for single words Our model - topics have weights for pairs of words David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 17. Interpreting Topics LDA 3 topics David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 18. Interpreting Topics LDA 3 topics LDA 6 topics David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 19. Interpreting Topics LDA 3 topics LDA 30 topics David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 20. Interpreting Topics LDA 3 topics LDA 30 topics APM 3 topics David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 21. Overview of APM Admixture of Poisson MRFS (APM) Multinomial Admixture Poisson MRF Gaussian MRF Mixture LDA David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 22. Overview of APM Admixture of Poisson MRFS (APM) Multinomial Admixture Poisson MRF Gaussian MRF Mixture LDA David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 23. Mixtures Multiple sub-populations The sub-populations are usually unknown a priori Each individual from the population comes from exactly one subpopulation Figure source: Kalai, Moitra, and Valiant. Disentangling Gaussians. Communications of the ACM. 2012. David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 24. Admixtures Mixtures - Draws from single component distribution. (Top) Admixtures - Draws from a distribution whose parameters are a convex combination of component parameters. (Bottom) x2 x1 "Documents" Mixture Components x2 x1 Dense "Topic" Sparse "Document" Dense "Document” Sparse "Topic" David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 25. Overview of APM Admixture of Poisson MRFS (APM) Multinomial Admixture Poisson MRF Gaussian MRF Mixture LDA David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 26. Gaussian MRFs Allows for dependencies between variables What if the data dimension is large? If dimension is 1000, 10002 /2 =500,000 parameters Assume some conditional independence between variables. David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 27. Independent PMRFIndependent PMRF Count of Word 1 CountofWord2 0 2 4 6 8 8 6 4 2 0 1. Each conditional (”slice”) of a PMRF is 1-D Poisson. 2. Distinct from Gaussian MRF 3. Positive dependencies can model word co-occurence. Positive Dependency PMRFPMRF Positive Dependency Count of Word 1 CountofWord2 0 2 4 6 8 8 6 4 2 0 Negative Dependency PMRFPMRF Negative Dependency Count of Word 1 CountofWord2 0 2 4 6 8 8 6 4 2 0 David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 28. Poisson MRFs [Yang et al., 2012] P(A | B, C) P(B | A, C) P(C | A, B) P(A, B, C) ?? If we assume the node conditional distributions are Poisson, does there exist a joint MRF distribution that has these conditionals? Poisson MRF joint distribution: Pr PMRF (x | θ, Θ) ∝ exp θT x + xT Θx − p s=1 ln(xs!) . Node conditionals are 1-D Poissons: Pr(xs | x−s, θs, Θs) ∝ exp{ (θs + xT Θs ηs ) xs − ln(xs!) }. David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 29. Overview of APM Admixture of Poisson MRFS (APM) Multinomial Admixture Poisson MRF Gaussian MRF Mixture LDA David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 30. Admixture of Poisson MRFs (APM) [Inouye et al. 2014] APM replaces standard Multinomial with Poisson MRF Pr APM (x, w, θ1...k , Θ1...k ) = Pr PMRF x ¯θ = k j=1 wj θj , ¯Θ = k j=1 wj Θj Pr Dir (w) k j=1 Pr(θj , Θj ) David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 31. APM Algorithm 1. Optimization problem is not convex 2. Want to exploit parallel computing 3. Large optimization problem: APM has O(kp2) parameters versus O(kp) for LDA LDA(k = 5, p = 1000) ⇒ 5,000 parameters APM(k = 5, p = 1000) ⇒ 5,000,000 parameters APM(k = 5, NNZ(Θ) = 10 per word) ⇒ 50,000 free parameters David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 32. Parallel Alternating Newton-like Algorithm 1. Split the algorithm into alternating convex problems arg min Φ1,Φ2,··· ,Φp − 1 n p s=1 tr(Ψs Φs ) − n i=1 exp(zT i Φs wi ) + p s=1 λ vec(Φs )1 1 arg min w1,w2,··· ,wn∈∆k − 1 n n i=1 ψT i wi − p s=1 exp(zT i Φs wi ) where zi = [1 xT i ]T Ψs = f (X, W) φj s = [θj s (Θj s )T ]T ψi = f (X, Φ1...k ) Φs = [φ1 s φ2 s · · · φk s ] 2. Subproblems in summation can be computed in parallel 3. Use fast Newton-like optimization method [Hsieh et al. 2014] David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 33. Timing Results on Wikipedia Dataset (k = 5, λ = 0.5) 1 3.1 3.4 0.6 2.2 2.2 0 1 2 3 4 n = 20,000 p = 5,000 # of Words = 50M n = 100,000 p = 5,000 # of Words = 133M n = 20,000 p = 10,000 # of Words = 57M Time(hrs) APM Training Time on Wikipedia Dataset 1st Iter. Avg. Next 3 Iter. Algorithm scales approximately as O(np2) David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 34. Parallel Speedup 0 5 10 15 20 0 5 10 15 20 Speedup # of MATLAB Workers Parallel Speedup on BNC Dataset Perfect Speedup Actual Speedup BNC dataset has n = 4049 and p = 1646 Speedup could be O(min(n, p)) on distributed system David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 35. Evaluating APM: No Direct Evaluation of Edge Parameters Previous metrics evaluate the similarity of word pairs [Newman et al. 2010, Mimno et al. 2011, Aletras and Court 2013] Averaged statistic for all 10 2 pairs of top words computed Attempted to correlate with human judgment Unlike previous topic models, APM explicitly models dependencies between words How can we semantically evaluate the parameters for these dependencies? David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 36. Evocation [Boyd-Graber et al. 2006] Evocation denotes the idea of which words “evoke” or “bring to mind” other words Different types of evocation: 1. Rose - Flower (example) 2. Brave - Noble (kind) 3. Yell - Talk (manner) 4. Eggs - Bacon (co-occurence) 5. Snore - Sleep (setting) 6. Wet - Desert (antonymy) 7. Work - Lazy (exclusivity) 8. Banana - Kiwi (likeness) Distinctive from word similarity or synonymy Collected human scores for approximately 15% of word pairs David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 37. Evocation Metric Illustration Word Pair H M w1 ↔ w2 w1 ↔ w3 w1 ↔ w4 w2 ↔ w3 w2 ↔ w4 w3 ↔ w4 w2 ↔ w3 w2 ↔ w4 w3 ↔ w4 w1 ↔ w3 w1 ↔ w2 w1 ↔ w4 Word Pair H M w2 ↔ w4 w3 ↔ w4 w1 ↔ w3 w1 ↔ w4 Word Pair H M Rank by model weights M Sum top-m human scores H David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 38. Models for Comparison APM: Admixture of Poisson MRFs APM-LowReg: Very small regularization parameter APM-HeldOut: Chooses λ from held-out documents CTM: Correlated Topic Models HDP: Hierarchical Dirichlet Process (Non-parametric) LDA: Latent Dirichlet Allocation RSM: Replicated Softmax (Undirected Topic Model) RND: Random baseline David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 39. Evocation Metric Results k = 1 3 5 10 25 50 k = 1 3 5 10 25 Evoc-1 (Avg. Evoc. of Topics) Evoc-2 (Evoc. of Avg. Topic) APM APM-LowReg APM-HeldOut CTM HDP LDA RSM RND 0 200 400 600 800 1000 1200 1400 1600 k = 1 3 5 10 25 50 k = 1 3 5 Evoc-1 (Avg. Evoc. of Topics) Evoc-2 (Ev Evocation(m=50) APM APM-LowReg APM-HeldOut CTM HDP LDA 0 200 400 600 800 1000 1200 1400 1600 k = 1 3 5 10 25 50 k = 1 3 5 Evoc-1 (Avg. Evoc. of Topics) Evoc-2 (E Evocation(m=50) APM APM-LowReg APM-HeldOut CTM HDP LDA 0 200 400 600 800 1000 1200 1400 1600 k = 1 3 5 10 25 50 k = 1 3 5 Evoc-1 (Avg. Evoc. of Topics) Evoc-2 (Ev Evocation(m=50) APM APM-LowReg APM-HeldOut CTM HDP LDA 5 10 25 50 k = 1 3 5 10 25 50 (Avg. Evoc. of Topics) Evoc-2 (Evoc. of Avg. Topic) PM-LowReg APM-HeldOut CTM HDP LDA RSM RND David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 40. Evocation Metric Top Word Pairs Table: Top 20 Word Pairs for Best LDA Human Score Human Score Human Score 100 run.v ↔ car.n 38 woman.n ↔ man.n 100 telephone.n 82 teach.v ↔ school.n 38 give.v ↔ church.n 97 husband.n 69 school.n ↔ class.n 38 wife.n ↔ man.n 82 residential.a 63 van.n ↔ car.n 38 engine.n ↔ car.n 76 politics.n 51 hour.n ↔ day.n 35 publish.v ↔ book.n 75 steel.n 50 teach.v ↔ student.n 32 west.n ↔ state.n 75 job.n 44 house.n ↔ government.n 32 year.n ↔ day.n 75 room.n 44 week.n ↔ day.n 25 member.n ↔ give.v 72 aunt.n 38 university.n ↔ institution.n 25 dog.n ↔ animal.n 72 printer.n 38 state.n ↔ government.n 25 seat.n ↔ car.n 60 love.v Word Pair Word Pair Wo Table: Top 20 Word Pairs for Best APM Human Score Human Score n.n ↔ man.n 100 telephone.n ↔ call.n 57 question.n ↔ answer.n e.v ↔ church.n 97 husband.n ↔ wife.n 57 prison.n ↔ cell.n e.n ↔ man.n 82 residential.a ↔ home.n 51 mother.n ↔ baby.n e.n ↔ car.n 76 politics.n ↔ political.a 50 sun.n ↔ earth.n h.v ↔ book.n 75 steel.n ↔ iron.n 50 west.n ↔ east.n .n ↔ state.n 75 job.n ↔ employment.n 44 weekend.n ↔ sunday.n r.n ↔ day.n 75 room.n ↔ bedroom.n 41 wine.n ↔ drink.v .n ↔ give.v 72 aunt.n ↔ uncle.n 38 south.n ↔ north.n g.n ↔ animal.n 72 printer.n ↔ print.v 38 morning.n ↔ afternoon.n .n ↔ car.n 60 love.v ↔ love.n 38 engine.n ↔ car.n Word Pair Word Pair Word Pair David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 41. Current and Future Work networksnetworks learninglearning basedbased usingusing analysisanalysis networknetwork wirelesswireless datadata modelmodel multimulti controlcontrol efficientefficient timetime performanceperformance routingrouting distributeddistributed optimaloptimal algorithmsalgorithms algorithmalgorithm sensorsensor traffictraffic schedulingscheduling highhigh largelarge multiplemultiple mobilemobile atmatm packetpacket delaydelay allocationallocation flowflow protocolprotocol accessaccess multicastmulticast energyenergy channelchannel realreal scalescale powerpower locallocal hochoc raterate randomrandom serviceservice evaluationevaluation codingcoding radioradio bandwidthbandwidth opticaloptical speedspeed videovideo endend peerpeer resourceresource cloudcloud computingcomputing congestioncongestion hophop distributiondistribution contentcontent cognitivecognitive switchswitch spectrumspectrum switchingswitching privacyprivacy wdmwdm layerlayer streamingstreaming locationlocation queueingqueueing engineeringengineering inputinput crosscross areaarea qualityquality loadload wavelengthwavelength preservingpreserving admissionadmission assignmentassignment reliablereliable switchesswitches macmac faultfault toleranttolerant balancingbalancing switchedswitched varyingvarying registerregister widewide centercenter networksnetworks learninglearning basedbased usingusing analysisanalysis networknetwork wirelesswireless datadata modelmodel multimulti systemssystems modelsmodels timetime neuralneural objectobject optimaloptimal informationinformation highhigh bayesianbayesian largelarge optimizationoptimization multiplemultiple inferenceinference linearlinear nonnon sparsesparse clusteringclustering estimationestimation selectionselection kernelkernel supportsupport stochasticstochastic scalescale gaussiangaussian featurefeature markovmarkov processprocess processesprocesses randomrandom codingcoding classclass decisiondecision recognitionrecognition machinesmachines predictionprediction visualvisual vectorvector lowlow supervisedsupervised structuredstructured policypolicy treestrees functionfunction approximateapproximate continuouscontinuous semisemi gradientgradient reductionreduction maximummaximum latentlatent dimensionaldimensional matrixmatrix convexconvex propagationpropagation marginmargin graphicalgraphical variablevariable hiddenhidden variationalvariational tasktask componentcomponent mixturemixture speechspeech spectralspectral rankrank theoretictheoretic neuronsneurons fieldsfields densitydensity vlsivlsi instanceinstance analoganalog montemonte messagemessage carlocarlo topictopic labellabel entropyentropy neighborneighbor nearnear dirichletdirichlet spikingspiking seriesseries beliefbelief factorizationfactorization dynamicaldynamical partiallypartially descentdescent differencedifference nearestnearest dimensionalitydimensionality passingpassing completioncompletion principalprincipal leastleast boltzmannboltzmann likelihoodlikelihood squaressquares observableobservable networksnetworkslearninglearning basedbased usingusing analysisanalysis networknetworkwirelesswireless datadata modelmodel multimulti systemssystems timetime approachapproach programmingprogramming objectobjectdistributeddistributed languagelanguage designdesign orientedoriented systemsystem informationinformation highhigh softwaresoftware programsprograms inferenceinference verificationverification checkingchecking flowflow codecode typetype realrealprogramprogram languageslanguages orderorder machinemachine levellevel temporaltemporal studystudy domaindomain virtualvirtual logiclogic generationgeneration implementationimplementation casecase staticstatic hybridhybrid abstractabstract structuresstructures formalformal higherhigher sessionsession specificspecific collectioncollection firstfirst garbagegarbage extendedextended posterposter aspectaspect 1. Visualization 2. Better inference of parameters 3. Extension to other domains David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs
  • 42. Thanks for listening! Admixture of Poisson MRFS (APM) Multinomial Admixture Poisson MRF Gaussian MRF Mixture LDA David Inouye*, Pradeep Ravikumar, Inderjit Dhillon Admixture of Poisson MRFs