SlideShare a Scribd company logo
1 of 24
Download to read offline
Dynamic Topic Modeling
via Non-negative Matrix
Factorization
Derek Greene
University College Dublin
Overview
• Topic Modeling
• Non-negative Matrix Factorization

• Dynamic Topic Modeling

• Proposed Approach
• Dynamic Topic Modeling via Non-negative 

Matrix Factorization

• Application
• Topic Modeling European 

Parliamentary Speeches
September 2016 2
Topic Modeling
September 2016 3
• Goal: Discover hidden thematic structure in a corpus of text 

(e.g. tweets, Facebook posts, news articles, political speeches).

• Unsupervised approach, no prior annotation required.
Input Output
Data

Preparation
Topic
Modeling
Algorithm
Topic 1
Topic 2
Topic k
• Output of topic modeling is a set of k topics. Each topic has:

1. A descriptor, based on highest-ranked terms for the topic.

2. Membership weights for all documents relative to the topic.
Topic Modeling with NMF
• Non-negative Matrix Factorization (NMF): Family of linear algebra
algorithms for identifying the latent structure in data represented
as a non-negative matrix (Lee & Seung, 1999).

• NMF can be applied for topic modeling, where the input is a
document-term matrix, typically TF-IDF normalized.
September 2016 4
Input Matrix 

(documents x terms)
• Input: Document-term matrix A; User-specified number of topics k.

• Output: Two k-dimensional factors W and H approximating A.
An
m
Factor

(documents x topics)
NMF Wn
k
Factor

(topics x terms)
H
m
k·
Example: NMF Topic Modeling
• Apply standard NMF to document-term matrix A (6 rows x 10
columns) for k=3 topics…
September 2016 5
document 1
document 2
document 3
document 4
document 5
document 6
research
stem
education
disease
patient
health
budget
finance
banking
bonds
Example: NMF Topic Modeling
September 2016 6
research
stem
education
disease
patient
health
budget
finance
banking
bonds
Topic 1 Topic 2 Topic 3
Factor H

Weights for terms
document 1
document 2
document 3
document 4
document 5
document 6
Topic 1 Topic 2 Topic 3
Factor W 

Weights for documents
(D. Blei, 2012)
Dynamic Topic Models
• Standard topic modeling approaches assume the order of
documents does not matter. Not suitable for time-stamped data.

• Dynamic topic modeling: Approaches to track how language
changes and topics evolve over time in a time-stamped corpus.
September 2016 7
Inaugural address
Dynamic Topic Modeling
via Non-negative Matrix
Factorization
Proposed Approach
• Two-Level approach: Link together related topics found in
different time windows to track topics over time.
9
Rank Term
1 eurozone
2 greece
3 imf
4 loan
5 debt
Rank Term
1 greece
2 debt
3 germany
4 reparations
5 eu
Rank Term
1 greece
2 russia
3 debt
4 eu
5 loan
Topic in

Window 1
Topic in

Window 2
Topic in

Window 3
Divide corpus into 𝜏 time windows of equal duration (e.g. days,
weeks, months, quarters, or years).

Level 1: Apply NMF topic modeling to documents in each
window to produce window topics.

Level 2: Apply another layer of NMF to all topics from Step 1 to
find dynamic topics which span multiple time windows.
Proposed Approach
• Key Idea for Level 2:
• View the topic basis vectors (columns of factor H) found in
each time window as “topic documents”.

• Construct a new combined representation from these H
factors. Similar to idea of “stacking” in supervised ensembles.

• Apply NMF to this new representation.
September 2016 10
𝜏 x Time Window 

Datasets 𝜏 x NMF H Factors
Factor H from Window 1
Factor H from Window 2
Factor H from Window 3
Factor H from Window 𝜏
…
m’ terms
n’topicdocuments
Topic-Term Matrix
Example: Dynamic Topic Modeling
11
Topic-term matrix for 2 time window results, each with 3 topics.
Window1-01
Window1-02
Window1-03
Window2-01
Window2-02
Window2-03
Topics for

Time 

Window 1
Topics for

Time 

Window 2
health
patient
disease
citizen
research
education
budget
finance
banking
Topic-Term Matrix Heatmap
Application:

European Parliament
Collaboration with Dr. James Cross 

UCD School of Politics & 

International Relations
Exploring the European Parliament Agenda
September 2016 13
• Directly elected parliamentary
institution of the EU.

• 8th term began in July 2014.

• 751 Members of European
Parliament (MEPs) from 28
member states.
• 12 plenary sessions per year are held in Strasbourg.

• During sessions, members may speak after being called by the
President. Speaking time available to MEPs is strictly limited.

• MEPs use speeches to state their positions on policies, to
explain votes, and to demonstrate to their electorates that they
are representing their interests in Europe.
Data Collection
• In Autumn 2014 we collected
~400k records from EuroParl.

• Covers activities of MEPS in the
European parliament during
terms 5-7 (1999-2014).

• Focus on records of speeches
in plenary. Accounts for 54.3%
of all Europarl records.
14
http://europarl.europa.eu
Data Collection
• Original corpus contains 269,696 plenary speeches.

• Identified subset of 210,247 English language speeches, either
native or translated.
15
• Divided these into 60 “time window” datasets. Each time
window is a quarter from 1999-Q3 to 2014-Q2.
Time Window (Quarter Number)
NumberofSpeeches
Time Window Topic Modeling
• Applied NMF to document-term matrix for the speeches in
each of the 60 time windows. 

• Use automated topic coherence approach to choose number
of topics k for each window (O’Callaghan et al, 2015).

➡ Output: 60 sets of time window topics.
September 2016 16
Time Window Topic Modeling
Example Topic: 2003-Q1
17
Top 10 terms suggest that this
topic relates to the Iraq war.
Top 10 speeches for this topic
provide the context.
Dynamic Topic Modeling Results
• Applying dynamic topic modeling to the resulting topic-term
matrix with parameter selection yields 57 dynamic topics
which show varied nature of European Parliament’s agenda…
18
Example: Climate Change
19
0
100
200
300
400
500
600
2000 2002 2004 2006 2008 2010 2012 2014
NumberofSpeeches
Year
Climate Change

Package
Cancun
CopenhagenMontreal
Example: Financial & Euro Crisis
20
0
200
400
600
800
1000
1200
2000 2002 2004 2006 2008 2010 2012 2014
NumberofSpeeches
Year
Financial crisis
Euro crisis
A
D
C
B
Dynamic Topics by Politician
We associate MEPs with dynamic topics based on the number of
speeches by the MEP associated with its window topics.
September 2016 21
Pat Cox (Ireland)
Top 10 Most Relevant Dynamic Topics
Dynamic Topics by Country
22
Ireland
Cyprus
More Information
European Parliament Speeches - Topic Explorer

http://erdos.ucd.ie/europarl
September 2016 23
Python Code and Documentation

https://github.com/derekgreene/dynamic-nmf
D. Greene, J. P. Cross, “Unveiling the Political Agenda of the
European Parliament Plenary: A Topical Analysis,” in Proc. ACM Web
Science’15, 2015.
derek.greene@ucd.ie @derekgreene
D. Greene, J. P. Cross. “Exploring the political agenda of the
European parliament using a dynamic topic modeling approach”,
Political Analysis, 2017 (in press).
References
• D. Blei, A. Y. Ng, M. Jordan. “Latent dirichlet allocation”. Journal of
Machine Learning Research, 3:993–1022, 2003. 

• D. Blei. “Probabilistic topic models”. Communications of the ACM, 2012.

• D. D. Lee & H. S. Seung. “Learning the parts of objects by non-negative
matrix factorization”. Nature, 401:788–91, 1999.

• D. O’Callaghan, D. Greene, J. Carthy & P. Cunningham. “An analysis of the
coherence of descriptors in topic modeling”. Expert Systems with
Applications (ESWA), 2015.

• Zhao, Wayne Xin, et al. "Comparing twitter and traditional media using
topic models." Advances in Information Retrieval, 2011.

• J. Grimmer. “A Bayesian Hierarchical Topic Model for Political Texts:
Measuring Expressed Agendas in Senate Press Releases.” Political
Analysis 18 (1). 1–35, 2010.
September 2016 24

More Related Content

What's hot

Personalized Playlists at Spotify
Personalized Playlists at SpotifyPersonalized Playlists at Spotify
Personalized Playlists at SpotifyRohan Agrawal
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Dev Sahu
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters
 
Topic Modeling - NLP
Topic Modeling - NLPTopic Modeling - NLP
Topic Modeling - NLPRupak Roy
 
Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsYONG ZHENG
 
Count-Distinct Problem
Count-Distinct ProblemCount-Distinct Problem
Count-Distinct ProblemKai Zhang
 
Personalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsPersonalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsJustin Basilico
 
Hamilton 1994 time series analysis
Hamilton 1994 time series analysisHamilton 1994 time series analysis
Hamilton 1994 time series analysisOzan Baskan
 
Simple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in MahoutSimple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in MahoutData Science London
 
Machine learning @ Spotify - Madison Big Data Meetup
Machine learning @ Spotify - Madison Big Data MeetupMachine learning @ Spotify - Madison Big Data Meetup
Machine learning @ Spotify - Madison Big Data MeetupAndy Sloane
 
From Idea to Execution: Spotify's Discover Weekly
From Idea to Execution: Spotify's Discover WeeklyFrom Idea to Execution: Spotify's Discover Weekly
From Idea to Execution: Spotify's Discover WeeklyChris Johnson
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Alexandros Karatzoglou
 
Steffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFSteffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFMLconf
 
Scala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music RecommendationsScala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music RecommendationsChris Johnson
 
Social network analysis part ii
Social network analysis part iiSocial network analysis part ii
Social network analysis part iiTHomas Plotkowiak
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain
 
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...Hakka Labs
 

What's hot (20)

Personalized Playlists at Spotify
Personalized Playlists at SpotifyPersonalized Playlists at Spotify
Personalized Playlists at Spotify
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 
Session-Based Recommender Systems
Session-Based Recommender SystemsSession-Based Recommender Systems
Session-Based Recommender Systems
 
Topic Modeling - NLP
Topic Modeling - NLPTopic Modeling - NLP
Topic Modeling - NLP
 
Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender Systems
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Count-Distinct Problem
Count-Distinct ProblemCount-Distinct Problem
Count-Distinct Problem
 
Personalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsPersonalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing Recommendations
 
Hamilton 1994 time series analysis
Hamilton 1994 time series analysisHamilton 1994 time series analysis
Hamilton 1994 time series analysis
 
Simple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in MahoutSimple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in Mahout
 
Machine learning @ Spotify - Madison Big Data Meetup
Machine learning @ Spotify - Madison Big Data MeetupMachine learning @ Spotify - Madison Big Data Meetup
Machine learning @ Spotify - Madison Big Data Meetup
 
From Idea to Execution: Spotify's Discover Weekly
From Idea to Execution: Spotify's Discover WeeklyFrom Idea to Execution: Spotify's Discover Weekly
From Idea to Execution: Spotify's Discover Weekly
 
Topic Modeling
Topic ModelingTopic Modeling
Topic Modeling
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
 
Steffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFSteffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SF
 
Scala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music RecommendationsScala Data Pipelines for Music Recommendations
Scala Data Pipelines for Music Recommendations
 
Social network analysis part ii
Social network analysis part iiSocial network analysis part ii
Social network analysis part ii
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
 

Viewers also liked

Hot Topic Detection and Technology Trend Tracking for Patents utilizing Term ...
Hot Topic Detection and Technology Trend Tracking for Patents utilizing Term ...Hot Topic Detection and Technology Trend Tracking for Patents utilizing Term ...
Hot Topic Detection and Technology Trend Tracking for Patents utilizing Term ...Ly Nguyen
 
Topic detection & tracking
Topic detection & trackingTopic detection & tracking
Topic detection & trackingGeorge Ang
 
An insight for Mobile Cloud Computing (MCC)
An insight for Mobile Cloud Computing (MCC)An insight for Mobile Cloud Computing (MCC)
An insight for Mobile Cloud Computing (MCC)Yuvaraj Ilangovan
 
Mobile Cloud Computing 2012
Mobile Cloud Computing 2012 Mobile Cloud Computing 2012
Mobile Cloud Computing 2012 Bhavya Siddappa
 
Topic modeling using big data analytics
Topic modeling using big data analyticsTopic modeling using big data analytics
Topic modeling using big data analyticsFarheen Nilofer
 
Evaluating the impacts of (Mobile) Cloud Computing on OSS/BSS Landscape
Evaluating the impacts of (Mobile) Cloud Computing on OSS/BSS LandscapeEvaluating the impacts of (Mobile) Cloud Computing on OSS/BSS Landscape
Evaluating the impacts of (Mobile) Cloud Computing on OSS/BSS LandscapeMuhammad Imran Awan
 
Details About Mobile Cloud Computing
Details About Mobile Cloud ComputingDetails About Mobile Cloud Computing
Details About Mobile Cloud Computingvaishnavi_sv
 
Principal component analysis and matrix factorizations for learning (part 1) ...
Principal component analysis and matrix factorizations for learning (part 1) ...Principal component analysis and matrix factorizations for learning (part 1) ...
Principal component analysis and matrix factorizations for learning (part 1) ...zukun
 
Mobile cloud computing
Mobile cloud computingMobile cloud computing
Mobile cloud computingFatih Özlü
 
Mobile cloud Computing
Mobile cloud ComputingMobile cloud Computing
Mobile cloud ComputingPooja Sharma
 
SpringPeople Introduction to Apache Hadoop
SpringPeople Introduction to Apache HadoopSpringPeople Introduction to Apache Hadoop
SpringPeople Introduction to Apache HadoopSpringPeople
 
Mobile Cloud Computing: Big Picture
Mobile Cloud Computing: Big PictureMobile Cloud Computing: Big Picture
Mobile Cloud Computing: Big PictureReza Rahimi
 
Mobile cloud computing; Future of Cloud Computing
Mobile cloud computing; Future of Cloud ComputingMobile cloud computing; Future of Cloud Computing
Mobile cloud computing; Future of Cloud ComputingVineet Garg
 
Data Architectures for Robust Decision Making
Data Architectures for Robust Decision MakingData Architectures for Robust Decision Making
Data Architectures for Robust Decision MakingGwen (Chen) Shapira
 
Mobile Cloud Computing
Mobile Cloud ComputingMobile Cloud Computing
Mobile Cloud ComputingSimeon Oriko
 
Mobile Cloud Computing
Mobile Cloud ComputingMobile Cloud Computing
Mobile Cloud ComputingVikas Kottari
 

Viewers also liked (20)

Hot Topic Detection and Technology Trend Tracking for Patents utilizing Term ...
Hot Topic Detection and Technology Trend Tracking for Patents utilizing Term ...Hot Topic Detection and Technology Trend Tracking for Patents utilizing Term ...
Hot Topic Detection and Technology Trend Tracking for Patents utilizing Term ...
 
Topic detection & tracking
Topic detection & trackingTopic detection & tracking
Topic detection & tracking
 
An insight for Mobile Cloud Computing (MCC)
An insight for Mobile Cloud Computing (MCC)An insight for Mobile Cloud Computing (MCC)
An insight for Mobile Cloud Computing (MCC)
 
Mobile Cloud Computing 2012
Mobile Cloud Computing 2012 Mobile Cloud Computing 2012
Mobile Cloud Computing 2012
 
Is hadoop for you
Is hadoop for youIs hadoop for you
Is hadoop for you
 
Topic modeling using big data analytics
Topic modeling using big data analyticsTopic modeling using big data analytics
Topic modeling using big data analytics
 
Twitter with hadoop for oow
Twitter with hadoop for oowTwitter with hadoop for oow
Twitter with hadoop for oow
 
Evaluating the impacts of (Mobile) Cloud Computing on OSS/BSS Landscape
Evaluating the impacts of (Mobile) Cloud Computing on OSS/BSS LandscapeEvaluating the impacts of (Mobile) Cloud Computing on OSS/BSS Landscape
Evaluating the impacts of (Mobile) Cloud Computing on OSS/BSS Landscape
 
Details About Mobile Cloud Computing
Details About Mobile Cloud ComputingDetails About Mobile Cloud Computing
Details About Mobile Cloud Computing
 
Principal component analysis and matrix factorizations for learning (part 1) ...
Principal component analysis and matrix factorizations for learning (part 1) ...Principal component analysis and matrix factorizations for learning (part 1) ...
Principal component analysis and matrix factorizations for learning (part 1) ...
 
Mobile cloud computing
Mobile cloud computingMobile cloud computing
Mobile cloud computing
 
Mobile cloud Computing
Mobile cloud ComputingMobile cloud Computing
Mobile cloud Computing
 
SpringPeople Introduction to Apache Hadoop
SpringPeople Introduction to Apache HadoopSpringPeople Introduction to Apache Hadoop
SpringPeople Introduction to Apache Hadoop
 
Mobile Cloud Computing: Big Picture
Mobile Cloud Computing: Big PictureMobile Cloud Computing: Big Picture
Mobile Cloud Computing: Big Picture
 
Mobile cloud computing; Future of Cloud Computing
Mobile cloud computing; Future of Cloud ComputingMobile cloud computing; Future of Cloud Computing
Mobile cloud computing; Future of Cloud Computing
 
Data Architectures for Robust Decision Making
Data Architectures for Robust Decision MakingData Architectures for Robust Decision Making
Data Architectures for Robust Decision Making
 
Have your cake and eat it too
Have your cake and eat it tooHave your cake and eat it too
Have your cake and eat it too
 
Mobile Cloud Computing
Mobile Cloud ComputingMobile Cloud Computing
Mobile Cloud Computing
 
Kafka for DBAs
Kafka for DBAsKafka for DBAs
Kafka for DBAs
 
Mobile Cloud Computing
Mobile Cloud ComputingMobile Cloud Computing
Mobile Cloud Computing
 

Similar to Dynamic Topic Modeling via Non-negative Matrix Factorization (Dr. Derek Greene)

ICWE2013 - Discovering links between political debates and media
ICWE2013 - Discovering links between political debates and mediaICWE2013 - Discovering links between political debates and media
ICWE2013 - Discovering links between political debates and mediagjhouben
 
The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers ProjectEuropeana Newspapers
 
Connecting political data to media data
Connecting political data to media dataConnecting political data to media data
Connecting political data to media dataLaura Hollink
 
The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012Europeana Newspapers
 
Connecting political data to media data
Connecting political data to media dataConnecting political data to media data
Connecting political data to media dataLaura Hollink
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampWim Peters
 
Information Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataInformation Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataWim Peters
 
slides_ZU_Text_mining_final (MEDIUM).pdf
slides_ZU_Text_mining_final (MEDIUM).pdfslides_ZU_Text_mining_final (MEDIUM).pdf
slides_ZU_Text_mining_final (MEDIUM).pdfPetr Korab
 
(Lecture notes in economics and mathematical systems 600) ralf borndörfer, an...
(Lecture notes in economics and mathematical systems 600) ralf borndörfer, an...(Lecture notes in economics and mathematical systems 600) ralf borndörfer, an...
(Lecture notes in economics and mathematical systems 600) ralf borndörfer, an...Pontificia Universidad Javeriana Cali
 
The META-NET Strategic Research Agenda for Multilingual Europe 2020
The META-NET Strategic Research Agenda for Multilingual Europe 2020The META-NET Strategic Research Agenda for Multilingual Europe 2020
The META-NET Strategic Research Agenda for Multilingual Europe 2020Georg Rehm
 
Data Science Capstone - Global Economics
Data Science Capstone - Global EconomicsData Science Capstone - Global Economics
Data Science Capstone - Global EconomicsMeagan Thompson
 
Topics of interest for IWPT'01.doc
Topics of interest for IWPT'01.docTopics of interest for IWPT'01.doc
Topics of interest for IWPT'01.docbutest
 
Groningen nl pgroep
Groningen nl pgroepGroningen nl pgroep
Groningen nl pgroepmaartenmarx
 
“How to write a successful proposal” .
“How to write a successful proposal” .“How to write a successful proposal” .
“How to write a successful proposal” .crysatal16
 

Similar to Dynamic Topic Modeling via Non-negative Matrix Factorization (Dr. Derek Greene) (20)

The Europeana Newspapers Project
The Europeana Newspapers ProjectThe Europeana Newspapers Project
The Europeana Newspapers Project
 
Reasoning with Reasoning (STRiX 2014)
Reasoning with Reasoning (STRiX 2014)Reasoning with Reasoning (STRiX 2014)
Reasoning with Reasoning (STRiX 2014)
 
Online Seminar Kickoff
Online Seminar KickoffOnline Seminar Kickoff
Online Seminar Kickoff
 
ICWE2013 - Discovering links between political debates and media
ICWE2013 - Discovering links between political debates and mediaICWE2013 - Discovering links between political debates and media
ICWE2013 - Discovering links between political debates and media
 
The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers Project
 
Connecting political data to media data
Connecting political data to media dataConnecting political data to media data
Connecting political data to media data
 
NECTAR_VRE1
NECTAR_VRE1NECTAR_VRE1
NECTAR_VRE1
 
The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012The Europeana Newspapers Presentation - Cyberspace 2012
The Europeana Newspapers Presentation - Cyberspace 2012
 
Connecting political data to media data
Connecting political data to media dataConnecting political data to media data
Connecting political data to media data
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative Camp
 
Information Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataInformation Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament data
 
slides_ZU_Text_mining_final (MEDIUM).pdf
slides_ZU_Text_mining_final (MEDIUM).pdfslides_ZU_Text_mining_final (MEDIUM).pdf
slides_ZU_Text_mining_final (MEDIUM).pdf
 
(Lecture notes in economics and mathematical systems 600) ralf borndörfer, an...
(Lecture notes in economics and mathematical systems 600) ralf borndörfer, an...(Lecture notes in economics and mathematical systems 600) ralf borndörfer, an...
(Lecture notes in economics and mathematical systems 600) ralf borndörfer, an...
 
The META-NET Strategic Research Agenda for Multilingual Europe 2020
The META-NET Strategic Research Agenda for Multilingual Europe 2020The META-NET Strategic Research Agenda for Multilingual Europe 2020
The META-NET Strategic Research Agenda for Multilingual Europe 2020
 
Open Access and Open Assessment: The Future of Peer Reviewed Journals? Experi...
Open Access and Open Assessment: The Future of Peer Reviewed Journals? Experi...Open Access and Open Assessment: The Future of Peer Reviewed Journals? Experi...
Open Access and Open Assessment: The Future of Peer Reviewed Journals? Experi...
 
DM2E DHAB Meeting: Report on the 2nd Project Review Meeting
DM2E DHAB Meeting: Report on the 2nd Project Review MeetingDM2E DHAB Meeting: Report on the 2nd Project Review Meeting
DM2E DHAB Meeting: Report on the 2nd Project Review Meeting
 
Data Science Capstone - Global Economics
Data Science Capstone - Global EconomicsData Science Capstone - Global Economics
Data Science Capstone - Global Economics
 
Topics of interest for IWPT'01.doc
Topics of interest for IWPT'01.docTopics of interest for IWPT'01.doc
Topics of interest for IWPT'01.doc
 
Groningen nl pgroep
Groningen nl pgroepGroningen nl pgroep
Groningen nl pgroep
 
“How to write a successful proposal” .
“How to write a successful proposal” .“How to write a successful proposal” .
“How to write a successful proposal” .
 

More from Sebastian Ruder

Frontiers of Natural Language Processing
Frontiers of Natural Language ProcessingFrontiers of Natural Language Processing
Frontiers of Natural Language ProcessingSebastian Ruder
 
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftStrong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftSebastian Ruder
 
On the Limitations of Unsupervised Bilingual Dictionary Induction
On the Limitations of Unsupervised Bilingual Dictionary InductionOn the Limitations of Unsupervised Bilingual Dictionary Induction
On the Limitations of Unsupervised Bilingual Dictionary InductionSebastian Ruder
 
Neural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain ShiftNeural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain ShiftSebastian Ruder
 
Successes and Frontiers of Deep Learning
Successes and Frontiers of Deep LearningSuccesses and Frontiers of Deep Learning
Successes and Frontiers of Deep LearningSebastian Ruder
 
Optimization for Deep Learning
Optimization for Deep LearningOptimization for Deep Learning
Optimization for Deep LearningSebastian Ruder
 
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila CastilhoHuman Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila CastilhoSebastian Ruder
 
Machine intelligence in HR technology: resume analysis at scale - Adrian Mihai
Machine intelligence in HR technology: resume analysis at scale - Adrian MihaiMachine intelligence in HR technology: resume analysis at scale - Adrian Mihai
Machine intelligence in HR technology: resume analysis at scale - Adrian MihaiSebastian Ruder
 
Hashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana Ifrim
Hashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana IfrimHashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana Ifrim
Hashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana IfrimSebastian Ruder
 
Transfer Learning for Natural Language Processing
Transfer Learning for Natural Language ProcessingTransfer Learning for Natural Language Processing
Transfer Learning for Natural Language ProcessingSebastian Ruder
 
Transfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine LearningTransfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine LearningSebastian Ruder
 
Making sense of word senses: An introduction to word-sense disambiguation and...
Making sense of word senses: An introduction to word-sense disambiguation and...Making sense of word senses: An introduction to word-sense disambiguation and...
Making sense of word senses: An introduction to word-sense disambiguation and...Sebastian Ruder
 
Spoken Dialogue Systems and Social Talk - Emer Gilmartin
Spoken Dialogue Systems and Social Talk - Emer GilmartinSpoken Dialogue Systems and Social Talk - Emer Gilmartin
Spoken Dialogue Systems and Social Talk - Emer GilmartinSebastian Ruder
 
NIPS 2016 Highlights - Sebastian Ruder
NIPS 2016 Highlights - Sebastian RuderNIPS 2016 Highlights - Sebastian Ruder
NIPS 2016 Highlights - Sebastian RuderSebastian Ruder
 
Modeling documents with Generative Adversarial Networks - John Glover
Modeling documents with Generative Adversarial Networks - John GloverModeling documents with Generative Adversarial Networks - John Glover
Modeling documents with Generative Adversarial Networks - John GloverSebastian Ruder
 
Multi-modal Neural Machine Translation - Iacer Calixto
Multi-modal Neural Machine Translation - Iacer CalixtoMulti-modal Neural Machine Translation - Iacer Calixto
Multi-modal Neural Machine Translation - Iacer CalixtoSebastian Ruder
 
Funded PhD/MSc. Opportunities at AYLIEN
Funded PhD/MSc. Opportunities at AYLIENFunded PhD/MSc. Opportunities at AYLIEN
Funded PhD/MSc. Opportunities at AYLIENSebastian Ruder
 
FaDA: Fast document aligner with word embedding - Pintu Lohar, Debasis Gangul...
FaDA: Fast document aligner with word embedding - Pintu Lohar, Debasis Gangul...FaDA: Fast document aligner with word embedding - Pintu Lohar, Debasis Gangul...
FaDA: Fast document aligner with word embedding - Pintu Lohar, Debasis Gangul...Sebastian Ruder
 
Transformation Functions for Text Classification: A case study with StackOver...
Transformation Functions for Text Classification: A case study with StackOver...Transformation Functions for Text Classification: A case study with StackOver...
Transformation Functions for Text Classification: A case study with StackOver...Sebastian Ruder
 
Idiom Token Classification using Sentential Distributed Semantics (Giancarlo ...
Idiom Token Classification using Sentential Distributed Semantics (Giancarlo ...Idiom Token Classification using Sentential Distributed Semantics (Giancarlo ...
Idiom Token Classification using Sentential Distributed Semantics (Giancarlo ...Sebastian Ruder
 

More from Sebastian Ruder (20)

Frontiers of Natural Language Processing
Frontiers of Natural Language ProcessingFrontiers of Natural Language Processing
Frontiers of Natural Language Processing
 
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftStrong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
 
On the Limitations of Unsupervised Bilingual Dictionary Induction
On the Limitations of Unsupervised Bilingual Dictionary InductionOn the Limitations of Unsupervised Bilingual Dictionary Induction
On the Limitations of Unsupervised Bilingual Dictionary Induction
 
Neural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain ShiftNeural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain Shift
 
Successes and Frontiers of Deep Learning
Successes and Frontiers of Deep LearningSuccesses and Frontiers of Deep Learning
Successes and Frontiers of Deep Learning
 
Optimization for Deep Learning
Optimization for Deep LearningOptimization for Deep Learning
Optimization for Deep Learning
 
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila CastilhoHuman Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
 
Machine intelligence in HR technology: resume analysis at scale - Adrian Mihai
Machine intelligence in HR technology: resume analysis at scale - Adrian MihaiMachine intelligence in HR technology: resume analysis at scale - Adrian Mihai
Machine intelligence in HR technology: resume analysis at scale - Adrian Mihai
 
Hashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana Ifrim
Hashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana IfrimHashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana Ifrim
Hashtagger+: Real-time Social Tagging of Streaming News - Dr. Georgiana Ifrim
 
Transfer Learning for Natural Language Processing
Transfer Learning for Natural Language ProcessingTransfer Learning for Natural Language Processing
Transfer Learning for Natural Language Processing
 
Transfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine LearningTransfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine Learning
 
Making sense of word senses: An introduction to word-sense disambiguation and...
Making sense of word senses: An introduction to word-sense disambiguation and...Making sense of word senses: An introduction to word-sense disambiguation and...
Making sense of word senses: An introduction to word-sense disambiguation and...
 
Spoken Dialogue Systems and Social Talk - Emer Gilmartin
Spoken Dialogue Systems and Social Talk - Emer GilmartinSpoken Dialogue Systems and Social Talk - Emer Gilmartin
Spoken Dialogue Systems and Social Talk - Emer Gilmartin
 
NIPS 2016 Highlights - Sebastian Ruder
NIPS 2016 Highlights - Sebastian RuderNIPS 2016 Highlights - Sebastian Ruder
NIPS 2016 Highlights - Sebastian Ruder
 
Modeling documents with Generative Adversarial Networks - John Glover
Modeling documents with Generative Adversarial Networks - John GloverModeling documents with Generative Adversarial Networks - John Glover
Modeling documents with Generative Adversarial Networks - John Glover
 
Multi-modal Neural Machine Translation - Iacer Calixto
Multi-modal Neural Machine Translation - Iacer CalixtoMulti-modal Neural Machine Translation - Iacer Calixto
Multi-modal Neural Machine Translation - Iacer Calixto
 
Funded PhD/MSc. Opportunities at AYLIEN
Funded PhD/MSc. Opportunities at AYLIENFunded PhD/MSc. Opportunities at AYLIEN
Funded PhD/MSc. Opportunities at AYLIEN
 
FaDA: Fast document aligner with word embedding - Pintu Lohar, Debasis Gangul...
FaDA: Fast document aligner with word embedding - Pintu Lohar, Debasis Gangul...FaDA: Fast document aligner with word embedding - Pintu Lohar, Debasis Gangul...
FaDA: Fast document aligner with word embedding - Pintu Lohar, Debasis Gangul...
 
Transformation Functions for Text Classification: A case study with StackOver...
Transformation Functions for Text Classification: A case study with StackOver...Transformation Functions for Text Classification: A case study with StackOver...
Transformation Functions for Text Classification: A case study with StackOver...
 
Idiom Token Classification using Sentential Distributed Semantics (Giancarlo ...
Idiom Token Classification using Sentential Distributed Semantics (Giancarlo ...Idiom Token Classification using Sentential Distributed Semantics (Giancarlo ...
Idiom Token Classification using Sentential Distributed Semantics (Giancarlo ...
 

Recently uploaded

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 

Recently uploaded (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 

Dynamic Topic Modeling via Non-negative Matrix Factorization (Dr. Derek Greene)

  • 1. Dynamic Topic Modeling via Non-negative Matrix Factorization Derek Greene University College Dublin
  • 2. Overview • Topic Modeling • Non-negative Matrix Factorization • Dynamic Topic Modeling • Proposed Approach • Dynamic Topic Modeling via Non-negative 
 Matrix Factorization • Application • Topic Modeling European 
 Parliamentary Speeches September 2016 2
  • 3. Topic Modeling September 2016 3 • Goal: Discover hidden thematic structure in a corpus of text 
 (e.g. tweets, Facebook posts, news articles, political speeches). • Unsupervised approach, no prior annotation required. Input Output Data
 Preparation Topic Modeling Algorithm Topic 1 Topic 2 Topic k • Output of topic modeling is a set of k topics. Each topic has: 1. A descriptor, based on highest-ranked terms for the topic. 2. Membership weights for all documents relative to the topic.
  • 4. Topic Modeling with NMF • Non-negative Matrix Factorization (NMF): Family of linear algebra algorithms for identifying the latent structure in data represented as a non-negative matrix (Lee & Seung, 1999). • NMF can be applied for topic modeling, where the input is a document-term matrix, typically TF-IDF normalized. September 2016 4 Input Matrix 
 (documents x terms) • Input: Document-term matrix A; User-specified number of topics k. • Output: Two k-dimensional factors W and H approximating A. An m Factor
 (documents x topics) NMF Wn k Factor
 (topics x terms) H m k·
  • 5. Example: NMF Topic Modeling • Apply standard NMF to document-term matrix A (6 rows x 10 columns) for k=3 topics… September 2016 5 document 1 document 2 document 3 document 4 document 5 document 6 research stem education disease patient health budget finance banking bonds
  • 6. Example: NMF Topic Modeling September 2016 6 research stem education disease patient health budget finance banking bonds Topic 1 Topic 2 Topic 3 Factor H
 Weights for terms document 1 document 2 document 3 document 4 document 5 document 6 Topic 1 Topic 2 Topic 3 Factor W 
 Weights for documents
  • 7. (D. Blei, 2012) Dynamic Topic Models • Standard topic modeling approaches assume the order of documents does not matter. Not suitable for time-stamped data. • Dynamic topic modeling: Approaches to track how language changes and topics evolve over time in a time-stamped corpus. September 2016 7 Inaugural address
  • 8. Dynamic Topic Modeling via Non-negative Matrix Factorization
  • 9. Proposed Approach • Two-Level approach: Link together related topics found in different time windows to track topics over time. 9 Rank Term 1 eurozone 2 greece 3 imf 4 loan 5 debt Rank Term 1 greece 2 debt 3 germany 4 reparations 5 eu Rank Term 1 greece 2 russia 3 debt 4 eu 5 loan Topic in
 Window 1 Topic in
 Window 2 Topic in
 Window 3 Divide corpus into 𝜏 time windows of equal duration (e.g. days, weeks, months, quarters, or years). Level 1: Apply NMF topic modeling to documents in each window to produce window topics. Level 2: Apply another layer of NMF to all topics from Step 1 to find dynamic topics which span multiple time windows.
  • 10. Proposed Approach • Key Idea for Level 2: • View the topic basis vectors (columns of factor H) found in each time window as “topic documents”. • Construct a new combined representation from these H factors. Similar to idea of “stacking” in supervised ensembles. • Apply NMF to this new representation. September 2016 10 𝜏 x Time Window 
 Datasets 𝜏 x NMF H Factors Factor H from Window 1 Factor H from Window 2 Factor H from Window 3 Factor H from Window 𝜏 … m’ terms n’topicdocuments Topic-Term Matrix
  • 11. Example: Dynamic Topic Modeling 11 Topic-term matrix for 2 time window results, each with 3 topics. Window1-01 Window1-02 Window1-03 Window2-01 Window2-02 Window2-03 Topics for
 Time 
 Window 1 Topics for
 Time 
 Window 2 health patient disease citizen research education budget finance banking Topic-Term Matrix Heatmap
  • 12. Application:
 European Parliament Collaboration with Dr. James Cross 
 UCD School of Politics & 
 International Relations
  • 13. Exploring the European Parliament Agenda September 2016 13 • Directly elected parliamentary institution of the EU. • 8th term began in July 2014. • 751 Members of European Parliament (MEPs) from 28 member states. • 12 plenary sessions per year are held in Strasbourg. • During sessions, members may speak after being called by the President. Speaking time available to MEPs is strictly limited. • MEPs use speeches to state their positions on policies, to explain votes, and to demonstrate to their electorates that they are representing their interests in Europe.
  • 14. Data Collection • In Autumn 2014 we collected ~400k records from EuroParl. • Covers activities of MEPS in the European parliament during terms 5-7 (1999-2014). • Focus on records of speeches in plenary. Accounts for 54.3% of all Europarl records. 14 http://europarl.europa.eu
  • 15. Data Collection • Original corpus contains 269,696 plenary speeches. • Identified subset of 210,247 English language speeches, either native or translated. 15 • Divided these into 60 “time window” datasets. Each time window is a quarter from 1999-Q3 to 2014-Q2. Time Window (Quarter Number) NumberofSpeeches
  • 16. Time Window Topic Modeling • Applied NMF to document-term matrix for the speeches in each of the 60 time windows. • Use automated topic coherence approach to choose number of topics k for each window (O’Callaghan et al, 2015). ➡ Output: 60 sets of time window topics. September 2016 16
  • 17. Time Window Topic Modeling Example Topic: 2003-Q1 17 Top 10 terms suggest that this topic relates to the Iraq war. Top 10 speeches for this topic provide the context.
  • 18. Dynamic Topic Modeling Results • Applying dynamic topic modeling to the resulting topic-term matrix with parameter selection yields 57 dynamic topics which show varied nature of European Parliament’s agenda… 18
  • 19. Example: Climate Change 19 0 100 200 300 400 500 600 2000 2002 2004 2006 2008 2010 2012 2014 NumberofSpeeches Year Climate Change
 Package Cancun CopenhagenMontreal
  • 20. Example: Financial & Euro Crisis 20 0 200 400 600 800 1000 1200 2000 2002 2004 2006 2008 2010 2012 2014 NumberofSpeeches Year Financial crisis Euro crisis A D C B
  • 21. Dynamic Topics by Politician We associate MEPs with dynamic topics based on the number of speeches by the MEP associated with its window topics. September 2016 21 Pat Cox (Ireland) Top 10 Most Relevant Dynamic Topics
  • 22. Dynamic Topics by Country 22 Ireland Cyprus
  • 23. More Information European Parliament Speeches - Topic Explorer http://erdos.ucd.ie/europarl September 2016 23 Python Code and Documentation https://github.com/derekgreene/dynamic-nmf D. Greene, J. P. Cross, “Unveiling the Political Agenda of the European Parliament Plenary: A Topical Analysis,” in Proc. ACM Web Science’15, 2015. derek.greene@ucd.ie @derekgreene D. Greene, J. P. Cross. “Exploring the political agenda of the European parliament using a dynamic topic modeling approach”, Political Analysis, 2017 (in press).
  • 24. References • D. Blei, A. Y. Ng, M. Jordan. “Latent dirichlet allocation”. Journal of Machine Learning Research, 3:993–1022, 2003. • D. Blei. “Probabilistic topic models”. Communications of the ACM, 2012. • D. D. Lee & H. S. Seung. “Learning the parts of objects by non-negative matrix factorization”. Nature, 401:788–91, 1999. • D. O’Callaghan, D. Greene, J. Carthy & P. Cunningham. “An analysis of the coherence of descriptors in topic modeling”. Expert Systems with Applications (ESWA), 2015. • Zhao, Wayne Xin, et al. "Comparing twitter and traditional media using topic models." Advances in Information Retrieval, 2011. • J. Grimmer. “A Bayesian Hierarchical Topic Model for Political Texts: Measuring Expressed Agendas in Senate Press Releases.” Political Analysis 18 (1). 1–35, 2010. September 2016 24