SlideShare une entreprise Scribd logo
1  sur  11
Mining OpenStack Community Network
with Neo4j
OpenStack Summit Atlanta
vBrownBag | May 13, 2014
Kamesh Raghavendra
kamesh@netapp.com
2
What?
Property graph representation of the global OpenStack
community network including:
• People
• Developers, Customers, Service Operators
• Interactions
• Mailing lists, Blueprints, Code check-ins
• Contexts
• Location, Parent Organization, Project
Opened to the community in the form of canned and
adhoc graph queries
3
Who?
Joint initiative between:
Where?
http://www.ccbd-pes.in/openstack-tool.html
http://www.ccbd-pes.in/openstack-graph-query.html
When?
Project active since March 2014, v1.0 by Paris summit
Collaborators welcome!
4
Motivations
• Product strategy & management
• Discover patterns of OpenStack consumption & deployment
• Demographic trends across organizations, industry
verticals & geography
• Segment consumers by demography
• Analyze multi-faceted roles
• Community members playing roles of consumers,
developers & service operators – more often multiple at the
same time
5
OpenStack Data Sources Integrated
• Mail Archives [58,702: http://openstack.markmail.org/]
• Support Form [10,344: https://ask.openstack.org/en/questions/]
• Bug Tracker [6,520: https://bugs.launchpad.net/openstack]
• Blueprints [6,311: https://blueprints.launchpad.net/openstack]
• More sources being integrated
6
OpenStack Network Graph Data Model
Hosted on Neo4j 2.0.3 Community Edition Server
Demographic Context
• Parent organization
• Country
• Industry Vertical
Interaction Context
• Project
• Sentiment
Person
7
OpenStack Network Graph Query Demo
https://www.youtube.com/watch?v=8N-D1IRcUo4
8
Sample Graph Queries
Which are most popular OpenStack projects in Japan?
MATCH (C:COUNTRY)--()--()--()--(Q)--(N:Project)
WHERE C.NAME=’Japan' WITH C,COUNT(Q) AS Count,N ORDER BY Count DESC
RETURN N.TAGNAME,Count
Which are the most popular industries in UK adopting OpenStack?
MATCH (I:INDUSTRY)--(O:ORGANIZATION)--(C:COUNTRY)
WHERE C.NAME='Uk' WITH COUNT(I) AS S,I ORDER BY S DESC
RETURN I.NAME,S LIMIT 5
9
More Sample Graph Queries
Who are the top 5 weekend contributors?
MATCH (O:ORGANIZATION)--(D:DOMAIN)--(P:PERSON)--(Q)
WHERE Q.TIMESTAMP=~".*Sat.*" OR Q.TIMESTAMP=~".*Sun.*" WITH COUNT(Q) AS N,
P, O ORDER BY N DESC
RETURN P.FULL_NAME AS Name, O.NAME AS Organization, N AS
Weekend_Contributions LIMIT 5
10
Road Ahead
• Enhance & automate data ETL
• Integrate more data sources
• Extract more contexts – sentiment, expertise, role
• Enhance query user experience – schema, syntax
• Bring popular queries as canned reports
Seeking early users & collaborators to accelerate
development
Thank You

Contenu connexe

Similaire à OpenStack Community Network Mining with Neo4j

Open Source Social Software
Open Source Social SoftwareOpen Source Social Software
Open Source Social Software
Josie Fraser
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & Analysis
Marcus Hanwell
 

Similaire à OpenStack Community Network Mining with Neo4j (20)

OpenStack: five years in
OpenStack: five years inOpenStack: five years in
OpenStack: five years in
 
OpenStack 5th Birthday
OpenStack 5th BirthdayOpenStack 5th Birthday
OpenStack 5th Birthday
 
CrossRef Distributed Usage Logging Pilot
CrossRef Distributed Usage Logging PilotCrossRef Distributed Usage Logging Pilot
CrossRef Distributed Usage Logging Pilot
 
nstitutional repositories, item and research data metrics
nstitutional repositories, item and research data metricsnstitutional repositories, item and research data metrics
nstitutional repositories, item and research data metrics
 
Webinar "Introduction to OpenStack"
Webinar "Introduction to OpenStack"Webinar "Introduction to OpenStack"
Webinar "Introduction to OpenStack"
 
OpenStack in Action 4! Heidi Bretz - State of OpenStack, what's new, the tech...
OpenStack in Action 4! Heidi Bretz - State of OpenStack, what's new, the tech...OpenStack in Action 4! Heidi Bretz - State of OpenStack, what's new, the tech...
OpenStack in Action 4! Heidi Bretz - State of OpenStack, what's new, the tech...
 
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth SciencesValues & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
 
Reco4J @ London Meetup (June 26th)
Reco4J @ London Meetup (June 26th)Reco4J @ London Meetup (June 26th)
Reco4J @ London Meetup (June 26th)
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation Repositories
 
The Agile and Open Source Way (AgileTour Brussels)
The Agile and Open Source Way (AgileTour Brussels)The Agile and Open Source Way (AgileTour Brussels)
The Agile and Open Source Way (AgileTour Brussels)
 
Developing on OpenStack Startup Edmonton
Developing on OpenStack Startup EdmontonDeveloping on OpenStack Startup Edmonton
Developing on OpenStack Startup Edmonton
 
Open Source Social Software
Open Source Social SoftwareOpen Source Social Software
Open Source Social Software
 
OpenStack Documentation in the Open
OpenStack Documentation in the OpenOpenStack Documentation in the Open
OpenStack Documentation in the Open
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & Analysis
 
OpenStack Documentation Projects and Processes
OpenStack Documentation Projects and ProcessesOpenStack Documentation Projects and Processes
OpenStack Documentation Projects and Processes
 
Open World Forum - The Agile and Open Source Way
Open World Forum - The Agile and Open Source WayOpen World Forum - The Agile and Open Source Way
Open World Forum - The Agile and Open Source Way
 
Guide to open source
Guide to open source Guide to open source
Guide to open source
 
Introduction to LoCloud
Introduction to LoCloud Introduction to LoCloud
Introduction to LoCloud
 
OmniSuggest: A Ubiquitous Cloud-Based Context-Aware Recommendation System for...
OmniSuggest: A Ubiquitous Cloud-Based Context-Aware Recommendation System for...OmniSuggest: A Ubiquitous Cloud-Based Context-Aware Recommendation System for...
OmniSuggest: A Ubiquitous Cloud-Based Context-Aware Recommendation System for...
 
Open stack 5th birthday slide deck
Open stack 5th birthday slide deckOpen stack 5th birthday slide deck
Open stack 5th birthday slide deck
 

Dernier

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Dernier (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

OpenStack Community Network Mining with Neo4j

  • 1. Mining OpenStack Community Network with Neo4j OpenStack Summit Atlanta vBrownBag | May 13, 2014 Kamesh Raghavendra kamesh@netapp.com
  • 2. 2 What? Property graph representation of the global OpenStack community network including: • People • Developers, Customers, Service Operators • Interactions • Mailing lists, Blueprints, Code check-ins • Contexts • Location, Parent Organization, Project Opened to the community in the form of canned and adhoc graph queries
  • 4. 4 Motivations • Product strategy & management • Discover patterns of OpenStack consumption & deployment • Demographic trends across organizations, industry verticals & geography • Segment consumers by demography • Analyze multi-faceted roles • Community members playing roles of consumers, developers & service operators – more often multiple at the same time
  • 5. 5 OpenStack Data Sources Integrated • Mail Archives [58,702: http://openstack.markmail.org/] • Support Form [10,344: https://ask.openstack.org/en/questions/] • Bug Tracker [6,520: https://bugs.launchpad.net/openstack] • Blueprints [6,311: https://blueprints.launchpad.net/openstack] • More sources being integrated
  • 6. 6 OpenStack Network Graph Data Model Hosted on Neo4j 2.0.3 Community Edition Server Demographic Context • Parent organization • Country • Industry Vertical Interaction Context • Project • Sentiment Person
  • 7. 7 OpenStack Network Graph Query Demo https://www.youtube.com/watch?v=8N-D1IRcUo4
  • 8. 8 Sample Graph Queries Which are most popular OpenStack projects in Japan? MATCH (C:COUNTRY)--()--()--()--(Q)--(N:Project) WHERE C.NAME=’Japan' WITH C,COUNT(Q) AS Count,N ORDER BY Count DESC RETURN N.TAGNAME,Count Which are the most popular industries in UK adopting OpenStack? MATCH (I:INDUSTRY)--(O:ORGANIZATION)--(C:COUNTRY) WHERE C.NAME='Uk' WITH COUNT(I) AS S,I ORDER BY S DESC RETURN I.NAME,S LIMIT 5
  • 9. 9 More Sample Graph Queries Who are the top 5 weekend contributors? MATCH (O:ORGANIZATION)--(D:DOMAIN)--(P:PERSON)--(Q) WHERE Q.TIMESTAMP=~".*Sat.*" OR Q.TIMESTAMP=~".*Sun.*" WITH COUNT(Q) AS N, P, O ORDER BY N DESC RETURN P.FULL_NAME AS Name, O.NAME AS Organization, N AS Weekend_Contributions LIMIT 5
  • 10. 10 Road Ahead • Enhance & automate data ETL • Integrate more data sources • Extract more contexts – sentiment, expertise, role • Enhance query user experience – schema, syntax • Bring popular queries as canned reports Seeking early users & collaborators to accelerate development