SlideShare une entreprise Scribd logo
1  sur  21
Career Path Analysis with Neo4j
Presentation by:

30 Beach St #2
Quincy, MA 02170
www.collegeminer.com
www.patheer.com

Matthew S. Harris
Co-Founder, Technology & Infrastructure
matthew.harris@collegeminer.com
matthew.harris@patheer.com
Twitter @harrisminer
About Me
Matthew Harris
Co-Founder, Technology & Infrastructure
• 10+ years database architecture and application development
• Masters Degree in Business Intelligence and Data Mining
• Experience in other startups
• Research at Boston University

Past Experience
Introduction

Build data analytics tools focused on analyzing career outcomes and paths.
• Founded in 2011
• Located in Boston, MA
• Original Premise - Do students get jobs related to their major?
• What can I do with my major?
New Tool
Live your passion, discover your path!
• Private Beta Launched in May 2013
• Public Beta Launch on November 8, 2013 in Boston, MA
• www.patheer.com

Goals & Focus
Holistic career analysis, planning, and recommendation tool.
1. Don’t get weeded out
2. Avoid painful job searches
3. Discover and plan the path to your dream career
Patheer Hierarchy
User Data

Market Data
20gb/day -city

Resume

Job Postings

Extraction & Parsing
User Activity

Resumes
Data Processing

Data Stores

Data Relay
Precog

Analysis Engine

Application

MongoDB

Neo4j

MS SQL
User Capabilities
Resume Analysis
• Understand how parsers work
• Analyze how complete your resume is
according to parser

Job Matches
• Get jobs that match your background
• Analyze why you don’t qualify for a particular job

Research
• Jobs

• Schools

• Companies

• Cities

Career Path Analysis
• View and analyze your career path
• Analyze what others did to reach your career goal
• Get recommendations on how to reach your career goal
Problem & Solution
How to store and analyze this data?
• Not transactional
• Somewhat relational
• Unstructured/Semi-structured data
• Direct and indirect connections
• Real-time and batch
• Flexible/Partial schema

• 3 instances
• Relationships
• Paths
• Weighted Paths

• Neo4jClient (C# Library)
• Shout out to Tatham Oddie!

• Customized data processing
• Mostly depth-first analysis
Relationships
Career Path Analysis
1. View and analyze your career path
2. Analyze what others did to reach your career goal
3. Get recommendations on how to reach your career goal
User Career Path
What does my career path look like?
START person1=node(*)
MATCH m = person1-[p:PATH]->x
WHERE p.UserID! = {userid}
RETURN p order by p.Date asc;

User 1

Bachelors
Degree

Database
Analyst

Database
Admin
User Career Path
Add your career goal

?

User 1
Database
Analyst
Bachelors
Degree

Database
Architect
Database
Admin
Career Path Analysis
1. View and analyze your career path
2. Analyze what others did to reach your career goal
3. Get recommendations on how to reach your career goal
Career Pathing with Neo4j
User X

Bachelors
Degree

Database
Analyst

Database
Admin

Masters
Degree

Database
Architect

User Y

Data
Analyst

Bachelors
Degree

Database
Develop

Database
Admin

Database
Architect

User Z

Bachelors
Degree

Masters
Degree

Database
Admin

Database
Develop

Database
Architect
Career Pathing with Neo4j
Database
Analyst

User X

Database
Admin

User Y

User Z

Data
Analyst

Database
Develop

Bachelors
Degree

Masters
Degree

Database
Architect
Career Pathing with Neo4j
Database
Develop

Database
Analyst

Data
Analyst

1
1

1
1

Users

1

1
1

Database
Admin

2

Bachelors
Degree

1

1
1

1
Masters
Degree

1

Database
Architect
Analysis
What are the top 3 traversals for each job group?
• Not an easy task!
• Variable path lengths with unique traversals
• Can’t simply do allPaths or x – [p:PATH*]-> y
• Unique identifiers
• Where x.pathnumber + 1 = y.pathnumber

• A* and Dijkstra
• Only least cost/cheapest path
• Need most cost (most traversed)

• Customized Solution/Query
•
•
•
•

Batch process nightly for all end nodes
Calculate sum of path weights (still testing optimal solution)
Store top 3 results in Precog (backend)
Application queries Precog
Career Path Analysis
1. View and analyze your career path
2. Analyze what others did to reach your career goal
3. Get recommendations on how to reach your career goal
User Career Path Recommendations
Recommendations based off of:
• Current Position –[p:path*1..?]->Career Goal
• User background (from resume)
• Real-time market data

1

• User relationships and connections

User 1
Database
Analyst
Bachelors
Degree

Database
Admin

2
3

Database
Architect
Timeline
Beta Launch!
• November 8, 2013
• Greater Boston Area
• All Job Categories and Industries

City Expansion
• Spring 2014
• All Job Categories and Industries

New Features (Coming Soon!)
• Research College Majors
• Career Path Explorer
Career Path Explorer Teaser
1. How do I become a…?
• Start at End Node and work backwards

?

Database
Architect

Bachelors
Degree

?

?

Database
Architect

2. What can I do with my degree?
• Start at Start Node and work forward

3. Advanced Search
• Select Start and End Nodes
Bachelors
Degree
THANK YOU!!

Presentation by:

30 Beach St #2
Quincy, MA 02170
www.collegeminer.com
www.patheer.com

Matthew S. Harris
Co-Founder, Technology & Infrastructure
matthew.harris@collegeminer.com
matthew.harris@patheer.com
Twitter @harrisminer

Contenu connexe

Tendances

Graph database Use Cases
Graph database Use CasesGraph database Use Cases
Graph database Use CasesMax De Marzi
 
Workshop - Build a Graph Solution
Workshop - Build a Graph SolutionWorkshop - Build a Graph Solution
Workshop - Build a Graph SolutionNeo4j
 
Jobs at Scale: Developing and Analyzing JTBD Surveys
Jobs at Scale: Developing and Analyzing JTBD SurveysJobs at Scale: Developing and Analyzing JTBD Surveys
Jobs at Scale: Developing and Analyzing JTBD SurveysBetsy Bland
 
Neo4j Training Introduction
Neo4j Training IntroductionNeo4j Training Introduction
Neo4j Training IntroductionMax De Marzi
 
Intermediate Cypher.pdf
Intermediate Cypher.pdfIntermediate Cypher.pdf
Intermediate Cypher.pdfNeo4j
 
Building the Rail Network Digital Twin at CSX
Building the Rail Network Digital Twin at CSXBuilding the Rail Network Digital Twin at CSX
Building the Rail Network Digital Twin at CSXNeo4j
 
Training Series - Intro to Neo4j
Training Series - Intro to Neo4jTraining Series - Intro to Neo4j
Training Series - Intro to Neo4jNeo4j
 
Introducing Neo4j
Introducing Neo4jIntroducing Neo4j
Introducing Neo4jNeo4j
 
Graphs in Retail: Know Your Customers and Make Your Recommendations Engine Learn
Graphs in Retail: Know Your Customers and Make Your Recommendations Engine LearnGraphs in Retail: Know Your Customers and Make Your Recommendations Engine Learn
Graphs in Retail: Know Your Customers and Make Your Recommendations Engine LearnNeo4j
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Supply Chain Twin Demo - Companion Deck
Supply Chain Twin Demo - Companion DeckSupply Chain Twin Demo - Companion Deck
Supply Chain Twin Demo - Companion DeckNeo4j
 
Introduction: Relational to Graphs
Introduction: Relational to GraphsIntroduction: Relational to Graphs
Introduction: Relational to GraphsNeo4j
 
Neo4j for Discovering Drugs and Biomarkers
Neo4j for Discovering Drugs and BiomarkersNeo4j for Discovering Drugs and Biomarkers
Neo4j for Discovering Drugs and BiomarkersNeo4j
 
Azure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data FlowsAzure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data FlowsThomas Sykes
 
Scaling Databricks to Run Data and ML Workloads on Millions of VMs
Scaling Databricks to Run Data and ML Workloads on Millions of VMsScaling Databricks to Run Data and ML Workloads on Millions of VMs
Scaling Databricks to Run Data and ML Workloads on Millions of VMsMatei Zaharia
 
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...Neo4j
 
Graph-Based Customer Journey Analytics with Neo4j
Graph-Based Customer Journey Analytics with Neo4jGraph-Based Customer Journey Analytics with Neo4j
Graph-Based Customer Journey Analytics with Neo4jNeo4j
 
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...Edureka!
 
Education data mining presentation
Education data mining presentationEducation data mining presentation
Education data mining presentationNishabhanot1
 

Tendances (20)

Graph database Use Cases
Graph database Use CasesGraph database Use Cases
Graph database Use Cases
 
Workshop - Build a Graph Solution
Workshop - Build a Graph SolutionWorkshop - Build a Graph Solution
Workshop - Build a Graph Solution
 
Jobs at Scale: Developing and Analyzing JTBD Surveys
Jobs at Scale: Developing and Analyzing JTBD SurveysJobs at Scale: Developing and Analyzing JTBD Surveys
Jobs at Scale: Developing and Analyzing JTBD Surveys
 
Neo4j Training Introduction
Neo4j Training IntroductionNeo4j Training Introduction
Neo4j Training Introduction
 
Intermediate Cypher.pdf
Intermediate Cypher.pdfIntermediate Cypher.pdf
Intermediate Cypher.pdf
 
Building the Rail Network Digital Twin at CSX
Building the Rail Network Digital Twin at CSXBuilding the Rail Network Digital Twin at CSX
Building the Rail Network Digital Twin at CSX
 
Training Series - Intro to Neo4j
Training Series - Intro to Neo4jTraining Series - Intro to Neo4j
Training Series - Intro to Neo4j
 
Introducing Neo4j
Introducing Neo4jIntroducing Neo4j
Introducing Neo4j
 
Graphs in Retail: Know Your Customers and Make Your Recommendations Engine Learn
Graphs in Retail: Know Your Customers and Make Your Recommendations Engine LearnGraphs in Retail: Know Your Customers and Make Your Recommendations Engine Learn
Graphs in Retail: Know Your Customers and Make Your Recommendations Engine Learn
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Supply Chain Twin Demo - Companion Deck
Supply Chain Twin Demo - Companion DeckSupply Chain Twin Demo - Companion Deck
Supply Chain Twin Demo - Companion Deck
 
Introduction: Relational to Graphs
Introduction: Relational to GraphsIntroduction: Relational to Graphs
Introduction: Relational to Graphs
 
Neo4j for Discovering Drugs and Biomarkers
Neo4j for Discovering Drugs and BiomarkersNeo4j for Discovering Drugs and Biomarkers
Neo4j for Discovering Drugs and Biomarkers
 
Azure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data FlowsAzure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data Flows
 
Power BI as a storyteller
Power BI as a storytellerPower BI as a storyteller
Power BI as a storyteller
 
Scaling Databricks to Run Data and ML Workloads on Millions of VMs
Scaling Databricks to Run Data and ML Workloads on Millions of VMsScaling Databricks to Run Data and ML Workloads on Millions of VMs
Scaling Databricks to Run Data and ML Workloads on Millions of VMs
 
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
 
Graph-Based Customer Journey Analytics with Neo4j
Graph-Based Customer Journey Analytics with Neo4jGraph-Based Customer Journey Analytics with Neo4j
Graph-Based Customer Journey Analytics with Neo4j
 
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
 
Education data mining presentation
Education data mining presentationEducation data mining presentation
Education data mining presentation
 

Similaire à Career Path Analysis with Neo4j Graph Database

Communicating Design
Communicating DesignCommunicating Design
Communicating Design悠識學院
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboardpetrknoth
 
The User Is Always Right (Usually): 4 User Research Methods That Get Results
The User Is Always Right (Usually): 4 User Research Methods That Get ResultsThe User Is Always Right (Usually): 4 User Research Methods That Get Results
The User Is Always Right (Usually): 4 User Research Methods That Get ResultsMichael Hartman
 
Slides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesSlides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesParang Saraf
 
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and SparkReproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and SparkAdaryl "Bob" Wakefield, MBA
 
Using MS Power BI to create full, interactive reports using Brightspace Data ...
Using MS Power BI to create full, interactive reports using Brightspace Data ...Using MS Power BI to create full, interactive reports using Brightspace Data ...
Using MS Power BI to create full, interactive reports using Brightspace Data ...D2L Barry
 
Content Strategy and User Experience: Combination in Practice
Content Strategy and User Experience: Combination in PracticeContent Strategy and User Experience: Combination in Practice
Content Strategy and User Experience: Combination in PracticeAmy Grace Wells
 
Disrupting Data Discovery
Disrupting Data DiscoveryDisrupting Data Discovery
Disrupting Data Discoverymarkgrover
 
Elements Of Web Strategy
Elements Of Web StrategyElements Of Web Strategy
Elements Of Web StrategyBen Zipkin
 
Start Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptxStart Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptxEngineerSaqlain
 
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...Qi Guo
 
Data Analysis and Synthesis & Techniques of System.pptx
Data Analysis and Synthesis & Techniques of System.pptxData Analysis and Synthesis & Techniques of System.pptx
Data Analysis and Synthesis & Techniques of System.pptxTs. Heshalini Rajagopal
 
Career in Data Using Tableau
Career in Data Using TableauCareer in Data Using Tableau
Career in Data Using TableauJen Vaughan
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentationTao Feng
 

Similaire à Career Path Analysis with Neo4j Graph Database (20)

Communicating Design
Communicating DesignCommunicating Design
Communicating Design
 
Conrad - Separating the Wheat from the Chaff
Conrad - Separating the Wheat from the ChaffConrad - Separating the Wheat from the Chaff
Conrad - Separating the Wheat from the Chaff
 
Building Your Analytics Toolbox
Building Your Analytics ToolboxBuilding Your Analytics Toolbox
Building Your Analytics Toolbox
 
Resume Screening
Resume ScreeningResume Screening
Resume Screening
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboard
 
The User Is Always Right (Usually): 4 User Research Methods That Get Results
The User Is Always Right (Usually): 4 User Research Methods That Get ResultsThe User Is Always Right (Usually): 4 User Research Methods That Get Results
The User Is Always Right (Usually): 4 User Research Methods That Get Results
 
Slides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesSlides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data Perspectives
 
Beyond User Research
Beyond User ResearchBeyond User Research
Beyond User Research
 
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and SparkReproducible Research with R, The Tidyverse, Notebooks, and Spark
Reproducible Research with R, The Tidyverse, Notebooks, and Spark
 
Using MS Power BI to create full, interactive reports using Brightspace Data ...
Using MS Power BI to create full, interactive reports using Brightspace Data ...Using MS Power BI to create full, interactive reports using Brightspace Data ...
Using MS Power BI to create full, interactive reports using Brightspace Data ...
 
Content Strategy and User Experience: Combination in Practice
Content Strategy and User Experience: Combination in PracticeContent Strategy and User Experience: Combination in Practice
Content Strategy and User Experience: Combination in Practice
 
Disrupting Data Discovery
Disrupting Data DiscoveryDisrupting Data Discovery
Disrupting Data Discovery
 
Big data – solution architect
Big data – solution architectBig data – solution architect
Big data – solution architect
 
Elements Of Web Strategy
Elements Of Web StrategyElements Of Web Strategy
Elements Of Web Strategy
 
Start Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptxStart Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptx
 
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
 
Data Analysis and Synthesis & Techniques of System.pptx
Data Analysis and Synthesis & Techniques of System.pptxData Analysis and Synthesis & Techniques of System.pptx
Data Analysis and Synthesis & Techniques of System.pptx
 
Career in Data Using Tableau
Career in Data Using TableauCareer in Data Using Tableau
Career in Data Using Tableau
 
Art
ArtArt
Art
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentation
 

Plus de Neo4j

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansNeo4j
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...Neo4j
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosNeo4j
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Neo4j
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Neo4j
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeNeo4j
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsNeo4j
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j
 

Plus de Neo4j (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Dernier (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Career Path Analysis with Neo4j Graph Database

  • 1. Career Path Analysis with Neo4j Presentation by: 30 Beach St #2 Quincy, MA 02170 www.collegeminer.com www.patheer.com Matthew S. Harris Co-Founder, Technology & Infrastructure matthew.harris@collegeminer.com matthew.harris@patheer.com Twitter @harrisminer
  • 2. About Me Matthew Harris Co-Founder, Technology & Infrastructure • 10+ years database architecture and application development • Masters Degree in Business Intelligence and Data Mining • Experience in other startups • Research at Boston University Past Experience
  • 3. Introduction Build data analytics tools focused on analyzing career outcomes and paths. • Founded in 2011 • Located in Boston, MA • Original Premise - Do students get jobs related to their major? • What can I do with my major?
  • 4. New Tool Live your passion, discover your path! • Private Beta Launched in May 2013 • Public Beta Launch on November 8, 2013 in Boston, MA • www.patheer.com Goals & Focus Holistic career analysis, planning, and recommendation tool. 1. Don’t get weeded out 2. Avoid painful job searches 3. Discover and plan the path to your dream career
  • 5. Patheer Hierarchy User Data Market Data 20gb/day -city Resume Job Postings Extraction & Parsing User Activity Resumes Data Processing Data Stores Data Relay Precog Analysis Engine Application MongoDB Neo4j MS SQL
  • 6. User Capabilities Resume Analysis • Understand how parsers work • Analyze how complete your resume is according to parser Job Matches • Get jobs that match your background • Analyze why you don’t qualify for a particular job Research • Jobs • Schools • Companies • Cities Career Path Analysis • View and analyze your career path • Analyze what others did to reach your career goal • Get recommendations on how to reach your career goal
  • 7. Problem & Solution How to store and analyze this data? • Not transactional • Somewhat relational • Unstructured/Semi-structured data • Direct and indirect connections • Real-time and batch • Flexible/Partial schema • 3 instances • Relationships • Paths • Weighted Paths • Neo4jClient (C# Library) • Shout out to Tatham Oddie! • Customized data processing • Mostly depth-first analysis
  • 9. Career Path Analysis 1. View and analyze your career path 2. Analyze what others did to reach your career goal 3. Get recommendations on how to reach your career goal
  • 10. User Career Path What does my career path look like? START person1=node(*) MATCH m = person1-[p:PATH]->x WHERE p.UserID! = {userid} RETURN p order by p.Date asc; User 1 Bachelors Degree Database Analyst Database Admin
  • 11. User Career Path Add your career goal ? User 1 Database Analyst Bachelors Degree Database Architect Database Admin
  • 12. Career Path Analysis 1. View and analyze your career path 2. Analyze what others did to reach your career goal 3. Get recommendations on how to reach your career goal
  • 13. Career Pathing with Neo4j User X Bachelors Degree Database Analyst Database Admin Masters Degree Database Architect User Y Data Analyst Bachelors Degree Database Develop Database Admin Database Architect User Z Bachelors Degree Masters Degree Database Admin Database Develop Database Architect
  • 14. Career Pathing with Neo4j Database Analyst User X Database Admin User Y User Z Data Analyst Database Develop Bachelors Degree Masters Degree Database Architect
  • 15. Career Pathing with Neo4j Database Develop Database Analyst Data Analyst 1 1 1 1 Users 1 1 1 Database Admin 2 Bachelors Degree 1 1 1 1 Masters Degree 1 Database Architect
  • 16. Analysis What are the top 3 traversals for each job group? • Not an easy task! • Variable path lengths with unique traversals • Can’t simply do allPaths or x – [p:PATH*]-> y • Unique identifiers • Where x.pathnumber + 1 = y.pathnumber • A* and Dijkstra • Only least cost/cheapest path • Need most cost (most traversed) • Customized Solution/Query • • • • Batch process nightly for all end nodes Calculate sum of path weights (still testing optimal solution) Store top 3 results in Precog (backend) Application queries Precog
  • 17. Career Path Analysis 1. View and analyze your career path 2. Analyze what others did to reach your career goal 3. Get recommendations on how to reach your career goal
  • 18. User Career Path Recommendations Recommendations based off of: • Current Position –[p:path*1..?]->Career Goal • User background (from resume) • Real-time market data 1 • User relationships and connections User 1 Database Analyst Bachelors Degree Database Admin 2 3 Database Architect
  • 19. Timeline Beta Launch! • November 8, 2013 • Greater Boston Area • All Job Categories and Industries City Expansion • Spring 2014 • All Job Categories and Industries New Features (Coming Soon!) • Research College Majors • Career Path Explorer
  • 20. Career Path Explorer Teaser 1. How do I become a…? • Start at End Node and work backwards ? Database Architect Bachelors Degree ? ? Database Architect 2. What can I do with my degree? • Start at Start Node and work forward 3. Advanced Search • Select Start and End Nodes Bachelors Degree
  • 21. THANK YOU!! Presentation by: 30 Beach St #2 Quincy, MA 02170 www.collegeminer.com www.patheer.com Matthew S. Harris Co-Founder, Technology & Infrastructure matthew.harris@collegeminer.com matthew.harris@patheer.com Twitter @harrisminer