SlideShare une entreprise Scribd logo
1  sur  12
Télécharger pour lire hors ligne
Semantic-assisted Analysis and 
Search in Customer Specifications 
Martin Voigt, Daniel Hladky 
September 2014 
1 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications
I speakabout… 
The Problem, 
Our Solution, 
Insights & Further Work. 
2
The Problem 
AviComp Controls GmbH 
 leading engineering contractor 
for rotating machinery controls 
3 
Customers 
Engineers 
Sales 
> 100k Technical 
Specifications 
http://www.avicomp.com/capabilities/turbo-compressor-controls.html
The Problem 
Analysis: 1) task, 2) current solution, 3) ideas 
Problems 
Multiple, inefficient tools 
Heterogeneity 
Knowledge management & transfer 
4 
http://answerhub.com/article/ the-cost-of-knowledge-loss/
Our Solution 
5 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications 
http://www.ontos.com/products/ontosldiw/
Our Solution 
Extraction& Analysis 
Homogenization: PDF conversion (Apache POI) & OCR (CuneiForm) 
Text extraction (Apache Tika) 
Language detection (language-detection API) 
Text preparation, e.g., remove headers & footers 
SKOS-based concept identification 
6 
Lorem ipsum dolor sit amet, consetetursadipscing 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata 
sanctusestLorem ipsum dolor sit 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Our Solution 
Storage via OntoQUAD 
 Triple and/or QuadStore, SPARQL 1.1, … 
Indexing 
 Full text search, result grouping, faceted browsing, 
SKOS-based label expansion, … 
 Apache Solr with lucene-skos plugin 
(https://github.com/behas/lucene-SKOS) 
7 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications
Our Solution 
Knowledge Management 
via OntoDixbut SKOS-only 
8 
ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Our Solution 
Search 
via AJAX Solr(https://github.com/evolvingweb/ajax-solr) 
9 
ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Insights & Further Work 
Iterative development with early customer testing lowers usage barrier 
Lessons learned 
Development of a knowledge base 
Faceted search user interface 
Faceted search on RDF 
Multilingual disambiguationmechanisms 
10
Q&A 
Martin Voigt 
Ontos AG / GmbH 
Nidau(CH) / Leipzig (DE) 
T:+49 341 21559-10 
M:+49 178 40 222 58 
E: martin.voigt@ontos.com 
11
About Ontos 
12 
12 
DoW – CTI Project 
Ontos Group 
Key Facts 
- Established 2001 
- 15+ employees 
- Share in Eventos RU 
(30 people) 
- 5± Mio CHF turnover 
Industry 
- Media/News 
- Law Enforcement 
- Government 
- (Russia)

Contenu connexe

Similaire à Semantic-assisted Analysis and Search in Customer Specifications

An AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance ManagementAn AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
Databricks
 
Berlin buzzwords 2020-feature-store-dowling
Berlin buzzwords 2020-feature-store-dowlingBerlin buzzwords 2020-feature-store-dowling
Berlin buzzwords 2020-feature-store-dowling
Jim Dowling
 
The power of faceted search in alfresco
The power of faceted search in alfrescoThe power of faceted search in alfresco
The power of faceted search in alfresco
XeniT Solutions nv
 
SharePoint 2013 Dev Features
SharePoint 2013 Dev FeaturesSharePoint 2013 Dev Features
SharePoint 2013 Dev Features
Ricardo Wilkins
 
OFF SHORE RECRUITER TRAINING
OFF SHORE RECRUITER TRAININGOFF SHORE RECRUITER TRAINING
OFF SHORE RECRUITER TRAINING
satish_kumar646
 

Similaire à Semantic-assisted Analysis and Search in Customer Specifications (20)

Microsoft Power BI and Cortana Analytics user group meetings with Alteryx
Microsoft Power BI and Cortana Analytics user group meetings with AlteryxMicrosoft Power BI and Cortana Analytics user group meetings with Alteryx
Microsoft Power BI and Cortana Analytics user group meetings with Alteryx
 
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botifyapidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
 
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Vertex AI - Unified ML Platform for the entire AI workflow on Google CloudVertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
 
PoolParty Overview
PoolParty OverviewPoolParty Overview
PoolParty Overview
 
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance ManagementAn AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
 
ActiveWarehouse/ETL - BI & DW for Ruby/Rails
ActiveWarehouse/ETL - BI & DW for Ruby/RailsActiveWarehouse/ETL - BI & DW for Ruby/Rails
ActiveWarehouse/ETL - BI & DW for Ruby/Rails
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
 
Semtech 2011 impressions
Semtech 2011 impressionsSemtech 2011 impressions
Semtech 2011 impressions
 
Webinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence IntroWebinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence Intro
 
Berlin buzzwords 2020-feature-store-dowling
Berlin buzzwords 2020-feature-store-dowlingBerlin buzzwords 2020-feature-store-dowling
Berlin buzzwords 2020-feature-store-dowling
 
The power of faceted search in alfresco
The power of faceted search in alfrescoThe power of faceted search in alfresco
The power of faceted search in alfresco
 
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
 
Sharepoint 2013-applied architecture from the field v3 (public)
Sharepoint 2013-applied architecture from the field v3 (public)Sharepoint 2013-applied architecture from the field v3 (public)
Sharepoint 2013-applied architecture from the field v3 (public)
 
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
 
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
 
SharePoint 2013 Dev Features
SharePoint 2013 Dev FeaturesSharePoint 2013 Dev Features
SharePoint 2013 Dev Features
 
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike WatsonSharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
 
OFF SHORE RECRUITER TRAINING
OFF SHORE RECRUITER TRAININGOFF SHORE RECRUITER TRAINING
OFF SHORE RECRUITER TRAINING
 
Discussion for Anomaly & Prediction Engine
Discussion for Anomaly & Prediction EngineDiscussion for Anomaly & Prediction Engine
Discussion for Anomaly & Prediction Engine
 
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data PlatformsData Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
 

Dernier

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 

Dernier (20)

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
ManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide DeckManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide Deck
 
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxBUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
LEVEL 5 - SESSION 1 2023 (1).pptx - PDF 123456
LEVEL 5   - SESSION 1 2023 (1).pptx - PDF 123456LEVEL 5   - SESSION 1 2023 (1).pptx - PDF 123456
LEVEL 5 - SESSION 1 2023 (1).pptx - PDF 123456
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 

Semantic-assisted Analysis and Search in Customer Specifications

  • 1. Semantic-assisted Analysis and Search in Customer Specifications Martin Voigt, Daniel Hladky September 2014 1 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications
  • 2. I speakabout… The Problem, Our Solution, Insights & Further Work. 2
  • 3. The Problem AviComp Controls GmbH  leading engineering contractor for rotating machinery controls 3 Customers Engineers Sales > 100k Technical Specifications http://www.avicomp.com/capabilities/turbo-compressor-controls.html
  • 4. The Problem Analysis: 1) task, 2) current solution, 3) ideas Problems Multiple, inefficient tools Heterogeneity Knowledge management & transfer 4 http://answerhub.com/article/ the-cost-of-knowledge-loss/
  • 5. Our Solution 5 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications http://www.ontos.com/products/ontosldiw/
  • 6. Our Solution Extraction& Analysis Homogenization: PDF conversion (Apache POI) & OCR (CuneiForm) Text extraction (Apache Tika) Language detection (language-detection API) Text preparation, e.g., remove headers & footers SKOS-based concept identification 6 Lorem ipsum dolor sit amet, consetetursadipscing elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata sanctusestLorem ipsum dolor sit elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 7. Our Solution Storage via OntoQUAD  Triple and/or QuadStore, SPARQL 1.1, … Indexing  Full text search, result grouping, faceted browsing, SKOS-based label expansion, …  Apache Solr with lucene-skos plugin (https://github.com/behas/lucene-SKOS) 7 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications
  • 8. Our Solution Knowledge Management via OntoDixbut SKOS-only 8 ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 9. Our Solution Search via AJAX Solr(https://github.com/evolvingweb/ajax-solr) 9 ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 10. Insights & Further Work Iterative development with early customer testing lowers usage barrier Lessons learned Development of a knowledge base Faceted search user interface Faceted search on RDF Multilingual disambiguationmechanisms 10
  • 11. Q&A Martin Voigt Ontos AG / GmbH Nidau(CH) / Leipzig (DE) T:+49 341 21559-10 M:+49 178 40 222 58 E: martin.voigt@ontos.com 11
  • 12. About Ontos 12 12 DoW – CTI Project Ontos Group Key Facts - Established 2001 - 15+ employees - Share in Eventos RU (30 people) - 5± Mio CHF turnover Industry - Media/News - Law Enforcement - Government - (Russia)