SlideShare une entreprise Scribd logo
1  sur  46
Washington DC, November 2011
George Roth, Adonis Damian
www.recognos.com
 A document management system (DMS) is a computer system (or
  set of computer programs) used to track and store electronic
  documents and/or images of paper documents. It is usually also
  capable of keeping track of the different versions created by different
  users (history tracking). The term has some overlap with the concepts
  of content management systems. It is often viewed as a component
  of enterprise content management (ECM) systems and related to
  digital asset management, document imaging, workflow systems and
  records management systems.
 Make the formatted equivalent with non-formatted !




November 2011
CLASSICAL           NEW
   Metadata           Compliance
   Integration        Accessibility
   Capture            Interactivity
   Indexing           Augmentation
   Storage            Translation
   Retrieval          Linking – Relationships
   Distribution       Sentiment Analysis
   Security           New Search (Semantic Tagging, Deep
   Workflow            Search, NL Questions)
   Collaboration
   Versioning
   Search
   Publishing
   …




November 2011
   Volume
   Labor extensive
   The “research project” – 40% – 60% data
    gathering
   Metadata independent of content
   Shallow Search
   Hard to understand by non-experts


November 2011
   NLP Natural Language Processing –
    understand the meaning of documents
    (statistic, machine learning, hybrid, graph
    based)
   Semantic Search – tagging
   Data Integration
   Sentiment Analysis
   Linked Open Data – Linked Data
   Inference - Reasoning

November 2011
   Inside – Controlled Environment - TRUST
   Inside – Security issues
   Same techniques as outside the enterprise
   Integrates non-formatted with formatted
    data
   Easy to measure the effects - ROI
   Add on to the existing KM models
   Emerging area – Semantic technologies
    started on the www
November 2011
New features will become commodity in 2-3 years

   Compliance
   Data Extraction, Comparison, Change
    Analysis
   Interactivity
   Augmentation
   Translation
   Linking – Relationships
   Sentiment Analysis
   New Search (Semantic Tagging, Deep Search,
    NL Questions)
November 2011
   Microsoft: Powerset (Bing), Fast Search, Jinni
   Google: Freebase, Needlebase
   Apple: SIRI
   Etc…




November 2011
 Embedded Compliance Rules




November 2011
 Example there is a rule: – email –
Rule 0134C: “Not allowed to mention a percentage as a
  profit promise investing with the firm”
 In an email:
“ Dear John, Our company has an amazing method to
  invest, so that you will make at least 10% profit in 3
  months !!!! “
 The email was stopped – sent to Compliance with the
  message: “Violation of the Rule 0134C”



November 2011
   MFIP data extraction
   Link to the original document




November 2011
 Data Extraction, Comparison,
    Change Analysis



November 2011
November 2011
November 2011
   Create Alarm when Trading Policy Changes
   Create Alarm when Commissions Change
    (fields)
   Create Alarms when member of the Board
    Changes




November 2011
 Interactivity




November 2011
November 2011
 Augmentation




November 2011
November 2011
 Automated Translation




November 2011
   Google Translate
     Great for simple translation – emails, non
        technical documents

   Language Weaver
     Specialized translation through machine learning
     Train the system per domains



November 2011
 Sentiment Analysis




November 2011
   Media Sentry
   Open Amplify, Expert Systems, Lymbix
   NLP and machine learning




November 2011
November 2011
 Search




November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
 Complex App Samples




November 2011
November 2011
WWW

                 Google        Meltwaters                                            Forums /
                                               Twitter           Facebook                                Websites
                 Alerts          Alerts                                               Blogs




                           Exchange
                              Server



                                                         External Data Pull


                          Exchange                 Twitter          Facebook              80legs                 Diffbot
                            Adapter               Adapter             Adapter            Adapter                Adapter




                Internal Message Storage

                                        File
                                      Server


                                                                      Natural Language Processing


                                                                                                     Uploaded
                                                                                ESSEX               Taxonomy




                Web User Interface
                                                                                Data Storage


                                                                                   MS SQL Server




November 2011
   Amdocs AIDA (AMDOCS Intelligent Decision Automation)




November 2011
November 2011
Display Linked Data   Ask a question –   Entity Lookup
                       semantic search

November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
   Interactive - Exists
   Search – Semantic Search, Q&A
   Semantic Tagging – Summarization
   LOD with domains
   Linked : People, Companies, Locations,
    Specific Terms
   Example a travel book


November 2011
The following technologies were used:
- iQser – GIN
- Clark & Parsia – Spanner, StarDog
- Expert System – NLP
- GATE
- Smart Logic – Enterprise Query Platform – Fast Search – Microsoft
  Sharepoint 11
- Revelytix
- Cognition
- Franz Systems
- DiffBot
- Ontotext




November 2011
George Roth
President and CEO Recognos Inc.
San Francisco
www.recognos.com
groth@recognos.com
Drew Warren
CEO Recognos Financial
New York
dwarren@recognosfinancial.com
www.recognosfinancial.com



November 2011

Contenu connexe

Similaire à Semantic Technology in Document Management

Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management Shakir Majeed Khan
 
SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011Shakir Majeed Khan
 
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing TagSPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing TagKnowledge Management Associates, LLC
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebNuxeo
 
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverContent is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverChris McNulty
 
SharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David StrockSharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David StrockJeff Shuey
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101Chris McNulty
 
SharePoint & ERM
SharePoint & ERMSharePoint & ERM
SharePoint & ERMNick Inglis
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Artificial Intelligence Institute at UofSC
 
SharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel OlesonSharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel OlesonJoel Oleson
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebAmit Sheth
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebAmit Sheth
 
SharePoint 2010- Changing business needs
SharePoint 2010- Changing business needsSharePoint 2010- Changing business needs
SharePoint 2010- Changing business needsShakir Majeed Khan
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationKim Negaard
 
Productie Sharepoint Presentatie
Productie Sharepoint PresentatieProductie Sharepoint Presentatie
Productie Sharepoint PresentatieJan van der Kolk
 
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupDriving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupEPC Group
 

Similaire à Semantic Technology in Document Management (20)

Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
 
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
 
Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management
 
SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011
 
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing TagSPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverContent is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
 
SharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David StrockSharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David Strock
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101
 
SharePoint & ERM
SharePoint & ERMSharePoint & ERM
SharePoint & ERM
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
SharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel OlesonSharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
 
Sp tech con-admin101
Sp tech con-admin101Sp tech con-admin101
Sp tech con-admin101
 
SharePoint 2010- Changing business needs
SharePoint 2010- Changing business needsSharePoint 2010- Changing business needs
SharePoint 2010- Changing business needs
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar Presentation
 
Asap session 1
Asap session 1Asap session 1
Asap session 1
 
Productie Sharepoint Presentatie
Productie Sharepoint PresentatieProductie Sharepoint Presentatie
Productie Sharepoint Presentatie
 
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupDriving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
 

Dernier

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Dernier (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

Semantic Technology in Document Management

  • 1. Washington DC, November 2011 George Roth, Adonis Damian www.recognos.com
  • 2.  A document management system (DMS) is a computer system (or set of computer programs) used to track and store electronic documents and/or images of paper documents. It is usually also capable of keeping track of the different versions created by different users (history tracking). The term has some overlap with the concepts of content management systems. It is often viewed as a component of enterprise content management (ECM) systems and related to digital asset management, document imaging, workflow systems and records management systems.  Make the formatted equivalent with non-formatted ! November 2011
  • 3. CLASSICAL NEW  Metadata  Compliance  Integration  Accessibility  Capture  Interactivity  Indexing  Augmentation  Storage  Translation  Retrieval  Linking – Relationships  Distribution  Sentiment Analysis  Security  New Search (Semantic Tagging, Deep  Workflow Search, NL Questions)  Collaboration  Versioning  Search  Publishing  … November 2011
  • 4. Volume  Labor extensive  The “research project” – 40% – 60% data gathering  Metadata independent of content  Shallow Search  Hard to understand by non-experts November 2011
  • 5. NLP Natural Language Processing – understand the meaning of documents (statistic, machine learning, hybrid, graph based)  Semantic Search – tagging  Data Integration  Sentiment Analysis  Linked Open Data – Linked Data  Inference - Reasoning November 2011
  • 6. Inside – Controlled Environment - TRUST  Inside – Security issues  Same techniques as outside the enterprise  Integrates non-formatted with formatted data  Easy to measure the effects - ROI  Add on to the existing KM models  Emerging area – Semantic technologies started on the www November 2011
  • 7. New features will become commodity in 2-3 years  Compliance  Data Extraction, Comparison, Change Analysis  Interactivity  Augmentation  Translation  Linking – Relationships  Sentiment Analysis  New Search (Semantic Tagging, Deep Search, NL Questions) November 2011
  • 8. Microsoft: Powerset (Bing), Fast Search, Jinni  Google: Freebase, Needlebase  Apple: SIRI  Etc… November 2011
  • 9.  Embedded Compliance Rules November 2011
  • 10.  Example there is a rule: – email – Rule 0134C: “Not allowed to mention a percentage as a profit promise investing with the firm”  In an email: “ Dear John, Our company has an amazing method to invest, so that you will make at least 10% profit in 3 months !!!! “ The email was stopped – sent to Compliance with the message: “Violation of the Rule 0134C” November 2011
  • 11. MFIP data extraction  Link to the original document November 2011
  • 12.  Data Extraction, Comparison, Change Analysis November 2011
  • 15. Create Alarm when Trading Policy Changes  Create Alarm when Commissions Change (fields)  Create Alarms when member of the Board Changes November 2011
  • 21. Google Translate  Great for simple translation – emails, non technical documents  Language Weaver  Specialized translation through machine learning  Train the system per domains November 2011
  • 23. Media Sentry  Open Amplify, Expert Systems, Lymbix  NLP and machine learning November 2011
  • 32.  Complex App Samples November 2011
  • 34. WWW Google Meltwaters Forums / Twitter Facebook Websites Alerts Alerts Blogs Exchange Server External Data Pull Exchange Twitter Facebook 80legs Diffbot Adapter Adapter Adapter Adapter Adapter Internal Message Storage File Server Natural Language Processing Uploaded ESSEX Taxonomy Web User Interface Data Storage MS SQL Server November 2011
  • 35. Amdocs AIDA (AMDOCS Intelligent Decision Automation) November 2011
  • 37. Display Linked Data Ask a question – Entity Lookup semantic search November 2011
  • 44. Interactive - Exists  Search – Semantic Search, Q&A  Semantic Tagging – Summarization  LOD with domains  Linked : People, Companies, Locations, Specific Terms  Example a travel book November 2011
  • 45. The following technologies were used: - iQser – GIN - Clark & Parsia – Spanner, StarDog - Expert System – NLP - GATE - Smart Logic – Enterprise Query Platform – Fast Search – Microsoft Sharepoint 11 - Revelytix - Cognition - Franz Systems - DiffBot - Ontotext November 2011
  • 46. George Roth President and CEO Recognos Inc. San Francisco www.recognos.com groth@recognos.com Drew Warren CEO Recognos Financial New York dwarren@recognosfinancial.com www.recognosfinancial.com November 2011