SlideShare une entreprise Scribd logo
1  sur  70
Enterprise Amnesia  vs. Enterprise Intelligence Jeff Jonas,  IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics [email_address] November 18, 2010 DEFRAG 2010
Big Data – New Physics ,[object Object],[object Object],[object Object],[object Object],[object Object]
Background ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Sensemaking on Streams ,[object Object],[object Object],[object Object],[object Object]
Sensemaking Blunders ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
National Security Sensemaking Disasters ,[object Object],[object Object],[object Object],[object Object]
State of the Union: Enterprise Amnesia
[object Object],[object Object]
[object Object],[object Object]
Trend: Organizations Are Getting Dumber Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Enterprise Amnesia
Trend: Organizations Are Getting Dumber Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context WHY?
Algorithms at Dead End.  You Can’t  Squeeze Knowledge  Out of a Pixel.
No Context [email_address]
Information without context is hardly actionable.
[object Object],[object Object]
Lack of Context – Consequences ,[object Object],[object Object],[object Object],[object Object]
Information in Context … and Accumulating  Top 200 Customer Job  Applicant Identity Thief  Term No-Rehire [email_address]
Context Accumulation Requires Feature Extraction Video LP#: “Not Cop” Douglas William Barr, Sr. Gene barr, Donn Pinsonne Royce Butler, Robert Lee Edwards DOB: 11 Mar 1936 POB: Cleveland, Ohio Add: 3755 N. Nellis Blvd
Some Pieces Just Don’t Relate … (yet)
Although … Observations Add Up “ Not Cop” Doug Barr, Sr. DOB: 11 Mar 1936 Add: Las Vegas
Observations Add Up “ Not Cop” Doug Barr, Sr. DOB: 11 Mar 1936 Add: Las Vegas
From Pixels to Pictures to Insight  Observations Contextualization Persistent Context Relevance Detection Consumer (An analyst, a system,  the sensor itself, etc.)
[object Object],[object Object],[object Object],The Brain!
The Puzzle Metaphor ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
How Context Accumulates ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Overstated Population Observations Unique Identities True Population
Counting Is Difficult Mark Smith 6/12/1978 443-43-0000 Mark R Smith (707) 433-0000 DL: 00001234 File 1 File 2
The Rise and Fall of a Population Observations Unique Identities True Population
Data Triangulation  Mark Smith 6/12/1978 443-43-0000 Mark R Smith (707) 433-0000 DL: 00001234 File 1 File 2 Mark Randy Smith 443-43-0000 DL: 00001234 New Record
Counting is Essential to Prediction ,[object Object],[object Object],[object Object],[object Object]
Counting: Degrees of Difficulty Exactly  Same Fuzzy Incompatible Features Deceit Bob Jones 123455 Bob Jones 123455 Bob Jones 123455 Robert T Jonnes 000123455 Bob Jones 123455 [email_address] Bob Jones 123455 Ken Wells 550119
And Deceit Revealed Observations Unique Identities True Population 6 Liars  Busted Here!
Demonstration
VOTER George F Balston YOB: 1951  D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DECEASED PERSON George Balston YOB: 1951  SSN: 5598 DOD: 1995 Is This Voter Deceased? When it comes to best practices in voter matching, if only a name and year of birth match, this is insufficient proof of a match.  Many different people in the U.S. share a name and year of birth. Human review is required. Unfortunately, there are thousands and thousands of cases just like this and state election offices don’t have the staff (or budget) to manually review such volumes.
VOTER George F Balston YOB: 1951  D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DECEASED PERSON George Balston YOB: 1951  SSN: 5598 DOD: 1995 Now Consider This Tertiary DMV Record DMV George F Balston YOB: 1951  SSN: 5598  D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 The DMV record contains enough features to match both the voter (name, year of birth and driver’s license) and/or the deceased persons record (name, year of birth and SSN).  For the sake of argument, let’s say it matches the voter best.
VOTER George F Balston YOB: 1951  D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951  SSN: 5598  D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 DECEASED PERSON George Balston YOB: 1951  SSN: 5598 DOD: 1995 Is This Voter/DMV Person Deceased? The voter/DMV record now shares a name, year of birth and SSN with the deceased person record.  In voter matching best practices, this evidence  would be  sufficient to make a determination that this voter is in fact deceased.  This case no longer needs human review.
VOTER George F Balston YOB: 1951  D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951  SSN: 5598  D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 DECEASED PERSON George Balston YOB: 1951  SSN: 5598 DOD: 1995 Context Accumulates! As features accumulate it becomes easier to match future identity records. As events and transactions accumulate – detection of relevance improves.  Here we can see George  who died in 1995 voted in 2008.
Major Moving Parts Persistent Context Context Analysis Relevance  Detection Feature Extraction & Classification Publish Notice Respond CONSUMERS Operational Systems Business Intelligence Data Marts Data Mining Pattern Discovery Predictive Modeling Case Management Visualization Etc. Answers to questions Observations Structured Unstructured Audio/Video Geospatial Biometrics Etc. Questions Search, Discovery, Context Requests Etc.
1 st  principle If you do not process every new piece of key data (perception) first like a query … then you will not know if it matters … until someone asks.
“The Data is a Query”  Beats  “Boil the Ocean” Marketing Department Prospect Database Employee Database Human  Resources  Department Corporate Security  Department Investigations Database Batch Analytics
2 nd  principle  Treat queries like data to avoid having to ask every question every day.
New Think: Data and Query Equality ,[object Object],[object Object],[object Object],Traditional Intelligent Systems Queries find queries!
3rd principle  Enterprise awareness is computationally most efficient when performed at the moment the observation is perceived.
Big Data – New Physics
Context Accumulation + Big Data = New Physics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
“ G2” My Skunk Works Effort
My G2 Effort ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
“Key Features” Enable Expert Counting ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Consider Lying Identical Twins #123 Sue 3/3/84 Uberstan Exp 2011 PASSPORT #123 Sue 3/3/84 Uberstan Exp 2011 PASSPORT Fingerprint DNA Most Trusted Authority “ Same person –  trust me.” Most Trusted Authority
[object Object],[object Object]
Space & Time Enables  Absolute  Disambiguation ,[object Object],Name Make Device ID Address Model Make Date of Birth Year Model Phone License Plate No. Firmware Vers. Passport VIN Asset ID Nationality Owner Etc. Biometric Etc. Etc. When When When Where Where Where
Life Arcs Are Also Telling Bill Smith 4/13/67 Salem, OR Bill Smith 4/13/67 Seattle, WA Address History Tampa, FL 2008-2008 Biloxi, MS 2005-2008 NY, NY 1996-2005 Tampa, FL 1984-1996 Address History San Diego, CA 2005-2009 San Fran, CA 2005-2005 Phoenix, AZ 1990-2005 San Jose, CA 1982-1990
OMG
Space-Time-Travel ,[object Object],[object Object],[object Object],[object Object]
Consequences ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Like Magic … ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Closing Thoughts
[object Object]
To Beat the Competition … Human  Capital Tools Data First Fastest Sensemaking
[object Object],[object Object]
Key Points ,[object Object],[object Object],[object Object],[object Object]
” The data will find the data … and the relevance will find you.”
Data Finding Data “ Jump to the right 1 foot!” Observations of migratory birds Data about where you are right now
…  you and your doctor … When this technology  serves … …  the police looking at you … LOVE! HATE!
Wish This On The Enemy Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Enterprise Amnesia
Enterprise Intelligence: The Way Forward Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Context  Accumulation
Better Prediction to Discard Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Context  Accumulation New/Useful Information Data Reduction
Related Blog Posts ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Blogging At: www.JeffJonas.TypePad.com Information Management Privacy National Security  and Triathlons
Enterprise Amnesia  vs. Enterprise Intelligence Jeff Jonas,  IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics [email_address] November 18, 2010 DEFRAG 2010

Contenu connexe

Similaire à Defrag 2010-distrib

Confessions (and Lessons) of a "Recovering" Data Broker
Confessions (and Lessons) of a "Recovering" Data BrokerConfessions (and Lessons) of a "Recovering" Data Broker
Confessions (and Lessons) of a "Recovering" Data Brokermetanautix
 
DEFCON 23 - Michael Schrenk - applied intelligence
DEFCON 23 - Michael Schrenk - applied intelligenceDEFCON 23 - Michael Schrenk - applied intelligence
DEFCON 23 - Michael Schrenk - applied intelligenceFelipe Prado
 
How to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIHow to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIAzavea
 
DAY 1 Morning 1. Introductions 2. Confirm project .docx
DAY 1 Morning 1. Introductions 2. Confirm project .docxDAY 1 Morning 1. Introductions 2. Confirm project .docx
DAY 1 Morning 1. Introductions 2. Confirm project .docxsimonithomas47935
 
Fishreel Final Lessons Learned
Fishreel Final Lessons LearnedFishreel Final Lessons Learned
Fishreel Final Lessons LearnedH4Diadmin
 
Fishreel Lessons Learned H4D Stanford 2016
Fishreel Lessons Learned H4D Stanford 2016 Fishreel Lessons Learned H4D Stanford 2016
Fishreel Lessons Learned H4D Stanford 2016 Stanford University
 
It's not the documents; it's the DATA
It's not the documents; it's the DATAIt's not the documents; it's the DATA
It's not the documents; it's the DATAJ T "Tom" Johnson
 
Genevieve Bell @ CMC Media & Data
Genevieve Bell @ CMC Media & DataGenevieve Bell @ CMC Media & Data
Genevieve Bell @ CMC Media & DataMedia Perspectives
 
Strata Conference NY: The Accidental Chief Privacy Officer
Strata Conference NY: The Accidental Chief Privacy OfficerStrata Conference NY: The Accidental Chief Privacy Officer
Strata Conference NY: The Accidental Chief Privacy OfficerJim Adler
 
C4 cnewsletter[jan2015]
C4 cnewsletter[jan2015]C4 cnewsletter[jan2015]
C4 cnewsletter[jan2015]C4CFED
 
Million Dollar Baby Essay Questions
Million Dollar Baby Essay QuestionsMillion Dollar Baby Essay Questions
Million Dollar Baby Essay QuestionsHeather Lopez
 
The Unicorn Project A book about techies
The Unicorn Project A book about techiesThe Unicorn Project A book about techies
The Unicorn Project A book about techiesInoruption Systems
 
Dedupe, Merge and Purge: the art of normalization
Dedupe, Merge and Purge: the art of normalizationDedupe, Merge and Purge: the art of normalization
Dedupe, Merge and Purge: the art of normalizationTyler Bell
 
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...Grammarly
 

Similaire à Defrag 2010-distrib (20)

Jeff jonas big data new physics
Jeff jonas big data new physicsJeff jonas big data new physics
Jeff jonas big data new physics
 
SFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdf
SFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdfSFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdf
SFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdf
 
Confessions (and Lessons) of a "Recovering" Data Broker
Confessions (and Lessons) of a "Recovering" Data BrokerConfessions (and Lessons) of a "Recovering" Data Broker
Confessions (and Lessons) of a "Recovering" Data Broker
 
DEFCON 23 - Michael Schrenk - applied intelligence
DEFCON 23 - Michael Schrenk - applied intelligenceDEFCON 23 - Michael Schrenk - applied intelligence
DEFCON 23 - Michael Schrenk - applied intelligence
 
How to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIHow to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero API
 
DAY 1 Morning 1. Introductions 2. Confirm project .docx
DAY 1 Morning 1. Introductions 2. Confirm project .docxDAY 1 Morning 1. Introductions 2. Confirm project .docx
DAY 1 Morning 1. Introductions 2. Confirm project .docx
 
Backgrounds for Churches and Nonprofits
Backgrounds for Churches and NonprofitsBackgrounds for Churches and Nonprofits
Backgrounds for Churches and Nonprofits
 
Fishreel Final Lessons Learned
Fishreel Final Lessons LearnedFishreel Final Lessons Learned
Fishreel Final Lessons Learned
 
Fishreel Lessons Learned H4D Stanford 2016
Fishreel Lessons Learned H4D Stanford 2016 Fishreel Lessons Learned H4D Stanford 2016
Fishreel Lessons Learned H4D Stanford 2016
 
It's not the documents; it's the DATA
It's not the documents; it's the DATAIt's not the documents; it's the DATA
It's not the documents; it's the DATA
 
mineria de datos
mineria de datosmineria de datos
mineria de datos
 
mineria datos
mineria datosmineria datos
mineria datos
 
Data Journalism 101 - Day 1 by Michael J. Berens
Data Journalism 101 - Day 1 by Michael J. BerensData Journalism 101 - Day 1 by Michael J. Berens
Data Journalism 101 - Day 1 by Michael J. Berens
 
Genevieve Bell @ CMC Media & Data
Genevieve Bell @ CMC Media & DataGenevieve Bell @ CMC Media & Data
Genevieve Bell @ CMC Media & Data
 
Strata Conference NY: The Accidental Chief Privacy Officer
Strata Conference NY: The Accidental Chief Privacy OfficerStrata Conference NY: The Accidental Chief Privacy Officer
Strata Conference NY: The Accidental Chief Privacy Officer
 
C4 cnewsletter[jan2015]
C4 cnewsletter[jan2015]C4 cnewsletter[jan2015]
C4 cnewsletter[jan2015]
 
Million Dollar Baby Essay Questions
Million Dollar Baby Essay QuestionsMillion Dollar Baby Essay Questions
Million Dollar Baby Essay Questions
 
The Unicorn Project A book about techies
The Unicorn Project A book about techiesThe Unicorn Project A book about techies
The Unicorn Project A book about techies
 
Dedupe, Merge and Purge: the art of normalization
Dedupe, Merge and Purge: the art of normalizationDedupe, Merge and Purge: the art of normalization
Dedupe, Merge and Purge: the art of normalization
 
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
 

Dernier

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Dernier (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Defrag 2010-distrib

  • 1. Enterprise Amnesia vs. Enterprise Intelligence Jeff Jonas, IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics [email_address] November 18, 2010 DEFRAG 2010
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7. State of the Union: Enterprise Amnesia
  • 8.
  • 9.
  • 10. Trend: Organizations Are Getting Dumber Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Enterprise Amnesia
  • 11. Trend: Organizations Are Getting Dumber Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context WHY?
  • 12. Algorithms at Dead End. You Can’t Squeeze Knowledge Out of a Pixel.
  • 14. Information without context is hardly actionable.
  • 15.
  • 16.
  • 17. Information in Context … and Accumulating Top 200 Customer Job Applicant Identity Thief Term No-Rehire [email_address]
  • 18. Context Accumulation Requires Feature Extraction Video LP#: “Not Cop” Douglas William Barr, Sr. Gene barr, Donn Pinsonne Royce Butler, Robert Lee Edwards DOB: 11 Mar 1936 POB: Cleveland, Ohio Add: 3755 N. Nellis Blvd
  • 19. Some Pieces Just Don’t Relate … (yet)
  • 20. Although … Observations Add Up “ Not Cop” Doug Barr, Sr. DOB: 11 Mar 1936 Add: Las Vegas
  • 21. Observations Add Up “ Not Cop” Doug Barr, Sr. DOB: 11 Mar 1936 Add: Las Vegas
  • 22. From Pixels to Pictures to Insight Observations Contextualization Persistent Context Relevance Detection Consumer (An analyst, a system, the sensor itself, etc.)
  • 23.
  • 24.
  • 25.
  • 26. Overstated Population Observations Unique Identities True Population
  • 27. Counting Is Difficult Mark Smith 6/12/1978 443-43-0000 Mark R Smith (707) 433-0000 DL: 00001234 File 1 File 2
  • 28. The Rise and Fall of a Population Observations Unique Identities True Population
  • 29. Data Triangulation Mark Smith 6/12/1978 443-43-0000 Mark R Smith (707) 433-0000 DL: 00001234 File 1 File 2 Mark Randy Smith 443-43-0000 DL: 00001234 New Record
  • 30.
  • 31. Counting: Degrees of Difficulty Exactly Same Fuzzy Incompatible Features Deceit Bob Jones 123455 Bob Jones 123455 Bob Jones 123455 Robert T Jonnes 000123455 Bob Jones 123455 [email_address] Bob Jones 123455 Ken Wells 550119
  • 32. And Deceit Revealed Observations Unique Identities True Population 6 Liars Busted Here!
  • 34. VOTER George F Balston YOB: 1951 D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DECEASED PERSON George Balston YOB: 1951 SSN: 5598 DOD: 1995 Is This Voter Deceased? When it comes to best practices in voter matching, if only a name and year of birth match, this is insufficient proof of a match. Many different people in the U.S. share a name and year of birth. Human review is required. Unfortunately, there are thousands and thousands of cases just like this and state election offices don’t have the staff (or budget) to manually review such volumes.
  • 35. VOTER George F Balston YOB: 1951 D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DECEASED PERSON George Balston YOB: 1951 SSN: 5598 DOD: 1995 Now Consider This Tertiary DMV Record DMV George F Balston YOB: 1951 SSN: 5598 D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 The DMV record contains enough features to match both the voter (name, year of birth and driver’s license) and/or the deceased persons record (name, year of birth and SSN). For the sake of argument, let’s say it matches the voter best.
  • 36. VOTER George F Balston YOB: 1951 D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951 SSN: 5598 D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 DECEASED PERSON George Balston YOB: 1951 SSN: 5598 DOD: 1995 Is This Voter/DMV Person Deceased? The voter/DMV record now shares a name, year of birth and SSN with the deceased person record. In voter matching best practices, this evidence would be sufficient to make a determination that this voter is in fact deceased. This case no longer needs human review.
  • 37. VOTER George F Balston YOB: 1951 D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951 SSN: 5598 D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 DECEASED PERSON George Balston YOB: 1951 SSN: 5598 DOD: 1995 Context Accumulates! As features accumulate it becomes easier to match future identity records. As events and transactions accumulate – detection of relevance improves. Here we can see George who died in 1995 voted in 2008.
  • 38. Major Moving Parts Persistent Context Context Analysis Relevance Detection Feature Extraction & Classification Publish Notice Respond CONSUMERS Operational Systems Business Intelligence Data Marts Data Mining Pattern Discovery Predictive Modeling Case Management Visualization Etc. Answers to questions Observations Structured Unstructured Audio/Video Geospatial Biometrics Etc. Questions Search, Discovery, Context Requests Etc.
  • 39. 1 st principle If you do not process every new piece of key data (perception) first like a query … then you will not know if it matters … until someone asks.
  • 40. “The Data is a Query” Beats “Boil the Ocean” Marketing Department Prospect Database Employee Database Human Resources Department Corporate Security Department Investigations Database Batch Analytics
  • 41. 2 nd principle Treat queries like data to avoid having to ask every question every day.
  • 42.
  • 43. 3rd principle Enterprise awareness is computationally most efficient when performed at the moment the observation is perceived.
  • 44. Big Data – New Physics
  • 45.
  • 46. “ G2” My Skunk Works Effort
  • 47.
  • 48.
  • 49. Consider Lying Identical Twins #123 Sue 3/3/84 Uberstan Exp 2011 PASSPORT #123 Sue 3/3/84 Uberstan Exp 2011 PASSPORT Fingerprint DNA Most Trusted Authority “ Same person – trust me.” Most Trusted Authority
  • 50.
  • 51.
  • 52. Life Arcs Are Also Telling Bill Smith 4/13/67 Salem, OR Bill Smith 4/13/67 Seattle, WA Address History Tampa, FL 2008-2008 Biloxi, MS 2005-2008 NY, NY 1996-2005 Tampa, FL 1984-1996 Address History San Diego, CA 2005-2009 San Fran, CA 2005-2005 Phoenix, AZ 1990-2005 San Jose, CA 1982-1990
  • 53. OMG
  • 54.
  • 55.
  • 56.
  • 58.
  • 59. To Beat the Competition … Human Capital Tools Data First Fastest Sensemaking
  • 60.
  • 61.
  • 62. ” The data will find the data … and the relevance will find you.”
  • 63. Data Finding Data “ Jump to the right 1 foot!” Observations of migratory birds Data about where you are right now
  • 64. … you and your doctor … When this technology serves … … the police looking at you … LOVE! HATE!
  • 65. Wish This On The Enemy Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Enterprise Amnesia
  • 66. Enterprise Intelligence: The Way Forward Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Context Accumulation
  • 67. Better Prediction to Discard Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Context Accumulation New/Useful Information Data Reduction
  • 68.
  • 69. Blogging At: www.JeffJonas.TypePad.com Information Management Privacy National Security and Triathlons
  • 70. Enterprise Amnesia vs. Enterprise Intelligence Jeff Jonas, IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics [email_address] November 18, 2010 DEFRAG 2010