SlideShare une entreprise Scribd logo
1  sur  70
Enterprise Amnesia  vs. Enterprise Intelligence Jeff Jonas,  IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics [email_address] November 18, 2010 DEFRAG 2010
Big Data – New Physics ,[object Object],[object Object],[object Object],[object Object],[object Object]
Background ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Sensemaking on Streams ,[object Object],[object Object],[object Object],[object Object]
Sensemaking Blunders ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
National Security Sensemaking Disasters ,[object Object],[object Object],[object Object],[object Object]
State of the Union: Enterprise Amnesia
[object Object],[object Object]
[object Object],[object Object]
Trend: Organizations Are Getting Dumber Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Enterprise Amnesia
Trend: Organizations Are Getting Dumber Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context WHY?
Algorithms at Dead End.  You Can’t  Squeeze Knowledge  Out of a Pixel.
No Context [email_address]
Information without context is hardly actionable.
[object Object],[object Object]
Lack of Context – Consequences ,[object Object],[object Object],[object Object],[object Object]
Information in Context … and Accumulating  Top 200 Customer Job  Applicant Identity Thief  Term No-Rehire [email_address]
Context Accumulation Requires Feature Extraction Video LP#: “Not Cop” Douglas William Barr, Sr. Gene barr, Donn Pinsonne Royce Butler, Robert Lee Edwards DOB: 11 Mar 1936 POB: Cleveland, Ohio Add: 3755 N. Nellis Blvd
Some Pieces Just Don’t Relate … (yet)
Although … Observations Add Up “ Not Cop” Doug Barr, Sr. DOB: 11 Mar 1936 Add: Las Vegas
Observations Add Up “ Not Cop” Doug Barr, Sr. DOB: 11 Mar 1936 Add: Las Vegas
From Pixels to Pictures to Insight  Observations Contextualization Persistent Context Relevance Detection Consumer (An analyst, a system,  the sensor itself, etc.)
[object Object],[object Object],[object Object],The Brain!
The Puzzle Metaphor ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
How Context Accumulates ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Overstated Population Observations Unique Identities True Population
Counting Is Difficult Mark Smith 6/12/1978 443-43-0000 Mark R Smith (707) 433-0000 DL: 00001234 File 1 File 2
The Rise and Fall of a Population Observations Unique Identities True Population
Data Triangulation  Mark Smith 6/12/1978 443-43-0000 Mark R Smith (707) 433-0000 DL: 00001234 File 1 File 2 Mark Randy Smith 443-43-0000 DL: 00001234 New Record
Counting is Essential to Prediction ,[object Object],[object Object],[object Object],[object Object]
Counting: Degrees of Difficulty Exactly  Same Fuzzy Incompatible Features Deceit Bob Jones 123455 Bob Jones 123455 Bob Jones 123455 Robert T Jonnes 000123455 Bob Jones 123455 [email_address] Bob Jones 123455 Ken Wells 550119
And Deceit Revealed Observations Unique Identities True Population 6 Liars  Busted Here!
Demonstration
VOTER George F Balston YOB: 1951  D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DECEASED PERSON George Balston YOB: 1951  SSN: 5598 DOD: 1995 Is This Voter Deceased? When it comes to best practices in voter matching, if only a name and year of birth match, this is insufficient proof of a match.  Many different people in the U.S. share a name and year of birth. Human review is required. Unfortunately, there are thousands and thousands of cases just like this and state election offices don’t have the staff (or budget) to manually review such volumes.
VOTER George F Balston YOB: 1951  D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DECEASED PERSON George Balston YOB: 1951  SSN: 5598 DOD: 1995 Now Consider This Tertiary DMV Record DMV George F Balston YOB: 1951  SSN: 5598  D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 The DMV record contains enough features to match both the voter (name, year of birth and driver’s license) and/or the deceased persons record (name, year of birth and SSN).  For the sake of argument, let’s say it matches the voter best.
VOTER George F Balston YOB: 1951  D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951  SSN: 5598  D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 DECEASED PERSON George Balston YOB: 1951  SSN: 5598 DOD: 1995 Is This Voter/DMV Person Deceased? The voter/DMV record now shares a name, year of birth and SSN with the deceased person record.  In voter matching best practices, this evidence  would be  sufficient to make a determination that this voter is in fact deceased.  This case no longer needs human review.
VOTER George F Balston YOB: 1951  D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951  SSN: 5598  D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 DECEASED PERSON George Balston YOB: 1951  SSN: 5598 DOD: 1995 Context Accumulates! As features accumulate it becomes easier to match future identity records. As events and transactions accumulate – detection of relevance improves.  Here we can see George  who died in 1995 voted in 2008.
Major Moving Parts Persistent Context Context Analysis Relevance  Detection Feature Extraction & Classification Publish Notice Respond CONSUMERS Operational Systems Business Intelligence Data Marts Data Mining Pattern Discovery Predictive Modeling Case Management Visualization Etc. Answers to questions Observations Structured Unstructured Audio/Video Geospatial Biometrics Etc. Questions Search, Discovery, Context Requests Etc.
1 st  principle If you do not process every new piece of key data (perception) first like a query … then you will not know if it matters … until someone asks.
“The Data is a Query”  Beats  “Boil the Ocean” Marketing Department Prospect Database Employee Database Human  Resources  Department Corporate Security  Department Investigations Database Batch Analytics
2 nd  principle  Treat queries like data to avoid having to ask every question every day.
New Think: Data and Query Equality ,[object Object],[object Object],[object Object],Traditional Intelligent Systems Queries find queries!
3rd principle  Enterprise awareness is computationally most efficient when performed at the moment the observation is perceived.
Big Data – New Physics
Context Accumulation + Big Data = New Physics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
“ G2” My Skunk Works Effort
My G2 Effort ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
“Key Features” Enable Expert Counting ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Consider Lying Identical Twins #123 Sue 3/3/84 Uberstan Exp 2011 PASSPORT #123 Sue 3/3/84 Uberstan Exp 2011 PASSPORT Fingerprint DNA Most Trusted Authority “ Same person –  trust me.” Most Trusted Authority
[object Object],[object Object]
Space & Time Enables  Absolute  Disambiguation ,[object Object],Name Make Device ID Address Model Make Date of Birth Year Model Phone License Plate No. Firmware Vers. Passport VIN Asset ID Nationality Owner Etc. Biometric Etc. Etc. When When When Where Where Where
Life Arcs Are Also Telling Bill Smith 4/13/67 Salem, OR Bill Smith 4/13/67 Seattle, WA Address History Tampa, FL 2008-2008 Biloxi, MS 2005-2008 NY, NY 1996-2005 Tampa, FL 1984-1996 Address History San Diego, CA 2005-2009 San Fran, CA 2005-2005 Phoenix, AZ 1990-2005 San Jose, CA 1982-1990
OMG
Space-Time-Travel ,[object Object],[object Object],[object Object],[object Object]
Consequences ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Like Magic … ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Closing Thoughts
[object Object]
To Beat the Competition … Human  Capital Tools Data First Fastest Sensemaking
[object Object],[object Object]
Key Points ,[object Object],[object Object],[object Object],[object Object]
” The data will find the data … and the relevance will find you.”
Data Finding Data “ Jump to the right 1 foot!” Observations of migratory birds Data about where you are right now
…  you and your doctor … When this technology  serves … …  the police looking at you … LOVE! HATE!
Wish This On The Enemy Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Enterprise Amnesia
Enterprise Intelligence: The Way Forward Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Context  Accumulation
Better Prediction to Discard Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Context  Accumulation New/Useful Information Data Reduction
Related Blog Posts ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Blogging At: www.JeffJonas.TypePad.com Information Management Privacy National Security  and Triathlons
Enterprise Amnesia  vs. Enterprise Intelligence Jeff Jonas,  IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics [email_address] November 18, 2010 DEFRAG 2010

Contenu connexe

Similaire à Defrag 2010-distrib

Confessions (and Lessons) of a "Recovering" Data Broker
Confessions (and Lessons) of a "Recovering" Data BrokerConfessions (and Lessons) of a "Recovering" Data Broker
Confessions (and Lessons) of a "Recovering" Data Brokermetanautix
 
DEFCON 23 - Michael Schrenk - applied intelligence
DEFCON 23 - Michael Schrenk - applied intelligenceDEFCON 23 - Michael Schrenk - applied intelligence
DEFCON 23 - Michael Schrenk - applied intelligenceFelipe Prado
 
How to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIHow to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIAzavea
 
DAY 1 Morning 1. Introductions 2. Confirm project .docx
DAY 1 Morning 1. Introductions 2. Confirm project .docxDAY 1 Morning 1. Introductions 2. Confirm project .docx
DAY 1 Morning 1. Introductions 2. Confirm project .docxsimonithomas47935
 
Fishreel Final Lessons Learned
Fishreel Final Lessons LearnedFishreel Final Lessons Learned
Fishreel Final Lessons LearnedH4Diadmin
 
Fishreel Lessons Learned H4D Stanford 2016
Fishreel Lessons Learned H4D Stanford 2016 Fishreel Lessons Learned H4D Stanford 2016
Fishreel Lessons Learned H4D Stanford 2016 Stanford University
 
It's not the documents; it's the DATA
It's not the documents; it's the DATAIt's not the documents; it's the DATA
It's not the documents; it's the DATAJ T "Tom" Johnson
 
Tapping the Data Deluge with R
Tapping the Data Deluge with RTapping the Data Deluge with R
Tapping the Data Deluge with RJeffrey Breen
 
Genevieve Bell @ CMC Media & Data
Genevieve Bell @ CMC Media & DataGenevieve Bell @ CMC Media & Data
Genevieve Bell @ CMC Media & DataMedia Perspectives
 
Strata Conference NY: The Accidental Chief Privacy Officer
Strata Conference NY: The Accidental Chief Privacy OfficerStrata Conference NY: The Accidental Chief Privacy Officer
Strata Conference NY: The Accidental Chief Privacy OfficerJim Adler
 
C4 cnewsletter[jan2015]
C4 cnewsletter[jan2015]C4 cnewsletter[jan2015]
C4 cnewsletter[jan2015]C4CFED
 
Million Dollar Baby Essay Questions
Million Dollar Baby Essay QuestionsMillion Dollar Baby Essay Questions
Million Dollar Baby Essay QuestionsHeather Lopez
 
Dedupe, Merge and Purge: the art of normalization
Dedupe, Merge and Purge: the art of normalizationDedupe, Merge and Purge: the art of normalization
Dedupe, Merge and Purge: the art of normalizationTyler Bell
 
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...Grammarly
 

Similaire à Defrag 2010-distrib (20)

Jeff jonas big data new physics
Jeff jonas big data new physicsJeff jonas big data new physics
Jeff jonas big data new physics
 
SFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdf
SFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdfSFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdf
SFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdf
 
Confessions (and Lessons) of a "Recovering" Data Broker
Confessions (and Lessons) of a "Recovering" Data BrokerConfessions (and Lessons) of a "Recovering" Data Broker
Confessions (and Lessons) of a "Recovering" Data Broker
 
DEFCON 23 - Michael Schrenk - applied intelligence
DEFCON 23 - Michael Schrenk - applied intelligenceDEFCON 23 - Michael Schrenk - applied intelligence
DEFCON 23 - Michael Schrenk - applied intelligence
 
How to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIHow to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero API
 
DAY 1 Morning 1. Introductions 2. Confirm project .docx
DAY 1 Morning 1. Introductions 2. Confirm project .docxDAY 1 Morning 1. Introductions 2. Confirm project .docx
DAY 1 Morning 1. Introductions 2. Confirm project .docx
 
Backgrounds for Churches and Nonprofits
Backgrounds for Churches and NonprofitsBackgrounds for Churches and Nonprofits
Backgrounds for Churches and Nonprofits
 
Fishreel Final Lessons Learned
Fishreel Final Lessons LearnedFishreel Final Lessons Learned
Fishreel Final Lessons Learned
 
Fishreel Lessons Learned H4D Stanford 2016
Fishreel Lessons Learned H4D Stanford 2016 Fishreel Lessons Learned H4D Stanford 2016
Fishreel Lessons Learned H4D Stanford 2016
 
It's not the documents; it's the DATA
It's not the documents; it's the DATAIt's not the documents; it's the DATA
It's not the documents; it's the DATA
 
Tapping the Data Deluge with R
Tapping the Data Deluge with RTapping the Data Deluge with R
Tapping the Data Deluge with R
 
mineria de datos
mineria de datosmineria de datos
mineria de datos
 
mineria datos
mineria datosmineria datos
mineria datos
 
Data Journalism 101 - Day 1 by Michael J. Berens
Data Journalism 101 - Day 1 by Michael J. BerensData Journalism 101 - Day 1 by Michael J. Berens
Data Journalism 101 - Day 1 by Michael J. Berens
 
Genevieve Bell @ CMC Media & Data
Genevieve Bell @ CMC Media & DataGenevieve Bell @ CMC Media & Data
Genevieve Bell @ CMC Media & Data
 
Strata Conference NY: The Accidental Chief Privacy Officer
Strata Conference NY: The Accidental Chief Privacy OfficerStrata Conference NY: The Accidental Chief Privacy Officer
Strata Conference NY: The Accidental Chief Privacy Officer
 
C4 cnewsletter[jan2015]
C4 cnewsletter[jan2015]C4 cnewsletter[jan2015]
C4 cnewsletter[jan2015]
 
Million Dollar Baby Essay Questions
Million Dollar Baby Essay QuestionsMillion Dollar Baby Essay Questions
Million Dollar Baby Essay Questions
 
Dedupe, Merge and Purge: the art of normalization
Dedupe, Merge and Purge: the art of normalizationDedupe, Merge and Purge: the art of normalization
Dedupe, Merge and Purge: the art of normalization
 
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
 

Dernier

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Dernier (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Defrag 2010-distrib

  • 1. Enterprise Amnesia vs. Enterprise Intelligence Jeff Jonas, IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics [email_address] November 18, 2010 DEFRAG 2010
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7. State of the Union: Enterprise Amnesia
  • 8.
  • 9.
  • 10. Trend: Organizations Are Getting Dumber Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Enterprise Amnesia
  • 11. Trend: Organizations Are Getting Dumber Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context WHY?
  • 12. Algorithms at Dead End. You Can’t Squeeze Knowledge Out of a Pixel.
  • 14. Information without context is hardly actionable.
  • 15.
  • 16.
  • 17. Information in Context … and Accumulating Top 200 Customer Job Applicant Identity Thief Term No-Rehire [email_address]
  • 18. Context Accumulation Requires Feature Extraction Video LP#: “Not Cop” Douglas William Barr, Sr. Gene barr, Donn Pinsonne Royce Butler, Robert Lee Edwards DOB: 11 Mar 1936 POB: Cleveland, Ohio Add: 3755 N. Nellis Blvd
  • 19. Some Pieces Just Don’t Relate … (yet)
  • 20. Although … Observations Add Up “ Not Cop” Doug Barr, Sr. DOB: 11 Mar 1936 Add: Las Vegas
  • 21. Observations Add Up “ Not Cop” Doug Barr, Sr. DOB: 11 Mar 1936 Add: Las Vegas
  • 22. From Pixels to Pictures to Insight Observations Contextualization Persistent Context Relevance Detection Consumer (An analyst, a system, the sensor itself, etc.)
  • 23.
  • 24.
  • 25.
  • 26. Overstated Population Observations Unique Identities True Population
  • 27. Counting Is Difficult Mark Smith 6/12/1978 443-43-0000 Mark R Smith (707) 433-0000 DL: 00001234 File 1 File 2
  • 28. The Rise and Fall of a Population Observations Unique Identities True Population
  • 29. Data Triangulation Mark Smith 6/12/1978 443-43-0000 Mark R Smith (707) 433-0000 DL: 00001234 File 1 File 2 Mark Randy Smith 443-43-0000 DL: 00001234 New Record
  • 30.
  • 31. Counting: Degrees of Difficulty Exactly Same Fuzzy Incompatible Features Deceit Bob Jones 123455 Bob Jones 123455 Bob Jones 123455 Robert T Jonnes 000123455 Bob Jones 123455 [email_address] Bob Jones 123455 Ken Wells 550119
  • 32. And Deceit Revealed Observations Unique Identities True Population 6 Liars Busted Here!
  • 34. VOTER George F Balston YOB: 1951 D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DECEASED PERSON George Balston YOB: 1951 SSN: 5598 DOD: 1995 Is This Voter Deceased? When it comes to best practices in voter matching, if only a name and year of birth match, this is insufficient proof of a match. Many different people in the U.S. share a name and year of birth. Human review is required. Unfortunately, there are thousands and thousands of cases just like this and state election offices don’t have the staff (or budget) to manually review such volumes.
  • 35. VOTER George F Balston YOB: 1951 D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DECEASED PERSON George Balston YOB: 1951 SSN: 5598 DOD: 1995 Now Consider This Tertiary DMV Record DMV George F Balston YOB: 1951 SSN: 5598 D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 The DMV record contains enough features to match both the voter (name, year of birth and driver’s license) and/or the deceased persons record (name, year of birth and SSN). For the sake of argument, let’s say it matches the voter best.
  • 36. VOTER George F Balston YOB: 1951 D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951 SSN: 5598 D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 DECEASED PERSON George Balston YOB: 1951 SSN: 5598 DOD: 1995 Is This Voter/DMV Person Deceased? The voter/DMV record now shares a name, year of birth and SSN with the deceased person record. In voter matching best practices, this evidence would be sufficient to make a determination that this voter is in fact deceased. This case no longer needs human review.
  • 37. VOTER George F Balston YOB: 1951 D/L: 4801 13070 SW Karen Blvd Apt 7 Beaverton, OR 97005 Last voted: 2008 DMV George F Balston YOB: 1951 SSN: 5598 D/L: 4801 3043 SW Clementine Blvd Apt 210 Beaverton, OR 97005 DECEASED PERSON George Balston YOB: 1951 SSN: 5598 DOD: 1995 Context Accumulates! As features accumulate it becomes easier to match future identity records. As events and transactions accumulate – detection of relevance improves. Here we can see George who died in 1995 voted in 2008.
  • 38. Major Moving Parts Persistent Context Context Analysis Relevance Detection Feature Extraction & Classification Publish Notice Respond CONSUMERS Operational Systems Business Intelligence Data Marts Data Mining Pattern Discovery Predictive Modeling Case Management Visualization Etc. Answers to questions Observations Structured Unstructured Audio/Video Geospatial Biometrics Etc. Questions Search, Discovery, Context Requests Etc.
  • 39. 1 st principle If you do not process every new piece of key data (perception) first like a query … then you will not know if it matters … until someone asks.
  • 40. “The Data is a Query” Beats “Boil the Ocean” Marketing Department Prospect Database Employee Database Human Resources Department Corporate Security Department Investigations Database Batch Analytics
  • 41. 2 nd principle Treat queries like data to avoid having to ask every question every day.
  • 42.
  • 43. 3rd principle Enterprise awareness is computationally most efficient when performed at the moment the observation is perceived.
  • 44. Big Data – New Physics
  • 45.
  • 46. “ G2” My Skunk Works Effort
  • 47.
  • 48.
  • 49. Consider Lying Identical Twins #123 Sue 3/3/84 Uberstan Exp 2011 PASSPORT #123 Sue 3/3/84 Uberstan Exp 2011 PASSPORT Fingerprint DNA Most Trusted Authority “ Same person – trust me.” Most Trusted Authority
  • 50.
  • 51.
  • 52. Life Arcs Are Also Telling Bill Smith 4/13/67 Salem, OR Bill Smith 4/13/67 Seattle, WA Address History Tampa, FL 2008-2008 Biloxi, MS 2005-2008 NY, NY 1996-2005 Tampa, FL 1984-1996 Address History San Diego, CA 2005-2009 San Fran, CA 2005-2005 Phoenix, AZ 1990-2005 San Jose, CA 1982-1990
  • 53. OMG
  • 54.
  • 55.
  • 56.
  • 58.
  • 59. To Beat the Competition … Human Capital Tools Data First Fastest Sensemaking
  • 60.
  • 61.
  • 62. ” The data will find the data … and the relevance will find you.”
  • 63. Data Finding Data “ Jump to the right 1 foot!” Observations of migratory birds Data about where you are right now
  • 64. … you and your doctor … When this technology serves … … the police looking at you … LOVE! HATE!
  • 65. Wish This On The Enemy Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Enterprise Amnesia
  • 66. Enterprise Intelligence: The Way Forward Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Context Accumulation
  • 67. Better Prediction to Discard Time Computing Power Growth Sensemaking Algorithms Available Observation Space Context Context Accumulation New/Useful Information Data Reduction
  • 68.
  • 69. Blogging At: www.JeffJonas.TypePad.com Information Management Privacy National Security and Triathlons
  • 70. Enterprise Amnesia vs. Enterprise Intelligence Jeff Jonas, IBM Distinguished Engineer Chief Scientist, IBM Entity Analytics [email_address] November 18, 2010 DEFRAG 2010