SlideShare une entreprise Scribd logo
1  sur  22
Télécharger pour lire hors ligne
The Economics of Artificial
Intelligence and Machine Learning
for Semantic Enrichment
Monday, 8 April 2019, Nice
IC-SDV Conference 2019
Jay Ven Eman, Ph.D., CEO
Access Innovations, Inc. / Data Harmony
j_ven_eman@accessinn.com
www.accessinn.com
+1.505.998.0800
Albuquerque, NM USA
Access Innovations, Inc.
The Science behind the Semantics™
www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
Summary
❖ AI/ML/DL hold promise for “Content”
❖ But big, headline grabbing failures
❖ Costs can run to the billions
❖ Choose carefully
❖ Choose narrowly
❖ Focus on improving content for customer utility and
process workflow improvements
© 2013. Access Innovations, Inc. All rights reserved.
Some of Our Current Clients
IOP
© 2010. Access Innovations, Inc. All Rights Reserved.
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
AI – It’ll Be Awesome!
© 2010. Access Innovations, Inc. All Rights Reserved.
Headlines – the good, the bad, & the ugly
❖ The Good
❖ “Can Artificial Intelligence Help Reduce False-positive
Mammograms?”
❖ “You Might Want Artificial Intelligence Reading Your Next
Mammogram”
❖ “When AI writes the Court Rulings”
❖ “Fast and Accurate Annotations of Short Texts with Wikipedia
Pages”
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
Headlines – the good, the bad, & the ugly
❖ The Bad
❖ “Without Humans, Artificial Intelligence is Still Pretty Stupid”
❖ “The Future of AI Depends on a Huge Workforce of Human
Teachers” and “Why AI is Useless Without Human Beings”
❖ “Google Has Picked an Answer for You – Too Bad It’s Often
Wrong”
❖ “Artificial Intelligence Still Isn’t a Game Changer?”
❖ “Google, Smoogle. Reference Librarians Are Busier Than Ever”
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
Headlines – the good, the bad, & the ugly
❖ The Ugly
❖ “Some AI Lessons from Watson’s Failure at MD Anderson”
❖ “MD Anderson Benches IBM Watson In Setback for Artificial
Intelligence in Medicine”
❖ “Artificial Intelligence and Bad Data”
❖ “Sky-high Salaries Are the Weapons in the AI Talent War”
❖ And for ‘STM’ – SciGen – “Tech society retracts 29 articles, ousts
three editors for ‘systematic violation’ of peer review polices”
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
IBM Watson and MD Anderson Cancer Center
❖ “Teaching a machine to read a record is a lot harder than anyone
thought.”
❖ First, know that Watson was getting good results!
❖ The Fail
▪ Not enough data
▪ Inconsistent and bad data
▪ Incompatible systems (Watson EPIC’s EHR)
▪ Changing objectives oops, need to retrain Watson!
▪ Lack of AI knowledge and expertise
▪ Cost overruns – US$62 million spend before tabling project
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2010. Access Innovations, Inc. All Rights Reserved.
IBM Watson and MD Anderson Cancer Center
❖ “Teaching a machine to read a record is a lot harder than anyone
thought.”
❖ First, know that by the end Watson was getting good results!
❖ The Fail
▪ Not enough data
▪ Inconsistent and bad data
▪ Incompatible systems (Watson EPIC’s EHR)
▪ Changing objectives oops, need to retrain Watson!
▪ Lack of AI knowledge and expertise
▪ Cost overruns – US$62 million spend before tabling project
© 2010. Access Innovations, Inc. All Rights Reserved.
Get back to business basics
❖ “Don’t believe the hype – AI is just a tool at the end of the day,
but a very clever tool…”
❖ The Hype
❖ “…find growth and accelerate innovation within an open data environment”
❖ “…breaking the silos of the status quo…”
❖ “Adopting a holistic data strategy…”
❖ “…providing next generation…”
❖ “IBM Watson capabilities to unlock previously unavailable data insights”
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
Get back to business basics
❖ Stick to basic business practices
❖ Use cases
❖ Business cases
❖ “Plan your dive. Dive your plan.”
❖ Without a big budget, keep your expectations in check – narrow your
focus
❖ Do you have US$62 million to blow?
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
Get back to business basics
❖ Use cases
❖ Anti-SciGen, fake news detection, submissions analysis
❖ Auto text generation and summarizations (big in the news business)
❖ Court rulings (e.g. Prometea – Argentina)
❖ Washington Post, Associated Press (e.g. sports summaries)
❖ Machine automated indexing (MAI), semantic enrichment
❖ Image analysis and recognition, info-graphics
❖ Author & institution disambiguation, entity extraction, ‘triples’
generation
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
Some notions of cost
❖ Headline costs – beware
❖ Software – free to hundreds of millions
❖ Support? Think what Redhat did for Linux
❖ “Genuine” AI/ML/DL software?
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
Some notions of cost continued
❖ Data quality costs
❖ Very large data sets (thousands to millions) must be gathered and
curated
❖ Data sets must be conceptually and contextually unique – (e.g. 20 to
40 for each semantic node)
❖ Corrupt and inconsistent data needs normalizing and cleanup
❖ Remove biased data
❖ Format consistency
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
Some notions of cost continued
❖ Training costs – people, time
❖ Look for software that makes training possible in-house
❖ Again, think what Redhat did for Linux and Data Harmony
❖ Need a good user interface for training tasks, app maintenance
❖ US$.03 to $0.15 per piece at outsourcing services, but up to $2,000,
for example tagging a medical image
❖ Staff size – (Facebook – 20k and growing!)
❖ The cost of change – more retraining costs
Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2010. Access Innovations, Inc. All Rights Reserved.
❖ API - Academic Performance Index
❖ API - Active Pharmaceutical Ingredient
❖ API - Application Programming Interface
❖ API - American Petroleum Institute
© 2010. Access Innovations, Inc. All Rights Reserved.
Training-costs case study
❖ Objectives for this project
▪ Improve productivity
▪ Improve discovery
❖ Goals
▪ Lower cost per item
▪ Improve discovery to 85% or better for recall and precision
❖ Process was to automatically cluster the content using
NLP only vs. semantically enrich the content
© 2010. Access Innovations, Inc. All Rights Reserved.
Training-costs case study continued
❖ NLP type AI system
▪ 7500 semantic nodes to train
▪ 7500 labor hours to curate training sets
❖ 20 to 40 items per node needed
❖ Review 60 automatically generated items to get to 20 “unique”
❖ Retrain is still 1 hour per node
❖ Hybrid NLP/rules layer with curated taxonomy*
▪ 7500 semantic nodes to train
▪ 125 labor hours to curate automatically generated rules layer
▪ Retrain is <5 minutes per node *Data Harmony®
© 2010. Access Innovations, Inc. All Rights Reserved.
And, finally…
❖ Like MD Anderson, know when to cut your loses
❖ Good luck!
And that good luck will come from
good planning and good
execution!
Thank you!
Jay Ven Eman, Ph.D., CEO
Access Innovations, Inc. / Data Harmony
j_ven_eman@accessinn.com
www.accessinn.com
+1.505.998.0800
Albuquerque, NM USA
Access Innovations, Inc.
The Science behind the Semantics™
www.accessinn.com

Contenu connexe

Tendances

Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing
Seungyun Lee
 

Tendances (20)

EDW 2015 cognitive computing panel session
EDW 2015 cognitive computing panel session EDW 2015 cognitive computing panel session
EDW 2015 cognitive computing panel session
 
The Business Case for Applied Artificial Intelligence
The Business Case for Applied Artificial IntelligenceThe Business Case for Applied Artificial Intelligence
The Business Case for Applied Artificial Intelligence
 
Wiki stage 20151128 - v001
Wiki stage   20151128 - v001Wiki stage   20151128 - v001
Wiki stage 20151128 - v001
 
Kush stats alpha
Kush stats alpha Kush stats alpha
Kush stats alpha
 
AI-SDV 2020: Special Hypertext Information Treatment in is Special Hypertext ...
AI-SDV 2020: Special Hypertext Information Treatment in is Special Hypertext ...AI-SDV 2020: Special Hypertext Information Treatment in is Special Hypertext ...
AI-SDV 2020: Special Hypertext Information Treatment in is Special Hypertext ...
 
Building an AI Startup: Realities & Tactics
Building an AI Startup: Realities & TacticsBuilding an AI Startup: Realities & Tactics
Building an AI Startup: Realities & Tactics
 
IBM Academy of Technology & Cognitive Computing
IBM Academy of Technology & Cognitive ComputingIBM Academy of Technology & Cognitive Computing
IBM Academy of Technology & Cognitive Computing
 
HPC Top 5 Stories: October 13, 2017
HPC Top 5 Stories: October 13, 2017HPC Top 5 Stories: October 13, 2017
HPC Top 5 Stories: October 13, 2017
 
Vertex Perspectives - Artificial Intelligence in China (Jul 2017)
Vertex Perspectives -   Artificial Intelligence in China (Jul 2017)Vertex Perspectives -   Artificial Intelligence in China (Jul 2017)
Vertex Perspectives - Artificial Intelligence in China (Jul 2017)
 
Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing
 
AI in Healthcare 2017
AI in Healthcare 2017AI in Healthcare 2017
AI in Healthcare 2017
 
Machine Learning Applications to IoT
Machine Learning Applications to IoTMachine Learning Applications to IoT
Machine Learning Applications to IoT
 
Top 5 Deep Learning and AI Stories - April 20, 2018
Top 5 Deep Learning and AI Stories - April 20, 2018Top 5 Deep Learning and AI Stories - April 20, 2018
Top 5 Deep Learning and AI Stories - April 20, 2018
 
11/4 Top 5 Deep Learning Stories
11/4 Top 5 Deep Learning Stories11/4 Top 5 Deep Learning Stories
11/4 Top 5 Deep Learning Stories
 
AI in IoT: Use Cases and Challenges
AI in IoT: Use Cases and ChallengesAI in IoT: Use Cases and Challenges
AI in IoT: Use Cases and Challenges
 
AI-SDV 2020: Can There Be Profitable Revenue from an AI Deployment? The Upsid...
AI-SDV 2020: Can There Be Profitable Revenue from an AI Deployment? The Upsid...AI-SDV 2020: Can There Be Profitable Revenue from an AI Deployment? The Upsid...
AI-SDV 2020: Can There Be Profitable Revenue from an AI Deployment? The Upsid...
 
Top 5 Deep Learning and AI Stories - November 3, 2017
Top 5 Deep Learning and AI Stories - November 3, 2017Top 5 Deep Learning and AI Stories - November 3, 2017
Top 5 Deep Learning and AI Stories - November 3, 2017
 
Improving healthcare with AI
Improving healthcare with AIImproving healthcare with AI
Improving healthcare with AI
 
AI-SDV 2021: Nils Newmann - AI – Who is in control and why is that important?
AI-SDV 2021: Nils Newmann - AI – Who is in control and why is that important?AI-SDV 2021: Nils Newmann - AI – Who is in control and why is that important?
AI-SDV 2021: Nils Newmann - AI – Who is in control and why is that important?
 
10/28 Top 5 Deep Learning Stories
10/28 Top 5 Deep Learning Stories10/28 Top 5 Deep Learning Stories
10/28 Top 5 Deep Learning Stories
 

Similaire à IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning for Automatic Categorization and Semantic Enrichment - Jay Ven Eman (CEO, Access Innovations, USA)

Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete DeckArtificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
SlideTeam
 
Artificial Intelligence High Technology PowerPoint Presentation Slides Comple...
Artificial Intelligence High Technology PowerPoint Presentation Slides Comple...Artificial Intelligence High Technology PowerPoint Presentation Slides Comple...
Artificial Intelligence High Technology PowerPoint Presentation Slides Comple...
SlideTeam
 
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
SlideTeam
 

Similaire à IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning for Automatic Categorization and Semantic Enrichment - Jay Ven Eman (CEO, Access Innovations, USA) (20)

SAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean ZouariSAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean Zouari
 
What Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data Science
 
Reinforcement Learning In AI Powerpoint Presentation Slide Templates Complete...
Reinforcement Learning In AI Powerpoint Presentation Slide Templates Complete...Reinforcement Learning In AI Powerpoint Presentation Slide Templates Complete...
Reinforcement Learning In AI Powerpoint Presentation Slide Templates Complete...
 
Your brain is too small to manage your business
Your brain is too small to manage your business Your brain is too small to manage your business
Your brain is too small to manage your business
 
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete DeckArtificial Intelligence PowerPoint Presentation Slide Template Complete Deck
Artificial Intelligence PowerPoint Presentation Slide Template Complete Deck
 
Artificial Intelligence High Technology PowerPoint Presentation Slides Comple...
Artificial Intelligence High Technology PowerPoint Presentation Slides Comple...Artificial Intelligence High Technology PowerPoint Presentation Slides Comple...
Artificial Intelligence High Technology PowerPoint Presentation Slides Comple...
 
The Data Lake: Empowering Your Data Science Team
The Data Lake: Empowering Your Data Science TeamThe Data Lake: Empowering Your Data Science Team
The Data Lake: Empowering Your Data Science Team
 
Making Intelligent Virtual Assistants a Reality
Making Intelligent Virtual Assistants a RealityMaking Intelligent Virtual Assistants a Reality
Making Intelligent Virtual Assistants a Reality
 
Getting to timely insights - how to make it happen?
Getting to timely insights - how to make it happen?Getting to timely insights - how to make it happen?
Getting to timely insights - how to make it happen?
 
Streamlining Information Flows In The Digital Workplace
Streamlining Information Flows In The Digital WorkplaceStreamlining Information Flows In The Digital Workplace
Streamlining Information Flows In The Digital Workplace
 
How would AI shape Future Integrations?
How would AI shape Future Integrations?How would AI shape Future Integrations?
How would AI shape Future Integrations?
 
How Machine Learning Will Transform Finance
How Machine Learning Will Transform FinanceHow Machine Learning Will Transform Finance
How Machine Learning Will Transform Finance
 
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
Artificial Intelligence And Machine Learning PowerPoint Presentation Slides C...
 
The Power of Big Data - Transformation Day Public Sector London 2017
The Power of Big Data - Transformation Day Public Sector London 2017The Power of Big Data - Transformation Day Public Sector London 2017
The Power of Big Data - Transformation Day Public Sector London 2017
 
There's No AI Without IA with Seth Earley
There's No AI Without IA with Seth EarleyThere's No AI Without IA with Seth Earley
There's No AI Without IA with Seth Earley
 
Practical Applications of Visual Analytics
Practical Applications of Visual AnalyticsPractical Applications of Visual Analytics
Practical Applications of Visual Analytics
 
Presumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of SuccessPresumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of Success
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
Data is not the new snake oil
Data is not the new snake oilData is not the new snake oil
Data is not the new snake oil
 
Smart Data - The Foundation for Better Business Outcomes
Smart Data - The Foundation for Better Business OutcomesSmart Data - The Foundation for Better Business Outcomes
Smart Data - The Foundation for Better Business Outcomes
 

Plus de Dr. Haxel Consult

AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
Dr. Haxel Consult
 

Plus de Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning for Automatic Categorization and Semantic Enrichment - Jay Ven Eman (CEO, Access Innovations, USA)

  • 1. The Economics of Artificial Intelligence and Machine Learning for Semantic Enrichment Monday, 8 April 2019, Nice IC-SDV Conference 2019 Jay Ven Eman, Ph.D., CEO Access Innovations, Inc. / Data Harmony j_ven_eman@accessinn.com www.accessinn.com +1.505.998.0800 Albuquerque, NM USA Access Innovations, Inc. The Science behind the Semantics™ www.accessinn.com
  • 2. © 2010. Access Innovations, Inc. All Rights Reserved. Summary ❖ AI/ML/DL hold promise for “Content” ❖ But big, headline grabbing failures ❖ Costs can run to the billions ❖ Choose carefully ❖ Choose narrowly ❖ Focus on improving content for customer utility and process workflow improvements
  • 3. © 2013. Access Innovations, Inc. All rights reserved. Some of Our Current Clients IOP
  • 4. © 2010. Access Innovations, Inc. All Rights Reserved. Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com AI – It’ll Be Awesome!
  • 5. © 2010. Access Innovations, Inc. All Rights Reserved. Headlines – the good, the bad, & the ugly ❖ The Good ❖ “Can Artificial Intelligence Help Reduce False-positive Mammograms?” ❖ “You Might Want Artificial Intelligence Reading Your Next Mammogram” ❖ “When AI writes the Court Rulings” ❖ “Fast and Accurate Annotations of Short Texts with Wikipedia Pages” Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 6. © 2010. Access Innovations, Inc. All Rights Reserved. Headlines – the good, the bad, & the ugly ❖ The Bad ❖ “Without Humans, Artificial Intelligence is Still Pretty Stupid” ❖ “The Future of AI Depends on a Huge Workforce of Human Teachers” and “Why AI is Useless Without Human Beings” ❖ “Google Has Picked an Answer for You – Too Bad It’s Often Wrong” ❖ “Artificial Intelligence Still Isn’t a Game Changer?” ❖ “Google, Smoogle. Reference Librarians Are Busier Than Ever” Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 7. © 2010. Access Innovations, Inc. All Rights Reserved. Headlines – the good, the bad, & the ugly ❖ The Ugly ❖ “Some AI Lessons from Watson’s Failure at MD Anderson” ❖ “MD Anderson Benches IBM Watson In Setback for Artificial Intelligence in Medicine” ❖ “Artificial Intelligence and Bad Data” ❖ “Sky-high Salaries Are the Weapons in the AI Talent War” ❖ And for ‘STM’ – SciGen – “Tech society retracts 29 articles, ousts three editors for ‘systematic violation’ of peer review polices” Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 8. © 2010. Access Innovations, Inc. All Rights Reserved. IBM Watson and MD Anderson Cancer Center ❖ “Teaching a machine to read a record is a lot harder than anyone thought.” ❖ First, know that Watson was getting good results! ❖ The Fail ▪ Not enough data ▪ Inconsistent and bad data ▪ Incompatible systems (Watson EPIC’s EHR) ▪ Changing objectives oops, need to retrain Watson! ▪ Lack of AI knowledge and expertise ▪ Cost overruns – US$62 million spend before tabling project
  • 9. © 2010. Access Innovations, Inc. All Rights Reserved.
  • 10. © 2010. Access Innovations, Inc. All Rights Reserved. IBM Watson and MD Anderson Cancer Center ❖ “Teaching a machine to read a record is a lot harder than anyone thought.” ❖ First, know that by the end Watson was getting good results! ❖ The Fail ▪ Not enough data ▪ Inconsistent and bad data ▪ Incompatible systems (Watson EPIC’s EHR) ▪ Changing objectives oops, need to retrain Watson! ▪ Lack of AI knowledge and expertise ▪ Cost overruns – US$62 million spend before tabling project
  • 11. © 2010. Access Innovations, Inc. All Rights Reserved. Get back to business basics ❖ “Don’t believe the hype – AI is just a tool at the end of the day, but a very clever tool…” ❖ The Hype ❖ “…find growth and accelerate innovation within an open data environment” ❖ “…breaking the silos of the status quo…” ❖ “Adopting a holistic data strategy…” ❖ “…providing next generation…” ❖ “IBM Watson capabilities to unlock previously unavailable data insights” Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 12. © 2010. Access Innovations, Inc. All Rights Reserved. Get back to business basics ❖ Stick to basic business practices ❖ Use cases ❖ Business cases ❖ “Plan your dive. Dive your plan.” ❖ Without a big budget, keep your expectations in check – narrow your focus ❖ Do you have US$62 million to blow? Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 13. © 2010. Access Innovations, Inc. All Rights Reserved. Get back to business basics ❖ Use cases ❖ Anti-SciGen, fake news detection, submissions analysis ❖ Auto text generation and summarizations (big in the news business) ❖ Court rulings (e.g. Prometea – Argentina) ❖ Washington Post, Associated Press (e.g. sports summaries) ❖ Machine automated indexing (MAI), semantic enrichment ❖ Image analysis and recognition, info-graphics ❖ Author & institution disambiguation, entity extraction, ‘triples’ generation Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 14. © 2010. Access Innovations, Inc. All Rights Reserved. Some notions of cost ❖ Headline costs – beware ❖ Software – free to hundreds of millions ❖ Support? Think what Redhat did for Linux ❖ “Genuine” AI/ML/DL software? Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 15. © 2010. Access Innovations, Inc. All Rights Reserved. Some notions of cost continued ❖ Data quality costs ❖ Very large data sets (thousands to millions) must be gathered and curated ❖ Data sets must be conceptually and contextually unique – (e.g. 20 to 40 for each semantic node) ❖ Corrupt and inconsistent data needs normalizing and cleanup ❖ Remove biased data ❖ Format consistency Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 16. © 2010. Access Innovations, Inc. All Rights Reserved. Some notions of cost continued ❖ Training costs – people, time ❖ Look for software that makes training possible in-house ❖ Again, think what Redhat did for Linux and Data Harmony ❖ Need a good user interface for training tasks, app maintenance ❖ US$.03 to $0.15 per piece at outsourcing services, but up to $2,000, for example tagging a medical image ❖ Staff size – (Facebook – 20k and growing!) ❖ The cost of change – more retraining costs Access Innovations, Inc.- The Science behind the Semantics™ - www.accessinn.com
  • 17. © 2010. Access Innovations, Inc. All Rights Reserved.
  • 18. © 2010. Access Innovations, Inc. All Rights Reserved. ❖ API - Academic Performance Index ❖ API - Active Pharmaceutical Ingredient ❖ API - Application Programming Interface ❖ API - American Petroleum Institute
  • 19. © 2010. Access Innovations, Inc. All Rights Reserved. Training-costs case study ❖ Objectives for this project ▪ Improve productivity ▪ Improve discovery ❖ Goals ▪ Lower cost per item ▪ Improve discovery to 85% or better for recall and precision ❖ Process was to automatically cluster the content using NLP only vs. semantically enrich the content
  • 20. © 2010. Access Innovations, Inc. All Rights Reserved. Training-costs case study continued ❖ NLP type AI system ▪ 7500 semantic nodes to train ▪ 7500 labor hours to curate training sets ❖ 20 to 40 items per node needed ❖ Review 60 automatically generated items to get to 20 “unique” ❖ Retrain is still 1 hour per node ❖ Hybrid NLP/rules layer with curated taxonomy* ▪ 7500 semantic nodes to train ▪ 125 labor hours to curate automatically generated rules layer ▪ Retrain is <5 minutes per node *Data Harmony®
  • 21. © 2010. Access Innovations, Inc. All Rights Reserved. And, finally… ❖ Like MD Anderson, know when to cut your loses ❖ Good luck! And that good luck will come from good planning and good execution!
  • 22. Thank you! Jay Ven Eman, Ph.D., CEO Access Innovations, Inc. / Data Harmony j_ven_eman@accessinn.com www.accessinn.com +1.505.998.0800 Albuquerque, NM USA Access Innovations, Inc. The Science behind the Semantics™ www.accessinn.com