SlideShare a Scribd company logo
1 of 10
Download to read offline
Mining Data from Images and Video for
Indexing and Analysis
Bill Brouwer 01/14/13

01/14/14

wjb19@psu.edu

1
Current Role at PSU

Computational Scientist, Research Computing and Cyberinfrastructure
(RCC), Penn State 06/2011-present
-Consultant, High Performance Computing (HPC)
-Teaching & Personal Research
-CUDA, C/C++ programming, code profiling/optimization
-Co-writer/recipient of awards
-Local XSEDE Campus Champion
-Publication & Presentations
-Maintain/use ~ 100 open source examples in software stack

01/14/14

wjb19@psu.edu

2
Overview
Objective
-Knowledge Discovery & Data Mining (KDD)
-Machine vs Humans
Example Problem
-Quantification in root structures
Methods
-Computer Vision Algorithms
-H.264/AVC codec
Solution
-Avpipe

01/14/14

wjb19@psu.edu

3
Knowledge discovery& Data Mining (KDD)
Goal: simply put, to learn things from data; first need to get it
in a database/usable state
Hard enough for text documents, much harder for
images/video because it's binary data
Even with meta from tagging allowing indexing and retrieval,
still difficult to analyze large amounts of image data
Want to make both indexing and analysis easier through
software; we can create useful data from binary using
machines or humans

01/14/14

wjb19@psu.edu

4
Machine: Examples
SKYTree
-Startup recently secured ~18M series A funding, provide solutions
to 'big data' problems, deriving value from disparate data using
machine learning (ML)
Roistr
-Startup dedicated to 'meaning discovery'
-Good for product recommendation problems eg., take a customers
twitter feed, and on this basis recommend some books to read
Plot2txt
-Personal start-up devoted to mining technical content from images
using unsupervised ML
-Works well on spectroscopic, oil+gas data

01/14/14

wjb19@psu.edu

5
Humans:Amazon Mechanical Turk
Crowd sourced solution to hard problems for machines,
referred to as Human Intelligence Tasks (HIT)
Turkers are the masses, to whom other users can submit
tasks, via web interface
Task examples including image tagging, comparison, writing
product descriptions
Not really scalable; humans are expensive, bad at accurate
measurement eg., quantitative data from images

01/14/14

wjb19@psu.edu

6
Quantifying Root Structure
Extract frames and for each:
-Detect edges for structures of interest
-Create VTK of volumes for subsequent visualization
&measurement
Problem provided by J. Yang (Brown/Lynch lab)

01/14/14

wjb19@psu.edu

7
Methods
Edge Detection
Connected Components
Binarization/thresholding
Threaded computation &synchronization
Ubiquitous H.264/AVC codec common to HD format playback
and transmission
-Associated IP issues made development/deployment of
software tricky/expensive
-Cisco recently open-sourced an implementation :
http://blogs.cisco.com/collaboration/open-source-h-264-removes-barriers-webrtc/

01/14/14

wjb19@psu.edu

8
Solution: avpipe
stdin

Takes AVI stream from stdin,
decodes and sends frames to
threads

decode

avpipe

threads

Frames after operation may be
re-encoded and sent to stdout
encode(?)
stdout

01/14/14

out

Data output extracted from frames
may be saved to file/sent to stderr

Cat avpipe instances together
using pipes

wjb19@psu.edu

9
Project Status
Basic framework released on github
-https://github.com/wjb19/avpipe
Currently incorporating :
-Codec
-Binarization &CCL
-VTK output using library devloped by Burak Korkut
http://liberlocus.blogspot.com/
Other applications??

01/14/14

wjb19@psu.edu

10

More Related Content

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Mining Data from Images and Video for Indexing and Analysis

  • 1. Mining Data from Images and Video for Indexing and Analysis Bill Brouwer 01/14/13 01/14/14 wjb19@psu.edu 1
  • 2. Current Role at PSU Computational Scientist, Research Computing and Cyberinfrastructure (RCC), Penn State 06/2011-present -Consultant, High Performance Computing (HPC) -Teaching & Personal Research -CUDA, C/C++ programming, code profiling/optimization -Co-writer/recipient of awards -Local XSEDE Campus Champion -Publication & Presentations -Maintain/use ~ 100 open source examples in software stack 01/14/14 wjb19@psu.edu 2
  • 3. Overview Objective -Knowledge Discovery & Data Mining (KDD) -Machine vs Humans Example Problem -Quantification in root structures Methods -Computer Vision Algorithms -H.264/AVC codec Solution -Avpipe 01/14/14 wjb19@psu.edu 3
  • 4. Knowledge discovery& Data Mining (KDD) Goal: simply put, to learn things from data; first need to get it in a database/usable state Hard enough for text documents, much harder for images/video because it's binary data Even with meta from tagging allowing indexing and retrieval, still difficult to analyze large amounts of image data Want to make both indexing and analysis easier through software; we can create useful data from binary using machines or humans 01/14/14 wjb19@psu.edu 4
  • 5. Machine: Examples SKYTree -Startup recently secured ~18M series A funding, provide solutions to 'big data' problems, deriving value from disparate data using machine learning (ML) Roistr -Startup dedicated to 'meaning discovery' -Good for product recommendation problems eg., take a customers twitter feed, and on this basis recommend some books to read Plot2txt -Personal start-up devoted to mining technical content from images using unsupervised ML -Works well on spectroscopic, oil+gas data 01/14/14 wjb19@psu.edu 5
  • 6. Humans:Amazon Mechanical Turk Crowd sourced solution to hard problems for machines, referred to as Human Intelligence Tasks (HIT) Turkers are the masses, to whom other users can submit tasks, via web interface Task examples including image tagging, comparison, writing product descriptions Not really scalable; humans are expensive, bad at accurate measurement eg., quantitative data from images 01/14/14 wjb19@psu.edu 6
  • 7. Quantifying Root Structure Extract frames and for each: -Detect edges for structures of interest -Create VTK of volumes for subsequent visualization &measurement Problem provided by J. Yang (Brown/Lynch lab) 01/14/14 wjb19@psu.edu 7
  • 8. Methods Edge Detection Connected Components Binarization/thresholding Threaded computation &synchronization Ubiquitous H.264/AVC codec common to HD format playback and transmission -Associated IP issues made development/deployment of software tricky/expensive -Cisco recently open-sourced an implementation : http://blogs.cisco.com/collaboration/open-source-h-264-removes-barriers-webrtc/ 01/14/14 wjb19@psu.edu 8
  • 9. Solution: avpipe stdin Takes AVI stream from stdin, decodes and sends frames to threads decode avpipe threads Frames after operation may be re-encoded and sent to stdout encode(?) stdout 01/14/14 out Data output extracted from frames may be saved to file/sent to stderr Cat avpipe instances together using pipes wjb19@psu.edu 9
  • 10. Project Status Basic framework released on github -https://github.com/wjb19/avpipe Currently incorporating : -Codec -Binarization &CCL -VTK output using library devloped by Burak Korkut http://liberlocus.blogspot.com/ Other applications?? 01/14/14 wjb19@psu.edu 10