SlideShare une entreprise Scribd logo
1  sur  40
Télécharger pour lire hors ligne
Big Data Analytics
Using the information effectively
9/10/2013 | 2 | ©2013 Ciber, Inc.
Agenda
• Big Data
• Making Sense of it all
• A Framework of Understanding
• Topical information
• Non Topical Information
• Analytics
• Examples
• Getting there
• Q&A
9/10/2013 | 3 | ©2013 Ciber, Inc.
Social Network Diagram
• Contextual analytics is one of the hottest areas of interest pertaining to big
data today
• Smart companies know there is tremendous value in contextual analytics.
But aggregating, categorizing, summarizing, exploring and contextualizing
unstructured data is a big undertaking.
9/10/2013 | 4 | ©2012 Ciber, Inc.
Big Data
9/10/2013 | 5 | ©2013 Ciber, Inc.
What is the Big Data market?
Source: “Big Data Market Size and Vendor Revenues”, Wikibon, Jeff Kelly, David Valante, David Elgyer, Feb 2013 – actual data through 2011
Acronyms: TBD = to be determined; SI = systems integrator; BPO = business process outsourcing
9/10/2013 | 6 | ©2013 Ciber, Inc.
Sample Industry Applications of Big Data
Telco
Call Detail Record
(CDR) analytics for:
• Customer service
• Network planning
• Regulatory
compliance
Financial Services
Transaction analytics
for:
• Fraud detection
• Customer retention
• Distribution network
planning (Branch,
ATM, Call Center)
• Regulatory
compliance
• Consumer card /
Merchant activity
Utilities
Network / Process
analytics for:
• Grid monitoring /
reliability studies
• Preventive
Maintenance
• Power production
monitoring /
planning
Retail
Product analytics for:
• Market Basket analytics
• SKU trending
• Competitive analyses
• Context-aware buying
• Social indicators of
brand
Healthcare
Patient analytics for:
• Cost of care
reduction
• Quality of care
improvement
• Claims optimization
• Service provider
consistency
• Outcome
diagnostics
• Regulatory
compliance
9/10/2013 | 7 | ©2013 Ciber, Inc.
Why Big Data? Insights from Analysis
• Time college football products to win customers
– WalmartLabs: social media buzz indicates when customers are
getting excited about the upcoming season and their team(s).
Combined with ShopyCat app provides targeted promos on team
items.
• Detecting nosocomial infections before they kill
infants
– Toronto hospital – Nosocomial infections can be life-threatening
to premature infants if not treated quickly. Neonatal monitoring
with real-time analytics can detect heart beat patterns that
identify an infection before symptoms appear.
9/10/2013 | 8 | ©2013 Ciber, Inc.
Wal-Mart handles more
than 1 million customer
transactions every hour
which import into
databases containing
more than 2.5 petabytes
Volume Velocity Variety
1M/hour
In addition to all procedure,
claims and payment
systems’ structured data
add unstructured data in
EMRs, patient monitoring
devices, publications, drug
structures, social network
comments, carrier health
sites, post-treatment care
records…
80%
Exist in the digital
universe as of early 2013
1 zettabyte =
1,000 exabytes
1,000,000 petabytes
10^9 terabytes
10^12 gigabytes
2.7zettabytes
What Drives Big Data Analytics
9/10/2013 | 9 | ©2013 Ciber, Inc.
EngineeringSocial/Mobile
The Big Data Ecosystem
Enterprise Systems
Customer Loyalty &
Service Systems
Customer
Case Files
E-MailsAudioImagesProvisioning
Systems
Variety Veracity
Velocity Volume
Analysis
Business Outcomes
Predictive Analytics
CEP
Operational Control
Simulation
Social Analytics
Digital Marketing
WEB Analytics
Blogs,
Communities
9/10/2013 | 10 | ©2013 Ciber, Inc.
Hadoop and other options
• A strategy for bringing together
hardware and software
• What choices are available and
how do you choose the best
option?
• How do I govern it?
9/10/2013 | 11 | ©2013 Ciber, Inc.
Big Data Toolscape
9/10/2013 | 12 | ©2013 Ciber, Inc.
There are Many Use Cases for a Big Data Platform
 Social Media - Product/brand Sentiment
analysis
 Brand strategy
 Market analysis
 RFID tracking & analysis
 Transaction analysis to create insight-
based product/service offerings
 Multimodal surveillance
 Cyber security
 Fraud modeling & detection
 Risk modeling & management
 Regulatory reporting
Innovate New Products
at Speed and Scale
Know Everything about your Customer
 Social media customer sentiment
analysis
 Promotion optimization
 Segmentation
 Customer profitability
 Click-stream analysis
 CDR processing
 Multi-channel interaction analysis
 Loyalty program analytics
 Churn prediction
Run Zero Latency Operations
 Smart Grid/meter management
 Distribution load forecasting
 Sales reporting
 Inventory & merchandising optimization
 Options trading
 ICU patient monitoring
 Disease surveillance
 Transportation network optimization
 Store performance
 Environmental analysis
 Experimental research
Instant Awareness of
Risk and Fraud
Exploit Instrumented Assets
 Network analytics
 Asset management and predictive issue resolution
 Website analytics
 IT log analysis
Back
9/10/2013 | 13 | ©2013 Ciber, Inc.
Processing and Archiving Strategies
• Store forever
• Selective storage
• Throw away after processing
9/10/2013 | 14 | ©2012 Ciber, Inc.
Making Sense of it all
9/10/2013 | 15 | ©2013 Ciber, Inc.
Making sense of it all
• Clarity of purpose
• Definition of scope
• Allocation of resources
• Concrete result expectations
• Comparative Analytical Measures (e.g.
KPIs)
– Rationalization of measures into actionable
items and hierarchical groups
– Defining predictive analytics workspaces
!
!
!
9/10/2013 | 16 | ©2013 Ciber, Inc.
Role of the Data Scientist
• Creating Intelligent Tagging
• Selecting tools for analysis
• Defining algorithms and data mining
techniques
9/10/2013 | 17 | ©2012 Ciber, Inc.
A Framework of Understanding
9/10/2013 | 18 | ©2013 Ciber, Inc.
What is Contextualization ?
• Context is the interrelated conditions in which
something exists or occurs . Helping define context
is Environment, Setting, Timeline, Genre
• Why is context important?
– Consistency needed in returned result sets
– The context describes the internal or external “framework”
– Internal contextual information is crucial
– External contextual information is knowledge that which
cannot be gotten from the text of the item itself
– Time and resources are wasted in searching irrelevant
and non-material information
9/10/2013 | 19 | ©2013 Ciber, Inc.
Problems in searching data
• Voluminous
• Ambiguous meanings
• Inconsistent tagging
• Multiple item types – text, formatted, PDF, TIFF,
graphical, blogs, mashups
• Knowledge of what is wanted is required to
understand and return the proper result sets
• Differentiation is necessary between
– Real-time needs (e.g. fraud detection, medical Emergency
room procedures)
– Near-time needs (sometime in the near timeline)
– Relaxed-time (some clearly defined future period)
9/10/2013 | 20 | ©2013 Ciber, Inc.
Topical information
• Topical information is generally
visible in the data stream
–Keywords, data ranges, etc.
9/10/2013 | 21 | ©2013 Ciber, Inc.
Non-topical information
• Has to be retrieved outside the item
– Although topic is crucial to the relevance of an
item, non-topical criteria plays an important
role in the determination of relevance and
significance
– The identification and use of non-content (or
“context”) descriptors is necessary
– How widely agreed upon are the values of a
given criterion among users (or user groups)?
9/10/2013 | 22 | ©2013 Ciber, Inc.
Non-topical information cont’d
–What is the degree to which an attribute-
value is “public” or “private”?
• How useful is each criterion for the search
tasks to be addressed by the specific query
system?
• How easily can a criterion be identified and
assigned to an item?
• What methods can be applied for refining
and speeding retrievals?
9/10/2013 | 23 | ©2013 Ciber, Inc.
Descriptors - The defining of disambiguity
• Do the content descriptors correspond
or relate to non-topical relevance
criteria of the system’s users?
• Will users see a relationship between
their relevance criteria and these
descriptors, and use these descriptors
in their search queries?
9/10/2013 | 24 | ©2013 Ciber, Inc.
Content descriptors
• Content descriptors (topical relevance
criteria)
– “Public” knowledge:
• People of similar cultural backgrounds would
(more or less) agree on the meanings.
However, context descriptors (which can
function as non-topical relevance criteria) can
vary widely in the degree to which their
attribute-values are considered public or
private.
9/10/2013 | 25 | ©2013 Ciber, Inc.
Public Knowledge Examples
• “Has pictures” is a criterion that could be
considered “public” as most people could agree
on whether or not a document “has pictures”, if
given a specific document to evaluate.
• On the other hand, the criterion of “Regency
Era” is highly situationally dependent - i.e. a
limited subset of the public has knowledge of it -
(specifically the period between 1811 and 1820,
when King George III was deemed unfit to rule
and his son - the Prince of Wales - ruled as his
proxy as Prince Regent)
9/10/2013 | 26 | ©2013 Ciber, Inc.
Genres refine taxonomy
• Genre is a “folk typology”
• Item categories must enjoy widespread recognition by their
intended user groups to qualify as genres.
– Examples: Resumes, Ballet, Music, Chemical formulae, statistical
results
• Groups of people agree on and define Genres by mutual
consent (Explicitly and Implicitly)
– E.g. Taxonomies (plants, accounting, medical), laws, voting, polls
• Genres give rise to sub-genres with increasing granularity
– E.g. Music, classical, romantic, new age, atonal
– Genres and sub-genres may contain common elements
• E.g. classical music and romantic music may have an intersection of data
points
9/10/2013 | 27 | ©2013 Ciber, Inc.
Genre knowledge
• Genre is a type based on purpose, form and
content.
– E.g. The “resume” genre is for soliciting employment,
divided into sections with contextual descriptors
• Knowing a particular item’s genre also infers
significant things about an item, sometimes enough
to a make a judgment regarding the Item’s
relevance to an information need
– E.g. The phrase “Classically Trained Musician” infers
knowledge to read music and understand musical
terminology along with additional shades of musical
knowledge
9/10/2013 | 28 | ©2012 Ciber, Inc.
Analytics
9/10/2013 | 29 | ©2013 Ciber, Inc.
Historical Analytics
• Presentation of historical data
– Dashboards, Drill-downs, interactive reports, static reports
– New methods and devices
– Identifying the metrics that affect key objectives
– Synchronizing those metrics through an organization
– Creating user tools to show effects of good (and bad)
choices
– Tying the financial, operational, and sales worlds together
– Analyzing to predict the future
– Refining models for accuracy
9/10/2013 | 30 | ©2013 Ciber, Inc.
Predictive Analytics
• Manipulation of data
– Dashboards, Drill-downs, interactive reports
– New methods and devices
– Varying the metrics that affect key objectives
– Synchronizing the impact of metrics through an
organization
– Creating user tools to show effects of good (and bad)
choices
– Tying the financial, operational, and sales worlds together
– Creating models that show potential future scenarios
– Refining models for accuracy using advanced tools and
statistics
9/10/2013 | 31 | ©2012 Ciber, Inc.
Examples
9/10/2013 | 32 | ©2013 Ciber, Inc.
Examples of Harnessing Data Resources
Retailer reduces time to run
queries by 80% to optimize
inventory
Stock Exchange cuts queries
from 26 hours to 2 minutes
on 2 PB
Government cuts acoustic
analysis from hours to
70 Milliseconds
Utility avoids power failures by
analyzing
10 PB of data in minutes
Telco analyses streaming
network data to reduce
hardware costs by 90%
Hospital analyses streaming
vitals to detect illness
24 hours earlier
Big data challenges exist in every organization today
9/10/2013 | 33 | ©2013 Ciber, Inc.
In Order to Realize New Opportunities, You Need to
Think Beyond Traditional Sources of Data
Transactional and
Application Data
Machine Data Social Data
 Volume
 Structured
 Throughput
 Velocity
 Semi-structured
 Ingestion
 Variety
 Highly unstructured
 Veracity
Enterprise
Content
 Variety
 Highly unstructured
 Volume
9/10/2013 | 34 | ©2013 Ciber, Inc.
• Data at rest – oceans
• Collection of what has streamed
• Web logs, emails, social media
• Unstructured documents: forms, claims
• Structured data from disparate systems
• Data in movement - streams
• Twitter / Facebook comments
• Stock market data
• Sensors: Vital signs of a newly-born
Two Sample Types of Big Data
9/10/2013 | 35 | ©2012 Ciber, Inc.
Getting there
9/10/2013 | 36 | ©2013 Ciber, Inc.
Leveraging Big Data Requires Multiple Platform
Capabilities
Manage & store huge
volume of any data
Hadoop File System
MapReduce
Manage streaming data Stream Computing
Analyze unstructured data Text Analytics Engine
Data WarehousingStructure and control data
Integrate and govern all
data sources
Integration, Data Quality, Security,
Lifecycle Management, MDM
Understand and navigate
federated big data sources
Federated Discovery and Navigation
9/10/2013 | 37 | ©2013 Ciber, Inc.
Outcomes Utilizing Big Data Capabilities
To Analyze Any Big
Data Type
With Unique CapabilitiesAchieve Breakthrough
Outcomes
Content
Transactional
/ Application
Data
Machine Data
Social Media
Data
Visualization
and Discovery
Know Everything
About Your
Customers
Run Zero-latency
Operations
Innovate new
products at Speed
and Scale
Instant Awareness
of Fraud and Risk
Exploit
Instrumented
Assets
Hadoop
Data
Warehousing
Stream
Computing
Integration and
Governance
Text Analytics
9/10/2013 | 38 | ©2013 Ciber, Inc.
Big Data Platform and Entry Points
2 – Analyze Raw Rata
5 – Analyze Streaming
Data
1 – Unlock Big Data
3 – Simplify your
warehouse
4 – Reduce costs with
Hadoop
9/10/2013 | 39 | ©2013 Ciber, Inc.
Q & A
Contact
Richard Gristak, Senior Director of Business Intelligence – rgristak@ciber.com
9/10/2013 | 40 | ©2012 Ciber, Inc.
Thank you

Contenu connexe

Tendances

Improving Policy Coherence and Accessibility through Semantic Web Technologie...
Improving Policy Coherence and Accessibility through Semantic Web Technologie...Improving Policy Coherence and Accessibility through Semantic Web Technologie...
Improving Policy Coherence and Accessibility through Semantic Web Technologie...Edward Curry
 
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive EraAlgorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive EraNozha Boujemaa
 
LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)Wolfgang Greller
 
Setting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case StudiesSetting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case StudiesBYTE Project
 
Transforming the European Data Economy: A Strategic Research and Innovation A...
Transforming the European Data Economy: A Strategic Research and Innovation A...Transforming the European Data Economy: A Strategic Research and Innovation A...
Transforming the European Data Economy: A Strategic Research and Innovation A...Edward Curry
 
Big Data Public-Private Forum_General Presentation
Big Data Public-Private Forum_General PresentationBig Data Public-Private Forum_General Presentation
Big Data Public-Private Forum_General PresentationBIG Project
 
Educating Data Scientists: the SoBigData master experience
Educating Data Scientists: the SoBigData master experienceEducating Data Scientists: the SoBigData master experience
Educating Data Scientists: the SoBigData master experienceResearch Data Alliance
 
Towards Unified and Native Enrichment in Event Processing Systems
Towards Unified and Native Enrichment in Event Processing SystemsTowards Unified and Native Enrichment in Event Processing Systems
Towards Unified and Native Enrichment in Event Processing SystemsEdward Curry
 
The Big Data Value PPP: A Standardisation Opportunity for Europe
The Big Data Value PPP: A Standardisation Opportunity for EuropeThe Big Data Value PPP: A Standardisation Opportunity for Europe
The Big Data Value PPP: A Standardisation Opportunity for EuropeEdward Curry
 
Crowdsourcing Approaches for Smart City Open Data Management
Crowdsourcing Approaches for Smart City Open Data ManagementCrowdsourcing Approaches for Smart City Open Data Management
Crowdsourcing Approaches for Smart City Open Data ManagementEdward Curry
 
Towards a BIG Data Public Private Partnership
Towards a BIG Data Public Private PartnershipTowards a BIG Data Public Private Partnership
Towards a BIG Data Public Private PartnershipEdward Curry
 
Ethics of Big Data
Ethics of Big DataEthics of Big Data
Ethics of Big DataMatti Vesala
 
Governance of Big Data
Governance of Big DataGovernance of Big Data
Governance of Big DataAlberto Asquer
 
AAMAS-2017 8-12 May, 2017, Sao Paulo, Brazil
AAMAS-2017 8-12 May, 2017, Sao Paulo, BrazilAAMAS-2017 8-12 May, 2017, Sao Paulo, Brazil
AAMAS-2017 8-12 May, 2017, Sao Paulo, BrazilCharith Perera
 
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...Edward Curry
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Research Data Alliance
 
Open Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and OpportunitiesOpen Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and OpportunitiesSlim Turki, Dr.
 
Big data - Key Enablers, Drivers & Challenges
Big data - Key Enablers, Drivers & ChallengesBig data - Key Enablers, Drivers & Challenges
Big data - Key Enablers, Drivers & ChallengesShilpi Sharma
 

Tendances (20)

Improving Policy Coherence and Accessibility through Semantic Web Technologie...
Improving Policy Coherence and Accessibility through Semantic Web Technologie...Improving Policy Coherence and Accessibility through Semantic Web Technologie...
Improving Policy Coherence and Accessibility through Semantic Web Technologie...
 
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive EraAlgorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
 
LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)
 
Setting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case StudiesSetting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
 
Transforming the European Data Economy: A Strategic Research and Innovation A...
Transforming the European Data Economy: A Strategic Research and Innovation A...Transforming the European Data Economy: A Strategic Research and Innovation A...
Transforming the European Data Economy: A Strategic Research and Innovation A...
 
Big Data Public-Private Forum_General Presentation
Big Data Public-Private Forum_General PresentationBig Data Public-Private Forum_General Presentation
Big Data Public-Private Forum_General Presentation
 
Educating Data Scientists: the SoBigData master experience
Educating Data Scientists: the SoBigData master experienceEducating Data Scientists: the SoBigData master experience
Educating Data Scientists: the SoBigData master experience
 
Towards Unified and Native Enrichment in Event Processing Systems
Towards Unified and Native Enrichment in Event Processing SystemsTowards Unified and Native Enrichment in Event Processing Systems
Towards Unified and Native Enrichment in Event Processing Systems
 
The Big Data Value PPP: A Standardisation Opportunity for Europe
The Big Data Value PPP: A Standardisation Opportunity for EuropeThe Big Data Value PPP: A Standardisation Opportunity for Europe
The Big Data Value PPP: A Standardisation Opportunity for Europe
 
Crowdsourcing Approaches for Smart City Open Data Management
Crowdsourcing Approaches for Smart City Open Data ManagementCrowdsourcing Approaches for Smart City Open Data Management
Crowdsourcing Approaches for Smart City Open Data Management
 
Towards a BIG Data Public Private Partnership
Towards a BIG Data Public Private PartnershipTowards a BIG Data Public Private Partnership
Towards a BIG Data Public Private Partnership
 
Ethics of Big Data
Ethics of Big DataEthics of Big Data
Ethics of Big Data
 
Governance of Big Data
Governance of Big DataGovernance of Big Data
Governance of Big Data
 
AAMAS-2017 8-12 May, 2017, Sao Paulo, Brazil
AAMAS-2017 8-12 May, 2017, Sao Paulo, BrazilAAMAS-2017 8-12 May, 2017, Sao Paulo, Brazil
AAMAS-2017 8-12 May, 2017, Sao Paulo, Brazil
 
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
 
Privacy in the Age of Big Data
Privacy in the Age of Big DataPrivacy in the Age of Big Data
Privacy in the Age of Big Data
 
4D Geospatial Analytics in Digital Healthcare PDF
4D Geospatial Analytics in Digital Healthcare PDF4D Geospatial Analytics in Digital Healthcare PDF
4D Geospatial Analytics in Digital Healthcare PDF
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
 
Open Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and OpportunitiesOpen Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and Opportunities
 
Big data - Key Enablers, Drivers & Challenges
Big data - Key Enablers, Drivers & ChallengesBig data - Key Enablers, Drivers & Challenges
Big data - Key Enablers, Drivers & Challenges
 

En vedette

When a Collection is Not a Collection ASA 2014
When a Collection is Not a Collection ASA 2014When a Collection is Not a Collection ASA 2014
When a Collection is Not a Collection ASA 2014Jill Emery
 
All that counts
All that countsAll that counts
All that countsJill Emery
 
Breaking Silos: Staffing for the OA Library SPARC 2014
Breaking Silos: Staffing for the OA Library SPARC 2014 Breaking Silos: Staffing for the OA Library SPARC 2014
Breaking Silos: Staffing for the OA Library SPARC 2014 Jill Emery
 
NPG OA Presentation
NPG OA PresentationNPG OA Presentation
NPG OA PresentationJill Emery
 
Je transforming collections
Je transforming collectionsJe transforming collections
Je transforming collectionsJill Emery
 
Rcjeks halfway open
Rcjeks halfway openRcjeks halfway open
Rcjeks halfway openJill Emery
 

En vedette (8)

When a Collection is Not a Collection ASA 2014
When a Collection is Not a Collection ASA 2014When a Collection is Not a Collection ASA 2014
When a Collection is Not a Collection ASA 2014
 
All that counts
All that countsAll that counts
All that counts
 
A Life Exposed
A Life ExposedA Life Exposed
A Life Exposed
 
Ecosystems
EcosystemsEcosystems
Ecosystems
 
Breaking Silos: Staffing for the OA Library SPARC 2014
Breaking Silos: Staffing for the OA Library SPARC 2014 Breaking Silos: Staffing for the OA Library SPARC 2014
Breaking Silos: Staffing for the OA Library SPARC 2014
 
NPG OA Presentation
NPG OA PresentationNPG OA Presentation
NPG OA Presentation
 
Je transforming collections
Je transforming collectionsJe transforming collections
Je transforming collections
 
Rcjeks halfway open
Rcjeks halfway openRcjeks halfway open
Rcjeks halfway open
 

Similaire à Minn twdi 9 9

Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applicationsPadma Metta
 
Internet of things ecosystem: The quest for value
Internet of things ecosystem: The quest for valueInternet of things ecosystem: The quest for value
Internet of things ecosystem: The quest for valueDeloitte United States
 
Big data analytics and its impact on internet users
Big data analytics and its impact on internet usersBig data analytics and its impact on internet users
Big data analytics and its impact on internet usersStruggler Ever
 
Smart Data Module 1 introduction to big and smart data
Smart Data Module 1 introduction to big and smart dataSmart Data Module 1 introduction to big and smart data
Smart Data Module 1 introduction to big and smart datacaniceconsulting
 
The Future Of Big Data In Business – 4 Emerging Trends In 2022.pptx
The Future Of Big Data In Business – 4 Emerging Trends In 2022.pptxThe Future Of Big Data In Business – 4 Emerging Trends In 2022.pptx
The Future Of Big Data In Business – 4 Emerging Trends In 2022.pptxArpitGautam20
 
A deep dive into digital lifesytles Allot Communications - Eyal Yaron
A deep dive into digital lifesytles Allot Communications - Eyal YaronA deep dive into digital lifesytles Allot Communications - Eyal Yaron
A deep dive into digital lifesytles Allot Communications - Eyal YaronEyal Yaron
 
Big Data, Big Investment
Big Data, Big InvestmentBig Data, Big Investment
Big Data, Big InvestmentGGV Capital
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataAkshata Humbe
 
IMA meeting accounting for big data
IMA meeting accounting for big dataIMA meeting accounting for big data
IMA meeting accounting for big dataJames Deiotte
 
SoBigData. European Research Infrastructure for Big Data and Social Mining
SoBigData. European Research Infrastructure for Big Data and Social MiningSoBigData. European Research Infrastructure for Big Data and Social Mining
SoBigData. European Research Infrastructure for Big Data and Social MiningResearch Data Alliance
 
Big Data Challenges and solutions.pptx
 Big Data Challenges and solutions.pptx Big Data Challenges and solutions.pptx
Big Data Challenges and solutions.pptxjawaria11
 
Big Data, Analytics and Data Science
Big Data, Analytics and Data ScienceBig Data, Analytics and Data Science
Big Data, Analytics and Data Sciencedlamb3244
 
What is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsWhat is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsShilpaKrishna6
 

Similaire à Minn twdi 9 9 (20)

Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applications
 
Internet of things ecosystem: The quest for value
Internet of things ecosystem: The quest for valueInternet of things ecosystem: The quest for value
Internet of things ecosystem: The quest for value
 
Big data analytics and its impact on internet users
Big data analytics and its impact on internet usersBig data analytics and its impact on internet users
Big data analytics and its impact on internet users
 
Smart Data Module 1 introduction to big and smart data
Smart Data Module 1 introduction to big and smart dataSmart Data Module 1 introduction to big and smart data
Smart Data Module 1 introduction to big and smart data
 
Big data applications
Big data applicationsBig data applications
Big data applications
 
Big data Mining
Big data MiningBig data Mining
Big data Mining
 
The Future Of Big Data In Business – 4 Emerging Trends In 2022.pptx
The Future Of Big Data In Business – 4 Emerging Trends In 2022.pptxThe Future Of Big Data In Business – 4 Emerging Trends In 2022.pptx
The Future Of Big Data In Business – 4 Emerging Trends In 2022.pptx
 
big-data.pdf
big-data.pdfbig-data.pdf
big-data.pdf
 
A deep dive into digital lifesytles Allot Communications - Eyal Yaron
A deep dive into digital lifesytles Allot Communications - Eyal YaronA deep dive into digital lifesytles Allot Communications - Eyal Yaron
A deep dive into digital lifesytles Allot Communications - Eyal Yaron
 
Big Data, Big Investment
Big Data, Big InvestmentBig Data, Big Investment
Big Data, Big Investment
 
Unit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdfUnit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdf
 
Unit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdfUnit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdf
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
IMA meeting accounting for big data
IMA meeting accounting for big dataIMA meeting accounting for big data
IMA meeting accounting for big data
 
Identifying the new frontier of big data as an enabler for T&T industries: Re...
Identifying the new frontier of big data as an enabler for T&T industries: Re...Identifying the new frontier of big data as an enabler for T&T industries: Re...
Identifying the new frontier of big data as an enabler for T&T industries: Re...
 
SoBigData. European Research Infrastructure for Big Data and Social Mining
SoBigData. European Research Infrastructure for Big Data and Social MiningSoBigData. European Research Infrastructure for Big Data and Social Mining
SoBigData. European Research Infrastructure for Big Data and Social Mining
 
Big Data Challenges and solutions.pptx
 Big Data Challenges and solutions.pptx Big Data Challenges and solutions.pptx
Big Data Challenges and solutions.pptx
 
Big Data, Analytics and Data Science
Big Data, Analytics and Data ScienceBig Data, Analytics and Data Science
Big Data, Analytics and Data Science
 
What is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsWhat is big data ? | Big Data Applications
What is big data ? | Big Data Applications
 

Dernier

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Dernier (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Minn twdi 9 9

  • 1. Big Data Analytics Using the information effectively
  • 2. 9/10/2013 | 2 | ©2013 Ciber, Inc. Agenda • Big Data • Making Sense of it all • A Framework of Understanding • Topical information • Non Topical Information • Analytics • Examples • Getting there • Q&A
  • 3. 9/10/2013 | 3 | ©2013 Ciber, Inc. Social Network Diagram • Contextual analytics is one of the hottest areas of interest pertaining to big data today • Smart companies know there is tremendous value in contextual analytics. But aggregating, categorizing, summarizing, exploring and contextualizing unstructured data is a big undertaking.
  • 4. 9/10/2013 | 4 | ©2012 Ciber, Inc. Big Data
  • 5. 9/10/2013 | 5 | ©2013 Ciber, Inc. What is the Big Data market? Source: “Big Data Market Size and Vendor Revenues”, Wikibon, Jeff Kelly, David Valante, David Elgyer, Feb 2013 – actual data through 2011 Acronyms: TBD = to be determined; SI = systems integrator; BPO = business process outsourcing
  • 6. 9/10/2013 | 6 | ©2013 Ciber, Inc. Sample Industry Applications of Big Data Telco Call Detail Record (CDR) analytics for: • Customer service • Network planning • Regulatory compliance Financial Services Transaction analytics for: • Fraud detection • Customer retention • Distribution network planning (Branch, ATM, Call Center) • Regulatory compliance • Consumer card / Merchant activity Utilities Network / Process analytics for: • Grid monitoring / reliability studies • Preventive Maintenance • Power production monitoring / planning Retail Product analytics for: • Market Basket analytics • SKU trending • Competitive analyses • Context-aware buying • Social indicators of brand Healthcare Patient analytics for: • Cost of care reduction • Quality of care improvement • Claims optimization • Service provider consistency • Outcome diagnostics • Regulatory compliance
  • 7. 9/10/2013 | 7 | ©2013 Ciber, Inc. Why Big Data? Insights from Analysis • Time college football products to win customers – WalmartLabs: social media buzz indicates when customers are getting excited about the upcoming season and their team(s). Combined with ShopyCat app provides targeted promos on team items. • Detecting nosocomial infections before they kill infants – Toronto hospital – Nosocomial infections can be life-threatening to premature infants if not treated quickly. Neonatal monitoring with real-time analytics can detect heart beat patterns that identify an infection before symptoms appear.
  • 8. 9/10/2013 | 8 | ©2013 Ciber, Inc. Wal-Mart handles more than 1 million customer transactions every hour which import into databases containing more than 2.5 petabytes Volume Velocity Variety 1M/hour In addition to all procedure, claims and payment systems’ structured data add unstructured data in EMRs, patient monitoring devices, publications, drug structures, social network comments, carrier health sites, post-treatment care records… 80% Exist in the digital universe as of early 2013 1 zettabyte = 1,000 exabytes 1,000,000 petabytes 10^9 terabytes 10^12 gigabytes 2.7zettabytes What Drives Big Data Analytics
  • 9. 9/10/2013 | 9 | ©2013 Ciber, Inc. EngineeringSocial/Mobile The Big Data Ecosystem Enterprise Systems Customer Loyalty & Service Systems Customer Case Files E-MailsAudioImagesProvisioning Systems Variety Veracity Velocity Volume Analysis Business Outcomes Predictive Analytics CEP Operational Control Simulation Social Analytics Digital Marketing WEB Analytics Blogs, Communities
  • 10. 9/10/2013 | 10 | ©2013 Ciber, Inc. Hadoop and other options • A strategy for bringing together hardware and software • What choices are available and how do you choose the best option? • How do I govern it?
  • 11. 9/10/2013 | 11 | ©2013 Ciber, Inc. Big Data Toolscape
  • 12. 9/10/2013 | 12 | ©2013 Ciber, Inc. There are Many Use Cases for a Big Data Platform  Social Media - Product/brand Sentiment analysis  Brand strategy  Market analysis  RFID tracking & analysis  Transaction analysis to create insight- based product/service offerings  Multimodal surveillance  Cyber security  Fraud modeling & detection  Risk modeling & management  Regulatory reporting Innovate New Products at Speed and Scale Know Everything about your Customer  Social media customer sentiment analysis  Promotion optimization  Segmentation  Customer profitability  Click-stream analysis  CDR processing  Multi-channel interaction analysis  Loyalty program analytics  Churn prediction Run Zero Latency Operations  Smart Grid/meter management  Distribution load forecasting  Sales reporting  Inventory & merchandising optimization  Options trading  ICU patient monitoring  Disease surveillance  Transportation network optimization  Store performance  Environmental analysis  Experimental research Instant Awareness of Risk and Fraud Exploit Instrumented Assets  Network analytics  Asset management and predictive issue resolution  Website analytics  IT log analysis Back
  • 13. 9/10/2013 | 13 | ©2013 Ciber, Inc. Processing and Archiving Strategies • Store forever • Selective storage • Throw away after processing
  • 14. 9/10/2013 | 14 | ©2012 Ciber, Inc. Making Sense of it all
  • 15. 9/10/2013 | 15 | ©2013 Ciber, Inc. Making sense of it all • Clarity of purpose • Definition of scope • Allocation of resources • Concrete result expectations • Comparative Analytical Measures (e.g. KPIs) – Rationalization of measures into actionable items and hierarchical groups – Defining predictive analytics workspaces ! ! !
  • 16. 9/10/2013 | 16 | ©2013 Ciber, Inc. Role of the Data Scientist • Creating Intelligent Tagging • Selecting tools for analysis • Defining algorithms and data mining techniques
  • 17. 9/10/2013 | 17 | ©2012 Ciber, Inc. A Framework of Understanding
  • 18. 9/10/2013 | 18 | ©2013 Ciber, Inc. What is Contextualization ? • Context is the interrelated conditions in which something exists or occurs . Helping define context is Environment, Setting, Timeline, Genre • Why is context important? – Consistency needed in returned result sets – The context describes the internal or external “framework” – Internal contextual information is crucial – External contextual information is knowledge that which cannot be gotten from the text of the item itself – Time and resources are wasted in searching irrelevant and non-material information
  • 19. 9/10/2013 | 19 | ©2013 Ciber, Inc. Problems in searching data • Voluminous • Ambiguous meanings • Inconsistent tagging • Multiple item types – text, formatted, PDF, TIFF, graphical, blogs, mashups • Knowledge of what is wanted is required to understand and return the proper result sets • Differentiation is necessary between – Real-time needs (e.g. fraud detection, medical Emergency room procedures) – Near-time needs (sometime in the near timeline) – Relaxed-time (some clearly defined future period)
  • 20. 9/10/2013 | 20 | ©2013 Ciber, Inc. Topical information • Topical information is generally visible in the data stream –Keywords, data ranges, etc.
  • 21. 9/10/2013 | 21 | ©2013 Ciber, Inc. Non-topical information • Has to be retrieved outside the item – Although topic is crucial to the relevance of an item, non-topical criteria plays an important role in the determination of relevance and significance – The identification and use of non-content (or “context”) descriptors is necessary – How widely agreed upon are the values of a given criterion among users (or user groups)?
  • 22. 9/10/2013 | 22 | ©2013 Ciber, Inc. Non-topical information cont’d –What is the degree to which an attribute- value is “public” or “private”? • How useful is each criterion for the search tasks to be addressed by the specific query system? • How easily can a criterion be identified and assigned to an item? • What methods can be applied for refining and speeding retrievals?
  • 23. 9/10/2013 | 23 | ©2013 Ciber, Inc. Descriptors - The defining of disambiguity • Do the content descriptors correspond or relate to non-topical relevance criteria of the system’s users? • Will users see a relationship between their relevance criteria and these descriptors, and use these descriptors in their search queries?
  • 24. 9/10/2013 | 24 | ©2013 Ciber, Inc. Content descriptors • Content descriptors (topical relevance criteria) – “Public” knowledge: • People of similar cultural backgrounds would (more or less) agree on the meanings. However, context descriptors (which can function as non-topical relevance criteria) can vary widely in the degree to which their attribute-values are considered public or private.
  • 25. 9/10/2013 | 25 | ©2013 Ciber, Inc. Public Knowledge Examples • “Has pictures” is a criterion that could be considered “public” as most people could agree on whether or not a document “has pictures”, if given a specific document to evaluate. • On the other hand, the criterion of “Regency Era” is highly situationally dependent - i.e. a limited subset of the public has knowledge of it - (specifically the period between 1811 and 1820, when King George III was deemed unfit to rule and his son - the Prince of Wales - ruled as his proxy as Prince Regent)
  • 26. 9/10/2013 | 26 | ©2013 Ciber, Inc. Genres refine taxonomy • Genre is a “folk typology” • Item categories must enjoy widespread recognition by their intended user groups to qualify as genres. – Examples: Resumes, Ballet, Music, Chemical formulae, statistical results • Groups of people agree on and define Genres by mutual consent (Explicitly and Implicitly) – E.g. Taxonomies (plants, accounting, medical), laws, voting, polls • Genres give rise to sub-genres with increasing granularity – E.g. Music, classical, romantic, new age, atonal – Genres and sub-genres may contain common elements • E.g. classical music and romantic music may have an intersection of data points
  • 27. 9/10/2013 | 27 | ©2013 Ciber, Inc. Genre knowledge • Genre is a type based on purpose, form and content. – E.g. The “resume” genre is for soliciting employment, divided into sections with contextual descriptors • Knowing a particular item’s genre also infers significant things about an item, sometimes enough to a make a judgment regarding the Item’s relevance to an information need – E.g. The phrase “Classically Trained Musician” infers knowledge to read music and understand musical terminology along with additional shades of musical knowledge
  • 28. 9/10/2013 | 28 | ©2012 Ciber, Inc. Analytics
  • 29. 9/10/2013 | 29 | ©2013 Ciber, Inc. Historical Analytics • Presentation of historical data – Dashboards, Drill-downs, interactive reports, static reports – New methods and devices – Identifying the metrics that affect key objectives – Synchronizing those metrics through an organization – Creating user tools to show effects of good (and bad) choices – Tying the financial, operational, and sales worlds together – Analyzing to predict the future – Refining models for accuracy
  • 30. 9/10/2013 | 30 | ©2013 Ciber, Inc. Predictive Analytics • Manipulation of data – Dashboards, Drill-downs, interactive reports – New methods and devices – Varying the metrics that affect key objectives – Synchronizing the impact of metrics through an organization – Creating user tools to show effects of good (and bad) choices – Tying the financial, operational, and sales worlds together – Creating models that show potential future scenarios – Refining models for accuracy using advanced tools and statistics
  • 31. 9/10/2013 | 31 | ©2012 Ciber, Inc. Examples
  • 32. 9/10/2013 | 32 | ©2013 Ciber, Inc. Examples of Harnessing Data Resources Retailer reduces time to run queries by 80% to optimize inventory Stock Exchange cuts queries from 26 hours to 2 minutes on 2 PB Government cuts acoustic analysis from hours to 70 Milliseconds Utility avoids power failures by analyzing 10 PB of data in minutes Telco analyses streaming network data to reduce hardware costs by 90% Hospital analyses streaming vitals to detect illness 24 hours earlier Big data challenges exist in every organization today
  • 33. 9/10/2013 | 33 | ©2013 Ciber, Inc. In Order to Realize New Opportunities, You Need to Think Beyond Traditional Sources of Data Transactional and Application Data Machine Data Social Data  Volume  Structured  Throughput  Velocity  Semi-structured  Ingestion  Variety  Highly unstructured  Veracity Enterprise Content  Variety  Highly unstructured  Volume
  • 34. 9/10/2013 | 34 | ©2013 Ciber, Inc. • Data at rest – oceans • Collection of what has streamed • Web logs, emails, social media • Unstructured documents: forms, claims • Structured data from disparate systems • Data in movement - streams • Twitter / Facebook comments • Stock market data • Sensors: Vital signs of a newly-born Two Sample Types of Big Data
  • 35. 9/10/2013 | 35 | ©2012 Ciber, Inc. Getting there
  • 36. 9/10/2013 | 36 | ©2013 Ciber, Inc. Leveraging Big Data Requires Multiple Platform Capabilities Manage & store huge volume of any data Hadoop File System MapReduce Manage streaming data Stream Computing Analyze unstructured data Text Analytics Engine Data WarehousingStructure and control data Integrate and govern all data sources Integration, Data Quality, Security, Lifecycle Management, MDM Understand and navigate federated big data sources Federated Discovery and Navigation
  • 37. 9/10/2013 | 37 | ©2013 Ciber, Inc. Outcomes Utilizing Big Data Capabilities To Analyze Any Big Data Type With Unique CapabilitiesAchieve Breakthrough Outcomes Content Transactional / Application Data Machine Data Social Media Data Visualization and Discovery Know Everything About Your Customers Run Zero-latency Operations Innovate new products at Speed and Scale Instant Awareness of Fraud and Risk Exploit Instrumented Assets Hadoop Data Warehousing Stream Computing Integration and Governance Text Analytics
  • 38. 9/10/2013 | 38 | ©2013 Ciber, Inc. Big Data Platform and Entry Points 2 – Analyze Raw Rata 5 – Analyze Streaming Data 1 – Unlock Big Data 3 – Simplify your warehouse 4 – Reduce costs with Hadoop
  • 39. 9/10/2013 | 39 | ©2013 Ciber, Inc. Q & A Contact Richard Gristak, Senior Director of Business Intelligence – rgristak@ciber.com
  • 40. 9/10/2013 | 40 | ©2012 Ciber, Inc. Thank you