SlideShare une entreprise Scribd logo
1  sur  7
Télécharger pour lire hors ligne
What about Little Data?
Big Data Forum Lightning Brief
Matthew Carroll, GM of 42six
March 19, 2014
Who the heck are we?
2!
Technologies
Accumulo (active contributor)
Ozone (active contributor)
Storm
Niagara Files
Apache Kafka
Titan (active contributor)
CouchDB
MongoDB (active contributor)
Cassandra
HBase
Neo4J
OpenStack
Puppet
Programs
RTRG
Intelink
Kickawesome
* Ozone & Apps Mall
* Ozone Mobile (DISA)
* Red Disk (Army-INSCOM)
* ORION (DIA)
FireTruck (DIA-DCTC)
Coral Reef (Army/NMEC)
WE ARE CSC’S BIG DATA SERVICES ARM PROVIDING
OPERATIONALLY FOCUSED SOLUTIONS AND CONSULTING TO THE
U.S. INTELLIGENCE COMMUNITY, INTERAGENCY & PRIVATE SECTOR.
Big Data’s Little Brother
3!
Big Data is what organizations know
about entities — be they people,
places, things, etc. Data is
aggregated from a large number of
sources, assembled into a massive
data store, and analyzed for
patterns. The results are more
accurate predictions, more targeted
communications, and more
personalized services.
Good For: General questions, recommendations
for all users, SNA, anomaly detection
Little Data is what we know about
ourselves. What we search. Who
we know. What we care about. How
we spend our time. We’ve always
had a sense for these things — after
all, it’s our jobs. But thanks to the
combination of social and cloud
technologies, it’s easier than ever
to gain insight into our own
behavior.
Good For: Individual recommendations,
personal goals, efficiency analysis, individual
pattern detection
Big Data gets all the attention but there is value, if not more, in Little
Data…
So how do I collect Little Data?
4!
AS THE GOVERNMENT MIGRATES TO DISTIRBUTED SYSTEMS,
SPECIFICALLY PaaS, ENGINEERING TEAMS NEED TO FOCUS ON
INDIVIDUAL BASED LOGGING & ANALYTICS.
1.  Simple APIs to insert logging of user activity including time in
app, how many searches executed, common search terms, etc.
2.  Insert “actions” into traditional web apps like was this report
interesting or value percentage.
3.  Hook into task management systems when you can…track what
the user is working on, where they are and for how long.
4.  Build personalized analytics pages with time and space
visualizations to help users see context in their activity
Some Examples
5!
Graph Clustering: What are the groups
that make up my activity?
Geospatial: What are the locations
associated with my activity?
Temporal Activity: When are you
active and what are you doing?
Textual: What do you search, what are
your common topics?
Ok I have my Little Data
6!
ONCE HOOKED INTO THE ARCHITECTURE AND USERS HAVE
ACCESS TO THEIR DATA, SETUP PERSONALIZED ANALYTIC
ENGINES FOR REMINDERS AND AUTOMATED ALERTING
1.  Design GUI for event-based processing first, i.e. node.js, etc. Think through
dynamic updates based on user actions not necessarily on new data in the
system
2.  Think like IFTTT – set the stage for the user to define desired personalized
goals, e.g. 20% search, 50% read, 30% write.
3.  Design data models with users in mind. Index data in a way to key off of
users. Treat users as first class citizens.
4.  Think about useful information to guide general user experience, i.e.
reminder that you typically search for “X” term when in this app on Fridays.
Questions!
Contact Us
Matthew Carroll
GM, 42six
matt@42six.com
@mcarroll_

Contenu connexe

Tendances

Big data Big Analytics
Big data Big AnalyticsBig data Big Analytics
Big data Big Analytics
Ajay Ohri
 

Tendances (20)

Big Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning GuruBig Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning Guru
 
Big Data Tutorial V4
Big Data Tutorial V4Big Data Tutorial V4
Big Data Tutorial V4
 
Big data Ppt
Big data PptBig data Ppt
Big data Ppt
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
 
9 facts about statice's data anonymization solution
9 facts about statice's data anonymization solution9 facts about statice's data anonymization solution
9 facts about statice's data anonymization solution
 
BIG DATA
BIG DATABIG DATA
BIG DATA
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
OpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilOpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry Council
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabati
 
Swift 20slides Ned 2010
Swift 20slides Ned 2010Swift 20slides Ned 2010
Swift 20slides Ned 2010
 
Big data abstract
Big data abstractBig data abstract
Big data abstract
 
Big Data Analysis Patterns - TriHUG 6/27/2013
Big Data Analysis Patterns - TriHUG 6/27/2013Big Data Analysis Patterns - TriHUG 6/27/2013
Big Data Analysis Patterns - TriHUG 6/27/2013
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big Data Course - BigData HUB
Big Data Course - BigData HUBBig Data Course - BigData HUB
Big Data Course - BigData HUB
 
Data, AI, and Tokens: Ocean Protocol
Data, AI, and Tokens: Ocean ProtocolData, AI, and Tokens: Ocean Protocol
Data, AI, and Tokens: Ocean Protocol
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big Data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data Big Analytics
Big data Big AnalyticsBig data Big Analytics
Big data Big Analytics
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 

Similaire à Big data forum 19 march 2014

INF2190_W1_2016_public
INF2190_W1_2016_publicINF2190_W1_2016_public
INF2190_W1_2016_public
Attila Barta
 
Activity Streaming as Information X-Docking
Activity Streaming as Information X-DockingActivity Streaming as Information X-Docking
Activity Streaming as Information X-Docking
Kai Riemer
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
Rohit Dubey
 

Similaire à Big data forum 19 march 2014 (20)

INF2190_W1_2016_public
INF2190_W1_2016_publicINF2190_W1_2016_public
INF2190_W1_2016_public
 
JIMS Rohini IT Flash Monthly Newsletter - October Issue
JIMS Rohini IT Flash Monthly Newsletter  - October IssueJIMS Rohini IT Flash Monthly Newsletter  - October Issue
JIMS Rohini IT Flash Monthly Newsletter - October Issue
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
 
Database Essay
Database EssayDatabase Essay
Database Essay
 
Big data with hadoop
Big data with hadoopBig data with hadoop
Big data with hadoop
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Activity Streaming as Information X-Docking
Activity Streaming as Information X-DockingActivity Streaming as Information X-Docking
Activity Streaming as Information X-Docking
 
Big data
Big dataBig data
Big data
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
 
Big Data Technologies.pdf
Big Data Technologies.pdfBig Data Technologies.pdf
Big Data Technologies.pdf
 
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your DataCloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
 
Big Data
Big DataBig Data
Big Data
 
1
11
1
 

Dernier

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 

Dernier (20)

Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The InsideCollecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 

Big data forum 19 march 2014

  • 1. What about Little Data? Big Data Forum Lightning Brief Matthew Carroll, GM of 42six March 19, 2014
  • 2. Who the heck are we? 2! Technologies Accumulo (active contributor) Ozone (active contributor) Storm Niagara Files Apache Kafka Titan (active contributor) CouchDB MongoDB (active contributor) Cassandra HBase Neo4J OpenStack Puppet Programs RTRG Intelink Kickawesome * Ozone & Apps Mall * Ozone Mobile (DISA) * Red Disk (Army-INSCOM) * ORION (DIA) FireTruck (DIA-DCTC) Coral Reef (Army/NMEC) WE ARE CSC’S BIG DATA SERVICES ARM PROVIDING OPERATIONALLY FOCUSED SOLUTIONS AND CONSULTING TO THE U.S. INTELLIGENCE COMMUNITY, INTERAGENCY & PRIVATE SECTOR.
  • 3. Big Data’s Little Brother 3! Big Data is what organizations know about entities — be they people, places, things, etc. Data is aggregated from a large number of sources, assembled into a massive data store, and analyzed for patterns. The results are more accurate predictions, more targeted communications, and more personalized services. Good For: General questions, recommendations for all users, SNA, anomaly detection Little Data is what we know about ourselves. What we search. Who we know. What we care about. How we spend our time. We’ve always had a sense for these things — after all, it’s our jobs. But thanks to the combination of social and cloud technologies, it’s easier than ever to gain insight into our own behavior. Good For: Individual recommendations, personal goals, efficiency analysis, individual pattern detection Big Data gets all the attention but there is value, if not more, in Little Data…
  • 4. So how do I collect Little Data? 4! AS THE GOVERNMENT MIGRATES TO DISTIRBUTED SYSTEMS, SPECIFICALLY PaaS, ENGINEERING TEAMS NEED TO FOCUS ON INDIVIDUAL BASED LOGGING & ANALYTICS. 1.  Simple APIs to insert logging of user activity including time in app, how many searches executed, common search terms, etc. 2.  Insert “actions” into traditional web apps like was this report interesting or value percentage. 3.  Hook into task management systems when you can…track what the user is working on, where they are and for how long. 4.  Build personalized analytics pages with time and space visualizations to help users see context in their activity
  • 5. Some Examples 5! Graph Clustering: What are the groups that make up my activity? Geospatial: What are the locations associated with my activity? Temporal Activity: When are you active and what are you doing? Textual: What do you search, what are your common topics?
  • 6. Ok I have my Little Data 6! ONCE HOOKED INTO THE ARCHITECTURE AND USERS HAVE ACCESS TO THEIR DATA, SETUP PERSONALIZED ANALYTIC ENGINES FOR REMINDERS AND AUTOMATED ALERTING 1.  Design GUI for event-based processing first, i.e. node.js, etc. Think through dynamic updates based on user actions not necessarily on new data in the system 2.  Think like IFTTT – set the stage for the user to define desired personalized goals, e.g. 20% search, 50% read, 30% write. 3.  Design data models with users in mind. Index data in a way to key off of users. Treat users as first class citizens. 4.  Think about useful information to guide general user experience, i.e. reminder that you typically search for “X” term when in this app on Fridays.
  • 7. Questions! Contact Us Matthew Carroll GM, 42six matt@42six.com @mcarroll_