SlideShare une entreprise Scribd logo
1  sur  15
S Anand




VISUALISING DATA
Gramener
A data analytics and visualisation company




We handle terabyte-size data   via non-traditional analytics and visualise it in real-time.




 Gramener visualises           Gramener transforms your data into concise dashboards
                               that make your business problem & solution visually obvious.
        your data              We help you find insights quickly, based on cognitive research,
                               and our visualisations guide you towards actionable decisions.
WHY VISUALISE?

Consider an Organizational          2010       Bangalore      Delhi       Hyderabad       Mumbai
Sales report shown alongside       Month      Price Sales Price Sales Price Sales Price Sales
                                       Jan    10.0   8.04   10.0   9.14   10.0   7.46    8.0   6.58
It shows performance of 4
                                       Feb     8.0   6.95    8.0   8.14    8.0   6.77    8.0   5.76
branches with average price
and sales across 4 cities              Mar    13.0   7.58   13.0   8.74   13.0 12.74     8.0   7.71
                                       Apr     9.0   8.81    9.0   8.77    9.0   7.11    8.0   8.84
Each of the branches change            May    11.0   8.33   11.0   9.26   11.0   7.81    8.0   8.47
prices every month with a              Jun    14.0   9.96   14.0   8.10   14.0   8.84    8.0   7.04
corresponding change in the             Jul    6.0   7.24    6.0   6.13    6.0   6.08    8.0   5.25
sales value
                                       Aug     4.0   4.26    4.0   3.10    4.0   5.39   19.0 12.50

Basic analytics of these               Sep    12.0 10.84    12.0   9.13   12.0   8.15    8.0   5.56
numbers reveal consistent              Oct     7.0   4.82    7.0   7.26    7.0   6.42    8.0   7.91
performance    across   4              Nov     5.0   5.68    5.0   4.74    5.0   5.73    8.0   6.89
branches.                          Average     9.0   7.50    9.0   7.50    9.0   7.50    9.0   7.50

Further, these sales figures       Variance   10.0   3.75   10.0   3.75   10.0   3.75   10.0   3.75
have a consistent Correlation
and Linear regression across all
cities
BECAUSE NUMBERS DON’T TELL THE FULL STORY

Plotting the same data
shows markedly different
behaviour.

Bangalore    sales      has
generally increased     with
price.

Hyderabad has a perfect
increase in sales with price,
except for one aberration.

Delhi, however, shows a
decline in sales as price is
increased beyond a certain
point.

Mumbai sales fluctuated a
lot despite a constant price,
except for one month.
DETECTING FRAUD




                 “
                     We know meter readings are
                     incorrect, for various reasons.
                     We don’t, however, have the
                     concrete proof we need to start the
                     process of meter reading
ENERGY UTILITY       automation.
                     Part of our problem is the volume
                     of data that needs to be analysed.
                     The other is the inexperience in
                     tools or analyses to identify such
                     patterns.
This plot shows the frequency of all meter readings from
  Why would                                                    Apr-2010 to Mar-2011. An unusually large number of
these happen?
                                                                 readings are aligned with the tariff slab boundaries.




This clearly shows            Apr-10 May-10 Jun-10 Jul-10 Aug-10 Sep-10 Oct-10 Nov-10 Dec-10 Jan-11 Feb-11 Mar-11
collusion of some form          217     219    200   200     200    200   200     200    200    350    200   200
with the customers.             250     200    200   200     201    200   200     200    250    200    200   150
                                250     150    150   200     200    200   200     200    200    200    200   150
This happens with specific      150     200    200   200     200    200   200     200    200    200    200     50
customers, not randomly.        200     200    200   150     180    150     50    100     50     70    100   100
Here are such customers’        100     100    100   100     100    100   100     100    100    100    110   100
                                100     150    123   123      50    100     50    100    100    100    100   100
meter readings.
                                   0    111    100   100     100    100   100     100    100    100     50     50
                                   0    100     27   100      50    100   100     100    100    100     70   100
If we define the “extent of
                                   1      1      1   100      99     50   100     100    100    100    100   100
fraud” as the percentage
excess of the 100 unit
meter reading,      Section Apr-10 May-10 Jun-10 Jul-10 Aug-10 Sep-10 Oct-10 Nov-10 Dec-10 Jan-11 Feb-11 Mar-11
the value varies Section 1    70%    97% 136% 65%        110% 116% 121% 107% 114%            88%    74% 109%
considerably        Section 2 66%    92% New section
                                            66% 87%       70%     64% is
                                                                  … and 63%    50%    58%    38%    41%    54%
                                        manager arrives        transferred50%
                                                                           out
across sections, Section 3    90%    46%    47% 43%       28%     31%          32%    19%    38%     8%    34%
                    Section 4 44%    24%    36% 39%       21%     18%     24%  49%    56%    44%    31%    14%
and time
                  Section 5     4%     63%   -27%   20%    41%     82%     26%      34%     43%     2%     37%     15%
                  Section 6    18%     23%    30%   21%    28%     33%     39%      41%     39%    18%      0%     33%
… with some
                  Section 7    36%     51%    33%   33%    27%     35%     10%      39%     12%     5%     15%     14%
explainable       Section 8    22%     21%    28%   12%    24%     27%     10%      31%     13%    11%     22%     17%
anamolies.        Section 9    19%     35%    14%    9%    16%     32%     37%      12%      9%     5%     -3%     11%
MONITORING COSTS




           “
               Our raw material cost varies
               considerably across farms, though
               we share best practices.
               We have over 5,000 farms. The
CONTRACT       raw material cost report is a 75-
               page Excel report that no one
 FARMING       reads.
               Also, we gain no insights as to how
               the productivity changes over time
PREDICTING MARKS




            What determines a child’s marks?
            Do girls score better than boys?
            Does the choice of subject matter?

EDUCATION   Does the medium of instruction matter?
            Does community or religion matter?
            Does their birthday matter?
            Does the first letter of their name matter?
… and peaks
Based on the results of the 20 lakh                                      for Sep-borns
students taking the Class XII exams                     The marks
at Tamil Nadu over the last 3 years,                  shoot up for Aug
                                                           borns
it appears that the month you were
born in can make a difference of as
much as 120 marks out of 1,200.                                                 120 marks out of
                                                                                1200 explainable
                                                                                by month of birth




                                                                     June borns
                                                                   score the lowest




                                              An identical pattern was observed in 2009 and 2010…
“It’s simply that in Canada the eligibility
cutoff for age-class hockey is January 1. A
boy who turns ten on January 2, then,
could be playing alongside someone who
doesn’t turn ten until the end of the year—
and at that age, in preadolescence, a
twelve-month gap in age represents an
enormous difference in physical maturity.”

        -- Malcolm Gladwell, Outliers         … and across districts, gender, subjects, and class X & XII.
SECURITIES   FINDING PATTERNS




             Which securities move together?
             How should I diversify?
             What should I sell to reduce risk?
             What’s a reliable predictor of a security?
68% correlation
              between AUD & EUR



Plot of 6 month daily
 AUD - EUR values                    … that move
                                  counter-cyclically to
                                        indices



                                  Block of correlated
                                      currencies

         … clustered
         hierarchically
VISUALISING CHANGE




            What was the weather in India like…
EDUCATION
 WEATHER     THE LAST 100 YEARS?
VIDEO

http://youtu.be/WT0Aq41BaOQ
www.gramener.com
blog.gramener.com

Contenu connexe

Similaire à Visualising Data: ISB Solstice 2011

Result sensex valuation october 2011
Result sensex valuation october 2011Result sensex valuation october 2011
Result sensex valuation october 2011
Tanesh Gagnani
 
Market neutral model daily model - long short
Market neutral model   daily model - long shortMarket neutral model   daily model - long short
Market neutral model daily model - long short
Nadav Goldman
 
Beauty - cosmetics - personal care category review february 2013
Beauty - cosmetics - personal care category review february 2013Beauty - cosmetics - personal care category review february 2013
Beauty - cosmetics - personal care category review february 2013
HIPERCOM
 
Market neutral model daily model - equities vs. market
Market neutral model   daily model - equities vs. marketMarket neutral model   daily model - equities vs. market
Market neutral model daily model - equities vs. market
Nadav Goldman
 
Comparativo estudios de_opinion
Comparativo estudios de_opinionComparativo estudios de_opinion
Comparativo estudios de_opinion
Martin Triana
 
Buzz Numbers Actionable Business Intelligence From Online Conversations
Buzz Numbers   Actionable Business Intelligence From Online ConversationsBuzz Numbers   Actionable Business Intelligence From Online Conversations
Buzz Numbers Actionable Business Intelligence From Online Conversations
Nick Court
 
What is the future of instore brand performance consumer behaviour
What is the future of instore brand performance  consumer behaviourWhat is the future of instore brand performance  consumer behaviour
What is the future of instore brand performance consumer behaviour
LindaSidia
 
Distimo Monthly Report Webinar - The Impact of Price Changes
Distimo Monthly Report Webinar - The Impact of Price ChangesDistimo Monthly Report Webinar - The Impact of Price Changes
Distimo Monthly Report Webinar - The Impact of Price Changes
Distimo
 

Similaire à Visualising Data: ISB Solstice 2011 (20)

Data analytics in decision making
Data analytics in decision makingData analytics in decision making
Data analytics in decision making
 
The value of storytelling through data
The value of storytelling through dataThe value of storytelling through data
The value of storytelling through data
 
Result sensex valuation october 2011
Result sensex valuation october 2011Result sensex valuation october 2011
Result sensex valuation october 2011
 
Market neutral model daily model - long short
Market neutral model   daily model - long shortMarket neutral model   daily model - long short
Market neutral model daily model - long short
 
Humanizing Data Storytelling for Greater Business Impact
Humanizing Data Storytelling for Greater Business ImpactHumanizing Data Storytelling for Greater Business Impact
Humanizing Data Storytelling for Greater Business Impact
 
Beauty - cosmetics - personal care category review february 2013
Beauty - cosmetics - personal care category review february 2013Beauty - cosmetics - personal care category review february 2013
Beauty - cosmetics - personal care category review february 2013
 
Market neutral model daily model - equities vs. market
Market neutral model   daily model - equities vs. marketMarket neutral model   daily model - equities vs. market
Market neutral model daily model - equities vs. market
 
Sensex valuation, Aug 2011
Sensex valuation, Aug 2011Sensex valuation, Aug 2011
Sensex valuation, Aug 2011
 
St louis parcel negotiating 2012
St louis parcel negotiating 2012St louis parcel negotiating 2012
St louis parcel negotiating 2012
 
Data Visualisation for Data journalism at Melbourne University
Data Visualisation for Data journalism at Melbourne UniversityData Visualisation for Data journalism at Melbourne University
Data Visualisation for Data journalism at Melbourne University
 
Comparativo estudios de_opinion
Comparativo estudios de_opinionComparativo estudios de_opinion
Comparativo estudios de_opinion
 
T & L MATHS ACTIVITY : VOLUME YR 2
T & L MATHS ACTIVITY : VOLUME YR 2T & L MATHS ACTIVITY : VOLUME YR 2
T & L MATHS ACTIVITY : VOLUME YR 2
 
074 ehs110912 f
074 ehs110912 f074 ehs110912 f
074 ehs110912 f
 
Why running makes you smarter
Why running makes you smarterWhy running makes you smarter
Why running makes you smarter
 
It's More Fun Investing in the Philippines
It's More Fun Investing in the PhilippinesIt's More Fun Investing in the Philippines
It's More Fun Investing in the Philippines
 
Buzz Numbers Actionable Business Intelligence From Online Conversations
Buzz Numbers   Actionable Business Intelligence From Online ConversationsBuzz Numbers   Actionable Business Intelligence From Online Conversations
Buzz Numbers Actionable Business Intelligence From Online Conversations
 
2017 YBC Presentation - M&W
2017 YBC Presentation - M&W2017 YBC Presentation - M&W
2017 YBC Presentation - M&W
 
What is the future of instore brand performance consumer behaviour
What is the future of instore brand performance  consumer behaviourWhat is the future of instore brand performance  consumer behaviour
What is the future of instore brand performance consumer behaviour
 
Distimo Monthly Report Webinar - The Impact of Price Changes
Distimo Monthly Report Webinar - The Impact of Price ChangesDistimo Monthly Report Webinar - The Impact of Price Changes
Distimo Monthly Report Webinar - The Impact of Price Changes
 
Planning Forum - UCAS & Planners
Planning Forum - UCAS & PlannersPlanning Forum - UCAS & Planners
Planning Forum - UCAS & Planners
 

Plus de Gramener

Plus de Gramener (20)

6 Methods to Improve Your Manufacturing Process with Computer Vision
6 Methods to Improve Your Manufacturing Process with Computer Vision6 Methods to Improve Your Manufacturing Process with Computer Vision
6 Methods to Improve Your Manufacturing Process with Computer Vision
 
Detecting Manufacturing Defects with Computer Vision
Detecting Manufacturing Defects with Computer VisionDetecting Manufacturing Defects with Computer Vision
Detecting Manufacturing Defects with Computer Vision
 
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma & Healthcare
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma  & HealthcareHow to Identify the Right Key Opinion Leaders (KOLs) in Pharma  & Healthcare
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma & Healthcare
 
Automated Barcode Generation System in Manufacturing
Automated Barcode Generation System in ManufacturingAutomated Barcode Generation System in Manufacturing
Automated Barcode Generation System in Manufacturing
 
The Role of Technology to Save Biodiversity
The Role of Technology to Save BiodiversityThe Role of Technology to Save Biodiversity
The Role of Technology to Save Biodiversity
 
Enable Storytelling with Power BI & Comicgen Plugin
Enable Storytelling with Power BI  & Comicgen PluginEnable Storytelling with Power BI  & Comicgen Plugin
Enable Storytelling with Power BI & Comicgen Plugin
 
The Most Effective Method For Selecting Data Science Projects
The Most Effective Method For Selecting Data Science ProjectsThe Most Effective Method For Selecting Data Science Projects
The Most Effective Method For Selecting Data Science Projects
 
Low Code Platform To Build Data & AI Products
Low Code Platform To Build Data & AI ProductsLow Code Platform To Build Data & AI Products
Low Code Platform To Build Data & AI Products
 
5 Key Foundations To Build An Effective CX Program
5 Key Foundations To Build An Effective CX Program5 Key Foundations To Build An Effective CX Program
5 Key Foundations To Build An Effective CX Program
 
Using Power BI To Improve Media Buying & Ad Performance
Using Power BI To Improve Media Buying & Ad PerformanceUsing Power BI To Improve Media Buying & Ad Performance
Using Power BI To Improve Media Buying & Ad Performance
 
Recession Proofing With Data : Webinar
Recession Proofing With Data : WebinarRecession Proofing With Data : Webinar
Recession Proofing With Data : Webinar
 
Engage Your Audience With PowerPoint Decks: Webinar
Engage Your Audience With PowerPoint Decks: WebinarEngage Your Audience With PowerPoint Decks: Webinar
Engage Your Audience With PowerPoint Decks: Webinar
 
Structure Your Data Science Teams For Best Outcomes
Structure Your Data Science Teams For Best OutcomesStructure Your Data Science Teams For Best Outcomes
Structure Your Data Science Teams For Best Outcomes
 
Dawn Of Geospatial AI - Webinar
Dawn Of Geospatial AI - WebinarDawn Of Geospatial AI - Webinar
Dawn Of Geospatial AI - Webinar
 
5 Steps To Become A Data-Driven Organization : Webinar
5 Steps To Become A Data-Driven Organization : Webinar5 Steps To Become A Data-Driven Organization : Webinar
5 Steps To Become A Data-Driven Organization : Webinar
 
5 Steps To Measure ROI On Your Data Science Initiatives - Webinar
 5 Steps To Measure ROI On Your Data Science Initiatives - Webinar 5 Steps To Measure ROI On Your Data Science Initiatives - Webinar
5 Steps To Measure ROI On Your Data Science Initiatives - Webinar
 
Saving Lives with Geospatial AI - Pycon Indonesia 2020
Saving Lives with Geospatial AI - Pycon Indonesia 2020Saving Lives with Geospatial AI - Pycon Indonesia 2020
Saving Lives with Geospatial AI - Pycon Indonesia 2020
 
Driving Transformation in Industries with Artificial Intelligence (AI)
Driving Transformation in Industries with Artificial Intelligence (AI)Driving Transformation in Industries with Artificial Intelligence (AI)
Driving Transformation in Industries with Artificial Intelligence (AI)
 
The Art of Storytelling Using Data Science
The Art of Storytelling Using Data ScienceThe Art of Storytelling Using Data Science
The Art of Storytelling Using Data Science
 
Storyfying your Data: How to go from Data to Insights to Stories
Storyfying your Data: How to go from Data to Insights to StoriesStoryfying your Data: How to go from Data to Insights to Stories
Storyfying your Data: How to go from Data to Insights to Stories
 

Dernier

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Dernier (20)

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 

Visualising Data: ISB Solstice 2011

  • 2. Gramener A data analytics and visualisation company We handle terabyte-size data via non-traditional analytics and visualise it in real-time. Gramener visualises Gramener transforms your data into concise dashboards that make your business problem & solution visually obvious. your data We help you find insights quickly, based on cognitive research, and our visualisations guide you towards actionable decisions.
  • 3. WHY VISUALISE? Consider an Organizational 2010 Bangalore Delhi Hyderabad Mumbai Sales report shown alongside Month Price Sales Price Sales Price Sales Price Sales Jan 10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58 It shows performance of 4 Feb 8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76 branches with average price and sales across 4 cities Mar 13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71 Apr 9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84 Each of the branches change May 11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47 prices every month with a Jun 14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04 corresponding change in the Jul 6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25 sales value Aug 4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50 Basic analytics of these Sep 12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56 numbers reveal consistent Oct 7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91 performance across 4 Nov 5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89 branches. Average 9.0 7.50 9.0 7.50 9.0 7.50 9.0 7.50 Further, these sales figures Variance 10.0 3.75 10.0 3.75 10.0 3.75 10.0 3.75 have a consistent Correlation and Linear regression across all cities
  • 4. BECAUSE NUMBERS DON’T TELL THE FULL STORY Plotting the same data shows markedly different behaviour. Bangalore sales has generally increased with price. Hyderabad has a perfect increase in sales with price, except for one aberration. Delhi, however, shows a decline in sales as price is increased beyond a certain point. Mumbai sales fluctuated a lot despite a constant price, except for one month.
  • 5. DETECTING FRAUD “ We know meter readings are incorrect, for various reasons. We don’t, however, have the concrete proof we need to start the process of meter reading ENERGY UTILITY automation. Part of our problem is the volume of data that needs to be analysed. The other is the inexperience in tools or analyses to identify such patterns.
  • 6. This plot shows the frequency of all meter readings from Why would Apr-2010 to Mar-2011. An unusually large number of these happen? readings are aligned with the tariff slab boundaries. This clearly shows Apr-10 May-10 Jun-10 Jul-10 Aug-10 Sep-10 Oct-10 Nov-10 Dec-10 Jan-11 Feb-11 Mar-11 collusion of some form 217 219 200 200 200 200 200 200 200 350 200 200 with the customers. 250 200 200 200 201 200 200 200 250 200 200 150 250 150 150 200 200 200 200 200 200 200 200 150 This happens with specific 150 200 200 200 200 200 200 200 200 200 200 50 customers, not randomly. 200 200 200 150 180 150 50 100 50 70 100 100 Here are such customers’ 100 100 100 100 100 100 100 100 100 100 110 100 100 150 123 123 50 100 50 100 100 100 100 100 meter readings. 0 111 100 100 100 100 100 100 100 100 50 50 0 100 27 100 50 100 100 100 100 100 70 100 If we define the “extent of 1 1 1 100 99 50 100 100 100 100 100 100 fraud” as the percentage excess of the 100 unit meter reading, Section Apr-10 May-10 Jun-10 Jul-10 Aug-10 Sep-10 Oct-10 Nov-10 Dec-10 Jan-11 Feb-11 Mar-11 the value varies Section 1 70% 97% 136% 65% 110% 116% 121% 107% 114% 88% 74% 109% considerably Section 2 66% 92% New section 66% 87% 70% 64% is … and 63% 50% 58% 38% 41% 54% manager arrives transferred50% out across sections, Section 3 90% 46% 47% 43% 28% 31% 32% 19% 38% 8% 34% Section 4 44% 24% 36% 39% 21% 18% 24% 49% 56% 44% 31% 14% and time Section 5 4% 63% -27% 20% 41% 82% 26% 34% 43% 2% 37% 15% Section 6 18% 23% 30% 21% 28% 33% 39% 41% 39% 18% 0% 33% … with some Section 7 36% 51% 33% 33% 27% 35% 10% 39% 12% 5% 15% 14% explainable Section 8 22% 21% 28% 12% 24% 27% 10% 31% 13% 11% 22% 17% anamolies. Section 9 19% 35% 14% 9% 16% 32% 37% 12% 9% 5% -3% 11%
  • 7. MONITORING COSTS “ Our raw material cost varies considerably across farms, though we share best practices. We have over 5,000 farms. The CONTRACT raw material cost report is a 75- page Excel report that no one FARMING reads. Also, we gain no insights as to how the productivity changes over time
  • 8.
  • 9. PREDICTING MARKS What determines a child’s marks? Do girls score better than boys? Does the choice of subject matter? EDUCATION Does the medium of instruction matter? Does community or religion matter? Does their birthday matter? Does the first letter of their name matter?
  • 10. … and peaks Based on the results of the 20 lakh for Sep-borns students taking the Class XII exams The marks at Tamil Nadu over the last 3 years, shoot up for Aug borns it appears that the month you were born in can make a difference of as much as 120 marks out of 1,200. 120 marks out of 1200 explainable by month of birth June borns score the lowest An identical pattern was observed in 2009 and 2010… “It’s simply that in Canada the eligibility cutoff for age-class hockey is January 1. A boy who turns ten on January 2, then, could be playing alongside someone who doesn’t turn ten until the end of the year— and at that age, in preadolescence, a twelve-month gap in age represents an enormous difference in physical maturity.” -- Malcolm Gladwell, Outliers … and across districts, gender, subjects, and class X & XII.
  • 11. SECURITIES FINDING PATTERNS Which securities move together? How should I diversify? What should I sell to reduce risk? What’s a reliable predictor of a security?
  • 12. 68% correlation between AUD & EUR Plot of 6 month daily AUD - EUR values … that move counter-cyclically to indices Block of correlated currencies … clustered hierarchically
  • 13. VISUALISING CHANGE What was the weather in India like… EDUCATION WEATHER THE LAST 100 YEARS?