SlideShare une entreprise Scribd logo
1  sur  6
Télécharger pour lire hors ligne
Summary	
  
Data	
  coding	
  ,	
  analysis,	
  archiving,	
  and	
  
   sharing	
  for	
  open	
  collabora9on	
  


                Richard	
  Aslin	
  
            University	
  of	
  Rochester	
  
1.	
  	
  What	
  is	
  your	
  hypothesis?	
  
•  9/11	
  occurred	
  because	
  the	
  intelligence	
  
   community	
  suffered	
  from	
  a	
  “failure	
  of	
  
   imagina9on”	
  
   –  BoGom-­‐up	
  data	
  mining	
  (“connec9ng	
  the	
  dots”)	
  
   –  Top-­‐down	
  predic9ons	
  (“what	
  are	
  vulnerabili9es??”)	
  
•  Clearly,	
  you	
  need	
  both	
  
•  Must	
  apply	
  approaches	
  itera9vely	
  and	
  repeatedly	
  
2.	
  	
  Observa9ons	
  are	
  DVs	
  
•  Are	
  the	
  paGerns	
  you	
  “see”	
  the	
  ones	
  that	
  are	
  
   “relevant”	
  or	
  causal?	
  	
  
•  Problem	
  of	
  data	
  sparsity	
  and	
  false	
  correla9ons	
  
•  Hypothesis	
  tes9ng	
  requires	
  an	
  experiment	
  
   (manipula9ng	
  an	
  IV)	
  
•  Tension	
  between	
  “ecology”	
  and	
  “control	
  of	
  
   variables”	
  (sociology	
  of	
  preferred	
  methods)	
  
3.	
  	
  How	
  expand	
  hypothesis	
  space?	
  
•  If	
  large/standard	
  datasets,	
  then	
  evalua9on	
  
   becomes	
  stagnant	
  (only	
  evaluated	
  with	
  that	
  
   dataset)	
  
•  If	
  evalua9on	
  only	
  uses	
  standard	
  (sta9s9cal)	
  
   tools,	
  same	
  problem	
  of	
  stagna9on	
  
•  Is	
  clever	
  visualiza9on	
  the	
  key	
  to	
  hypothesis	
  
   forma9on,	
  even	
  if	
  “simple”	
  variables?	
  

               TED	
  talk	
  by	
  Deb	
  Roy	
  from	
  MIT	
  
4.	
  	
  When	
  do	
  you	
  give	
  up?	
  
•  Reliance	
  on	
  visual	
  paGern	
  recogni9on	
  by	
  
   human	
  coder	
  may	
  not	
  reveal	
  relevant	
  
   (informa9ve)	
  features	
  (sound	
  spectrogram	
  
   cannot	
  be	
  “read”)	
  
•  Failure	
  at	
  macro	
  level	
  prompts	
  search	
  for	
  info	
  
   at	
  micro	
  level	
  (fMRI	
  univariate	
  vs.	
  mul9variate	
  
   analysis):	
  need	
  to	
  “drill	
  down”	
  
•  Failure	
  at	
  micro	
  level	
  may	
  indicate	
  
   indeterminacy	
  of	
  causal	
  hierarchy	
  (Fodor)	
  
5.	
  	
  Rules	
  of	
  sharing	
  
•  When	
  does	
  “your”	
  data	
  become	
  accessible	
  by:	
  
    –  Your	
  collaborators	
  
    –  Friends	
  who	
  ask	
  
    –  Strangers	
  
    –  Anyone	
  
•  Who	
  gets	
  credit?	
  
•  How	
  should	
  junior	
  researchers	
  “share”?	
  	
  
   Especially	
  with	
  senior	
  labs	
  that	
  have	
  $$$.	
  

Contenu connexe

Similaire à Aslin.discussion

NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015Jackie Wirz, PhD
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 
Data Science Folk Knowledge
Data Science Folk KnowledgeData Science Folk Knowledge
Data Science Folk KnowledgeKrishna Sankar
 
NeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimagingNeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimagingKrzysztof Gorgolewski
 
Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?Bertram Ludäscher
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Jisc
 
Share and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelShare and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelKrzysztof Gorgolewski
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
Altman pitt 2013_v3
Altman pitt 2013_v3Altman pitt 2013_v3
Altman pitt 2013_v3Micah Altman
 
(One Possible) Future of Scholarly Communication
(One Possible) Future of Scholarly Communication(One Possible) Future of Scholarly Communication
(One Possible) Future of Scholarly CommunicationMicah Altman
 
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona Elsevier
 
Editing Digital Imagery in Research: Exploring the Fidelity-to-Artificiality...
Editing Digital Imagery in Research:  Exploring the Fidelity-to-Artificiality...Editing Digital Imagery in Research:  Exploring the Fidelity-to-Artificiality...
Editing Digital Imagery in Research: Exploring the Fidelity-to-Artificiality...Shalin Hai-Jew
 
Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"summersocialwebshop
 

Similaire à Aslin.discussion (20)

NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Data Science Folk Knowledge
Data Science Folk KnowledgeData Science Folk Knowledge
Data Science Folk Knowledge
 
Biswa research
Biswa researchBiswa research
Biswa research
 
NeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimagingNeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimaging
 
Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014
 
Share and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelShare and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next level
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
Altman pitt 2013_v3
Altman pitt 2013_v3Altman pitt 2013_v3
Altman pitt 2013_v3
 
(One Possible) Future of Scholarly Communication
(One Possible) Future of Scholarly Communication(One Possible) Future of Scholarly Communication
(One Possible) Future of Scholarly Communication
 
From byte to mind
From byte to mindFrom byte to mind
From byte to mind
 
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona
 
Editing Digital Imagery in Research: Exploring the Fidelity-to-Artificiality...
Editing Digital Imagery in Research:  Exploring the Fidelity-to-Artificiality...Editing Digital Imagery in Research:  Exploring the Fidelity-to-Artificiality...
Editing Digital Imagery in Research: Exploring the Fidelity-to-Artificiality...
 
Jsm big-data
Jsm big-dataJsm big-data
Jsm big-data
 
Waves keynote2c
Waves keynote2cWaves keynote2c
Waves keynote2c
 
Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"
 

Plus de Jesse Lingeman

Its About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Its About Time: Analyzing Temporal MicroLevel Behavioral PatternsIts About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Its About Time: Analyzing Temporal MicroLevel Behavioral PatternsJesse Lingeman
 
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDASupporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDAJesse Lingeman
 
Messinger.openshapa.091511
Messinger.openshapa.091511Messinger.openshapa.091511
Messinger.openshapa.091511Jesse Lingeman
 
Hoffman nsf presentation hoffman-25-aug11.ppt
Hoffman nsf presentation hoffman-25-aug11.pptHoffman nsf presentation hoffman-25-aug11.ppt
Hoffman nsf presentation hoffman-25-aug11.pptJesse Lingeman
 
Alibali mult data streams a
Alibali mult data streams aAlibali mult data streams a
Alibali mult data streams aJesse Lingeman
 

Plus de Jesse Lingeman (12)

Its About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Its About Time: Analyzing Temporal MicroLevel Behavioral PatternsIts About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Its About Time: Analyzing Temporal MicroLevel Behavioral Patterns
 
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDASupporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
 
Messinger.openshapa.091511
Messinger.openshapa.091511Messinger.openshapa.091511
Messinger.openshapa.091511
 
Mac whinney macw
Mac whinney macwMac whinney macw
Mac whinney macw
 
Hoffman nsf presentation hoffman-25-aug11.ppt
Hoffman nsf presentation hoffman-25-aug11.pptHoffman nsf presentation hoffman-25-aug11.ppt
Hoffman nsf presentation hoffman-25-aug11.ppt
 
Gray 110916 ns-fwkshp
Gray 110916 ns-fwkshpGray 110916 ns-fwkshp
Gray 110916 ns-fwkshp
 
Davis kean.open shapa
Davis kean.open shapaDavis kean.open shapa
Davis kean.open shapa
 
Borner links
Borner linksBorner links
Borner links
 
Altman links
Altman linksAltman links
Altman links
 
Alibali mult data streams a
Alibali mult data streams aAlibali mult data streams a
Alibali mult data streams a
 
Test1
Test1Test1
Test1
 
Test2
Test2Test2
Test2
 

Dernier

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Dernier (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Aslin.discussion

  • 1. Summary   Data  coding  ,  analysis,  archiving,  and   sharing  for  open  collabora9on   Richard  Aslin   University  of  Rochester  
  • 2. 1.    What  is  your  hypothesis?   •  9/11  occurred  because  the  intelligence   community  suffered  from  a  “failure  of   imagina9on”   –  BoGom-­‐up  data  mining  (“connec9ng  the  dots”)   –  Top-­‐down  predic9ons  (“what  are  vulnerabili9es??”)   •  Clearly,  you  need  both   •  Must  apply  approaches  itera9vely  and  repeatedly  
  • 3. 2.    Observa9ons  are  DVs   •  Are  the  paGerns  you  “see”  the  ones  that  are   “relevant”  or  causal?     •  Problem  of  data  sparsity  and  false  correla9ons   •  Hypothesis  tes9ng  requires  an  experiment   (manipula9ng  an  IV)   •  Tension  between  “ecology”  and  “control  of   variables”  (sociology  of  preferred  methods)  
  • 4. 3.    How  expand  hypothesis  space?   •  If  large/standard  datasets,  then  evalua9on   becomes  stagnant  (only  evaluated  with  that   dataset)   •  If  evalua9on  only  uses  standard  (sta9s9cal)   tools,  same  problem  of  stagna9on   •  Is  clever  visualiza9on  the  key  to  hypothesis   forma9on,  even  if  “simple”  variables?   TED  talk  by  Deb  Roy  from  MIT  
  • 5. 4.    When  do  you  give  up?   •  Reliance  on  visual  paGern  recogni9on  by   human  coder  may  not  reveal  relevant   (informa9ve)  features  (sound  spectrogram   cannot  be  “read”)   •  Failure  at  macro  level  prompts  search  for  info   at  micro  level  (fMRI  univariate  vs.  mul9variate   analysis):  need  to  “drill  down”   •  Failure  at  micro  level  may  indicate   indeterminacy  of  causal  hierarchy  (Fodor)  
  • 6. 5.    Rules  of  sharing   •  When  does  “your”  data  become  accessible  by:   –  Your  collaborators   –  Friends  who  ask   –  Strangers   –  Anyone   •  Who  gets  credit?   •  How  should  junior  researchers  “share”?     Especially  with  senior  labs  that  have  $$$.