SlideShare une entreprise Scribd logo
1  sur  74
Télécharger pour lire hors ligne
made available by Paul Keller under a CC-BY-2.0 license

Thursday, 14 July 2011
well .. sort of.

                         made available by Paul Keller under a CC-BY-2.0 license

Thursday, 14 July 2011
data in the digital age

                                   kaitlin thaney
                         austin big data user group, 13 july 2011
                                       austin, texas


Thursday, 14 July 2011
xi. background




Thursday, 14 July 2011
about me
              expat
           technologist
           open science
                            about me

                         sameAs

Thursday, 14 July 2011
Thursday, 14 July 2011
technology company
                            publisher link
                          london, nyc, tokyo



Thursday, 14 July 2011
investment arm
                          incubator role
                           in-house dev


Thursday, 14 July 2011
tiered approach
                             build to scale
                         researcher-focused


Thursday, 14 July 2011
<text>



Thursday, 14 July 2011
about


                                 1815
Thursday, 14 July 2011
first geological map
                                 “strata”
                               reputation


Thursday, 14 July 2011
data ...
                         metadata / markup
                          experimentation
                              metrics


Thursday, 14 July 2011
1. science is changing



Thursday, 14 July 2011
1. science is changing
                         (and the research workflow)




Thursday, 14 July 2011
research
                                       idea

                         publish                   lit review


     share results                                  materials

                                       retest
                          analyze                  experiment

                                    collect data
Thursday, 14 July 2011
blocking points
                                   (to name a few ... )
                                          idea

                         publish                     lit review


     share results                                     materials

                                          retest
                          analyze                    experiment

                                      collect data
Thursday, 14 July 2011
types of information
                                 (will revisit later)
                                         idea
                    articles
                                                       content
                  proceedings        prof activities
                                      mentorship
                                        patents
     share results                                the non-digital “stuff”

                                         retest
                          analysis                      protocols
                         synthesis                     parameters
                                      datasets
Thursday, 14 July 2011
text text
                            text




Thursday, 14 July 2011
Thursday, 14 July 2011
remaining
                         roadblocks
                         specialisation of tools (+/-)
                               interoperability
                                 accessibility
                              design decisions
                              the “social issue”



Thursday, 14 July 2011
2. focus areas



Thursday, 14 July 2011
(3)

Thursday, 14 July 2011
knowledge discovery
                    software applications
                research management


Thursday, 14 July 2011
knowledge discovery
                    software applications
                research management


Thursday, 14 July 2011
data ...
                     content, compounds,
                          collections


Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
text text
                            text




Thursday, 14 July 2011
can fine-tune




Thursday, 14 July 2011
name disambiguation




Thursday, 14 July 2011
10,11-dihydro-5-methyl-5H-dibenzo[b,e][1,4]diazepin-11-one




Thursday, 14 July 2011
knowledge discovery
                    software applications
                research management


Thursday, 14 July 2011
gigabytes, not
                            terabytes



Thursday, 14 July 2011
CC-BY-2.0 - Plaxco Lab - http://www.flickr.com/photos/34857812@N04/



Thursday, 14 July 2011
Thursday, 14 July 2011
better
                          tracking
                         is needed


Thursday, 14 July 2011
ordering
                                      +
                                 processing

                         the non-digital



Thursday, 14 July 2011
protocols
                         parameters
                         calibration
                          literature

Thursday, 14 July 2011
the non-digital



                          parameters



Thursday, 14 July 2011
the non-digital



                             about



Thursday, 14 July 2011
not just tracking,
                         but organisation +
                              analysis


Thursday, 14 July 2011
Thursday, 14 July 2011
000s
       100s
       200s
       300s
       400s
       500s
       600s
Thursday, 14 July 2011
000s              ...
       100s              philosophy/psych
       200s              religion
       300s              social sci
       400s              ...
       500s              lang, natsci, maths
       600s              tech/appliedsci
Thursday, 14 July 2011
000s              computersci
       100s              philosophy/psych
       200s              religion
       300s               problems
                         social sci
       400s              ...
       500s              lang, natsci, maths
       600s              tech/appliedsci
Thursday, 14 July 2011
spatial, topical mapping
             arbitrary, heuristic
                difficult to edit



Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
knowledge discovery
                    software applications
                research management


Thursday, 14 July 2011
data capture
                         (of a different sort)



Thursday, 14 July 2011
tools for decision makers
                         (research admin / funders)

                          using technology to spur
                                cultural shift


Thursday, 14 July 2011
existing system is imperfect




Thursday, 14 July 2011
“Right now we're going through a
               Cambrian explosion of metrics.”
                                     - Johan Bollen




                                Nature 465, 864-866 (2010) | doi:10.1038/465864a




Thursday, 14 July 2011
a wealth of mechanisms exist ...

                                citation / impact factor
                                        h - index
                          weighted citations (eigenfactor, sjr)
                               “betweenness centrality”
                                    alt-metrics, etc.




Thursday, 14 July 2011
challenges :
                           harmonisation
                          track /maintain
                          judgement calls
                         external pressures


Thursday, 14 July 2011
measurements still
                         stuck in the paper
                             metaphor



Thursday, 14 July 2011
“ what do we want
                         on the back of our
                         (science) baseball *

                               cards? “
                                - paul groth (et al.)


                             * UK folks, think Top Trumps


Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
Thursday, 14 July 2011
4. our goal



Thursday, 14 July 2011
software that
                         understands science



Thursday, 14 July 2011
software that
                 understands scientists



Thursday, 14 July 2011
reflect changes in
                          digital research

               account for new “data”


Thursday, 14 July 2011
more efficient research
              increase productivity
               accelerate discovery
                  shift culture


Thursday, 14 July 2011
thank you.

                  k.thaney@digital-science.com
                     www.digital-science.com



Thursday, 14 July 2011

Contenu connexe

En vedette

En vedette (8)

Fed2013_Managing Workplace Productivity
Fed2013_Managing Workplace ProductivityFed2013_Managing Workplace Productivity
Fed2013_Managing Workplace Productivity
 
From The Pillory To The Joneses Using Peer Pressure To Improve Your Security ...
From The Pillory To The Joneses Using Peer Pressure To Improve Your Security ...From The Pillory To The Joneses Using Peer Pressure To Improve Your Security ...
From The Pillory To The Joneses Using Peer Pressure To Improve Your Security ...
 
Lean 101
Lean 101Lean 101
Lean 101
 
Why Games Will Sit At The Head Of The Media Table
Why Games Will Sit At The Head Of The Media TableWhy Games Will Sit At The Head Of The Media Table
Why Games Will Sit At The Head Of The Media Table
 
Week 1 Lecture @ UMBC
Week 1 Lecture @ UMBCWeek 1 Lecture @ UMBC
Week 1 Lecture @ UMBC
 
Reporting on Research: The Ethical Obligations of the Editor
Reporting on Research: The Ethical Obligations of the EditorReporting on Research: The Ethical Obligations of the Editor
Reporting on Research: The Ethical Obligations of the Editor
 
Architecture without an end state
Architecture without an end stateArchitecture without an end state
Architecture without an end state
 
Manueverable architecture
Manueverable architectureManueverable architecture
Manueverable architecture
 

Similaire à "Data in the Digital Age" - Hadoop Big Data Meetup

Inspiratiemiddag_Vincent_Everts_Finalist generatie_einstein_komt_eraan_07042011
Inspiratiemiddag_Vincent_Everts_Finalist generatie_einstein_komt_eraan_07042011Inspiratiemiddag_Vincent_Everts_Finalist generatie_einstein_komt_eraan_07042011
Inspiratiemiddag_Vincent_Everts_Finalist generatie_einstein_komt_eraan_07042011
Finalist - open IT oplossingen
 
Oop design magma rails 2011
Oop design   magma rails 2011Oop design   magma rails 2011
Oop design magma rails 2011
MagmaConf
 
Koss, How to make desktop caliber browser apps
Koss, How to make desktop caliber browser appsKoss, How to make desktop caliber browser apps
Koss, How to make desktop caliber browser apps
Evil Martians
 
100701 1st
100701 1st100701 1st
100701 1st
picshbj
 

Similaire à "Data in the Digital Age" - Hadoop Big Data Meetup (20)

The Digital Toolbox - a discussion -Science Online '11
The Digital Toolbox - a discussion -Science Online '11The Digital Toolbox - a discussion -Science Online '11
The Digital Toolbox - a discussion -Science Online '11
 
Inspiratiemiddag_Vincent_Everts_Finalist generatie_einstein_komt_eraan_07042011
Inspiratiemiddag_Vincent_Everts_Finalist generatie_einstein_komt_eraan_07042011Inspiratiemiddag_Vincent_Everts_Finalist generatie_einstein_komt_eraan_07042011
Inspiratiemiddag_Vincent_Everts_Finalist generatie_einstein_komt_eraan_07042011
 
Persuasive Speaking
Persuasive SpeakingPersuasive Speaking
Persuasive Speaking
 
Benjamin Button Effect July 2011
Benjamin Button Effect July 2011Benjamin Button Effect July 2011
Benjamin Button Effect July 2011
 
Spectrum of IT BPO Services in the Philippines
Spectrum of IT BPO Services in the PhilippinesSpectrum of IT BPO Services in the Philippines
Spectrum of IT BPO Services in the Philippines
 
You're doing it wrong
You're doing it wrongYou're doing it wrong
You're doing it wrong
 
Can Metadata Keep Libraries Relevant?
Can Metadata Keep Libraries Relevant?Can Metadata Keep Libraries Relevant?
Can Metadata Keep Libraries Relevant?
 
Events+Me
Events+MeEvents+Me
Events+Me
 
Replacing Telco DB/DW to Hadoop and Hive
Replacing Telco DB/DW to Hadoop and HiveReplacing Telco DB/DW to Hadoop and Hive
Replacing Telco DB/DW to Hadoop and Hive
 
Visual Communication That Works! (PDF)
Visual Communication That Works! (PDF)Visual Communication That Works! (PDF)
Visual Communication That Works! (PDF)
 
Project management
Project managementProject management
Project management
 
Oop design magma rails 2011
Oop design   magma rails 2011Oop design   magma rails 2011
Oop design magma rails 2011
 
Where do you see ICT?
Where do you see ICT?Where do you see ICT?
Where do you see ICT?
 
Sinsai.info Global ICT summit
Sinsai.info   Global ICT summitSinsai.info   Global ICT summit
Sinsai.info Global ICT summit
 
Koss, How to make desktop caliber browser apps
Koss, How to make desktop caliber browser appsKoss, How to make desktop caliber browser apps
Koss, How to make desktop caliber browser apps
 
Beyond Page Objects
Beyond Page ObjectsBeyond Page Objects
Beyond Page Objects
 
Jeremiah Pliché's PBE 2011
Jeremiah Pliché's PBE 2011Jeremiah Pliché's PBE 2011
Jeremiah Pliché's PBE 2011
 
How to speed-code a success story
How to speed-code a success storyHow to speed-code a success story
How to speed-code a success story
 
Standing on the Shoulders of Hackers
Standing on the Shoulders of HackersStanding on the Shoulders of Hackers
Standing on the Shoulders of Hackers
 
100701 1st
100701 1st100701 1st
100701 1st
 

Plus de Kaitlin Thaney

Making the web work for science - RIT Dean's Lecture Series
Making the web work for science - RIT Dean's Lecture SeriesMaking the web work for science - RIT Dean's Lecture Series
Making the web work for science - RIT Dean's Lecture Series
Kaitlin Thaney
 
Making the web work for science - University of Queensland
Making the web work for science - University of QueenslandMaking the web work for science - University of Queensland
Making the web work for science - University of Queensland
Kaitlin Thaney
 

Plus de Kaitlin Thaney (20)

Megaphones to (No)where: On Sustaining Change
Megaphones to (No)where:  On Sustaining ChangeMegaphones to (No)where:  On Sustaining Change
Megaphones to (No)where: On Sustaining Change
 
Lessons in Resilience - International Women's Day Keynote @ Brooklyn College
Lessons in Resilience - International Women's Day Keynote @ Brooklyn CollegeLessons in Resilience - International Women's Day Keynote @ Brooklyn College
Lessons in Resilience - International Women's Day Keynote @ Brooklyn College
 
Building Capacity for Open Science
Building Capacity for Open ScienceBuilding Capacity for Open Science
Building Capacity for Open Science
 
Fueling the Open Movement - Compute Midwest
Fueling the Open Movement - Compute MidwestFueling the Open Movement - Compute Midwest
Fueling the Open Movement - Compute Midwest
 
Shifting Scientific Practice - ORCID 2015
Shifting Scientific Practice - ORCID 2015Shifting Scientific Practice - ORCID 2015
Shifting Scientific Practice - ORCID 2015
 
Mozilla Science Lab 101
Mozilla Science Lab 101Mozilla Science Lab 101
Mozilla Science Lab 101
 
Building capacity for open science - COASP Meeting
Building capacity for open science - COASP MeetingBuilding capacity for open science - COASP Meeting
Building capacity for open science - COASP Meeting
 
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
 
Leveraging the power of the web - Open Repositories 2015
Leveraging the power of the web - Open Repositories 2015Leveraging the power of the web - Open Repositories 2015
Leveraging the power of the web - Open Repositories 2015
 
Building capacity for open, data-driven science - Grand Rounds
Building capacity for open, data-driven science - Grand RoundsBuilding capacity for open, data-driven science - Grand Rounds
Building capacity for open, data-driven science - Grand Rounds
 
National Data Integrity Conference - Making the web work for science
National Data Integrity Conference - Making the web work for scienceNational Data Integrity Conference - Making the web work for science
National Data Integrity Conference - Making the web work for science
 
Capturing Contribution - ARCS
Capturing Contribution - ARCSCapturing Contribution - ARCS
Capturing Contribution - ARCS
 
Making the web work for science - RIT Dean's Lecture Series
Making the web work for science - RIT Dean's Lecture SeriesMaking the web work for science - RIT Dean's Lecture Series
Making the web work for science - RIT Dean's Lecture Series
 
Piloting Contributorship Badges for Science
Piloting Contributorship Badges for SciencePiloting Contributorship Badges for Science
Piloting Contributorship Badges for Science
 
"Designing for Truth, Scale and Sustainability" - WSSSPE2 Keynote
"Designing for Truth, Scale and Sustainability" - WSSSPE2 Keynote"Designing for Truth, Scale and Sustainability" - WSSSPE2 Keynote
"Designing for Truth, Scale and Sustainability" - WSSSPE2 Keynote
 
"Making the Web Work for Science" - NCI CBIIT
"Making the Web Work for Science" - NCI CBIIT"Making the Web Work for Science" - NCI CBIIT
"Making the Web Work for Science" - NCI CBIIT
 
"Building Capacity for Open Research" - AAMC
"Building Capacity for Open Research" - AAMC"Building Capacity for Open Research" - AAMC
"Building Capacity for Open Research" - AAMC
 
Making the web work for science - eResearch nz
Making the web work for science - eResearch nzMaking the web work for science - eResearch nz
Making the web work for science - eResearch nz
 
Making the web work for science - University of Queensland
Making the web work for science - University of QueenslandMaking the web work for science - University of Queensland
Making the web work for science - University of Queensland
 
Discoverability and Web-Enabled Science - #ScholarAfrica
Discoverability and Web-Enabled Science - #ScholarAfricaDiscoverability and Web-Enabled Science - #ScholarAfrica
Discoverability and Web-Enabled Science - #ScholarAfrica
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

"Data in the Digital Age" - Hadoop Big Data Meetup