SlideShare une entreprise Scribd logo
1  sur  39
hoard.it : Stealing your data
Or... “Where is your online value?”
Or... “Originality sucks”
Dan Zambonini
www.boxuk.com

Museums and the Web 2009, Indianapolis, April 16
WARNING
WARNING
1. I am playing Devil’s Advocate

2. These are‘thoughts in progress’
Introduction
1. The hoard.it project

2. Museums and the Web:
   where’s the value?
Introduction
1. The hoard.it project

2. Museums and the Web:
   where’s the value?
2.5 - 15%
2.5 - 15%
Cross-Collections Projects

  “Search through the cultural collections of Europe”



            “explore and comment on collections”


     “find and explore digital collections from museums”


                   “Discover cultural objects, collections”
Why is this a Problem?
1. Some duplication of effort
  • £25,000 - £100,000 to put collections online
  • £1,500 - £6,500 per cross-collection project
2. Potential end-user confusion
3. Usually only include larger institutions
4. Is there really a need?
Our Approach
• Use data that already exists
   • No cost/duplication of effort
• No input or changes from museums
   • Lightweight, open to all
• Re-expose the data programmatically
   • Enable easy re-use
How it works
Screen-Scraper + Spider
How it works
Screen-Scraper + Spider
How it works
Screen-Scraper + Spider
Difficulties and Limitations
•   Must have collections online
•   Must have a consistent template
•   Slow; not real-time
•   Technical variations (encoding, standards)
•   Rudimentary: Flash/Forms a barrier
Difficulties: Normalization
•   Dates
    •   circa 19th century, 1960s, 2008-01, 1Jan ’52, 2000 BC, 30s, April 4 1934,
        04-76, 1783-25-04, 10-11-64, about 200 AD, Victorian, 1100-1150, ...

    •   http://feeds.boxuk.com/convert/date/


•   Location
    •   Points of interest, cities, towns, countries, administrative regions, political
        regions, ancient names, continents, postal codes, co-ordinates, ...

    •   http://developer.yahoo.com/geo/
The Data
   Virtual Museum of Canada!

     Carnegie Museum of Art!

          Smithsonian NASM!

 National Museum of Australia!

      National Portrait Gallery!

        Imperial War Museum!

National Museums of Scotland!

                     Ingenious!

  Museum of London: E20CL!

               British Museum!

  Victoria and Albert Museum!

   National Maritime Museum!

                  Powerhouse!

             Science Museum!

             24 Hour Museum!

            Freebase: Events!

    Wikipedia: List of Painters!

                                   0!   2000!   4000!   6000!   8000!   10000!   12000!   14000!   16000!
The Data
   Virtual Museum of Canada!

     Carnegie Museum of Art!

          Smithsonian NASM!

 National Museum of Australia!

      National Portrait Gallery!

        Imperial War Museum!

National Museums of Scotland!

                     Ingenious!

  Museum of London: E20CL!

               British Museum!

  Victoria and Albert Museum!

   National Maritime Museum!

                  Powerhouse!

             Science Museum!

             24 Hour Museum!

            Freebase: Events!

    Wikipedia: List of Painters!

                                   0!   2000!   4000!   6000!   8000!   10000!   12000!   14000!   16000!


                                                                            70,000 objects
The Data
 • URL            100%
 • Identifier     95%
 • Title          100%
 • Description    70%
 • Image          85%
 • Creator        50%
 • Created Date   75%
 • Copyright      50%
 • Dimensions     45%
 • Subject        65%
 • Location       45%
 • Materials      65%
Data Mining - Location
                                       65%   Europe
                                       15%   Asia
                                       14%   North America
                                       4%    Oceania




Percentage of objects from the same continent as museum:

• North America: 85%
• Europe:        75%
• Oceania:       65%
% of objects by continent of origin!




             0!
                  10!
                        20!
                                  30!
                                          40!
                                                  50!
                                                          60!
                                                                     70!
                                                                           80!
                                                                                 90!
        -1000!
         -900!
         -800!
         -700!
         -600!
         -500!
         -400!
         -300!
         -200!
         -100!
            0!
          100!
          200!
          300!
          400!
          500!




Year!
          600!
          700!
          800!
          900!
         1000!
         1100!
         1200!
         1300!
         1400!
         1500!
         1600!
         1700!
         1800!
         1900!
         2000!
                                 Asia!
                                 Africa!
                                 Europe!
                                 Oceania!
                                 North America!
                                 South America!
                                                                                       Data Mining - Date/Location
% of objects by material!




                      0!
                           5!
                                10!
                                                15!
                                                                  20!
                                                                            25!
                                                                                  30!
                                                                                        35!
                                                                                              40!
              0!
         10
              0!
         20
              0!
         30
              0!
         40
              0!
         50
              0!
         60
              0!
         70
              0!
         80
              0!
         90
              0!
        10
          00
               !




Year!
        11
           0  0!
        12
             00
                  !
        13
             00
                  !
        14
             00
                  !
        15
          00
               !
        16
             00
                  !
        17
             00
                  !
        18
             00
                  !
        19
             00
                  !
        20
             00
                  !
                                                          Clay!

                                                  Gold!

                                      Silver!
                                                                   Stone!
                                                                                                    Data Mining - Date/Material
How it has been used
•   Experiments: http://hoard.it/labs/




•   UK Museums on the
    Web 2008 Hack Day


•   Who knows...?
                                         Photo courtesy of Brian Kelly
How it has been used
Next steps...
Next steps...


 ABSOLUTELY
  NOTHING
Do you offer anything?
dbPedia, Freebase
What can you offer?
•   Expertise
•   Media
•   The Physical Space
•   Reputation and Trust
•   Audience
•   Voice, Exposure and Influence
What’s changed?
“...not all information should flow everywhere; only the
meaningful should be transmitted.

But in the network economy only signals in real time (or
close to it) are truly meaningful.

Examine the speed of knowledge in your system. How
can it be brought closer to real time? If this requires the
cooperation of subcontractors, distant partners, and far-
flung customers, so much the better.”

Kevin Kelly
http://www.kk.org/newrules/blog/2009/04/if-you-are-not-in-real-time-yo.php
What’s changed?


                  !quot;#$%#$&
!quot;#$%&




                  '($(&
                  )%*+,-%.&




          '()%&
What’s changed?
What’s changed?

 EXECUTION
    not
   IDEAS
What’s changed?

              !quot;#$%&'()
              *+#,)




                      !quot;#$%&'(
                      )*#+%$%&'(
                      ,--.**%+%$&'(
                      /0.(1%20&(3.#"4.*(
                      5.*%26(
UK Newspaper Example
                                ,-./012345quot;
                                 #!quot;
                                  +quot;
                                  *quot;
                F44:G2.:=quot;                        6278925:quot;
                                  )quot;
                                  (quot;
                                  'quot;
                                                                               H2-1Iquot;JKL.8==quot;
                                  "
                                                                               H2-1Iquot;A2-1quot;
                                  %quot;
                                                                               H2-1Iquot;A-..4.quot;
                                  $quot;
                                                                               H2-1Iquot;CM2.quot;
                                  #quot;
                                                                               H2-1Iquot;>8187.2LBquot;
                                  !quot;
D5-E08quot;D=8.=quot;                                                 ;2/8<44:quot;;25=quot;
                                                                               ;-525/-21quot;>-G8=quot;
                                                                               >B8quot;N02.O-25quot;
                                                                               >B8quot;P5O8L85O85Mquot;
                                                                               >B8quot;C05quot;
                                                                               >B8quot;>-G8=quot;



         9CCquot;C0<=/.-<8.=quot;                         >?-@8.quot;;4114?8.=quot;




                             A85345=quot;-5quot;$&quot;B.=quot;
For example
•   Let your patrons collaborate
•   Let your patrons run your space
•   Give local communities a voice
•   Provide advice and guidance
•   Collect & distribute niche knowledge
•   ...


•   You know better than I do.
What has to change?
•   A focus on proven user needs
•   Re-usable services, not more data
•   Smaller projects
•   Iterative approaches
•   A real commitment to the web platform
•   (At least some) In-house development
How do we get there?
•   Should web projects generate revenue?
•   Don’t be afraid of re-inventing the wheel
•   Demand all projects use/expose APIs that
    are easy (REST not SOAP/OAI) and publicized
•   Show early, show often
•   Annoy funding bodies to support more,
    smaller, longer (i.e. iterative) ‘boring’ projects,
    and less ‘big, audacious’ projects.
Summary
•   We stole your data...
•   But then so are lots of other people...
•   So produce value elsewhere.


•   Ideas are harmful: do what’s proven...
•   But do it brilliantly.
•   And to do that, we need change.
Thank you
      www.boxuk.com


      dan@boxuk.com


    twitter.com/zambonini
Thank you
      www.boxuk.com


      dan@boxuk.com


    twitter.com/zambonini

Contenu connexe

En vedette

Intranet y sus beneficios
Intranet y sus beneficiosIntranet y sus beneficios
Intranet y sus beneficiosAndrewwcc
 
Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"Asyst News
 
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...museums and the web
 
Saudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solutionSaudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solutionBrandon Dooley
 
Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)Penso Ideias
 
Practica colas (if, else)
Practica colas (if, else)Practica colas (if, else)
Practica colas (if, else)Eli Diaz
 
Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)miguelmunguia
 
Inspiring Shopper Behaviours
Inspiring Shopper BehavioursInspiring Shopper Behaviours
Inspiring Shopper BehavioursOgilvy Consulting
 

En vedette (11)

Intranet y sus beneficios
Intranet y sus beneficiosIntranet y sus beneficios
Intranet y sus beneficios
 
12 san francisco museum of modern art
12 san francisco museum of modern art12 san francisco museum of modern art
12 san francisco museum of modern art
 
Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"
 
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...
 
Saudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solutionSaudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solution
 
Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)
 
PicNic no Monet
PicNic no MonetPicNic no Monet
PicNic no Monet
 
Practica colas (if, else)
Practica colas (if, else)Practica colas (if, else)
Practica colas (if, else)
 
Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)
 
Proyecto Final
Proyecto FinalProyecto Final
Proyecto Final
 
Inspiring Shopper Behaviours
Inspiring Shopper BehavioursInspiring Shopper Behaviours
Inspiring Shopper Behaviours
 

Similaire à Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent

RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmailRTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmailAlberto Bacchelli
 
The Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 TalkThe Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 TalkDigital Sparks
 
Cloud computing
Cloud computingCloud computing
Cloud computingtimesheet1
 
TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"Karla Witte
 
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...WRI Ross Center for Sustainable Cities
 
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」Takashi Iba
 
Jane Jacobs and the Voice of the Monstrous Hybrid
Jane Jacobs and the Voice of the Monstrous Hybrid Jane Jacobs and the Voice of the Monstrous Hybrid
Jane Jacobs and the Voice of the Monstrous Hybrid Jed Sundwall
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009adminfbgroup
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009guest3117009
 

Similaire à Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent (9)

RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmailRTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
 
The Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 TalkThe Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 Talk
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"
 
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
 
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
 
Jane Jacobs and the Voice of the Monstrous Hybrid
Jane Jacobs and the Voice of the Monstrous Hybrid Jane Jacobs and the Voice of the Monstrous Hybrid
Jane Jacobs and the Voice of the Monstrous Hybrid
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009
 

Plus de museums and the web

How to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting SiuHow to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting Siumuseums and the web
 
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...museums and the web
 
MW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museumMW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museummuseums and the web
 
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...museums and the web
 
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...museums and the web
 
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...museums and the web
 
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...museums and the web
 
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia GuideMW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia Guidemuseums and the web
 
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...museums and the web
 
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor TrackingMW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Trackingmuseums and the web
 
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...museums and the web
 
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...museums and the web
 
MW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data SculptingMW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data Sculptingmuseums and the web
 
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...museums and the web
 
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...museums and the web
 
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...museums and the web
 
MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network museums and the web
 
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...museums and the web
 
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...museums and the web
 

Plus de museums and the web (20)

How to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting SiuHow to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting Siu
 
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
 
MW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museumMW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museum
 
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
 
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
 
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
 
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
 
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia GuideMW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
 
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
 
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor TrackingMW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
 
MW2011 Best of the Web Awards
MW2011 Best of the Web AwardsMW2011 Best of the Web Awards
MW2011 Best of the Web Awards
 
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
 
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
 
MW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data SculptingMW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data Sculpting
 
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
 
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
 
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
 
MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network
 
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
 
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
 

Dernier

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 

Dernier (20)

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent

  • 1. hoard.it : Stealing your data Or... “Where is your online value?” Or... “Originality sucks” Dan Zambonini www.boxuk.com Museums and the Web 2009, Indianapolis, April 16
  • 3. WARNING 1. I am playing Devil’s Advocate 2. These are‘thoughts in progress’
  • 4. Introduction 1. The hoard.it project 2. Museums and the Web: where’s the value?
  • 5. Introduction 1. The hoard.it project 2. Museums and the Web: where’s the value?
  • 8. Cross-Collections Projects “Search through the cultural collections of Europe” “explore and comment on collections” “find and explore digital collections from museums” “Discover cultural objects, collections”
  • 9. Why is this a Problem? 1. Some duplication of effort • £25,000 - £100,000 to put collections online • £1,500 - £6,500 per cross-collection project 2. Potential end-user confusion 3. Usually only include larger institutions 4. Is there really a need?
  • 10. Our Approach • Use data that already exists • No cost/duplication of effort • No input or changes from museums • Lightweight, open to all • Re-expose the data programmatically • Enable easy re-use
  • 14. Difficulties and Limitations • Must have collections online • Must have a consistent template • Slow; not real-time • Technical variations (encoding, standards) • Rudimentary: Flash/Forms a barrier
  • 15. Difficulties: Normalization • Dates • circa 19th century, 1960s, 2008-01, 1Jan ’52, 2000 BC, 30s, April 4 1934, 04-76, 1783-25-04, 10-11-64, about 200 AD, Victorian, 1100-1150, ... • http://feeds.boxuk.com/convert/date/ • Location • Points of interest, cities, towns, countries, administrative regions, political regions, ancient names, continents, postal codes, co-ordinates, ... • http://developer.yahoo.com/geo/
  • 16. The Data Virtual Museum of Canada! Carnegie Museum of Art! Smithsonian NASM! National Museum of Australia! National Portrait Gallery! Imperial War Museum! National Museums of Scotland! Ingenious! Museum of London: E20CL! British Museum! Victoria and Albert Museum! National Maritime Museum! Powerhouse! Science Museum! 24 Hour Museum! Freebase: Events! Wikipedia: List of Painters! 0! 2000! 4000! 6000! 8000! 10000! 12000! 14000! 16000!
  • 17. The Data Virtual Museum of Canada! Carnegie Museum of Art! Smithsonian NASM! National Museum of Australia! National Portrait Gallery! Imperial War Museum! National Museums of Scotland! Ingenious! Museum of London: E20CL! British Museum! Victoria and Albert Museum! National Maritime Museum! Powerhouse! Science Museum! 24 Hour Museum! Freebase: Events! Wikipedia: List of Painters! 0! 2000! 4000! 6000! 8000! 10000! 12000! 14000! 16000! 70,000 objects
  • 18. The Data • URL 100% • Identifier 95% • Title 100% • Description 70% • Image 85% • Creator 50% • Created Date 75% • Copyright 50% • Dimensions 45% • Subject 65% • Location 45% • Materials 65%
  • 19. Data Mining - Location 65% Europe 15% Asia 14% North America 4% Oceania Percentage of objects from the same continent as museum: • North America: 85% • Europe: 75% • Oceania: 65%
  • 20. % of objects by continent of origin! 0! 10! 20! 30! 40! 50! 60! 70! 80! 90! -1000! -900! -800! -700! -600! -500! -400! -300! -200! -100! 0! 100! 200! 300! 400! 500! Year! 600! 700! 800! 900! 1000! 1100! 1200! 1300! 1400! 1500! 1600! 1700! 1800! 1900! 2000! Asia! Africa! Europe! Oceania! North America! South America! Data Mining - Date/Location
  • 21. % of objects by material! 0! 5! 10! 15! 20! 25! 30! 35! 40! 0! 10 0! 20 0! 30 0! 40 0! 50 0! 60 0! 70 0! 80 0! 90 0! 10 00 ! Year! 11 0 0! 12 00 ! 13 00 ! 14 00 ! 15 00 ! 16 00 ! 17 00 ! 18 00 ! 19 00 ! 20 00 ! Clay! Gold! Silver! Stone! Data Mining - Date/Material
  • 22. How it has been used • Experiments: http://hoard.it/labs/ • UK Museums on the Web 2008 Hack Day • Who knows...? Photo courtesy of Brian Kelly
  • 23. How it has been used
  • 26. Do you offer anything? dbPedia, Freebase
  • 27. What can you offer? • Expertise • Media • The Physical Space • Reputation and Trust • Audience • Voice, Exposure and Influence
  • 28. What’s changed? “...not all information should flow everywhere; only the meaningful should be transmitted. But in the network economy only signals in real time (or close to it) are truly meaningful. Examine the speed of knowledge in your system. How can it be brought closer to real time? If this requires the cooperation of subcontractors, distant partners, and far- flung customers, so much the better.” Kevin Kelly http://www.kk.org/newrules/blog/2009/04/if-you-are-not-in-real-time-yo.php
  • 29. What’s changed? !quot;#$%#$& !quot;#$%& '($(& )%*+,-%.& '()%&
  • 32. What’s changed? !quot;#$%&'() *+#,) !quot;#$%&'( )*#+%$%&'( ,--.**%+%$&'( /0.(1%20&(3.#&quot;4.*( 5.*%26(
  • 33. UK Newspaper Example ,-./012345quot; #!quot; +quot; *quot; F44:G2.:=quot; 6278925:quot; )quot; (quot; 'quot; H2-1Iquot;JKL.8==quot; &quot; H2-1Iquot;A2-1quot; %quot; H2-1Iquot;A-..4.quot; $quot; H2-1Iquot;CM2.quot; #quot; H2-1Iquot;>8187.2LBquot; !quot; D5-E08quot;D=8.=quot; ;2/8<44:quot;;25=quot; ;-525/-21quot;>-G8=quot; >B8quot;N02.O-25quot; >B8quot;P5O8L85O85Mquot; >B8quot;C05quot; >B8quot;>-G8=quot; 9CCquot;C0<=/.-<8.=quot; >?-@8.quot;;4114?8.=quot; A85345=quot;-5quot;$&quot;B.=quot;
  • 34. For example • Let your patrons collaborate • Let your patrons run your space • Give local communities a voice • Provide advice and guidance • Collect & distribute niche knowledge • ... • You know better than I do.
  • 35. What has to change? • A focus on proven user needs • Re-usable services, not more data • Smaller projects • Iterative approaches • A real commitment to the web platform • (At least some) In-house development
  • 36. How do we get there? • Should web projects generate revenue? • Don’t be afraid of re-inventing the wheel • Demand all projects use/expose APIs that are easy (REST not SOAP/OAI) and publicized • Show early, show often • Annoy funding bodies to support more, smaller, longer (i.e. iterative) ‘boring’ projects, and less ‘big, audacious’ projects.
  • 37. Summary • We stole your data... • But then so are lots of other people... • So produce value elsewhere. • Ideas are harmful: do what’s proven... • But do it brilliantly. • And to do that, we need change.
  • 38. Thank you www.boxuk.com dan@boxuk.com twitter.com/zambonini
  • 39. Thank you www.boxuk.com dan@boxuk.com twitter.com/zambonini

Notes de l'éditeur