Stella Wisdom's slides for a talk to UCL BASc students on 02/03/2015.
Including information on BL Labs, Mechanical Curator, Mechanical Comedian, David Normal and Off the Map
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Stella Wisdom's slides for a talk to UCL BASc students on 02/03/2015
1. Digital Research at
the British Library
Stella Wisdom, Digital Curator
@miss_wisdom
Blog: http://britishlibrary.typepad.co.uk/digital-scholarship/
2. www.bl.uk 2
Digital Scholarship at British Library
“The production, use and
integration of digital content,
services and tools to facilitate
scholarship and research. It
allows research areas to be
investigated in new ways,
using new tools, leading to
new discoveries and analysis
to generate new
understanding”
-Adam Farquhar
Head of Digital Scholarship
Created in 2010, the department
works to enable….
• production of digital content
• sharing and integration of
digital content
• wider collaboration and
contribution around digital content
• complex analysis & facilitation of
new discoveries
4. www.bl.uk 4
Engaging with Labs
• Competitions
Labs Competition Deadline Thursday 30th of April 2015.
Winners will be chosen by Friday 29th of May 2015
http://labs.bl.uk/British+Library+Labs+Competition+2015
Labs Awards Deadline Monday 14th of September 2015
http://labs.bl.uk/British+Library+Labs+Awards+2015
• Events – Hack/ Data Days, Ideas Labs,Tag-a-thons
• Project proposals
5. www.bl.uk 5
Story of one digital collection…
What can68,000
books tell us?
Image: Artwork by Alicia Martin
6. www.bl.uk 6
Microsoft Partnership Digitisation
2006-8
• 68,000 volumes (47,000+ titles) published in the 19th
century mostly in English
• Bulk selection by storage location and subject classification
(English literature, geography, history and philosophy)
• Excluded authors active 1850-1901 and who died after
1936
• Output: 25 million pages (colour JPEG
2000, ALTO text files, metadata)
• Digitised content is public domain
7. www.bl.uk 7
Extracting Images from OCR
7
<?xml version="1.0"
encoding="UTF-8" ?>
- <mets:mets
xmlns:xsi="http://ww
w.w3.org/2001/XML
Schema-instance"
xmlns:mets="http://w
ww.loc.gov/METS/"
xsi:schemaLocation=
"http://www.loc.gov/
METS/
http://www.loc.gov/
standards/mets/ver
sion18/mets.xsd
info:lc/xmlns/premi
s-v2
Image snipped out
Algorithmically
From ALTO XML
Image taken from page 207 of 'London and its Environs. A
picturesque survey of the metropolis and the suburbs ...
Translated by Henry Frith. With ... illustrations'
ALTO XML
11. www.bl.uk 11
Flickr in numbers
Over 200,000,000 !!!
image views since launch December 13th 2013
Almost all images seen at least 10 times
>116712 tags added
18,567 images favourited
12. www.bl.uk 12
Free colouring in sheets for children
http://www.playingbythebook.net/2014/03/18/barbapapas-new-house-a-book-
so-good-im-featuring-it-for-a-second-time/
Strassburg und seine
Bauten. Herausgegeben
vom Architekten- und
Ingenieur-Verein für
Elsass-Lothringen. Mit 655
Abbildungen in Text, etc,
1894
13. www.bl.uk 13
Soundscapes inspired by the Flickr
collection
http://britishlibrary.typepad.co.uk/sound-and-vision/2014/10/inspired-by-flickr-air.html
24. www.bl.uk 24
Off the Map videogame competition for 2013
Three sub themes:
• Stonehenge, including a proposed plan to rebuild
Stonehenge
• Pyramids at Giza
• 17th Century London, including a survey map made
months after the Great Fire of 1666
26. www.bl.uk 26
John Leake, An exact surveigh of the streets lanes and churches contained within the
ruines of the City of London, 1667. Maps Crace port 2.58
27. www.bl.uk 27
2013 winning team: Pudding Lane Productions from De Montfort
University, Leicester who created an interpretation of 17th Century London
http://puddinglanedmuga.blogspot.co.uk/
http://youtu.be/SPY-hr-8-M0 (Flythrough starts at 0:50)
28. www.bl.uk 28
Gothic theme, tie-in with the Library's exhibition
Terror and Wonder: The Gothic Imagination
3 October 2014 - 20 January 2015
• Fonthill Abbey
Home of William Beckford, author of Vathek
• Edgar Allan Poe’s
Masque of the Red Death
• Whitby and its association with Bram Stoker’s
novel Dracula
Off the Map 2014
29. www.bl.uk 29
Off the Map
2014 Winners
• 2014 winning team:
Gothulus Rift
University of South
Wales
• Created a Fonthill Abbey
inspired game called Nix
using Oculus Rift
• Blog:
http://nixgamedevblog.bl
ogspot.co.uk
• YouTube flythrough:
http://youtu.be/8ESieZO
4VHw
31. www.bl.uk 31
Off the Map 2015
Alice’s Adventures Off the Map
Part of the British Library's celebrations for the 150th
anniversary of Alice in Wonderland
http://gamecity.org/alices-adventures-off-the-map/
32. www.bl.uk 32
@miss_wisdom
British Library Labs
http://labs.bl.uk
digitalresearch@bl.uk
Digital Scholarship Blog
http://britishlibrary.typepad.co.uk/digital-scholarship/
@bl_labs
Notes de l'éditeur
The work of Labs is really about a number of stories, stories about digital collections and about researchers wanting to ask fascinating research questions about them. Let’s now tell you a story about one collection and the intended and unintended consequences of working with it.
60 seconds
The Library digitised 68,000 predominantly 19th century books from our collections a few years ago (around 2.7 % of the physical total in that period). You can view them from our catalogue or read them on your <click>IPad via the Historical Books app developed by BiblioLabs.
There are 22 million individual page images, along with full text scans of these images, all of which contain untold quantity of useful data such as names of people, places, historical events, dates.
with no restrictions on use by Microsoft
So the question became then, what next? What can 68,000 books tell us?
60 seconds
As the books were scanned for text, this had a fortunate ‘side effect’ the software not only tries to detect the text on the page but also where the images might be. There had already been some interest in the images from the community of researchers. It seemed easy to extract them.
s part of the Labs competition, Matt Prior attended one of our hack events and when examining our book data and was very interested in the images from the books.
Meanwhile the algorithm that Ben had written to snip the images from the OCR scans was still churning away, how many were there going to be? The Mechanical Curator could publish them every hour, but was there somewhere we could put them all for people to browse when they wanted. Importantly if we did put them somewhere, could we get people to help us add descriptions to the individual images making them infinitely more discoverable.]
With an algorithm by Ben O’Steen we snipped out images from digitised books and put them on to Flickr on December 13 2013, there were over a million, but the problem we had was that we knew which books they came from (author/dates), but we didn’t’ have any information about the images. By releasing them onto flickr, we have got people to start tagging them and using them in very creative ways.
Hosting them internally was not an option and there was not sufficient metadata to put them on Wikipedia. Flickr seemed the obvious option as it is a platform that can support high usage, did not require metadata, allowed tagging and it is free for public domain images.
There has been since considerable news coverage in the press about the million images released on Flickr commons.
What was amazing is how this became a headline overnight, following a blogpost by Ben O’Steen and few tweets about it. without any official Library press release,
http://britishlibrary.typepad.co.uk/digital-scholarship/2013/12/a-million-first-steps.html
<click>The Independent, <click>Wired magazine, <click>The Guardian, <click>Popular Science and the <click>Mail online to name a very few.
The site was launched on Friday, December 13, 2013. On the first day only there were 5 million views of the Library’s Flickr Photostream and the site averages 20 million hits per month. So far there have been a staggering 200,000,000 views of the images, with over 116000 use generated tags.
The huge number of views has been sustained, with some work on our side.
The team had to be clever about keeping the images refreshed, every day there were new images appearing on top of the stream, we initiated some of the sub-collections, eg. random selections, curator picked images, popular high resolution images, images in need of tagging.
So Friday the 13th wasn’t so unlucky for us after all!
Top tags 17640 -- 'map' -- http://flickr.com/photos/britishlibrary/tags/map
6462 -- 'rotated' -- http://flickr.com/photos/britishlibrary/tags/rotated
4601 -- 'portrait' -- http://flickr.com/photos/britishlibrary/tags/portrait
1928 -- 'architecture' -- http://flickr.com/photos/britishlibrary/tags/architecture
Top Contributors: (282 contributors total) ===============
http://flickr.com/people/82452447@N00 - 34055 tags added
http://flickr.com/people/35468141611@N01 - 27509 tags added
Zoe Toft talks in her Playing by the Book blog about the way she used illustrations from a book about Strassburg Architecture to create colouring in sheets for her children. She used the trace function in Inkscape to create clean(er) black and white images, and the same programme to put them together in order to create her dream street.
Strassburg und seine Bauten. Herausgegeben vom Architekten- und Ingenieur-Verein für Elsass-Lothringen. Mit 655 Abbildungen in Text, etc, published in 1894 by Architekten- und Ingenieur-Verein für Elsass-Lothringen.
In a blog post British Library Sound Curator Cheryl Tipp invited people working with sounds to create audio recording inspired by the Flickr Collection and has been posting the creations on the BL Sounds blog (Each sound piece was no more than 3 minutes long, submitted as an mp3 and we needed to know which image inspired you and why)
http://britishlibrary.typepad.co.uk/sound-and-vision/2014/03/inspired-by-flickr.html#sthash.TgzVA4eK.dpuf
http://soundslikenoise.org/2014/03/22/the-devils-acres/
Top contributor Mario Klingemann has created artistic works through applying pattern recognition software using different criteria. Eg. 1800 Portraits sorted by pose; Female faces; arrangements by similarity of random images – portraits sorted by pose or by gender, flowers, circular pictures, colourful pages
Mario will speak about it at an event on 18 Dec
https://www.flickr.com/photos/quasimondo/
His web site is: http://mario-klingemann.tumblr.com/
And http://incubator.quasimondo.com/
He speaks about his project, how he came across the images and what he did with them.
How he learnt about the image = it was pure serendipity
Taking images out of the context of books creates potential to reinvent them in a new context.
http://youtu.be/3AOa98RsA2Q
http://www.youtube.com/watch?feature=player_detailpage&v=3AOa98RsA2Q#t=48
Make sure subtitles are on.
This is a surprising use of the images we put onto Flickr. Once a year in the summer, tens of thousands of participants gather in Nevada's Black Rock Desert to create Black Rock City, dedicated to community, art, self-expression, and self-reliance. They depart one week later, having left no trace whatsoever. [This year it took place between August 25 to September 1, Nevada, USA, the show ends by burning an effigy of wooden man! <click>]
American Artist David Normal used images from your Flickr Commons collection and worked on a set of collages called "Crossroads of Curiosity". The finished paintings based on these collages were presented in full colour as ' lightboxes at this year's Burning Man Festival, the theme for which was "Caravansary“. They were presented around the base of the effigy of the Burning Man in the heart of the festival.