SlideShare une entreprise Scribd logo
1  sur  87
Big data and the dark arts:
Demystifying the world of big data
Catherine Grout, Jisc
http://fc00.deviantart.net/fs71/f/2013/073/5/e/defence_against_the_dark_arts_lesson_by
_asiapasek-d5y0oc7.jpg
» Introduction to the topic and its importance education and
research
» Presentations from some key projects at the coal face of this issue
› COSMOS - Collaborative online social media observatory (Pete
Burnap)
› Mining Biodiversity - Enriching biodiversity heritage with text mining
and social media (Riza Batista-Navarro)
› Trees andTweets - combining twitter datawith family trees- (JackGrieve)
Structure of session
410/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
In 2012, Gartner updated its definition as follows:
"Big data is high volume, high velocity, and/or high variety
information assets that require new forms of processing to
enable enhanced decision making, insight discovery and
process optimization."[16] Additionally, a newV "Veracity" is
added by some organizations to describe it.[17]
(http://en.wikipedia.org/wiki/Big_data)
5
» Better use of Big data through high performance analytics could
add £216 billion to the UK economy by 2017 (CEBR via sas.com)
» Data has moved from a backroom issue to a boardroom issue
(strategy insight and competitive advantage)
chiefdataofficersummit.com/
» Therefore data ownership also a very important issue
» Tim Berners-Lee (as paraphrased in Guardian):
“the data we create about ourselves should be owned by each of us,
not the large companies that harvest it”
theguardian.com/technology/2014/oct/08/sir-tim-berners-lee-
speaks-out-on-data-ownership
Big data: big issue
610/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Total investment is in the region of £550 m (2012-15)
» This is across all 7 research councils but also includes collaborative
programmes (17 programmes)
» Includes production of:
› Methodologies, tools and new aggregated datasets
› Infrastructure - giving access to public and private data
› Infrastructure - providing storage, compute
› Centres of Expertise - Capacity and skills development
» RCUK overview of Big data investments
rcuk.ac.uk/research/infrastructure/big-data/
RCUK “Big data” investment overview
710/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Power
» Responsibility
» Opportunity
Big data for Universities
810/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Enterprise Data: about learners, researchers and staff and the
University as a business (including research grants)
› Held in structuredsystems,databasesbutmaybe notall interoperable
» Research Data (generally not structured or centrally held, Jisc
supporting universities to address this challenge “Research at Risk”)
› But Open Access publications (and some other material) in
Institutional Repositories (about 125 universities have one)
» Sensitive Data (e.g. medical data – securenetworks,anonymisedetc.)
» Activity data (data about performance, benchmarking, student
and researcher behaviour)
Big data for Universities
910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Big data enables much better analytics -
Key area for universities and for Jisc to support
» Jisc-HESA Business Intelligence Service (in development)
» LAMP (shared academic library analytics service)
» Effective Learner Analytics challenge
» All designed to help support effective analytics at institutional and
national (aggregate level)
Big data: analytics
1010/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
11
» “Your recent Amazon purchases,
Tweet score and location history
makes you 23.5% welcome here.”
(Cartoon critical of big data application, byT. Gregorius
en.wikipedia.org/wiki/Big_data)
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Big data research is not all about analysing very big data
» It can be about bringing data together from different sources
» It can be about techniques from the big data field to build more
interesting ways of interacting with digital libraries
» It can be about using and building new techniques, tools to interact
with data and address research questions
» Project presentations will illustrate this
Big data: For research
1210/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Issues around curation and preservation of research data (variable
size and condition)
» Performance of infrastructure required
» Why should we share and re-use research data?
» What tools, methodologies, techniques can be used?
» Do researchers have the rights skills to exploit data effectively
» How does all of the above impact on research and the research
process?
Big data: For research
1310/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Two of the projects presenting today are part of
Digging into Data Challenge
» Digging into Data has been addressing many of the challenges
that were flagged earlier
» Digging into Data brings together 10 funders in four countries (UK,
US, Canada, NL)
» 36 projects funded since 2011
» Addresses “big data for research” in the humanities and social
sciences
Big data: Digging into data
Machine Anatomy 101 - UK funders & unviersities 17/10/2013 14
» Pioneered and legitimised big data based research in the humanities
– for computer scientists and others. (from zero to hero)
» “digital humanities” and “computational social sciences” working
together
» Engaged GLAM sector and others and encourage them to make
their data available in forms useful to researchers and to work with
them (encourages joint data curation)
Digging into data:Achievements so far
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 15
» Progress on the policy side toward reforming copyright and IP to
allow for big data research on cultural heritage materials - (more to
do here)
» International & multidisciplinary cooperation had high impact
(more than anticipated). Increased visibility also strengthened
research bringing new teams together)
Digging into data:Achievements so far
1610/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Bringing data together to make Big data can create exciting
research opportunities
» Article in Nature 2013
» Mummies reveal that clogged arteries plagued the ancient world
» Based on Digging into Data programme project that brought
together CT scans on 137 mummies from four very different
ancient populations: Egyptian, Peruvian, the Ancestral Puebloans
of southwest America and the Unangans of the Aleutian Islands in
Alaska
» nature.com/news/mummies-reveal-that-clogged-arteries-
plagued-the-ancient-world-1.12568
Big data: For research
1710/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Big data: For research
1810/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» “Big data” covers a very wide set of activities
» But has and is inspiring major investments and changes in practice
» Jisc is helping to support institutions in making the most of big
data through:
› Developing shared services, advice and guidance to help manage
research data effectively and comply with funders requirements
(Research at Risk Challenge)
› Promoting effective use of data analytics and delivering some key
analytics services
› Working with the Research Councils to help exploit the benefits
of big data for research
Big data: In summary
1910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Gerd Leonhard , Big Data and the Future of
Journalismflickr.com/photos/gleonhard/8978372783/
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Find out more…
Contact…
Catherine Grout
Head of change – research, Jisc
catherine.grout@jisc.ac.uk
Collaborative Online Social Media
Observatory
COSMOS
Dr. Pete Burnap (@pbFeed)
Cardiff School of Computer Science and
Informatics
Cardiff University, UK
With Matthew Williams, Jeffrey Morgan, Omer Rana, Luke Sloan, Alex Voss
Adam Edwards, William Housley and Rob Procter
What is COSMOS?
• Aim to establish a coordinated interdisciplinary response to “Big
Social Data”
• Led from Cardiff (Computer Science and Social Sciences),
Warwick and St. Andrews
• Additional input from Edinburgh, UCL, Leeds, Manchester and
Wolverhampton
• Brings together social, computer, political, health and
mathematical scientists to study the methodological, theoretical,
and empirical dimensions of Big Data in technical, social and policy
contexts
• Developing a research programme to help understand and explain
how social processes and interactions manifest on the Web, with
a focus upon the challenges posed by big social data to government,
digital economy and civil society,
• Development of new methodological tools and technical/data
solutions for UK academia and public sector…a Web Observatory
What is COSMOS?
• COSMOS has attracted 17 research grants
amounting to over £1.25M in funding from
JISC/ESRC/EPSRC/AHRC/and £500K from the
public and private sectors (DoH/FSA/HPC Wales).
• A significant proportion of these funds have been
awarded to collect and analyse social media data in
the contexts of Societal Safety and Security e.g.
social tension, hate speech, crime reporting and
fear of crime, suicidal ideation
Research Programme
Digital Social Research Tools, Tension Indicators and Safer
Communities: A demonstration of COSMOS (ESRC DSR)
COSMOS: Supporting Empirical Social Scientific Research with a
Virtual Research Environment (JISC)
Small items of research equipment at Cardiff University (EPSRC)
Hate Speech and Social Media: Understanding Users, Networks and
Information Flows (ESRC Google)
Social Media and Prediction: Crime Sensing, Data Integration and
Statistical Modelling (ESRC NCRM)
Understanding the Role of Social Media in the Aftermath of Youth
Suicides (Department of Health)
Scaling the Computational Analysis of “Big Social Data” & Massive
Temporal Social Media Datasets (HPC Wales)
Digital Wildfire: (Mis)information flows, propagation and responsible
governance, (ESRC Global Uncertainties)
Public perceptions of the UK food system: public understanding and
engagement, and the impact of crises and scares (ESRC/FSA)
2011
2016
COSMOS Web Observatory
Integrated
Open (“plug and play”)
Scalable (MongoDB data stores/
Hadoop Back End)
Burnap, P. et al. (2014) ‘COSMOS: Towards an Integrated and Scalable Service for Analyzing Social Media
on Demand’, International Journal of Parallel, Emergent and Distributed Systems
Usable – developed with social
scientists for social scientists
Reproducible/Citable Research
- export/share workflow
Web Observatory Features
• Data Collection
– Persistent connection to Twitter 1% Stream (~4 billion)
– ONS/Police API
– Drag and drop RSS
– Import CSV/JSON
• Data Transformation
– Word Frequency
– Point data frequency over time
– Social Network Analysis
– Geospatial Clustering
– Sentiment Analysis
– …API to plug new modules and benchmark tools
Observing Events
Observing Events
COSMOS Infrastructure
COSMOS Desktop
•Small local datasets
•Users’ API credentials
•Local analysis
•Sept ‘14 launch (>100 dl’s in 17
countries)
COSMOS Cloud
•Scalable storage
• Massive datasets
•Scalable compute
• On-demand nodes
• Fast search & retrieve
• Fast analysis
•Workflow management
•Collaboration support
•2015 launch
Web Observatory Examples
• Policy/impact driven (benefit to society/economy)
• Focus on ethical research into human safety and
security
• Augment terrestrial methods
• Comparison to existing methods
• Experimental applied stats & machine learning
• Provide examples of machine intelligence tasks
integrated into social research workflow…
• Radio 5 Live Hit List (#5LiveHitList) - biggest impact
stories across social media and online
Questions?
Pete Burnap (@pbFeed)
burnapp@cardiff.ac.uk
Mining Biodiversity:
Enriching biodiversity literature with
OCR corrections and text-mined
semantic metadata
Riza Batista-Navarro
National centre for text mining,
University of Manchester
Mining biodiversity
34
The Partners
A
A
B
B
C C
D
D
Social Media Lab
E
E
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
35
» Transform BHL into a next-generation social digital library
» Bring together strengths from multiple disciplines:
› Text mining
› Machine learning
› Data visualisation
› History
› Library and information science
› Social media
Project aims
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 36
What do we want to accomplish?
Social
Media
Semantic
Metadata
Visualisa-
tion
Mining biodiversity
37
» A consortium of botanical and natural history libraries
» Stores digitised legacy literature on biodiversity
» Currently holds 130,000 volumes = millions of pages (PDFs and
OCR-generated text)
» Open-access
Biodiversity Heritage Library (BHL)
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
38
» Supports keyword-based search
» Species annotated and linked to the Encyclopedia of Life
» Integrates automatic taxonomic name finding tools
» Data access through export functionalities andWeb services
BHL: Current features
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
39
BHL: Keyword-based search and Browsing
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
40
BHL: Metadata included in advanced search functionality
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
41
BHL: Page viewing
Page in PDF/image
format
OCR – generated
text
Annotated species
names
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
42
Enhanced BHL: Proposed search functionalities
Faceted search
Time-sensitive
search
Automatically
generated
questions
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
43
Enhanced BHL: Proposed page view
Page in PDF/image
format
OCR – corrected text
with annotations
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 44
Big data analytics: OCR correction and text mining
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 45
Big data analytics: Compilation and visualisation of (evolving) terms
Mining biodiversity
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 46
Big data analytics: Compilation and visualisation of (evolving) terms
Mining biodiversity
Sample OCR errors detected and corrected
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 47
Mining biodiversity
» Original
I mean by habit, that law in virtiie of
which all the actions and the characters
of living beings tend to repeat and to
T)err)etuatf
vi I'REFACE.
themselves, not only in tlie individual but
in its offspring.
» Result
I mean by habit, that law in virtue of
which all the actions and the characters
of living beings tend to repeat and to
perpetuate
vi PREFACE.
themselves, not only in the individual but
in its offspring.
Semantic metadata generation
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 48
Mining biodiversity
» Entity types
› Taxonomic entities
› Geographic locations
› Habitats
› Anatomical entities
› Qualities
› Temporal expressions
› Persons
» Association types
› Observation
› Habitation
› Nutrition
› Trait
Mining biodiversity
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 49
Examples of semantic metadata (annotations)
» Observation
» Habitation
Mining biodiversity
50
Examples of semantic metadata (annotations)
» Nutrition
» Trait
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
51
» Web-based, graphicalTM workbench
» Conforms with the Unstructured Information Management
Architecture (UIMA) standard
» Facilitates the straightforward integration of various analytics into
workflows
» Allows for the validation of annotations
: Automatic annotation by text mining (TM)
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Mining biodiversity
52
Main interface
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
53
Reconfigurable, reusable, modular workflows
Mining biodiversity
ENVO
Catalogue
of Life
PATO
GAZ
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
54
Validation interface
Mining biodiversity
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
55
» Semantic metadata is generated and visualised using big data
analytics
» Enhanced searching through historical archives is facilitated
» Outcomes
› More informative search results
› Discovery of novel associations
In summary…
Mining biodiversity
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Find out more…
Contact…
Riza Batista-Navarro
Research associate, NaCTeM
riza.batista@manchester.ac.uk
nactem.ac.uk/
Big data for lexical research
Jack Grieve, Aston University
» The problem with analyzing the lexicon is that most words are very
rare. For example, a majority of the 100,000 most common words
in English occur on average less than once per 25 million words.
However, even the largest standard linguistic datasets (e.g. the
British National Corpus) are smaller than 100 million words
» To observe the usage of most words, we therefore require access
to incredibly large corpora, which is now possible with the
availability big data
Big data for lexical research
5810/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» Today, I’m going to demonstrate how taking advantage of big data
mined from Twitter allows us to study for the first time how newly
emerging words enter and spread within in language
» In particular, I’ll be analysing a 8.9 billion word corpus ofAmerican
Tweets posted by over 7 million different users using geo-enabled
smart phones fromOctober 2013 – November 2014, which was
collected for the Digging into Data Challenge
Big data for lexical research
5910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» To find newly emerging words we looked for words that were very
rare at the start of the period represented by our corpus but that
rose considerably over the course of this period by analysing the
relative frequency of the 67,000 most common words in our corpus
over each day of the corpus
Finding newly emerging words
6010/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
6110/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
6210/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» “Unbothered by the negativity and foolishness”
» “I starting to enjoying being unbothered”
» “What's that new s**t bitches are saying. Unbothered whatever
that means”
» “I'm always Unbothered I have no need to worry about the
next person.”
» “I'm so unbothered omg I've never felt more in my zone”
» “The FACTThat BeyoncéWas So Unbothered About
Michelle Falling”
Unbothered examples
6310/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 64
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 65
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 66
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 67
10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 68
6910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7010/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7110/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» In addition to finding newly emerging words, we can also map the
spread of these words across space for the first time, by taking
advantage of the geocoded information provided byTwitter,
which consists of a longitude and latitude for each tweet
Mapping newly emerging words
7210/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7310/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7410/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7510/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7610/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7710/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7810/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
7910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
8010/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
8110/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
8210/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
8310/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
8410/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
8510/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
» By taking of advantage of big data we are thus able to investigate
language in far greater detail than was previously possible, including
identifying and mapping the spread of newly emerging words
» Big data is therefore incredibly useful for understanding complex
systems that involve very large numbers of rare events, including
the lexicon of modern languages
Conclusion
8610/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
Find out more…
Contact…
Jack Grieve
Aston University
j.grieve1@aston.ac.uk
@JWGrieve

Contenu connexe

Tendances

Harnessing the power of indoor positioning technology - Jisc Digital Festival...
Harnessing the power of indoor positioning technology - Jisc Digital Festival...Harnessing the power of indoor positioning technology - Jisc Digital Festival...
Harnessing the power of indoor positioning technology - Jisc Digital Festival...Jisc
 
The future of cloud computing - Jisc Digifest 2016
The future of cloud computing - Jisc Digifest 2016The future of cloud computing - Jisc Digifest 2016
The future of cloud computing - Jisc Digifest 2016Jisc
 
Endings and new beginnings: update on the Jisc Content programme 2011-13
Endings and new beginnings: update on the Jisc Content programme 2011-13 Endings and new beginnings: update on the Jisc Content programme 2011-13
Endings and new beginnings: update on the Jisc Content programme 2011-13 PaolaMarchionni
 
SafeShare - Networkshop44
SafeShare - Networkshop44SafeShare - Networkshop44
SafeShare - Networkshop44Jisc
 
Getting value from institutional repositories: IRUS UK - Jisc Digital Festiva...
Getting value from institutional repositories: IRUS UK - Jisc Digital Festiva...Getting value from institutional repositories: IRUS UK - Jisc Digital Festiva...
Getting value from institutional repositories: IRUS UK - Jisc Digital Festiva...Jisc
 
Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?Martin Hamilton
 
Using jisc's JUSP and CCM services effectively to manage resources - Jisc Dig...
Using jisc's JUSP and CCM services effectively to manage resources - Jisc Dig...Using jisc's JUSP and CCM services effectively to manage resources - Jisc Dig...
Using jisc's JUSP and CCM services effectively to manage resources - Jisc Dig...Jisc
 
Total cost of ownership: reducing the cost of gold open access - Jisc Digital...
Total cost of ownership: reducing the cost of gold open access - Jisc Digital...Total cost of ownership: reducing the cost of gold open access - Jisc Digital...
Total cost of ownership: reducing the cost of gold open access - Jisc Digital...Jisc
 
Finding the right cloud solution for your organisation
Finding the right cloud solution for your organisationFinding the right cloud solution for your organisation
Finding the right cloud solution for your organisationJisc
 
Telephony is changing - is your institution ready? - Jisc Digital Festival 2015
Telephony is changing - is your institution ready? - Jisc Digital Festival 2015 Telephony is changing - is your institution ready? - Jisc Digital Festival 2015
Telephony is changing - is your institution ready? - Jisc Digital Festival 2015 Jisc
 
Research data spring: clipper
Research data spring: clipperResearch data spring: clipper
Research data spring: clipperJisc RDM
 
The Janet network: your digital utility - Jisc Digifest 2016
The Janet network: your digital utility - Jisc Digifest 2016The Janet network: your digital utility - Jisc Digifest 2016
The Janet network: your digital utility - Jisc Digifest 2016Jisc
 
BRISSKit: biomedical research made easy - Jisc Digital Festival 2015
BRISSKit: biomedical research made easy - Jisc Digital Festival 2015BRISSKit: biomedical research made easy - Jisc Digital Festival 2015
BRISSKit: biomedical research made easy - Jisc Digital Festival 2015Jisc
 
The user -driven evolution of Janet - Jisc Digifest 2016
The user -driven evolution of Janet - Jisc Digifest 2016The user -driven evolution of Janet - Jisc Digifest 2016
The user -driven evolution of Janet - Jisc Digifest 2016Jisc
 
UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015
UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015
UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015Martin Hamilton
 
Open Science at the European Commission
Open Science at the European CommissionOpen Science at the European Commission
Open Science at the European CommissionCarl-Christian Buhr
 
Introducing the Jisc National HPC Agreement
Introducing the Jisc National HPC AgreementIntroducing the Jisc National HPC Agreement
Introducing the Jisc National HPC AgreementMartin Hamilton
 
Reading lists as open data - Meeting the Reading List Challenge 2016
Reading lists as open data - Meeting the Reading List Challenge 2016Reading lists as open data - Meeting the Reading List Challenge 2016
Reading lists as open data - Meeting the Reading List Challenge 2016Martin Hamilton
 
Directions in research data management - Jisc Digital Festival 2015
Directions in research data management - Jisc Digital Festival 2015Directions in research data management - Jisc Digital Festival 2015
Directions in research data management - Jisc Digital Festival 2015Jisc
 
HPC in the cloud comes of age - Red Oak HPC Seminar
HPC in the cloud comes of age - Red Oak HPC SeminarHPC in the cloud comes of age - Red Oak HPC Seminar
HPC in the cloud comes of age - Red Oak HPC SeminarMartin Hamilton
 

Tendances (20)

Harnessing the power of indoor positioning technology - Jisc Digital Festival...
Harnessing the power of indoor positioning technology - Jisc Digital Festival...Harnessing the power of indoor positioning technology - Jisc Digital Festival...
Harnessing the power of indoor positioning technology - Jisc Digital Festival...
 
The future of cloud computing - Jisc Digifest 2016
The future of cloud computing - Jisc Digifest 2016The future of cloud computing - Jisc Digifest 2016
The future of cloud computing - Jisc Digifest 2016
 
Endings and new beginnings: update on the Jisc Content programme 2011-13
Endings and new beginnings: update on the Jisc Content programme 2011-13 Endings and new beginnings: update on the Jisc Content programme 2011-13
Endings and new beginnings: update on the Jisc Content programme 2011-13
 
SafeShare - Networkshop44
SafeShare - Networkshop44SafeShare - Networkshop44
SafeShare - Networkshop44
 
Getting value from institutional repositories: IRUS UK - Jisc Digital Festiva...
Getting value from institutional repositories: IRUS UK - Jisc Digital Festiva...Getting value from institutional repositories: IRUS UK - Jisc Digital Festiva...
Getting value from institutional repositories: IRUS UK - Jisc Digital Festiva...
 
Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?
 
Using jisc's JUSP and CCM services effectively to manage resources - Jisc Dig...
Using jisc's JUSP and CCM services effectively to manage resources - Jisc Dig...Using jisc's JUSP and CCM services effectively to manage resources - Jisc Dig...
Using jisc's JUSP and CCM services effectively to manage resources - Jisc Dig...
 
Total cost of ownership: reducing the cost of gold open access - Jisc Digital...
Total cost of ownership: reducing the cost of gold open access - Jisc Digital...Total cost of ownership: reducing the cost of gold open access - Jisc Digital...
Total cost of ownership: reducing the cost of gold open access - Jisc Digital...
 
Finding the right cloud solution for your organisation
Finding the right cloud solution for your organisationFinding the right cloud solution for your organisation
Finding the right cloud solution for your organisation
 
Telephony is changing - is your institution ready? - Jisc Digital Festival 2015
Telephony is changing - is your institution ready? - Jisc Digital Festival 2015 Telephony is changing - is your institution ready? - Jisc Digital Festival 2015
Telephony is changing - is your institution ready? - Jisc Digital Festival 2015
 
Research data spring: clipper
Research data spring: clipperResearch data spring: clipper
Research data spring: clipper
 
The Janet network: your digital utility - Jisc Digifest 2016
The Janet network: your digital utility - Jisc Digifest 2016The Janet network: your digital utility - Jisc Digifest 2016
The Janet network: your digital utility - Jisc Digifest 2016
 
BRISSKit: biomedical research made easy - Jisc Digital Festival 2015
BRISSKit: biomedical research made easy - Jisc Digital Festival 2015BRISSKit: biomedical research made easy - Jisc Digital Festival 2015
BRISSKit: biomedical research made easy - Jisc Digital Festival 2015
 
The user -driven evolution of Janet - Jisc Digifest 2016
The user -driven evolution of Janet - Jisc Digifest 2016The user -driven evolution of Janet - Jisc Digifest 2016
The user -driven evolution of Janet - Jisc Digifest 2016
 
UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015
UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015
UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015
 
Open Science at the European Commission
Open Science at the European CommissionOpen Science at the European Commission
Open Science at the European Commission
 
Introducing the Jisc National HPC Agreement
Introducing the Jisc National HPC AgreementIntroducing the Jisc National HPC Agreement
Introducing the Jisc National HPC Agreement
 
Reading lists as open data - Meeting the Reading List Challenge 2016
Reading lists as open data - Meeting the Reading List Challenge 2016Reading lists as open data - Meeting the Reading List Challenge 2016
Reading lists as open data - Meeting the Reading List Challenge 2016
 
Directions in research data management - Jisc Digital Festival 2015
Directions in research data management - Jisc Digital Festival 2015Directions in research data management - Jisc Digital Festival 2015
Directions in research data management - Jisc Digital Festival 2015
 
HPC in the cloud comes of age - Red Oak HPC Seminar
HPC in the cloud comes of age - Red Oak HPC SeminarHPC in the cloud comes of age - Red Oak HPC Seminar
HPC in the cloud comes of age - Red Oak HPC Seminar
 

En vedette

Get involved with codesign - Jisc Digital Festival 2015
Get involved with codesign - Jisc Digital Festival 2015Get involved with codesign - Jisc Digital Festival 2015
Get involved with codesign - Jisc Digital Festival 2015Jisc
 
How technology can help top prepare learners for the world of work - Jisc Dig...
How technology can help top prepare learners for the world of work - Jisc Dig...How technology can help top prepare learners for the world of work - Jisc Dig...
How technology can help top prepare learners for the world of work - Jisc Dig...Jisc
 
Open access: changes in the global research market - Jisc Digital Festival 2015
Open access: changes in the global research market - Jisc Digital Festival 2015Open access: changes in the global research market - Jisc Digital Festival 2015
Open access: changes in the global research market - Jisc Digital Festival 2015Jisc
 
Uncovering research - what's the standard - Jisc Digital Festival 2015
Uncovering research - what's the standard - Jisc Digital Festival 2015Uncovering research - what's the standard - Jisc Digital Festival 2015
Uncovering research - what's the standard - Jisc Digital Festival 2015Jisc
 
Mobile learning in practice - Jisc Digital Festival 2015
Mobile learning in practice - Jisc Digital Festival 2015Mobile learning in practice - Jisc Digital Festival 2015
Mobile learning in practice - Jisc Digital Festival 2015Jisc
 
Get involved - Jisc Digital Festival 2015
Get involved - Jisc Digital Festival 2015Get involved - Jisc Digital Festival 2015
Get involved - Jisc Digital Festival 2015Jisc
 
Good practice in learning analytics - Jisc Digital Festival 2015
Good practice in learning analytics - Jisc Digital Festival 2015Good practice in learning analytics - Jisc Digital Festival 2015
Good practice in learning analytics - Jisc Digital Festival 2015Jisc
 
Maximised discovery of institutions digital collections - Jisc Digital Festiv...
Maximised discovery of institutions digital collections - Jisc Digital Festiv...Maximised discovery of institutions digital collections - Jisc Digital Festiv...
Maximised discovery of institutions digital collections - Jisc Digital Festiv...Jisc
 
The cost of curation - Jisc Digital Festival 2015
The cost of curation - Jisc Digital Festival 2015The cost of curation - Jisc Digital Festival 2015
The cost of curation - Jisc Digital Festival 2015Jisc
 
Open access, universities as publishers - Jisc Digital Festival 2015
Open access, universities as publishers - Jisc Digital Festival 2015Open access, universities as publishers - Jisc Digital Festival 2015
Open access, universities as publishers - Jisc Digital Festival 2015Jisc
 
Research data spring - Jisc Digital Festival 2015
Research data spring - Jisc Digital Festival 2015Research data spring - Jisc Digital Festival 2015
Research data spring - Jisc Digital Festival 2015Jisc
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
 
Staff-student partnership working to effect institutional change - Jisc Digit...
Staff-student partnership working to effect institutional change - Jisc Digit...Staff-student partnership working to effect institutional change - Jisc Digit...
Staff-student partnership working to effect institutional change - Jisc Digit...Jisc
 
Call for participants - Jisc Digital Festival 2015
Call for participants - Jisc Digital Festival 2015Call for participants - Jisc Digital Festival 2015
Call for participants - Jisc Digital Festival 2015Jisc
 
Showcasing uk teaching resources: Jorum - Jisc Digital Festival 2015
Showcasing uk teaching resources: Jorum - Jisc Digital Festival 2015Showcasing uk teaching resources: Jorum - Jisc Digital Festival 2015
Showcasing uk teaching resources: Jorum - Jisc Digital Festival 2015Jisc
 
Are learning technologies fit for purpose?
Are learning technologies fit for purpose?Are learning technologies fit for purpose?
Are learning technologies fit for purpose?Jisc
 
Telephony is changing - Jisc Digital Festival 2015
Telephony is changing - Jisc Digital Festival 2015Telephony is changing - Jisc Digital Festival 2015
Telephony is changing - Jisc Digital Festival 2015Jisc
 
Risk Management - Jisc Digital Festival 2015
Risk Management - Jisc Digital Festival 2015Risk Management - Jisc Digital Festival 2015
Risk Management - Jisc Digital Festival 2015Jisc
 
The changing role of the IT leader - Jisc Digital Festival 2015
The changing role of the IT leader - Jisc Digital Festival 2015The changing role of the IT leader - Jisc Digital Festival 2015
The changing role of the IT leader - Jisc Digital Festival 2015Jisc
 
Internet safety - how Jisc is helping providers to stay safe online - Jisc Di...
Internet safety - how Jisc is helping providers to stay safe online - Jisc Di...Internet safety - how Jisc is helping providers to stay safe online - Jisc Di...
Internet safety - how Jisc is helping providers to stay safe online - Jisc Di...Jisc
 

En vedette (20)

Get involved with codesign - Jisc Digital Festival 2015
Get involved with codesign - Jisc Digital Festival 2015Get involved with codesign - Jisc Digital Festival 2015
Get involved with codesign - Jisc Digital Festival 2015
 
How technology can help top prepare learners for the world of work - Jisc Dig...
How technology can help top prepare learners for the world of work - Jisc Dig...How technology can help top prepare learners for the world of work - Jisc Dig...
How technology can help top prepare learners for the world of work - Jisc Dig...
 
Open access: changes in the global research market - Jisc Digital Festival 2015
Open access: changes in the global research market - Jisc Digital Festival 2015Open access: changes in the global research market - Jisc Digital Festival 2015
Open access: changes in the global research market - Jisc Digital Festival 2015
 
Uncovering research - what's the standard - Jisc Digital Festival 2015
Uncovering research - what's the standard - Jisc Digital Festival 2015Uncovering research - what's the standard - Jisc Digital Festival 2015
Uncovering research - what's the standard - Jisc Digital Festival 2015
 
Mobile learning in practice - Jisc Digital Festival 2015
Mobile learning in practice - Jisc Digital Festival 2015Mobile learning in practice - Jisc Digital Festival 2015
Mobile learning in practice - Jisc Digital Festival 2015
 
Get involved - Jisc Digital Festival 2015
Get involved - Jisc Digital Festival 2015Get involved - Jisc Digital Festival 2015
Get involved - Jisc Digital Festival 2015
 
Good practice in learning analytics - Jisc Digital Festival 2015
Good practice in learning analytics - Jisc Digital Festival 2015Good practice in learning analytics - Jisc Digital Festival 2015
Good practice in learning analytics - Jisc Digital Festival 2015
 
Maximised discovery of institutions digital collections - Jisc Digital Festiv...
Maximised discovery of institutions digital collections - Jisc Digital Festiv...Maximised discovery of institutions digital collections - Jisc Digital Festiv...
Maximised discovery of institutions digital collections - Jisc Digital Festiv...
 
The cost of curation - Jisc Digital Festival 2015
The cost of curation - Jisc Digital Festival 2015The cost of curation - Jisc Digital Festival 2015
The cost of curation - Jisc Digital Festival 2015
 
Open access, universities as publishers - Jisc Digital Festival 2015
Open access, universities as publishers - Jisc Digital Festival 2015Open access, universities as publishers - Jisc Digital Festival 2015
Open access, universities as publishers - Jisc Digital Festival 2015
 
Research data spring - Jisc Digital Festival 2015
Research data spring - Jisc Digital Festival 2015Research data spring - Jisc Digital Festival 2015
Research data spring - Jisc Digital Festival 2015
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
Staff-student partnership working to effect institutional change - Jisc Digit...
Staff-student partnership working to effect institutional change - Jisc Digit...Staff-student partnership working to effect institutional change - Jisc Digit...
Staff-student partnership working to effect institutional change - Jisc Digit...
 
Call for participants - Jisc Digital Festival 2015
Call for participants - Jisc Digital Festival 2015Call for participants - Jisc Digital Festival 2015
Call for participants - Jisc Digital Festival 2015
 
Showcasing uk teaching resources: Jorum - Jisc Digital Festival 2015
Showcasing uk teaching resources: Jorum - Jisc Digital Festival 2015Showcasing uk teaching resources: Jorum - Jisc Digital Festival 2015
Showcasing uk teaching resources: Jorum - Jisc Digital Festival 2015
 
Are learning technologies fit for purpose?
Are learning technologies fit for purpose?Are learning technologies fit for purpose?
Are learning technologies fit for purpose?
 
Telephony is changing - Jisc Digital Festival 2015
Telephony is changing - Jisc Digital Festival 2015Telephony is changing - Jisc Digital Festival 2015
Telephony is changing - Jisc Digital Festival 2015
 
Risk Management - Jisc Digital Festival 2015
Risk Management - Jisc Digital Festival 2015Risk Management - Jisc Digital Festival 2015
Risk Management - Jisc Digital Festival 2015
 
The changing role of the IT leader - Jisc Digital Festival 2015
The changing role of the IT leader - Jisc Digital Festival 2015The changing role of the IT leader - Jisc Digital Festival 2015
The changing role of the IT leader - Jisc Digital Festival 2015
 
Internet safety - how Jisc is helping providers to stay safe online - Jisc Di...
Internet safety - how Jisc is helping providers to stay safe online - Jisc Di...Internet safety - how Jisc is helping providers to stay safe online - Jisc Di...
Internet safety - how Jisc is helping providers to stay safe online - Jisc Di...
 

Similaire à Big data and the dark arts - Jisc Digital Media 2015

Rising tide of data update 20171024
Rising tide of data update 20171024Rising tide of data update 20171024
Rising tide of data update 20171024Keith Russell
 
Rising tide of data update
Rising tide of data update Rising tide of data update
Rising tide of data update ARDC
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Jisc
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhurymaredata
 
Open Data and Big Data Capacity Building Initiative
Open Data and Big Data Capacity Building InitiativeOpen Data and Big Data Capacity Building Initiative
Open Data and Big Data Capacity Building InitiativeCIARD Movement
 
Introducing Data and Text Mining at DigiFest
Introducing Data and Text Mining at DigiFestIntroducing Data and Text Mining at DigiFest
Introducing Data and Text Mining at DigiFestJisc RDM
 
Goebel.jst.big.data.jan10 12.2017.4
Goebel.jst.big.data.jan10 12.2017.4Goebel.jst.big.data.jan10 12.2017.4
Goebel.jst.big.data.jan10 12.2017.4Randy Goebel
 
e-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE Francee-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE FranceJean-François Lutz
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor networkparry prabhu
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
EPFL Open Research Data - a Jisc perspective
EPFL Open Research Data - a Jisc perspectiveEPFL Open Research Data - a Jisc perspective
EPFL Open Research Data - a Jisc perspectiveChristopher Brown
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policyHistoric Environment Scotland
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourKNOWeSCAPE2014
 
COMIT Sept 2016 - Open Data (Paul Wilkinson)
COMIT Sept 2016 - Open Data (Paul Wilkinson)COMIT Sept 2016 - Open Data (Paul Wilkinson)
COMIT Sept 2016 - Open Data (Paul Wilkinson)Comit Projects Ltd
 
Text and data mining - the opportunities and the EU conundrum - why aren’t we...
Text and data mining - the opportunities and the EU conundrum - why aren’t we...Text and data mining - the opportunities and the EU conundrum - why aren’t we...
Text and data mining - the opportunities and the EU conundrum - why aren’t we...FutureTDM
 
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...Katie Whipkey
 

Similaire à Big data and the dark arts - Jisc Digital Media 2015 (20)

Rising tide of data update 20171024
Rising tide of data update 20171024Rising tide of data update 20171024
Rising tide of data update 20171024
 
Rising tide of data update
Rising tide of data update Rising tide of data update
Rising tide of data update
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
 
Open Data and Big Data Capacity Building Initiative
Open Data and Big Data Capacity Building InitiativeOpen Data and Big Data Capacity Building Initiative
Open Data and Big Data Capacity Building Initiative
 
Introducing Data and Text Mining at DigiFest
Introducing Data and Text Mining at DigiFestIntroducing Data and Text Mining at DigiFest
Introducing Data and Text Mining at DigiFest
 
Goebel.jst.big.data.jan10 12.2017.4
Goebel.jst.big.data.jan10 12.2017.4Goebel.jst.big.data.jan10 12.2017.4
Goebel.jst.big.data.jan10 12.2017.4
 
CODATA, Open Science Policies and Capacity Building by Simon Hodson
CODATA, Open Science Policies and Capacity Building by Simon HodsonCODATA, Open Science Policies and Capacity Building by Simon Hodson
CODATA, Open Science Policies and Capacity Building by Simon Hodson
 
e-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE Francee-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE France
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor network
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
WORLD CAT AS BIG DATA
WORLD CAT AS  BIG DATAWORLD CAT AS  BIG DATA
WORLD CAT AS BIG DATA
 
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon HodsonCODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
 
EPFL Open Research Data - a Jisc perspective
EPFL Open Research Data - a Jisc perspectiveEPFL Open Research Data - a Jisc perspective
EPFL Open Research Data - a Jisc perspective
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policy
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
 
COMIT Sept 2016 - Open Data (Paul Wilkinson)
COMIT Sept 2016 - Open Data (Paul Wilkinson)COMIT Sept 2016 - Open Data (Paul Wilkinson)
COMIT Sept 2016 - Open Data (Paul Wilkinson)
 
Text and data mining - the opportunities and the EU conundrum - why aren’t we...
Text and data mining - the opportunities and the EU conundrum - why aren’t we...Text and data mining - the opportunities and the EU conundrum - why aren’t we...
Text and data mining - the opportunities and the EU conundrum - why aren’t we...
 
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
 

Plus de Jisc

Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
International students’ digital experience: understanding and mitigating the ...
International students’ digital experience: understanding and mitigating the ...International students’ digital experience: understanding and mitigating the ...
International students’ digital experience: understanding and mitigating the ...Jisc
 
Digital Storytelling Community Launch!.pptx
Digital Storytelling Community Launch!.pptxDigital Storytelling Community Launch!.pptx
Digital Storytelling Community Launch!.pptxJisc
 
Open Access book publishing understanding your options (1).pptx
Open Access book publishing understanding your options (1).pptxOpen Access book publishing understanding your options (1).pptx
Open Access book publishing understanding your options (1).pptxJisc
 
Scottish Universities Press supporting authors with requirements for open acc...
Scottish Universities Press supporting authors with requirements for open acc...Scottish Universities Press supporting authors with requirements for open acc...
Scottish Universities Press supporting authors with requirements for open acc...Jisc
 
How Bloomsbury is supporting authors with UKRI long-form open access requirem...
How Bloomsbury is supporting authors with UKRI long-form open access requirem...How Bloomsbury is supporting authors with UKRI long-form open access requirem...
How Bloomsbury is supporting authors with UKRI long-form open access requirem...Jisc
 
Jisc Northern Ireland Strategy Forum 2023
Jisc Northern Ireland Strategy Forum 2023Jisc Northern Ireland Strategy Forum 2023
Jisc Northern Ireland Strategy Forum 2023Jisc
 
Jisc Scotland Strategy Forum 2023
Jisc Scotland Strategy Forum 2023Jisc Scotland Strategy Forum 2023
Jisc Scotland Strategy Forum 2023Jisc
 
Jisc stakeholder strategic update 2023
Jisc stakeholder strategic update 2023Jisc stakeholder strategic update 2023
Jisc stakeholder strategic update 2023Jisc
 
JISC Presentation.pptx
JISC Presentation.pptxJISC Presentation.pptx
JISC Presentation.pptxJisc
 
Community-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptxCommunity-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptxJisc
 
The Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptxThe Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptxJisc
 
Are we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptxAre we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptxJisc
 
JiscOAWeek_LAIR_slides_October2023.pptx
JiscOAWeek_LAIR_slides_October2023.pptxJiscOAWeek_LAIR_slides_October2023.pptx
JiscOAWeek_LAIR_slides_October2023.pptxJisc
 
UWP OA Week Presentation (1).pptx
UWP OA Week Presentation (1).pptxUWP OA Week Presentation (1).pptx
UWP OA Week Presentation (1).pptxJisc
 
An introduction to Cyber Essentials
An introduction to Cyber EssentialsAn introduction to Cyber Essentials
An introduction to Cyber EssentialsJisc
 

Plus de Jisc (20)

Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
International students’ digital experience: understanding and mitigating the ...
International students’ digital experience: understanding and mitigating the ...International students’ digital experience: understanding and mitigating the ...
International students’ digital experience: understanding and mitigating the ...
 
Digital Storytelling Community Launch!.pptx
Digital Storytelling Community Launch!.pptxDigital Storytelling Community Launch!.pptx
Digital Storytelling Community Launch!.pptx
 
Open Access book publishing understanding your options (1).pptx
Open Access book publishing understanding your options (1).pptxOpen Access book publishing understanding your options (1).pptx
Open Access book publishing understanding your options (1).pptx
 
Scottish Universities Press supporting authors with requirements for open acc...
Scottish Universities Press supporting authors with requirements for open acc...Scottish Universities Press supporting authors with requirements for open acc...
Scottish Universities Press supporting authors with requirements for open acc...
 
How Bloomsbury is supporting authors with UKRI long-form open access requirem...
How Bloomsbury is supporting authors with UKRI long-form open access requirem...How Bloomsbury is supporting authors with UKRI long-form open access requirem...
How Bloomsbury is supporting authors with UKRI long-form open access requirem...
 
Jisc Northern Ireland Strategy Forum 2023
Jisc Northern Ireland Strategy Forum 2023Jisc Northern Ireland Strategy Forum 2023
Jisc Northern Ireland Strategy Forum 2023
 
Jisc Scotland Strategy Forum 2023
Jisc Scotland Strategy Forum 2023Jisc Scotland Strategy Forum 2023
Jisc Scotland Strategy Forum 2023
 
Jisc stakeholder strategic update 2023
Jisc stakeholder strategic update 2023Jisc stakeholder strategic update 2023
Jisc stakeholder strategic update 2023
 
JISC Presentation.pptx
JISC Presentation.pptxJISC Presentation.pptx
JISC Presentation.pptx
 
Community-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptxCommunity-led Open Access Publishing webinar.pptx
Community-led Open Access Publishing webinar.pptx
 
The Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptxThe Open Access Community Framework (OACF) 2023 (1).pptx
The Open Access Community Framework (OACF) 2023 (1).pptx
 
Are we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptxAre we onboard yet University of Sussex.pptx
Are we onboard yet University of Sussex.pptx
 
JiscOAWeek_LAIR_slides_October2023.pptx
JiscOAWeek_LAIR_slides_October2023.pptxJiscOAWeek_LAIR_slides_October2023.pptx
JiscOAWeek_LAIR_slides_October2023.pptx
 
UWP OA Week Presentation (1).pptx
UWP OA Week Presentation (1).pptxUWP OA Week Presentation (1).pptx
UWP OA Week Presentation (1).pptx
 
An introduction to Cyber Essentials
An introduction to Cyber EssentialsAn introduction to Cyber Essentials
An introduction to Cyber Essentials
 

Dernier

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 

Dernier (20)

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 

Big data and the dark arts - Jisc Digital Media 2015

  • 1.
  • 2. Big data and the dark arts: Demystifying the world of big data Catherine Grout, Jisc
  • 4. » Introduction to the topic and its importance education and research » Presentations from some key projects at the coal face of this issue › COSMOS - Collaborative online social media observatory (Pete Burnap) › Mining Biodiversity - Enriching biodiversity heritage with text mining and social media (Riza Batista-Navarro) › Trees andTweets - combining twitter datawith family trees- (JackGrieve) Structure of session 410/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 5. In 2012, Gartner updated its definition as follows: "Big data is high volume, high velocity, and/or high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization."[16] Additionally, a newV "Veracity" is added by some organizations to describe it.[17] (http://en.wikipedia.org/wiki/Big_data) 5
  • 6. » Better use of Big data through high performance analytics could add £216 billion to the UK economy by 2017 (CEBR via sas.com) » Data has moved from a backroom issue to a boardroom issue (strategy insight and competitive advantage) chiefdataofficersummit.com/ » Therefore data ownership also a very important issue » Tim Berners-Lee (as paraphrased in Guardian): “the data we create about ourselves should be owned by each of us, not the large companies that harvest it” theguardian.com/technology/2014/oct/08/sir-tim-berners-lee- speaks-out-on-data-ownership Big data: big issue 610/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 7. » Total investment is in the region of £550 m (2012-15) » This is across all 7 research councils but also includes collaborative programmes (17 programmes) » Includes production of: › Methodologies, tools and new aggregated datasets › Infrastructure - giving access to public and private data › Infrastructure - providing storage, compute › Centres of Expertise - Capacity and skills development » RCUK overview of Big data investments rcuk.ac.uk/research/infrastructure/big-data/ RCUK “Big data” investment overview 710/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 8. » Power » Responsibility » Opportunity Big data for Universities 810/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 9. » Enterprise Data: about learners, researchers and staff and the University as a business (including research grants) › Held in structuredsystems,databasesbutmaybe notall interoperable » Research Data (generally not structured or centrally held, Jisc supporting universities to address this challenge “Research at Risk”) › But Open Access publications (and some other material) in Institutional Repositories (about 125 universities have one) » Sensitive Data (e.g. medical data – securenetworks,anonymisedetc.) » Activity data (data about performance, benchmarking, student and researcher behaviour) Big data for Universities 910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 10. » Big data enables much better analytics - Key area for universities and for Jisc to support » Jisc-HESA Business Intelligence Service (in development) » LAMP (shared academic library analytics service) » Effective Learner Analytics challenge » All designed to help support effective analytics at institutional and national (aggregate level) Big data: analytics 1010/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 11. 11 » “Your recent Amazon purchases, Tweet score and location history makes you 23.5% welcome here.” (Cartoon critical of big data application, byT. Gregorius en.wikipedia.org/wiki/Big_data) 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 12. » Big data research is not all about analysing very big data » It can be about bringing data together from different sources » It can be about techniques from the big data field to build more interesting ways of interacting with digital libraries » It can be about using and building new techniques, tools to interact with data and address research questions » Project presentations will illustrate this Big data: For research 1210/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 13. » Issues around curation and preservation of research data (variable size and condition) » Performance of infrastructure required » Why should we share and re-use research data? » What tools, methodologies, techniques can be used? » Do researchers have the rights skills to exploit data effectively » How does all of the above impact on research and the research process? Big data: For research 1310/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 14. » Two of the projects presenting today are part of Digging into Data Challenge » Digging into Data has been addressing many of the challenges that were flagged earlier » Digging into Data brings together 10 funders in four countries (UK, US, Canada, NL) » 36 projects funded since 2011 » Addresses “big data for research” in the humanities and social sciences Big data: Digging into data Machine Anatomy 101 - UK funders & unviersities 17/10/2013 14
  • 15. » Pioneered and legitimised big data based research in the humanities – for computer scientists and others. (from zero to hero) » “digital humanities” and “computational social sciences” working together » Engaged GLAM sector and others and encourage them to make their data available in forms useful to researchers and to work with them (encourages joint data curation) Digging into data:Achievements so far 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 15
  • 16. » Progress on the policy side toward reforming copyright and IP to allow for big data research on cultural heritage materials - (more to do here) » International & multidisciplinary cooperation had high impact (more than anticipated). Increased visibility also strengthened research bringing new teams together) Digging into data:Achievements so far 1610/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 17. » Bringing data together to make Big data can create exciting research opportunities » Article in Nature 2013 » Mummies reveal that clogged arteries plagued the ancient world » Based on Digging into Data programme project that brought together CT scans on 137 mummies from four very different ancient populations: Egyptian, Peruvian, the Ancestral Puebloans of southwest America and the Unangans of the Aleutian Islands in Alaska » nature.com/news/mummies-reveal-that-clogged-arteries- plagued-the-ancient-world-1.12568 Big data: For research 1710/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 18. Big data: For research 1810/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 19. » “Big data” covers a very wide set of activities » But has and is inspiring major investments and changes in practice » Jisc is helping to support institutions in making the most of big data through: › Developing shared services, advice and guidance to help manage research data effectively and comply with funders requirements (Research at Risk Challenge) › Promoting effective use of data analytics and delivering some key analytics services › Working with the Research Councils to help exploit the benefits of big data for research Big data: In summary 1910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 20. Gerd Leonhard , Big Data and the Future of Journalismflickr.com/photos/gleonhard/8978372783/ 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 21. Find out more… Contact… Catherine Grout Head of change – research, Jisc catherine.grout@jisc.ac.uk
  • 22. Collaborative Online Social Media Observatory COSMOS Dr. Pete Burnap (@pbFeed) Cardiff School of Computer Science and Informatics Cardiff University, UK With Matthew Williams, Jeffrey Morgan, Omer Rana, Luke Sloan, Alex Voss Adam Edwards, William Housley and Rob Procter
  • 23. What is COSMOS? • Aim to establish a coordinated interdisciplinary response to “Big Social Data” • Led from Cardiff (Computer Science and Social Sciences), Warwick and St. Andrews • Additional input from Edinburgh, UCL, Leeds, Manchester and Wolverhampton • Brings together social, computer, political, health and mathematical scientists to study the methodological, theoretical, and empirical dimensions of Big Data in technical, social and policy contexts • Developing a research programme to help understand and explain how social processes and interactions manifest on the Web, with a focus upon the challenges posed by big social data to government, digital economy and civil society, • Development of new methodological tools and technical/data solutions for UK academia and public sector…a Web Observatory
  • 24. What is COSMOS? • COSMOS has attracted 17 research grants amounting to over £1.25M in funding from JISC/ESRC/EPSRC/AHRC/and £500K from the public and private sectors (DoH/FSA/HPC Wales). • A significant proportion of these funds have been awarded to collect and analyse social media data in the contexts of Societal Safety and Security e.g. social tension, hate speech, crime reporting and fear of crime, suicidal ideation
  • 25. Research Programme Digital Social Research Tools, Tension Indicators and Safer Communities: A demonstration of COSMOS (ESRC DSR) COSMOS: Supporting Empirical Social Scientific Research with a Virtual Research Environment (JISC) Small items of research equipment at Cardiff University (EPSRC) Hate Speech and Social Media: Understanding Users, Networks and Information Flows (ESRC Google) Social Media and Prediction: Crime Sensing, Data Integration and Statistical Modelling (ESRC NCRM) Understanding the Role of Social Media in the Aftermath of Youth Suicides (Department of Health) Scaling the Computational Analysis of “Big Social Data” & Massive Temporal Social Media Datasets (HPC Wales) Digital Wildfire: (Mis)information flows, propagation and responsible governance, (ESRC Global Uncertainties) Public perceptions of the UK food system: public understanding and engagement, and the impact of crises and scares (ESRC/FSA) 2011 2016
  • 26. COSMOS Web Observatory Integrated Open (“plug and play”) Scalable (MongoDB data stores/ Hadoop Back End) Burnap, P. et al. (2014) ‘COSMOS: Towards an Integrated and Scalable Service for Analyzing Social Media on Demand’, International Journal of Parallel, Emergent and Distributed Systems Usable – developed with social scientists for social scientists Reproducible/Citable Research - export/share workflow
  • 27. Web Observatory Features • Data Collection – Persistent connection to Twitter 1% Stream (~4 billion) – ONS/Police API – Drag and drop RSS – Import CSV/JSON • Data Transformation – Word Frequency – Point data frequency over time – Social Network Analysis – Geospatial Clustering – Sentiment Analysis – …API to plug new modules and benchmark tools
  • 30. COSMOS Infrastructure COSMOS Desktop •Small local datasets •Users’ API credentials •Local analysis •Sept ‘14 launch (>100 dl’s in 17 countries) COSMOS Cloud •Scalable storage • Massive datasets •Scalable compute • On-demand nodes • Fast search & retrieve • Fast analysis •Workflow management •Collaboration support •2015 launch
  • 31. Web Observatory Examples • Policy/impact driven (benefit to society/economy) • Focus on ethical research into human safety and security • Augment terrestrial methods • Comparison to existing methods • Experimental applied stats & machine learning • Provide examples of machine intelligence tasks integrated into social research workflow… • Radio 5 Live Hit List (#5LiveHitList) - biggest impact stories across social media and online
  • 33. Mining Biodiversity: Enriching biodiversity literature with OCR corrections and text-mined semantic metadata Riza Batista-Navarro National centre for text mining, University of Manchester
  • 34. Mining biodiversity 34 The Partners A A B B C C D D Social Media Lab E E 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 35. Mining biodiversity 35 » Transform BHL into a next-generation social digital library » Bring together strengths from multiple disciplines: › Text mining › Machine learning › Data visualisation › History › Library and information science › Social media Project aims 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 36. Mining biodiversity 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 36 What do we want to accomplish? Social Media Semantic Metadata Visualisa- tion
  • 37. Mining biodiversity 37 » A consortium of botanical and natural history libraries » Stores digitised legacy literature on biodiversity » Currently holds 130,000 volumes = millions of pages (PDFs and OCR-generated text) » Open-access Biodiversity Heritage Library (BHL) 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 38. Mining biodiversity 38 » Supports keyword-based search » Species annotated and linked to the Encyclopedia of Life » Integrates automatic taxonomic name finding tools » Data access through export functionalities andWeb services BHL: Current features 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 39. Mining biodiversity 39 BHL: Keyword-based search and Browsing 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 40. Mining biodiversity 40 BHL: Metadata included in advanced search functionality 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 41. Mining biodiversity 41 BHL: Page viewing Page in PDF/image format OCR – generated text Annotated species names 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 42. Mining biodiversity 42 Enhanced BHL: Proposed search functionalities Faceted search Time-sensitive search Automatically generated questions 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 43. Mining biodiversity 43 Enhanced BHL: Proposed page view Page in PDF/image format OCR – corrected text with annotations 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 44. Mining biodiversity 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 44 Big data analytics: OCR correction and text mining
  • 45. 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 45 Big data analytics: Compilation and visualisation of (evolving) terms Mining biodiversity
  • 46. 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 46 Big data analytics: Compilation and visualisation of (evolving) terms Mining biodiversity
  • 47. Sample OCR errors detected and corrected 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 47 Mining biodiversity » Original I mean by habit, that law in virtiie of which all the actions and the characters of living beings tend to repeat and to T)err)etuatf vi I'REFACE. themselves, not only in tlie individual but in its offspring. » Result I mean by habit, that law in virtue of which all the actions and the characters of living beings tend to repeat and to perpetuate vi PREFACE. themselves, not only in the individual but in its offspring.
  • 48. Semantic metadata generation 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 48 Mining biodiversity » Entity types › Taxonomic entities › Geographic locations › Habitats › Anatomical entities › Qualities › Temporal expressions › Persons » Association types › Observation › Habitation › Nutrition › Trait
  • 49. Mining biodiversity 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 49 Examples of semantic metadata (annotations) » Observation » Habitation
  • 50. Mining biodiversity 50 Examples of semantic metadata (annotations) » Nutrition » Trait 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 51. Mining biodiversity 51 » Web-based, graphicalTM workbench » Conforms with the Unstructured Information Management Architecture (UIMA) standard » Facilitates the straightforward integration of various analytics into workflows » Allows for the validation of annotations : Automatic annotation by text mining (TM) 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 52. Mining biodiversity 52 Main interface 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 53. 53 Reconfigurable, reusable, modular workflows Mining biodiversity ENVO Catalogue of Life PATO GAZ 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 54. 54 Validation interface Mining biodiversity 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 55. 55 » Semantic metadata is generated and visualised using big data analytics » Enhanced searching through historical archives is facilitated » Outcomes › More informative search results › Discovery of novel associations In summary… Mining biodiversity 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 56. Find out more… Contact… Riza Batista-Navarro Research associate, NaCTeM riza.batista@manchester.ac.uk nactem.ac.uk/
  • 57. Big data for lexical research Jack Grieve, Aston University
  • 58. » The problem with analyzing the lexicon is that most words are very rare. For example, a majority of the 100,000 most common words in English occur on average less than once per 25 million words. However, even the largest standard linguistic datasets (e.g. the British National Corpus) are smaller than 100 million words » To observe the usage of most words, we therefore require access to incredibly large corpora, which is now possible with the availability big data Big data for lexical research 5810/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 59. » Today, I’m going to demonstrate how taking advantage of big data mined from Twitter allows us to study for the first time how newly emerging words enter and spread within in language » In particular, I’ll be analysing a 8.9 billion word corpus ofAmerican Tweets posted by over 7 million different users using geo-enabled smart phones fromOctober 2013 – November 2014, which was collected for the Digging into Data Challenge Big data for lexical research 5910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 60. » To find newly emerging words we looked for words that were very rare at the start of the period represented by our corpus but that rose considerably over the course of this period by analysing the relative frequency of the 67,000 most common words in our corpus over each day of the corpus Finding newly emerging words 6010/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 61. 6110/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 62. 6210/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 63. » “Unbothered by the negativity and foolishness” » “I starting to enjoying being unbothered” » “What's that new s**t bitches are saying. Unbothered whatever that means” » “I'm always Unbothered I have no need to worry about the next person.” » “I'm so unbothered omg I've never felt more in my zone” » “The FACTThat BeyoncéWas So Unbothered About Michelle Falling” Unbothered examples 6310/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 64. 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 64
  • 65. 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 65
  • 66. 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 66
  • 67. 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 67
  • 68. 10/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham 68
  • 69. 6910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 70. 7010/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 71. 7110/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 72. » In addition to finding newly emerging words, we can also map the spread of these words across space for the first time, by taking advantage of the geocoded information provided byTwitter, which consists of a longitude and latitude for each tweet Mapping newly emerging words 7210/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 73. 7310/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 74. 7410/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 75. 7510/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 76. 7610/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 77. 7710/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 78. 7810/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 79. 7910/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 80. 8010/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 81. 8110/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 82. 8210/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 83. 8310/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 84. 8410/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 85. 8510/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 86. » By taking of advantage of big data we are thus able to investigate language in far greater detail than was previously possible, including identifying and mapping the spread of newly emerging words » Big data is therefore incredibly useful for understanding complex systems that involve very large numbers of rare events, including the lexicon of modern languages Conclusion 8610/03/2015 Jisc Digital Festival, 9-10 March 2015, ICC Birmingham
  • 87. Find out more… Contact… Jack Grieve Aston University j.grieve1@aston.ac.uk @JWGrieve