SlideShare une entreprise Scribd logo
1  sur  49
Télécharger pour lire hors ligne
Media Suite: Unlocking Archives for Mixed
Media Scholarly Research
Roeland Ordelman - Technical coordinator CLARIAH Media Suite
Netherlands Institute for Sound and Vision / University of Twente
The Netherlands
Media Studies
Focus on both “institutional”
data collections and collections
created by scholars
Welke data zitten in de Media Suite V3?
Radio & Television (1.88M items) Newspapers (60M pages)
Film (1129 films) Oral History (2744 interviews)
MULTIMEDIA
Welke data zitten in de Media Suite V3?
MIXEDMEDIA
RESEARCH PILOTS
Cross-Medial Analysis of WW2
Eyewitness Testimonies
Cross-media research of public debates
on drugs and regulation
Me and Myself: Tracing first person in
documentary history in AV-collections
Annotating EYE’s Jean Desmet Collection:
Towards Mixed Media Analysis in Digital Media History
Narrativizing Disruption: How exploratory search can support
media researchers to interpret ‘disruptive’ media events as lucid narratives
Remediation in Sports News
clariah.nl/projecten/research-pilots
Media Suite: enabling Mixed Media Scholarly Research
with Multi-media Data in a Sustainable Infrastructure
CLARIAH Centers
Common Lab Research Infrastructure for the Arts and Humanities
SUSTAINABLE
üAvailable after the project
üMaintenance and support
üUpdates and upgrades
Architecture principles
1. Centers are responsible for data quality and to facilitate
access to data
2. Authorized access using a federated authentication
mechanism
3. Data is connected to a shared “workspace” (VRE) for
various forms of analysis …
4. … that provides exports of data in various formats for
using tools outside the closed environment
5. The Media Suite provides the interface on the underlying
architecture
1. Centers facilitate are responsible for data
quality and access to data
REGISTER COLLECTION
HARVEST COLLECTION METADATA
SEARCH COLLECTION
Collection Owner
Media Suite
Scholar
CKAN web-based
open source management
system for the storage and
distribution of open data
Open Archive Initiative (OAI)
ISSUE:
Persistent link to
source file
ISSUE:
IPR (e.g., no
subtitles)
Example: DANS registers set Oral History
Common Lab Research Infrastructure for the Humanities
“METADATA ARCHEOLOGY”
Manual effort to describe metadata fields
ISSUE:
Resources
manual effort
Tools for inspection of metadata
Common Lab Research Infrastructure for the Humanities
2. Authorized access using a federated
authentication mechanism
Secure play-out and viewing
ISSUE:
Not always
available
Federated login
3. Data is connected to a shared
“workspace” (VRE) for analysis
ISSUE:
Currently semi-
shared
WORKSPACE
ü Create virtual personal
mixed media collections
ü Create projects
ü Stores annotations
ü Upload personal collections
ü Advanced Data Analysis
(Jupyter Notebooks)
ü Advanced Data processing
ü Export annotations
Data analysis: Jupyter Notebooks or NLP
Common Lab Research Infrastructure for the Humanities
ISSUE:
Robust pipelines
Write your own (Python)
code to analyze the data
in the Media Suite
ISSUE:
expertise
Example
output
Jupyter
Notebook
Auto Metadata Extraction –
Large scale speech recognition
350K hours processed
until now
Poster slam 11:00 – 11:30 tomorrow
4. Provide exports of data for tools outside
Media Suite is just an
interface on the
underlying
infrastructure….
Speech Suite
Media Suite: Unlocking Archives for Mixed Media
Scholarly Research
Co-development
Community
building
User stories!
Short iterations
(sprints) of 2 weeks:
development &
testing
• Information Specialist
• Experienced DH Researcher
Liaisons part of
development team:
Workshops, hack-a-
thons, data-a-thons
Discussing issues with Gitter
Tracking issues with Github
SCHOLARLY PRIMITIVES
Unsworth, 2000
Blanke and Hedges, 2013
“Unlock data”
Distant reading
Close reading
1. Discovery & Inspection of data sets hidden in archives
2. Discovery of items in large archival data sets
3. Accessing items (play, view) from restricted data sets
4. Discovery of segments in time-based media
5. Relating and comparing data on the segment level
DistantreadingClosereading
Search Oral History in Media Suite
Common Lab Research Infrastructure for the Humanities
Project
Search
Bookmark
Save
Bookmark
Save
Query
Bookmark view View Source
Annotation view View SourceAlignment
ISSUE:
Complex
interface
Private collection Apply enrichment or a “pipeline”
To appear:
Content-based Cross-media
Recommendations
1. Registered collections: persistent link (data management)
2. Registered collections: rights don’t permit (legal)
3. Metadata archeology: manual resources (funding)
4. Play-out/view: not always available (funding)
5. Shared workspace: semi-shared (infra development)
6. Advanced analysis: expertise scholars (training)
7. Advanced analysis: robust pipelines (benchmarking)
8. Workspace: complex interface (interaction design)
Issues/investments
Main contribution: enabling mixed media scholarly
research for “institutional” multimedia collections
Bringing the Tools to the Data: in progress but already
useful:
ü Unlocking the data, enabling distant/close reading
ü Supporting the scholarly primitives
ü Providing a workspace for saving annotations, creating
collections and options for (advanced) analysis
Summary…
Research coordination: Julia Noordegraaf @jjnoordegraaf
Technical coordination: Roeland Ordelman @roelandordelman
DEMO & QUESTIONS AT THE BAZAR
mediasuite.clariah.nl

Contenu connexe

Tendances

3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...
3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...
3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...Leon Osinski
 
Sharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yetSharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yetRoss Mounce
 
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)Paolo Ciccarese
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
 
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010OpenAIRE at e-infrastructures DC-NET Brussels, October 2010
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010OpenAIRE
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research DataRoss Mounce
 
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themRoss Mounce
 
Open Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based ResearchOpen Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based ResearchUQSCADS
 
The Chemist's Toolkit 10 9 09
The Chemist's Toolkit 10 9 09The Chemist's Toolkit 10 9 09
The Chemist's Toolkit 10 9 09Elizabeth Brown
 
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Wouter Beek
 
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...UCD Library
 
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY Paolo Ciccarese
 
Research Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffResearch Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffMartin Donnelly
 
Annotopia: Open Annotation Server
Annotopia: Open Annotation ServerAnnotopia: Open Annotation Server
Annotopia: Open Annotation ServerPaolo Ciccarese
 

Tendances (20)

Ariadne: Data Sharing
Ariadne: Data SharingAriadne: Data Sharing
Ariadne: Data Sharing
 
3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...
3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...
3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2...
 
Reading avoidance
Reading avoidanceReading avoidance
Reading avoidance
 
Ird3 2 lib
Ird3 2 libIrd3 2 lib
Ird3 2 lib
 
Sharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yetSharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yet
 
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)
(Live) Annotopia Overview by Paolo Ciccarese (Architect and principal developer)
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010OpenAIRE at e-infrastructures DC-NET Brussels, October 2010
OpenAIRE at e-infrastructures DC-NET Brussels, October 2010
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
 
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on them
 
Open Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based ResearchOpen Knowledge and the Benefits for University-based Research
Open Knowledge and the Benefits for University-based Research
 
The Chemist's Toolkit 10 9 09
The Chemist's Toolkit 10 9 09The Chemist's Toolkit 10 9 09
The Chemist's Toolkit 10 9 09
 
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
 
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...
Open data licensing : Trojan horse or sunken treasure? Authors: Caleb Derven,...
 
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
Integrating OPEN ANNOTATION with any DOMAIN ONTOLOGY
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Research Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffResearch Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staff
 
Ssp Collexis Overview 2009
Ssp Collexis   Overview 2009Ssp Collexis   Overview 2009
Ssp Collexis Overview 2009
 
Annotopia: Open Annotation Server
Annotopia: Open Annotation ServerAnnotopia: Open Annotation Server
Annotopia: Open Annotation Server
 
Open Notebook Science
Open Notebook ScienceOpen Notebook Science
Open Notebook Science
 

Similaire à Media Suite: Unlocking Archives for Mixed Media Scholarly Research

An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeologyguest756e05
 
Strategic overview, Alastair Dunning, Programme Manager at The European Library
Strategic overview, Alastair Dunning, Programme Manager at The European LibraryStrategic overview, Alastair Dunning, Programme Manager at The European Library
Strategic overview, Alastair Dunning, Programme Manager at The European LibraryThe European Library
 
Alastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TELAlastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TELThe European Library
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Robert H. McDonald
 
Open sciencerefresher2019
Open sciencerefresher2019Open sciencerefresher2019
Open sciencerefresher2019heila1
 
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016OpenAIRE
 
Scholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to showScholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to showDerek Keats
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseAnita de Waard
 
Presentation - First International Library Staff Exchange Week, Zagreb
Presentation - First International Library Staff Exchange Week, ZagrebPresentation - First International Library Staff Exchange Week, Zagreb
Presentation - First International Library Staff Exchange Week, ZagrebIva Vrkic
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesStefan Dietze
 
Using technologies to promote projects
Using technologies to promote projectsUsing technologies to promote projects
Using technologies to promote projectsDART Project
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesVictor de Boer
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010ALISS
 

Similaire à Media Suite: Unlocking Archives for Mixed Media Scholarly Research (20)

Data Publishing in Archaeozoology
Data Publishing in ArchaeozoologyData Publishing in Archaeozoology
Data Publishing in Archaeozoology
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeology
 
Strategic overview, Alastair Dunning, Programme Manager at The European Library
Strategic overview, Alastair Dunning, Programme Manager at The European LibraryStrategic overview, Alastair Dunning, Programme Manager at The European Library
Strategic overview, Alastair Dunning, Programme Manager at The European Library
 
Alastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TELAlastair Dunning, Open data at The European library, TEL
Alastair Dunning, Open data at The European library, TEL
 
Open Data from the European Library
Open Data from the European LibraryOpen Data from the European Library
Open Data from the European Library
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
 
Open Archives & Open Access
Open Archives & Open AccessOpen Archives & Open Access
Open Archives & Open Access
 
Open Science
Open ScienceOpen Science
Open Science
 
T-Space
T-SpaceT-Space
T-Space
 
Open sciencerefresher2019
Open sciencerefresher2019Open sciencerefresher2019
Open sciencerefresher2019
 
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
 
Scholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to showScholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to show
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Presentation - First International Library Staff Exchange Week, Zagreb
Presentation - First International Library Staff Exchange Week, ZagrebPresentation - First International Library Staff Exchange Week, Zagreb
Presentation - First International Library Staff Exchange Week, Zagreb
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital Libraries
 
Using technologies to promote projects
Using technologies to promote projectsUsing technologies to promote projects
Using technologies to promote projects
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media Archives
 
Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?
 
British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010
 

Plus de roelandordelman.nl

Accessing Large AV Collections using Visual Analysis in Digital Humanities
Accessing Large AV Collections using Visual Analysis in Digital HumanitiesAccessing Large AV Collections using Visual Analysis in Digital Humanities
Accessing Large AV Collections using Visual Analysis in Digital Humanitiesroelandordelman.nl
 
Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...roelandordelman.nl
 
Oral History Today: project eindpresentatie
Oral History Today: project eindpresentatieOral History Today: project eindpresentatie
Oral History Today: project eindpresentatieroelandordelman.nl
 
User Requirements in Audiovisual Search: a Quantitative Approach
User Requirements in Audiovisual Search: a Quantitative ApproachUser Requirements in Audiovisual Search: a Quantitative Approach
User Requirements in Audiovisual Search: a Quantitative Approachroelandordelman.nl
 
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editor
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editorIntetain presentation on VideoHypE, the LinkedTV video hyperlink editor
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editorroelandordelman.nl
 
Presentation on MediaEval Search & Linking task 2013
Presentation on MediaEval Search & Linking task 2013Presentation on MediaEval Search & Linking task 2013
Presentation on MediaEval Search & Linking task 2013roelandordelman.nl
 
Linking inside a video collection
Linking inside a video collectionLinking inside a video collection
Linking inside a video collectionroelandordelman.nl
 
20130212 immovator cross media cafe - linkedtv en axes
20130212 immovator cross media cafe - linkedtv en axes20130212 immovator cross media cafe - linkedtv en axes
20130212 immovator cross media cafe - linkedtv en axesroelandordelman.nl
 
Presentatie Mediapark Jaarcongres 2010
Presentatie Mediapark Jaarcongres 2010Presentatie Mediapark Jaarcongres 2010
Presentatie Mediapark Jaarcongres 2010roelandordelman.nl
 
Audiovisual content exploitation JTS2010
Audiovisual content exploitation  JTS2010 Audiovisual content exploitation  JTS2010
Audiovisual content exploitation JTS2010 roelandordelman.nl
 
Audiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISVAudiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISVroelandordelman.nl
 

Plus de roelandordelman.nl (13)

Video Hyperlinking
Video HyperlinkingVideo Hyperlinking
Video Hyperlinking
 
Accessing Large AV Collections using Visual Analysis in Digital Humanities
Accessing Large AV Collections using Visual Analysis in Digital HumanitiesAccessing Large AV Collections using Visual Analysis in Digital Humanities
Accessing Large AV Collections using Visual Analysis in Digital Humanities
 
Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...Audiovisual collections, the spoken word and user needs of scholars in the Hu...
Audiovisual collections, the spoken word and user needs of scholars in the Hu...
 
Oral History Today: project eindpresentatie
Oral History Today: project eindpresentatieOral History Today: project eindpresentatie
Oral History Today: project eindpresentatie
 
User Requirements in Audiovisual Search: a Quantitative Approach
User Requirements in Audiovisual Search: a Quantitative ApproachUser Requirements in Audiovisual Search: a Quantitative Approach
User Requirements in Audiovisual Search: a Quantitative Approach
 
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editor
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editorIntetain presentation on VideoHypE, the LinkedTV video hyperlink editor
Intetain presentation on VideoHypE, the LinkedTV video hyperlink editor
 
Presentation on MediaEval Search & Linking task 2013
Presentation on MediaEval Search & Linking task 2013Presentation on MediaEval Search & Linking task 2013
Presentation on MediaEval Search & Linking task 2013
 
Linking inside a video collection
Linking inside a video collectionLinking inside a video collection
Linking inside a video collection
 
Clariah kick-off-oht final
Clariah kick-off-oht finalClariah kick-off-oht final
Clariah kick-off-oht final
 
20130212 immovator cross media cafe - linkedtv en axes
20130212 immovator cross media cafe - linkedtv en axes20130212 immovator cross media cafe - linkedtv en axes
20130212 immovator cross media cafe - linkedtv en axes
 
Presentatie Mediapark Jaarcongres 2010
Presentatie Mediapark Jaarcongres 2010Presentatie Mediapark Jaarcongres 2010
Presentatie Mediapark Jaarcongres 2010
 
Audiovisual content exploitation JTS2010
Audiovisual content exploitation  JTS2010 Audiovisual content exploitation  JTS2010
Audiovisual content exploitation JTS2010
 
Audiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISVAudiovisual Content Exploitation at FIA 15042010 NISV
Audiovisual Content Exploitation at FIA 15042010 NISV
 

Dernier

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 

Dernier (20)

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 

Media Suite: Unlocking Archives for Mixed Media Scholarly Research

  • 1. Media Suite: Unlocking Archives for Mixed Media Scholarly Research Roeland Ordelman - Technical coordinator CLARIAH Media Suite Netherlands Institute for Sound and Vision / University of Twente The Netherlands
  • 2. Media Studies Focus on both “institutional” data collections and collections created by scholars
  • 3. Welke data zitten in de Media Suite V3? Radio & Television (1.88M items) Newspapers (60M pages) Film (1129 films) Oral History (2744 interviews) MULTIMEDIA
  • 4. Welke data zitten in de Media Suite V3? MIXEDMEDIA
  • 5. RESEARCH PILOTS Cross-Medial Analysis of WW2 Eyewitness Testimonies Cross-media research of public debates on drugs and regulation Me and Myself: Tracing first person in documentary history in AV-collections Annotating EYE’s Jean Desmet Collection: Towards Mixed Media Analysis in Digital Media History Narrativizing Disruption: How exploratory search can support media researchers to interpret ‘disruptive’ media events as lucid narratives Remediation in Sports News clariah.nl/projecten/research-pilots
  • 6. Media Suite: enabling Mixed Media Scholarly Research with Multi-media Data in a Sustainable Infrastructure
  • 7. CLARIAH Centers Common Lab Research Infrastructure for the Arts and Humanities SUSTAINABLE üAvailable after the project üMaintenance and support üUpdates and upgrades
  • 8.
  • 9. Architecture principles 1. Centers are responsible for data quality and to facilitate access to data 2. Authorized access using a federated authentication mechanism 3. Data is connected to a shared “workspace” (VRE) for various forms of analysis … 4. … that provides exports of data in various formats for using tools outside the closed environment 5. The Media Suite provides the interface on the underlying architecture
  • 10. 1. Centers facilitate are responsible for data quality and access to data
  • 11. REGISTER COLLECTION HARVEST COLLECTION METADATA SEARCH COLLECTION Collection Owner Media Suite Scholar CKAN web-based open source management system for the storage and distribution of open data Open Archive Initiative (OAI) ISSUE: Persistent link to source file ISSUE: IPR (e.g., no subtitles)
  • 12. Example: DANS registers set Oral History Common Lab Research Infrastructure for the Humanities
  • 13. “METADATA ARCHEOLOGY” Manual effort to describe metadata fields ISSUE: Resources manual effort
  • 14. Tools for inspection of metadata Common Lab Research Infrastructure for the Humanities
  • 15.
  • 16. 2. Authorized access using a federated authentication mechanism
  • 17.
  • 18. Secure play-out and viewing ISSUE: Not always available
  • 20. 3. Data is connected to a shared “workspace” (VRE) for analysis ISSUE: Currently semi- shared
  • 21. WORKSPACE ü Create virtual personal mixed media collections ü Create projects ü Stores annotations ü Upload personal collections ü Advanced Data Analysis (Jupyter Notebooks) ü Advanced Data processing ü Export annotations
  • 22. Data analysis: Jupyter Notebooks or NLP Common Lab Research Infrastructure for the Humanities ISSUE: Robust pipelines
  • 23. Write your own (Python) code to analyze the data in the Media Suite ISSUE: expertise
  • 25. Auto Metadata Extraction – Large scale speech recognition 350K hours processed until now
  • 26. Poster slam 11:00 – 11:30 tomorrow
  • 27. 4. Provide exports of data for tools outside
  • 28. Media Suite is just an interface on the underlying infrastructure…. Speech Suite
  • 29. Media Suite: Unlocking Archives for Mixed Media Scholarly Research
  • 30. Co-development Community building User stories! Short iterations (sprints) of 2 weeks: development & testing • Information Specialist • Experienced DH Researcher Liaisons part of development team: Workshops, hack-a- thons, data-a-thons
  • 35. 1. Discovery & Inspection of data sets hidden in archives 2. Discovery of items in large archival data sets 3. Accessing items (play, view) from restricted data sets 4. Discovery of segments in time-based media 5. Relating and comparing data on the segment level DistantreadingClosereading
  • 36. Search Oral History in Media Suite Common Lab Research Infrastructure for the Humanities
  • 37.
  • 38.
  • 39.
  • 40.
  • 42.
  • 44. Annotation view View SourceAlignment ISSUE: Complex interface
  • 45. Private collection Apply enrichment or a “pipeline”
  • 47. 1. Registered collections: persistent link (data management) 2. Registered collections: rights don’t permit (legal) 3. Metadata archeology: manual resources (funding) 4. Play-out/view: not always available (funding) 5. Shared workspace: semi-shared (infra development) 6. Advanced analysis: expertise scholars (training) 7. Advanced analysis: robust pipelines (benchmarking) 8. Workspace: complex interface (interaction design) Issues/investments
  • 48. Main contribution: enabling mixed media scholarly research for “institutional” multimedia collections Bringing the Tools to the Data: in progress but already useful: ü Unlocking the data, enabling distant/close reading ü Supporting the scholarly primitives ü Providing a workspace for saving annotations, creating collections and options for (advanced) analysis Summary…
  • 49. Research coordination: Julia Noordegraaf @jjnoordegraaf Technical coordination: Roeland Ordelman @roelandordelman DEMO & QUESTIONS AT THE BAZAR mediasuite.clariah.nl