SlideShare une entreprise Scribd logo
1  sur  13
RSI Archive: our experience working with
Speech to Text and Semantic Analysis
Sarah-Haye Aziz and Lorenzo Vassallo
May 17, 2013
2
Come al solito, anche la recente
inaugurazione dell'ultima monumentale
opera di quell'eccezionale scultore che
Giacomo Manzù, vale a dire la nuova porta
del Duomo di Rotterdam avuto il sapore
di un avvenimento straordinario di
risonanza internazionale e per lavori in
corso Fabio Bonetti è riuscito ad avvicinare
l'insigne maestro bergamasco, a buon
diritto ritenuto ormai uno dei più alti
interpreti del nostro tempo, artista fra i più
grandi del secolo e non solo per la misura
del suo talento ma anche per il rigore
morale di cui è sempre stato esempio in
anni di sorta, tormentata, ispirata
attività.
Credits: Giacomo Manzù, Fabio Bonetti
Geographic Therms: Rotterdam
Themes: arte, cultura, intrattenimento
Errors
è
as
Audio Transcription
ha
Categorization
3
Outline
1. Why an automatic indexing system?
2. The project timeline
3. Two paths: system and archivists workflow overview
4. Does it work? We learned that...
5. Next steps
6. Some advices
4
Why an automatic indexing system?
RSI has a consolidated cataloguing system (CMM)
with a well-defined human workflow from 2008
RSI has plenty undocumented historical material
and no capacity to document it.
Increase (plus) the documented material adding
an automation but not substituting (vs) the archivist.
Not vs but plus!
5
Archivists and Technicians Synergy
Project timeline
DeploymentDeploymentTuningTuningAnalysis & StartupAnalysis & Startup
Workflow DesignWorkflow Design
Language ModelLanguage Model
Tv & Radio
Programmes Choice
Tv & Radio
Programmes Choice
Workflow ReviewWorkflow Review
Transcription TestTranscription Test
System TestSystem Test
6
Documenting a material: two paths
Ingestion
Catalogue
Publishing
Transcription Engine
Audio + Key frames
Semantic Engine
Audio
and Video
Key frames
Archivist
Documentation
+
Refinements
Speech
Transcription
Text +
Sequences
Categorization
Text + Sequences
Credits
SIA
Themes +
Geographical therms
Human
audio listening
and
transcription
+
Archivist
documentation
7
The two paths for the archivist
Start ?
Invoke
Indexing
Human Task
on Catalogue
Detailed documentation
Manual creation of
logical sequences
Automated
Transcription and
Categorisation
Detailed documentation
Automatic creation of
logical sequences
Publish
Doc
Level
Basic Human
Limited set of
documented metadata
High Human
with Automation
Limited set of
documented metadata
Automatic creation of
logical sequences
?
?
Human Task
on Catalogue
Yes
No
Doc
Level
High Human
Basic Human
with Automation
8
The archivist – Francesco Veri
9
Does it work? Yes! But…
Differences between Radio and TV
Background Music/Noise does not help the transcription.
Based only on silences and
without key frames, the system
creates too many sequences.
Key frames help to locate a
change of context.
Speech rhythm and pauses are different between and .
10
Next steps (1) – Capitalize Editorial Texts
Semantic Engine
Categorization CatalogueAudio
+
Editorial Texts
11
Next steps (2) – Capitalize 24h Radio Logging
24h Radio Logging
0 24
SIA
(Transcription and
Semantic Engine)
Transcription &
Categorization
Catalogue
Automatic
Cut
12
If you... some advice
Involve the Archivists
Take a different approach in Radio and TV
Choose the right Tv & Radio Programmes
13
sarah-haye.aziz@rsi.ch
lorenzo.vassallo@rsi.ch

Contenu connexe

Similaire à Presentation 17 may morning case study 2 sarahhaye aziz

Muehlberger - PrestoPrime case study 2 @EUscreen Mykonos
Muehlberger - PrestoPrime case study 2 @EUscreen MykonosMuehlberger - PrestoPrime case study 2 @EUscreen Mykonos
Muehlberger - PrestoPrime case study 2 @EUscreen Mykonos
EUscreen
 
Searching information in a collection of video-lectures
Searching information in a collection of video-lecturesSearching information in a collection of video-lectures
Searching information in a collection of video-lectures
ronchet
 
Lectures On Demand: delivering traditional lectures over the web
Lectures On Demand: delivering traditional lectures over the webLectures On Demand: delivering traditional lectures over the web
Lectures On Demand: delivering traditional lectures over the web
ronchet
 
Miniproject audioenhancement-100223094301-phpapp02
Miniproject audioenhancement-100223094301-phpapp02Miniproject audioenhancement-100223094301-phpapp02
Miniproject audioenhancement-100223094301-phpapp02
mohankota
 

Similaire à Presentation 17 may morning case study 2 sarahhaye aziz (20)

Muehlberger - PrestoPrime case study 2 @EUscreen Mykonos
Muehlberger - PrestoPrime case study 2 @EUscreen MykonosMuehlberger - PrestoPrime case study 2 @EUscreen Mykonos
Muehlberger - PrestoPrime case study 2 @EUscreen Mykonos
 
Apache Stanbol Incubation Proposal
Apache Stanbol Incubation ProposalApache Stanbol Incubation Proposal
Apache Stanbol Incubation Proposal
 
F/LOSS in Norwegian libraries
F/LOSS in Norwegian librariesF/LOSS in Norwegian libraries
F/LOSS in Norwegian libraries
 
OSMC 2014 | Processing millions of logs with Logstash and integrating with El...
OSMC 2014 | Processing millions of logs with Logstash and integrating with El...OSMC 2014 | Processing millions of logs with Logstash and integrating with El...
OSMC 2014 | Processing millions of logs with Logstash and integrating with El...
 
Knoxbug2016
Knoxbug2016Knoxbug2016
Knoxbug2016
 
PHP is the king, nodejs is the prince and Python is the fool - Alessandro Cin...
PHP is the king, nodejs is the prince and Python is the fool - Alessandro Cin...PHP is the king, nodejs is the prince and Python is the fool - Alessandro Cin...
PHP is the king, nodejs is the prince and Python is the fool - Alessandro Cin...
 
PHP is the King, nodejs the prince and python the fool
PHP is the King, nodejs the prince and python the foolPHP is the King, nodejs the prince and python the fool
PHP is the King, nodejs the prince and python the fool
 
Searching information in a collection of video-lectures
Searching information in a collection of video-lecturesSearching information in a collection of video-lectures
Searching information in a collection of video-lectures
 
IETF remote participation via Meetecho @ WebRTC Meetup Stockholm
IETF remote participation via Meetecho @ WebRTC Meetup StockholmIETF remote participation via Meetecho @ WebRTC Meetup Stockholm
IETF remote participation via Meetecho @ WebRTC Meetup Stockholm
 
OSMC 2014: Processing millions of logs with Logstash and integrating with Ela...
OSMC 2014: Processing millions of logs with Logstash and integrating with Ela...OSMC 2014: Processing millions of logs with Logstash and integrating with Ela...
OSMC 2014: Processing millions of logs with Logstash and integrating with Ela...
 
Multilingualism ifla 2014 08
Multilingualism ifla 2014 08Multilingualism ifla 2014 08
Multilingualism ifla 2014 08
 
WebRTC, RED and Janus @ ClueCon21
WebRTC, RED and Janus @ ClueCon21WebRTC, RED and Janus @ ClueCon21
WebRTC, RED and Janus @ ClueCon21
 
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
 
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
 
Sem tech in CH, Linked Data Meetup, 2014-08-21, Malmo, Sweden
Sem tech in CH, Linked Data Meetup, 2014-08-21, Malmo, SwedenSem tech in CH, Linked Data Meetup, 2014-08-21, Malmo, Sweden
Sem tech in CH, Linked Data Meetup, 2014-08-21, Malmo, Sweden
 
Lectures On Demand: delivering traditional lectures over the web
Lectures On Demand: delivering traditional lectures over the webLectures On Demand: delivering traditional lectures over the web
Lectures On Demand: delivering traditional lectures over the web
 
DepositMOre: Applying tools to increase full-text content in institutional re...
DepositMOre: Applying tools to increase full-text content in institutional re...DepositMOre: Applying tools to increase full-text content in institutional re...
DepositMOre: Applying tools to increase full-text content in institutional re...
 
Visualizing Developer Interactions [VISSOFT2014]
Visualizing Developer Interactions [VISSOFT2014]Visualizing Developer Interactions [VISSOFT2014]
Visualizing Developer Interactions [VISSOFT2014]
 
Mini Project- Audio Enhancement
Mini Project- Audio EnhancementMini Project- Audio Enhancement
Mini Project- Audio Enhancement
 
Miniproject audioenhancement-100223094301-phpapp02
Miniproject audioenhancement-100223094301-phpapp02Miniproject audioenhancement-100223094301-phpapp02
Miniproject audioenhancement-100223094301-phpapp02
 

Plus de Nederlands Instituut voor Beeld en Geluid

Plus de Nederlands Instituut voor Beeld en Geluid (11)

Presentation 17 may morning keynote cees snoek
Presentation 17 may morning keynote cees snoekPresentation 17 may morning keynote cees snoek
Presentation 17 may morning keynote cees snoek
 
Presentation 17 may afternoon casestudy 2 liam wylie
Presentation 17 may afternoon casestudy 2 liam wyliePresentation 17 may afternoon casestudy 2 liam wylie
Presentation 17 may afternoon casestudy 2 liam wylie
 
Presentation 16 may casestudy 2 evalisgreen kaisa unander
Presentation 16 may casestudy 2 evalisgreen kaisa unanderPresentation 16 may casestudy 2 evalisgreen kaisa unander
Presentation 16 may casestudy 2 evalisgreen kaisa unander
 
Presentation 16 may morning semantic linking rutger verhoeven
Presentation 16 may morning semantic linking rutger verhoevenPresentation 16 may morning semantic linking rutger verhoeven
Presentation 16 may morning semantic linking rutger verhoeven
 
Presentation 16 may morning casestudy 2 xavier jacques jourion
Presentation 16 may morning casestudy 2 xavier jacques jourionPresentation 16 may morning casestudy 2 xavier jacques jourion
Presentation 16 may morning casestudy 2 xavier jacques jourion
 
Presentation 16 may morning casestudy 1 maarten de rijke
Presentation 16 may morning casestudy 1 maarten de rijkePresentation 16 may morning casestudy 1 maarten de rijke
Presentation 16 may morning casestudy 1 maarten de rijke
 
Presentation 16 may morning keynote seth van hooland
Presentation 16 may morning keynote seth van hoolandPresentation 16 may morning keynote seth van hooland
Presentation 16 may morning keynote seth van hooland
 
Presentation 16 may keynote karin bredenberg
Presentation 16 may keynote karin bredenbergPresentation 16 may keynote karin bredenberg
Presentation 16 may keynote karin bredenberg
 
Presentation 16 may casestudy daniel steinmeier
Presentation 16 may casestudy daniel steinmeierPresentation 16 may casestudy daniel steinmeier
Presentation 16 may casestudy daniel steinmeier
 
Presentation 16 may casestudy 2 evalisgreen kaisa unander
Presentation 16 may casestudy 2 evalisgreen kaisa unanderPresentation 16 may casestudy 2 evalisgreen kaisa unander
Presentation 16 may casestudy 2 evalisgreen kaisa unander
 
Presentation 16 may archive achievements awards tom de smet
Presentation 16 may archive achievements awards tom de smetPresentation 16 may archive achievements awards tom de smet
Presentation 16 may archive achievements awards tom de smet
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Dernier (20)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Presentation 17 may morning case study 2 sarahhaye aziz

  • 1. RSI Archive: our experience working with Speech to Text and Semantic Analysis Sarah-Haye Aziz and Lorenzo Vassallo May 17, 2013
  • 2. 2 Come al solito, anche la recente inaugurazione dell'ultima monumentale opera di quell'eccezionale scultore che Giacomo Manzù, vale a dire la nuova porta del Duomo di Rotterdam avuto il sapore di un avvenimento straordinario di risonanza internazionale e per lavori in corso Fabio Bonetti è riuscito ad avvicinare l'insigne maestro bergamasco, a buon diritto ritenuto ormai uno dei più alti interpreti del nostro tempo, artista fra i più grandi del secolo e non solo per la misura del suo talento ma anche per il rigore morale di cui è sempre stato esempio in anni di sorta, tormentata, ispirata attività. Credits: Giacomo Manzù, Fabio Bonetti Geographic Therms: Rotterdam Themes: arte, cultura, intrattenimento Errors è as Audio Transcription ha Categorization
  • 3. 3 Outline 1. Why an automatic indexing system? 2. The project timeline 3. Two paths: system and archivists workflow overview 4. Does it work? We learned that... 5. Next steps 6. Some advices
  • 4. 4 Why an automatic indexing system? RSI has a consolidated cataloguing system (CMM) with a well-defined human workflow from 2008 RSI has plenty undocumented historical material and no capacity to document it. Increase (plus) the documented material adding an automation but not substituting (vs) the archivist. Not vs but plus!
  • 5. 5 Archivists and Technicians Synergy Project timeline DeploymentDeploymentTuningTuningAnalysis & StartupAnalysis & Startup Workflow DesignWorkflow Design Language ModelLanguage Model Tv & Radio Programmes Choice Tv & Radio Programmes Choice Workflow ReviewWorkflow Review Transcription TestTranscription Test System TestSystem Test
  • 6. 6 Documenting a material: two paths Ingestion Catalogue Publishing Transcription Engine Audio + Key frames Semantic Engine Audio and Video Key frames Archivist Documentation + Refinements Speech Transcription Text + Sequences Categorization Text + Sequences Credits SIA Themes + Geographical therms Human audio listening and transcription + Archivist documentation
  • 7. 7 The two paths for the archivist Start ? Invoke Indexing Human Task on Catalogue Detailed documentation Manual creation of logical sequences Automated Transcription and Categorisation Detailed documentation Automatic creation of logical sequences Publish Doc Level Basic Human Limited set of documented metadata High Human with Automation Limited set of documented metadata Automatic creation of logical sequences ? ? Human Task on Catalogue Yes No Doc Level High Human Basic Human with Automation
  • 8. 8 The archivist – Francesco Veri
  • 9. 9 Does it work? Yes! But… Differences between Radio and TV Background Music/Noise does not help the transcription. Based only on silences and without key frames, the system creates too many sequences. Key frames help to locate a change of context. Speech rhythm and pauses are different between and .
  • 10. 10 Next steps (1) – Capitalize Editorial Texts Semantic Engine Categorization CatalogueAudio + Editorial Texts
  • 11. 11 Next steps (2) – Capitalize 24h Radio Logging 24h Radio Logging 0 24 SIA (Transcription and Semantic Engine) Transcription & Categorization Catalogue Automatic Cut
  • 12. 12 If you... some advice Involve the Archivists Take a different approach in Radio and TV Choose the right Tv & Radio Programmes