SlideShare une entreprise Scribd logo
1  sur  12
Exploding information
•Recent studies show
that most of the stored
data is in the form of
multimedia.
•Large volume of
multimedia data makes
it difficult to handle it
manually
•Need to have an             1 hr of TV broadcast across the world is 100 Petabyte.
automatic method to
                            Source: http://www.sims.berkeley.edu/research/projects/how-much-
organize and use it                                info/summary.html#tv

appropriately.
Audio indexing
                                                    Audio classification - An
    Reason of choosing audio data
●
                                                    important step in building an
    for study
                                                    audio indexing system
         Easier to process
     –

                                                             An audio indexing system
         Contains significant information
     –

    Indexing – method of
●

    organizing data for further
    search and retrieval.
    Example – book indexing

    Audio Indexing – indexing
●

    non-text data using audio
    part of it
                                            Source: J. Makhoul et. al. “Speech and language technologies for audio
                                            indexing and retrieval”, in Proc. of the IEEE, 88(8), pp. 1338-1353, 2000.
Levels of information in audio signal
    Subsegmental information
●


        Related to excitation source characteristics
    –

    Segmental information
●


        Related to system / physiological characteristics
    –

    Suprasegmental information
●


        Related to behavioural characteristics of audio
    –
Missing component in existing
        approaches and it's importance
    Features derived based on spectral analysis
●


        Carry significant properties of audio data at segmental level
    –

        Miss information present at subsegmental, suprasegmental level
    –

    Perceptually significant information in linear prediction
●

    (LP) residual of signal
        Complimentary in nature to the spectral information
    –

        Suprasegmental information not being used in current systems
    –
EXPLORING
SUPRASEGMENTAL FEATURES
USING
LP RESIDUAL
FOR
AUDIO CLIP CLASSIFICATION


                                      B.Yegnanarayana
          Anvita Bajpai
                                         yegna@iiit.ac.in
         anvita@mailcity.com
                                       International Institute of
      Applied Research Group Satyam
                                        Information Technology
           Computer Services Ltd.,
                                              Hyderabad
                 Bangalore
Audio clip classification
    Closed set problem
●


    To classify a given audio clip in one of the following
●

    predefined categories
         Advertisement, Cartoon, Cricket, Football, News
     –

    Issues in audio clip classification
●

         Feature extraction
     –
              Effective representation of data to capture all significant properties of audio for
          ●

              the task
              Robust under various conditions
          ●


         Classification
     –
              Formulation of a distance measure and rule/models
          ●

                    Training a models for the task
                –
                    Testing – actual classification task
                –
                    Combining evidences from different systems
                –
Presence of audio-specific




                                                     Residual
                                          Original
information in LP residual

                             Aa_res.wav




        Aa1.wav




                              Aa1.wav
uprasegmental information in Hilbert
nvelope of LP residual of audio signal
Suprasegmental information in LP
   residual for audio clip classification




Autocorrelation samples of Hilbert envelope of LP residual for 5 audio classes
Statistics of autocorrelation sequence




Correction – here we have statistics of autocorrelation sequence peaks of HE (not LP residual)
Statistics of autocorrelation sequence
Summary & Conclusions
     Need to organize multimedia data because of its
●

    large volume and need in real-life applications


     Shown presence of audio-specific
●

    suprasegmental information in LP residual


     Need to explore methods to use the
●

    suprasegmental information as an additional
    evidence for the audio clip classification task

Contenu connexe

En vedette

Large Sub Surface Filter Design
Large Sub Surface Filter DesignLarge Sub Surface Filter Design
Large Sub Surface Filter Designbiomeshubha
 
List of Schools South and East of Bangalore
List of Schools South and East of BangaloreList of Schools South and East of Bangalore
List of Schools South and East of Bangalorebiomeshubha
 
Psychometric questionnaire
Psychometric questionnairePsychometric questionnaire
Psychometric questionnairebhas_ani
 
NHS of Greater Cleveland.pptx
NHS of Greater Cleveland.pptxNHS of Greater Cleveland.pptx
NHS of Greater Cleveland.pptxTaylor Hartman
 
A life of inspiration - Mahatma Gandhi 1869-1948
A life of inspiration - Mahatma Gandhi 1869-1948A life of inspiration - Mahatma Gandhi 1869-1948
A life of inspiration - Mahatma Gandhi 1869-1948Shivam Dhawan
 

En vedette (7)

Large Sub Surface Filter Design
Large Sub Surface Filter DesignLarge Sub Surface Filter Design
Large Sub Surface Filter Design
 
List of Schools South and East of Bangalore
List of Schools South and East of BangaloreList of Schools South and East of Bangalore
List of Schools South and East of Bangalore
 
Psychometric questionnaire
Psychometric questionnairePsychometric questionnaire
Psychometric questionnaire
 
Chemistry CV
Chemistry CVChemistry CV
Chemistry CV
 
Resume(Email) copy
Resume(Email) copyResume(Email) copy
Resume(Email) copy
 
NHS of Greater Cleveland.pptx
NHS of Greater Cleveland.pptxNHS of Greater Cleveland.pptx
NHS of Greater Cleveland.pptx
 
A life of inspiration - Mahatma Gandhi 1869-1948
A life of inspiration - Mahatma Gandhi 1869-1948A life of inspiration - Mahatma Gandhi 1869-1948
A life of inspiration - Mahatma Gandhi 1869-1948
 

Similaire à Anvita Wisp 2007 Presentation

Anvita Ncvpripg 2008 Presentation
Anvita Ncvpripg 2008 PresentationAnvita Ncvpripg 2008 Presentation
Anvita Ncvpripg 2008 Presentationguest6e7a1b1
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004guest6e7a1b1
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTERN Australia
 
Deep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionDeep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionSai Kiran Kadam
 
Anvita Audio Classification Presentation
Anvita Audio Classification PresentationAnvita Audio Classification Presentation
Anvita Audio Classification Presentationguest6e7a1b1
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersHerbert Van de Sompel
 
Deep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionDeep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionSai Kiran Kadam
 
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challengehanumayamma
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseHilmar Lapp
 
BITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS
 
Aplications for machine learning in IoT
Aplications for machine learning in IoTAplications for machine learning in IoT
Aplications for machine learning in IoTYashesh Shroff
 
Improving Software Maintenance using Unsupervised Machine Learning techniques
Improving Software Maintenance using Unsupervised Machine Learning techniquesImproving Software Maintenance using Unsupervised Machine Learning techniques
Improving Software Maintenance using Unsupervised Machine Learning techniquesValerio Maggio
 
MS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docMS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docbutest
 
No specimen (software) left behind
No specimen (software) left behindNo specimen (software) left behind
No specimen (software) left behindVince Smith
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkAdina Chuang Howe
 
840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2guestdad76b5
 
840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2guestdad76b5
 

Similaire à Anvita Wisp 2007 Presentation (20)

Anvita Ncvpripg 2008 Presentation
Anvita Ncvpripg 2008 PresentationAnvita Ncvpripg 2008 Presentation
Anvita Ncvpripg 2008 Presentation
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
 
Anvita Eusipco 2004
Anvita Eusipco 2004Anvita Eusipco 2004
Anvita Eusipco 2004
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasets
 
Deep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event DetectionDeep Learning - Speaker Verification, Sound Event Detection
Deep Learning - Speaker Verification, Sound Event Detection
 
Anvita Audio Classification Presentation
Anvita Audio Classification PresentationAnvita Audio Classification Presentation
Anvita Audio Classification Presentation
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
 
Shaman Project Hemmje
Shaman Project  HemmjeShaman Project  Hemmje
Shaman Project Hemmje
 
Workshop on sparse image and signal processing
Workshop on sparse image and signal processingWorkshop on sparse image and signal processing
Workshop on sparse image and signal processing
 
Deep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker RecognitionDeep Learning for Automatic Speaker Recognition
Deep Learning for Automatic Speaker Recognition
 
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge
2018 IEEE Big Data Cup Challenge - FEMH ​Voice Data Challenge
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
 
BITS: Basics of sequence databases
BITS: Basics of sequence databasesBITS: Basics of sequence databases
BITS: Basics of sequence databases
 
Aplications for machine learning in IoT
Aplications for machine learning in IoTAplications for machine learning in IoT
Aplications for machine learning in IoT
 
Improving Software Maintenance using Unsupervised Machine Learning techniques
Improving Software Maintenance using Unsupervised Machine Learning techniquesImproving Software Maintenance using Unsupervised Machine Learning techniques
Improving Software Maintenance using Unsupervised Machine Learning techniques
 
MS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docMS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.doc
 
No specimen (software) left behind
No specimen (software) left behindNo specimen (software) left behind
No specimen (software) left behind
 
EPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data TalkEPA 2013 Air Sensors Meeting Big Data Talk
EPA 2013 Air Sensors Meeting Big Data Talk
 
840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2
 
840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2840 Santoso Perkembangan Tik Bab 2
840 Santoso Perkembangan Tik Bab 2
 

Dernier

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Dernier (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Anvita Wisp 2007 Presentation

  • 1. Exploding information •Recent studies show that most of the stored data is in the form of multimedia. •Large volume of multimedia data makes it difficult to handle it manually •Need to have an 1 hr of TV broadcast across the world is 100 Petabyte. automatic method to Source: http://www.sims.berkeley.edu/research/projects/how-much- organize and use it info/summary.html#tv appropriately.
  • 2. Audio indexing Audio classification - An Reason of choosing audio data ● important step in building an for study audio indexing system Easier to process – An audio indexing system Contains significant information – Indexing – method of ● organizing data for further search and retrieval. Example – book indexing Audio Indexing – indexing ● non-text data using audio part of it Source: J. Makhoul et. al. “Speech and language technologies for audio indexing and retrieval”, in Proc. of the IEEE, 88(8), pp. 1338-1353, 2000.
  • 3. Levels of information in audio signal Subsegmental information ● Related to excitation source characteristics – Segmental information ● Related to system / physiological characteristics – Suprasegmental information ● Related to behavioural characteristics of audio –
  • 4. Missing component in existing approaches and it's importance Features derived based on spectral analysis ● Carry significant properties of audio data at segmental level – Miss information present at subsegmental, suprasegmental level – Perceptually significant information in linear prediction ● (LP) residual of signal Complimentary in nature to the spectral information – Suprasegmental information not being used in current systems –
  • 5. EXPLORING SUPRASEGMENTAL FEATURES USING LP RESIDUAL FOR AUDIO CLIP CLASSIFICATION B.Yegnanarayana Anvita Bajpai yegna@iiit.ac.in anvita@mailcity.com International Institute of Applied Research Group Satyam Information Technology Computer Services Ltd., Hyderabad Bangalore
  • 6. Audio clip classification Closed set problem ● To classify a given audio clip in one of the following ● predefined categories Advertisement, Cartoon, Cricket, Football, News – Issues in audio clip classification ● Feature extraction – Effective representation of data to capture all significant properties of audio for ● the task Robust under various conditions ● Classification – Formulation of a distance measure and rule/models ● Training a models for the task – Testing – actual classification task – Combining evidences from different systems –
  • 7. Presence of audio-specific Residual Original information in LP residual Aa_res.wav Aa1.wav Aa1.wav
  • 8. uprasegmental information in Hilbert nvelope of LP residual of audio signal
  • 9. Suprasegmental information in LP residual for audio clip classification Autocorrelation samples of Hilbert envelope of LP residual for 5 audio classes
  • 10. Statistics of autocorrelation sequence Correction – here we have statistics of autocorrelation sequence peaks of HE (not LP residual)
  • 12. Summary & Conclusions Need to organize multimedia data because of its ● large volume and need in real-life applications Shown presence of audio-specific ● suprasegmental information in LP residual Need to explore methods to use the ● suprasegmental information as an additional evidence for the audio clip classification task