SlideShare a Scribd company logo
1 of 24
Voice morphing

              Presented
                  By
          H.Mohammed.Sabir
             09AT1A0461

              Supervised
                  By
             Shreedhar Sir
SEMINAR OUTLINES

What   It is?
Need   of Voice Morphing
Description      the Morphing.
Technical       details of Morphing.
Application       areas.
What is Voice Morphing ??


   Voice morphing is a technique for modifying a (source)
    speaker's speech to sound as if it were spoken by a
    different (target) speaker.


   In Simpler terms it is being able to change the speech of
    one speaker to that of another speaker.


   Technology developed at the Los Alamos National
    Laboratory in New Mexico, USA by George Papcun


   Applications for Voice Morphing range from recreational
    ones to security ones.
What it actually performs ?
   It is a technique to modify a source speaker's
    speech to sound as if it was spoken by a target
    speaker.
   Voice morphing enables speech patterns to be
    cloned
   And an accurate copy of a person's voice can
    be made that can wishes to say, anything in the
    voice of someone else.
Need of voice morphing

   Text To Speech (TTS)
   In public speech systems
   For special effects ( just like video or image morphing is
    done ).
   To diminish Ethnical barriers.
How to Morph Voice ??


   We need to effectively change the pitch from that of a male
    speaker to that of a female speaker. If we reminisce the
    excitation signal has information about the speaker.

   We find the LPC coefficients for the Source and Target Signals
    and using these coefficients we are going to interpolate
    between the two Signals.

   We get the New LPC (linear predictive coding) coefficients
    using the formula

         new lpc coeff = [const*(lpc source) + (1-const)(lpc
    target)]

   0 <= const <= 1

                                                                     …
How to Morph Speech ?? (contd…)



    The pitch of a female speaker will be close to twice that of
     the male speaker. In our example the pitch of the male
     speaker is 141Hz and that of the female speaker is 210Hz.


    So we need to develop some time stretching algorithm so
     that we can implement pitch shifting. We obtain the residue
     of the source signal and stretch it according to the value of
     the const. The const indicates what is the position of morphed
     signal in between the source and target.


    For example if const = 0.2 then the morphed signal will be
     closer in pitch to the source signal and a value of 0.8 for const
     will result in a pitch that is closer to the target signal.
How do we shift the Pitch ??
   We break the residue signal into small windows and introduce fade in
    and fade out for each block. We recombine everything to form the pitch
    shifted signal. Based on the alpha we can time stretch the residue
    according to our requirements.




    How do we Morph finally ??

•   We now have the pitch shifted residue signal and the new
      LPC coefficients. We should resample the pitch shifted
      signal so that it is played at a faster rate. [Remember
      when we pitch shift then the residue will last longer]. If
      we inverse filter the resampled pitch shifted residue then
      we can effect morphing.
Block Diagram
Time Domain Plots of Source and Target featuring the Pitch
Matching and Warping

   DTW(Dynamic Time Warping)


    - Dynamic Time Warping (DTW) is used to
    find the best match between the pitch of
    the two sounds.
Signal Re-Estimation

   Loss during Signal re-estimation


    -Due to signals being transformation into the
    cepstral domain, a magnitude function is
    used. This results in a loss of phase
    information in the representation of the
    data.
Limitations
 
Lots   of normalizing problems.
Some     applications require extensive sound libraries.
Different   languages require different phonetics.
It   is very seldom complete.
Advantages

   Allows speech model to be duplicated and an exact
    copy of a person’s voice.


   Powerful combat zone weapon.
Disadvantages

   Use to pull out the useful information.


   It hides the actual identity of the user.
Conclusion
   The approach we have adopted separates the sounds into two
    forms:

    - Spectral   envelope information
    - Pitch and voicing information.
   Dynamic Time Warping
    - Aligns the sounds with respect to their pitches.
   Signal re-estimation algorithm.
    - Frames are converted back into a time domain
    waveform.
Application Areas
   Fake telephone conversations as evidence in courts of
    law.


   Powerful battlefield weapon.

    - Provide
            fake orders to the enemy's troops,
    appearing to come from their own
    commanders.
Future Scope
   Extending the functionality of tool.
    - Create a powerful and flexible morphing
    tool.

   Increased user interaction.
    - Graphical User Interface could be
    designed and integrated to make the
    package more ‘user-friendly’.
BIBLIOGRAPHY:
• Ye, H. and S. Young (2003). "Perceptually Weighted Linear
  Transformations for Voice Conversion". Eurospeech 2003,
  Geneva. 
• Ye, H. and S. Young (2004). "High Quality Voice Morphing".
  Int Conference Acoustics Speech and Signal Processing,
  Montreal, Canada. 
• High quality Voice Morphing Hui Yeand Steve Young.
• Quality-enhanced Voice Morphing
Thank you!!!
Questions??

More Related Content

What's hot

Voice Morphing
Voice MorphingVoice Morphing
Voice MorphingSayyed Z
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technologyJeet Das
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technologySrijanKumar18
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data miningJimit Rupani
 
Silent sound technology final report
Silent sound technology final reportSilent sound technology final report
Silent sound technology final reportLohit Dalal
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
Silent sound technology NEW
Silent sound technology NEW Silent sound technology NEW
Silent sound technology NEW Neha Tyagi
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By MatlabAnkit Gujrati
 
silent sound new by RAJ NIRANJAN
silent sound new by RAJ NIRANJANsilent sound new by RAJ NIRANJAN
silent sound new by RAJ NIRANJANRaj Niranjan
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Silentsound documentation
Silentsound documentationSilentsound documentation
Silentsound documentationRaj Niranjan
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition systemavinash raibole
 
Abstract Silent Sound Technology
Abstract   Silent Sound TechnologyAbstract   Silent Sound Technology
Abstract Silent Sound Technologyvishnu murthy
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionAhmed Moawad
 
silent sound technology
silent sound technologysilent sound technology
silent sound technologykamesh0007
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySrijanKumar18
 

What's hot (20)

Voice Morphing
Voice MorphingVoice Morphing
Voice Morphing
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound Technology
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Silent sound technology final report
Silent sound technology final reportSilent sound technology final report
Silent sound technology final report
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Silent sound technology NEW
Silent sound technology NEW Silent sound technology NEW
Silent sound technology NEW
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
silent sound new by RAJ NIRANJAN
silent sound new by RAJ NIRANJANsilent sound new by RAJ NIRANJAN
silent sound new by RAJ NIRANJAN
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Silentsound documentation
Silentsound documentationSilentsound documentation
Silentsound documentation
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Abstract Silent Sound Technology
Abstract   Silent Sound TechnologyAbstract   Silent Sound Technology
Abstract Silent Sound Technology
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
silent sound technology
silent sound technologysilent sound technology
silent sound technology
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 

Viewers also liked

Viewers also liked (16)

FINAL REVIEW
FINAL REVIEWFINAL REVIEW
FINAL REVIEW
 
Vlsi technology-dinesh
Vlsi technology-dineshVlsi technology-dinesh
Vlsi technology-dinesh
 
brain chip technology
brain chip technologybrain chip technology
brain chip technology
 
VLSI
VLSI VLSI
VLSI
 
VLSI Training presentation
VLSI Training presentationVLSI Training presentation
VLSI Training presentation
 
Brain chips ppt
Brain chips pptBrain chips ppt
Brain chips ppt
 
Brain chips ppt
Brain chips pptBrain chips ppt
Brain chips ppt
 
Brain chips
Brain chipsBrain chips
Brain chips
 
Vlsi design and fabrication ppt
Vlsi design and fabrication  pptVlsi design and fabrication  ppt
Vlsi design and fabrication ppt
 
Chip morphing
Chip morphingChip morphing
Chip morphing
 
Basics Of VLSI
Basics Of VLSIBasics Of VLSI
Basics Of VLSI
 
Build Features, Not Apps
Build Features, Not AppsBuild Features, Not Apps
Build Features, Not Apps
 
All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11
 
Fractal robotics
Fractal  roboticsFractal  robotics
Fractal robotics
 
March 3 2004 for the ai cie
March 3 2004 for the ai cieMarch 3 2004 for the ai cie
March 3 2004 for the ai cie
 
Airborn internet
Airborn internetAirborn internet
Airborn internet
 

Similar to Voice morphing-101113123852-phpapp01

voice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfvoice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfDeepthiDeepu668278
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognitionsunnysyed
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization worksMuhammad Taqi
 
Linear predictive coding documentation
Linear predictive coding  documentationLinear predictive coding  documentation
Linear predictive coding documentationchakravarthy Gopi
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresSivaranjan Goswami
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesissipij
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language ProcessingVikalp Mahendra
 
Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)Harshal Ladhe
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...IRJET Journal
 

Similar to Voice morphing-101113123852-phpapp01 (20)

An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
voice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfvoice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdf
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
 
G010424248
G010424248G010424248
G010424248
 
Linear Predictive Coding
Linear Predictive CodingLinear Predictive Coding
Linear Predictive Coding
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
Linear predictive coding documentation
Linear predictive coding  documentationLinear predictive coding  documentation
Linear predictive coding documentation
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
An Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech featuresAn Introduction to Various Features of Speech SignalSpeech features
An Introduction to Various Features of Speech SignalSpeech features
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesis
 
voice morphing.pptx
voice morphing.pptxvoice morphing.pptx
voice morphing.pptx
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language Processing
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Automatic Speech Recognion
Automatic Speech RecognionAutomatic Speech Recognion
Automatic Speech Recognion
 
Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)Speech compression using loosy predictive coding (lpc)
Speech compression using loosy predictive coding (lpc)
 
50120140501002
5012014050100250120140501002
50120140501002
 
B45010811
B45010811B45010811
B45010811
 
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
Speech Enhancement Based on Spectral Subtraction Involving Magnitude and Phas...
 
A Case Study on DSP (Speech Processing)
A Case Study on DSP (Speech Processing)A Case Study on DSP (Speech Processing)
A Case Study on DSP (Speech Processing)
 

Recently uploaded

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 

Recently uploaded (20)

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 

Voice morphing-101113123852-phpapp01

  • 1. Voice morphing Presented By H.Mohammed.Sabir 09AT1A0461 Supervised By Shreedhar Sir
  • 2. SEMINAR OUTLINES What It is? Need of Voice Morphing Description the Morphing. Technical details of Morphing. Application areas.
  • 3. What is Voice Morphing ??  Voice morphing is a technique for modifying a (source) speaker's speech to sound as if it were spoken by a different (target) speaker.  In Simpler terms it is being able to change the speech of one speaker to that of another speaker.  Technology developed at the Los Alamos National Laboratory in New Mexico, USA by George Papcun  Applications for Voice Morphing range from recreational ones to security ones.
  • 4. What it actually performs ?  It is a technique to modify a source speaker's speech to sound as if it was spoken by a target speaker.  Voice morphing enables speech patterns to be cloned  And an accurate copy of a person's voice can be made that can wishes to say, anything in the voice of someone else.
  • 5. Need of voice morphing  Text To Speech (TTS)  In public speech systems  For special effects ( just like video or image morphing is done ).  To diminish Ethnical barriers.
  • 6. How to Morph Voice ??  We need to effectively change the pitch from that of a male speaker to that of a female speaker. If we reminisce the excitation signal has information about the speaker.  We find the LPC coefficients for the Source and Target Signals and using these coefficients we are going to interpolate between the two Signals.  We get the New LPC (linear predictive coding) coefficients using the formula new lpc coeff = [const*(lpc source) + (1-const)(lpc target)]  0 <= const <= 1 …
  • 7. How to Morph Speech ?? (contd…)  The pitch of a female speaker will be close to twice that of the male speaker. In our example the pitch of the male speaker is 141Hz and that of the female speaker is 210Hz.  So we need to develop some time stretching algorithm so that we can implement pitch shifting. We obtain the residue of the source signal and stretch it according to the value of the const. The const indicates what is the position of morphed signal in between the source and target.  For example if const = 0.2 then the morphed signal will be closer in pitch to the source signal and a value of 0.8 for const will result in a pitch that is closer to the target signal.
  • 8. How do we shift the Pitch ??  We break the residue signal into small windows and introduce fade in and fade out for each block. We recombine everything to form the pitch shifted signal. Based on the alpha we can time stretch the residue according to our requirements. How do we Morph finally ?? • We now have the pitch shifted residue signal and the new LPC coefficients. We should resample the pitch shifted signal so that it is played at a faster rate. [Remember when we pitch shift then the residue will last longer]. If we inverse filter the resampled pitch shifted residue then we can effect morphing.
  • 10. Time Domain Plots of Source and Target featuring the Pitch
  • 11.
  • 12. Matching and Warping  DTW(Dynamic Time Warping) - Dynamic Time Warping (DTW) is used to find the best match between the pitch of the two sounds.
  • 13.
  • 14.
  • 15. Signal Re-Estimation  Loss during Signal re-estimation -Due to signals being transformation into the cepstral domain, a magnitude function is used. This results in a loss of phase information in the representation of the data.
  • 16. Limitations   Lots of normalizing problems. Some applications require extensive sound libraries. Different languages require different phonetics. It is very seldom complete.
  • 17. Advantages  Allows speech model to be duplicated and an exact copy of a person’s voice.  Powerful combat zone weapon.
  • 18. Disadvantages  Use to pull out the useful information.  It hides the actual identity of the user.
  • 19. Conclusion  The approach we have adopted separates the sounds into two forms: - Spectral envelope information - Pitch and voicing information.  Dynamic Time Warping - Aligns the sounds with respect to their pitches.  Signal re-estimation algorithm. - Frames are converted back into a time domain waveform.
  • 20. Application Areas  Fake telephone conversations as evidence in courts of law.  Powerful battlefield weapon. - Provide fake orders to the enemy's troops, appearing to come from their own commanders.
  • 21. Future Scope  Extending the functionality of tool. - Create a powerful and flexible morphing tool.  Increased user interaction. - Graphical User Interface could be designed and integrated to make the package more ‘user-friendly’.
  • 22. BIBLIOGRAPHY: • Ye, H. and S. Young (2003). "Perceptually Weighted Linear Transformations for Voice Conversion". Eurospeech 2003, Geneva.  • Ye, H. and S. Young (2004). "High Quality Voice Morphing". Int Conference Acoustics Speech and Signal Processing, Montreal, Canada.  • High quality Voice Morphing Hui Yeand Steve Young. • Quality-enhanced Voice Morphing

Editor's Notes

  1. MAHATMA GANDHI MISSION ENGINEERING COLLEGE,NOIDA
  2. MAHATMA GANDHI MISSION ENGINEERING COLLEGE,NOIDA