Soumettre la recherche
Mettre en ligne
Autotuned voice cloning enabling multilingualism
•
0 j'aime
•
4 vues
IRJET Journal
Suivre
https://www.irjet.net/archives/V9/i11/IRJET-V9I11111.pdf
Lire moins
Lire la suite
Ingénierie
Signaler
Partager
Signaler
Partager
1 sur 3
Télécharger maintenant
Télécharger pour lire hors ligne
Recommandé
Review On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep Learning
IRJET Journal
A survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech Recognition
IRJET Journal
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET Journal
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IRJET Journal
Voice Assistant Using Python and AI
Voice Assistant Using Python and AI
IRJET Journal
IRJET - Sign Language Recognition System
IRJET - Sign Language Recognition System
IRJET Journal
Voice Based Email System For Visual Impaired Using AI
Voice Based Email System For Visual Impaired Using AI
IRJET Journal
IRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation System
IRJET Journal
Recommandé
Review On Speech Recognition using Deep Learning
Review On Speech Recognition using Deep Learning
IRJET Journal
A survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech Recognition
IRJET Journal
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET Journal
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IRJET Journal
Voice Assistant Using Python and AI
Voice Assistant Using Python and AI
IRJET Journal
IRJET - Sign Language Recognition System
IRJET - Sign Language Recognition System
IRJET Journal
Voice Based Email System For Visual Impaired Using AI
Voice Based Email System For Visual Impaired Using AI
IRJET Journal
IRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation System
IRJET Journal
IRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for Blinds
IRJET Journal
IRJETDevice for Location Finder and Text Reader for Visually Impaired People
IRJETDevice for Location Finder and Text Reader for Visually Impaired People
IRJET Journal
IRJET- Device for Location Finder and Text Reader for Visually Impaired P...
IRJET- Device for Location Finder and Text Reader for Visually Impaired P...
IRJET Journal
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
IRJET Journal
IRJET- Text Reading for Visually Impaired Person using Raspberry Pi
IRJET- Text Reading for Visually Impaired Person using Raspberry Pi
IRJET Journal
Smart Sound Measurement and Control System for Smart City
Smart Sound Measurement and Control System for Smart City
IRJET Journal
Real Time Sign Language Translation Using Tensor Flow Object Detection
Real Time Sign Language Translation Using Tensor Flow Object Detection
IRJET Journal
SURVEY ON SMART VIRTUAL VOICE ASSISTANT
SURVEY ON SMART VIRTUAL VOICE ASSISTANT
IRJET Journal
Voice-based Login System
Voice-based Login System
IRJET Journal
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
IRJET Journal
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
IRJET Journal
Design of a Communication System using Sign Language aid for Differently Able...
Design of a Communication System using Sign Language aid for Differently Able...
IRJET Journal
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET Journal
IRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound Recognition
IRJET Journal
IRJET - Threat Prediction using Speech Analysis
IRJET - Threat Prediction using Speech Analysis
IRJET Journal
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”
IRJET Journal
IRJET - Hand Gestures Recognition using Deep Learning
IRJET - Hand Gestures Recognition using Deep Learning
IRJET Journal
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET Journal
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
IRJET Journal
IRJET- My Buddy App: Communications between Smart Devices through Voice A...
IRJET- My Buddy App: Communications between Smart Devices through Voice A...
IRJET Journal
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
Contenu connexe
Similaire à Autotuned voice cloning enabling multilingualism
IRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for Blinds
IRJET Journal
IRJETDevice for Location Finder and Text Reader for Visually Impaired People
IRJETDevice for Location Finder and Text Reader for Visually Impaired People
IRJET Journal
IRJET- Device for Location Finder and Text Reader for Visually Impaired P...
IRJET- Device for Location Finder and Text Reader for Visually Impaired P...
IRJET Journal
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
IRJET Journal
IRJET- Text Reading for Visually Impaired Person using Raspberry Pi
IRJET- Text Reading for Visually Impaired Person using Raspberry Pi
IRJET Journal
Smart Sound Measurement and Control System for Smart City
Smart Sound Measurement and Control System for Smart City
IRJET Journal
Real Time Sign Language Translation Using Tensor Flow Object Detection
Real Time Sign Language Translation Using Tensor Flow Object Detection
IRJET Journal
SURVEY ON SMART VIRTUAL VOICE ASSISTANT
SURVEY ON SMART VIRTUAL VOICE ASSISTANT
IRJET Journal
Voice-based Login System
Voice-based Login System
IRJET Journal
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
IRJET Journal
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
IRJET Journal
Design of a Communication System using Sign Language aid for Differently Able...
Design of a Communication System using Sign Language aid for Differently Able...
IRJET Journal
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET Journal
IRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound Recognition
IRJET Journal
IRJET - Threat Prediction using Speech Analysis
IRJET - Threat Prediction using Speech Analysis
IRJET Journal
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”
IRJET Journal
IRJET - Hand Gestures Recognition using Deep Learning
IRJET - Hand Gestures Recognition using Deep Learning
IRJET Journal
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET Journal
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
IRJET Journal
IRJET- My Buddy App: Communications between Smart Devices through Voice A...
IRJET- My Buddy App: Communications between Smart Devices through Voice A...
IRJET Journal
Similaire à Autotuned voice cloning enabling multilingualism
(20)
IRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for Blinds
IRJETDevice for Location Finder and Text Reader for Visually Impaired People
IRJETDevice for Location Finder and Text Reader for Visually Impaired People
IRJET- Device for Location Finder and Text Reader for Visually Impaired P...
IRJET- Device for Location Finder and Text Reader for Visually Impaired P...
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
IRJET- Text Reading for Visually Impaired Person using Raspberry Pi
IRJET- Text Reading for Visually Impaired Person using Raspberry Pi
Smart Sound Measurement and Control System for Smart City
Smart Sound Measurement and Control System for Smart City
Real Time Sign Language Translation Using Tensor Flow Object Detection
Real Time Sign Language Translation Using Tensor Flow Object Detection
SURVEY ON SMART VIRTUAL VOICE ASSISTANT
SURVEY ON SMART VIRTUAL VOICE ASSISTANT
Voice-based Login System
Voice-based Login System
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
SPEECH RECOGNITION WITH LANGUAGE SPECIFICATION
Design of a Communication System using Sign Language aid for Differently Able...
Design of a Communication System using Sign Language aid for Differently Able...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- A Survey on Sound Recognition
IRJET- A Survey on Sound Recognition
IRJET - Threat Prediction using Speech Analysis
IRJET - Threat Prediction using Speech Analysis
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”
IRJET - Hand Gestures Recognition using Deep Learning
IRJET - Hand Gestures Recognition using Deep Learning
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...
IRJET- My Buddy App: Communications between Smart Devices through Voice A...
IRJET- My Buddy App: Communications between Smart Devices through Voice A...
Plus de IRJET Journal
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
IRJET Journal
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
IRJET Journal
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
IRJET Journal
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
IRJET Journal
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
IRJET Journal
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
IRJET Journal
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
IRJET Journal
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
IRJET Journal
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
IRJET Journal
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
IRJET Journal
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
IRJET Journal
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
IRJET Journal
React based fullstack edtech web application
React based fullstack edtech web application
IRJET Journal
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
IRJET Journal
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
IRJET Journal
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
IRJET Journal
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
IRJET Journal
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
IRJET Journal
Plus de IRJET Journal
(20)
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
React based fullstack edtech web application
React based fullstack edtech web application
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Dernier
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdf
smsksolar
School management system project Report.pdf
School management system project Report.pdf
Kamal Acharya
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
SCMS School of Architecture
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Call Girls Mumbai
Online electricity billing project report..pdf
Online electricity billing project report..pdf
Kamal Acharya
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
bhaskargani46
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
mphochane1998
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic Marks
Magic Marks
Thermal Engineering Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
DineshKumar4165
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
hamedmustafa094
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
NANDHAKUMARA10
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
BhangaleSonal
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
Quintin Balsdon
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
JIT KUMAR GUPTA
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
RishantSharmaFr
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
soginsider
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
smsksolar
Air Compressor reciprocating single stage
Air Compressor reciprocating single stage
Abc194748
Online food ordering system project report.pdf
Online food ordering system project report.pdf
Kamal Acharya
Dernier
(20)
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdf
School management system project Report.pdf
School management system project Report.pdf
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Online electricity billing project report..pdf
Online electricity billing project report..pdf
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic Marks
Thermal Engineering Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
Air Compressor reciprocating single stage
Air Compressor reciprocating single stage
Online food ordering system project report.pdf
Online food ordering system project report.pdf
Autotuned voice cloning enabling multilingualism
1.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 11 | Nov 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 739 Autotuned voice cloning enabling multilingualism Prof. Priyadarshani Doke1, Piyush Jaiswal2, Neha Karmal3, Vivek Patil4, Samnan Shaikh5 ALARD COLLEGE OF ENGINEERING & MANAGEMENT (ALARD Knowledge Park, Survey No. 50, Marunji, Near Rajiv Gandhi IT Park, Hinjewadi, Pune-411057) Approved by AICTE. Recognized by DTE. NAAC Accredited. Affiliated to SPPU (Pune University). ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - This article describes a neural network-based text-to-speech (TTS) synthesis system that can generate spoken audio in a variety of speakervoices, includingthose not seen during training. We show that the proposed model can convert natural-language text-to-speech into a target language, and synthesize and translatenaturaltext-to-speech. We quantify the importance oftrainedvoicemodulestoobtain the best generalization performance. Finally, using randomly selected speaker embeddings, we show that speech can be synthesized with new speaker voices used in training and that the model learned high-quality speaker representations. We have also introduced a multilingual system and auto-tuner that allows you to translate regular text into another language, which makes multilingualization possible Key Words: (Text to speech, Speech Synthesizer, Voice Cloning, Auto-tuner, Multilinguism) … 1. Introduction Voice cloning is the process in which one uses a computer to generate the speech of a real individual, creating a clone of their specific, unique voice using neural networks. A text-to- speech (TTS) system simply converts text to speech. In this project we are using TTS systems which are trained with datasets composed of texts and audio, thus, the system learns the sound (e.g., the waveform) of letters, words, and sentences. However, the resulting voice is the same as the one presented in the training dataset, which means that to produce a specific voice the TTS system needs to be trained with the target voice. Text is normal voice. Synthetic speech can be generated by concatenating recorded speech segments. In addition, synthesizers can combine voice models and other features of the human voice to produce a fully "synthesized" speech output. 1.1 Voice Cloning Voice cloning is the process in which one uses a computer to generate the speech of a real individual, creating a clone of their specific, unique voice using neural networks. This model is composed of an encoder, and decoder and converts the text into audio using a vocoder. After receiving the text data the model detects the endpoint and evaluates the voice according to the condition that the voice is detected clearly or not. We are also using an auto-tuner for altering the tone, and pitch and smootheningthevoice.Atthistime,composed of 60+ languages. According to the paper, the latest multi- lingual text-to-speech systems require a large amount of data for training or handling onlytwotothreelanguages, but in this model, training with a small amount of data through deep learning technique enabled the high performance of synthetic sounds and stable voice-cloning between multiple languages (English, French, Chinese and Russian). 1.2 TTS The goal of this paper is to make a TTS system that can induce natural speech for a variety of speakers in a data- effective manner. Speech synthesis is a technology that allows a computer to convert written text into speech via a microphone or telephone. As an arising technology, not all inventors are familiar with speech technology. We specifically address a zero-shot literacy setting, wheremany seconds of un-transcribed reference audio from a target speaker is used to synthesize new speech in that speaker’s voice, without streamlining any model parameters. Still, it's also important to note the eventuality for abuse of this technology, for illustration impersonating someone’s voice without their concurrence. To address safety enterprises harmonious with principles similar, we corroborate that voices generated by the proposed model can fluently be distinguished from real voices. 1.3 Text-To-Speech Synthesis A speech synthesis system is by description a system, which produces synthetic speech. It's implicitly clear, that this involves some kind of input. What isn't clear is the type of this input. However, which doesn't contain fresh phonetic and/ or phonological information thesystemmaybecalled a Text-To-Speech (TTS) system, If the input is plain text. As shown, the conflation starts from text input. currently this may be plain text or pronounced-up text e.g., HTML or commodity analogous like JSML (Java Synthesis Mark- up Language). 1.4 Auto Tuner Auto-Tune uses a proprietary device to measure and alter the pitch of vocal and instrumental music recordings and performances. The training data consists of performance
2.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 11 | Nov 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 740 pairs that are identical except for pitch. Such pairs are necessary for model training, but difficult to find naturally. Therefore, we construct input signals by detuning high- quality vocal performances and synthesize them by training a model to predict shifts that restore the original pitch. 3. Objectives of the study It aims to induce synthetic voices veritably analogous to an original voice. Grounded on deep- literacy ways, this technology takes advantage of a set of audios of the original voice in order to train a model able of generatingnewaudios that sound a suchlike The specific objects are 1. To enable the deaf and dumb to communicate and contribute to the growth of an association through synthesized voice. 2. To enable the eyeless and senior people enjoy a stoner- friendly computer interface. 3. To produce ultramodern technology appreciation and mindfulness by computer drivers. 4. To apply an insulated whole word speech synthesizerthat can convert text and responding with speech. 5. To measure and alter pitch in oral and necessary music recording and performances. 4. Scope of the study Use-case diagrams describe the high-level functions and scope of a system. These diagrams also identify the interactions between the system and its actors. A Use case diagram outlines how external entities i.e. user interactwith an internal software system. Diagram -1: Use Case Diagram A state diagram consists of states, transitions, events, and activities. It describes the different states that an object moves through or provide an abstract description of the behavior of a system. Diagram -2: State Diagram Activity diagramsaregraphicalrepresentationsofworkflows of stepwise activities and actions with support for choice, iteration and concurrency. Diagram -3: Activity Diagram
3.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 11 | Nov 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 741 3. CONCLUSION In this research paper, we have successfully studied about Auto-tuned voice cloning which enables Multilingualism. In future, we are planning to use this model in Google Maps, Transportation services for creating a familiar voice to sound very natural and able to understand instructions fast and easily. ACKNOWLEDGEMENT This paper was supported by Alard College of Engineering & Management, Pune 411057. Weareverythankful toall those who have provided us valuable guidance towards the completion of this Seminar Report on “Autotuned voice cloning enabling multilingualism” as part of the syllabus of our course. We express our sincere gratitude towards the cooperative department who has provided us with valuable assistance and requirements for the system development. We are very grateful and want to express our thanks to Prof. Priyadarshini Doke for guiding us in the right manner, correcting our doubts by giving us their time whenever we required, and providing their knowledge and experience in making this project. REFERENCES [1] Jiwon Seong and WooKey Lee, Suan Lee, “Multilingual Speech Synthesis for Voice Cloning” 2021 IEEE International Conference on Big Data and Smart Computing (BigComp)|978-1-7281-8924-6/20/$31.00 ©2021IEEE|DOI:10.1109/BigComp51126.2021.00067 [2] Sanna Wager1 , George Tzanetakis2,3 , Cheng-i Wang3 , Minje Kim1 “DEEPAUTOTUNER:APITCHCORRECTING NETWORK FOR SINGING PERFORMANCES” [3] Nal Kalchbrenner * 1 Erich Elsen * 2 Karen Simonyan 1 Seb Noury 1 Norman Casagrande 1 Edward Lockhart 1 Florian Stimberg 1 Aaron van den Oord ¨ 1 Sander Dieleman 1 Koray Kavukcuoglu “Efficient Neural Audio Synthesis” [4] Li Zhao , Li Zhao “Research on Voice Cloning with a Few Samples” 2020 International Conference on Computer Network, Electronic and Automation (ICCNEA) [5] Ye Jia∗ Yu Zhang∗ Ron J. Weiss∗ Quan Wang Jonathan Shen Fei Ren Zhifeng Chen Patrick Nguyen Ruoming Pang Ignacio Lopez Moreno Yonghui Wu “Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis” arXiv:1806.04558v4 [cs.CL] 2 Jan 2019 [6] Qicong Xie1 , Xiaohai Tian2 , Guanghou Liu1 , KunSong1 , Lei Xie1∗ , Zhiyong Wu3 , Hai Li4 , Song Shi4 , Haizhou Li2,5 , Fen Hong6 , Hui Bu7 , Xin Xu “THE MULTI- SPEAKER MULTI-STYLE VOICE CLONING CHALLENGE 2021” [7] Li Wan Quan Wang Alan Papir Ignacio Lopez Moreno “GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION” arXiv:1710.10467v5 [eess.AS] 9 Nov 2020 [8] Yuxuan Wang∗ , RJ Skerry-Ryan∗ , Daisy Stanton, Yonghui Wu, Ron J. Weiss† , Navdeep Jaitly, Zongheng Yang, Ying Xiao∗ , Zhifeng Chen, Samy Bengio† ,QuocLe, Yannis Agiomyrgiannakis, Rob Clark, Rif A. Saurous “TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS”: 6 Apr 2017 [9] Nwakanma Ifeanyi1 , Oluigbo Ikenna2 and Okpala Izunna3 “Text – To – Speech Synthesis (TTS)” IJRIT International Journal of Research in Information Technology,
Télécharger maintenant