程序師世界是廣大編程愛好者互助、分享、學習的平台,程序師世界有你更精彩!
首頁
編程語言
C語言|JAVA編程
Python編程
網頁編程
ASP編程|PHP編程
JSP編程
數據庫知識
MYSQL數據庫|SqlServer數據庫
Oracle數據庫|DB2數據庫
您现在的位置: 程式師世界 >> 編程語言 >  >> 更多編程語言 >> Python

Super simple to teach you how to clone sound in python (take Juan Fu as an example)

編輯:Python

Voice cloning is a popular deep learning application in the past two years , It allows you to learn the way an object speaks and its tone from a few seconds of audio , And use it to generate new voice .

Let's take a look at the SV2TTS Practice imitating the effect of reading the following sentence :

She is beginning to get many wrinkles around her eyes.

Training set :

clone 、 Imitation effect (She is beginning to get many wrinkles around her eyes.):

The result is right , If you don't know it is Python Generated , I thought it was really a blessing .

Let's teach you how to use  Real-Time-Voice-Cloning  The project clones the voice and generates the desired statement .

1. Get ready

You can go to Real-Time-Voice-Cloning Project download the code of this project and the model completed by pre training .( Be careful , need Python 3.6 above To run this project ):
https://github.com/CorentinJ/Real-Time-Voice-Cloning

If your network speed is poor , Can't download github Project and its pre training model , Can be in  Python Practical treasure   Official account back office reply   Clone voice   Download the complete project code and pre training model .

After downloading the project code , You also need to download two important dependencies :

  • PyTorch(> = 1.0.1)
    https://pytorch.org/get-started/locally/

  • ffmpeg
    https://ffmpeg.org/download.html

install PyTorch

among ,PyTorch The official guide to has been written very clearly , You can install it according to your own needs .

install ffmpeg

ffmpeg We have covered the installation of in detail in this article :Python Multiple music format conversion ( Batch ) Practical course , Here, I will explain the installation guide of each system again :

Mac ( Open the terminal (Terminal), use  homebrew  install ):

brew install ffmpeg --with-libvorbis --with-sdl2 --with-theora

Linux:

apt-get install ffmpeg libavcodec-extra

Windows:

1. Go to the following link , Click on windows The corresponding icon , Enter the download interface and click download Download button :
http://ffmpeg.org/download.html#build-windows

2. Unzip download okay zip File to specified directory
3. Put the extracted files in the directory bin Catalog ( contain ffmpeg.exe ) Added to the path In the environment variables

Installation module depends on

After the installation of the above two important dependencies , At terminal 、 Enter the project directory from the command line , install Python rely on :

pip install -r requirements.txt

This command will install all requirements.txt All dependencies in . After completing the above dependent installation , We can go on to the next step .

2. Download pre training model ( Optional )

If you use the project documents provided by us , You don't have to do this again , Because the pre training models have been put in .

If you of no avail Python The project code provided by the practical dictionary , You also need to download the pre training model :https://github.com/CorentinJ/Real-Time-Voice-Cloning/wiki/Pretrained-models

Unzip after download pretrained.zip Put the corresponding models into the corresponding positions of the project :

encoder\saved_models\pretrained.pt
synthesizer\saved_models\pretrained\pretrained.pt
vocoder\saved_models\pretrained\pretrained.pt

3. Try cloning voice

Pick any voice of the person you want to clone , Probably 30 About seconds , Put it in the project folder . Then run the command in this folder :

python demo_cli.py

If everything goes well , It will appear allowing you to select the training voice file :


At this point, enter a speech you have prepared , Wait for it to finish training , It will let you type the text you want to imitate :


Like in the picture above , I entered :

She is beginning to get many wrinkles around her eyes.

The cloning result will be read out automatically after the program is generated , If you don't hear the cloning results , No problem , The program will save it in the current folder , Name it demo_output_xx.wav.

Double click to open this file , It is the result of voice cloning generated by it , Listen to it , Is it the effect you want ?

If you don't achieve your desired results , Please check the training set for any noise 、 Is it long enough 、 Is there anyone else involved , These factors may lead to unsatisfactory cloning effect .

This is the end of our article , If you like today's Python Practical course , Stay tuned Python Practical treasure .

Any questions , You can reply in official account. : Add group , Answer accordingly Red letter verification information , Enter the mutual aid group and ask .

Originality is not easy. , I hope you can praise me and support me to continue to create , thank you !

Click below to read the original text for a better reading experience

Python Practical treasure  (pythondict.com)
It's not just a treasure book
Welcome to the official account :Python Practical treasure


  1. 上一篇文章:
  2. 下一篇文章:
Copyright © 程式師世界 All Rights Reserved