Ok this is pretty awesome
Quote: |
15.ai - A deep learning text-to-speech tool for generating natural high-quality voices of characters with minimal data (MIT)
This is a text-to-speech tool that you can use to generate 44.1 kHz voices of various characters. The voices are generated in real time using multiple audio synthesis algorithms and customized deep neural networks trained on very little available data (between 30 and 120 minutes of clean dialogue for each character). This project demonstrates a significant reduction in the amount of audio required to realistically clone voices while retaining their affective prosodies. |
https://15.ai/
https://15.ai/examples
https://15.ai/faq
May the NFOrce be with you always.