We've created an open-source alternative to Eleven Labs for voice cloning and multilingual TTS. Key features:
- Clone voices from 15-second samples - 50+ pre-trained celebrity voice models - Support for 100+ languages via Google Translator - Speech recognition with Whisper - One-click Windows installation - AI cover generation with pre-trained models
Demo videos showing podcast creation and multilingual dubbing: https://youtu.be/z8g8LMhoh_o (Podcast) https://youtu.be/ZtyhrZHbW0Y (Original) https://youtu.be/CA4WYdkJrkQ (English) https://youtu.be/hSEe0trPtnQ (Spanish) https://youtu.be/qwExW2sReNc (Chinese)
Try it: https://github.com/abus-aikorea/voice-pro
Comments URL: https://news.ycombinator.com/item?id=42836934
Points: 5
# Comments: 1
https://github.com/abus-aikorea/voice-pro/blob/main/docs/README.eng.md
Login to add comment
Other posts in this group

Article URL: https://www.acm.org/publications/openaccess
Comments URL: http
Article URL: https://spj.science.org/doi/10.34133/hds.0161
Article URL: https://maruos.com/
Comments URL: https://news.ycombinator.com/item?id=44727298
