Overview of several open source AI projects

Ebook2Audiobook open source project

Automatic conversion of e-books to audiobooks supports voice cloning, multiple languages

ebook 2audiobook XTTS is an open source project that aims to automatically convert e-books to audiobooks and supports multiple languages, voice cloning and chapter information generation. This project combines Calibre (e-book conversion tool) and Coqui XTTS (text-to-speech engine) to complete the conversion through simple commands or Web interfaces, making it convenient for users to convert their e-books into audio files, suitable for daily book listening needs or personalized audiobook production.

Github:https://github.com/DrewThomasson/ebook2audiobookXTTS

Hertz-dev: The first open source model for session audio

Full duplex real-time voice interaction 120 milliseconds ultra-low latency

Hertz-dev is the first open source model of session audio developed by Standard Intelligence. hertz-dev is a full-duplex, audio-only Transformer basic model.

Its main function is to generate dialogue audio, that is, voice generation that simulates human dialogue. Supports full-duplex audio, which can simultaneously receive and generate audio, just like a phone call or real-time conversation, without having to wait for a sentence to be finished before replying.

GitHub:https://github.com/Standard-Intelligence/hertz-dev

Software name: Xiaobin AI Mattu

Software function: AI image processing

Supported platform: Windows
Software Introduction: A free open source AI image processing tool. Its main functions include one-click image matting, ID photo making and image format conversion.

You can do individual or batch matting by dragging, pasting pictures or links. Ability to efficiently process images in a variety of formats, including jpg, png, gif, webp and bmp.

You can also use the software to make ID photos that meet different specifications and perform secondary editing.

Original text:https://matting.20133075.xyz/

Website function: AI avatar moves

Website name: Discopixel
A greeting card service that uses artificial intelligence technology to provide interesting facial animations and videos.
Just upload a photo and share some interesting facts to generate personalized music videos.
The website is currently preparing for its next release and can be added to the waiting list.

Original text:https://discopixel.app/

Oil tubing:

Scroll to Top