AI converts video and audio content into multi-style documents with one click

AI-Media2Doc is an open source AI video graphic creation assistant designed to help users convert audio and video content into multiple styles of documents with one click. The project was developed by hanshuaikang, hosted on GitHub, adopts the MIT license, supports local deployment, and protects user privacy.
An open source Web tool based on the AI model aims to convert video and audio content into multiple styles of documents with one click, including small red books, public accounts, knowledge notes, mind maps, etc.

Project introduction

AI-Media2Doc provides a Web tool based on the AI macro model and capable of converting video and audio content into:

Picture and text notes in the style of Xiaohongshu
Articles on Weixin Official Accounts
Knowledge Notes
mind map
video caption
Various document forms such as content summary

The tool does not require login registration, and the front and back terminals can be deployed locally. Users can experience AI video/audio to document services at a very low cost.

Core functions

Fully open source: Adopt the MIT protocol and support local deployment, making it convenient for users to customize according to their needs.
Privacy protection: No login and registration are required, and task records are stored locally to ensure the security of user data.
Front-end processing: Using ffmpeg wasm technology, there is no need to install ffmpeg locally, improving convenience of use.
Multiple styles support: Support multiple document styles such as small red books, public accounts, knowledge notes, mind maps, and content summaries to meet the needs of different scenarios.
AI Dialogue: Support AI secondary Q & A for video content to improve the interactivity and depth of content.
Subtitle export: Supports conversion of video content into subtitles for subsequent editing and publishing.

🚀Usage

Users can deploy the tool locally by cloning the GitHub repository. The specific steps include:

Clone warehouse:

$git clone https://github.com/hanshuaikang/AI-Media2Doc.git
Go to the project directory and start the service:

$cd AI-Media2Doc
$docker-compose up
Access locally deployed services in a browser, upload audio and video files, select the required document style, and generate corresponding document content.

Project address

You can visit the project on GitHub for more information and the latest updates:

👉AI-Media2Doc GitHub repository

Github：https://github.com/hanshuaikang/AI-Media2Doc

Oil tubing: