--- title: Vidzly short_description: Transform raw footage into viral-ready content in seconds. thumbnail: https://cdn.tihado.com/app.png emoji: 🎬 colorFrom: blue colorTo: purple sdk: docker app_port: 7860 pinned: false tags: - mcp-in-action-track-creative - building-mcp-track-creative - video-editor - mcp-server - gradio - gemini - elevenlabs - gradio-mcp ---

Vidzly Logo

# 🎬 Vidzly - Your AI-Powered Short Video Creator > **Transform raw footage into viral-ready content in seconds. No skills required. No expensive gear needed. Just your vision and our AI.** ## ✨ What is Vidzly? Vidzly is an intelligent automation platform that revolutionizes short-form video creation. Whether you're a micro-influencer, content creator, or business owner, Vidzly transforms your raw clips into polished, engaging videos that stop the scroll.

▶️ Watch on YouTube

💼 View on LinkedIn

### 🚀 Why Vidzly? - **Zero Learning Curve**: No video editing skills? No problem. Use our intuitive web interface. - **AI-Powered Magic**: Advanced AI handles video analysis, cutting, transitions, music generation, and thumbnail creation automatically. - **Lightning Fast**: What takes hours in traditional editing software takes minutes with Vidzly. - **Professional Quality**: Get studio-quality results without the studio price tag. - **MCP Tools Integration**: All tools are available as MCP (Model Context Protocol) tools for AI agent integration. ### 🎯 Perfect For - 📱 Micro-influencers building their social media presence - 🎨 Content creators who want to focus on creativity, not editing - 💼 Small businesses creating marketing content - 🎓 Educators making engaging educational clips - 🎪 Anyone who wants to create stunning videos effortlessly ## 🎬 How It Works 1. **Upload Your Raw Footage** - Drop your clips through the Gradio web interface 2. **Describe Your Vision** - Optionally provide a description of the mood, style, or vibe you want 3. **AI-Powered Parallel Processing** - Our optimized workflow processes your videos efficiently using parallel execution: - **Parallel Video Analysis**: All videos are analyzed concurrently using Google Gemini AI to understand content, mood, and key moments - **Script Generation**: Creates intelligent composition scripts with scene sequences, transitions, and timing - **Parallel Asset Creation**: Music generation and frame extraction run simultaneously to save time - **Thumbnail Generation**: Creates engaging thumbnails with AI-generated text and stickers - **Video Composition**: Combines clips, music, and thumbnail into a polished final video with smooth transitions 4. **Get Your Masterpiece** - Receive a polished video with thumbnail overlay on the first frame ## 🛠️ Available Tools Vidzly provides a comprehensive suite of MCP tools accessible through a Gradio web interface: - 🎥 **Video Summarizer**: Uses Google Gemini AI to analyze video content and generate detailed summaries including key scenes, detected objects, mood tags, and recommended thumbnail timestamps - ✂️ **Video Clipper**: Extract specific segments from videos by specifying start and end times - 🖼️ **Frame Extractor**: Extract representative frames from videos, with AI-powered selection or manual timestamp specification - 🎨 **Thumbnail Generator**: Automatically generate engaging thumbnails with AI-generated text and stickers based on video frames and summaries - 🎬 **Video Composer**: Combine multiple video clips with transitions (fade, crossfade, cut) and optional background music according to a JSON script. Supports optional thumbnail image overlay on the first frame - 🎵 **Music Selector**: Generate background music and sound effects using ElevenLabs API based on mood, style, duration, BPM, and other parameters ## 🏗️ Architecture - **Web Interface**: Built with Gradio (with MCP server support) - **Parallel Workflow Engine**: Optimized tool orchestration with parallel execution for maximum performance: - **Concurrent Video Analysis**: Multiple videos analyzed simultaneously using ThreadPoolExecutor - **Intelligent Script Generation**: AI-powered scene planning with automatic timestamp validation - **Parallel Asset Creation**: Music generation and frame extraction run concurrently - **Smart Video Composition**: Validates and corrects timestamps, handles transitions, and ensures accurate durations - **AI Integration**: Google Gemini for video understanding, analysis, script generation, and thumbnail creation - **Audio Generation**: ElevenLabs API for music and sound effect generation - **Video Processing**: MoviePy for video editing, clipping, and composition with duration validation - **Image Processing**: OpenCV and Pillow for frame extraction and thumbnail generation - **Testing**: Comprehensive pytest test suite with unit and integration tests ## 👥 Team **Team Name:** Vidzly **Team Members:** - 🐮 Hồng Hạnh - [@tthhanh](https://huggingface.co/tthhanh) - AI Engineer - 🐔 Việt Tiến - [@tiena2cva](https://huggingface.co/tiena2cva) - AI Engineer - 🐻 Nhật Linh - [@Nlag](https://huggingface.co/NLag) - AI Engineer - 🐰 Phương Nhi - [@Daphneee17](https://huggingface.co/Daphneee17) - AI Engineer ## Setup This project uses [Poetry](https://python-poetry.org/) for dependency management. ### Installing Poetry If you don't have Poetry installed, you can install it using: ```bash curl -sSL https://install.python-poetry.org | python3 - ``` Or on macOS with Homebrew: ```bash brew install poetry ``` ### Installing Dependencies Once Poetry is installed, install the project dependencies: ```bash poetry install ``` This will create a virtual environment and install all dependencies specified in `pyproject.toml`. ### Activating the Virtual Environment To activate the Poetry virtual environment: ```bash poetry shell ``` Alternatively, you can run commands within the virtual environment without activating it: ```bash poetry run ``` ### Adding Dependencies To add a new dependency: ```bash poetry add ``` To add a development dependency: ```bash poetry add --group dev ``` ### Removing Dependencies To remove a dependency: ```bash poetry remove ``` ### Updating Dependencies To update all dependencies to their latest compatible versions: ```bash poetry update ``` ### Code Formatting with Black and Lefthook This project uses [Black](https://black.readthedocs.io/) for code formatting and [Lefthook](https://github.com/evilmartians/lefthook) for git hooks to automatically format code before commits. After installing dependencies, set up lefthook: ```bash poetry run lefthook install ``` This will install git hooks that will: - **Before commit**: Automatically format staged Python files with Black - **Before push**: Check that all Python files in `src/` and `tests/` are properly formatted To manually format code: ```bash poetry run black src/ tests/ ``` To check formatting without making changes: ```bash poetry run black --check src/ tests/ ``` ### Setting Up Environment Variables Create a `.env` file in the root directory and add your environment variables. ```bash GOOGLE_API_KEY=your_google_api_key ELEVENLABS_API_KEY=your_elevenlabs_api_key ``` **Note**: The application uses Google Gemini API for AI-powered video analysis and script generation. ### Running the Application Start the Gradio web interface: ```bash poetry run python src/app/app.py ``` This will launch a web interface with: - **Vidzly Tab**: Project introduction and overview - **MCP Tools Tab**: Access to all 6 video processing tools The application runs with MCP server support, allowing AI agents to interact with the tools programmatically. ## Testing This project includes comprehensive unit and integration tests. See [tests/README.md](tests/README.md) for detailed testing documentation. ### Running Tests Run all tests: ```bash poetry run pytest ``` Run with coverage: ```bash poetry run pytest --cov=src/app/tools --cov-report=html ``` Run specific test file: ```bash poetry run pytest tests/test_video_summarizer.py ``` ### Test Structure - **Unit Tests**: Mocked tests for input validation, error handling, and logic - **Integration Tests**: Real video file tests for actual functionality - All tools have corresponding test files in the `tests/` directory ## Technology Stack - **Python 3.12+**: Core language - **Gradio 6.0+**: Web interface with MCP support - **Sequential Workflow**: Step-by-step tool orchestration for video processing - **Google Gemini API**: Video understanding, analysis, script generation, and thumbnail creation - **ElevenLabs API**: Music and sound effect generation - **MoviePy 2.2.1**: Video editing, composition, and image overlay - **OpenCV 4.12+**: Video processing and frame extraction - **Pillow 11**: Image processing for thumbnails - **Poetry**: Dependency management - **pytest**: Testing framework