YouTube to Doc: Turn Videos into AI-Friendly Documentation
YouTube to Doc: Videos Become Documentation
Ever wanted to feed a YouTube tutorial into your AI coding assistant? YouTube to Doc transforms any YouTube video into structured, readable documentation that LLMs can easily parse. It extracts transcripts, metadata, descriptions, and even comments, turning video content into a format AI tools actually understand.
Perfect for developers who learn from video tutorials and want to reference them in their AI workflows.
Key Features
📺 Complete Video Processing
Extract comprehensive video information:
- Video Metadata - Title, duration, view count, channel info, thumbnails
- Full Transcripts - Automatic extraction in 9+ languages with timestamps
- Descriptions - Complete video descriptions with links
- Comments - Optional inclusion of top comments for additional context
🤖 AI-Optimized Output
Documentation is structured specifically for LLM consumption:
- Clean, formatted text that’s easy to tokenize
- Token count estimation using tiktoken
- Structured sections for metadata, transcript, and comments
- Perfect for feeding into Claude, GPT, or other AI coding assistants
⚡ Developer-Friendly
Built by developers, for developers:
- RESTful API - Programmatic access for automation
- Docker Ready - One-command deployment
- Rate Limiting - Built-in protection with slowapi
- Multi-Language - Supports English, Spanish, French, German, Italian, Portuguese, Japanese, Korean, and Chinese
Platforms
YouTube to Doc runs anywhere Docker or Python works:
- 🐧 Linux
- 🍎 macOS
- 🪟 Windows
Get Started
Option 1: Docker (Recommended)
# Clone the repository
git clone https://github.com/Solomonkassa/Youtube-to-Doc.git
cd youtubedoc
# Run with Docker Compose
docker-compose up -dOption 2: Local Installation
# Clone the repository
git clone https://github.com/Solomonkassa/Youtube-to-Doc.git
cd youtubedoc
# Install dependencies
pip install -r requirements.txt
# Run the application
uvicorn src.server.main:app --host 0.0.0.0 --port 8000 --reloadAPI Usage Example
import requests
url = "http://localhost:8000/"
data = {
"input_text": "https://www.youtube.com/watch?v=VIDEO_ID",
"max_transcript_length": 10000,
"language": "en",
"include_comments": False
}
response = requests.post(url, data=data)
print(response.text)🔗 GitHub: github.com/Solomonkassa/Youtube-to-Doc
Why This Tool Rocks
- Bridge Video & AI: Finally make YouTube tutorials accessible to your AI coding tools
- Multi-Language Support: Process videos in 9+ languages automatically
- Self-Hosted: Run it locally or on your own server-your data, your control
- Cloud-Ready: Docker deployment makes it easy to run on any cloud provider
- Open Source: MIT licensed-fork it, extend it, make it yours
Crepi il lupo! 🐺