VIDEO & VOICE (Modules 20–23)
VS field compares to the nearest peer. For a solo builder these power product
demos, ads, faceless content, and voiceovers — high-leverage for marketing without
a studio. Prices verified June 2026.
MODULE 20: Runway ML CATEGORY: Video DEPTH: SKIM ONE-LINE SUMMARY: A pro-grade AI video studio — text/image-to-video (Gen-4.x) plus an editing suite (inpainting, motion brush, camera control, green screen). VS NEAREST PEER (Kling): The "creative tool" not just a generator — finer control, editing features, and a workflow built for filmmakers/marketers. Kling often gives longer/cheaper clips with strong realism; Runway gives more direction and post tools. Use Runway when you need control + an editing pipeline. BEST FOR:
- Polished product/ad b-roll with directed camera moves and motion control.
- Editing existing footage (remove objects, extend shots, green-screen) AI-style.
- Client video deliverables where you need to iterate precisely, not roll dice. WEAKNESS: Credits don't roll over and burn fast (Gen-4.5 = 25 credits/sec); high-quality output is expensive at volume; learning curve on the editing tools. COST: ⚠ verify — Free (125 one-time credits); Standard $12/user/mo annual (625 cr/mo); Pro $28/user/mo (2,250 cr); Max $76/user/mo (9,500 cr). Gen-4.5 ~25 cr/sec, Gen-4 Turbo ~5 cr/sec. HANDS-ON TASK: Turn one Midjourney product still into a 4-second cinematic clip with a slow push-in (image-to-video + camera control). That's a reusable ad-intro recipe. GOTCHA: Credits don't roll over and Gen-4.5 eats them at 25/sec — a few seconds of premium video can drain a Standard month. Storyboard before generating; use Turbo for drafts, premium only for finals.
MODULE 21: Kling CATEGORY: Video DEPTH: SKIM ONE-LINE SUMMARY: A Chinese AI video model known for longer, realistic, high-motion clips — strong image-to-video, native audio, and (2026) 4K output. VS NEAREST PEER (Runway): Tends to win on raw clip realism, length, and price-per-second, especially image-to-video. Runway wins on editing/control. Kling is the "generate a great-looking clip cheaply" option; Runway is the "direct and edit it" option. BEST FOR:
- Cheaper, longer realistic clips for social/faceless content at volume.
- Image-to-video bringing your Midjourney/Flux stills to life.
- High-motion shots (action, camera movement) where realism matters. WEAKNESS: Credit math is confusing (first-time vs renewal pricing differs); free content can't be used commercially; less of an editing suite than Runway; data-residency optics (Chinese platform). COST: ⚠ verify — Free (66 daily credits, non-commercial); Standard ~$10/mo (offer ~$6.99); Pro ~$37/mo; Premier ~$92/mo; Ultra ~$180/mo. Kling 3.0 ~6–12 credits/sec by resolution/audio. HANDS-ON TASK: Take the same Midjourney still you used in Runway, generate a clip in Kling, and compare realism, length, and credit cost side by side. Pick your default per use case. GOTCHA: Free-tier clips are non-commercial. If you use Kling output in client or product marketing, you must be on a paid tier — check the license before shipping.
MODULE 22: ElevenLabs CATEGORY: Voice DEPTH: CORE ONE-LINE SUMMARY: The leading AI voice platform — ultra-realistic text-to-speech, instant voice cloning, dubbing, and a low-latency API for voice agents. VS NEAREST PEER (built-in TTS / OpenAI voice): Best-in-class realism and the widest voice toolkit (cloning, multilingual dubbing, voice design, streaming API). Others are catching up on quality but ElevenLabs has the deepest feature set and dev-friendly API. This is your voice layer. BEST FOR:
- Voiceovers for product demos, faceless YouTube/TikTok, and course narration.
- Voice cloning your own voice for scalable content (record once, generate forever).
- Adding a realistic voice to an app/agent via the API (FastAPI + ElevenLabs). WEAKNESS: Credits = characters and they burn fast (1 char = 1 credit on v2; Flash/Turbo at 0.5); higher-volume narration needs the $99 Pro tier; cloning realism can dip on emotional range. COST: ⚠ verify — Free (10k credits/mo); Starter $5/mo (commercial rights + instant cloning); Creator $22/mo (100k chars); Pro $99/mo (500k); Scale $330/mo (2M); Business $990/mo. Credits roll over up to 2 months. HANDS-ON TASK: Clone your own voice (Starter+), then generate a 60-second narration of your freelance pitch. You now have a reusable voice asset for every demo and course video. GOTCHA: Character-metering means long-form narration eats plans fast — a single 20-min course module can blow past Creator's 100k. Use Flash/Turbo models (0.5 credits/char) for drafts and budget the real plan against total minutes.
MODULE 23: Suno CATEGORY: Voice/Music DEPTH: SKIM ONE-LINE SUMMARY: Generates full songs — vocals, lyrics, instruments — from a text prompt. "Describe a song, get a song." VS NEAREST PEER (ElevenLabs / Udio): ElevenLabs does speech; Suno does music with vocals. vs Udio (its main rival): comparable; Suno is the better-known, faster-to-usable option. There's no real Claude-Code analog — it's a different medium entirely. BEST FOR:
- Royalty-clear background music and jingles for your videos/ads (with paid commercial rights).
- Intro/outro music and audio branding for content and courses.
- Fun, fast custom songs as marketing hooks or client freebies. WEAKNESS: Commercial rights require a paid plan; output quality varies and needs reroll luck; fine musical control is limited; credits cap monthly songs. COST: ⚠ verify — Free (50 credits/day, ~10 songs, non-commercial); Pro $10/mo ($8 annual, 2,500 cr ≈ 500 songs, commercial rights); Premier $30/mo ($24 annual, 10,000 cr, Suno Studio). HANDS-ON TASK: Generate a 30-second upbeat intro track for your course/brand, download it, and drop it under a Runway/Kling clip. You now have an end-to-end AI video+music asset. GOTCHA: Free-tier songs are NOT commercially usable. The moment a Suno track goes into client work or monetized content, you need Pro+ for the commercial license. Don't ship free-tier audio.
Video & Voice — at a glance
| Tool | Medium | Killer trait | Cost reflex | Reach for it when |
|---|---|---|---|---|
| Runway | Video | Control + editing suite | $12–76/mo, credits | directed, editable video |
| Kling | Video | Realism, length, price | ~$7–180/mo | cheap realistic clips at volume |
| ElevenLabs | Voice (speech) | Realism + cloning + API | $5–99/mo | voiceover, cloning, voice agents |
| Suno | Music+vocals | Full songs from text | $10–30/mo | music/jingles for content |
Solo-builder content combo: Midjourney still → Kling/Runway motion → ElevenLabs voiceover → Suno music = a full faceless video without a camera, mic-talent, or composer.
Sources (verified June 2026)
Next: 05-automation.md.