vibestack

VIDEO & VOICE (Modules 20–23)

VS field compares to the nearest peer. For a solo builder these power product demos, ads, faceless content, and voiceovers — high-leverage for marketing without a studio. Prices verified June 2026.

MODULE 20: Runway ML CATEGORY: Video DEPTH: SKIM ONE-LINE SUMMARY: A pro-grade AI video studio — text/image-to-video (Gen-4.x) plus an editing suite (inpainting, motion brush, camera control, green screen). VS NEAREST PEER (Kling): The "creative tool" not just a generator — finer control, editing features, and a workflow built for filmmakers/marketers. Kling often gives longer/cheaper clips with strong realism; Runway gives more direction and post tools. Use Runway when you need control + an editing pipeline. BEST FOR:

  • Polished product/ad b-roll with directed camera moves and motion control.
  • Editing existing footage (remove objects, extend shots, green-screen) AI-style.
  • Client video deliverables where you need to iterate precisely, not roll dice. WEAKNESS: Credits don't roll over and burn fast (Gen-4.5 = 25 credits/sec); high-quality output is expensive at volume; learning curve on the editing tools. COST: ⚠ verify — Free (125 one-time credits); Standard $12/user/mo annual (625 cr/mo); Pro $28/user/mo (2,250 cr); Max $76/user/mo (9,500 cr). Gen-4.5 ~25 cr/sec, Gen-4 Turbo ~5 cr/sec. HANDS-ON TASK: Turn one Midjourney product still into a 4-second cinematic clip with a slow push-in (image-to-video + camera control). That's a reusable ad-intro recipe. GOTCHA: Credits don't roll over and Gen-4.5 eats them at 25/sec — a few seconds of premium video can drain a Standard month. Storyboard before generating; use Turbo for drafts, premium only for finals.

MODULE 21: Kling CATEGORY: Video DEPTH: SKIM ONE-LINE SUMMARY: A Chinese AI video model known for longer, realistic, high-motion clips — strong image-to-video, native audio, and (2026) 4K output. VS NEAREST PEER (Runway): Tends to win on raw clip realism, length, and price-per-second, especially image-to-video. Runway wins on editing/control. Kling is the "generate a great-looking clip cheaply" option; Runway is the "direct and edit it" option. BEST FOR:

  • Cheaper, longer realistic clips for social/faceless content at volume.
  • Image-to-video bringing your Midjourney/Flux stills to life.
  • High-motion shots (action, camera movement) where realism matters. WEAKNESS: Credit math is confusing (first-time vs renewal pricing differs); free content can't be used commercially; less of an editing suite than Runway; data-residency optics (Chinese platform). COST: ⚠ verify — Free (66 daily credits, non-commercial); Standard ~$10/mo (offer ~$6.99); Pro ~$37/mo; Premier ~$92/mo; Ultra ~$180/mo. Kling 3.0 ~6–12 credits/sec by resolution/audio. HANDS-ON TASK: Take the same Midjourney still you used in Runway, generate a clip in Kling, and compare realism, length, and credit cost side by side. Pick your default per use case. GOTCHA: Free-tier clips are non-commercial. If you use Kling output in client or product marketing, you must be on a paid tier — check the license before shipping.

MODULE 22: ElevenLabs CATEGORY: Voice DEPTH: CORE ONE-LINE SUMMARY: The leading AI voice platform — ultra-realistic text-to-speech, instant voice cloning, dubbing, and a low-latency API for voice agents. VS NEAREST PEER (built-in TTS / OpenAI voice): Best-in-class realism and the widest voice toolkit (cloning, multilingual dubbing, voice design, streaming API). Others are catching up on quality but ElevenLabs has the deepest feature set and dev-friendly API. This is your voice layer. BEST FOR:

  • Voiceovers for product demos, faceless YouTube/TikTok, and course narration.
  • Voice cloning your own voice for scalable content (record once, generate forever).
  • Adding a realistic voice to an app/agent via the API (FastAPI + ElevenLabs). WEAKNESS: Credits = characters and they burn fast (1 char = 1 credit on v2; Flash/Turbo at 0.5); higher-volume narration needs the $99 Pro tier; cloning realism can dip on emotional range. COST: ⚠ verify — Free (10k credits/mo); Starter $5/mo (commercial rights + instant cloning); Creator $22/mo (100k chars); Pro $99/mo (500k); Scale $330/mo (2M); Business $990/mo. Credits roll over up to 2 months. HANDS-ON TASK: Clone your own voice (Starter+), then generate a 60-second narration of your freelance pitch. You now have a reusable voice asset for every demo and course video. GOTCHA: Character-metering means long-form narration eats plans fast — a single 20-min course module can blow past Creator's 100k. Use Flash/Turbo models (0.5 credits/char) for drafts and budget the real plan against total minutes.

MODULE 23: Suno CATEGORY: Voice/Music DEPTH: SKIM ONE-LINE SUMMARY: Generates full songs — vocals, lyrics, instruments — from a text prompt. "Describe a song, get a song." VS NEAREST PEER (ElevenLabs / Udio): ElevenLabs does speech; Suno does music with vocals. vs Udio (its main rival): comparable; Suno is the better-known, faster-to-usable option. There's no real Claude-Code analog — it's a different medium entirely. BEST FOR:

  • Royalty-clear background music and jingles for your videos/ads (with paid commercial rights).
  • Intro/outro music and audio branding for content and courses.
  • Fun, fast custom songs as marketing hooks or client freebies. WEAKNESS: Commercial rights require a paid plan; output quality varies and needs reroll luck; fine musical control is limited; credits cap monthly songs. COST: ⚠ verify — Free (50 credits/day, ~10 songs, non-commercial); Pro $10/mo ($8 annual, 2,500 cr ≈ 500 songs, commercial rights); Premier $30/mo ($24 annual, 10,000 cr, Suno Studio). HANDS-ON TASK: Generate a 30-second upbeat intro track for your course/brand, download it, and drop it under a Runway/Kling clip. You now have an end-to-end AI video+music asset. GOTCHA: Free-tier songs are NOT commercially usable. The moment a Suno track goes into client work or monetized content, you need Pro+ for the commercial license. Don't ship free-tier audio.

Video & Voice — at a glance

ToolMediumKiller traitCost reflexReach for it when
RunwayVideoControl + editing suite$12–76/mo, creditsdirected, editable video
KlingVideoRealism, length, price~$7–180/mocheap realistic clips at volume
ElevenLabsVoice (speech)Realism + cloning + API$5–99/movoiceover, cloning, voice agents
SunoMusic+vocalsFull songs from text$10–30/momusic/jingles for content

Solo-builder content combo: Midjourney still → Kling/Runway motion → ElevenLabs voiceover → Suno music = a full faceless video without a camera, mic-talent, or composer.

Sources (verified June 2026)

Next: 05-automation.md.