Video Production Glossary - AI & Veo 3 Terms Explained

A

AI Video Generation

Technology

The process of using artificial intelligence to automatically create video content from text descriptions, scripts, or prompts. AI video generators like Veo 3 use machine learning models trained on millions of videos to understand and recreate realistic scenes, movements, and visual elements.

Example: Typing "a woman walking through a busy Tokyo street at night" into Veo 3 will generate a video showing exactly that scene.

Aspect Ratio

Technical

The proportional relationship between a video's width and height, expressed as two numbers separated by a colon (e.g., 16:9, 9:16, 1:1). Different platforms prefer different aspect ratios.

Common ratios: 16:9 (YouTube, traditional video), 9:16 (TikTok, Instagram Reels), 1:1 (Instagram feed)

B

B-Roll

Production

Supplementary footage used to enhance the main narrative or cover cuts in the primary footage (A-Roll). B-roll provides visual variety and context, often showing what's being discussed rather than just the speaker.

Example: While narrating about coffee production, B-roll might show coffee beans being harvested, roasted, and ground.

C

Character Consistency

Veo 3 Specific

Maintaining the same physical appearance, clothing, and characteristics of characters across multiple scenes in AI-generated videos. Essential for professional-looking content, as AI tends to generate slightly different character appearances in each independent scene.

How it works: Provide detailed character descriptions (age, hair, clothing, etc.) that Veo 3 applies to every scene featuring that character.

Cinematic

Style

A visual style that mimics professional film production, characterized by carefully composed shots, intentional camera movements, dramatic lighting, and a polished aesthetic. Often includes shallow depth of field and color grading.

Close-Up (CU)

Camera Work

A camera shot that tightly frames a subject, typically showing a person's face from shoulders up or focusing on a specific object detail. Close-ups emphasize emotion, detail, or importance.

D

Dialogue

Script Writing

Spoken words between characters in a video or film. In Veo 3, dialogue must be 95 characters or less per scene and is rendered as text overlays or subtitles synchronized with the video.

Veo 3 requirement: "Hello, how are you today?" (26 characters) ✓ Valid
"I wanted to tell you about this amazing thing that happened to me yesterday afternoon when I was walking home" (108 characters) ✗ Too long, needs splitting

Depth of Field

Camera Work

The range of distance in a shot that appears acceptably sharp. Shallow depth of field (blurred background) focuses attention on the subject, while deep depth of field keeps more of the scene in focus.

E

Establishing Shot

Camera Work

A wide shot that shows the entire setting or location, establishing where a scene takes place. Typically used at the beginning of a scene to orient the viewer.

Example: Before showing characters inside a coffee shop, an establishing shot might show the exterior of the building and surrounding street.

F

Frame Rate (FPS)

Technical

Frames Per Second - the number of individual images displayed per second in a video. Common rates are 24fps (cinematic), 30fps (standard video), and 60fps (smooth motion).

G

Generative AI

Technology

Artificial intelligence systems that can create new content (images, videos, text, audio) based on training data and user prompts. Veo 3 is a generative AI system specialized in video creation.

I

Inference

AI Technology

The process where an AI model generates output based on input. In Veo 3, inference is when the model creates video frames from your text prompt or script.

L

Lighting

Production

The illumination in a scene, which dramatically affects mood, visibility, and visual quality. Key types include natural (sunlight), artificial (studio lights), hard (sharp shadows), and soft (diffused).

M

Medium Shot

Camera Work

A camera framing that shows a person from approximately waist up. Balances showing the subject's expressions with their body language and some surrounding context.

Motion Blur

Technical

The visual streaking of moving objects in video, occurring naturally when objects move faster than the camera's shutter speed can freeze. Adds realism and smoothness to motion.

N

Narrative Flow

Script Writing

The smooth, logical progression of a story from one scene or moment to the next. Good narrative flow keeps viewers engaged and makes the story easy to follow.

P

Prompt Engineering

AI Technology

The art and science of crafting effective text prompts to guide AI systems toward desired outputs. Good prompts are specific, descriptive, and structured to match how the AI model processes information.

Weak prompt: "A person in a room"
Strong prompt: "A 30-year-old woman in professional attire sitting at a modern desk in a bright office, natural window lighting, medium shot, working on laptop"

Pacing

Editing

The rhythm and speed at which a story unfolds, controlled through scene length, cuts, and timing. Fast pacing creates excitement; slow pacing builds tension or allows emotional moments to resonate.

R

Resolution

Technical

The number of pixels in a video, typically expressed as width × height (e.g., 1920×1080, 3840×2160). Higher resolution means more detail but larger file sizes. Common standards: 1080p (Full HD), 4K (Ultra HD).

S

Scene

Structure

A single continuous segment of video action occurring in one location at one time. In Veo 3, each prompt generates one scene. Multiple scenes are combined to create a complete video.

Scene Atoms

Veo 3 Specific

Veo 3's advanced feature that breaks prompts into smaller, more manageable components (atoms) for more precise control over scene generation. Allows detailed specification of camera work, lighting, subject actions, and timing.

Storyboard

Pre-Production

A visual plan for video production, typically a series of sketches or images representing key frames or scenes. Shows what will appear on screen, camera angles, and sequence of shots.

T

Transition

Editing

The method of moving from one scene to another. Common types include cut (instant change), fade (gradual appearance/disappearance), dissolve (blend between scenes), and wipe (one scene replaces another with a moving edge).

Text-to-Video

AI Technology

AI technology that generates video content directly from written text descriptions. Veo 3 is a text-to-video system that converts prompts and scripts into visual video sequences.

V

Veo 3

Platform

Google's advanced AI video generation model that creates high-quality videos from text prompts. Veo 3 features improved realism, better physics simulation, character consistency capabilities, and support for various video styles and formats.

Visual Continuity

Production

Maintaining consistent visual elements (lighting, colors, character appearance, props, environment) across scenes. Essential for professional videos as inconsistencies break immersion and confuse viewers.

Voice-Over (VO)

Audio

Narration or dialogue recorded separately and added to video, where the speaker is not visible on screen. Commonly used for tutorials, documentaries, and explainer videos.