GENERATIVE VIDEO / SIMULATION / VFX

SYNTHETIC SURVEILLANCE

AMBIENT.AI

AI-generated CCTV footage, built to demo and stress-test behavioral security-detection models.

Synthetic Surveillance: every generated camera feed, one mid-frame each, tiled like a security operations video wall

Overview

A series of simulated CCTV “incident sequence” reels built to demonstrate Ambient.ai’s behavioral-detection platform. A single threat actor moves through an escalating breach across five surveillance stages (reconnaissance, perimeter breach, transit, entry, and interior climax), each captured as if by a different security camera on one coherent site, and each catching a behavioral precursor before the threat reaches an occupied area. The implicit promise: with the system deployed, the final stage never happens. Every frame is AI-generated, letting us stage scenarios that would be dangerous, costly, or impossible to film with real actors and cameras, produced safely, with no one ever at risk. The pipeline combined tightly art-directed image generation, manual image editing, AI video generation, compositing, masking, and effects work.

Corporate Campus: Escalation

The flagship “Planned Breach”: one threat actor across five cameras, vehicle-gate recon, a badge-reader tailgate, the secured wing, the executive suite entry, then the interior climax.

Corporate Campus

The wider camera network across the corporate campus.

Education: Escalation

“The Campus Intrusion”: street-side recon, a weak fence junction, and transit toward an occupied wing.

Education

The surrounding campus cameras.

Museum

Galleries, exhibit storage, and after-hours interiors for cultural institutions.

Data Center

Server floors and the interior infrastructure that surrounds them.

Manufacturing

Factory floors, raw-materials storage, and loading docks.

Energy & Utilities

Control rooms, access gates, and hardened perimeters for critical infrastructure.

Healthcare

Hospital-site cameras: entrance, pharmacy access corridors, parking structure, and restricted clinical areas.

Financial

Banking-hall, ATM, and lot environments.

How It Was Made

Every reel is fully synthetic: no cameras, no actors, no one ever at risk. Getting footage this directed out of today's models meant treating generative AI like a film set, a tightly art-directed, multi-stage pipeline with a human craftsman in the loop at every step.

Scene matrix spreadsheet: every scenario across industries, each with a category, location, and three prompt options

Ideation. Every scene starts in a spreadsheet, mapped by industry and location, each with multiple prompt variations to explore. Casting wide on paper is what gives the later selection real options to choose from.

Automation. A custom Python script reads the spreadsheet row by row and runs every prompt through Google's Gemini 3 Pro Image (Preview) model, batch-generating the base images for the entire matrix, turning hundreds of variations into an automated render queue instead of one-off manual generations.

Contact sheet: Corporate badge-controlled turnstile, Stage 2 options

Contact sheet: Education weak fence junction, Stage 2 options

Selection. The generated options are laid out as per-scene contact sheets, so framing, lighting, and the surveillance read can be compared side by side and the strongest base plate locked before any motion work begins.

Node-based generation pipeline: text prompts feeding image generation, Nano Banana retouching, and Kling video out, one branch per camera shot

Refinement. Chosen frames move into a node-based editing graph for cleanup and updates, retouching artifacts and holding visual consistency (wardrobe, environment, and camera character) across every shot in a sequence before it's taken to video.

Image-to-video iteration board — a branching tree of generated takes (V1, Fix Arms, Walking Direction, Box Fix) leading to a green-lit final version

Animation. The locked frame is taken into image-to-video, then iterated relentlessly — each branch a re-roll fixing one specific problem (arm motion, walk direction, a drifting box) until a take reads clean and gets the green light. The model gets directed like a shoot: many takes, ruthless selection.

Composite, Mask, Rotoscope

Generations rarely came out clean. When a take nailed one element but broke another, we cut the usable parts from multiple generations, masked out artifacts and inconsistencies, and rotoscoped key elements to keep the subject reading clearly at the top of frame. A frame-by-frame finishing pass layered on top of the AI.

More Projects

More Projects

MT. Iturbide

O.T.T.O.

AI PHOTOBOOTH

Creative Frontier Columbia × DeepMind