A young boy with brown hair wearing an orange jacket kneels in a shallow creek, examining an old, torn map laid on the water. Behind him, large, twisted tree roots extend into the water, forming a dark, natural archway.

A Theory Stack

The center is the delta.

A measurement framework for compositional reasoning in AI-generated imagery.

This documentation presents the Visual Thinking Lens (VTL) framework, a systematic approach to measuring geometric bias, compositional priors, and spatial reasoning in text-to-image models. At its core: visual intelligence isn't measured by aesthetic polish or semantic correctness, but by how an image holds structural tension, reveals geometry, and survives compositional interrogation.

These papers were developed through systematic testing across multiple platforms (Sora, MidJourney, GPT, SDXL, Gemini, Stable Diffusion, Firefly, OpenArt, Canva, Leonardo), comparative analysis in 5,000+ generated images, and reproducible measurement protocols. They document both findings and methodology, what works, what doesn't, and how to replicate the results.

What you'll find:

  • Empirical studies revealing compositional monoculture across AI platforms

  • Measurement infrastructure: geometric primitives, kernel coordinates, detection protocols

  • Practical applications: fingerprinting, steering, pre-failure detection, cross-platform comparison

  • Working notebooks and implementation code (GitHub)

The framework doesn't assume images "know" what they're doing. It measures what they are doing geometrically, provides coordinates for navigating alternatives, and offers scaffolding for rebuilding beyond defaults.

New to VTL? Begin here:

  1. Mass, Not Subject - Foundational concept (15 min)

  2. The Kernel Primitives - Core measurements (10 min)

  3. Monoculture in MidJourney - Empirical evidence (20 min)

Want practical application?

  1. Deformation Operator Playbook - Hands-on techniques

  2. The Off-Center Prior: Mapping Spatial Bias - Basin navigation

  3. Foreshortening Recipe Book - Constraint architecture

Researcher or engineer?

  1. Measuring Spatial Priors in Generative Image Models Using Geometry-Based Field Metrics - Complete technical spec

  2. Measuring Spatial Operating Envelopes Text-to-Image Generation Models - Phase 2: Measure stress and structural failure (Phase 1 for background)

  3. VCLI-G Documentation - Measurement methodology

  4. GitHub link - Reproducible implementations

Red Team?

  1. Behavioral Drift Detection for Generative Models - A geometric measurement system

  2. Canonical Mathematical Specification - A deterministic measurement device

  3. Breaking Compositional Monoculture: Surface Texture as a Centering Prior in Sora - Phase 1B Measuring escape in Sora

  4. Compositional Basins A Geometry-First Framework Measuring Structural Behavior - Phase 2B Measuring steering and controllability

Full documentation below. Papers are organized by focus: measurement infrastructure, empirical studies, practical applications, theoretical foundations. See GitHub for working code and notebooks.

CORE RESEARCH: Measurement & Evidence

Radial Collapse & Compositional Monoculture

Measurement Infrastructure

Diagnostic Methods

APPLICATIONS: Methods & Protocols

Steering & Navigation

Practical Playbooks

Advanced Methods

FOR RESEARCHERS & ENGINEERS

CASE STUDIES

ARCHIVE: Historical & WIP Work - Early VTL Development

ARCHIVE: Artistic Extensions Legacy Work Early VTL Development

[Note: work below this point documents the development of VTL before the geometric kernel framework. Concepts remain valid but measurement methods have been superseded by VCLI-G and full kernel primitives.]

ARCHIVE: PROOFS & WALKTHROUGHS: Legacy Work Early VTL Development


Author’s Note: This work measures compositional bias in AI-generated imagery through computational methods and provides operational frameworks for navigating beyond default priors.

The measurements (RCA-2, VCLI-G, LSI-lite, kernel primitives) produce reproducible numerical results. The methods (dialectical prompting, constraint architecture, basin navigation) are tested protocols for steering generative models. See Github for code. Everything can be run via python or recursively.

The mechanistic explanations, why compositional collapse occurs, how models resist certain geometric coordinates, are evidence-based inferences from measured outputs, not reverse-engineering of model internals.

The notebooks contain deterministic measurements. The papers contain testable hypotheses. The frameworks translate both into practical application.

Engage critically. Replicate the measurements. Test whether the methods work in your context.