UIAP — UI Agent Protocol

The missing protocol between web applications and AI agents. UIAP lets apps tell agents what's visible, what's possible, what's risky, and how to verify success — structured, safe, and transport-agnostic.

Getting Started Read the Spec Protocol Overview

The Problem

Today’s AI agents interact with web UIs in one of three ways — all of them fragile:

Screenshots + Vision models — slow, expensive, unreliable. The agent guesses what a button does.
Raw DOM / HTML scraping — brittle selectors, no business semantics. Breaks on every deploy.
ARIA / Accessibility tree — knows what an element is, but not what it does in business terms, how risky it is, or how to verify success.

There is no standard that answers: “What can I do here? Is it safe? How do I know it worked?”

UIAP closes this gap.

What UIAP Provides

Semantic, not Pixel-based

Agents receive a structured PageGraph with roles, states, affordances, and stable identifiers — no screenshot parsing, no fragile CSS selectors.

Action Safety Built-in

Every action declares a risk level. The SDK enforces policy evaluation, confirmation flows, and human handoff before anything irreversible happens.

Success Verification

Actions define expected outcomes: route changes, toasts, dialog closures. Agents verify success instead of hoping it worked.

Transport Agnostic

Works over WebSocket, HTTP/SSE, postMessage, or any custom binding. The protocol defines the contract, not the wire.

How It Works

App integrates the UIAP SDK

A few lines of code or data-uiap-* attributes instrument your UI with semantic meaning.
Session starts, capabilities are exchanged

The agent learns what the app can do: available actions, risk levels, execution modes, success signals.
Agent receives a PageGraph

A structured snapshot of the current UI state — routes, scopes, elements, states, affordances — not raw HTML.
Agent plans an action, SDK checks policy

Before execution, the policy layer decides: allow, confirm, deny, or hand off to the user.
Action executes, app sends feedback

The SDK runs the action (preferring app-native execution over DOM manipulation) and reports results + state deltas.
Agent verifies success via signals

Route changed? Toast appeared? Form submitted? The agent confirms the outcome before moving to the next step.

The Protocol at a Glance

UIAP is a suite of 13 specifications:

#	Spec	What it defines
1	Core	Message envelope, session lifecycle, errors, versioning
2	Capability Model	UI roles, states, affordances, actions, risk, signals
3	Web Profile	DOM, ARIA, PageGraph, iframes, Shadow DOM, routes
4	Action Runtime	Execution, verification, confirmation, result reporting
5	Policy Extension	Permissions, risk classes, sensitivity, audit
6	SDK API	Client-side JavaScript integration API
7	Workflow Extension	Multi-step flows, skills, recovery
8	Discovery Mapper	Automatic UI element discovery and classification
9	Authoring/Manifest	Configuration formats, validation, build/release
10	Conformance Suite	Test modules, harness model, evaluation rules
11	HTTP/REST Binding	HTTP+SSE transport binding
12	Agent Cognition	Entity schema, scope context, navigation map, scope queries
13	Agent Integration Guide	LLM agent integration patterns, context building, tool mapping

Not Another Framework

UIAP doesn’t replace existing standards — it fills the gap between them:

Standard	What it does	What’s missing for agents
ARIA	Describes roles, states, properties	No business actions, no risk levels, no success signals
MCP	Connects tools and knowledge to models	Doesn’t model live UI state or browser interaction
Playwright / WebDriver	Automates browser interaction	No semantic understanding, no policy, no verification
AG-UI	Agent-to-app event protocol	No canonical UI model, no action catalog, no safety layer

UIAP is adapter-capable: expose capabilities as MCP tools, mirror events to AG-UI, or delegate execution to WebDriver — the protocol is the contract, not the runtime.

Start Here

Quickstart Integrate UIAP into your app in 5 minutes.

Why UIAP? The rationale behind the protocol.

Core Specification Start with the protocol foundation.

Agent Integration Guide How an LLM agent consumes UIAP context.

Live Demo See UIAP in action — an instrumented app with agent control.