Operator LLM Playbook (Definitive)

This is the canonical reference for LLM-driven automation in Clawperator.

Use this doc for: - running the app through ACTION_AGENT_COMMAND - authoring/maintaining skill packages - integrating skill scripts with OpenClaw - understanding runtime components and naming

1) Runtime components (production)

These are runtime components (not debug-only):

com.clawperator.operator.runtime.OperatorCommandService
com.clawperator.operator.runtime.OperatorCommandReceiver

They own broadcast ingress for: - Action Namespace: com.clawperator.operator.ACTION_AGENT_COMMAND (stable) - Package Target: Varies by build (e.g., com.clawperator.operator or com.clawperator.operator.dev)

2) Command ingress contract

Reliability rule (required)

For app automation commands, default to: 1. close_app 2. open_app 3. wait for stabilization 4. add small post-navigation settle delays (~500–1500ms) before critical reads/clicks

Required fields

commandId: string
taskId: string
source: string
expectedFormat: "android-ui-automator" (Required for v1 compatibility)
actions: []

Determinism Doctrine

Validation First: No side effects if the payload is malformed.
Exactly One Envelope: Every command must emit a [Clawperator-Result].
No Retries: The runtime never retries a failed step; it reports the failure immediately to the Brain.
Stable IDs: Correlate commandId and taskId end-to-end.

Supported action types (current)

Action type	Key params	Notes
`open_app`	`applicationId: string`	Launches app by package ID
`close_app`	`applicationId: string`	Node runs `adb shell am force-stop` pre-flight; Android step always returns `success: false` (expected)
`enter_text`	`matcher: NodeMatcher`, `text: string`, `submit?: boolean`, `clear?: boolean`	CLI: `action type`. `submit: true` presses Enter after typing. `clear` is accepted by Node but currently ignored by Android
`click`	`matcher: NodeMatcher`, `clickType?: "default"\\|"long_click"\\|"focus"`	CLI: `action click`
`read_text`	`matcher: NodeMatcher`, `validator?: "temperature"`, `retry?: object`	CLI: `action read`. Result in `data.text`. Other validator values are rejected by the runtime
`wait_for_node`	`matcher: NodeMatcher`, `retry?: object`	CLI: `action wait`. Waits with internal retry
`snapshot_ui`	`retry?: object`	CLI: `observe snapshot`. Snapshot content in `data.text` as `hierarchy_xml`
`take_screenshot`	`path?: string`, `retry?: object`	Node captures screenshot via ADB and returns local file path
`scroll_and_click`	`target: NodeMatcher`, `container?: NodeMatcher`, `direction?`, `maxSwipes?`, `distanceRatio?`, `settleDelayMs?`, `findFirstScrollableChild?`, `clickAfter?: boolean`, `scrollRetry?: object`, `clickRetry?: object`	Scrolls until target is visible, then clicks by default. Set `clickAfter: false` to reveal the target without tapping it. `scrollRetry` defaults to UiScroll; `clickRetry` defaults to UiReadiness
`scroll`	`container?: NodeMatcher`, `direction?`, `distanceRatio?`, `settleDelayMs?`, `findFirstScrollableChild?`, `retry?: object`	Performs exactly one scroll gesture and reports `scroll_outcome` as `moved`, `edge_reached`, or `gesture_failed`
`scroll_until`	`container?: NodeMatcher`, `direction?`, `distanceRatio?`, `settleDelayMs?`, `maxScrolls?`, `maxDurationMs?`, `noPositionChangeThreshold?`, `findFirstScrollableChild?`	Bounded scroll loop that returns `termination_reason`. Use for feed pagination with explicit caps
`sleep`	`durationMs: number`	Pause between steps. Must fit within the execution `timeoutMs` budget

enter_text vs CLI action type: The CLI command is action type but the action type field in execution payloads is enter_text. These map to the same runtime action. When building execution payloads directly, always use enter_text.

NodeMatcher fields: resourceId, contentDescEquals, textEquals, textContains, contentDescContains, role. All fields are AND-combined. Prefer resourceId when available. Full reference in docs/node-api-for-agents.md.

Scroll targeting rule: If a screen contains nested or multiple scrollable containers, do not rely on auto-detect. Capture snapshot_ui, identify the intended list's resource-id, and pass it as params.container.

Visual verification with ADB screenshots (recommended)

Use screenshots alongside UI-tree logs when building/debugging skills.

adb exec-out screencap -p > ./tmp/ui-check.png

3) Skills-first packaging (PII-safe)

Canonical unit is a skill package, not a standalone recipe file.

Required structure

Skills are maintained in a dedicated sibling repository: ../clawperator-skills. Each skill follows this structure: - skills/<applicationId>.<intent>/SKILL.md - skills/<applicationId>.<intent>/scripts/*.sh

Nature of Skills

Due to the dynamic nature of mobile apps (A/B tests, server-side flags, unexpected popups), skills are treated as highly informed context for the Agent rather than purely deterministic scripts. - Agent Responsibility: The Agent uses skill templates as a baseline, modifying them at runtime to handle personal configurations (variable substitution) or UI drift.

Rules

No PII in committed skill artifacts.
Use variables/placeholders for user-specific labels (for example {{AC_TILE_NAME}}).
Prefer stable selectors first (resourceId), text matching second.
Keep fallback matching strategy documented in SKILL.md.
Keep skill-specific scripts/artifacts inside the skill folder.

4) Current skill set

com.google.android.apps.chromecast.app.get-aircon-status
com.google.android.apps.chromecast.app.set-aircon
com.globird.energy.get-usage
com.solaxcloud.starter.get-battery
com.theswitchbot.switchbot.get-bedroom-temperature

5) New skill authoring checklist

Start from a fresh app session (close_app then open_app).
Capture snapshot_ui and an ADB screenshot.
Identify robust selectors (resource-id first).
Create skills/<appId>.<intent>/SKILL.md.
Add scripts/*.sh deterministic wrapper(s).
Add optional artifacts/*.recipe.json template(s) if helpful.
Validate on device end-to-end.
Update this playbook if conventions changed.

6) Where to update docs

Skill model/design: docs/design/skill-design.md
Canonical LLM/operator usage: docs/design/operator-llm-playbook.md (this file)
App-specific skill packages: skills/<applicationId>.<intent>/...