Version: Latest

API Reference

Complete reference documentation for all MCP Server API endpoints.

Base URL

http://localhost:8000/api/v1

Authentication

Currently, the API does not require authentication. Future versions will support API key authentication.

Content Types

Request: application/json
Response: application/json
Encoding: UTF-8

Common Response Headers

Content-Type: application/json
X-Request-ID: <unique-request-id>
X-Response-Time: <milliseconds>

Endpoints

Health Check

`GET /api/v1/health`

Check server health and Brobot CLI connectivity.

Response

{
  "status": "ok",
  "version": "0.1.0",
  "brobot_connected": true,
  "timestamp": "2024-01-20T10:30:00Z"
}

Status Codes

200 OK: Server is healthy
503 Service Unavailable: Server is unhealthy

Example

curl http://localhost:8000/api/v1/health

Get State Structure

`GET /api/v1/state_structure`

Retrieve the complete state structure of the target application.

Response

{
  "states": [
    {
      "name": "main_menu",
      "description": "Application main menu",
      "images": [
        "main_menu_logo.png",
        "menu_button.png"
      ],
      "transitions": [
        {
          "from_state": "main_menu",
          "to_state": "login_screen",
          "action": "click_login",
          "probability": 0.95
        }
      ],
      "is_initial": true,
      "is_final": false
    }
  ],
  "current_state": "main_menu",
  "metadata": {
    "application": "MyApp",
    "version": "1.0.0",
    "last_updated": "2024-01-20T10:30:00Z"
  }
}

Response Schema

Field	Type	Description
`states`	array	List of all application states
`states[].name`	string	Unique state identifier
`states[].description`	string	Human-readable description
`states[].images`	array	Associated image patterns
`states[].transitions`	array	Possible state transitions
`states[].is_initial`	boolean	Whether this is a starting state
`states[].is_final`	boolean	Whether this is an ending state
`current_state`	string	Currently active state name
`metadata`	object	Additional information

Status Codes

200 OK: Success
500 Internal Server Error: CLI error

Example

import requests

response = requests.get("http://localhost:8000/api/v1/state_structure")
states = response.json()["states"]
print(f"Found {len(states)} states")

Get Observation

`GET /api/v1/observation`

Get current observation including screenshot and active states.

Query Parameters

Parameter	Type	Default	Description
`include_screenshot`	boolean	true	Include base64 screenshot
`format`	string	"png"	Screenshot format (png/jpg)

Response

{
  "timestamp": "2024-01-20T10:30:00Z",
  "active_states": [
    {
      "name": "dashboard",
      "confidence": 0.95,
      "matched_patterns": [
        "dashboard_header.png",
        "user_menu.png"
      ]
    },
    {
      "name": "notifications",
      "confidence": 0.72,
      "matched_patterns": [
        "notification_bell.png"
      ]
    }
  ],
  "screenshot": "iVBORw0KGgoAAAANS...",
  "screen_width": 1920,
  "screen_height": 1080,
  "metadata": {
    "capture_duration": 0.125,
    "analysis_duration": 0.087,
    "total_patterns_checked": 42,
    "patterns_matched": 3
  }
}

Response Schema

Field	Type	Description
`timestamp`	string	ISO 8601 timestamp
`active_states`	array	Currently active states
`active_states[].name`	string	State name
`active_states[].confidence`	float	Confidence score (0.0-1.0)
`active_states[].matched_patterns`	array	Patterns that matched
`screenshot`	string	Base64 encoded image
`screen_width`	integer	Screen width in pixels
`screen_height`	integer	Screen height in pixels
`metadata`	object	Performance metrics

Status Codes

200 OK: Success
500 Internal Server Error: Screenshot capture failed

Example

# Get observation without screenshot
response = requests.get(
    "http://localhost:8000/api/v1/observation",
    params={"include_screenshot": False}
)

# Save screenshot to file
import base64
obs = response.json()
if obs.get("screenshot"):
    img_data = base64.b64decode(obs["screenshot"])
    with open("screen.png", "wb") as f:
        f.write(img_data)

Execute Action

`POST /api/v1/execute`

Execute an automation action on the target application.

Request Body

{
  "action_type": "click",
  "parameters": {
    "image_pattern": "submit_button.png",
    "confidence": 0.9,
    "timeout": 5.0
  },
  "target_state": "form_submitted",
  "timeout": 10.0
}

Request Schema

Field	Type	Required	Description
`action_type`	string	Yes	Type of action to execute
`parameters`	object	Yes	Action-specific parameters
`target_state`	string	No	Expected state after action
`timeout`	float	No	Action timeout in seconds (default: 10.0)

Response

{
  "success": true,
  "action_type": "click",
  "duration": 0.523,
  "result_state": "form_submitted",
  "error": null,
  "metadata": {
    "click_location": {
      "x": 640,
      "y": 480
    },
    "pattern_found": true,
    "confidence": 0.92,
    "search_time": 0.234
  }
}

Response Schema

Field	Type	Description
`success`	boolean	Whether action succeeded
`action_type`	string	Type of action executed
`duration`	float	Execution time in seconds
`result_state`	string	State after execution
`error`	string	Error message if failed
`metadata`	object	Action-specific details

Status Codes

200 OK: Action completed (check success field)
400 Bad Request: Invalid parameters
422 Unprocessable Entity: Validation error
500 Internal Server Error: Execution error

Action Types

Click Action

Click on an image pattern or specific location.

Parameters

{
  "action_type": "click",
  "parameters": {
    "image_pattern": "button.png",  // OR use location
    "location": {"x": 500, "y": 300},
    "confidence": 0.9,
    "click_type": "single",  // single|double|right
    "offset": {"x": 0, "y": 0}
  }
}

Parameter	Type	Required	Description
`image_pattern`	string	No*	Image file to find
`location`	object	No*	Click coordinates
`confidence`	float	No	Min similarity (0.0-1.0)
`click_type`	string	No	Click type (default: single)
`offset`	object	No	Offset from pattern center

*Either image_pattern or location is required

Type Action

Type text at the current cursor location.

Parameters

{
  "action_type": "type",
  "parameters": {
    "text": "Hello, World!",
    "typing_speed": 300,  // chars per minute
    "clear_first": false,
    "press_enter": true
  }
}

Parameter	Type	Required	Description
`text`	string	Yes	Text to type
`typing_speed`	integer	No	Typing speed (default: 300)
`clear_first`	boolean	No	Clear field first
`press_enter`	boolean	No	Press Enter after typing

Drag Action

Drag from one location to another.

Parameters

{
  "action_type": "drag",
  "parameters": {
    "start_pattern": "drag_handle.png",  // OR start_x, start_y
    "start_x": 100,
    "start_y": 100,
    "end_pattern": "drop_zone.png",  // OR end_x, end_y
    "end_x": 500,
    "end_y": 500,
    "duration": 1.0,
    "button": "left"  // left|right|middle
  }
}

Parameter	Type	Required	Description
`start_pattern`	string	No*	Start position pattern
`start_x`, `start_y`	integer	No*	Start coordinates
`end_pattern`	string	No*	End position pattern
`end_x`, `end_y`	integer	No*	End coordinates
`duration`	float	No	Drag duration (default: 1.0)
`button`	string	No	Mouse button (default: left)

*Either pattern or coordinates required for start/end

Wait Action

Wait for a specific state or condition.

Parameters

{
  "action_type": "wait",
  "parameters": {
    "state_name": "dashboard",
    "timeout": 30.0,
    "check_interval": 1.0,
    "stability_time": 2.0
  }
}

Parameter	Type	Required	Description
`state_name`	string	Yes	State to wait for
`timeout`	float	No	Max wait time (default: 30.0)
`check_interval`	float	No	Check frequency (default: 1.0)
`stability_time`	float	No	Required stable time

Error Handling

Error Response Format

{
  "detail": "Detailed error message",
  "type": "error_type",
  "loc": ["field", "name"],
  "ctx": {
    "additional": "context"
  }
}

Common Error Codes

Status Code	Error Type	Description
`400`	`bad_request`	Invalid request format
`422`	`validation_error`	Parameter validation failed
`404`	`not_found`	Resource not found
`408`	`timeout`	Request timeout
`500`	`internal_error`	Server error
`503`	`service_unavailable`	Brobot CLI unavailable

Error Examples

Validation Error (422)

{
  "detail": [
    {
      "loc": ["body", "action_type"],
      "msg": "field required",
      "type": "value_error.missing"
    }
  ]
}

CLI Error (500)

{
  "detail": "Brobot CLI error: Pattern not found",
  "type": "cli_error",
  "ctx": {
    "pattern": "nonexistent.png",
    "search_time": 5.0
  }
}

Rate Limiting

Future versions will implement rate limiting:

X-RateLimit-Limit: 100
X-RateLimit-Remaining: 99
X-RateLimit-Reset: 1642521600

Pagination

Future versions will support pagination for large responses:

{
  "data": [...],
  "pagination": {
    "page": 1,
    "per_page": 20,
    "total": 100,
    "pages": 5
  }
}

Webhooks

Future versions will support webhooks for state changes:

{
  "event": "state_changed",
  "timestamp": "2024-01-20T10:30:00Z",
  "data": {
    "from_state": "login",
    "to_state": "dashboard",
    "trigger": "user_action"
  }
}

Client Libraries

Python

from brobot_client import BrobotClient

client = BrobotClient("http://localhost:8000")
result = client.click("button.png")

JavaScript (Coming Soon)

const client = new BrobotClient("http://localhost:8000");
await client.click("button.png");

cURL Examples

# Get observation
curl http://localhost:8000/api/v1/observation

# Execute click
curl -X POST http://localhost:8000/api/v1/execute \
  -H "Content-Type: application/json" \
  -d '{"action_type":"click","parameters":{"image_pattern":"button.png"}}'

# Pretty print with jq
curl http://localhost:8000/api/v1/state_structure | jq .

API Versioning

The API uses URL versioning. Current version: v1

Future versions will maintain backwards compatibility or provide migration guides.

OpenAPI Specification

Access the OpenAPI (Swagger) specification:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc
OpenAPI JSON: http://localhost:8000/openapi.json

Support

GitHub Issues: Report API issues
Discord: Get help on Discord

Base URL​

Authentication​

Content Types​

Common Response Headers​

Endpoints​

Health Check​

GET /api/v1/health​

Get State Structure​

GET /api/v1/state_structure​

Get Observation​

GET /api/v1/observation​

Execute Action​

POST /api/v1/execute​

Action Types​

Click Action​

Type Action​

Drag Action​

Wait Action​

Error Handling​

Error Response Format​

Common Error Codes​

Error Examples​

Rate Limiting​

Pagination​

Webhooks​

Client Libraries​

Python​

JavaScript (Coming Soon)​

cURL Examples​

API Versioning​

OpenAPI Specification​

Support​

Base URL

Authentication

Content Types

Common Response Headers

Endpoints

Health Check

`GET /api/v1/health`

Get State Structure

`GET /api/v1/state_structure`

Get Observation

`GET /api/v1/observation`

Execute Action

`POST /api/v1/execute`

Action Types

Click Action

Type Action

Drag Action

Wait Action

Error Handling

Error Response Format

Common Error Codes

Error Examples

Rate Limiting

Pagination

Webhooks

Client Libraries

Python

JavaScript (Coming Soon)

cURL Examples

API Versioning

OpenAPI Specification

Support