codex-app-server-sdk

High-level async Python client for codex app-server.

It gives you a convenient conversation API over stdio or websocket without having to manage raw protocol events yourself.

Documentation: https://emsi.github.io/codex-app-server-sdk/

Highlights

simple one-shot turns with chat_once(...)
step-streaming turns with chat(...) (thinking, exec, codex, etc.), non-delta
built-in thread/turn lifecycle handling
thread-scoped config + forking via ThreadHandle
inactivity timeout continuation for long-running turns
turn cancellation with unread-step/event drain via cancel(...)
optional low-level request(...) access when needed

Install

Install uv (if needed):

curl -LsSf https://astral.sh/uv/install.sh | sh

Install the package from PyPI:

uv add codex-app-server-sdk

Or pip-compatible install in the active environment:

uv pip install codex-app-server-sdk

Documentation

Docs site: https://emsi.github.io/codex-app-server-sdk/
PyPI: https://pypi.org/project/codex-app-server-sdk/

Quick start

Stdio

import asyncio
from codex_app_server_sdk import CodexClient


async def main() -> None:
    async with CodexClient.connect_stdio() as client:
        result = await client.chat_once("Hello from Python")
        print(result.final_text)


asyncio.run(main())

By default, stdio transport runs:

command: codex app-server

You can override via:

connect_stdio(command=[...])
environment variable: CODEX_APP_SERVER_CMD

Websocket

import asyncio
from codex_app_server_sdk import CodexClient


async def main() -> None:
    async with CodexClient.connect_websocket() as client:
        result = await client.chat_once("Hello over websocket")
        print(result.final_text)


asyncio.run(main())

Websocket defaults:

URL: CODEX_APP_SERVER_WS_URL or ws://127.0.0.1:8765
Bearer token: CODEX_APP_SERVER_TOKEN (optional)

Continuation on inactivity timeout

Both high-level APIs support resuming the same running turn.

import asyncio
from codex_app_server_sdk import CodexClient, CodexTurnInactiveError


async def main() -> None:
    async with CodexClient.connect_stdio(inactivity_timeout=120.0) as client:
        continuation = None
        while True:
            try:
                if continuation is None:
                    result = await client.chat_once("Do a longer task")
                else:
                    result = await client.chat_once(continuation=continuation)
                print(result.final_text)
                break
            except CodexTurnInactiveError as exc:
                continuation = exc.continuation
                idle = (
                    f"{exc.idle_seconds:.1f}s"
                    if exc.idle_seconds is not None
                    else "unknown"
                )
                print(
                    f"[warn] turn inactive for {idle}; resuming "
                    f"(thread_id={continuation.thread_id}, turn_id={continuation.turn_id})"
                )


asyncio.run(main())

Advanced thread control (cwd/instructions/model/fork)

Use explicit thread handles when you need thread-scoped configuration.

import asyncio
from codex_app_server_sdk import CodexClient, ThreadConfig, TurnOverrides


async def main() -> None:
    async with CodexClient.connect_stdio() as client:
        thread = await client.start_thread(
            ThreadConfig(
                cwd="/home/me/project",
                base_instructions="You are concise.",
                developer_instructions="Prefer rg over grep.",
                model="gpt-5",
            )
        )

        result = await thread.chat_once("Summarize the repo layout.")
        print(result.final_text)

        await thread.update_defaults(ThreadConfig(model="gpt-5.1-codex-mini"))
        forked = await thread.fork(
            overrides=ThreadConfig(
                developer_instructions="Focus on tests first.",
            )
        )

        async for step in forked.chat(
            "Run a quick diagnostics pass.",
            turn_overrides=TurnOverrides(effort="low"),
        ):
            print(step.step_type, step.text)


asyncio.run(main())

Configuration scopes and semantics

CodexClient: connection/session scope (transport, request routing, lifecycle).
ThreadHandle + ThreadConfig: thread scope (cwd, baseInstructions, developerInstructions, model, etc.).
TurnOverrides: per-turn scope (cwd, model, effort, summary, ...).

`UNSET` vs `None`

UNSET (default): omit field from request payload; keep server default/current value.
None: send JSON null explicitly (where protocol allows) to reset/clear.

Example:

from codex_app_server_sdk import ThreadConfig, UNSET

cfg = ThreadConfig(
    model=UNSET,  # omit key
    developer_instructions=None,  # send explicit null
)

Continuation constraints

When resuming with continuation=..., do not pass extra turn-start arguments in that same call. Specifically, do not pass: text, thread_id, user, metadata, thread_config, or turn_overrides.

Apply thread changes via thread.update_defaults(...) or start a new/forked thread before continuing with a new turn.

Example clients

More complete examples are under examples/.

All thread_* examples print lifecycle progress checkpoints by default so long operations are visible. Use --quiet on those scripts for minimal output.

Rich step-stream example (thinking/exec/codex blocks)

Recommended example for step-oriented API and continuation behavior.

Stdio:

uv run python examples/chat_steps_rich.py

Websocket:

uv run python examples/chat_steps_rich.py --transport websocket --url ws://127.0.0.1:8765

With extra payload summaries:

uv run python examples/chat_steps_rich.py --show-data

Cancel timed-out turns instead of auto-resume:

uv run python examples/chat_steps_rich.py --cancel-on-timeout

Common options:

--transport {stdio,websocket}
--cmd "codex app-server" (stdio mode)
--url ws://127.0.0.1:8765 (websocket mode)
--token "$CODEX_APP_SERVER_TOKEN" (websocket mode)
--prompt "..."
--user "..."
--inactivity-timeout 120
--show-data
--cancel-on-timeout

Advanced thread config + fork example

uv run python examples/thread_config_and_fork.py \
  --transport stdio \
  --cwd . \
  --base-instructions "Be concise." \
  --developer-instructions "Prioritize correctness."

Websocket:

uv run python examples/thread_config_and_fork.py \
  --transport websocket \
  --url ws://127.0.0.1:8765

Quiet mode:

uv run python examples/thread_config_and_fork.py --quiet

Resume-by-id example

uv run python examples/thread_resume_by_id.py \
  --transport stdio \
  --thread-id <existing-thread-id> \
  --prompt "Continue the previous conversation."

Quiet mode:

uv run python examples/thread_resume_by_id.py --thread-id <existing-thread-id> --quiet

Concurrent thread handles example

This example starts two new threads and runs turns concurrently on those fresh ThreadHandles over one shared client connection (it does not call thread/resume for the newly started threads).

uv run python examples/thread_concurrent_handles.py --transport stdio

Quiet mode:

uv run python examples/thread_concurrent_handles.py --quiet

Thread/model/config ops showcase

This example uses the newly exposed helper APIs:

thread/read, thread/list, thread/name/set, thread/archive
model/list
config/read
endpoint-aware summaries with explicit <not-provided> / null values
optional thread model update reporting with --set-model
config/read prints origin_entries: count of config keys that include provenance metadata (which layer/file provided that effective value)

uv run python examples/thread_ops_showcase.py \
  --transport stdio \
  --prompt "Give a 3-bullet summary." \
  --thread-name "showcase-thread"

Websocket:

uv run python examples/thread_ops_showcase.py \
  --transport websocket \
  --url ws://127.0.0.1:8765

Show model update intent and before/after thread snapshot model visibility:

uv run python examples/thread_ops_showcase.py --set-model gpt-5.3-codex

With raw payload dumps:

uv run python examples/thread_ops_showcase.py --show-data

Quiet mode:

uv run python examples/thread_ops_showcase.py --quiet

Stdio example (multi-turn, one thread)

uv run python examples/chat_session_stdio.py

Custom command and prompts:

uv run python examples/chat_session_stdio.py \
  --cmd "codex app-server" \
  --prompt "First prompt" \
  --prompt "Second prompt"

Websocket example (multi-turn, one thread)

uv run python examples/chat_session_websocket.py

With explicit endpoint/token:

uv run python examples/chat_session_websocket.py \
  --url ws://127.0.0.1:8765 \
  --token "$CODEX_APP_SERVER_TOKEN"

Or via environment:

export CODEX_APP_SERVER_WS_URL=ws://127.0.0.1:8765
export CODEX_APP_SERVER_TOKEN=your-token
uv run python examples/chat_session_websocket.py

API reference (quick)

`CodexClient` (`src/codex_app_server_sdk/client.py`)

connect_stdio(...): create a stdio-configured client (unstarted).
connect_websocket(...): create a websocket-configured client (unstarted).
start(): connect transport and start receive loop (idempotent).
initialize(params=None, timeout=None): perform JSON-RPC initialize handshake with default-merged params (protocolVersion, clientInfo, capabilities) and return normalized InitializeResult.
request(method, params=None, timeout=None): low-level JSON-RPC request helper.
start_thread(config=None): create thread and return ThreadHandle.
resume_thread(thread_id, overrides=None): resume thread and return ThreadHandle.
fork_thread(thread_id, overrides=None): fork thread and return ThreadHandle.
set_thread_defaults(thread_id, overrides): apply thread-level overrides via thread/resume.
read_thread(thread_id, include_turns=True): read one thread.
list_threads(...): list threads with optional filters.
set_thread_name(thread_id, name): rename thread.
archive_thread(thread_id) / unarchive_thread(thread_id): archive lifecycle controls.
rollback_thread(thread_id, num_turns=...): drop recent turns from thread history.
compact_thread(thread_id): request context compaction.
chat(...) (text=None, thread_id=None, user=None, metadata=None, thread_config=None, turn_overrides=None, inactivity_timeout=None, continuation=None): async iterator yielding completed non-delta step blocks.
chat_once(...) (text=None, thread_id=None, user=None, metadata=None, thread_config=None, turn_overrides=None, inactivity_timeout=None, continuation=None): send one user message and wait for completed turn.
cancel(continuation, timeout=None): interrupt running turn, return unread steps/events, and clean turn state.
steer_turn(thread_id=..., expected_turn_id=..., input_items=...): steer active turn input.
start_review(thread_id=..., target=..., delivery=None): run review mode.
list_models(...): discover available models.
exec_command(command, ...): run one command via server command API.
read_config(...), read_config_requirements(), write_config_value(...), batch_write_config(...): config APIs.
interrupt_turn(turn_id, timeout=None): low-level turn interruption request.
close(): cancel receive loop and close transport.

`Transport` and implementations (`src/codex_app_server_sdk/transport.py`)

Transport.connect/send/recv/close: abstract interface.
StdioTransport: line-delimited JSON over subprocess stdin/stdout.
WebSocketTransport: JSON messages over websocket frames.

Data models (`src/codex_app_server_sdk/models.py`)

InitializeResult: parsed initialize response (protocol_version, server_info, capabilities, raw).
ConversationStep: completed step from chat(...) (step_type, item_type, text, item_id, thread_id, turn_id, data).
ChatResult: buffered turn output (thread_id, turn_id, final_text, raw_events, assistant_item_id, completion_source).
ChatContinuation: continuation token for timed-out running turns (thread_id, turn_id, cursor, mode).
CancelResult: cancellation result with unread steps/raw_events plus terminal flags.
ThreadConfig: thread-level config for thread/start, thread/resume, thread/fork (cwd, base_instructions, developer_instructions, model, ...).
TurnOverrides: per-turn overrides forwarded to turn/start (cwd, model, effort, ...).
UNSET: sentinel for “omit this field from request payload.”
ApprovalPolicy: literal type for approval policy values (untrusted, on-failure, on-request, never).

`ThreadHandle` (`src/codex_app_server_sdk/client.py`)

thread_id: bound thread id.
defaults: local thread config snapshot.
chat_once(...): convenience one-turn call bound to this thread.
chat(...): step-streaming call bound to this thread.
update_defaults(overrides): apply thread defaults between messages.
fork(overrides=None): fork thread and get a new handle.
read(include_turns=True): low-level thread/read helper.
set_name(name), archive(), unarchive(), rollback(num_turns), compact(): thread lifecycle/history helpers.
start_review(target, delivery=None): thread-bound review API.

Exceptions (`src/codex_app_server_sdk/errors.py`)

CodexError: base exception.
CodexTransportError: transport/connectivity problems.
CodexTimeoutError: request timeout (and base for timeout-related flow).
CodexTurnInactiveError: per-turn inactivity timeout with resumable continuation.
CodexProtocolError: protocol/JSON-RPC error (optional code and data).

Behavior notes

This version does not expose token-delta streaming as a public API.
chat(...) provides async streaming of completed step blocks (non-delta) from live item/completed notifications only.
chat(...) intentionally does not merge thread/read snapshot items for the same turn, avoiding duplicate blocks when snapshot item IDs differ from live event item IDs.
chat_once(...) resolves final text from completed agentMessage items (item/completed), with thread/read(includeTurns=true) fallback.
turn_timeout is intentionally removed to avoid conflicting timeout semantics.
Turn waits are controlled by inactivity_timeout (or unbounded when None).
cancel(...) interrupts a continuation turn, returns unread buffered data, and cleans internal session state so the same thread can be reused safely.
Advanced thread-level config/fork uses protocol v2 methods (thread/start, thread/resume, thread/fork) exposed via ThreadHandle and ThreadConfig.
metadata is applied on turn/start payloads for message turns; thread-level config uses schema-aligned fields on thread methods.
preferred lifecycle is async with CodexClient.connect_*() as client:; manual start()/close() remains available for advanced control.
The client uses modern thread/turn methods (thread/start, thread/resume, turn/start, turn/interrupt).
initialize currently sends protocolVersion: "1" as handshake metadata.
Websocket transport targets websockets (>=16,<17), uses additional_headers, and disables compression by default (compression=None) for codex app-server compatibility.
After dependency changes, run uv sync to refresh the virtual environment.

Initialize handshake (`initialize()`)

initialize() performs the protocol handshake and returns InitializeResult.

chat_once(...) and chat(...) call initialize() automatically on first use.
call initialize() explicitly when you want to fail fast before first turn, inspect server metadata, or send custom init params.

Default initialize payload

When params=None, the client sends:

{
  "protocolVersion": "1",
  "clientInfo": {
    "name": "codex-app-server-sdk",
    "version": "0.1.0"
  },
  "capabilities": {
    "optOutNotificationMethods": [
      "codex/event/agent_message_content_delta",
      "codex/event/reasoning_content_delta",
      "codex/event/item_started",
      "codex/event/item_completed",
      "codex/event/task_started",
      "codex/event/task_complete"
    ]
  }
}

Custom init params (`initialize(params=...)`)

Supported/customizable keys:

protocolVersion: str
clientInfo: dict (commonly name, version, plus optional extra fields)
capabilities: dict
capabilities.optOutNotificationMethods: list[str]
any additional top-level keys are passed through unchanged

Merge rules:

the payload starts from the default block above;
caller params are shallow-merged at top level;
if caller provides capabilities as a dict and omits optOutNotificationMethods, defaults are auto-injected;
if caller provides capabilities.optOutNotificationMethods, caller value is preserved;
if caller sets capabilities to None or a non-dict value, no injection is applied.

`InitializeResult` fields

protocol_version: extracted from protocolVersion or protocol_version in server result
server_info: extracted from serverInfo or server_info
capabilities: extracted from capabilities
raw: full raw initialize result payload

Example: explicit initialize

import asyncio
from codex_app_server_sdk import CodexClient


async def main() -> None:
    async with CodexClient.connect_stdio() as client:
        init = await client.initialize(
            {
                "clientInfo": {
                    "name": "my-client",
                    "version": "0.3.0",
                },
                "capabilities": {
                    "optOutNotificationMethods": [
                        "codex/event/agent_message_content_delta"
                    ]
                },
            }
        )
        print(init.protocol_version)


asyncio.run(main())

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
scripts		scripts
src/codex_app_server_sdk		src/codex_app_server_sdk
tests		tests
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
RELEASE.md		RELEASE.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

codex-app-server-sdk

Highlights

Install

Documentation

Quick start

Stdio

Websocket

Continuation on inactivity timeout

Advanced thread control (cwd/instructions/model/fork)

Configuration scopes and semantics

UNSET vs None

Continuation constraints

Example clients

Rich step-stream example (thinking/exec/codex blocks)

Advanced thread config + fork example

Resume-by-id example

Concurrent thread handles example

Thread/model/config ops showcase

Stdio example (multi-turn, one thread)

Websocket example (multi-turn, one thread)

API reference (quick)

CodexClient (src/codex_app_server_sdk/client.py)

Transport and implementations (src/codex_app_server_sdk/transport.py)

Data models (src/codex_app_server_sdk/models.py)

ThreadHandle (src/codex_app_server_sdk/client.py)

Exceptions (src/codex_app_server_sdk/errors.py)

Behavior notes

Initialize handshake (initialize())

Default initialize payload

Custom init params (initialize(params=...))

InitializeResult fields

Example: explicit initialize

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`UNSET` vs `None`

`CodexClient` (`src/codex_app_server_sdk/client.py`)

`Transport` and implementations (`src/codex_app_server_sdk/transport.py`)

Data models (`src/codex_app_server_sdk/models.py`)

`ThreadHandle` (`src/codex_app_server_sdk/client.py`)

Exceptions (`src/codex_app_server_sdk/errors.py`)

Initialize handshake (`initialize()`)

Custom init params (`initialize(params=...)`)

`InitializeResult` fields

Packages