mirror of https://github.com/coder/coder.git synced 2026-06-02 20:48:20 +00:00

Files

T

DevCats 094fe971ad chore(aibridge): add AWS PRM user-agent attribution for Bedrock calls (#25221 )

Adds middleware in `withAWSBedrockOptions` that appends the AWS Partner
Revenue Measurement (PRM) attribution string to the User-Agent header on
every Bedrock API call made through AI Bridge.

This is the AI Bridge counterpart to the Terraform provisioner change
merged in #23138. Together, they ensure all AWS API calls made by Coder
(both workspace infrastructure via Terraform and LLM inference via
Bedrock) include PRM attribution.

## How it works

- A middleware is added before `bedrock.WithConfig(awsCfg)` that reads
the existing `User-Agent` header and appends
`sdk-ua-app-id/APN_1.1%2Fpc_cdfmjwn8i6u8l9fwz8h82e4w3%24`
- Only affects Bedrock calls; OpenAI and direct Anthropic API calls are
unaffected
- Uses `option.WithMiddleware` rather than `option.WithHeader` because
the existing User-Agent (set by the Anthropic SDK) must be preserved and
appended to, not replaced

## Tests

- **Positive**: `TestAWSBedrockIntegration` verifies PRM attribution is
present in the User-Agent on Bedrock requests
- **Negative**: `TestAnthropicMessages` verifies PRM attribution is
absent on non-Bedrock requests

## References

- Companion Terraform provisioner PR: #23138 (merged)
- Backport: #24052 (merged)
- Preserve existing `AWS_SDK_UA_APP_ID`: #24606 (open)
- Original `coder/aibridge` PR:
https://github.com/coder/aibridge/pull/224 (superseded by this PR since
aibridge was moved into coder/coder via #24190)
- [AWS SDK Application ID
docs](https://docs.aws.amazon.com/sdkref/latest/guide/feature-appid.html)
- [AWS PRM Automated User
Agent](https://prm.partner.aws.dev/automated-user-agent.html) (partner
login required)

> Generated with [Coder Agents](https://coder.com/agents)

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>

2026-05-28 11:08:00 -05:00

circuitbreaker

feat: remove 429 from aibridge circuit breaker failure conditions (#24701 )

2026-04-30 09:31:32 +01:00

config

feat: add automatic key failover for AI Bridge OpenAI (#24847 )

2026-05-07 15:35:46 +01:00

context

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

fixtures

feat: add automatic key failover for AI Bridge OpenAI (#24847 )

2026-05-07 15:35:46 +01:00

intercept

chore(aibridge): add AWS PRM user-agent attribution for Bedrock calls (#25221 )

2026-05-28 11:08:00 -05:00

internal

chore(aibridge): add AWS PRM user-agent attribution for Bedrock calls (#25221 )

2026-05-28 11:08:00 -05:00

keypool

refactor(aibridge): clean up keypool and provider error handling (#25609 )

2026-05-25 18:58:29 +01:00

mcp

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

mcpmock

chore: 'go generate' mockgen to use go tool wrapper (#25490 )

2026-05-19 14:53:13 +00:00

metrics

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

provider

refactor(aibridge): remove InjectAuthHeader in favor of KeyFailoverConfig (#25618 )

2026-05-25 19:10:38 +01:00

recorder

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

tracing

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

utils

feat: add automatic key failover for AI Bridge passthrough (#24920 )

2026-05-07 15:46:36 +01:00

AGENTS.md

chore: update aibridge/AGENTS.md to reflect it is now part of coder/coder repo (#24822 )

2026-04-29 15:51:07 +02:00

api.go

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

bridge_test.go

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

bridge.go

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

client_test.go

fix(aibridge): track Charm Crush client and session ID (#24630 )

2026-04-22 19:02:31 +02:00

client.go

fix(aibridge): track Charm Crush client and session ID (#24630 )

2026-04-22 19:02:31 +02:00

passthrough_internal_test.go

refactor(aibridge): remove InjectAuthHeader in favor of KeyFailoverConfig (#25618 )

2026-05-25 19:10:38 +01:00

passthrough.go

feat: add automatic key failover for AI Bridge passthrough (#24920 )

2026-05-07 15:46:36 +01:00

README.md

chore: update AI Gateway docs (#24805 )

2026-04-29 18:28:09 +02:00

session_test.go

fix(aibridge): track Charm Crush client and session ID (#24630 )

2026-04-22 19:02:31 +02:00

session.go

fix(aibridge): track Charm Crush client and session ID (#24630 )

2026-04-22 19:02:31 +02:00

sse_parser.go

chore: move aibridge library code into coder repo (#24190 )

2026-04-22 17:01:01 +02:00

README.md

aibridge

aibridge provides an HTTP handler that intercepts AI client requests bound for upstream AI providers (Anthropic, OpenAI, Copilot). It records token usage, prompts, and tool invocations per user. Optionally supports centralized MCP tool injection with allowlist/denylist filtering.

The handler is mounted by a host process. Today that host is coderd, which mounts the handler at /api/v2/aibridge/<provider>/*. Running aibridge as a separate process is planned for the future.

Architecture

┌─────────────────┐     ┌───────────────────────────────────────────┐
│    AI Client    │     │                    aibridge               │
│  (Claude Code,  │────▶│  ┌─────────────────┐    ┌─────────────┐   │
│   Cursor, etc.) │     │  │  RequestBridge  │───▶│  Providers  │   │
└─────────────────┘     │  │  (http.Handler) │    │  (Anthropic │   │
                        │  └─────────────────┘    │   OpenAI)   │   │
                        │                         └──────┬──────┘   │
                        │                                │          │
                        │                                ▼          │    ┌─────────────┐
                        │  ┌─────────────────┐    ┌─────────────┐   │    │  Upstream   │
                        │  │    Recorder     │◀───│ Interceptor │─── ───▶│    API      │
                        │  │ (tokens, tools, │    │ (streaming/ │   │    │ (Anthropic  │
                        │  │  prompts)       │    │  blocking)  │   │    │   OpenAI)   │
                        │  └────────┬────────┘    └──────┬──────┘   │    └─────────────┘
                        │           │                    │          │
                        │           ▼             ┌──────▼──────┐   │
                        │  ┌ ─ ─ ─ ─ ─ ─ ─ ┐      │  MCP Proxy  │   │
                        │  │    Database   │      │   (tools)   │   │
                        │  └ ─ ─ ─ ─ ─ ─ ─ ┘      └─────────────┘   │
                        └───────────────────────────────────────────┘

Components

RequestBridge: The main http.Handler that routes requests to providers
Provider: Defines bridged routes (intercepted) and passthrough routes (proxied)
Interceptor: Handles request/response processing and streaming
Recorder: Interface for capturing usage data (tokens, prompts, tools)
MCP Proxy (optional): Connects to MCP servers to list tool, inject them into requests, and invoke them in an inner agentic loop

Request Flow

Client sends request to /anthropic/v1/messages or /openai/v1/chat/completions
Actor extraction: Request must have an actor in context (via AsActor()). The host is responsible for authenticating the caller before invoking the handler.
Upstream call: Request forwarded to the AI provider
Response relay: Response streamed/sent to client
Recording: Token usage, prompts, and tool invocations recorded

With MCP enabled: Tools from configured MCP servers are centrally defined and injected into requests (prefixed bmcp_). Allowlist/denylist regex patterns control which tools are available. When the model selects an injected tool, the gateway invokes it in an inner agentic loop, and continues the conversation loop until complete.

Passthrough routes (/v1/models, /v1/messages/count_tokens) are reverse-proxied directly.

Observability

Prometheus Metrics

Create metrics with NewMetrics(prometheus.Registerer):

Metric	Type	Description
`interceptions_total`	Counter	Intercepted request count
`interceptions_inflight`	Gauge	Currently processing requests
`interceptions_duration_seconds`	Histogram	Request duration
`passthrough_total`	Counter	Non-intercepted requests forwarded to the upstream
`prompts_total`	Counter	User prompt count
`tokens_total`	Counter	Token usage (input, output, cache read/write, provider extras)
`injected_tool_invocations_total`	Counter	Injected MCP tool invocations performed by the handler
`non_injected_tool_selections_total`	Counter	Client-defined tool selections returned by the model
`circuit_breaker_state`	Gauge	Circuit breaker state per provider/endpoint (0=closed, 0.5=half, 1=open)
`circuit_breaker_trips_total`	Counter	Times the circuit breaker transitioned to open
`circuit_breaker_rejects_total`	Counter	Requests rejected due to an open circuit breaker

Recorder Interface

Implement Recorder to persist usage data to your database:

aibridge_interceptions - request metadata (provider, model, initiator, timestamps)
aibridge_token_usages - input/output and cache read/write token counts per response
aibridge_user_prompts - user prompts
aibridge_tool_usages - tool invocations (injected and client-defined)
aibridge_model_thoughts - model reasoning content (thinking, reasoning summaries, commentary)

type Recorder interface {
    RecordInterception(ctx context.Context, req *InterceptionRecord) error
    RecordInterceptionEnded(ctx context.Context, req *InterceptionRecordEnded) error
    RecordTokenUsage(ctx context.Context, req *TokenUsageRecord) error
    RecordPromptUsage(ctx context.Context, req *PromptUsageRecord) error
    RecordToolUsage(ctx context.Context, req *ToolUsageRecord) error
    RecordModelThought(ctx context.Context, req *ModelThoughtRecord) error
}

Supported Routes

Each provider instance is mounted under /api/v2/aibridge/<name>, where <name> is the provider's configured name. For example, with an Anthropic provider named my-anthropic, its /messages endpoint would be reachable at /api/v2/aibridge/my-anthropic/v1/messages.

If a name is not set, the route path defaults to the provider's type: anthropic, openai, or copilot. The table below uses the default names.

(/*) denotes a route that handles both the exact path and any subpaths. A trailing /* denotes subpaths only.

Provider	Route	Type
Anthropic	`/anthropic/v1/messages`	Bridged (intercepted)
Anthropic	`/anthropic/v1/messages/count_tokens`	Passthrough
Anthropic	`/anthropic/v1/models(/*)`	Passthrough
Anthropic	`/anthropic/api/event_logging/*`	Passthrough
OpenAI	`/openai/v1/chat/completions`	Bridged (intercepted)
OpenAI	`/openai/v1/responses`	Bridged (intercepted)
OpenAI	`/openai/v1/responses/*`	Passthrough
OpenAI	`/openai/v1/conversations(/*)`	Passthrough
OpenAI	`/openai/v1/models(/*)`	Passthrough
Copilot	`/copilot/chat/completions`	Bridged (intercepted)
Copilot	`/copilot/responses`	Bridged (intercepted)
Copilot	`/copilot/models(/*)`	Passthrough
Copilot	`/copilot/agents/*`	Passthrough
Copilot	`/copilot/mcp/*`	Passthrough
Copilot	`/copilot/.well-known/*`	Passthrough