coder

mirror of https://github.com/coder/coder.git synced 2026-06-03 21:18:24 +00:00

Author	SHA1	Message	Date
Hugo Dutka	84527390c6	feat: chat desktop backend (#23005 ) Implement the backend for the desktop feature for agents. - Adds a new `/api/experimental/chats/$id/desktop` endpoint to coderd which exposes a VNC stream from a [portabledesktop](https://github.com/coder/portabledesktop) process running inside the workspace - Adds a new `spawn_computer_use_agent` tool to chatd, which spawns a subagent that has access to the `computer` tool which lets it interact with the `portabledesktop` process running inside the workspace - Adds the plumbing to make the above possible There's a follow up frontend PR here: https://github.com/coder/coder/pull/23006	2026-03-13 19:49:34 +01:00
Mathias Fredriksson	bdbcd3428b	feat(coderd/chatd): unify chat storage on SDK parts and fix file-reference rendering (#22958 ) File-reference parts in user messages were flattened to `TextContent` at write time because fantasy has no file-reference content type. The frontend never saw them as structured parts. This moves all write paths (user, assistant, tool) from fantasy envelope format to `codersdk.ChatMessagePart`. The streaming layer (`chatloop`) is untouched, the conversion happens at the serialization boundary in `persistStep`. Old rows are still readable. `ParseContent` uses a structural heuristic (`isFantasyEnvelopeFormat`) to distinguish legacy envelopes from SDK parts. We chose this over try/fallback because fantasy envelopes partially unmarshal into `ChatMessagePart` (the `type` field matches) while silently losing content. A guard test enforces that no SDK part can produce the envelope shape. This is forward-only: new rows are unreadable by old code. Chat is behind a feature flag so rollback risk is contained. Also adds a typed `ChatMessageRole` to replace raw strings and `fantasy.MessageRole` casts at the persistence boundary. The type covers `ChatMessage.Role`, `ChatStreamMessagePart.Role`, the `PublishMessagePart` callback chain, and all DB write sites. `fantasy.MessageRole` remains only where we build `fantasy.Message` structs for LLM dispatch. Separately, `ProviderMetadata` was leaking to SSE clients via `publishMessagePart`. `StripInternal` now runs on both the SSE and REST paths, covering this. Other cleanup: - Old `db2sdk.contentBlockToPart` silently dropped metadata on text/reasoning/tool-call content. New code preserves it. - `providerMetadataToOptions` now logs warnings instead of silently returning nil. - `db2sdk` shrinks from ~250 lines of parallel conversion to ~15 lines delegating to `chatprompt.ParseContent()`, removing the `fantasy` import entirely. Refs #22821	2026-03-13 17:53:26 +02:00
Kyle Carberry	42c12176a0	fix(chatd): persist interrupted tool call steps instead of losing them (#23011 ) ## Problem When a chat is interrupted while tools are executing, the step content (text, reasoning, tool calls, and partial tool results) was being lost. Two gaps existed: 1. During tool execution: `executeTools` returns with error results for interrupted tools, but the subsequent `PersistStep(ctx, ...)` fails on the canceled context and returns `ErrInterrupted` without persisting anything. 2. PersistStep race: If the context is canceled between the post-tool interrupt check and the `PersistStep` call, the same loss occurs. This is inconsistent with how we handle stream interruptions (which properly flush and persist partial content via `persistInterruptedStep`) and how [coder/blink](https://github.com/coder/blink) handles interruptions (always inserting the response message regardless of execution phase). ## Fix Two changes in `chatloop.go`: - Post-tool-execution interrupt check: After `executeTools` returns, check if the context was interrupted and route through `persistInterruptedStep` (which uses `context.WithoutCancel` internally) to save the accumulated content. - PersistStep fallback: If `PersistStep` returns `ErrInterrupted`, retry via `persistInterruptedStep` so partial content is not lost. ## Tests - `TestRun_InterruptedDuringToolExecutionPersistsStep`: Verifies that when a tool is blocked and the chat is interrupted, the step (text + reasoning + tool call + tool error result) is persisted via the interrupt-safe path. - `TestRun_PersistStepInterruptedFallback`: Verifies that when `PersistStep` itself returns `ErrInterrupted`, the step is retried via the fallback path and content is saved.	2026-03-12 16:59:16 -04:00
Kyle Carberry	072e9a212f	fix(chatloop): keep provider-executed tool results in assistant message (#23012 ) ## Problem When a step contains both provider-executed tool calls (e.g. Anthropic web search) and local tool calls in parallel, the next loop iteration fails with the Anthropic API claiming the regular tool call has no result. However, sending a new user message (which reloads messages from the DB) works fine. ## Root cause `toResponseMessages` was placing all tool results into the tool-role message, regardless of `ProviderExecuted`. When Fantasy's Anthropic provider later converted these messages for the API, it moved the provider tool result from the tool message to the end of the previous assistant message (`prevMsg.Content = append(...)`). This placed `web_search_tool_result` after the regular `tool_use` block: ``` assistant: [server_tool_use(A), tool_use(B), web_search_tool_result(A)] ← wrong order user: [tool_result(B)] ``` The persistence layer in `chatd.go` already handles this correctly — provider-executed tool results stay in the assistant message, producing the expected ordering: ``` assistant: [server_tool_use(A), web_search_tool_result(A), tool_use(B)] ← correct order user: [tool_result(B)] ``` This is why reloading from the DB fixed it. ## Fix In the `ContentTypeToolResult` case of `toResponseMessages`, route provider-executed results to `assistantParts` instead of `toolParts`, matching the persistence layer's behavior. ## Testing Added `TestToResponseMessages_ProviderExecutedToolResultInAssistantMessage` which verifies that mixed provider+local tool results are split correctly between the assistant and tool messages.	2026-03-12 20:22:09 +00:00
Kyle Carberry	57dc23f603	feat(chatd): add provider-native web search tools to chats (#22909 ) ## What Adds provider-native web search tools to the chat system. Anthropic, OpenAI, and Google all offer server-side web search — this wires them up as opt-in per-model config options using the existing `ChatModelProviderOptions` JSONB column (no migration). Web search is off by default. ## Config Set `web_search_enabled: true` in the model config provider options: ```json { "provider_options": { "anthropic": { "web_search_enabled": true, "allowed_domains": ["docs.coder.com", "github.com"] } } } ``` Available options per provider: - Anthropic: `web_search_enabled`, `allowed_domains`, `blocked_domains` - OpenAI: `web_search_enabled`, `search_context_size` (`low`/`medium`/`high`), `allowed_domains` - Google: `web_search_enabled` ## Backend - `codersdk/chats.go` — new fields on the per-provider option structs - `coderd/chatd/chatd.go` — `buildProviderTools()` reads config, creates `ProviderDefinedTool` entries (uses `anthropic.WebSearchTool()` helper from fantasy) - `coderd/chatd/chatloop/chatloop.go` — `ProviderTools` on `RunOptions`, merged into `Call.Tools`. Provider-executed tool calls skip local execution. `StreamPartTypeToolResult` with `ProviderExecuted: true` is accumulated inline (matching fantasy's own agent.go pattern) instead of post-stream synthesis. - `coderd/chatd/chatprompt/` — `MarshalToolResult` carries `ProviderMetadata` through DB persistence so multi-turn round-trips work (Anthropic needs `encrypted_content` back) ## Frontend - Source citations render inline at the tool-call position (not bottom-of-message), using `ToolCollapsible` so they look like other tool cards — collapsed "Searched N results" with globe icon, expand to see source pills - Provider-executed tool calls/results are hidden from the normal tool card UI - Tool-role messages with only provider-executed results return `null` (no empty bubble) - Both persisted (messageParsing.ts) and streaming (streamState.ts) paths group consecutive `source` parts into a single `{ type: "sources" }` render block ## Fantasy changes The fantasy fork (`kylecarbs/fantasy` branch `cj/go1.25`) has the Anthropic tool code merged in, but will hopefully go upstream from: https://github.com/charmbracelet/fantasy/pull/163	2026-03-11 21:33:15 +00:00
Kyle Carberry	1f37df4db3	perf(chatd): fix six scale bottlenecks identified by benchmarking (#22957 ) ## Summary Scale-tested the `chatd` package with mock-based benchmarks to identify performance bottlenecks. This PR fixes 6 of the 8 identified issues, ranked by severity. ## Changes ### 1. Parallel tool execution (HIGH) — `chatloop.go` `executeTools` ran tool calls sequentially. Now dispatches all calls concurrently via goroutines with `sync.WaitGroup`. Results are pre-allocated by index (no mutex needed). `onResult` callbacks fire as each tool completes. ### 2. Pubsub-backed subagent await (HIGH) — `subagent.go` `awaitSubagentCompletion` polled the DB every 200ms. Now subscribes to the child chat's `ChatStreamNotifyChannel` via pubsub for near-instant notifications. Fallback poll reduced to 5s. Falls back to 200ms only when `pubsub == nil` (single-instance / in-memory). ### 3. Per-chat stream locking (MEDIUM) — `chatd.go` Replaced single global `streamMu` + `map[uuid.UUID]*chatStreamState` with `sync.Map` where each `chatStreamState` has its own `sync.Mutex`. Zero cross-chat contention. ### 4. Batch chat acquisition (MEDIUM) — `chatd.go` `processOnce` acquired 1 chat per tick. Now loops up to `maxChatsPerAcquire = 10` per tick, avoiding idle time when many chats are pending. ### 5. Reduced heartbeat frequency (LOW-MEDIUM) — `chatd.go` `chatHeartbeatInterval` changed from 30s to 60s. Safe given the 5-minute `DefaultInFlightChatStaleAfter`. ### 6. O(depth) descendant check (LOW) — `subagent.go` Replaced top-down BFS (`O(total_descendants)` queries) with bottom-up parent-chain walk (`O(depth)` queries). Includes cycle protection. ## Not addressed (intentionally) - Message serialization overhead - Buffer eviction (`buffer[1:]` pattern)	2026-03-11 14:00:08 -04:00
Kyle Carberry	0a026fde39	refactor: remove reasoning title extraction from chat pipeline (#22926 ) Removes the backend and frontend logic that extracted compact titles from reasoning/thinking blocks. The `Title` field on `ChatMessagePart` remains for other part types (e.g. source), but reasoning blocks no longer have titles derived from first-line markdown bold text or provider metadata summaries. Backend: - Remove `ReasoningTitleFromFirstLine`, `reasoningTitleFromContent`, `reasoningSummaryTitle`, `compactReasoningSummaryTitle`, and `reasoningSummaryHeadline` from chatprompt - Simplify `marshalContentBlock` to plain `json.Marshal` (no title injection) - Remove title tracking maps and `setReasoningTitleFromText` from chatloop stream processing - Remove `reasoningStoredTitle` from db2sdk - Remove related tests from db2sdk_test Frontend: - Remove `mergeThinkingTitles` from blockUtils - Simplify `appendTextBlock` to always merge consecutive thinking blocks - Remove `applyStreamThinkingTitle` from streamState - Simplify reasoning/thinking stream handler to ignore title-only parts - Update tests accordingly Net: -487 lines / +42 lines	2026-03-11 11:01:26 +00:00
Kyle Carberry	eecb7d0b66	fix: resolve bugs in chatd streaming system (#22720 ) Split from #22693 per review feedback. Fixes multiple bugs in coderd/chatd and sub-packages including race conditions, transaction safety, stream buffer bounds, retry limits, and enterprise relay improvements. See commit message for full list.	2026-03-06 21:02:25 +00:00
Kyle Carberry	5630390d94	fix(chatd): enable compaction between steps and re-enter after summarization (#22640 ) ## Problem Three bugs with chat summarization (compaction) share a single root cause: `ReloadMessages` was never wired up in the production `chatloop.Run()` call. ### Bug 1: Compaction never fires between steps The inline compaction guard in `chatloop.go` requires both `Compaction` and `ReloadMessages` to be non-nil: ```go if opts.Compaction != nil && opts.ReloadMessages != nil { ``` Since `ReloadMessages` was only set in tests, inline compaction was dead code in production. Long multi-step turns could blow through the context window. ### Bug 2: Compaction only occurs at end of turn The post-run safety net doesn't check `ReloadMessages`, so it was the only compaction path that fired: ```go if !alreadyCompacted && opts.Compaction != nil { // no ReloadMessages check ``` This meant compaction only happened once, after the entire agent turn finished. ### Bug 3: Agent stops after summarization After post-run compaction, `Run()` unconditionally returned `nil`. `processChat` then set the chat status to `waiting` (done). The agent never had a chance to continue with its fresh summarized context. ## Fix 1. Wire up `ReloadMessages` in `chatd.go`: reloads persisted messages from the database and re-applies system prompts (subagent instruction, workspace AGENTS.md). 2. Wrap the step loop in an outer compaction loop: when compaction fires on the model's final step (`compactedOnFinalStep`), reload messages and `continue` the outer loop so the agent re-enters with summarized context. 3. Track `compactedOnFinalStep` to distinguish inline compaction on the last step (needs re-entry) from inline compaction mid-loop followed by more tool-call steps (agent already consumed the compacted context, no re-entry needed). 4. Add `maxCompactionRetries = 3` to prevent infinite compaction loops. ## Testing - All 7 existing compaction tests pass unchanged. - Added `PostRunCompactionReEntersStepLoop` test: verifies that when a text-only response triggers compaction, the outer loop re-enters and the agent makes a second stream call with fresh context.	2026-03-04 22:28:23 -05:00
Kyle Carberry	ddfe630757	refactor(chatd): replace fantasy.Agent with custom agent loop (#22507 ) ## Summary Replaces fantasy's `Agent` abstraction with a direct step loop calling `LanguageModel.Stream()`. Fantasy is retained as the provider abstraction layer (streaming parsers, types, tool schema) but we no longer use `fantasy.Agent`, `AgentStreamCall`, `AgentResult`, or `StepResult`. ## Problems solved \| Problem \| Before \| After \| \|---\|---\|---\| \| Sentinel prompt hack \| fantasy.Agent requires non-empty Prompt → UUID sentinel generated and stripped in PrepareStep \| Messages passed directly to `model.Stream()` \| \| Discarded PersistStep errors \| `_ = opts.OnStepFinish(result)` silently swallows errors \| Errors propagate directly from `PersistStep()` \| \| Shadow draft state \| ~160 LOC tracking content in parallel because fantasy doesn't expose in-progress content on interruption \| `stepResult` owns content directly; `flushActiveState()` is trivial \| \| Nested retry layers \| fantasy's 2-attempt retry nested inside chatretry's indefinite retry \| Single `chatretry.Retry` layer \| \| Callback-mediated compaction \| Mutex + boolean flag + coordination between OnStepFinish/PrepareStep callbacks \| Inline `if` statement between steps \| \| Duplicate compaction paths \| `compactStep()` + `maybeCompact()` sharing ~80% logic \| Single `tryCompact()` function \| ## Changes ### `coderd/chatd/chatloop/chatloop.go` — Rewritten - Removed: `fantasy.NewAgent()`, `AgentStreamCall`, sentinel prompt, shadow draft state (~160 LOC of closures), `compactedMu`/`compacted` flag, `PrepareStepResult` - Added: `stepResult` struct, `processStepStream()` (stream consumer), `executeTools()` (sequential tool execution), `flushActiveState()` (interrupt handling), `buildToolDefinitions()`, `toResponseMessages()` - Changed: `Run()` return type from `(fantasy.AgentResult, error)` to `error` (callers already discarded the result) - Preserved*: Anthropic prompt caching, reasoning title extraction, `extractContextLimit()`, `ErrInterrupted` semantics ### `coderd/chatd/chatloop/compaction.go` — Simplified - Merged `compactStep()` + `maybeCompact()` → single `tryCompact()` - Removed `[]StepResult` parameter from `generateCompactionSummary()` (caller provides complete message list) - Kept helper functions: `normalizedCompactionConfig`, `contextTokensFromUsage`, `resolveContextLimit`, `shouldCompact` ### `coderd/chatd/chatd.go` — Caller updates - Removed `AgentStreamCall` construction - Changed `_, err = chatloop.Run(...)` to `err = chatloop.Run(...)` - Model parameters moved from `AgentStreamCall` fields to `RunOptions` fields ### Tests — 4 new tests - `MidLoopCompactionReloadsMessages` — compaction fires mid-loop, messages reloaded - `PostRunCompactionSkippedAfterMidLoop` — no double compaction - `MultiStepToolExecution` — tools execute between steps, results feed next step - `PersistStepErrorPropagates` — persistence errors propagate (was silently discarded)	2026-03-02 18:51:57 -05:00
Kyle Carberry	2bdacae5f5	feat(chatd): add LLM stream retry with exponential backoff (#22418 ) ## Summary Adds automatic retry with exponential backoff for transient LLM errors during chat streaming and title generation. Inspired by [coder/mux](https://github.com/coder/mux)'s retry mechanism. ## Key Behaviors - Infinite retries with exponential backoff: 1s → 2s → 4s → ... → 60s cap - Deterministic delays (no jitter) - Error classification: retryable (429, 5xx, overloaded, rate limit, network errors) vs non-retryable (auth, quota, context exceeded, model not found, canceled) - Retry status published to SSE stream so frontend can show "Retrying in Xs..." UI - Title generation retries silently (best-effort, nil onRetry callback) ## New Package: `coderd/chatd/chatretry/` \| File \| Purpose \| \|------\|---------\| \| `classify.go` \| `IsRetryable(err)` and `StatusCodeRetryable(code)` \| \| `backoff.go` \| `Delay(attempt)` — exponential doubling with 60s cap \| \| `retry.go` \| `Retry(ctx, fn, onRetry)` — infinite loop with context-aware timer \| ## Test Helpers: `coderd/chatd/chattest/errors.go` Anthropic and OpenAI error response builders for use in chattest providers: - `AnthropicErrorResponse()`, `AnthropicOverloadedResponse()`, `AnthropicRateLimitResponse()` - `OpenAIErrorResponse()`, `OpenAIRateLimitResponse()`, `OpenAIServerErrorResponse()` ## SDK Changes: `codersdk/chats.go` - New `ChatStreamEventType: "retry"` - New `ChatStreamRetry` struct with `Attempt`, `DelayMs`, `Error`, `RetryingAt` fields - TypeScript types auto-generated ## Changed Files - `coderd/chatd/chatloop/chatloop.go` — wraps `agent.Stream()` in `chatretry.Retry()` - `coderd/chatd/chatd.go` — publishes retry events to SSE stream with logging - `coderd/chatd/title.go` — wraps `model.Generate()` in silent retry - `coderd/chatd/chattest/anthropic.go` / `openai.go` — error injection support ## Tests 42 tests covering classification (33), backoff (9), and retry scenarios (8).	2026-02-27 18:34:33 -05:00
Kyle Carberry	edee917d88	feat: add experimental agents support (#22290 ) feat: add AI chat system with agent tools and chat UI Introduce the chatd subsystem and Agents UI for AI-powered chat within Coder workspaces. - Add chatd package with chat loop, message compaction, prompt management, and LLM provider integration (OpenAI, Anthropic) - Add agent tools: create workspace, list/read templates, read/write/ edit files, execute commands - Add chat API endpoints with streaming, message editing, and durable reconnection - Add database schema and migrations for chats, chat messages, chat providers, and chat model configs - Add RBAC policies and dbauthz enforcement for chat resources - Add Agents UI pages with conversation timeline, queued messages list, diff viewer, and model configuration panel - Add comprehensive test coverage including coderd integration tests, chatd unit tests, and Storybook stories - Gate feature behind experiments flag --------- Co-authored-by: Cian Johnston <cian@coder.com> Co-authored-by: Danielle Maywood <danielle@themaywoods.com> Co-authored-by: Jeremy Ruppel <jeremy@coder.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-27 16:50:56 +00:00

12 Commits