coder

mirror of https://github.com/coder/coder.git synced 2026-06-05 05:58:20 +00:00

Author	SHA1	Message	Date
Kyle Carberry	3812b504fc	fix(coderd/x/chatd): prevent nil required field in MCP tool schemas for OpenAI (#23538 )	2026-03-24 18:29:41 -04:00
Mathias Fredriksson	9dc2e180a2	test(coderd/x/chatd): add coverage for awaitSubagentCompletion (#23527 ) Nine subtests covering the poll loop, pubsub notification path, timeout, context cancellation, descendant auth check, and both error-status branches in handleSubagentDone. Wire p.clock through awaitSubagentCompletion's timer and ticker so future tests can use quartz mock clock. Tests use channel-based coordination and context.WithTimeout instead of time.Sleep. Coverage: awaitSubagentCompletion 0%->70.3%, handleSubagentDone 0%->100%, checkSubagentCompletion 0%->77.8%, latestSubagentAssistantMessage 0%->78.9%.	2026-03-24 22:19:18 +00:00
Michael Suchacz	4f571f8fff	fix: inline synthetic paste attachments as bounded prompt text (#23523 ) ## Summary Large pasted text that the UI collapses into an attachment chip was completely invisible to the LLM. Providers only accept specific MIME types (images, PDFs) in file content blocks — a `text/plain` `FilePart` is silently dropped, so the model received nothing for pasted content. ## Fix Detect paste-originated text files by their `pasted-text-{timestamp}.txt` filename pattern and convert them to `fantasy.TextPart` with a bounded 128 KiB inline body and truncation notice. Binary uploads and real uploaded text files keep their existing `FilePart` semantics. The detection uses the existing frontend naming convention (`pasted-text-YYYY-MM-DD-HH-MM-SS.txt`) combined with a text-like MIME check for defense-in-depth. A TODO marks this for migration to explicit origin metadata. <details> <summary>Review notes: intentionally skipped findings</summary> A 10-reviewer deep review was run on this change. The following findings were raised and intentionally dropped after cross-check. Documenting them here so future reviewers do not re-flag the same concerns: "Unresolved file IDs cause silent data loss" (Edge Case Analyst P1) — When a file ID is not in the resolver map, `name` stays empty and paste detection fails. This is pre-existing behavior for ALL file types (not introduced by this change). The resolver calls `GetChatFilesByIDs` which returns whatever rows exist; missing IDs simply fall through to an empty `FilePart`. The Contract Auditor independently traced this path and confirmed the fallback is safe. If the file was deleted between message construction and conversion, the model already saw nothing before this patch — this change does not make it worse. "String builder pre-allocation overhead" (Performance Analyst P1) — Misidentified scope. `formatSyntheticPasteText` is only called when `isSyntheticPaste` returns true (actual synthetic pastes), not for every file part. The `Grow()` call is correct and efficient. "Constant naming violates Uber style" (Style Reviewer P1) — Over-severity. `syntheticPasteInlineBudget` is standard Go camelCase for unexported constants, consistent with the Uber guide and surrounding code. "`IsSyntheticPasteForTest` naming is misleading" (Style Reviewer P2) — This is the standard Go `export_test.go` pattern. The `ForTest` suffix is conventional. </details>	2026-03-24 21:39:42 +01:00
Kyle Carberry	dda985150d	feat: add MCP server config ID to tool-call message parts (#23522 )	2026-03-24 20:29:36 +00:00
Kyle Carberry	e34162945a	fix(coderd/x/chatd): normalize OAuth2 token type to canonical Bearer case (#23516 ) Linear's MCP server (`mcp.linear.app`) returns `token_type="bearer"` (lowercase) in its OAuth2 token response but rejects requests that use the lowercase form in the `Authorization` header. RFC 6750 says the scheme is case-insensitive, but Linear enforces capital-B `Bearer`. Confirmed by running the actual Linear MCP OAuth flow end-to-end: - `Authorization: Bearer <token>` → 42 tools, works - `Authorization: bearer <token>` → 401 invalid_token This is a one-line fix: normalize any case variant of `bearer` to `Bearer` before building the `Authorization` header, matching the behavior of the mcp-go library's own OAuth handler.	2026-03-24 14:32:06 -04:00
Michael Suchacz	19e86628da	feat: add propose_plan tool for markdown plan proposals (#23452 ) Adds a `propose_plan` tool that presents a workspace markdown file as a dedicated plan card in the agent UI. The workflow is: the agent uses `write_file`/`edit_files` to build a plan file (e.g. `/home/coder/PLAN.md`), then calls `propose_plan(path)` to present it. The backend reads the file via `ReadFile` and the frontend renders it as an expanded markdown preview card. Backend (`coderd/x/chatd/chattool/proposeplan.go`): new tool registered as root-chat-only. Validates `.md` suffix, requires an absolute path, reads raw file content from the workspace agent. Includes 1 MiB size cap. Frontend (`site/src/components/ai-elements/tool/`): dedicated `ProposePlanTool` component with `ToolCollapsible` + `ScrollArea` + `Response` markdown renderer, expanded by default. Custom icon (`ClipboardListIcon`) and filename-based label. System prompt (`coderd/x/chatd/prompt.go`): added `<planning>` section guiding the agent to research → write plan file → iterate → call `propose_plan`.	2026-03-24 15:06:22 +01:00
Michael Suchacz	02356c61f6	fix: use previous_response_id chaining for OpenAI store=true follow-ups (#23450 ) OpenAI Responses follow-up turns were replaying full assistant/tool history even when `store=true`, which breaks after reasoning + provider-executed `web_search` output. This change persists the OpenAI response ID on assistant messages, then in `coderd/x/chatd` switches `store=true` follow-ups to `previous_response_id` chaining with a system + new-user-only prompt. `store=false` and missing-ID cases still fall back to manual replay. It also updates the fake OpenAI server and integration coverage for the chaining contract, and carries the rebased path move to `coderd/x/chatd` plus the migration renumber needed after rebasing onto `main`.	2026-03-24 14:57:40 +01:00
Kyle Carberry	13241a58ba	fix(coderd/x/chatd/mcpclient): use dedicated HTTP transport per MCP connection (#23494 ) ## Problem `TestConnectAll_MultipleServers` flakes with: ``` net/http: HTTP/1.x transport connection broken: http: CloseIdleConnections called ``` Each MCP client connection implicitly uses `http.DefaultTransport`. When `httptest.Server.Close()` runs during parallel test cleanup, it calls `CloseIdleConnections` on `http.DefaultTransport`, breaking in-flight connections from other goroutines or parallel tests sharing that transport. ## Fix Clone the default transport for each MCP connection via `http.DefaultTransport.(*http.Transport).Clone()`, passed through `WithHTTPBasicClient` (StreamableHTTP) and `WithHTTPClient` (SSE). This scopes idle connection cleanup to a single MCP server so it cannot disrupt unrelated connections. Fixes coder/internal#1420	2026-03-24 09:22:45 -04:00
Michael Suchacz	82f965a0ae	feat: per-user per-model chat compaction threshold overrides (#23412 ) ## What Adds per-user per-model auto-compaction threshold overrides. Users can now customize the percentage of context window usage that triggers chat compaction, independently for each enabled model. ## Why The compaction threshold was previously only configurable at the deployment level (`chat_model_configs.compression_threshold`). Different users have different preferences — some want aggressive compaction to keep costs low, others prefer higher thresholds to retain more context. This gives users control without requiring admin intervention. ## Architecture Storage: Reuses the existing `user_configs` table (no migration needed). Overrides are stored as key/value pairs with keys shaped `chat_compaction_threshold:<modelConfigID>` and integer percent values. API: Three new experimental endpoints under `/api/experimental/chats/config/`: - `GET /user-compaction-thresholds` — list all overrides for the current user - `PUT /user-compaction-thresholds/{modelConfig}` — upsert an override (validates model exists and is enabled, validates 0–100 range) - `DELETE /user-compaction-thresholds/{modelConfig}` — clear an override (idempotent) Runtime resolution: In `coderd/chatd/chatd.go`, a new `resolveUserCompactionThreshold()` helper runs at the start of each chat turn (inside `runChat()`), after the model config is resolved but before `CompactionOptions` is built. If a valid override exists, it replaces `modelConfig.CompressionThreshold`. The threshold source (`user_override` vs `model_default`) is logged with each compaction event. Precedence: `effectiveThreshold = userOverride ?? modelConfig.CompressionThreshold` UI: New "Context Compaction" subsection in the Agents → Settings → Behavior tab, placed after Personal Instructions. Shows one row per enabled model with the system default, a number input for the override, and Save/Reset controls. ## Testing - 9 API subtests covering CRUD, validation (boundary values 0/100, out-of-range rejection), upsert behavior, idempotent delete, user isolation, and non-existent model config - 4 dbauthz tests (16 scenarios) verifying `ActionReadPersonal` / `ActionUpdatePersonal` on all query methods - 4 Storybook stories with play functions (Default, WithOverrides, Loading, Error) <details> <summary>Implementation plan</summary> ### Phase 1 — Tests - Backend API tests in `coderd/chats_test.go` (9 subtests) - Database auth wrapper tests in `coderd/database/dbauthz/dbauthz_test.go` (4 methods) - Frontend stories in `UserCompactionThresholdSettings.stories.tsx` (4 stories) ### Phase 2 — Backend preference surface - 4 SQL queries in `coderd/database/queries/users.sql` (list, get, upsert, delete) - `make gen` to propagate into generated artifacts - Auth/metrics wrappers in dbauthz and dbmetrics - SDK types and client methods in `codersdk/chats.go` - HTTP handlers and routes in `coderd/chats.go` and `coderd/coderd.go` - Key prefix constant shared between handlers and runtime ### Phase 3 — Runtime override - `resolveUserCompactionThreshold()` helper in `coderd/chatd/chatd.go` - Override injection in `runChat()` before building `CompactionOptions` - `threshold_source` field added to compaction log ### Phase 4 — Settings UI - API client methods and React Query hooks in `site/src/api/` - `UserCompactionThresholdSettings` component extracted from `SettingsPageContent` - Per-model mutation tracking (only the active row disables during save) - 100% warning, "System default" label, helpful empty state copy ### Phase 5 — Refactor and review fixes - Consolidated key prefix constant in `codersdk` - Explicit PUT range validation (not just struct tags) - GET handler gracefully skips malformed rows instead of 500 - Boundary value, upsert, and non-existent model config tests - UX improvements: per-model mutation state, aria-live on errors </details>	2026-03-24 00:48:18 +01:00
Michael Suchacz	c389c2bc5c	fix(coderd/x/chatd): stabilize auto-promotion flake (#23448 ) TestInterruptAutoPromotionIgnoresLaterUsageLimitIncrease still relied on wall-clock polling after the acquire loop moved to a mock clock, so it could assert before chatd finished its asynchronous cleanup and auto-promotion work. Wait on explicit request-start signals and on the server's in-flight chat work before asserting the intermediate and final database state. This keeps the test synchronized with the actual processor lifecycle instead of scheduler timing. Closes https://github.com/coder/internal/issues/1406	2026-03-23 19:17:58 +00:00
Mathias Fredriksson	138bc41563	fix: improve process tool descriptions to prefer foreground execution (#23395 ) The tool descriptions pushed agents toward backgrounding anything over 5 seconds, including builds, tests, and installs where you actually want to wait for the result. This led to unnecessary process_output round-trips and missed the foreground timeout-to-reattach workflow entirely. Reframe background mode as the exception (persistent processes with no natural exit) and foreground with an appropriate timeout as the default. Replace "background process" with "tracked process" in process_output, process_list, and process_signal since they work on all tracked processes regardless of how they were started.	2026-03-23 17:54:30 +00:00
Cian Johnston	80a172f932	chore: move chatd and related packages to /x/ subpackage (#23445 ) - Moves `coderd/chatd/`, `coderd/gitsync/`, `enterprise/coderd/chatd/` under `x/` parent directories to signal instability - Adds `Experimental:` glue code comments in `coderd/coderd.go` > 🤖 This PR was created with the help of Coder Agents, and was reviewed by my human. 🧑‍💻	2026-03-23 17:34:43 +00:00

12 Commits