coder

mirror of https://github.com/coder/coder.git synced 2026-06-02 20:48:20 +00:00

Author	SHA1	Message	Date
Mathias Fredriksson	07be354683	feat(dogfood/coder): restart container unless stopped (#25382 ) Add `restart = "unless-stopped"` to the dogfood workspace container so it automatically recovers from crashes, daemon restarts or server reboots without requiring manual intervention.	2026-05-15 12:35:52 +03:00
Jaayden Halko	e8cfff40b4	feat(site): add theme mode frontend foundation (#25181 ) ## Summary - Add theme mode helpers for legacy migration, active theme resolution, draft conversion, and mode switching. - Add `usePreferredColorScheme` and refactor `ThemeProvider` to use the shared theme mode resolver. - Add reusable Appearance theme picker components plus isolated Storybook coverage. ## Dependencies - Stacked on #25180. ## Validation - `pnpm -C site exec vitest run --project=unit src/theme/themeMode.test.ts src/theme/usePreferredColorScheme.test.tsx` - `pnpm -C site lint:types` - `pnpm -C site lint:knip` - Pre-commit hook passed on the branch commit.	2026-05-15 10:04:48 +01:00
Danny Kopping	c6ab379c32	fix(aibridge/intercept/messages): convert enabled thinking to adaptive for Bedrock Opus 4.7+ (#25335 ) Disclaimer: implemented by a Coder Agent using Claude Opus 4.6/4.7 Fixes [coder/aibridge#280](https://github.com/coder/aibridge/issues/280). Claude Opus 4.7 (and future adaptive-only Bedrock models) reject the legacy `thinking.type: "enabled"` + `budget_tokens` shape with a 400. Claude Code falls back to that shape when it cannot read the upstream model's capability metadata, which is exactly the case when AI Bridge sits between the client and Bedrock. Pinning back to Opus 4.6 is the only operator workaround today. This is the counterpart to the `adaptive -> enabled` conversion added in [coder/aibridge#225](https://github.com/coder/aibridge/pull/225) for older Bedrock models. ## Behavior - New `bedrockModelRequiresAdaptiveThinking()` helper matches Opus 4.7 (covers `us.anthropic.claude-opus-4-7`, ARN-style application inference profile names that include the model ID, etc.). - New `RequestPayload.convertEnabledThinkingForBedrock()` rewrites `thinking: {type: enabled, budget_tokens: N}` to `thinking: {type: adaptive}`. The budget hint is dropped; an explicit `output_config.effort` from the caller is preserved naturally because we never touch that field. We deliberately do not derive an effort label from the budget (see decision log). - `removeUnsupportedBedrockFields` learns a variadic `exemptFields` parameter. Adaptive-only models support `output_config` natively (no beta flag required), so `augmentRequestForBedrock` exempts that field for those models. - Bedrock Opus 4.7 accepts `output_config.effort` but rejects `output_config.format` (structured outputs) with the same "Extra inputs are not permitted" 400. The generic strip pass operates at top-level granularity only, so a small targeted pass drops `output_config.format` after the top-level strip for adaptive-only models. The whole Bedrock thinking-type shim block carries a header comment flagging it as temporary; a planned native Bedrock provider removes the impedance mismatch and lets us delete it. ## Out of scope The issue calls out a possible follow-up around `Anthropic-Beta: interleaved-thinking-2025-05-14` for adaptive-only models; best evidence is that Opus 4.7 still accepts those flags, so this PR is a no-op there. <details> <summary>Decision log</summary> - `bedrockModelSupportsAdaptiveThinking` now also returns `true` for adaptive-only models. That keeps the existing `convertAdaptiveThinkingForBedrock` branch from running on Opus 4.7 (which would otherwise be incorrect; `adaptive` is the supported native type there), and the new `convertEnabledThinkingForBedrock` runs only for adaptive-only models via the explicit `bedrockModelRequiresAdaptiveThinking` switch case. The two model sets are disjoint by construction. - The reverse conversion does not derive `output_config.effort` from `budget_tokens / max_tokens`. The two thinking shapes encode different intents (`enabled+budget` is "give me exactly N tokens," `adaptive[+effort]` is "model, pick a budget, optionally biased") and there is no canonical mapping between them. An earlier draft of this PR derived effort via midpoints of an invented anchor table; it was symmetric-looking but lossy and required a lot of scaffolding (sorted anchors, init-time invariant guard, round-trip tests) to keep two halves consistent. The reverse direction now just rewrites the shape, which is honest about the information loss and matches platform-defined adaptive behavior when no effort hint is present. - `output_config.format` is stripped only for adaptive-only models. Other Bedrock models either don't get `output_config` through at all (top-level strip handles them) or accept it via a beta flag that may imply broader feature support. Easy to widen if the same 400 shows up elsewhere. - I chose `variadic exemptFields ...string` over passing the model down to `removeUnsupportedBedrockFields`, to keep that function focused on stripping and to localise the model-aware policy in `augmentRequestForBedrock`. </details>	2026-05-15 10:11:41 +02:00
Thomas Kosiewski	96ea2465b7	build(coderd/database/gen/dump): fall back to embedded postgres without docker (#25332 ) Generating `coderd/database/dump.sql` previously required a Docker-compatible socket via `ory/dockertest`. Contributors using runtimes that don't expose one (e.g. Apple's `container` CLI) hit a panic during `make gen`: ``` build: panic: open containerized database failed: open container: could not start resource: dial unix /var/run/docker.sock: connect: no such file or directory ``` Fall back to `fergusstrange/embedded-postgres` (already a direct module dep, used by `scripts/develop/dbrecovery.go`) when `dbtestutil.OpenContainerized` fails. The server's timezone is forced to UTC so `timestamptz` DEFAULT expressions canonicalize identically to the Docker-based path; otherwise the host's local TZ leaks into the dump as values like `'0001-12-31 23:06:32+00 BC'`. `PGDumpSchemaOnly` still needs `pg_dump` v13.x on PATH (the embedded-postgres archive ships only `initdb`/`postgres`/`pg_ctl`). When neither `pg_dump` nor `docker` is available, the existing error is supplemented with install hints for `mise`, `brew`, and `apt`. CI keeps using the Docker path unchanged; the fallback is local-dev-only and produces a byte-identical `dump.sql`. 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Signed-off-by: Thomas Kosiewski <tk@coder.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 09:39:05 +02:00
Ethan	5e701d3075	test: fix TestWatcher_SharedParentRefcount on macOS (#25379 ) `TestWatcher_SharedParentRefcount` was deterministically broken on macOS: `t.TempDir()` lives under `/var` which is a symlink to `/private/var`, but the watcher canonicalizes paths via `filepath.EvalSymlinks` before storing them, so the test's `w.dirs[dir]` lookup missed and returned `0` instead of `2`. Adds `testutil.TempDirResolved`, a shared helper that returns `t.TempDir()` with symlinks resolved and falls back to the raw temp dir on error (Windows-friendly). Migrates the matching inline `EvalSymlinks(t.TempDir())` callsites in `agent/agentgit/agentgit_test.go` to use it. Closes https://github.com/coder/internal/issues/1531	2026-05-15 17:37:08 +10:00
Ethan	a59b951565	test: skip stale notification chatd flakes (#25376 ) These chatd tests are flaking for the same stale control-notification race tracked by CODAGT-353, so this change skips the newly reflaking advisor-chain and `TestPatchChatMessage/ChangesModel` tests and rewrites the older `TODO(hugodutka)` skips to point at the same root cause. This keeps the known flakes documented consistently until the chatd notification-flow refactor lands. Closes CODAGT-427 Closes https://github.com/coder/internal/issues/1510	2026-05-15 17:36:48 +10:00
Callum Styan	81212470fd	feat: implement basic (MVP version) of a fake agent + manager (#25070 ) This PR introduces a "fake agent" + manager, which can be used during scaletests to run a single executable that acts as many workspace agents. The goals of these are to provide a much lighter weight implementation of a workspace in terms of resource cost and startup time when executing scaletests. --------- Signed-off-by: Callum Styan <callumstyan@gmail.com> Co-authored-by: Mux <noreply@coder.com>	2026-05-14 14:46:36 -07:00
Yevhenii Shcherbina	238968cfa0	feat: add per-group AI budget table and endpoints (#25203 ) Closes https://linear.app/codercom/issue/AIGOV-284/add-group-budgets-table-and-crud-api ## Summary Adds the `group_ai_budgets` table and the following endpoints: - `GET /api/v2/groups/{group}/ai/budget` - `PUT /api/v2/groups/{group}/ai/budget` - `DELETE /api/v2/groups/{group}/ai/budget` Each group may have at most one budget row. If no row exists, no budget is enforced. ### Feature gate Added `RequireFeatureMW(FeatureAIBridge)` on the `/ai/budget` sub-route. ## RBAC Authorization reuses `rbac.ResourceGroup` with the existing `.InOrganization(...).WithID(...)` scoping model. The `dbauthz` wrappers load the parent `groups` row and authorize against it. No new resource type is introduced. As a result, anyone with `group:update` permissions (Owner, OrgAdmin, or UserAdmin within the organization) can manage AI budgets for that group. ## Read access for group members `database.Group.RBACObject()` grants `policy.ActionRead` to all members of the group through the group ACL: ```go func (g Group) RBACObject() rbac.Object { return rbac.ResourceGroup.WithID(g.ID). InOrg(g.OrganizationID). // Group members can read the group. WithGroupACL(map[string][]policy.Action{ g.ID.String(): { policy.ActionRead, }, }) } ``` Because the `GET` endpoint authorizes against the same loaded `Group` object, any group member can call: ```text GET /api/v2/groups/{group}/ai/budget ``` `PUT` and `DELETE` remain admin-only. The group ACL grants only `ActionRead`, so write operations continue to require role-based `group:update` permissions. ## Alternative considered A dedicated `rbac.ResourceGroupAiBudget` resource would allow budget management to be separated from general group administration. We decided not to add that complexity for now.	2026-05-14 15:54:37 -04:00
Garrett Delfosse	d97f5ae2a6	fix: add ESR support to release calendar script (#25205 ) The `update-release-calendar.sh` script did not account for Extended Support Release (ESR) versions. Running it would drop ESR entries (e.g. 2.24) from the calendar entirely or mark them as "Not Supported" instead of "Extended Support Release". ## Changes - Add `ESR_VERSIONS` array for tracking active ESR minor versions - Add `is_esr_version()` helper to check ESR membership - Extract `generate_release_row()` to reduce duplication - Prepend ESR versions older than the standard window - Override "Not Supported" status for ESR versions within the window > [!NOTE] > When new ESR versions are designated or old ones reach end of life, update the `ESR_VERSIONS` array at the top of the script. <!-- This PR was authored by Coder Agents -->	2026-05-14 15:35:30 -04:00
Tyler	d79cfcfe61	fix(site): move docs link to primary header on observability page (#25313 ) Move the "Read the docs" button from the Audit Logging subsection up to the primary Observability header's `actions` prop, matching the layout pattern used by General, Network, and other deployment settings pages. Also updates the docs URL from `/admin/security/audit-logs` to `/admin/monitoring` to reflect the page-level scope. > Generated by Coder Agents on behalf of @designertyler --------- Co-authored-by: TJ <tracy@coder.com>	2026-05-14 13:57:01 -05:00
Kayla はな	a43690d29b	chore: add storybook to .mcp.json (#25352 )	2026-05-14 12:40:23 -06:00
Kayla はな	df5e16ed6d	fix(dogfood): install rust-src component (#25349 )	2026-05-14 12:06:00 -06:00
Danielle Maywood	68baf84b8c	fix: hide empty execute tool calls (#25346 )	2026-05-14 18:19:12 +01:00
35C4n0r	2871a02352	fix: use actual ai_task instances for HasAITasks (#25197 ) Previously, `hasAITaskResources()` scanned the Terraform graph for `coder_ai_task` node labels. The graph includes resource definitions regardless of `count`, so templates with `count = 0` were incorrectly marked as `HasAITasks = true`, causing them to appear on the `/tasks` page when no AI task resources would be created. Replace the graph-based check with `len(aiTasks) > 0`. The `aiTasks` slice is populated from state modules where Terraform has already evaluated `count`, so it correctly reflects actual resource instances. ref: https://linear.app/codercom/issue/ECO-39/make-coder-tasks-respect-count > Generated with [Coder Agents](https://coder.com/agents) --------- Signed-off-by: 35C4n0r <work.jaykumar@gmail.com> Signed-off-by: Jay Kumar <jay.kumar@coder.com>	2026-05-14 21:49:05 +05:30
Danielle Maywood	9ddfafe2b1	feat: add chat ACL database foundation (#25080 )	2026-05-14 17:18:50 +01:00
Nick Vigilante	507ece3bc4	docs: Fix the display of the tab block in External Workspaces (#25341 ) Fixes DOCS-169 <!-- If you have used AI to produce some or all of this PR, please ensure you have read our [AI Contribution guidelines](https://coder.com/docs/about/contributing/AI_CONTRIBUTING) before submitting. -->	2026-05-14 12:04:45 -04:00
Cian Johnston	15c958fea2	fix(testutil): ensure FakeSink does not swallow logs (#25185 ) `FakeSink` was silently capturing log entries without forwarding them to `testing.TB.Log`. This made debugging test failures harder because logs were invisible in `go test -v` output. Store `testing.TB` in `FakeSink` and call `t.Log` on each entry, guarded by a check to avoid logging after the test has finished. Split out from #25012. > 🤖 Generated with [Coder Agents](https://coder.com)	2026-05-14 16:51:44 +01:00
Spike Curtis	132fa87bf3	fix: only embed Azure roots on darwin (#25312 ) Partially reverts #25136 for non-darwin platforms. In general we want to avoid pinning trust roots to embedded Certs, since that limits operational flexibility. If Azure changes CAs, operators should, at most, be able to update the OS trust store to keep Coder working correctly. Embedding roots means we need to upgrade the Coder binary. Since Coder Server on macOS is not really supported for production use, embedding only in that case to ease development and testing is OK.	2026-05-14 11:45:21 -04:00
Ethan	e37bf4f7be	ci: bump paralleltestctx to v0.0.2 (#25323 ) ## Summary - bump `github.com/coder/paralleltestctx` from v0.0.1 to v0.0.2 - pick up the latest paralleltestctx timeout-context detection improvements in `go tool ... paralleltestctx` runs	2026-05-15 00:14:38 +10:00
Danny Kopping	841b777ccd	feat: add ai_providers table, queries, dbauthz, audit, RBAC (#24892 )	2026-05-14 16:10:46 +02:00
Jaayden Halko	acf57b3b35	feat(site): add legacy auto sync helper and tritan cleanup (#25180 ) ## Summary - Add `legacyAutoToSync` to map legacy auto-family theme preferences to light and dark sync pairs. - Expand colorblind theme tests for legacy auto-family handling. - Move tritan success and git-added surfaces onto the sky-blue palette in theme roles and CSS variables. ## Dependencies - None. This can merge independently before the dropdown work. ## Validation - `pnpm -C site exec vitest run --project=unit src/theme/colorblind.test.ts` - `pnpm -C site lint:types` - Pre-commit hook passed on the branch commit.	2026-05-14 14:43:13 +01:00
Danielle Maywood	25a803221e	feat: add shell tool display mode preference (#25029 )	2026-05-14 14:25:07 +01:00
Jaayden Halko	3a070a83dd	fix(site/src/pages/AgentsPage): theme diff changed-line backgrounds (#25179 ) ## Summary - Update AgentsPage diff viewer CSS so changed lines use theme-aware git added and deleted surfaces. - Add unit coverage for the changed-line CSS variables and line-type selectors. ## Dependencies - None. This is independent of the theme mode API and dropdown stack. ## Validation - `pnpm -C site exec vitest run --project=unit src/pages/AgentsPage/components/ChatElements/tools/utils.test.ts` - Pre-commit hook passed on the branch commit.	2026-05-14 14:11:30 +01:00
Cian Johnston	da2fa082bb	fix(coderd/httpapi): CloseRead on test conns to ensure pings pong (#25184 ) The `websocketPair` test helper was not calling `CloseRead` on either side of the connection. Without `CloseRead`, the websocket library does not process control frames (ping/pong), so the heartbeat tests were passing only because no pings had yet failed, not because pings were actually succeeding. Add `CloseRead` on both the client and server connections so that pong frames are delivered in response to pings. Split out from #25012. > 🤖 Generated with [Coder Agents](https://coder.com)	2026-05-14 13:54:59 +01:00
Max Schwenk	f3e90b334d	fix(cli): show sync wait dependencies (#25089 ) ## Problem `coder exp sync want` and `coder exp sync start` both printed generic success messages, which hid the dependency units involved in startup coordination. Before, declaring dependencies with `sync want` printed: ```text Success ``` Before, `sync start` printed while waiting, then finished with another generic success message: ```text Waiting for dependencies of unit 'test-unit' to be satisfied... Success ``` ## Solution Print the dependency units in both cases, using wording that matches where the command is in the lifecycle. After, `sync want` prints the dependencies it declared for the unit: ```text Unit "test-unit" declared dependencies: [dep-unit] ``` After, `sync start` enumerates the dependencies while it is waiting, then prints the same dependencies after the unit starts executing: ```text Unit "test-unit" is waiting for dependencies to be satisfied: [dep-unit, dep-unit-2] Unit "test-unit" finished waiting for dependencies: [dep-unit, dep-unit-2] ``` The sync golden tests now cover the updated output, including multiple dependencies for `sync start`.	2026-05-14 14:45:20 +02:00
Michael Suchacz	cb37047dce	feat: dedicated /prompts endpoint for chat history cycle (#25083 ) Follow-up to #25004. The merged change cycles only through messages already loaded in the in-memory chat store (page size 50). Long chats and chats whose oldest turns have rolled out of the page lose access to their earlier prompts in the composer's up/down arrow cycle. This PR adds a dedicated server endpoint that returns the full prompt history, newest first, and rewires the composer to use it. ## What changed ### Endpoint `GET /api/experimental/chats/{chat}/prompts?limit=N` ```go type ChatPrompt struct { ID int64; Text string } type ChatPromptsResponse struct { Prompts []ChatPrompt } ``` - `limit`: `0..2000`. `0` (the default) is treated as the server-side default of 500; out-of-range values return `400`. Negative values are rejected by the SDK's `PositiveInt32` parser before reaching the handler. - Auth: parent-chat read in `dbauthz`, mirroring `GetChatMessagesByChatID`. - The SQL filters `role='user'`, `deleted=false`, `visibility IN ('user','both')`, guards the lateral with `jsonb_typeof(content) = 'array'` so legacy V0 scalar-string rows are silently skipped, then unrolls `content` JSONB with `WITH ORDINALITY` and concatenates only `type='text'` parts in original order via `string_agg(... ORDER BY ordinality)`. Messages whose joined text is whitespace-only are dropped via `HAVING ... ~ '\S'` so cycling never lands on a blank entry. ### Partial index (migration `000494`) ```sql CREATE INDEX idx_chat_messages_user_prompts ON chat_messages (chat_id, id DESC) WHERE deleted = false AND role = 'user' AND visibility IN ('user', 'both'); ``` The partial WHERE matches the query's filter exactly and the key order matches `ORDER BY id DESC`, so the planner gets both the filter and the ordering from the index without a sort step. `EXPLAIN ANALYZE` on a synthetic 51-chat × 5,000-message dataset (≈260k rows, 10k user prompts in the target chat, `random_page_cost=1.1`): \| \| Plan \| Buffers hit \| Time \| \|---\|---\|---\|---\| \| Without index \| `Index Scan Backward using chat_messages_pkey`, 250,848 rows removed by filter \| 6,683 \| 32.4 ms \| \| With index \| `Index Scan using idx_chat_messages_user_prompts`, no filter \| 38 \| 1.3 ms \| ≈25× faster, 175× fewer buffer hits. ### Frontend - `chatPromptsKey` / `chatPromptsQuery` factories in `site/src/api/queries/chats.ts` (`staleTime: 30s`, `enabled: chatId !== ""`, asks the server for 500 prompts). - `ChatPageContent.tsx` replaces the in-memory derivation with `useQuery(chatPromptsQuery(chatId ?? ""))`. The composer's existing `cycleHistorySnapshotRef` anchors the in-flight cycle so a refetch arriving mid-cycle cannot shift the indexed prompt out from under the user. - `getEditableUserMessagePayload` now concatenates user-message text parts verbatim, mirroring the server's `string_agg(part->>'text', '' ORDER BY ordinality)`, instead of routing through the streaming-oriented `parseMessageContent` / `appendText` pipeline (which drops whitespace-only chunks — correct for assistant streams, wrong for a user's persisted message). This keeps the cycle and the edit path in agreement on the same message. File blocks are still pulled separately via `parseMessageContent(...).blocks.filter(isEditableUserMessageFileBlock)`. - Cache invalidation in `createChatMessage.onSuccess`, `editChatMessage.onSettled`, and `useChatStore.upsertCacheMessages` (only when an upserted message has `role === "user"`). - Page-level stories pre-seed `chatPromptsKey(CHAT_ID)` from the same `messagesData` to keep them offline. ## Tests - New `TestGetChatUserPrompts` in `coderd/exp_chats_test.go` with five subtests: - `NewestFirstFiltering` — multi-part concatenation, non-text parts skipped, whitespace-only filtered, soft-deleted excluded, `model`-only visibility excluded, assistant-role excluded by `cm.role = 'user'`, legacy V0 scalar row silently excluded by the `jsonb_typeof` guard, ordering newest first. - `LimitClampsResults` — explicit `limit=2` returns the two newest prompts. - `InvalidLimitRejected` — `limit=5000` is `400 Bad Request`. - `NotFoundForOtherUsers` — a separate user in the same org gets `404`, not the prompts. - `EmptyResultIsJSONArray` — zero-message chat and assistant-only chat both return `Prompts: []` (non-nil, empty). - New unit test in `messageParsing.test.ts` asserting that `getEditableUserMessagePayload(["hello", " ", "world"])` returns `"hello world"`, locking in the agreement with the SQL `string_agg`. - `dbauthz_test.go` adds the `MethodTestSuite.TestChats/GetChatUserPromptsByChatID` entry, asserting parent-chat `policy.ActionRead`. - `pnpm test src/pages/AgentsPage` — 1159 passed, 2 skipped. - `make gen` produces no diff. ## Manual verification Seeded a dev chat with Claude Sonnet 4.6 via the aibridge Anthropic provider and posted 20 user prompts end-to-end. Verified that the `/prompts` endpoint returns 20 rows newest-first, that `limit=10` clamps correctly, that `limit=0` uses the server default of 500, and that the up/down keyboard cycle in the composer walks the same sequence (and reverses correctly back to the empty draft). ## Out of scope - Cross-chat history. - Per-user opt-out for the cycle. - File-reference / attachment cycling — the cycle continues to reproduce plain text only, by design. <details> <summary>Implementation plan</summary> # CODAGT-319 Follow-up — Dedicated `/prompts` endpoint ## Context The merged feature ([#25004](https://github.com/coder/coder/pull/25004) / [`d32842f`](https://github.com/coder/coder/commit/d32842f)) cycles only through messages already loaded in the in-memory chat store, which is capped at the first 50 messages of the current page. Long chats and chats whose oldest turns have rolled out of the page can no longer recall their full prompt history. This follow-up exposes a dedicated server endpoint that returns the user-authored prompts in a chat, newest first, and rewires the composer to use it. ## Design ### Endpoint `GET /api/experimental/chats/{chat}/prompts?limit=N` Returns: ```go type ChatPrompt struct { ID int64 Text string } type ChatPromptsResponse struct { Prompts []ChatPrompt } ``` - `limit`: `0..2000`. `0` (the default) → server-side default of 500. The wire-level default is encoded in SQL as `COALESCE(NULLIF($limit, 0), 500)`. Negatives are rejected upstream by `PositiveInt32`; the handler only caps the upper bound. - Auth: parent-chat read in `dbauthz`, mirroring `GetChatMessagesByChatID`. - Listed under the experimental router so we can iterate without API guarantees. ### SQL The query lives in `coderd/database/queries/chats.sql` as `GetChatUserPromptsByChatID`: - Filters `role='user'`, `deleted=false`, `visibility IN ('user','both')` to mirror the composer's "what the user actually typed and can re-send" contract. - Guards the lateral with `jsonb_typeof(content) = 'array'` so legacy V0 rows whose content is a scalar JSON string (predates migration `000434`) are silently excluded instead of raising `"cannot extract elements from a scalar"`. - Unrolls `content` JSONB with `jsonb_array_elements WITH ORDINALITY` and concatenates only `type='text'` parts, preserving original order via `string_agg(... ORDER BY ordinality)`. - Casts the result to `text` so sqlc emits a `string` field instead of `[]byte`. - Drops whitespace-only prompts via `HAVING string_agg(...) ~ '\S'` so cycling never lands on a blank entry. - Orders by `cm.id DESC` (`id` is a sequence, so this is "newest first" without relying on `created_at`). ### Index New partial index added in migration `000494`: ```sql CREATE INDEX idx_chat_messages_user_prompts ON chat_messages (chat_id, id DESC) WHERE deleted = false AND role = 'user' AND visibility IN ('user', 'both'); ``` The partial WHERE clause matches the query's filter exactly, so the planner can use the index for both filtering and ordering without a sort step. ### Frontend - `chatPromptsKey(chatId)` and `chatPromptsQuery(chatId)` factories in `site/src/api/queries/chats.ts`. `staleTime: 30s`, `enabled: chatId !== ""`. Asks the server for 500 prompts (well below the 2000 max, plenty for the cycle). - `ChatPageContent.tsx` replaces the in-memory derivation with `useQuery(chatPromptsQuery(chatId ?? ""))`. The composer's `cycleHistorySnapshotRef` already takes a stable snapshot at cycle entry, so a refetch arriving mid-cycle cannot shift the indexed prompt out from under the user. - `getEditableUserMessagePayload` extracts the edit-path text from raw user-message parts (filter `type === "text"`, join verbatim) instead of going through `parseMessageContent` / `appendText`, which is built for assistant streams and intentionally drops whitespace-only chunks. Without this, cycling and clicking Edit on the same message could produce different draft text for messages with whitespace-only interleaved text parts. - Cache invalidation: `createChatMessage.onSuccess`, `editChatMessage.onSettled`, and `useChatStore.upsertCacheMessages` (when at least one upserted message has `role === "user"`) all invalidate `chatPromptsKey(chatId)`. ### Tests - `TestGetChatUserPrompts` (`coderd/exp_chats_test.go`) covers: - `NewestFirstFiltering` — multi-part concatenation, non-text parts skipped, whitespace-only filtered, soft-deleted excluded, `model`-only visibility excluded, assistant-role excluded by `cm.role = 'user'`, legacy V0 scalar row silently excluded by the `jsonb_typeof` guard, ordering newest first. - `LimitClampsResults` — explicit `limit=2` returns the two newest prompts. - `InvalidLimitRejected` — `limit=5000` is `400 Bad Request`. - `NotFoundForOtherUsers` — a separate user in the same org gets `404`, not the prompts. - `EmptyResultIsJSONArray` — zero-message chat and assistant-only chat both return `Prompts: []` (non-nil, empty). - `messageParsing.test.ts` adds a unit test asserting that `getEditableUserMessagePayload(["hello", " ", "world"])` returns `"hello world"`, locking in the agreement with the SQL `string_agg`. - `dbauthz_test.go` adds the `MethodTestSuite.TestChats/GetChatUserPromptsByChatID` entry, asserting the parent-chat `policy.ActionRead`. ## Out of scope - Cross-chat history. - Per-user opt-out for the cycle. - File-reference / attachment cycling — the cycle still reproduces plain text only, by design. </details> <details> <summary>coder-agents-review history</summary> Four review rounds, eight unique findings, all addressed in this PR (approved twice). Rebased onto `main` twice after R4: first to pick up new migrations `000491` / `000492`, then again for `000493_idx_chat_diff_statuses_url_lower`. The prompts-index migration was renumbered `000491 → 000493 → 000494` via `coderd/database/migrations/fix_migration_numbers.sh`; no other diff changes. \| Round \| Head \| Outcome \| \|---\|---\|---\| \| R1 \| `725422ab` \| `COMMENTED` — 7 findings (DEREM-1..7) \| \| R2 \| `ab2a8936` \| `COMMENTED` — 1 new (DEREM-10) + 1 reraised (DEREM-5) \| \| R3 \| `648c5d1f` \| `APPROVED` — 7 fixed, DEREM-5 deferred via #25125 \| \| R4 \| `93b6f450` \| `APPROVED` — DEREM-5 also fixed in-PR, #25125 closed \| \| ID \| Where \| Resolution \| \|---\|---\|---\| \| DEREM-1 \| `chats.sql` \| Added `jsonb_typeof(content) = 'array'` guard against V0 scalar rows \| \| DEREM-2 \| `exp_chats.go` \| Removed dead `limit < 0` branch (SDK rejects upstream) \| \| DEREM-3 \| `useChatStore.ts` \| Rewrote misleading invalidation comment \| \| DEREM-4 \| `exp_chats_test.go` \| `NewestFirstFiltering` now inserts an assistant-role message so the `role='user'` filter is exercised end-to-end \| \| DEREM-5 \| `messageParsing.ts` \| Rewrote `getEditableUserMessagePayload` to concatenate text parts verbatim, mirroring the SQL `string_agg` \| \| DEREM-6 \| `exp_chats.go` \| Tightened swagger doc + error message to spell out the 0–2000 range \| \| DEREM-7 \| `exp_chats_test.go` \| Added `EmptyResultIsJSONArray` subtest \| \| DEREM-10 \| `exp_chats_test.go` \| `NewestFirstFiltering` now inserts a raw V0 scalar-content row; verified locally that removing the guard makes the test fail \| </details> --- This PR was created on behalf of @ibetitsmike by Coder Agents.	2026-05-14 12:43:12 +02:00
Thomas Kosiewski	f71bccf53f	ci(.github/actions/setup-node): verify active Node version (#25143 ) Updates the shared setup-node composite action to current Node 24 based releases of `pnpm/action-setup` and `actions/setup-node`. This avoids the deprecated Node 20 action runtime seen in CODAGT-178 while keeping the third-party actions pinned by SHA. Adds an explicit post-setup check that fails inside Setup Node when `node --version` is not `v22.19.0`, so self-hosted runner/toolcache mismatches are surfaced before `pnpm install` reports a dependency engine error. Closes https://github.com/coder/internal/issues/1457 Generated by Coder Agents.	2026-05-14 12:07:09 +02:00
Danielle Maywood	f7d1ecaece	refactor(site): inline single-use class string constant in AgentSetupNotice (#25086 )	2026-05-14 10:18:50 +01:00
Cian Johnston	581f3bdd14	fix(coderd/httpapi): stop writing websocket frames to ResponseRecorder in test (#25284 ) The `mockEventSenderWrite` function in `newOneWayWriter()` wrote WebSocket frame data to both the `net.Pipe` and the `httptest.ResponseRecorder`. After `websocket.Accept()` calls `WriteHeader(101)`, the recorder rejects body writes with `"response status code does not allow body"`. When `HeartbeatClose` sends a ping, the control frame flush routes through the recorder, producing an ERROR-level log that `slogtest` catches as a test failure. Removed the `recorder.Write(b)` call from the write function. The recorder is only needed for header/status inspection; WebSocket frame data should only go through the `net.Pipe`. Closes https://github.com/coder/internal/issues/1521 > 🤖 Generated by Coder Agents	2026-05-14 09:15:14 +01:00
Jaayden Halko	024132e8a4	feat: add theme_mode, theme_light, theme_dark to UserAppearanceSettings (#25076 ) Part 1: Backend portion of a change broken into 2 PRs. Part 2: #25077 Adds three new UserAppearanceSettings fields (theme_mode, theme_light, theme_dark) on top of the existing theme_preference and terminal_font. Replaces GetUserThemePreference and GetUserTerminalFont with a single GetUserAppearanceSettings aggregate query. The PUT handler is wrapped in db.InTx so sync-mode's mode + slot writes can never half-apply.	2026-05-14 05:44:05 +01:00
Ethan	d147dd3bdd	feat(site/src/pages/AgentsPage/components): show workspace quota in usage indicator (#25168 ) This updates the Agents sidebar usage indicator to surface workspace quota alongside AI spend limits. When both signals are active, the compact trigger renders stacked bars in the same order as the dropdown instead of collapsing them into a single percent. The dropdown still shows the full labels, percentages, and details for each usage section, and Storybook coverage now exercises the combined sidebar state. <img width="315" height="261" alt="image" src="https://github.com/user-attachments/assets/e5cfc276-2cc0-4dc9-9400-6d1b829e75e2" /> <img width="320" height="243" alt="image" src="https://github.com/user-attachments/assets/506ae8ad-3d93-4857-9cdb-b3cf4142772d" /> <img width="314" height="353" alt="image" src="https://github.com/user-attachments/assets/5af3644f-f155-43a4-bae9-91b33a0a4333" /> <img width="322" height="349" alt="image" src="https://github.com/user-attachments/assets/9ae4ae55-55aa-4a2f-856e-f462793f389e" /> Relates to CODAGT-197	2026-05-14 11:59:00 +10:00
Ethan	a35f71cd8a	fix(coderd/x/chatd): retry HTTP/2 stream resets (#25170 ) Mid-stream HTTP/2 peer resets from LLM providers can arrive after a 200 streaming response has already emitted provisional parts. Previously those resets fell through as generic non-retryable errors because `stream ID` messages did not match retryable transport signals, and stream IDs could be misread as HTTP statuses. Classify retryable HTTP/2 RST_STREAM codes as transient timeout failures, ignore stream IDs during status extraction, and keep the existing `retry` event as the rollback boundary for provisional message parts so replacement attempts do not replay failed-attempt output. Closes CODAGT-382	2026-05-14 11:40:43 +10:00
Kayla はな	2943bf5f21	fix(site): use ExternalImage for icon URLs (#25315 )	2026-05-13 17:52:05 -06:00
Danielle Maywood	ef1093d0dd	fix(site): hide sticky metadata user messages (#25316 )	2026-05-13 22:27:38 +01:00
Nick Vigilante	7aaa8485db	docs: update screenshot to point to generic URL (#25314 ) At present, the docs point to an internal URL, so I'm updating the screenshot to point to a ficticious address. Fixes DOCS-59	2026-05-13 17:20:09 -04:00
Michael Suchacz	d1a471e29e	fix(coderd/x/chatd): retune subagent selection guidance (#25311 ) > Mux working on behalf of Mike. ## Summary - retune chatd subagent guidance to prefer `general` for substantial delegated work, including read-only synthesis and planning support - narrow `explore` guidance to repository-local code lookup and bounded tracing - add regression tests for planning, spawn tool, and Plan Mode guidance text ## Tests - `go test ./coderd/x/chatd -run 'Test(DefaultSystemPromptPlanningGuidance_SteersSubagentSelection\|SpawnAgent_DescriptionSteersGeneralForSubstantialResearch\|SpawnAgent_PlanModeDescriptionOmitsComputerUse\|PlanningOverlaySubagentGuidance_UsesPlanModeSafeDescriptions\|ExploreSubagentIsReadOnly)$'` - `make lint` - `make test TEST_PACKAGES=./coderd/x/chatd RUN=Guidance && make test TEST_PACKAGES=./coderd/x/chatd RUN=Description` - pre-commit hook during `git commit`	2026-05-13 23:10:21 +02:00
Kayla はな	341051ceee	fix: exclude service accounts from license seat count (#24401 )	2026-05-13 13:55:53 -07:00
Zach	e0be9bf213	feat: surface missing coder_secret requirements on resolve-autostart (#25081 ) Adds `dynamicparameters.EvaluateSecretMismatch` as a shared helper on top of the existing renderer, then wires it into the resolve-autostart handler so the UI can surface unsatisfied `coder_secret` requirements in a template alongside parameter mismatch for autostart. The lifecycle executor changes will land in a follow-up that depend on this helper. The UI changes that consume the new `secret_mismatch` field is also a follow-up. Generated with assistance from Coder Agents.	2026-05-13 14:20:02 -06:00
Steven Masley	0f505aa4da	chore: unhide flag to force unix filepaths in config-ssh (#25142 ) Docs now include this flag. This flag is now also viewable in linux/mac despite it effectively being a `no-op`. Closes https://github.com/coder/coder/issues/24205	2026-05-13 14:59:33 -05:00
Michael Suchacz	38f586107d	refactor: remove agents TUI (#25190 )	2026-05-13 21:30:11 +02:00
Kayla はな	660fa9478f	style(site): use shorthand for boolean JSX props (#25096 )	2026-05-13 10:56:50 -06:00
George K	49c6191bbe	fix(coderd/azureidentity): add Azure IMDS G2 chain certificates (#25243 ) Azure IMDS attested data signatures can now chain through Microsoft TLS G2 RSA CA OCSP intermediates, then through the cross-signed Microsoft TLS RSA Root G2 certificate, before reaching DigiCert Global Root G2. coderd did not bundle the new G2 OCSP intermediates or the cross-signed Microsoft TLS RSA Root G2 bridge certificate, so it could fail to build a trusted chain for affected IMDS signatures. Related to: https://linear.app/codercom/issue/PLAT-205/bug-azure-instance-identity-verification-is-broken	2026-05-13 09:07:44 -07:00
Danielle Maywood	7fe4d97fd0	fix(site): align streaming thinking spacing (#25291 )	2026-05-13 17:05:34 +01:00
Thomas Kosiewski	b9b8d763e3	refactor(site/src/pages/AgentsPage): break AgentChatPage circular dep (#25287 ) `AgentChatPageView.tsx` imported `getPersistedSidebarTabId` / `savePersistedSidebarTabId` from `AgentChatPage.tsx`, which already imports `AgentChatPageView`, closing a cycle that `pnpm run lint:circular-deps` reports but doesn't fail on (dpdm defaults to exit code 0; the script is missing `--exit-code circular:1`). Move the three sidebar-tab localStorage helpers and the key prefix into `utils/sidebarTabStorage.ts` alongside `draftStorage.ts` and the other per-chat storage modules. Pure code move, no behavior change. After this change, `pnpm run lint:circular-deps` reports zero cycles. --------- Signed-off-by: Thomas Kosiewski <tk@coder.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-13 17:13:32 +02:00
Jaayden Halko	36200b625e	fix: fix flaky storybook test (#25272 )	2026-05-13 16:08:18 +01:00
Kyle Carberry	5040ab6fca	feat: filter chats by diff URL via the q search parameter (#24970 ) Adds a `diff_url:` term to the `q` search parameter on `GET /api/experimental/chats` so callers can look up the chat associated with a particular pull request, merge request, or any other URL persisted on the chat's diff status. ``` q=diff_url:"https://github.com/coder/coder/pull/123" ``` Match is case-insensitive. When the URL lives on a delegated sub-agent's diff status, the parent chat is returned so the relationship surfaces from a single lookup. <details> <summary>Design notes</summary> - Forge-agnostic. Reuses the existing `chat_diff_statuses.url` column rather than introducing a `pr:` vocabulary, since the SDK already documents the URL as "may point to a pull request or a branch page depending on whether a PR has been opened." Works for GitHub PRs, GitLab MRs, branch pages, etc. - Composes with `archived:`. The two terms can be combined: `q=archived:true diff_url:"..."`. - Case handling. The parser used to lowercase the entire `q` string up front, which would mangle URL path segments. Switched to lowercasing only the field key inside `searchTerms` (already happens there) and keeping the value as the caller typed it. The SQL comparison lowercases on both sides. - Validation. `diff_url` must be a syntactically valid HTTP(S) URL with a non-empty host. No forge-specific validation. - Index. Adds `idx_chat_diff_statuses_url_lower` on `LOWER(url)` so the lookup is cheap even on large datasets. - Sub-agent fan-in. `EXISTS` clause matches when the URL lives on the chat itself or any chat with `root_chat_id` equal to the chat's id, so a delegated sub-agent's PR pulls in its parent. - Deferred. Sentinels like `pr:any` / `pr:none` and a forge-agnostic state filter (`diff_state:open\|merged\|closed`) were intentionally left out of this change. They couple cleanly to a second forge or a clearer product call, and shipping them now would lock in vocabulary we may want to revisit. </details> ## Tests - `coderd/searchquery`: parser tests for valid URLs, case handling (key insensitive, value preserved), composition with `archived:`, and validation errors (non-HTTP scheme, missing host, malformed URL). - `coderd/exp_chats_test.go`: end-to-end coverage hitting `ListChats`. Verifies a root chat matches its own URL, a parent chat surfaces when only a sub-agent has the URL, lookups are case-insensitive, non-matching URLs return empty, and invalid URLs return `400`. --- _This PR was authored by a Coder Agent on behalf of @kylecarbs._	2026-05-13 11:06:42 -04:00
Seth Shelnutt	8eb7051987	fix(scripts/ironbank): update base image to UBI9 and remove urllib3 (CVE-2026-44431) (#25217 ) The IronBank Dockerfile used UBI8-minimal:8.7 as its base image. IronBank has migrated images to UBI9 base, and the bundled urllib3 1.26.5 in the image triggers CVE-2026-44431 (sensitive headers leaked on cross-origin redirects via the low-level API). This updates the base image from UBI8-minimal to UBI9-minimal and explicitly removes python3-urllib3 after package installation. Coder is a Go binary and does not invoke Python at runtime, so urllib3 is unused. Refs [ENT-4](https://linear.app/codercom/issue/ENT-4/ironbank-v23111-update-urllib3-from-1265-to-fix-cve-2026-44431), [ENT-51](https://linear.app/codercom/issue/ENT-51/ironbank-main-update-base-image-urllib3-cve-2026-44431), [CVE-2026-44431](https://nvd.nist.gov/vuln/detail/CVE-2026-44431) > Generated by Coder Agents <details><summary>Decision log</summary> - Base image: Moved from `ubi8-minimal:8.7` to `ubi9-minimal:9.6` to align with IronBank's UBI9 migration and reduce overall vulnerability surface. - urllib3 removal: Added explicit `microdnf remove python3-urllib3` with error suppression (`\|\| true`) so the build succeeds whether or not the package is present in the base image. This handles both the minimal and full UBI9 base image variants that IronBank may use. - Crypto policies: RHEL 9 uses the same `/etc/crypto-policies/back-ends/.config` paths as RHEL 8; no changes needed. - Build script*: Updated the `registry.access.redhat.com` override from `ubi8/ubi-minimal:8.7` to `ubi9/ubi-minimal:9.6` for local builds. </details>	2026-05-13 10:41:56 -04:00
Jakub Domeracki	1a1f06aa79	fix: verify PKCS7 signature on Azure instance identity tokens (#25286 ) Migrates Azure instance identity verification from `go.mozilla.org/pkcs7` and `github.com/fullsailor/pkcs7` to `github.com/smallstep/pkcs7`, using `VerifyWithChainAtTime` to validate both the PKCS7 signature and the certificate chain in one call. The previous code only verified the signer certificate against a set of intermediates/roots but did not verify that the PKCS7 signature itself covered the content, meaning tampered payloads could be accepted. The `Options` struct is restructured to accept `Roots`, `Intermediates`, and `CurrentTime` as explicit fields instead of embedding `x509.VerifyOptions`. The test helper `NewAzureInstanceIdentity` now builds a realistic 3-level certificate chain (Root CA -> Intermediate CA -> Signing Cert) matching real Azure trust hierarchy. New tests (`TestValidate_TamperedContent`, `TestValidate_UntrustedCertWithValidSignature`) confirm tampered and untrusted envelopes are rejected. Addresses GHSA-6x44-w3xg-hqqf. > [!NOTE] > This PR was authored by Coder Agents. <details> <summary>Implementation Plan</summary> ### Files Changed \| File \| Summary \| \|------\|---------\| \| `coderd/azureidentity/azureidentity.go` \| Replace `signer.Verify()` with `VerifyWithChainAtTime`; restructure `Options` struct; add `ParseCertificates()` helper \| \| `coderd/azureidentity/azureidentity_test.go` \| Add `testCertChain` builder, tampered-content and untrusted-cert tests; update existing tests for new `Options` API \| \| `coderd/coderd.go` \| Change `AzureCertificates` field from `x509.VerifyOptions` to `azureidentity.Options` \| \| `coderd/workspaceresourceauth.go` \| Pass `api.AzureCertificates` directly instead of wrapping \| \| `coderd/coderdtest/coderdtest.go` \| Migrate to `smallstep/pkcs7`; build 3-level cert chain in test helper \| \| `go.mod` / `go.sum` \| Add `github.com/smallstep/pkcs7`; remove `fullsailor/pkcs7` and `go.mozilla.org/pkcs7` \| </details>	2026-05-13 14:14:07 +00:00
Danielle Maywood	b52c0bdb56	fix(site/src/pages/AgentsPage/components): unify live thinking spacing and sizing (#25192 )	2026-05-13 12:51:49 +01:00
Jakub Domeracki	57b11d405f	fix(coderd): harden Azure identity certificate fetch (#25274 ) Security improvements: - Restrict cert fetches to a host+port allowlist (Microsoft and DigiCert on 80/443). - Route requests through a dedicated `http.Client` that resolves the host once and dials the validated IP directly, preventing DNS rebinding. - Reject loopback, private (RFC 1918 / IPv6 ULA), link-local, multicast, unspecified, CGNAT, benchmarking, and IPv4-mapped IPv6 addresses. - Cap the certificate response body at 1 MiB. - Log the underlying error via slog and return a generic detail to the caller to prevent information disclosure.	2026-05-13 12:51:44 +02:00

1 2 3 4 5 ...

14315 Commits