coder

mirror of https://github.com/coder/coder.git synced 2026-06-04 13:38:21 +00:00

Author	SHA1	Message	Date
Ethan	cda460f5df	perf(coderd/chatd): skip same-replica stream DB rereads (#23218 ) ## Problem Scaletest follow-up storms showed that the chat stream path was doing a same-replica DB reread for every durable message it had already delivered locally. In a 600-chat / 10-turn run, `/stream`-attributed `GetChatMessagesByChatID` calls reached about 14.2k across 5,400 follow-up turns — roughly 2.63 rereads per turn. The primary coderd replicas saturated their DB pools at 60/60 open connections during the storm window. The root cause: when pubsub was active, `Subscribe()` suppressed local durable `message` events and relied entirely on pubsub notify → `GetChatMessagesByChatID` for catch-up. Same-replica subscribers paid the full DB round-trip even though the persisting process was on the same replica. ## Solution Add a bounded per-chat durable message cache to `chatStreamState` so that same-replica subscribers can catch up from memory instead of the database. ### How it works 1. `publishMessage()` caches the SDK event in `chatStreamState` before local fanout and pubsub notify. 2. `publishEditedMessage()` replaces the cache with only the edited message, then publishes `FullRefresh`. 3. `Subscribe()` handles ordinary `AfterMessageID` notifies by first consulting the per-chat durable cache and only falling back to `GetChatMessagesByChatID` on cache miss. 4. `FullRefresh` always forces a DB reread (cache is bypassed). ### Safety properties - If the cache misses (e.g. message expired or remote replica), the DB catch-up still runs — no silent message loss. - `FullRefresh` (edits) always rereads from the database. - Remote replicas still use the pubsub + DB path unchanged. - The cache is bounded (`maxDurableMessageCacheSize = 256`) and scoped per chat — no unbounded memory growth. ## Impact This change removes the entire same-replica portion of the stream rereads. Based on the 600-chat follow-up run, the upper bound on saved work is the same-replica share of about 14.2k `GetChatMessagesByChatID` rereads, with the observed total stream reread rate at about 2.63 rereads per follow-up turn.	2026-03-19 14:02:00 +11:00
Cian Johnston	be1c06dec9	feat: add endpoint and CLI for users to view their own OIDC claims (#23053 ) - Adds a new API endpoint `GET /api/v2/users/oidc-claims` that returns only the merged claims (not the separate id_token/userinfo breakdown). Scoped exclusively to the authenticated user's own identity — no user parameter, so users cannot view each other's claims. - Adds a new CLI command:** `coder users oidc-claims` that hits the above endpoint. - The existing owner-only debug endpoint is preserved unchanged for admins who need the full claim breakdown. > 🤖 This PR was created with the help of Coder Agents, and will be reviewed by my human. 🧑‍💻	2026-03-18 22:10:04 +00:00
Hugo Dutka	d285a3e74e	fix: handle null bytes in chat messages (#22946 ) This PR fixes a bug where if a tool result contained binary data it wouldn't be persisted to the database. `jsonb` in Postgres is unable to store null bytes which are sometimes output by tool results. This change makes it so that we encode them with a special escape sequence before saving them to the database, and decode them on read. <img width="808" height="637" alt="Screenshot 2026-03-11 at 13 14 06" src="https://github.com/user-attachments/assets/9be353eb-ff26-40ec-9f0a-195022b11f43" />	2026-03-18 21:19:25 +01:00
Kyle Carberry	147d627505	fix: deduplicate PR insights, fix cost computation, simplify UI (#23251 ) ## Problem The `/agents/settings/insights` page had several issues: 1. Duplicate PRs in "Recent Pull Requests" — multiple chats referencing the same PR URL each produced a row 2. Wildly wrong costs — the cost subquery summed ALL messages across the entire chat tree (`GROUP BY root_chat_id`), so every chat in a tree got the same inflated total. When aggregated, the same tree cost was counted N× per PR in that tree 3. UI clutter — too many stat cards, too many table columns, mixed naming conventions ## Fix ### Backend (SQL) - Deduplicate by PR URL using `DISTINCT ON (COALESCE(cds.url, c.id::text))` across all 4 queries - Fix cost computation: use two CTEs — `pr_costs` sums cost from ALL chats that reference a PR (so review chats contribute), `deduped` picks one row per PR for state/additions/deletions via DISTINCT ON - Tests: 3 subtests covering multi-chat cost summing, different PRs no duplication, and duplicate URL counted once ### Frontend - 3 stat cards (down from 5): Merged, Merge rate, Cost / merge - 2-line chart (down from 3): created (dashed) + merged (solid) - 4-column model table (down from 7): Model, Merged, Merge rate, Cost/merge - 4-column recent table (down from 7): Title, Status, Cost, Created — with `table-fixed` to prevent overflow - Consistent naming: no mixed PR/PRs abbreviation, contextual labels since page title establishes context	2026-03-18 15:50:50 -04:00
Cian Johnston	14ed3e3644	feat: bump workspace last_used_at on chat heartbeat (#23205 ) - coderd: Wires `options.WorkspaceUsageTracker` into the chatd config. - chatd: Adds `UsageTracker` and calls `UsageTracker.Add(workspaceID)` on each heartbeat tick - chatd: adds tests to verify `last_used_at` bump behaviour > 🤖 This PR was created with the help of Coder Agents, and will be reviewed by my human. 🧑‍💻	2026-03-18 19:07:21 +00:00
Kyle Carberry	1f0d896fc9	feat: add deleted flag to chat messages for soft-delete (#23223 ) Adds a `deleted` boolean column to the `chat_messages` table. Messages are never physically deleted from the database — instead they are marked as deleted so that usage and cost data is preserved. ## Changes ### Migration - New migration (000444) adds `deleted boolean NOT NULL DEFAULT false` to `chat_messages` ### SQL queries - `DeleteChatMessagesAfterID` → `SoftDeleteChatMessagesAfterID` (UPDATE SET deleted=true instead of DELETE) - New `SoftDeleteChatMessageByID` query for single-message soft-delete - All read queries now filter `deleted = false`: - `GetChatMessageByID` - `GetChatMessagesByChatID` - `GetChatMessagesByChatIDDescPaginated` - `GetChatMessagesForPromptByChatID` (both CTE and main query) - `GetLastChatMessageByRole` - Cost/usage queries (`GetChatCostSummary`, `GetChatCostPerModel`, etc.) intentionally still include deleted messages to preserve accurate spend tracking ### EditMessage behavior - Previously: updated the message content in-place + hard-deleted subsequent messages - Now: soft-deletes the original message + soft-deletes subsequent messages + inserts a new message with the updated content - This preserves the original message data (tokens, cost, content) in the database	2026-03-18 14:37:09 -04:00
Kyle Carberry	cbe29e4e25	fix: encode non-ASCII filenames in chat file upload header (#23241 ) ## Problem Uploading a file on the `/agents` chat page fails with: ``` Failed to execute 'setRequestHeader' on 'XMLHttpRequest': String contains non ISO-8859-1 code point. ``` This happens when the image filename contains non-ASCII characters (e.g. CJK characters from macOS screenshots like `スクリーンショット.png`, accented characters, emoji, etc.). HTTP headers only support ISO-8859-1 code points, and the filename was being interpolated directly into the `Content-Disposition` header. ## Fix Use [RFC 5987](https://datatracker.ietf.org/doc/html/rfc5987) `filename=UTF-8''` encoding so the percent-encoded name is always valid in the header. A static ASCII `filename="file"` fallback is included for older clients. The server already uses Go's `mime.ParseMediaType` which decodes `filename` automatically, so no backend changes are needed. ### Before ```ts "Content-Disposition": `attachment; filename="${file.name}"` ``` ### After ```ts "Content-Disposition": `attachment; filename="file"; filename*=UTF-8''${encodeURIComponent(file.name)}` ``` ## Testing Added a server-side test (`TestGetChatFile/UnicodeFilename`) that uploads with a Japanese filename and verifies it round-trips correctly through the `Content-Disposition` header.	2026-03-18 14:11:30 -04:00
Kyle Carberry	90cf4f0a91	refactor: consolidate chat streaming endpoints under /stream (#23248 ) Moves per-chat streaming/watch endpoints under a `/stream` sub-route for better API consistency: \| Before \| After \| \|--------\|-------\| \| `GET /{chat}/stream` \| `GET /{chat}/stream/` \| \| `GET /{chat}/desktop` \| `GET /{chat}/stream/desktop` \| \| `GET /{chat}/git/watch` \| `GET /{chat}/stream/git` \| ### Changes - `coderd/coderd.go` — Route definitions: replaced flat routes with `r.Route("/stream", ...)` sub-router - `site/src/api/api.ts` — Updated WebSocket URLs for `watchChatGit` and `watchChatDesktop` - `coderd/chats_test.go` — Updated desktop test URL - `coderd/workspaceagents_internal_test.go` — Updated git watcher test URLs (route mounts + dial URLs) - `site/src/pages/AgentsPage/AgentDetail.stories.tsx` — Updated storybook WebSocket mock paths	2026-03-18 18:04:42 +00:00
Cian Johnston	0b13ba978a	fix: rename chat logger from coderd.chats.chat-processor to coderd.chatd.processor (#23246 ) - Rename logger `coderd.chats` to `coderd.chatd` in `coderd.go` - Rename sub-logger `chat-processor` to `processor` in `chatd/chatd.go`	2026-03-18 17:48:47 +00:00
Kyle Carberry	d4a072b61e	fix: address review comments on InsertChatMessages (#23239 ) Follow-up to #23220, addressing Cian's review comments: - SQL casing: Uppercase `UNNEST` to match `NULLIF`/`COALESCE` convention in the query. - Builder pattern: `chatMessage` struct now uses unexported fields with a `newChatMessage` constructor for required fields (role, content, visibility, modelConfigID, contentVersion) and chainable builder methods (`withCreatedBy`, `withCompressed`, `withUsage`, `withContextLimit`, `withTotalCostMicros`, `withRuntimeMs`) for optional/nullable fields. - Batch test in chats_test: Replaced the `for i := 0; i < 2` loop with a single batch insert of 2 messages to actually exercise the batch logic. - Multi-message querier test: Added `BatchInsertMultipleMessages` test verifying 3-message batch insert with role ordering, sequential IDs, nullable field semantics (NULL for zero UUIDs and zero ints), and token/cost assertions. --------- Co-authored-by: Cian Johnston <cian@coder.com>	2026-03-18 17:06:44 +00:00
Steven Masley	c46136ff73	chore: update coder/trivy override (#23230 ) Coder/preview does this update as well. Because it is a `replace`, we have to manually update our `replace` too	2026-03-18 12:03:56 -05:00
Cian Johnston	65b7658568	chore: extract testutil.FakeSink for slog test assertions (#23208 ) Follow-up to [review comment on #23025](https://github.com/coder/coder/pull/23025#discussion_r2930309487) from @mafredri. Extracts the repeated `logSink` / `fakeSink` test pattern into a shared `testutil.FakeSink` and migrates all existing call sites. > 🤖 This PR was created with the help of Coder Agents, and will be reviewed by my human. 🧑‍💻 --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-03-18 17:02:38 +00:00
Kyle Carberry	483adc59fe	feat: replace InsertChatMessage with batch InsertChatMessages (#23220 ) Replaces the singular `InsertChatMessage` query with `InsertChatMessages` that uses PostgreSQL's `unnest()` for batch inserts. This reduces the number of database round-trips when inserting multiple messages in a single transaction. ## Changes - SQL: New `InsertChatMessages :many` query using `unnest()` arrays following the existing codebase pattern (e.g., `InsertWorkspaceAgentStats`). Preserves the CTE that updates `chats.last_model_config_id` using the last non-null model config from the batch. Uses `NULLIF` for UUID columns to handle NULL foreign keys. - Go layers: Updated `querier.go`, `dbauthz.go`, `dbmetrics/querymetrics.go`, `dbmock/dbmock.go`, and `queries.sql.go` to use the new batch signature (`[]ChatMessage` return type, array params). - chatd.go: All call sites converted to batch inserts: - CreateChat: System prompt + user message batched into one call - persistStep: Assistant message + tool messages batched into one call - persistSummary: Hidden summary + assistant + tool messages batched into one call - Single-message sites use the same API with single-element arrays - Helper: New `appendChatMessage` function simplifies building batch params at each call site. - Tests: All test files updated to use the new API. Builds on top of #23213.	2026-03-18 16:27:07 +00:00
Kyle Carberry	d6fef96d72	feat: add PR insights analytics dashboard (#23215 ) ## What Adds a new admin-only PR Insights page for the `/agents` analytics view — a dashboard for engineering leaders to understand code shipped by AI agents. ### Backend - `GET /api/v2/chats/insights/pull-requests` — admin-only endpoint - 4 SQL queries in `chatinsights.sql` aggregating `chat_diff_statuses` joined with chat cost data (via root chat tree rollup) - Runs 5 parallel DB queries: current summary, previous summary (for trends), time series, per-model breakdown, recent PRs - SDK types auto-generate to TypeScript ### Frontend (`PRInsightsView`) - Stat cards: PRs created, Merged, Merge rate, Lines shipped, Cost/merged PR — with trend badges comparing to previous period - Activity chart: Stacked area chart (created/merged/closed) using git color tokens (`git-added-bright`, `git-merged-bright`, `git-deleted-bright`) - Model performance table: Per-model PR counts, inline merge rate bars, diff stats, cost breakdown - Recent PRs table: Status badges, review state icons, author info, external links - Time range filter: 7d/14d/30d/90d button group - 4 Storybook stories: Default, HighPerformance, LowVolume, NoPRs ### Data source All PR data comes from the existing `chat_diff_statuses` table (populated by the `gitsync.Worker` background job that polls GitHub every 120s). No new data collection required. ### Screenshot View in Storybook: `pages/AgentsPage/PRInsightsView`	2026-03-18 15:29:29 +00:00
Kyle Carberry	4dd8531f37	feat: track step runtime_ms on chat messages (#23219 ) ## Summary Adds a `runtime_ms` column to `chat_messages` that records the wall-clock duration (in milliseconds) of each LLM step. This covers LLM streaming, tool execution, and retries — the full time the agent is "alive" for a step. This is the foundation for billing by agent alive time. The column follows the same pattern as `total_cost_micros`: stored per assistant message, aggregatable with `SUM()` over time periods by user. ## Changes - Migration: adds nullable `runtime_ms bigint` to `chat_messages`. - chatloop: adds `Runtime time.Duration` field to `PersistedStep`, measures `time.Since(stepStart)` at the beginning of each step (covering stream + tool execution + retries). - chatd: passes `step.Runtime.Milliseconds()` to the assistant message `InsertChatMessage` call; all other message types (system, user, tool) get `NULL`. - Tests: adds `runtime > 0` assertion in chatloop tests. ## Billing query pattern Once ready, aggregation mirrors the existing cost queries: ```sql SELECT COALESCE(SUM(cm.runtime_ms), 0)::bigint AS total_runtime_ms FROM chat_messages cm JOIN chats c ON c.id = cm.chat_id WHERE c.owner_id = @user_id AND cm.created_at >= @start_time AND cm.created_at < @end_time AND cm.runtime_ms IS NOT NULL; ```	2026-03-18 10:57:35 -04:00
Kacper Sawicki	1e07ec49a6	feat: add merge_strategy support for coder_env resources (#23107 ) ## Description Implements the server-side merge logic for the `merge_strategy` attribute added to `coder_env` in [terraform-provider-coder v2.15.0](https://github.com/coder/terraform-provider-coder/pull/489). This allows template authors to control how duplicate environment variable names are combined across multiple `coder_env` resources. Relates to https://github.com/coder/coder/issues/21885 ## Supported strategies \| Strategy \| Behavior \| \|----------\|----------\| \| `replace` (default) \| Last value wins — backward compatible \| \| `append` \| Joins values with `:` separator (e.g. PATH additions) \| \| `prepend` \| Prepends value with `:` separator \| \| `error` \| Fails the build if the variable is already defined \| ## Example ```hcl resource "coder_env" "path_tools" { agent_id = coder_agent.dev.id name = "PATH" value = "/home/coder/tools/bin" merge_strategy = "append" } ``` ## Changes - Proto: Added `merge_strategy` field to `Env` message in `provisioner.proto` - State reader: Updated `agentEnvAttributes` struct and proto construction in `resources.go` - Merge logic: Added `mergeExtraEnvs()` function in `provisionerdserver.go` with strategy-aware merging for both agent envs and devcontainer subagent envs - Tests: 15 unit tests covering all strategies, edge cases (empty values, mixed strategies, multiple appends) - Dependency: Bumped `terraform-provider-coder` v2.14.0 → v2.15.0 - Fixtures: Updated `duplicate-env-keys` test fixtures and golden files ## Ordering When multiple resources `append` or `prepend` to the same key, they are processed in alphabetical order by Terraform resource address (per the determinism fix in #22706).	2026-03-18 15:43:28 +01:00
Steven Masley	84de391f26	chore: add tallyman events for ai seat tracking (#22689 ) AI seat tracking inserted as heartbeat into usage table.	2026-03-18 09:30:22 -05:00
Kyle Carberry	b83b93ea5c	feat: add workspace awareness system message on chat creation (#23213 ) When a chat is created via `chatd`, a system message is now inserted informing the model whether the chat was created with or without a workspace. With workspace: > This chat is attached to a workspace. You can use workspace tools like execute, read_file, write_file, etc. Without workspace: > There is no workspace associated with this chat yet. Create one using the create_workspace tool before using workspace tools like execute, read_file, write_file, etc. This is a model-only visibility system message (not shown to users) that helps the model understand its available capabilities upfront — particularly important for subagents spawned without a workspace, which previously would attempt to use workspace tools and fail. Changes: - `coderd/chatd/chatd.go`: Added workspace awareness constants and inserted the system message in `CreateChat` after the system prompt, before the initial user message. - `coderd/chatd/chatd_test.go`: Added `TestCreateChatInsertsWorkspaceAwarenessMessage` with sub-tests for both with-workspace and without-workspace cases.	2026-03-18 14:01:46 +00:00
Ethan	fc3508dc60	feat: configure acquire chat batch size (#23196 ) ## Summary - add a hidden deployment config option for chat acquire batch size (`CODER_CHAT_ACQUIRE_BATCH_SIZE` / `chat.acquireBatchSize`) - thread the configured value into chatd startup while preserving the existing default of `10` - clamp the deployment value to the `int32` range before passing it into chatd - regenerate the API/docs/types/testdata artifacts for the new config field ## Why `chatd` currently acquires pending chats in batches of `10` via a compile-time default. This change makes that batch size operator-configurable from deployment config, so we can tune acquisition behavior without another code change.	2026-03-19 00:54:32 +11:00
Kyle Carberry	d42008e93d	fix: persist partial assistant response when chat is interrupted mid-stream (#23193 ) ## Problem When a user cancels a streaming chat response mid-stream, the partial content disappears entirely — both from the UI and the database. The streamed text vanishes as if the response never happened. ## Root Causes Three issues combine to prevent partial message persistence on interrupt: ### 1. StreamPartTypeError only matched `context.Canceled` (`chatloop.go`) The interrupt detection in `processStepStream` checked: ```go errors.Is(part.Error, context.Canceled) && errors.Is(context.Cause(ctx), ErrInterrupted) ``` But some providers propagate `ErrInterrupted` directly as the stream error rather than wrapping it in `context.Canceled`. This caused the condition to fail, so `flushActiveState` was never called and partial text accumulated in `activeTextContent` was lost. ### 2. No post-loop interrupt check (`chatloop.go`) If the stream iterator stops yielding parts without producing a `StreamPartTypeError` (e.g., a provider that silently closes the response body on cancel), there was no check after the `for part := range stream` loop to detect the interrupt and flush active state. ### 3. Worker ownership check blocked interrupted persists (`chatd.go`) `InterruptChat` → `setChatWaiting` clears `worker_id` in the DB before the chatloop detects the interrupt. When `persistInterruptedStep` (using `context.WithoutCancel`) tried to write the partial message, the ownership check: ```go if !lockedChat.WorkerID.Valid \|\| lockedChat.WorkerID.UUID != p.workerID { return chatloop.ErrInterrupted // always blocks! } ``` unconditionally rejected the write. The error was silently logged as a warning. ## Fix - Broaden the `StreamPartTypeError` interrupt detection to match both `context.Canceled` and `ErrInterrupted` as the stream error. - Add a post-loop interrupt check in `processStepStream` that flushes active state when the context was canceled with `ErrInterrupted`. - Allow `persistStep` to write when the chat is in `waiting` status (interrupt) even if `worker_id` was cleared. The `pending` status (from `EditMessage`, where history is truncated) still correctly blocks stale writes. ## Testing Added `TestInterruptChatPersistsPartialResponse` — an end-to-end integration test that: 1. Streams partial text chunks from a mock LLM 2. Waits for the chatloop to publish `message_part` events (confirming chunks were processed) 3. Interrupts the chat mid-stream 4. Verifies the partial assistant message is persisted in the database with the expected text content	2026-03-18 11:48:28 +00:00
Atif Ali	bd5b62c976	feat: expose MCP tool annotations for tool grouping (#23195 ) ## Summary - add shared MCP annotation metadata to toolsdk tools - emit MCP tool annotations from both coderd and CLI MCP servers - cover annotation serialization in toolsdk, coderd MCP e2e, and CLI MCP tests ## Why - Coder already exposed MCP tools, but it did not populate MCP tool annotation hints (`readOnlyHint`, `destructiveHint`, `idempotentHint`, `openWorldHint`). - Hosts such as Claude Desktop use those hints to classify and group tools, so without them Coder tools can get lumped together. - This change adds a shared annotation source in `toolsdk` and has both MCP servers emit those hints through `mcp.Tool.Annotations`, avoiding drift between local and remote MCP implementations. ## Testing - Tested locally on Cladue Desktop and the tools are categorized correctly. <table> <tr> <td> Before <td> After <tr> <td> <img width="613" height="183" alt="image" src="https://github.com/user-attachments/assets/29d2e3fb-53bc-4ea7-bdb3-f10df4ef996b" /> <td> <img width="600" height="457" alt="image" src="https://github.com/user-attachments/assets/cc384036-c9a7-4db9-9400-43ad51920ff5" /> </table> Note: Done using Coder Agents, reviewed and tested by human locally	2026-03-18 10:21:45 +00:00
Hugo Dutka	2cf47ec384	feat: virtual desktop settings toggle backend (#23171 ) Adds a new `site_config` entry that controls whether the virtual desktop feature for Coder Agents is enabled. It can be set via a new `/api/experimental/chats/config/desktop-enabled` endpoint, which will be used by the frontend.	2026-03-18 09:35:13 +01:00
Ethan	11481d7bed	perf(coderd/chatd): reduce lock contention in instruction cache and persistStep (#23144 ) ## Summary Two targeted performance improvements to the chatd server, identified through benchmarking. ### 1. RWMutex for instruction cache The instruction cache is read on every chat turn to fetch the home instruction file for a workspace agent. Writes only occur on cache misses (once per agent per 5-minute TTL window), making the access pattern ~90%+ reads. Switching from `sync.Mutex` to `sync.RWMutex` and using `RLock`/`RUnlock` on the read path allows concurrent readers instead of serializing them. Benchmark (200 concurrent chats): \| \| ns/op \| \|---\|---\| \| Mutex \| 108 \| \| RWMutex \| 32 \| \| Speedup \| 3.4x \| ### 2. Hoist JSON marshaling out of persistStep transaction `MarshalParts`, `PartFromContent`, `CalculateTotalCostMicros`, and the `usageForCost` struct population are pure CPU work that ran inside the `FOR UPDATE` transaction in `persistStep`. They have zero dependency on the database transaction. Moving all marshal and cost-calculation calls above `p.db.InTx()` means the row lock is held only for `GetChatByIDForUpdate` + `InsertChatMessage` calls. Benchmark (16 goroutines contending on same lock): \| Tool calls \| Inside lock \| Outside lock \| Speedup \| \|---\|---\|---\|---\| \| 1 \| 13,977 ns/op \| 1,055 ns/op \| 13x \| \| 5 \| 38,203 ns/op \| 3,769 ns/op \| 10x \| \| 10 \| 67,353 ns/op \| 7,284 ns/op \| 9x \| \| 20 \| 145,864 ns/op \| 14,045 ns/op \| 10x \| No behavioral changes in either commit.	2026-03-18 16:12:14 +11:00
Kayla はな	49e5547c22	feat: add support for creating service accounts (#23140 )	2026-03-17 15:36:20 -06:00
George K	91ec0f1484	feat: add service_accounts workspace sharing mode (#23093 ) Introduce a three-way workspace sharing setting (none, everyone, service_accounts) replacing the boolean workspace_sharing_disabled. In service_accounts mode, only service account-owned workspaces can be shared while regular members' share permissions are removed. Adds a new organization-service-account system role with per-org permissions reconciled alongside the existing organization-member system role. Related to: https://linear.app/codercom/issue/PLAT-28/feat-service-accounts-sharing-mode-and-rbac-role --------- Co-authored-by: Steven Masley <Emyrk@users.noreply.github.com> Co-authored-by: Kayla はな <mckayla@hey.com>	2026-03-17 12:16:43 -07:00
Kyle Carberry	b779c9ee33	fix: use SQL-level auth filtering for chat listing (#23159 ) ## Problem The chat listing endpoint (`GetChatsByOwnerID`) was using `fetchWithPostFilter`, which fetches N rows from the database and then filters them in Go memory using RBAC checks. This causes a pagination bug: if the user requests `limit=25` but some rows fail the auth check, fewer than 25 rows are returned even though more authorized rows exist in the database. The client may incorrectly assume it has reached the end of the list. ## Solution Switch to the same pattern used by `GetWorkspaces`, `GetTemplates`, and `GetUsers`: `prepareSQLFilter` + `GetAuthorized*` variant. The RBAC filter is compiled to a SQL WHERE clause and injected into the query before `ORDER BY`/`LIMIT`, so the database returns exactly the requested number of authorized rows. Additionally, `GetChatsByOwnerID` is renamed to `GetChats` with `OwnerID` as an optional (nullable) filter parameter, matching the `GetWorkspaces` naming convention. ## Changes \| File \| Change \| \|------\|--------\| \| `queries/chats.sql` \| Renamed to `GetChats`, `owner_id` now optional via CASE/NULL, added `-- @authorize_filter` \| \| `queries.sql.go` \| Renamed constant, params struct (`GetChatsParams`), and method \| \| `querier.go` \| Interface method renamed \| \| `modelqueries.go` \| Added `chatQuerier` interface + `GetAuthorizedChats` impl \| \| `dbauthz/dbauthz.go` \| `GetChats` now uses `prepareSQLFilter` instead of `fetchWithPostFilter` \| \| `dbauthz/dbauthz_test.go` \| Updated tests for SQL filter pattern \| \| `dbmock/dbmock.go` \| Renamed + added mock for `GetAuthorizedChats` \| \| `dbmetrics/querymetrics.go` \| Renamed + added metrics wrapper \| \| `rbac/regosql/configs.go` \| Added `ChatConverter` (maps `org_owner` to empty string literal since `chats` has no `organization_id` column) \| \| `rbac/authz.go` \| Added `ConfigChats()` \| \| `chats.go` \| Handler uses renamed method with `uuid.NullUUID` \| \| `searchquery/search.go` \| Updated return type \| \| `gitsync/worker.go` \| Updated interface and call site \| \| Various test files \| Updated for renamed types \|	2026-03-17 12:46:24 -04:00
Kyle Carberry	075dfecd12	refactor: consolidate experimental chats API types (#23143 ) ## Summary Consolidates three areas of type duplication in the experimental chats API: ### 1. Merge archive/unarchive into `PATCH /{chat}` - Before: `POST /{chat}/archive` + `POST /{chat}/unarchive` (two endpoints, two handlers with mirrored logic) - After: `PATCH /{chat}` accepting `{ "archived": true/false }` via `UpdateChatRequest` - Removes one endpoint and ~30 lines of duplicated handler code ### 2. Collapse identical request/response prompt types - `ChatSystemPromptResponse` + `UpdateChatSystemPromptRequest` → `ChatSystemPrompt` - `UserChatCustomPromptResponse` + `UpdateUserChatCustomPromptRequest` → `UserChatCustomPrompt` - These pairs were field-for-field identical (single string field) ### 3. Merge duplicate reasoning options types - `ChatModelOpenRouterReasoningOptions` + `ChatModelVercelReasoningOptions` → `ChatModelReasoningOptions` - Same 4 fields, same types — only field ordering and enum value sets differed - Unified type uses the superset of enum values ### Files changed - `codersdk/chats.go` — SDK types and client methods - `coderd/chats.go` — Handler consolidation - `coderd/coderd.go` — Route change - `coderd/chats_test.go` — Test updates - `site/src/api/api.ts` — Frontend API client - `site/src/api/queries/chats.ts` — Query mutations - `site/src/api/queries/chats.test.ts` — Test mocks - `site/src/pages/AgentsPage/AgentsPage.tsx` — Call site - Generated files (`typesGenerated.ts`, `chatModelOptionsGenerated.json`) ### Testing - All Go tests pass (`TestArchiveChat`, `TestUnarchiveChat`, `TestChatSystemPrompt`) - All frontend tests pass (31/31 in `chats.test.ts`)	2026-03-17 14:31:11 +00:00
Ethan	41bd7acf66	perf(chatd): remove redundant chat rereads (#23161 ) ## Summary This PR removes two redundant chat rereads in `chatd`. ### Archive / unarchive - `archiveChat` and `unarchiveChat` already come through `httpmw.ChatParam`, so the handlers already have the `database.Chat` row. - Pass that row into `chatd.ArchiveChat` / `chatd.UnarchiveChat` instead of rereading by ID before publishing the sidebar events. ### End-of-turn cleanup - `processChat` no longer calls `GetChatByID` after the cleanup transaction just to refresh the chat snapshot. - Title generation already persists the generated title and emits its own `title_change` event. - To preserve best-effort title freshness for the cleanup path, the async title-generation goroutine stores the generated title in per-turn shared state and cleanup overlays it if available before publishing the `status_change` event and dispatching push notifications. ## Why - removes one DB read from archive / unarchive requests - removes one DB read from completed turns, which is the larger hot-path win - keeps the existing pubsub/event contract intact instead of broadening this into a larger event-model redesign ## Notes - `title_change` remains the authoritative title update for clients - cleanup does not wait for title generation; it uses the generated title only when it is already available	2026-03-18 00:52:06 +11:00
Ethan	a33605df58	perf(coderd/chatd): reuse workspace context within a turn (#23145 ) ## Summary - reuse workspace agent context within a single `runChat()` turn - remove duplicate latest-build agent lookups between `resolveInstructions()` and `getWorkspaceConn()` - avoid the extra `GetWorkspaceAgentByID` fetch when the selected `WorkspaceAgent` already has the needed metadata - add focused internal tests for reuse and refresh-on-dial-failure ## Why This came out of a 5000-chat / 10-turn scaletest on bravo against a single workspace. The run completed successfully, but coderd stayed DB-pool bound, and one workspace-backed hot path stood out: - `GetWorkspaceAgentsInLatestBuildByWorkspaceID ≈ 46.7k` - `GetWorkspaceByID ≈ 48.0k` - `GetWorkspaceAgentByID ≈ 2.2k` Within one `runChat()` turn, chatd was rediscovering the same workspace agent multiple times just to resolve instructions and open the workspace connection. ## What this changes This PR introduces a turn-local workspace context helper so a single acquired turn can: - resolve the selected workspace agent once - reuse that agent for instruction resolution - reuse the same `AgentConn` for workspace tools and reload/compaction This stays turn-local only, so a later turn on another replica still rebuilds fresh context from the DB. ## Expected impact This is an incremental improvement, not a full fix. It should reduce duplicated workspace-agent lookups and shave some DB pressure from a hot path for workspace-backed chats, while preserving multi-replica correctness. ## Testing - `go test ./coderd/chatd/...` - `golangci-lint run ./coderd/chatd/...`	2026-03-18 00:33:44 +11:00
Danny Kopping	365de3e367	feat: record model thoughts (#22676 ) Depends on https://github.com/coder/aibridge/pull/203 Closes https://github.com/coder/internal/issues/1337 --------- Signed-off-by: Danny Kopping <danny@coder.com>	2026-03-17 11:41:10 +00:00
Michael Suchacz	5d0eb772da	fix(cored): fix flaky TestInterruptAutoPromotionIgnoresLaterUsageLimitIncrease (#23147 )	2026-03-17 19:08:22 +11:00
Ethan	04fca84872	perf(coderd): reduce duplicated reads in push and webpush paths (#23115 ) ## Background A 5000-chat scaletest (~50k turns, ~2m45s wall time) completed successfully, but the main bottleneck was DB pool starvation from repeated reads, not individually expensive SQL. The push/webpush path showed a few especially noisy reads: - `GetLastChatMessageByRole` for push body generation - `GetEnabledChatProviders` + `GetChatModelConfigByID` for push summary model resolution - `GetWebpushSubscriptionsByUserID` for every webpush dispatch This PR keeps the optimizations that remove those duplicate reads while leaving stream behavior unchanged. ## What changes in this PR ### 1. Reuse resolved chat state for push notifications `maybeSendPushNotification` used to re-read the last assistant message and re-resolve the chat model/provider after `runChat` had already done that work. Now `runChat` returns the final assistant text plus the already-resolved model and provider keys, and the push goroutine uses that state directly. That removes the extra push-path reads for: - `GetLastChatMessageByRole` - the second `resolveChatModel` path - the provider/model lookups that came with that second resolution ### 2. Cache webpush subscriptions during dispatch `Dispatch()` previously hit `GetWebpushSubscriptionsByUserID` on every push. A small per-user in-memory cache now avoids those repeated reads. The follow-up fix keeps that optimization correct: `InvalidateUser()` bumps a per-user generation so an older in-flight fetch cannot repopulate the cache with pre-mutation data after subscribe/unsubscribe. That preserves the cache win without letting local subscription changes be silently overwritten by stale fetch results. ## Why this is safe - The push change only reuses data already produced during the same chat run. It does not change notification semantics; if there is no assistant text to summarize, the existing fallback body still applies. - The webpush change keeps the existing TTL and `410 Gone` cleanup behavior. The generation guard only prevents stale in-flight fetches from poisoning the shared cache after invalidation. - The final PR does not change stream setup, pubsub/relay behavior, or chat status snapshot timing. ## Deliberately not included - No stream-path optimization in `Subscribe`. - No inline pubsub message payloads. - No distributed cross-replica webpush cache invalidation.	2026-03-17 13:50:47 +11:00
Michael Suchacz	1031da9738	feat: add agent chat spend limiting (backend) (#23071 ) Introduces deployment-scoped spend limiting for Coder Agents, enabling administrators to control LLM costs at global, group, and individual user levels. ## Changes - Database migration (000437): `chat_usage_limit_config` (singleton), `chat_usage_limit_overrides` (per-user), `chat_usage_limit_group_overrides` (per-group) - Single-query limit resolution: individual override > min(group) > global default via `ResolveUserChatSpendLimit` - Fail-open enforcement in chatd with documented TOCTOU trade-off - Experimental API under `/api/experimental/chats/usage-limits` for CRUD on limits - `AsChatd` RBAC subject for narrowly-scoped daemon access (replaces `AsSystemRestricted`) - Generated TypeScript types for the frontend SDK ## Hierarchy 1. Individual user override (highest) 2. Minimum of group limits 3. Global default 4. Disabled / unlimited Currency stored as micro-dollars (`1,000,000` = $1.00). Frontend PR: #23072	2026-03-17 01:24:03 +01:00
Steven Masley	93b9d70a9b	chore: add audit log entry when ai seat is consumed (#22683 ) When an ai seat is consumed, an audit log entry is made. This only happens the first time a seat is used.	2026-03-16 15:30:25 -05:00
Kyle Carberry	6972d073a2	fix: improve background process handling for agent tools (#23132 ) ## Problem Models frequently use shell `&` instead of `run_in_background=true` when starting long-running processes through `/agents`, causing them to die shortly after starting. This happens because: 1. No guidance in tool schema — The `ExecuteArgs` struct had zero `description` tags. The model saw `run_in_background: boolean (optional)` with no explanation of when/why to use it. 2. Shell `&` is silently broken — `sh -c "command &"` forks the process, the shell exits immediately, and the forked child becomes an orphan not tracked by the process manager. 3. No process group isolation — The SSH subsystem sets `Setsid: true` on spawned processes, but the agent process manager set no `SysProcAttr` at all. Signals only hit the top-level `sh`, not child processes. ## Investigation Compared our implementation against openai/codex and coder/mux: \| Aspect \| codex \| mux \| coder/coder (before) \| \|--------\|-------\|-----\|---------------------\| \| Background flag \| Yield/resume with `session_id` \| `run_in_background` with rich description \| `run_in_background` with no description \| \| `&` handling \| `setsid()` + `killpg()` \| `detached: true` + `killProcessTree()` \| Nothing — orphaned children escape \| \| Process isolation \| `setsid()` on every spawn \| `set -m; nohup ... setsid` for background \| No `SysProcAttr` at all \| \| Signal delivery \| `killpg(pgid, sig)` — entire group \| `kill -15 -\$pid` — negative PID \| `proc.cmd.Process.Signal()` — PID only \| ## Changes ### Fix 1: Add descriptions to `ExecuteArgs` (highest impact) The model now sees explicit guidance: "Use for long-running processes like dev servers, file watchers, or builds. Do NOT use shell & — it will not work correctly." ### Fix 2: Update tool description The top-level execute tool description now reinforces: "Use run_in_background=true for long-running processes. Never use shell '&' for backgrounding." ### Fix 3: Detect trailing `&` and auto-promote to background Defense-in-depth: if the model still uses `command &`, we strip the `&` and promote to `run_in_background=true` automatically. Correctly distinguishes `&` from `&&`. ### Fix 4: Process group isolation (`Setpgid`) New platform-specific files (`proc_other.go` / `proc_windows.go`) following the same pattern as `agentssh/exec_other.go`. Every spawned process gets its own process group. ### Fix 5: Process group signaling `signal()` now uses `syscall.Kill(-pid, sig)` on Unix to signal the entire process group, ensuring child processes from shell pipelines are also cleaned up. ## Testing All existing `agent/agentproc` tests pass. Both packages compile cleanly.	2026-03-16 16:22:10 -04:00
Steven Masley	abf59ee7a6	feat: track ai seat usage (#22682 ) When a user uses an AI feature, we record them in the `ai_seat_state` as consuming a seat. Added in debouching to prevent excessive writes to the db for this feature. There is no need for frequent updates.	2026-03-16 12:36:26 -05:00
Steven Masley	cabb611fd9	chore: implement database crud for AI seat usage (#22681 ) Creates a new table `ai_seat_state` to keep track of when users consume an ai_seat. Once a user consumes an AI seat, they will forever in this table (as it stands today).	2026-03-16 11:53:20 -05:00
Kyle Carberry	741af057dc	feat: paginate chat messages endpoint with cursor-based infinite scroll (#23083 ) Adds cursor-based pagination to the chat messages endpoint. ## Backend - New `GetChatMessagesByChatIDPaginated` SQL query: returns messages in `id DESC` order with a `before_id` keyset cursor and configurable `limit` - Handler parses `?before_id=N&limit=N` query params, uses the `LIMIT N+1` trick to set `has_more` without a separate COUNT query - Queued messages only returned on the first page (no cursor) since they're always the most recent - SDK client updated with `ChatMessagesPaginationOptions` - Fully backward compatible: omitting params returns the 50 newest messages ## Frontend - Switches `getChatMessages` from `useQuery` to `useInfiniteQuery` with cursor chaining via `getNextPageParam` - Pages flattened and sorted by `id` ascending for chronological display - `MessagesPaginationSentinel` component uses `IntersectionObserver` (200px rootMargin prefetch) inside the existing `flex-col-reverse` scroll container - `flex-col-reverse` handles scroll anchoring natively when older messages are prepended — no manual `scrollTop` adjustment needed (same pattern as coder/blink) ## Why cursor-based instead of offset/limit Offset-based pagination breaks when new messages arrive while paginating backward (offsets shift, causing duplicates or missed messages). The `before_id` cursor is stable regardless of inserts — each page is deterministic.	2026-03-16 16:40:59 +00:00
Charlie Voiselle	e94de0bdab	fix(coderd): render HTML error page for OIDC email validation failures (#23059 ) ## Summary When the email address returned from an OIDC provider doesn't match the configured allowed domain list (or isn't verified), users previously saw raw JSON dumped directly in the browser — an ugly and confusing experience during a browser-redirect flow. This PR replaces those JSON responses with the same styled static HTML error page already used for group allow-list errors, signups-disabled, and wrong-login-type errors. ## Changes ### `coderd/userauth.go` Replaced 3 `httpapi.Write` calls in `userOIDC` with `site.RenderStaticErrorPage`: \| Error case \| Title shown \| \|---\|---\| \| Email domain not in allowed list \| "Unauthorized email" \| \| Malformed email (no `@`) with domain restrictions \| "Unauthorized email" \| \| `email_verified` is `false` \| "Email not verified" \| All render HTTP 403 with `HideStatus: true` and a "Back to login" action button. ### `coderd/userauth_test.go` - Updated `AssertResponse` callbacks on existing table-driven tests (`EmailNotVerified`, `NotInRequiredEmailDomain`, `EmailDomainForbiddenWithLeadingAt`) to verify HTML Content-Type and page content. - Extended `TestOIDCDomainErrorMessage` to additionally assert HTML rendering. - Added new `TestOIDCErrorPageRendering` with 3 subtests covering all error scenarios, verifying: HTML doctype, expected title/description, "Back to login" link, and absence of JSON markers. --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-03-16 11:56:59 -04:00
Kyle Carberry	6f97539122	fix: update sidebar diff status on WebSocket events (#23116 ) ## Problem The sidebar diff status (PR icon, +additions/-deletions, file count) was not updating in real-time. Users had to reload the page to see changes. Two root causes: 1. Frontend: The `diff_status_change` WebSocket handler in `AgentsPage.tsx` had an early `return` (line 398) that skipped `updateInfiniteChatsCache`, so the sidebar's cache was never updated. Even for other event types, the cache merge only spread `status` and `title` — never `diff_status`. 2. Server: `publishChatPubsubEvent` in `chatd.go` constructed a minimal `Chat` payload without `DiffStatus`, so even if the frontend consumed the event, `updatedChat.diff_status` would be `undefined`. ## Fix ### Server (`coderd/chatd/chatd.go`) - `publishChatPubsubEvent` now accepts an optional `*codersdk.ChatDiffStatus` parameter; when non-nil it's set on the outgoing `Chat` payload. - `PublishDiffStatusChange` fetches the diff status from the DB, converts it, and passes it through. - Added `convertDBChatDiffStatus` (mirrors `coderd/chats.go`'s converter to avoid circular import). - All other callers pass `nil`. ### Frontend (`site/src/pages/AgentsPage/AgentsPage.tsx`) - Removed the early `return` so `diff_status_change` events fall through to the cache update logic. - Added `isDiffStatusEvent` flag and spread `diff_status` into both the infinite chats cache (sidebar) and the individual chat cache.	2026-03-16 15:41:32 +00:00
Kyle Carberry	530872873e	chore: remove swagger annotations from experimental chat endpoints (#23120 ) The `/archive` and `/desktop` chat endpoints had swagger route comments (`@Summary`, `@ID`, `@Router`, etc.) that would cause them to appear in generated API docs. Since these live under `/experimental/chats`, they should not be documented. This removes the swagger annotations and adds the standard `// EXPERIMENTAL: this endpoint is experimental and is subject to change.` comment to `archiveChat` (the `watchChatDesktop` handler already had it, just needed the swagger block removed).	2026-03-16 08:41:13 -07:00
Cian Johnston	f8dff3f758	fix: improve push notification message shown on subscribe (#23052 ) Updates push notification message for test notification.	2026-03-16 14:52:31 +00:00
Kyle Carberry	27cbf5474b	refactor: remove /diff-status endpoint, include diff_status in chat payload (#23082 ) The `/chats/{chat}/diff-status` endpoint was redundant because: - The `Chat` type already has a `DiffStatus` field - Listing chats already resolves and returns `diff_status` - The `getChat` endpoint was the only one not resolving it (passing `nil`) ## Changes Backend: - `getChat` now calls `resolveChatDiffStatus` and includes the result in the response - Removed `getChatDiffStatus` handler, route (`GET /diff-status`), and SDK method - Tests updated to use `GetChat` instead of `GetChatDiffStatus` Frontend: - `AgentDetail.tsx`: uses `chatQuery.data?.diff_status` instead of separate query - `RemoteDiffPanel.tsx`: accepts `diffStatus` as a prop instead of fetching internally - `AgentsPage.tsx`: `diff_status_change` events now invalidate the chat query - Removed `chatDiffStatus` query, `chatDiffStatusKey`, and `getChatDiffStatus` API method	2026-03-16 14:40:22 +00:00
Ethan	c4db03f11a	perf(coderd/database): skip redundant chat row update in InsertChatMessage (#23111 ) ## Summary - add an `IS DISTINCT FROM` guard to `InsertChatMessage`'s `updated_chat` CTE so `chats.last_model_config_id` is only rewritten when the incoming `model_config_id` actually changes - regenerate the query layer - add focused regression coverage for the two meaningful behaviors: same-model inserts and real model switches - trim redundant message-field assertions so the new test stays focused on the guard behavior ## Proof this is an improvement This PR reduces work in the hottest chat write query without changing the insert behavior. ### Why the old query did unnecessary work Before this change, `InsertChatMessage` always ran this update whenever `model_config_id` was non-null: ```sql UPDATE chats SET last_model_config_id = sqlc.narg('model_config_id')::uuid WHERE id = @chat_id::uuid AND sqlc.narg('model_config_id')::uuid IS NOT NULL ``` That means the query rewrote the `chats` row even when `chats.last_model_config_id` was already equal to the incoming value. ### What changes in this PR This PR adds: ```sql AND chats.last_model_config_id IS DISTINCT FROM sqlc.narg('model_config_id')::uuid ``` So same-model inserts still insert the message, but they no longer perform a redundant `UPDATE chats`. ### Why this matters on the hot path From the chat scaletest investigation that motivated this change: - `InsertChatMessage` (+ `updated_chat` CTE) was the hottest write query - about 104k calls - about 0.69 ms average latency - about 71.8 s total DB execution time We also verified common callsites where the update is provably redundant: - `CreateChat` inserts the chat with `LastModelConfigID = opts.ModelConfigID`, then immediately inserts initial system/user messages with that same model config - follow-up user messages commonly pass `lockedChat.LastModelConfigID` straight into `InsertChatMessage` - assistant/tool/summary persistence keeps the current model in the common case; only real switches or fallback cases need the chat row update That means a meaningful fraction of executions of the hottest DB write query move from: - before: insert message + rewrite chat row - after: insert message only This should reduce row churn and write contention on `chats`, especially against other chat-row writers like `UpdateChatStatus` and `GetChatByIDForUpdate`.	2026-03-17 00:44:10 +11:00
Michael Suchacz	fbc8930fc3	fix(coderd): make chat cost summary tests deterministic (#23097 ) Fixes flaky `TestChatCostSummary_UnpricedMessages` (and siblings) by replacing implicit handler-default date windows with explicit time windows derived from database-assigned message timestamps. Root cause: Tests called `GetChatCostSummary` with empty options, triggering the handler to use `[time.Now()-30d, time.Now())` as the query window. The SQL filter's exclusive upper bound (`created_at < @end_date`) can exclude freshly-inserted messages when the handler's clock drifts even slightly past the message's `created_at`. Fix (test-only, `coderd/chats_test.go`): - `seedChatCostFixture` now captures `InsertChatMessage` return values and exposes `EarliestCreatedAt`/`LatestCreatedAt`. - Added `safeOptions()` helper that builds a padded ±1 min window around DB timestamps. - Updated 4 tests to use explicit date windows; `TestChatCostSummary_DateRange` unchanged. Validated with `go test -count=20` (100/100 passes).	2026-03-16 14:42:06 +01:00
Thomas Kosiewski	069d3e2beb	fix(coderd): require ssh access for workspace chats (#23094 ) ### Motivation - The chat creation flow associated a workspace agent for a chat if the requester could read the workspace, enabling privilege escalation where users without SSH/app-connect permissions could cause the daemon to open privileged agent connections and execute commands. - The intent is to ensure that attaching a workspace agent to a chat only happens when the requester has the workspace SSH permission so the chat daemon cannot be abused to bypass RBAC. ### Description - Require request-scoped authorization for workspace agent usage by changing `validateCreateChatWorkspaceSelection` to accept the `*http.Request` and calling `api.Authorize(r, policy.ActionSSH, workspace)` before selecting the workspace for a chat. - Pass the HTTP request into the validator from `postChats` so authorization is evaluated in the request context (`postChats` now calls `validateCreateChatWorkspaceSelection(ctx, r, req)`). - Add a regression test `WorkspaceAccessibleButNoSSH` in `coderd/chats_test.go` which creates an org-admin-scoped user (read access but no `ActionSSH`) and asserts that creating a chat with `WorkspaceID` is denied. ### Testing - Ran `gofmt -w coderd/chats.go coderd/chats_test.go` which succeeded. - Attempted to run repository pre-commit checks (`make pre-commit`) and targeted `go test` invocations; these checks could not be completed in this environment due to missing local tooling and environment constraints (protobuf include resolution, containerized DB access via Docker socket, and long-running golden generation tasks), so full CI/pre-commit verification and end-to-end test runs did not complete here. - Added a focused regression unit test (`WorkspaceAccessibleButNoSSH`) to prevent reintroduction of the authorization bypass; this test is included in the change and should be executed in CI where the full toolchain and test environment are available. ------ [Codex Task](https://chatgpt.com/codex/tasks/task_b_69b432502670832e91d14e937745de46)	2026-03-16 11:42:01 +01:00
Mathias Fredriksson	703b974757	fix(coderd): remove false devcontainers early access warning (#23056 ) The script source claimed Dev Containers are early access and told users to set CODER_AGENT_DEVCONTAINERS_ENABLE=true, which already defaults to true. Clear the script source and set RunOnStart to false since there is nothing to run.	2026-03-16 10:16:14 +02:00
Kyle Carberry	0d3e39a24e	feat: add head_branch to pull request diff status (#23076 ) Adds the `head_branch` field (the source/feature branch name of a PR) to the diff status pipeline. Previously only `base_branch` (target branch) and the head commit SHA were captured from the GitHub API, but not the head branch name itself. ## Changes - Migration 438: Add `head_branch` nullable TEXT column to `chat_diff_statuses` - gitprovider: Parse `head.ref` from the GitHub API response (alongside `head.sha`) and add `HeadBranch` to `PRStatus` - gitsync: Wire `HeadBranch` through `refreshOne()` into the DB upsert params - worker: Map `HeadBranch` in `chatDiffStatusFromRow()` - coderd: Convert `HeadBranch` in `convertChatDiffStatus()` - codersdk: Expose as `head_branch` (`string`, omitempty) in `ChatDiffStatus` API response - Tests*: Updated `github_test.go` pull JSON fixtures and assertions	2026-03-14 17:24:19 +00:00
Thomas Kosiewski	3f7f25b3ee	fix(chats): enforce desktop connect authorization (#23073 ) ### Motivation - The desktop watch handler opened a VNC stream using the chat's workspace ID while only relying on workspace read permissions, allowing read-only users to escalate to interactive desktop access. - Enforce connect-level authorization so only actors with `ActionApplicationConnect` or `ActionSSH` can open the desktop stream. ### Description - Added an explicit workspace lookup in `watchChatDesktop` using `GetWorkspaceByID` to obtain a workspace object for authorization. - Require the requester to be authorized for either `policy.ActionApplicationConnect` or `policy.ActionSSH` on the workspace before proceeding to locate agents or connect to the VNC stream, and return `403 Forbidden` when neither permission is present. - The change is minimal and localized to `coderd/chats.go` and does not alter other code paths or behavior when the requester has the necessary connect permissions. ### Testing - Ran `gofmt -w coderd/chats.go` to format the modified file, which succeeded. - Attempted to run the unit test `TestWatchChatDesktop/NoWorkspace` via `go test` in this environment but the test run did not complete within the environment constraints and did not produce a full pass result. - Attempted to run the repository pre-commit/gen steps but they could not complete due to missing developer tooling and services in this environment (e.g. `sqlc`, `mockgen`, `protoc` plugins and test services like Docker/Postgres), so full pre-commit validation did not finish here. - Code review and static validation confirm the added authorization check properly prevents read-only access from opening the desktop VNC stream. ------ [Codex Task](https://chatgpt.com/codex/tasks/task_b_69b46a4ac5c4832ea9d330aeba43c32d)	2026-03-14 17:53:05 +01:00
Michael Suchacz	969066b55e	feat(site): improve cost analytics view (#23069 ) Surfaces cache token data in the analytics views and fixes table spacing. ### Changes - Cache token columns: Added cache read and cache write token counts to all analytics views (user and admin), from SQL queries through Go SDK types to the frontend tables and summary cards. - Table spacing fix: Replaced the bare React fragment in `ChatCostSummaryView` with a `space-y-6` container so the model and chat breakdown tables no longer overlap. ### Data flow `chat_messages` table already stores `cache_read_tokens` and `cache_creation_tokens` (and uses them for cost calculation). This PR aggregates and displays them alongside input/output tokens in: - Summary cards (6 cards: Total Cost, Input, Output, Cache Read, Cache Write, Messages) - Per-model breakdown table - Per-chat breakdown table - Admin per-user table	2026-03-14 01:22:00 -05:00

1 2 3 4 5 ...

3410 Commits