coder

mirror of https://github.com/coder/coder.git synced 2026-06-03 13:08:25 +00:00

Author	SHA1	Message	Date
Cian Johnston	38017010ce	fix(coderd): disallow POSTing a workspace build on a deleted workspace (#20584 ) - Adds a check on /api/v2/workspacebuilds to disallow creating a START or STOP build if the workspace is deleted. - DELETEs are still allowed.	2025-10-30 13:32:18 +00:00
Cian Johnston	73dedcc765	fix: delete related task when deleting workspace (#20567 ) * Instead of prompting the user to start a deleted workspace (which is silly), prompt them to create a new task instead. * Adds a warning dialog when deleting a workspace * Updates provisionerdserver to delete the related task if a workspace is related to a task	2025-10-30 10:37:51 +00:00
Steven Masley	54497f4f6b	chore: add revocation endpoint to oauth well-known (#20561 ) Was added to apps endpoints, but not the wider site ones. This is a site wide oauth route	2025-10-29 16:44:53 -05:00
Mathias Fredriksson	859e94d67a	fix: deprecate codersdk.AITaskPromptParameterName and reduce usage (#20501 ) Depends on coder/sqlc#1 Fixes coder/internal#979 Updates coder/internal#973	2025-10-29 18:59:12 +00:00
Mathias Fredriksson	303e9ef7de	fix: switch to coder/sqlc fork (#20536 ) Refs https://github.com/coder/sqlc/pull/1 Unblocks https://github.com/coder/coder/pull/20501 Upstream https://github.com/sqlc-dev/sqlc/pull/4159	2025-10-29 18:45:56 +02:00
Cian Johnston	1ebc217624	fix: update task link AppStatus using task_id (#20543 ) Fixes https://github.com/coder/coder/issues/20515 Alternative to https://github.com/coder/coder/pull/20519 Adds `task_id` to `workspaces_expanded` view and updates the "View Task" link in `AppStatuses` component. NOTE: this contains a migration	2025-10-29 15:45:45 +00:00
Danielle Maywood	06dbadab11	fix(coderd): ensure lifecycle executor has sufficient task permissions (#20539 ) We recently made a change to the `wsbuilder` to handle task related logic. Our test coverage for the lifecycle executor didn't handle this scenario and so we missed that it had insufficient permissions. This PR adds `Update` and `Read` permissions for `Task`s in the lifecycle executor, as well as an autostart/autostop test tailored to task workspaces to verify the change. --- Anthropic's Claude Sonnet 4.5 Thinking was involved in writing the tests	2025-10-29 15:44:35 +00:00
Cian Johnston	566146af72	fix(coderd): fix audit log resource link for tasks (#20545 ) Existing task audit log links were incorrect. As audit log links are generated on-the-fly, this does not require backfill.	2025-10-29 15:31:41 +00:00
Susana Ferreira	7e8fcb4b0f	perf: optimize prebuilds membership reconciliation to check orgs not presets (#20493 ) ## Description The membership reconciliation ensures the prebuilds system user is a member of all organizations with prebuilds configured. To support prebuilds quota management, each organization must have a prebuilds group that the system user belongs to. ## Problem Previously, membership reconciliation iterated over all presets to check and update membership status. This meant database queries `GetGroupByOrgAndName` and `InsertGroupMember` were executed for each preset. Since presets are unique combinations of `(organization, template, template version, preset)`, this resulted in several redundant checks for the same organization. In dogfood, `InsertGroupMember` was called thousands of times per day, even though memberships were already configured ([internal Grafana dashboard link](https://grafana.dev.coder.com/goto/46MZ1UgDg?orgId=1)) <img width="5382" height="1788" alt="Screenshot 2025-10-28 at 16 01 36" src="https://github.com/user-attachments/assets/757b7253-106f-4f72-8586-8e2ede9f18db" /> ## Solution This PR introduces `GetOrganizationsWithPrebuildStatus`, a single query that returns: * All unique organizations with prebuilds configured * Whether the prebuilds user is a member of each organization * Whether the prebuilds group exists in each organization * Whether the prebuilds user is in the prebuilds group The membership reconciliation logic now: * Fetches status for all organizations in one query * Only performs inserts for organizations missing required memberships or groups * Safely handles concurrent operations via unique constraint violations * This reduces database load from `O(presets)` to `O(organizations)` per reconciliation loop, with a single read query when everything is configured. ## Changes * Add `GetOrganizationsWithPrebuildStatus` SQL query * Update `membership.ReconcileAll` to use organization-based reconciliation instead of preset-based * Update tests to reflect new behavior Related to internal thread: https://codercom.slack.com/archives/C07GRNNRW03/p1760535570381369	2025-10-29 14:24:29 +00:00
Danny Kopping	b20fd6f2c1	chore: graduate aibridge API out of experimental (#20523 ) <!-- If you have used AI to produce some or all of this PR, please ensure you have read our [AI Contribution guidelines](https://coder.com/docs/about/contributing/AI_CONTRIBUTING) before submitting. -->	2025-10-29 07:18:54 -06:00
Susana Ferreira	aad1b401c1	feat: add prebuilds reconciliation duration metric (#20535 ) ## Description Adds `coderd_prebuilds_reconciliation_duration_seconds` histogram metric to track the duration of each prebuilds reconciliation cycle. This metric helps operators monitor reconciliation performance and identify potential bottlenecks. ## Changes - Added `ReconcileStats` struct to capture reconciliation cycle statistics - Updated `ReconcileAll()` to return stats including elapsed time - Added histogram metric `coderd_prebuilds_reconciliation_duration_seconds`	2025-10-29 12:52:30 +00:00
Danny Kopping	95a1ca898f	chore: remove aibridge experiment (#20520 ) Removes the experiment and all references to it	2025-10-29 06:18:38 -06:00
Susana Ferreira	c3e3bb58f2	feat: delete pending canceled prebuilds (#20499 ) ## Description PR https://github.com/coder/coder/pull/20387 introduced canceling pending prebuild jobs from inactive template versions to avoid provisioning obsolete workspaces. However, the associated prebuilds remained in the database with "Canceled" status, visible in the UI. This PR now orphan-deletes these canceled prebuilt workspaces. Since the canceled jobs were never processed by a provisioner, no Terraform resources were created, making orphan deletion safe. Orphan deletion always creates a provisioner job, but behaves differently based on provisioner availability: - If no provisioner daemon is available, the job is immediately marked as completed and the workspace is marked as deleted without any provisioner processing - If a provisioner daemon is available, it processes the delete job with empty Terraform state (no actual resources to destroy) The job cancellation and workspace deletion occur atomically in the same transaction. We don't split this into two separate reconciliation runs because there's no way to distinguish between system-canceled prebuilds and user-canceled workspaces. If we deleted canceled workspaces in a later run, we'd delete user-canceled workspaces that users may want to keep for troubleshooting. Note: This only applies to system-generated prebuilds from inactive template versions. ## Changes * Update `UpdatePrebuildProvisionerJobWithCancel` query to return job ID, workspace ID, template ID, and template version preset ID * Add `DeprovisionMode` enum to support orphan deletion in the provision flow * Update `ActionTypeCancelPending` handler to cancel jobs and orphan-delete associated workspaces atomically	2025-10-29 10:37:28 +00:00
Callum Styan	45c43d4ec4	fix: refactor agent resource monitoring API to avoid excessive calls to DB (#20430 ) This should resolve https://github.com/coder/internal/issues/728 by refactoring the ResourceMonitorAPI struct to only require querying the resource monitor once for memory and once for volumes, then using the stored monitors on the API struct from that point on. This should eliminate the vast majority of calls to `GetWorkspaceByAgentID` and `FetchVolumesResourceMonitorsUpdatedAfter`/`FetchMemoryResourceMonitorsUpdatedAfter` (millions of calls per week). Tests passed, and I ran an instance of coder via a workspace with a template that added resource monitoring every 10s. Note that this is the default docker container, so there are other sources of `GetWorkspaceByAgentID` db queries. Note that this workspace was running for ~15 minutes at the time I gathered this data. Over 30s for the `ResourceMonitor` calls: ``` coder@callum-coder-2:~/coder$ curl localhost:19090/metrics \| grep ResourceMonitor \| grep count % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0coderd_db_query_latencies_seconds_count{query="FetchMemoryResourceMonitorsByAgentID"} 2 coderd_db_query_latencies_seconds_count{query="FetchMemoryResourceMonitorsUpdatedAfter"} 2 100 288k 0 288k 0 0 58.3M 0 --:--:-- --:--:-- --:--:-- 70.4M coderd_db_query_latencies_seconds_count{query="FetchVolumesResourceMonitorsByAgentID"} 2 coderd_db_query_latencies_seconds_count{query="FetchVolumesResourceMonitorsUpdatedAfter"} 2 coderd_db_query_latencies_seconds_count{query="UpdateMemoryResourceMonitor"} 155 coderd_db_query_latencies_seconds_count{query="UpdateVolumeResourceMonitor"} 155 coder@callum-coder-2:~/coder$ curl localhost:19090/metrics \| grep ResourceMonitor \| grep count % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0coderd_db_query_latencies_seconds_count{query="FetchMemoryResourceMonitorsByAgentID"} 2 coderd_db_query_latencies_seconds_count{query="FetchMemoryResourceMonitorsUpdatedAfter"} 2 100 288k 0 288k 0 0 34.7M 0 --:--:-- --:--:-- --:--:-- 40.2M coderd_db_query_latencies_seconds_count{query="FetchVolumesResourceMonitorsByAgentID"} 2 coderd_db_query_latencies_seconds_count{query="FetchVolumesResourceMonitorsUpdatedAfter"} 2 coderd_db_query_latencies_seconds_count{query="UpdateMemoryResourceMonitor"} 158 coderd_db_query_latencies_seconds_count{query="UpdateVolumeResourceMonitor"} 158 ``` And over 1m for the `GetWorkspaceAgentByID` calls, the majority are from the workspace metadata stats updates: ``` coder@callum-coder-2:~/coder$ curl localhost:19090/metrics \| grep GetWorkspaceByAgentID \| grep count % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 284k 0 284k 0 0 42.4M 0 --:--:-- --:--:-- --:--:-- 46.3M coderd_db_query_latencies_seconds_count{query="GetWorkspaceByAgentID"} 876 coder@callum-coder-2:~/coder$ curl localhost:19090/metrics \| grep GetWorkspaceByAgentID \| grep count % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 284k 0 284k 0 0 75.4M 0 --:--:-- --:--:-- --:--:-- 92.7M coderd_db_query_latencies_seconds_count{query="GetWorkspaceByAgentID"} 918 ``` --------- Signed-off-by: Callum Styan <callumstyan@gmail.com>	2025-10-28 13:38:16 -07:00
Danielle Maywood	a1e7e105a4	chore: disable task notifications by default (#20518 ) Relates to https://github.com/coder/internal/issues/1098 Currently task notifications are incredibly noisy. We should disable them by default for the upcoming release whilst we iron them out.	2025-10-28 17:21:23 +00:00
Cian Johnston	659f89e079	feat(coderd): add owner-related fields to tasks_with_status view (#20471 ) Relates to https://github.com/coder/coder/pull/20431/files#diff-9cfc826a6ce7e77d977b2025482474dd263d12965b2a94479a74c7f1d872b782 If the workspace relating to a task was deleted, most of the workspace-related fields in `taskFromDBTaskAndWorkspace` will be zero-valued. However, we can still get information relating to the owner so that "created by" shows up correctly in the UI. Updates the `tasks_with_status` view with a join on `visible_users` to get owner-related info.	2025-10-28 14:29:29 +00:00
Mathias Fredriksson	a1fa58ac17	fix: update dbgen and dbfake task creation and toolsdk test fixtures (#20508 ) Depends on #20506 Fixes coder/internal#1103	2025-10-28 14:15:58 +02:00
Danny Kopping	d18441debe	feat: add AWS Bedrock support (#20507 ) Depends on https://github.com/coder/aibridge/pull/44 Closes https://github.com/coder/aibridge/issues/28 --------- Signed-off-by: Danny Kopping <danny@coder.com>	2025-10-28 03:38:14 +00:00
ケイラ	4f7b279fd8	feat: add an organization member permission level (#19953 )	2025-10-27 17:14:16 -06:00
Mathias Fredriksson	c3cbd977f1	fix(coderd/database/dbfake): use transaction for workspace builder (#20506 ) While investigating a flake I noticed that the dbfake workspace builder executes all database inserts without a transaction. Since our real wsbuilder implementation utilizes one it makes sense to do here as well. For example, our normal workspace <-> build relationship is such that a workspace cannot exist with at least one build. However, our GetWorkspaces query left joins workspace builds but has types that are non-nullable, leading to flakes like coder/internal#1103.	2025-10-28 01:06:52 +02:00
Dean Sheather	5a3ceb38f0	chore: add aibridge data to telemetry (#20449 ) - Adds a new table to keep track of which payloads have already been reported since we only report for the last clock hour - Adds a query to gather and aggregate all the data by provider/model/client Relates to https://github.com/coder/coder-telemetry-server/issues/27	2025-10-28 03:16:41 +11:00
ケイラ	d9c40d61c2	refactor: clean up policy.rego (#20366 )	2025-10-27 10:01:30 -06:00
Spike Curtis	af3ff825a1	test: track postgres database creation by package and test name (#20492 ) Adds columns to track package and test name to test_databases table, and populates them as databases are created using the Broker. In order to seamlessly work with existing `coder_database` databases with the old schema, the SQL that creates the table and columns is additive and idempotent, so we run it every time we initialize the Broker (once per test binary execution). We include a transaction level advisorly lock to prevent deadlocks before attempting to alter the schema. I was seeing deadlocks without this.	2025-10-27 14:31:32 +04:00
Paweł Banaszewski	50ba223aa1	feat: add db query for setting interception ended_at field (#20437 ) Adds UpdateAIBridgeInterceptionEnded query to mark interceptions as done. Needed for https://github.com/coder/internal/issues/1051	2025-10-27 09:51:37 +01:00
Cian Johnston	b8a0f97cab	chore(coderd): add test for deleting task with no workspace (#20466 )	2025-10-24 18:19:05 +01:00
Susana Ferreira	f6e86c6fdb	feat: cancel pending prebuilds from non-active template versions (#20387 ) ## Description This PR introduces an optimization to automatically cancel pending prebuild-related jobs from non-active template versions in the reconciliation loop. ## Problem Currently, when a template is configured with more prebuild instances than available provisioners, the provisioner queue can become flooded with pending prebuild jobs. This issue is worsened when provisioning/deprovisioning operations take a long time. When the prebuild reconciliation loop generates jobs faster than provisioners can process them, pending jobs accumulate in the queue. Since prebuilt workspaces should always run the latest active template version, pending prebuild jobs from non-active versions become obsolete once a new version is promoted. ## Solution The reconciliation loop cancels pending prebuild-related jobs from non-active template versions that match the following criteria: * Build number: 1 (initial build created by the reconciliation loop) * Job status: `pending` * Not yet picked up by a provisioner (`worker_id` is `NULL`) * Owned by the prebuilds system user * Workspace transition: `start` This prevents the queue from being cluttered with stale prebuild jobs that would provision workspaces on an outdated template version that would consequently need to be deprovisioned. ## Changes * Added new SQL query `CountPendingNonActivePrebuilds` to identify presets with pending jobs from non-active versions * Added new SQL query `UpdatePrebuildProvisionerJobWithCancel` to cancel jobs for a specific preset * New reconciliation action type `ActionTypeCancelPending` handles the cancellation logic * Cancellation is non-blocking: failures to cancel prebuild jobs are logged as errors and don't prevent other reconciliation actions ## Follow-up PR Canceling pending prebuild jobs leaves workspaces in a Canceled state. While no Terraform resources need to be destroyed (since jobs were canceled before provisioning started), these database records should still be cleaned up. This will be addressed in a follow-up PR. Closes: https://github.com/coder/coder/issues/20242	2025-10-24 15:27:49 +01:00
Mathias Fredriksson	51d3abb904	feat(site): use new task data model and endpoints (#20431 ) Updates the UI to use the new API endpoints for tasks and use its new data model. Disclaimer: Since the base data model for tasks changed, we had to do a quite large refactor and I'm sorry for that 🙏, but you'll notice most of the changes are to adjust the types. Closes coder/internal#976 --------- Co-authored-by: Bruno Quaresma <bruno_nonato_quaresma@hotmail.com>	2025-10-24 10:45:19 -03:00
Thomas Kosiewski	c6e551f538	fix: renumber api key allow list migration (#20457 )	2025-10-24 11:54:51 +00:00
Thomas Kosiewski	f684831f56	feat: add allow list to API keys (#19972 ) Add API key allow list to the SDK This PR adds an allow list to API keys in the SDK. The allow list is a list of targets that the API key is allowed to access. If the allow list is empty, a default allow list with a single entry that allows access to all resources is created. The changes include: - Adding a default allow list when generating an API key if none is provided - Adding allow list to the API key response in the SDK - Converting database allow list entries to SDK format in the API response - Adding tests to verify the default allow list behavior Fixes #19854	2025-10-24 12:33:56 +01:00
dependabot[bot]	f947a34103	ci: bump the github-actions group across 1 directory with 15 updates (#20384 ) Co-authored-by: github-actions[bot] Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: M Atif Ali <atif@coder.com> Co-authored-by: Ethan Dickson <ethan@coder.com>	2025-10-24 16:06:44 +05:00
Danielle Maywood	e60112e54f	chore(coderd): introduce TaskAppID and deprecate AITaskSidebarAppID (#20336 ) As we're moving away from the SidebarAppID nomenclature, this PR introduces a new `TaskAppID` field to `codersdk.WorkspaceBuild` and deprecates the `AITaskSidebarAppID` field. They both contain the same value.	2025-10-24 10:57:32 +01:00
Danielle Maywood	5a31c590e6	fix(coderd/provisionerdserver): pipe through task id and prompt (#20408 ) Pipes through the Task's ID and prompt into the provisioner. This is required to support the new `coder_ai_task.prompt` field and modified `coder_ai_task.id` field.	2025-10-24 09:43:48 +01:00
Steven Masley	13ca9ead3a	chore!: ensure consistent secret token generation and hashing (#20388 ) This PR uses the same sha256 hashing technique as we use for APIKeys. So now all randomly generated secrets will be hashed with sha256 for consistency. This is a breaking change for the oauth tokens. Since oauth is only allowed for dev builds and experimental, this is ok.	2025-10-23 15:38:49 -05:00
Mathias Fredriksson	a106d67c07	feat(coderd): use task data model for list (#20394 ) Updates coder/internal#976	2025-10-23 20:22:51 +03:00
Mathias Fredriksson	2c6cbf15e2	feat(coderd): use task data model for send/logs (#20381 ) Updates coder/internal#976	2025-10-23 20:10:50 +03:00
Mathias Fredriksson	c6f63990cf	feat(coderd): use task data model when fetching a task (#20380 ) Updates coder/internal#976	2025-10-23 19:58:47 +03:00
Mathias Fredriksson	9855460524	feat(coderd): use new data model for task delete (#20334 ) Updates coder/internal#976	2025-10-23 19:45:18 +03:00
Mathias Fredriksson	79728c30fa	chore(coderd/database/migrations): migrate tasks to new data model (#20434 ) Updates coder/internal#976 Closes coder/internal#1078	2025-10-23 19:29:23 +03:00
Mathias Fredriksson	5c802c2627	feat(coderd): use task data model when creating a new task (#20275 ) Updates coder/internal#976	2025-10-23 19:12:09 +03:00
Hugo Dutka	e62c5db678	chore: remove references to dbtestutil.WillUsePostgres (#20436 ) Addresses https://github.com/coder/internal/issues/758. This PR only cleans up dead code, it makes no changes to test logic.	2025-10-23 14:24:54 +02:00
Paweł Banaszewski	4244b20823	feat: add ended_at column to aibridge_interceptions table (#20432 ) Needed for marking interceptions as done (https://github.com/coder/internal/issues/1051).	2025-10-23 13:29:05 +02:00
Jake Howell	d455f6ea2b	fix: rename `total` to `count` in `AIBridgeListInterceptionsResponse` (#20410 ) Thanks to the great work in #20393, we’ve successfully introduced offset-based pagination for this endpoint. However, the frontend expects a `count` field in the response rather than `total`. This PR updates the response payload to rename the returned key to `count` for consistency with frontend expectations and existing API patterns. This is necessary to unblock the work in #20331	2025-10-23 13:19:12 +11:00
Steven Masley	4bd7c7b7e0	feat: implement oauth2 RFC 7009 token revocation endpoint (#20362 ) Adds RFC 7009 token revocation endpoint	2025-10-22 15:18:42 -05:00
Marcin Tojek	f2a410566c	feat: add support buttons (#20339 ) Fixes: https://github.com/coder/coder/issues/16804	2025-10-22 15:35:16 +02:00
Dean Sheather	69c2c40512	chore: add user details to aibridge interception list endpoint (#20397 ) - Adds FK from `aibridge_interceptions.initiator_id` to `users.id` - This is enforced by deleting any rows that don't have any users. Since this is an experimental feature AND coder never deletes user rows I think this is acceptable. - Adds `name` as a property on `codersdk.MinimalUser` - This matches the `visible_users` view in the database. I'm unsure why `name` wasn't already included given that `username` is. - Adds a new `initiator` field to `codersdk.AIBridgeInterception` which contains `codersdk.MinimalUser` (ID, username, name, avatar URL) - Removes `initiator_id` from `codersdk.AIBridgeInterception` - Should be fine since we're still in early access	2025-10-22 16:18:31 +11:00
Zach	9da60a9dc5	chore: migrate from tenv linter to usetesting linter (#20401 ) The tenv linter is deprecated in favor of usetesting which offers a superset of lint checks. This message is seen when running `make lint` ``` [nix-shell:~/src/coder]$ make lint <snip> WARN The linter 'tenv' is deprecated (since v1.64.0) due to: Duplicate feature in another linter. Replaced by usetesting. <snip> ``` This change swaps out the deprecated tenv linter for the usetesting linter, and configures it for linting parity. https://github.com/coder/coder/issues/20398	2025-10-21 15:10:47 -06:00
Steven Masley	86f0f39863	chore: make authz recorder opt in (#20310 ) The authz recorder is causing a lot of memory to be allocated, and is a memory leak for websocket connections. This change makes it opt-in on a per request basis (ontop of `isDev`). To get the authz headers, use `Copy as cURL` on chrome and append the header `x-authz-checks=true`.	2025-10-21 14:15:37 +00:00
Dean Sheather	ea261a1f7c	chore: add offset-based pagination support to aibridge list endpoint (#20393 ) Necessary for the frontend to be able to paginate easily. Cursor pagination is good for fetching all events, but doesn't play very well when a pagination component gets involved. Adds support for `?offset=x` to the existing endpoint. The cursor-based pagination (`?after_id=x`) is still supported. The two pagination modes are mutually exclusive, and are documented as such. If both are supplied, the request will be rejected. Also adds a `total` property to the response that contains the full count of items matching the filter. We already have indices in place so I don't think this will impact performance (or we can revisit it before GA).	2025-10-21 11:50:00 +00:00
Dean Sheather	0652b18ebc	feat: mount pprof and metrics to /api/v2/debug for admins (#20353 ) Adds the following debug routes for people with the `debug_info:read` permission: - `/api/v2/debug/pprof` for `net/http/pprof` - `/` - `/cmdline` - `/profile` - `/symbol` - `/trace` - `/*` - `/api/v2/debug/metrics` for Prometheus metrics	2025-10-21 03:13:11 +00:00
Callum Styan	5a18cf4c86	fix: remove unintentionally added print in test code (#20391 ) accidentally added in https://github.com/coder/coder/pull/19786 Signed-off-by: Callum Styan <callumstyan@gmail.com>	2025-10-20 18:51:15 -07:00

1 2 3 4 5 ...

2982 Commits