coder

mirror of https://github.com/coder/coder.git synced 2026-06-03 04:58:23 +00:00

Author	SHA1	Message	Date
Ethan	c650aabbef	chore: standardize on _internal_test.go for white-box tests (#25601 ) My agent added `//nolint:testpackage` to a test file on one of my PRs. Again. This PR cleans it up across the entire repo and updates the in-repo conventions so future agents stop doing it. The repo already has a precedent for white-box tests that need to touch unexported symbols: `_internal_test.go` (145+ existing files). The `testpackage` linter's default `skip-regexp` exempts that filename suffix, so the `//nolint:testpackage` directive is unnecessary in every case where someone reached for it. This PR renames 51 such files to `_internal_test.go` via `git mv` so blame and history follow, and strips the dead directive from 2 files that were already correctly named (`coderd/oauth2provider/authorize_internal_test.go`, `coderd/x/chatd/advisor_internal_test.go`). `.claude/docs/TESTING.md` now documents the rule explicitly under Test Package Naming, which is imported into the root `AGENTS.md` via `@.claude/docs/TESTING.md`. The rule: prefer `package foo_test`; if you need internal access, rename the file to `_internal_test.go` rather than adding a nolint directive.	2026-05-22 20:24:38 +10:00
Zach	ddc0e99c69	chore: remove coder_secret Terraform integration (#25512 ) Removes the coder_secret Terraform integration: the data.coder_secret consumption path through provisionerdserver → provisioner.proto → provisioner/terraform, the dynamic-parameter secret-requirement validation, and the workspace-update / resolve-autostart surfaces that depended on it. This is being done due to a product/feature direction change (see PLAT-243). User-secret CRUD (DB, REST, CLI, UI, telemetry, audit) and the agent-manifest secret-injection path are untouched. The provisionerd API is bumped from v1.17 to v1.18 rather than rolled back: v1.17 shipped in v2.33.x, so user_secrets field numbers are reserved and the changelog documents both versions. Generated with assistance from Coder Agents.	2026-05-21 09:19:29 -06:00
Ethan	ef0151601e	feat: report insufficient quota build failures in chat tools (#24956 ) ## Summary When a workspace build fails because the user is over their group quota, the chat tools currently surface the failure as a bare `"workspace build failed: insufficient quota"` string with no machine-readable error code and no visibility into the user's current usage. Agents and the UI cannot distinguish quota failures from any other Terraform error, so users see an opaque message and have no clear path to recovery. This PR tags quota failures with a typed error code at the source and propagates it through the chat tool layer so callers can react to it explicitly. Relates to CODAGT-20 ## Changes Provisioner runner - Add `InsufficientQuotaErrorCode = "INSUFFICIENT_QUOTA"` and set it explicitly at the `commitQuota` failure site via a new `failedWorkspaceBuildfCode` helper, so `provisioner_jobs.error_code` is populated only on the genuine quota path. The substring matcher used for externally produced sentinels (e.g. `"missing parameter"`, `"required template variables"`) is intentionally not extended; provider errors that happen to mention "insufficient quota" stay classified as generic build failures. SDK and API contract - Add `JobErrorCodeInsufficientQuota` and a `JobIsInsufficientQuotaErrorCode` helper to `codersdk`. - Extend the swagger `enums` tag on `ProvisionerJob.ErrorCode` to include `INSUFFICIENT_QUOTA`. - Regenerate `coderd/apidoc`, `docs/reference/api/`, and `site/src/api/typesGenerated.ts`. chattool create_workspace / start_workspace* - `waitForBuild` now returns a typed `*workspaceBuildError` carrying both the message and the `JobErrorCode`, instead of a bare error string. - New `quotaerror.go` introduces a structured `quotaErrorResult` (with `error_code`, `title`, `message`, `build_id`, and optional `quota`) and a best-effort `workspaceQuotaDetails` lookup that wraps owner authorization internally and fetches `credits_consumed` and `budget` from the database. Quota lookup failures (including authorization failures) never block the failure payload. - On quota-coded build failures, both `create_workspace` and `start_workspace` now return the structured response (with the recovery guidance inlined into `message`) instead of the bare `"insufficient quota"` string. This applies to all three failure paths: post-creation, an in-progress existing build, and a freshly triggered start build. Non-quota build failures continue to use the existing `buildToolResponse` / `newBuildError` path. - Owner authorization is wrapped only on the call sites that need it (the `CreateFn` and `StartFn` invocations and the quota-detail lookup), so idempotent fast paths (already running, already in progress, existing-workspace early returns) do not pay for an extra RBAC round-trip or fail when role lookup is transient. ## Out of scope - No changes to quota math, allowances, or bypass behavior. - No automatic retries. - No new quota-inspection tools and no changes to MCP `coder_create_workspace` (which returns immediately and never observed the build outcome here). - No frontend UI changes; those will land in a follow-up PR that consumes the new `INSUFFICIENT_QUOTA` code.	2026-05-07 15:01:58 +10:00
Zach	79735f2d45	feat: plumb user secrets through provisioner chain to terraform (#24542 ) This change passes user secrets from coderd to the Terraform process at workspace build time so the `data.coder_secret` data source in terraform-provider-coder can resolve values at plan time. Secrets traverse two proto hops: `provisionerdserver` fetches them via`ListUserSecretsWithValues`, attaches them to `AcquiredJob.WorkspaceBuild.user_secrets` on `provisionerd.proto`; `runner.go` forwards into `PlanRequest.user_secrets` on `provisioner.proto`; the Terraform provisioner encodes each as `CODER_SECRET_ENV_<name>` or `CODER_SECRET_FILE_<hex(path)>` before invoking `terraform plan`. Only plan requests carry secrets; apply runs with `nil` because values are baked into plan state. Fetch is gated on a workspace transitioning to start. stop and delete transitions never carry secrets, so revoking or deleting a stored secret cannot make a workspace unstoppable. DB errors on the fetch fail the job outright rather than silently continuing with an empty secret set. Note that user secrets will be stored in the workspace_builds table in provisioner_state with other Terraform state (including other sensitive data).	2026-04-27 08:26:07 -06:00
Steven Masley	e13f2a9869	chore: remove extra `stop_modules` from provisionerd proto (#21706 ) Was a duplicate of start_modules Closes https://github.com/coder/coder/issues/21206	2026-01-28 09:25:47 -06:00
Steven Masley	60b3fd0783	chore!: send modules archive over the proto messages (#21398 ) # What this does Dynamic parameters caches the `./terraform/modules` directory for parameter usage. What this PR does is send over this archive to the provisioner when building workspaces. This allow terraform to skip downloading modules from their registries, a step that takes seconds. <img width="1223" height="429" alt="Screenshot From 2025-12-29 12-57-52" src="https://github.com/user-attachments/assets/16066e0a-ac79-4296-819d-924f4b0418dc" /> # Wire protocol The wire protocol reuses the same mechanism used to download the modules `provisoner -> coder`. It splits up large archives into multiple protobuf messages so larger archives can be sent under the message size limit. # 🚨 Behavior Change (Breaking Change) 🚨 Before this PR modules were downloaded on every workspace build. This means unpinned modules always fetched the latest version After this PR modules are cached at template import time, and their versions are effectively pinned for all subsequent workspace builds.	2026-01-09 11:33:34 -06:00
Steven Masley	89f4d60e7b	chore: remove experiment "terraform-directory-reuse" (#21397 ) Experiment is no longer required, the new method will be released without an experiment and without a toggle Main PR is: https://github.com/coder/coder/pull/21398	2026-01-09 11:13:16 -06:00
Spike Curtis	bddb808b25	chore: arrange imports in a standard way (#21452 ) Fixes all our Go file imports to match the preferred spec that we've _mostly_ been using. For example: ``` import ( "context" "time" "github.com/prometheus/client_golang/prometheus" "golang.org/x/xerrors" "gopkg.in/natefinch/lumberjack.v2" "cdr.dev/slog/v3" "github.com/coder/coder/v2/codersdk/agentsdk" "github.com/coder/serpent" ) ``` 3 groups: standard library, 3rd partly libs, Coder libs. This PR makes the change across the codebase. The PR in the stack above modifies our formatting to maintain this state of affairs, and is a separate PR so it's possible to review that one in detail.	2026-01-08 15:24:11 +04:00
Spike Curtis	49b34a716a	fix: fix slog to always use array of Fields (#21426 ) Upgrades to slog v3 which includes a small, but backward incompatible API change to the acceptible call arguments when logging. This change allows us to verify via compile time type checking that arguments are correct and won't cause a panic, as was possible in slog v1, which this replaces (v2 was tagged but never used in coder/coder). It also updates dependencies that also use slog and were updated. I've left the `aibridge` dependency as a commit SHA, under the assumption that the team there (cc @pawbana @dannykopping ) will tag and update the dependency soon and on their own schedule. Other dependencies, I pushed new tags.	2026-01-08 10:29:41 +04:00
Steven Masley	3194bcfc9e	chore: distinct operations for provisioner's 'parse', 'init', 'plan', 'apply', 'graph' (#21064 ) Provisioner steps broken into smaller granular actions. Changes: - `ExtractArchive` moved to `init` request (was in `configure`) - Writing `tfstate` moved to `plan` (was in `configure`) - Moved most plan/apply outputs to `GraphComplete`	2025-12-15 11:26:41 -06:00
Steven Masley	9149c1e9f2	chore: append template metadata to protobuf config (#20558 ) Adds some extra meta data sent to provisioners. Also adds a field `reuse_terraform_workspace` to tell the provisioner whether or not to use the caching experiment.	2025-11-12 12:46:39 -06:00
Kacper Sawicki	9edceef0bf	feat(coderd): add support for external agents to API's and provisioner (#19286 ) This pull request introduces support for external workspace management, allowing users to register and manage workspaces that are provisioned and managed outside of the Coder. Depends on: https://github.com/coder/terraform-provider-coder/pull/424 * GET /api/v2/init-script - Gets the agent initialization script * By default, it returns a script for Linux (amd64), but with query parameters (os and arch) you can get the init script for different platforms * GET /api/v2/workspaces/{workspace}/external-agent/{agent}/credentials - Gets credentials for an external agent (enterprise) * Updated queries to filter workspaces/templates by the has_external_agent field	2025-08-19 10:41:33 +02:00
Danny Kopping	0238f2926d	feat: persist AI task state in template imports & workspace builds (#18449 )	2025-06-24 10:36:37 +00:00
Steven Masley	c1341cccdd	feat: use proto streams to increase maximum module files payload (#18268 ) This PR implements protobuf streaming to handle large module files by: 1. Streaming large payloads: When module files exceed the 4MB limit, they're streamed in chunks using a new UploadFile RPC method 2. Database storage: Streamed files are stored in the database and referenced by hash for deduplication 3. Backward compatibility: Small module files continue using the existing direct payload method	2025-06-13 12:46:26 -05:00
Steven Masley	0428c5ec1c	chore: include 'everyone' group in template importing (#18257 )	2025-06-05 19:25:36 +00:00
Danny Kopping	6e967780c9	feat: track resource replacements when claiming a prebuilt workspace (#17571 ) Closes https://github.com/coder/internal/issues/369 We can't know whether a replacement (i.e. drift of terraform state leading to a resource needing to be deleted/recreated) will take place apriori; we can only detect it at `plan` time, because the provider decides whether a resource must be replaced and it cannot be inferred through static analysis of the template. This is likely to be the most common gotcha with using prebuilds, since it requires a slight template modification to use prebuilds effectively, so let's head this off before it's an issue for customers. Drift details will now be logged in the workspace build logs: ![image](https://github.com/user-attachments/assets/da1988b6-2cbe-4a79-a3c5-ea29891f3d6f) Plus a notification will be sent to template admins when this situation arises: ![image](https://github.com/user-attachments/assets/39d555b1-a262-4a3e-b529-03b9f23bf66a) A new metric - `coderd_prebuilt_workspaces_resource_replacements_total` - will also increment each time a workspace encounters replacements. We only track _that_ a resource replacement occurred, not how many. Just one is enough to ruin a prebuild, but we can't know apriori which replacement would cause this. For example, say we have 2 replacements: a `docker_container` and a `null_resource`; we don't know which one might cause an issue (or indeed if either would), so we just track the replacement. --------- Signed-off-by: Danny Kopping <dannykopping@gmail.com>	2025-05-14 14:52:22 +02:00
Steven Masley	398b999d8f	chore: pass previous values into terraform apply (#17696 ) Pass previous workspace build parameter values into the terraform `plan/apply`. Enforces monotonicity in terraform as well as `coderd`.	2025-05-12 15:32:00 -05:00
ケイラ	d0ab91c16f	fix: reduce size of terraform modules archive (#17749 )	2025-05-12 13:50:07 -06:00
Jon Ayers	a9f1a6b2a2	fix: revert fix: persist terraform modules during template import (#17665 ) (#17734 ) This reverts commit `ae3d90b057`.	2025-05-08 22:03:08 -04:00
ケイラ	ae3d90b057	fix: persist terraform modules during template import (#17665 )	2025-05-08 16:13:46 -06:00
Jon Ayers	17ddee05e5	chore: update golang to 1.24.1 (#17035 ) - Update go.mod to use Go 1.24.1 - Update GitHub Actions setup-go action to use Go 1.24.1 - Fix linting issues with golangci-lint by: - Updating to golangci-lint v1.57.1 (more compatible with Go 1.24.1) 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <claude@anthropic.com>	2025-03-26 01:56:39 -05:00
ケイラ	5b3eda6719	chore: persist template import terraform plan in postgres (#17012 )	2025-03-24 10:01:50 -06:00
Sas Swart	46e04c68e3	feat(provisioner): add support for presets to coder provisioners (#16574 ) This pull request adds support for presets to coder provisioners. If a template defines presets using a compatible version of the provider, then this PR will allow those presets to be persisted to the control plane database for use in workspace creation.	2025-02-17 13:00:44 +02:00
Danielle Maywood	a6e054a115	chore(provisionerd): use correct log levels for template provisioner logs (#16232 ) Relates to https://github.com/coder/coder/issues/14062 Previously a `logProvisionerJobLog` helper was added in https://github.com/coder/coder/pull/6508 to forward logs from the provisioner at the correct log level, but this was only used for logs produced in `buildWorkspace`. This PR uses this helper for forwarding logs produced in `runTemplateImportParse` and `runTemplateImportProvisionWithRichParameters` at the correct log level.	2025-01-23 09:27:10 +00:00
Kevin Ha	c5287910f9	feat: add workspace build timing metrics (#15771 ) This PR introduces a new prometheus metrics for `workspace_build_timing_seconds`, which specifically reports workspace build times. To reduce cardinality, this metrics excludes `workspace_name` and `workspace_owner` that are present on the `workspace_builds_total` metrics.	2024-12-11 05:36:48 +00:00
Hugo Dutka	aa0dc2daa1	chore: track terraform modules in telemetry (#15450 ) Addresses https://github.com/coder/nexus/issues/35. This PR: - Adds a `workspace_modules` table to track modules used by the Terraform provisioner in provisioner jobs. - Adds a `module_path` column to the `workspace_resources` table, allowing to identify which module a resource originates from. - Starts pushing this new information into telemetry. For the person reviewing this PR, do not fret about the 1,500 new lines - ~1,000 of them are auto-generated.	2024-11-16 21:56:19 +01:00
Ethan	208a5beb95	fix: improve duplicate template version name error (#14572 )	2024-09-06 16:13:34 +10:00
Cian Johnston	5366f2576f	fix(provisionerd/runner): do not log entire resources (#14538 ) fix(coderd/workspaceagentsrpc): do not log entire agent fix(provisionerd/runner): do not log entire resources	2024-09-04 10:23:34 +01:00
Danny Kopping	6960d194ae	feat: add provisioning timings to understand slow build times (#14274 )	2024-08-21 14:18:58 +02:00
Marcin Tojek	b8b80fe6d2	feat: store `coder_workspace_tags` in the database (#13294 )	2024-05-20 13:30:19 +00:00
Kayla Washburn-Love	475c3650ca	feat: add support for optional external auth providers (#12021 )	2024-02-21 11:18:38 -07:00
Garrett Delfosse	5b122d108e	fix: publish workspace update on quota failure (#11559 )	2024-01-11 14:59:40 -05:00
Kyle Carberry	5596fb20b5	chore: move `/gitauth` to `/externalauth` on the frontend (#9954 ) * chore: move `/gitauth` to `/externalauth` on the frontend This actually took a lot more jank than anticipated, so I wanted to split this up before adding the ability to embed new providers. * Rename FE * Fix em' up * Fix linting error * Fix e2e tests * chore: update helm golden files	2023-09-30 14:30:01 -05:00
Kyle Carberry	8abca9bea7	chore: rename `git_auth` to `external_auth` in our schema (#9935 ) * chore: rename `git_auth` to `external_auth` in our schema We're changing Git auth to be external auth. It will support any OAuth2 or OIDC provider. To split up the larger change I want to contribute the schema changes first, and I'll add the feature itself in another PR. * Fix names * Fix outdated view * Rename some additional places * Fix sort order * Fix template versions auth route * Fix types * Fix dbauthz	2023-09-29 19:13:20 +00:00
Spike Curtis	60d5002eb6	refactor: change template archive extraction to be on provisioner (#9264 ) * refactor provisionersdk protocol Signed-off-by: Spike Curtis <spike@coder.com> * refactor provisioners to use new protocol Signed-off-by: Spike Curtis <spike@coder.com> * refactor provisionerd to use new protocol Signed-off-by: Spike Curtis <spike@coder.com> * refactor tests & proto renames * Fixes from self-review Signed-off-by: Spike Curtis <spike@coder.com> * appease fmt & link Signed-off-by: Spike Curtis <spike@coder.com> * code review fixes & e2e fixes Signed-off-by: Spike Curtis <spike@coder.com> * More fmt Signed-off-by: Spike Curtis <spike@coder.com> * Code review fixes Signed-off-by: Spike Curtis <spike@coder.com> * new gen; use uuid for session workdir Signed-off-by: Spike Curtis <spike@coder.com> * Revert nix-based gen CI task until dogfood is on nix Signed-off-by: Spike Curtis <spike@coder.com> * revert deleting dogfood Docker stuff Signed-off-by: Spike Curtis <spike@coder.com> * Revert "revert deleting dogfood Docker stuff" This reverts commit `9762158167`. --------- Signed-off-by: Spike Curtis <spike@coder.com>	2023-08-25 06:10:15 +00:00
Ammar Bandukwala	545a256b57	fix: correctly reject quota-violating builds (#9233 ) Due to a logical error in CommitQuota, all workspace Stop->Start operations were being accepted, regardless of the Quota limit. This issue only appeared after #9201, so this was a minor regression in main for about 3 days. This PR adds a test to make sure this kind of bug doesn't recur. To make the new test possible, we give the echo provisioner the ability to simulate responses to specific transitions.	2023-08-22 02:55:39 +00:00
Ammar Bandukwala	6b8102cf4c	feat(cli): add daily_cost to `coder ls` (#9200 )	2023-08-19 12:56:08 -05:00
Kyle Carberry	22e781eced	chore: add /v2 to import module path (#9072 ) * chore: add /v2 to import module path go mod requires semantic versioning with versions greater than 1.x This was a mechanical update by running: ``` go install github.com/marwan-at-work/mod/cmd/mod@latest mod upgrade ``` Migrate generated files to import /v2 * Fix gen	2023-08-18 18:55:43 +00:00
Asher	7ed17b2605	fix: add some missing workspace updates (#7790 ) * Standardize on function to get workspace channel name There were two, now there is one. * Add some missing workspace updates There are some failure cases where we do not set the type as a workspace build which causes the workspace update to never be published. * Make build failures warnings Otherwise the associated test fails due to the logger fataling on error messages.	2023-07-14 15:07:48 -08:00
Marcin Tojek	c6fcd7ee93	fix: report failed CompletedJob (#8318 )	2023-07-06 07:26:33 +00:00
Dean Sheather	98a5ae7f48	feat: add provisioner job hang detector (#7927 )	2023-06-25 13:17:00 +00:00
Marcin Tojek	4fb4c9b270	chore: add more rules to ensure logs consistency (#8104 )	2023-06-21 12:00:38 +02:00
Mathias Fredriksson	c12c9f1f4e	chore(go.mod): update cdr.dev/slog (#7994 ) * chore(mod): update cdr.dev/slog * fix: change uses of []slog.Field to []any to match new API	2023-06-13 18:17:04 +00:00
goodspark	0665a6c2f2	feat: add metric for provisioner daemons (#7858 )	2023-06-06 16:50:11 -05:00
Marcin Tojek	a7366a8b76	feat!: drop support for legacy parameters (#7663 )	2023-06-02 11:16:46 +02:00
Colin Adler	085330ad96	fix(provisionerd): only heartbeat when logs aren't being flushed (#7110 )	2023-04-13 14:02:10 -05:00
ElliotG	0069831e8d	fix: use error log when failing provisioner job (#6812 ) Co-authored-by: Colin Adler <colin1adler@gmail.com>	2023-04-05 13:30:53 -05:00
Colin Adler	a29fc7dd6f	chore: update otel to v1.14.0 (#6963 )	2023-04-03 00:31:39 -05:00
Marcin Tojek	0ba200c2a1	feat: Enable workspace debug logging (#6838 ) * feat: Enable workspace debug logging * Fix * Fix * Fix * fix * fix * Enable RBAC * unit tests * Fix * fix * fix * fix * more tests * fix: workspacebuild_test use roles * fix: swagger comment * fix: ctx.Done * fix: address PR comments * break loop	2023-03-30 16:00:33 +02:00
Kyle Carberry	df31636e72	feat: pass `access_token` to `coder_git_auth` resource (#6713 ) This allows template authors to leverage git auth to perform custom actions, like clone repositories.	2023-03-22 19:37:08 +00:00

1 2

76 Commits