coder

mirror of https://github.com/coder/coder.git synced 2026-06-03 21:18:24 +00:00

Author	SHA1	Message	Date
Spike Curtis	56eb57caf4	chore: enable agent socket by default (#22352 ) relates to #21335 Enables the agent socket by default and updates docs to strike references to having to enable it. The PRs in this stack change the MCP server that Tasks use to update their status to rely on the agent socket, rather than directly dialing Coderd with the agent token. Default disable was a reasonable default when it was only used for the experimental script ordering features, but now that we want to use it for Tasks, it should be default on.	2026-03-03 21:23:59 +04:00
Spike Curtis	393b3874ac	feat: add UpdateAppStatus to the workspace agent API (#22219 ) <!-- If you have used AI to produce some or all of this PR, please ensure you have read our [AI Contribution guidelines](https://coder.com/docs/about/contributing/AI_CONTRIBUTING) before submitting. --> part of https://github.com/coder/coder/issues/21335 This moves updating app status (used by Tasks) into the workspace agent API over dRPC. This will allow us to update the status without having to re-authenticate each time, like we would with an HTTP PATCH request. Further PRs in this stack will pipe these requests thru from the CLI MCP server to the agentsock and finally to this dRPC call to coderd.	2026-02-24 13:26:55 +04:00
Kacper Sawicki	f016d9e505	fix(coderd): add role param to agent RPC to prevent false connectivity (#22052 ) ## Summary coder-logstream-kube and other tools that use the agent token to connect to the RPC endpoint were incorrectly triggering connection monitoring, causing false connected/disconnected timestamps on the agent. This led to VSCode/JetBrains disconnections and incorrect dashboard status. ## Changes Add a `role` query parameter to `/api/v2/workspaceagents/me/rpc`: - `role=agent`: triggers connection monitoring (default for the agent SDK) - any other value (e.g. `logstream-kube`): skips connection monitoring - omitted: triggers monitoring for backward compatibility with older agents The agent SDK now sends `role=agent` by default. A new `Role` field on the `agentsdk.Client` allows non-agent callers to specify a different role. ## Required follow-up coder-logstream-kube needs to set `client.Role = "logstream-kube"` before calling `ConnectRPC20()`. Without that change, it will still send `role=agent` and trigger monitoring. Fixes #21625	2026-02-18 09:44:06 +01:00
Danielle Maywood	2de8cdf160	feat(agent): add subagent ID fields to devcontainers in manifest (#21848 ) Update the agent protobuf schema (agent/proto/agent.proto) to include: - subagent_id field in WorkspaceAgentDevcontainer message - id field in CreateSubAgentRequest message Bump the Agent API version from v2.7 to v2.8 and update all client references throughout the codebase (ConnectRPC27 -> ConnectRPC28, DRPCAgentClient27 -> DRPCAgentClient28).	2026-02-03 12:37:30 +00:00
Spike Curtis	bddb808b25	chore: arrange imports in a standard way (#21452 ) Fixes all our Go file imports to match the preferred spec that we've _mostly_ been using. For example: ``` import ( "context" "time" "github.com/prometheus/client_golang/prometheus" "golang.org/x/xerrors" "gopkg.in/natefinch/lumberjack.v2" "cdr.dev/slog/v3" "github.com/coder/coder/v2/codersdk/agentsdk" "github.com/coder/serpent" ) ``` 3 groups: standard library, 3rd partly libs, Coder libs. This PR makes the change across the codebase. The PR in the stack above modifies our formatting to maintain this state of affairs, and is a separate PR so it's possible to review that one in detail.	2026-01-08 15:24:11 +04:00
Spike Curtis	49b34a716a	fix: fix slog to always use array of Fields (#21426 ) Upgrades to slog v3 which includes a small, but backward incompatible API change to the acceptible call arguments when logging. This change allows us to verify via compile time type checking that arguments are correct and won't cause a panic, as was possible in slog v1, which this replaces (v2 was tagged but never used in coder/coder). It also updates dependencies that also use slog and were updated. I've left the `aibridge` dependency as a commit SHA, under the assumption that the team there (cc @pawbana @dannykopping ) will tag and update the dependency soon and on their own schedule. Other dependencies, I pushed new tags.	2026-01-08 10:29:41 +04:00
Zach	9d1493a13a	feat: add initial API for boundary log forwarding to coderd (#21293 ) Add the AgentAPI changes to support the feature that transmits boundary logs from workspaces to coderd via the agent API for eventual re-emission to stderr. The API handlers are stubs for now because I'm trying to land this feature from multiple smaller PRs. High level architecture: - Boundary records resource access in batches and sends proto message to agent - Agent proxies messages to coderd (captured by the API changes in this PR) - coderd re-emits logs to stderr RFC: https://www.notion.so/coderhq/Agent-Boundary-Logs-2afd579be59280f29629fc9823ac41ba	2025-12-19 10:41:39 -07:00
Spike Curtis	1354d84eb4	chore: refactor instance identity to be a SessionTokenProvider (#19566 ) Refactors Agent instance identity to be a SessionTokenProvider. Refactors the CLI to create Agent clients via a centralized function, rather than add-hoc via individual command handlers and their flags. This allows commands besides `coder agent`, but which still use the agent identity, to support instance identity authentication. Fixes #19111 by unifying all API requests to go thru the SessionTokenProvider for auth credentials.	2025-09-03 10:38:42 +04:00
Danielle Maywood	529fb5083c	feat(agent/agentcontainers): support apps for dev container agents (#18346 ) Add apps to the sub agent based on the dev container customization. The implementation also provides the following env variables for use in the devcontainer json - `CODER_WORKSPACE_AGENT_NAME` - `CODER_WORKSPACE_USER_NAME` - `CODER_WORKSPACE_NAME` - `CODER_DEPLOYMENT_URL`	2025-06-18 14:55:27 +01:00
Danielle Maywood	dd150264bc	feat(agent/agentcontainers): support displayApps from devcontainer config (#18342 ) Updates the agent injection routine to read the dev container's configuration so we can add display apps to the sub agent.	2025-06-12 23:36:23 +01:00
Mathias Fredriksson	fca99174ad	feat(agent/agentcontainers): implement sub agent injection (#18245 ) This change adds support for sub agent creation and injection into dev containers. Updates coder/internal#621	2025-06-10 12:37:54 +03:00
Danielle Maywood	b712d0b23f	feat(coderd/agentapi): implement sub agent api (#17823 ) Closes https://github.com/coder/internal/issues/619 Implement the `coderd` side of the AgentAPI for the upcoming dev-container agents work. `agent/agenttest/client.go` is left unimplemented for a future PR working to implement the agent side of this feature.	2025-05-29 12:15:47 +01:00
Danielle Maywood	61f22a59ba	feat(agent): add `ParentId` to agent manifest (#17888 ) Closes https://github.com/coder/internal/issues/648 This change introduces a new `ParentId` field to the agent's manifest. This will allow an agent to know if it is a child or not, as well as knowing who the owner is. This is part of the Dev Container Agents work	2025-05-19 16:09:56 +01:00
Steven Masley	64807e1d61	chore: apply the 4mb max limit on drpc protocol message size (#17771 ) Respect the 4mb max limit on proto messages	2025-05-13 11:24:51 -05:00
Steven Masley	37832413ba	chore: resolve internal drpc package conflict (#17770 ) Our internal drpc package name conflicts with the external one in usage. `drpc.` == external `drpcsdk.` == internal	2025-05-12 10:31:38 -05:00
Eng Zer Jun	04c33968cf	refactor: replace `golang.org/x/exp/slices` with `slices` (#16772 ) The experimental functions in `golang.org/x/exp/slices` are now available in the standard library since Go 1.21. Reference: https://go.dev/doc/go1.21#slices Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2025-03-04 00:46:49 +11:00
Mathias Fredriksson	4ba5a8a2ba	feat(agent): add connection reporting for SSH and reconnecting PTY (#16652 ) Updates #15139	2025-02-27 10:45:45 +00:00
Mathias Fredriksson	b07b33ec9d	feat: add agentapi endpoint to report connections for audit (#16507 ) This change adds a new `ReportConnection` endpoint to the `agentapi`. The protocol version was bumped previously, so it has been omitted here. This allows the agent to report connection events, for example when the user connects to the workspace via SSH or VS Code. Updates #15139	2025-02-20 14:52:01 +02:00
Vincent Vielle	bc609d0056	feat: integrate agentAPI with resources monitoring logic (#16438 ) As part of the new resources monitoring logic - more specifically for OOM & OOD Notifications , we need to update the AgentAPI , and the agents logic. This PR aims to do it, and more specifically : We are updating the AgentAPI & TailnetAPI to version 24 to add two new methods in the AgentAPI : - One method to fetch the resources monitoring configuration - One method to push the datapoints for the resources monitoring. Also, this PR adds a new logic on the agent side, with a routine running and ticking - fetching the resources usage each time , but also storing it in a FIFO like queue. Finally, this PR fixes a problem we had with RBAC logic on the resources monitoring model, applying the same logic than we have for similar entities.	2025-02-14 10:28:15 +01:00
Spike Curtis	5861e516b9	chore: add standard test logger ignoring db canceled (#15556 ) Refactors our use of `slogtest` to instantiate a "standard logger" across most of our tests. This standard logger incorporates https://github.com/coder/slog/pull/217 to also ignore database query canceled errors by default, which are a source of low-severity flakes. Any test that has set non-default `slogtest.Options` is left alone. In particular, `coderdtest` defaults to ignoring all errors. We might consider revisiting that decision now that we have better tools to target the really common flaky Error logs on shutdown.	2024-11-18 14:09:22 +04:00
Spike Curtis	40802958e9	fix: use explicit api versions for agent and tailnet (#15508 ) Bumps the Tailnet and Agent API version 2.3, and creates some extra controls and machinery around these versions. What happened is that we accidentally shipped two new API features without bumping the version. `ScriptCompleted` on the Agent API in Coder v2.16 and `RefreshResumeToken` on the Tailnet API in Coder v2.15. Since we can't easily retroactively bump the versions, we'll roll these changes into API version 2.3 along with the new WorkspaceUpdates RPC, which hasn't been released yet. That means there is some ambiguity in Coder v2.15-v2.17 about exactly what methods are supported on the Tailnet and Agent APIs. This isn't great, but hasn't caused us major issues because 1. RefreshResumeToken is considered optional, and clients just log and move on if the RPC isn't supported. 2. Agents basically never get started talking to a Coderd that is older than they are, since the agent binary is normally downloaded from Coderd at workspace start. Still it's good to get things squared away in terms of versions for SDK users and possible edge cases around client and server versions. To mitigate against this thing happening again, this PR also: 1. adds a CODEOWNERS for the API proto packages, so I'll review changes 2. defines interface types for different API versions, and has the agent explicitly use a specific version. That way, if you add a new method, and try to use it in the agent without thinking explicitly about versions, it won't compile. With the protocol controllers stuff, we've sort of already abstracted the Tailnet API such that the interface type strategy won't work, but I'll work on getting the Controller to be version aware, such that it can check the API version it's getting against the controllers it has -- in a later PR.	2024-11-15 11:16:28 +04:00
Spike Curtis	886dcbec84	chore: refactor coordination (#15343 ) Refactors the way clients of the Tailnet API (clients of the API, which include both workspace "agents" and "clients") interact with the API. Introduces the idea of abstract "controllers" for each of the RPCs in the API, and implements a Coordination controller by refactoring from `workspacesdk`. chore re: #14729	2024-11-05 13:50:10 +04:00
Danielle Maywood	ae522c558d	feat: add agent timings (#14713 ) * feat: begin impl of agent script timings * feat: add job_id and display_name to script timings * fix: increment migration number * fix: rename migrations from 251 to 254 * test: get tests compiling * fix: appease the linter * fix: get tests passing again * fix: drop column from correct table * test: add fixture for agent script timings * fix: typo * fix: use job id used in provisioner job timings * fix: increment migration number * test: behaviour of script runner * test: rewrite test * test: does exit 1 script break things? * test: rewrite test again * fix: revert change Not sure how this came to be, I do not recall manually changing these files. * fix: let code breathe * fix: wrap errors * fix: justify nolint * fix: swap require.Equal argument order * fix: add mutex operations * feat: add 'ran_on_start' and 'blocked_login' fields * fix: update testdata fixture * fix: refer to agent_id instead of job_id in timings * fix: JobID -> AgentID in dbauthz_test * fix: add 'id' to scripts, make timing refer to script id * fix: fix broken tests and convert bug * fix: update testdata fixtures * fix: update testdata fixtures again * feat: capture stage and if script timed out * fix: update migration number * test: add test for script api * fix: fake db query * fix: use UTC time * fix: ensure r.scriptComplete is not nil * fix: move err check to right after call * fix: uppercase sql * fix: use dbtime.Now() * fix: debug log on r.scriptCompleted being nil * fix: ensure correct rbac permissions * chore: remove DisplayName * fix: get tests passing * fix: remove space in sql up * docs: document ExecuteOption * fix: drop 'RETURNING' from sql * chore: remove 'display_name' from timing table * fix: testdata fixture * fix: put r.scriptCompleted call in goroutine * fix: track goroutine for test + use separate context for reporting * fix: appease linter, handle trackCommandGoroutine error * fix: resolve race condition * feat: replace timed_out column with status column * test: update testdata fixture * fix: apply suggestions from review * revert: linter changes	2024-09-24 10:51:49 +01:00
Spike Curtis	70c5c47efd	fix: stop blocking fake Agent API channel writes after context expires (#13908 )	2024-07-16 23:22:13 +04:00
Kayla Washburn-Love	b248f125e1	chore: rename notification banners to announcement banners (#13419 )	2024-05-31 10:59:28 -06:00
Kayla Washburn-Love	d8e0be6ee6	feat: add support for multiple banners (#13081 )	2024-05-08 15:40:43 -06:00
Colin Adler	e5d911462f	fix(tailnet): enforce valid agent and client addresses (#12197 ) This adds the ability for `TunnelAuth` to also authorize incoming wireguard node IPs, preventing agents from reporting anything other than their static IP generated from the agent ID.	2024-03-01 09:02:33 -06:00
Spike Curtis	b0afffbafb	feat: use v2 API for agent metadata updates (#12281 ) Switches the agent to report metadata over the v2 API. Fixes #10534	2024-02-26 09:50:19 +04:00
Spike Curtis	aa7a9f5cc4	feat: use v2 API for agent lifecycle updates (#12278 ) Agent uses the v2 API to post lifecycle updates. Part of #10534	2024-02-23 15:24:28 +04:00
Spike Curtis	4cc132cea0	feat: switch agent to use v2 API for sending logs (#12068 ) Changes the agent to use the new v2 API for sending logs, via the logSender component. We keep the PatchLogs function around, but deprecate it so that we can test the v1 endpoint.	2024-02-23 11:27:15 +04:00
Dean Sheather	2fc3064653	chore: add tests for app ID copy in app healths (#12088 )	2024-02-12 05:49:48 +00:00
Spike Curtis	1cf4b62867	feat: change agent to use v2 API for reporting stats (#12024 ) Modifies the agent to use the v2 API to report its statistics, using the `statsReporter` subcomponent.	2024-02-07 15:26:41 +04:00
Spike Curtis	1aa117b9ec	chore: rename client Listen to ConnectRPC (#11916 ) ConnectRPC seems more appropriate for this function	2024-02-01 14:44:11 +04:00
Spike Curtis	0fc177203e	feat: use agent v2 API to update app health (#11889 ) Use the Agent v2 API to update App Health	2024-01-30 11:35:12 +04:00
Spike Curtis	2599850e54	feat: use agent v2 API to post startup (#11877 ) Uses the v2 Agent API to post startup information.	2024-01-30 11:23:28 +04:00
Spike Curtis	da8bb1c198	feat: use agent v2 API to fetch manifest (#11832 ) Agent uses the v2 API to obtain the manifest, instead of the HTTP API.	2024-01-30 10:11:28 +04:00
Spike Curtis	0eff646c31	chore: move proto to sdk conversion to agentsdk (#11831 ) `agentsdk` depends on `agent/proto` because it needs to get the version to dial. Therefore, the conversion routines need to live in `agentsdk` so that we can convert to and from the Manifest. I briefly considered refactoring the agent to only reference `proto.Manifest`, but decided against it because we might have multiple protocol versions in the future, its useful to have a protocol-independent data structure.	2024-01-30 09:04:56 +04:00
Spike Curtis	13e24f21e4	feat: use Agent v2 API for Service Banner (#11806 ) Agent uses the v2 API for the service banner, rather than the v1 HTTP API. One of several for #10534	2024-01-30 07:44:47 +04:00
Spike Curtis	059e533544	feat: agent uses Tailnet v2 API for DERPMap updates (#11698 ) Switches the Agent to use Tailnet v2 API to get DERPMap updates. Subsequent PRs will do the same for the CLI (`codersdk`) and `wsproxy`.	2024-01-23 14:42:07 +04:00
Spike Curtis	f01cab9894	feat: use tailnet v2 API for coordination (#11638 ) This one is huge, and I'm sorry. The problem is that once I change `tailnet.Conn` to start doing v2 behavior, I kind of have to change it everywhere, including in CoderSDK (CLI), the agent, wsproxy, and ServerTailnet. There is still a bit more cleanup to do, and I need to add code so that when we lose connection to the Coordinator, we mark all peers as LOST, but that will be in a separate PR since this is big enough!	2024-01-22 11:07:50 +04:00
Spike Curtis	ad3fed72bc	chore: rename Coordinator to CoordinatorV1 (#11222 ) Renames the tailnet.Coordinator to represent both v1 and v2 APIs, so that we can use this interface for the main atomic pointer. Part of #10532	2023-12-15 11:38:12 +04:00
Mathias Fredriksson	4857d4bd55	feat(codersdk/agentsdk): use new agent metadata batch endpoint (#10224 ) Part of #9782	2023-10-13 17:32:28 +03:00
Mathias Fredriksson	7eeba15d16	feat(coderd): add support for sending batched agent metadata (#10223 ) Part of #9782	2023-10-13 16:37:55 +03:00
Cian Johnston	93ef696b57	refactor(agent): add agenttest.New helper function (#9812 ) * Adds agenttest.New() helper function * Makes sure agent gets closed on test cleanup * Makes sure you don't forget to set session token * Sets the agent and client logger automatically	2023-09-26 12:05:19 +01:00
Kyle Carberry	22e781eced	chore: add /v2 to import module path (#9072 ) * chore: add /v2 to import module path go mod requires semantic versioning with versions greater than 1.x This was a mechanical update by running: ``` go install github.com/marwan-at-work/mod/cmd/mod@latest mod upgrade ``` Migrate generated files to import /v2 * Fix gen	2023-08-18 18:55:43 +00:00
Kyle Carberry	bd944e0d21	chore: rename startup logs to agent logs (#8649 ) * chore: rename startup logs to agent logs This also adds a `source` property to every agent log. It should allow us to group logs and display them nicer in the UI as they stream in. * Fix migration order * Fix naming * Rename the frontend * Fix tests * Fix down migration * Match enums for workspace agent logs * Fix inserting log source * Fix migration order * Fix logs tests * Fix psql insert	2023-07-28 15:57:23 +00:00
Dean Sheather	2f0a9996e7	chore: add derpserver to wsproxy, add proxies to derpmap (#7311 )	2023-07-27 02:21:04 +10:00
Colin Adler	c8d65de4b7	test(agent): fix `TestAgent_Metadata/Once` flake (#8613 )	2023-07-20 18:49:44 +00:00
Colin Adler	c47b78c44b	chore: replace wsconncache with a single tailnet (#8176 )	2023-07-12 17:37:31 -05:00

49 Commits