coder

mirror of https://github.com/coder/coder.git synced 2026-06-02 20:48:20 +00:00

Author	SHA1	Message	Date
Atif Ali	bd5b62c976	feat: expose MCP tool annotations for tool grouping (#23195 ) ## Summary - add shared MCP annotation metadata to toolsdk tools - emit MCP tool annotations from both coderd and CLI MCP servers - cover annotation serialization in toolsdk, coderd MCP e2e, and CLI MCP tests ## Why - Coder already exposed MCP tools, but it did not populate MCP tool annotation hints (`readOnlyHint`, `destructiveHint`, `idempotentHint`, `openWorldHint`). - Hosts such as Claude Desktop use those hints to classify and group tools, so without them Coder tools can get lumped together. - This change adds a shared annotation source in `toolsdk` and has both MCP servers emit those hints through `mcp.Tool.Annotations`, avoiding drift between local and remote MCP implementations. ## Testing - Tested locally on Cladue Desktop and the tools are categorized correctly. <table> <tr> <td> Before <td> After <tr> <td> <img width="613" height="183" alt="image" src="https://github.com/user-attachments/assets/29d2e3fb-53bc-4ea7-bdb3-f10df4ef996b" /> <td> <img width="600" height="457" alt="image" src="https://github.com/user-attachments/assets/cc384036-c9a7-4db9-9400-43ad51920ff5" /> </table> Note: Done using Coder Agents, reviewed and tested by human locally	2026-03-18 10:21:45 +00:00
Spike Curtis	1a30ca1a2a	chore: use agentsocket for task status updates in MCP server (#22354 ) relates to #21335 Modifies our local MCP server used in Tasks to push task status updates over the agentsocket, rather than directly dialing Coderd. This will significantly reduce pressure on the database at scale because we can avoid expensive authentication of the agent API key. Disclosure: I used AI to generate a lot of this PR, but hand-reviewed and tweaked it.	2026-03-04 21:41:21 +04:00
Mathias Fredriksson	947b390c5a	fix: allow agent-reported final states, add SSE reconnection (#22286 ) When AgentAPI is configured, `WithTaskReporter` unconditionally overrides all self-reported states to `working`. The intent was to distrust the agent's `idle` and rely on the screen watcher, but the override also blocks `failure` and `complete`, which only the agent can produce (the screen watcher only knows `running`/`stable`). Tasks get stuck as `working` or `null` forever. Now only `idle` is overridden to `working`; `failure`, `complete`, and `working` pass through as-is. Also: - Remove misplaced unconditional `"Failed to watch screen events"` log that fired on every startup - Add SSE reconnection with exponential backoff (1s-30s) in `startWatcher` so it recovers from dropped connections instead of dying silently - Add `complete` to the `coder_report_task` tool enum, which the `coder/claude-code` registry module already instructs agents to use but was missing from the schema Refs coder/internal#1350	2026-02-24 20:28:50 +02:00
Spike Curtis	606ae897b7	chore: refactor to directly create Client in Command Handlers (#19760 ) Refactors the CLI to create the `*codersdk.Client` in the handlers. This is groundwork for changing the `rootCmd.InitClient()` to use the new `ClientOption`s. It also improves variable locality, scoping the Client to the handler. This makes misuse less likely and reduces the memory allocations to just the command being executed, rather than allocating a Client for every command regardless of whether it is executed.	2025-09-22 17:14:07 +04:00
Spike Curtis	18945a7949	chore: refactor CLI agent auth tests as unit tests (#19609 ) Fixes https://github.com/coder/internal/issues/933 Refactors CLI tests that check the `--auth` flag parsing for various public clouds into a unit test that just creates the agent Client and asserts on the type. Testing that the agent client actually authenticates correctly with these auth types is well covered by Coderd tests, so we don't need to retread that ground here, and the deleted tests were flaky on Windows.	2025-09-03 10:49:19 +04:00
Spike Curtis	1354d84eb4	chore: refactor instance identity to be a SessionTokenProvider (#19566 ) Refactors Agent instance identity to be a SessionTokenProvider. Refactors the CLI to create Agent clients via a centralized function, rather than add-hoc via individual command handlers and their flags. This allows commands besides `coder agent`, but which still use the agent identity, to support instance identity authentication. Fixes #19111 by unifying all API requests to go thru the SessionTokenProvider for auth credentials.	2025-09-03 10:38:42 +04:00
Marcin Tojek	e98dce7f99	fix: mute Claude API key warning if Bedrock in use (#18988 ) Fixes: https://github.com/coder/coder/issues/17402	2025-07-22 13:56:20 +02:00
Asher	fc7700a62f	fix: improve reliability of app statuses (#18622 ) We were discarding all "working" updates from the screen watcher because we cannot tell the difference between the agent or user changing the screen, but it makes sense to accept it as the very first update, because the agent could be working but neglected to report that fact, so you would never get an initial "working" update (it would just eventually go straight to "idle"). Also messages can start at zero, so I made a fix for that as well, although the first message will be from the LLM and we ignore those anyway, so this probably has no actual effect, but seems more technically correct. And it seems I forgot to actually update the last message ID, which also does not actually matter for user messages (since I think the SSE endpoint will not re-emit a user message it has already emitted), but seems more technically correct to check. Lastly, if we have the screen watcher, ignore the agent's self-reported state and always use "working" since it is unreliable. The idle state will eventually be caught by the watcher.	2025-06-30 12:12:20 -08:00
Asher	0a483ea2b7	feat: add idle app status (#18415 ) "Idle" is more accurate than "complete" since: 1. AgentAPI only knows if the screen is active; it has no way of knowing if the task is complete. 2. The LLM might be done with its current prompt, but that does not mean the task is complete either (it likely needs refinement). The "complete" state will be reserved for future definition. Additionally, in the case where the screen goes idle but the LLM never reported a status update, we can get an idle icon without a message, and it looks kinda janky in the UI so if there is no message I display the state text. Closes https://github.com/coder/internal/issues/699	2025-06-20 14:34:31 -08:00
Asher	4bd5609e13	feat: add status watcher to MCP server (#18320 ) This is meant to complement the existing task reporter since the LLM does not call it reliably. It also includes refactoring to use the common agent flags/env vars.	2025-06-13 12:53:43 -08:00
Kyle Carberry	bedeb4710b	fix: improve task reporting tool description (#18119 ) In my (albeit subjective) testing, this dramatically improved the reporting ability - both in frequency and accuracy.	2025-05-30 00:00:12 +00:00
Thomas Kosiewski	b551a062d7	fix: correct environment variable name for MCP app status slug (#17948 ) Fixed environment variable name for app status slug in Claude MCP configuration from `CODER_MCP_CLAUDE_APP_STATUS_SLUG` to `CODER_MCP_APP_STATUS_SLUG` to maintain consistency with other MCP environment variables. This also caused the User level Claude.md to not contain instructions to report its progress, so it did not receive status reports.	2025-05-20 19:35:19 +02:00
Thomas Kosiewski	29bce8d9e6	feat(cli): make MCP server work without user authentication (#17688 ) Part of #17649 --- # Allow MCP server to run without authentication This PR enhances the MCP server to operate without requiring authentication, making it more flexible for environments where authentication isn't available or necessary. Key changes: - Replaced `InitClient` with `TryInitClient` to allow the MCP server to start without credentials - Added graceful handling when URL or authentication is missing - Made authentication status visible in server logs - Added logic to skip user-dependent tools when no authenticated user is present - Made the `coder_report_task` tool available with just an agent token (no user token required) - Added comprehensive tests to verify operation without authentication These changes allow the MCP server to function in more environments while still using authentication when available, improving flexibility for CI/CD and other automated environments.	2025-05-07 21:53:06 +02:00
Cian Johnston	2acf0adcf2	chore(codersdk/toolsdk): improve static analyzability of toolsdk.Tools (#17562 ) * Refactors toolsdk.Tools to remove opaque `map[string]any` argument in favour of typed args structs. * Refactors toolsdk.Tools to remove opaque passing of dependencies via `context.Context` in favour of a tool dependencies struct. * Adds panic recovery and clean context middleware to all tools. * Adds `GenericTool` implementation to allow keeping `toolsdk.All` with uniform type signature while maintaining type information in handlers. * Adds stricter checks to `patchWorkspaceAgentAppStatus` handler.	2025-04-29 16:05:23 +01:00
Cian Johnston	22b932a8e0	fix(cli): fix prompt issue in mcp configure claude-code (#17599 ) * Updates default Coder prompt. * Skips the directions to report tasks if the pre-requisites are not available (agent token and app slug). * Adds the capability to override the default Coder prompt via `CODER_MCP_CLAUDE_CODER_PROMPT`.	2025-04-29 15:23:16 +01:00
Cian Johnston	2d2c9bda98	fix(cli): correct logic around CODER_MCP_APP_STATUS_SLUG (#17391 ) Past me was not smart.	2025-04-14 16:24:02 +01:00
Cian Johnston	7b0422b49b	fix(codersdk/toolsdk): fix tool schemata (#17365 ) Fixes two issues with the MCP server: - Ensures we have a non-null schema, as the following schema was making claude-code unhappy: ``` "inputSchema": { "type": "object", "properties": null }, ``` - Skip adding the coder_report_task tool if an agent client is not available. Otherwise the agent may try to report tasks and get confused.	2025-04-11 18:58:17 +01:00
Cian Johnston	1235550637	feat(codersdk): add toolsdk and replace existing mcp server tool impl (#17343 ) - Refactors existing `mcp` package to use `kylecarbs/aisdk-go` and moves to `codersdk/toolsdk` package. - Updates existing MCP server implementation to use `codersdk/toolsdk` Co-authored-by: Kyle Carberry <kyle@coder.com>	2025-04-11 10:24:45 +01:00
Cian Johnston	4aa45a5c43	fix(cli): modify `exp mcp configure` to also read claude API key from CLAUDE_API_KEY env (#17229 ) Currently you have to set `CODER_MCP_CLAUDE_API_KEY`, which can be obnoxious.	2025-04-03 09:45:17 +01:00
Cian Johnston	88bae05223	feat(cli): implement exp mcp configure claude-code command (#17195 ) Updates `~/.claude.json` and `~/.claude/CLAUDE.md` with required settings for agentic usage.	2025-04-01 20:06:42 +01:00
Cian Johnston	27d2343adf	fix(cli): exp mcp: remove unnecessary cli flag (#17190 )	2025-04-01 16:53:18 +01:00
Cian Johnston	1e11e823c9	fix(mcp): report task status correctly (#17187 )	2025-04-01 15:02:08 +01:00
Cian Johnston	057cbd4d80	feat(cli): add `coder exp mcp` command (#17066 ) Adds a `coder exp mcp` command which will start a local MCP server listening on stdio with the following capabilities: * Show logged in user (`coder whoami`) * List workspaces (`coder list`) * List templates (`coder templates list`) * Start a workspace (`coder start`) * Stop a workspace (`coder stop`) * Fetch a single workspace (no direct CLI analogue) * Execute a command inside a workspace (`coder exp rpty`) * Report the status of a task (currently a no-op, pending task support) This can be tested as follows: ``` # Start a local Coder server. ./scripts/develop.sh # Start a workspace. Currently, creating workspaces is not supported. ./scripts/coder-dev.sh create -t docker --yes # Add the MCP to your Claude config. claude mcp add coder ./scripts/coder-dev.sh exp mcp # Tell Claude to do something Coder-related. You may need to nudge it to use the tools. claude 'start a docker workspace and tell me what version of python is installed' ```	2025-03-31 18:52:09 +01:00

23 Commits