coder

mirror of https://github.com/coder/coder.git synced 2025-07-10 23:53:15 +00:00

Author	SHA1	Message	Date
brettkolodny	2cd3f999a6	feat: add one shot commands to the coder ssh command (#17779 ) Closes #2154 > [!WARNING] > The tests in this PR were co-authored by AI	2025-05-16 10:09:46 -04:00
brettkolodny	73251cf5b2	chore: add documentation to the coder ssh command regarding feature parity with ssh (#17827 ) Closes [coder/internal#628](https://github.com/coder/internal/issues/628) --------- Co-authored-by: M Atif Ali <atif@coder.com>	2025-05-14 15:42:44 -04:00
Ethan	c7fc7b91ec	fix: create directory before writing coder connect network info file (#17628 ) The regular network info file creation code also calls `Mkdirall`. Wasn't picked up in manual testing as I already had the `/net` folder in my VSCode. Wasn't picked up in automated testing because we use an in-memory FS, which for some reason does this implicitly.	2025-05-01 16:53:13 +10:00
Ethan	53ba3613b3	feat(cli): use coder connect in `coder ssh --stdio`, if available (#17572 ) Closes https://github.com/coder/vscode-coder/issues/447 Closes https://github.com/coder/jetbrains-coder/issues/543 Closes https://github.com/coder/coder-jetbrains-toolbox/issues/21 This PR adds Coder Connect support to `coder ssh --stdio`. When connecting to a workspace, if `--force-new-tunnel` is not passed, the CLI will first do a DNS lookup for `<agent>.<workspace>.<owner>.<hostname-suffix>`. If an IP address is returned, and it's within the Coder service prefix, the CLI will not create a new tailnet connection to the workspace, and instead dial the SSH server running on port 22 on the workspace directly over TCP. This allows IDE extensions to use the Coder Connect tunnel, without requiring any modifications to the extensions themselves. Additionally, `using_coder_connect` is added to the `sshNetworkStats` file, which the VS Code extension (and maybe Jetbrains?) will be able to read, and indicate to the user that they are using Coder Connect. One advantage of this approach is that running `coder ssh --stdio` on an offline workspace with Coder Connect enabled will have the CLI wait for the workspace to build, the agent to connect (and optionally, for the startup scripts to finish), before finally connecting using the Coder Connect tunnel. As a result, `coder ssh --stdio` has the overhead of looking up the workspace and agent, and checking if they are running. On my device, this meant `coder ssh --stdio <workspace>` was approximately a second slower than just connecting to the workspace directly using `ssh <workspace>.coder` (I would assume anyone serious about their Coder Connect usage would know to just do the latter anyway). To ensure this doesn't come at a significant performance cost, I've also benchmarked this PR. <details> <summary>Benchmark</summary> ## Methodology All tests were completed on `dev.coder.com`, where a Linux workspace running in AWS `us-west1` was created. The machine running Coder Desktop (the 'client') was a Windows VM running in the same AWS region and VPC as the workspace. To test the performance of specifically the SSH connection, a port was forwarded between the client and workspace using: ``` ssh -p 22 -L7001:localhost:7001 <host> ``` where `host` was either an alias for an SSH ProxyCommand that called `coder ssh`, or a Coder Connect hostname. For latency, [`tcping`](https://www.elifulkerson.com/projects/tcping.php) was used against the forwarded port: ``` tcping -n 100 localhost 7001 ``` For throughput, [`iperf3`](https://iperf.fr/iperf-download.php) was used: ``` iperf3 -c localhost -p 7001 ``` where an `iperf3` server was running on the workspace on port 7001. ## Test Cases ### Testcase 1: `coder ssh` `ProxyCommand` that bicopies from Coder Connect This case tests the implementation in this PR, such that we can write a config like: ``` Host codercliconnect ProxyCommand /path/to/coder ssh --stdio workspace ``` With Coder Connect enabled, `ssh -p 22 -L7001:localhost:7001 codercliconnect` will use the Coder Connect tunnel. The results were as follows: Throughput, 10 tests, back to back: - Average throughput across all tests: 788.20 Mbits/sec - Minimum average throughput: 731 Mbits/sec - Maximum average throughput: 871 Mbits/sec - Standard Deviation: 38.88 Mbits/sec Latency, 100 RTTs: - Average: 0.369ms - Minimum: 0.290ms - Maximum: 0.473ms ### Testcase 2: `ssh` dialing Coder Connect directly without a `ProxyCommand` This is what we assume to be the 'best' way to use Coder Connect Throughput, 10 tests, back to back: - Average throughput across all tests: 789.50 Mbits/sec - Minimum average throughput: 708 Mbits/sec - Maximum average throughput: 839 Mbits/sec - Standard Deviation: 39.98 Mbits/sec Latency, 100 RTTs: - Average: 0.369ms - Minimum: 0.267ms - Maximum: 0.440ms ### Testcase 3: `coder ssh` `ProxyCommand` that creates its own Tailnet connection in-process This is what normally happens when you run `coder ssh`: Throughput, 10 tests, back to back: - Average throughput across all tests: 610.20 Mbits/sec - Minimum average throughput: 569 Mbits/sec - Maximum average throughput: 664 Mbits/sec - Standard Deviation: 27.29 Mbits/sec Latency, 100 RTTs: - Average: 0.335ms - Minimum: 0.262ms - Maximum: 0.452ms ## Analysis Performing a two-tailed, unpaired t-test against the throughput of testcases 1 and 2, we find a P value of `0.9450`. This suggests the difference between the data sets is not statistically significant. In other words, there is a 94.5% chance that the difference between the data sets is due to chance. ## Conclusion From the t-test, and by comparison to the status quo (regular `coder ssh`, which uses gvisor, and is noticeably slower), I think it's safe to say any impact on throughput or latency by the `ProxyCommand` performing a bicopy against Coder Connect is negligible. Users are very much unlikely to run into performance issues as a result of using Coder Connect via `coder ssh`, as implemented in this PR. Less scientifically, I ran these same tests on my home network with my Sydney workspace, and both throughput and latency were consistent across testcases 1 and 2. </details>	2025-04-30 15:17:10 +10:00
Mathias Fredriksson	1fc74f629e	refactor(agent): update agentcontainers api initialization (#17600 ) There were too many ways to configure the agentcontainers API resulting in inconsistent behavior or features not being enabled. This refactor introduces a control flag for enabling or disabling the containers API. When disabled, all implementations are no-op and explicit endpoint behaviors are defined. When enabled, concrete implementations are used by default but can be overridden by passing options.	2025-04-29 17:53:10 +03:00
Spike Curtis	d312e82a51	feat: support --hostname-suffix flag on coder ssh (#17279 ) Adds `hostname-suffix` flag to `coder ssh` command for use in SSH Config ProxyCommands. Also enforces that Coder server doesn't start the suffix with a dot. part of: #16828	2025-04-07 21:33:33 +04:00
Garrett Delfosse	fc471eb384	fix: handle vscodessh style workspace names in coder ssh (#17154 ) Fixes an issue where old ssh configs that use the `owner--workspace--agent` format will fail to properly use the `coder ssh` command since we migrated off the `coder vscodessh` command.	2025-04-07 10:06:58 -04:00
Jon Ayers	17ddee05e5	chore: update golang to 1.24.1 (#17035 ) - Update go.mod to use Go 1.24.1 - Update GitHub Actions setup-go action to use Go 1.24.1 - Fix linting issues with golangci-lint by: - Updating to golangci-lint v1.57.1 (more compatible with Go 1.24.1) 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <claude@anthropic.com>	2025-03-26 01:56:39 -05:00
Cian Johnston	ec44f06f5c	feat(cli): allow SSH command to connect to running container (#16726 ) Fixes https://github.com/coder/coder/issues/16709 and https://github.com/coder/coder/issues/16420 Adds the capability to`coder ssh` into a running container if `CODER_AGENT_DEVCONTAINERS_ENABLE=true`. Notes: * SFTP is currently not supported * Haven't tested X11 container forwarding * Haven't tested agent forwarding	2025-02-28 09:38:45 +00:00
Aaron Lehmann	1aa9e32a2b	feat: add --ssh-host-prefix flag for "coder ssh" (#16088 ) This adds a flag matching `--ssh-host-prefix` from `coder config-ssh` to `coder ssh`. By trimming a custom prefix from the argument, we can set up wildcard-based `Host` entries in SSH config for the IDE plugins (and eventually `coder config-ssh`). We also replace `--` in the argument with `/`, so ownership can be specified in wildcard-based SSH hosts like `<owner>--<workspace>`. Replaces #16087. Part of https://github.com/coder/coder/issues/14986. Related to https://github.com/coder/coder/pull/16078 and https://github.com/coder/coder/pull/16080.	2025-01-13 19:07:21 -06:00
Aaron Lehmann	ec6645b832	chore: add parent PID to coder ssh log file name (#16080 ) Part of bringing `coder ssh` to parity with `coder vscodessh` is associating the log files with a particular parent process (in this case, the ssh process that spawned the coder CLI via `ProxyCommand`). `coder vscodessh` named log files using the parent PID, but coder ssh is missing this. Add the parent PID to the log file name when used in stdio mode so that the VS Code extension will be able to identify the correct log file. See also #16078.	2025-01-13 18:30:02 -06:00
Aaron Lehmann	838ee3b244	feat: add --network-info-dir and --network-info-interval flags to coder ssh (#16078 ) This is the first in a series of PRs to enable `coder ssh` to replace `coder vscodessh`. This change adds `--network-info-dir` and `--network-info-interval` flags to the `ssh` subcommand. These were formerly only available with the `vscodessh` subcommand. Subsequent PRs will add a `--ssh-host-prefix` flag to the ssh subcommand, and adjust the log file naming to contain the parent PID.	2025-01-13 18:29:31 -06:00
Mathias Fredriksson	ba6e84dec3	fix(cli/ssh): retry on autostart conflict (#16058 )	2025-01-08 15:15:30 +02:00
Ethan	37885e2e82	fix: make cli respect deployment --docs-url (#14568 )	2024-09-18 21:47:53 +10:00
Spike Curtis	6ff9a05832	fix: close SSH sessions bottom-up if top-down fails (#14678 )	2024-09-17 14:46:49 +04:00
Cian Johnston	bcf9bc3c90	feat(cli): add `--provisioner-log-debug` option (#14558 ) Allows starting a build in debug mode from the CLI without needing to have the build fail first by adding `--provisioner-log-debug`.	2024-09-04 14:39:35 +01:00
Ethan	e8db21c89e	chore: add additional network telemetry stats & events (#13800 )	2024-07-10 14:14:35 +10:00
Ethan	a110d18275	chore: add DRPC tailnet & cli network telemetry (#13687 )	2024-07-03 15:23:46 +10:00
Garrett Delfosse	fed668b432	chore: switch ssh session stats based on experiment (#13637 )	2024-06-25 10:58:45 -04:00
Spike Curtis	0268c7a659	chore: refactor autobuild/notify to use clock test (#13566 ) Refactor autobuild/notify and tests to use the clock testing library. I also rewrote some of the comments because I didn't understand them when I was looking at the package.	2024-06-13 16:01:17 +04:00
Colin Adler	13dd526f11	fix: prevent stdlib logging from messing up ssh (#13161 ) Fixes https://github.com/coder/coder/issues/13144	2024-05-03 22:12:06 +00:00
Aaron Lehmann	8a1216254e	feat(cli): add `--env` flag for `coder ssh` (#12991 ) This allows environment variables to be set on the SSH session. Example: coder ssh myworkspace --env VAR1=val1,VAR2=val2	2024-04-22 13:13:48 +03:00
Dean Sheather	d426569d4a	fix: make terminal raw in ssh command on windows (#12990 )	2024-04-17 18:01:20 +00:00
Kyle Carberry	d3790bb5be	fix: use provided username when fetching workspaces (#12955 )	2024-04-13 14:39:57 -04:00
Steven Masley	93a233ac10	chore: write auto-update message after success (#12804 )	2024-03-28 08:55:15 -05:00
Colin Adler	4d5a7b2d56	chore(codersdk): move all tailscale imports out of `codersdk` (#12735 ) Currently, importing `codersdk` just to interact with the API requires importing tailscale, which causes builds to fail unless manually using our fork.	2024-03-26 12:44:31 -05:00
Ammar Bandukwala	b4c0fa80d8	chore(cli): rename Cmd to Command (#12616 ) I think Command is cleaner and my original decision to use "Cmd" a mistake. Plus this creates better parity with cobra.	2024-03-17 09:45:26 -05:00
Ammar Bandukwala	496232446d	chore(cli): replace clibase with external `coder/serpent` (#12252 )	2024-03-15 11:24:38 -05:00
Kyle Carberry	895df54051	fix: separate signals for passive, active, and forced shutdown (#12358 ) * fix: separate signals for passive, active, and forced shutdown `SIGTERM`: Passive shutdown stopping provisioner daemons from accepting new jobs but waiting for existing jobs to successfully complete. `SIGINT` (old existing behavior): Notify provisioner daemons to cancel in-flight jobs, wait 5s for jobs to be exited, then force quit. `SIGKILL`: Untouched from before, will force-quit. * Revert dramatic signal changes * Rename * Fix shutdown behavior for provisioner daemons * Add test for graceful shutdown	2024-03-15 13:16:36 +00:00
Spike Curtis	b96f6b48a4	fix: ensure ssh cleanup happens on cmd error I noticed in my logs that sometimes `coder ssh` doesn't gracefully disconnect from the coordinator. The cause is the `closerStack` construct we use in that function. It has two paths to start closing things down: 1. explicit `close()` which we do in `defer` 2. context cancellation, which happens if the cli function returns an error sometimes the ssh remote command returns an error, and this triggers context cancellation of the `closerStack`. That is fine in and of itself, but we still want the explicit `close()` to wait until everything is closed before returning, since that's where we do cleanup, including the graceful disconnect. Prior to this fix the `close()` just immediately exits if another goroutine is closing the stack. Here we add a wait until everything is done.	2024-03-07 17:26:49 +04:00
Spike Curtis	4e7beee102	feat: show tailnet peer diagnostics after coder ping (#12314 ) Beginnings of a solution to #12297 Doesn't cover disco or definitively display whether we successfully connected to DERP, but shows some checklist diagnostics for connecting to an agent. For this first PR, I just added it to `coder ping` to see how we like it, but could be incorporated into `coder ssh` _et al._ after a timeout. ``` $ coder ping dogfood2 p2p connection established in 147ms pong from dogfood2 p2p via 95.217.xxx.yyy:42631 in 147ms pong from dogfood2 p2p via 95.217.xxx.yyy:42631 in 140ms pong from dogfood2 p2p via 95.217.xxx.yyy:42631 in 140ms ✔ preferred DERP region 999 (Council Bluffs, Iowa) ✔ sent local data to Coder networking coodinator ✔ received remote agent data from Coder networking coordinator preferred DERP 10013 (Europe Fly.io (Paris)) endpoints: 95.217.xxx.yyy:42631, 95.217.xxx.yyy:37576, 172.17.0.1:37576, 172.20.0.10:37576 ✔ Wireguard handshake 11s ago ```	2024-02-27 22:04:46 +04:00
Mathias Fredriksson	e659957b65	fix(cli/ssh): prevent reads/writes to stdin/stdout in stdio mode (#12045 ) Fixes #11530	2024-02-08 13:09:42 +02:00
Marcin Tojek	77a4792ecd	fix(cli): ssh: auto-update workspace (#11773 )	2024-01-23 18:01:44 +01:00
Mathias Fredriksson	200a87e7d4	feat(cli/ssh): allow multiple remote forwards and allow missing local file (#11648 )	2024-01-19 15:21:10 +02:00
Mathias Fredriksson	df3c310379	feat(cli): add `coder open vscode` (#11191 ) Fixes #7667	2024-01-02 20:46:18 +02:00
Jon Ayers	37f6b38d53	fix: return 403 when rebuilding workspace with require_active_version (#11114 )	2023-12-08 23:03:46 -06:00
Steven Masley	cb89bc1729	feat: restart stopped workspaces on ssh command (#11050 ) * feat: autostart workspaces on ssh & port forward This is opt out by default. VScode ssh does not have this behavior	2023-12-08 10:01:13 -06:00
Mathias Fredriksson	61be4dfe5a	fix: improve exit codes for agent/agentssh and cli/ssh (#10850 )	2023-11-24 14:35:56 +02:00
Spike Curtis	f20cc66c04	fix: give SSH stdio sessions a chance to close before closing netstack (#10815 ) Man, graceful shutdown is hard. Even after my changes, we were still hitting a graceful shutdown race: https://github.com/coder/coder/runs/18886842123 The problem was that while we attempt a graceful shutdown at the SSH layer by closing the session for writing, we were not giving it a chance to complete before continuing to tear down the stack of closers, including one that closes the netstack, and thus drop the TCP connection before it closes.	2023-11-22 13:11:21 +04:00
Spike Curtis	3dd35e019b	fix: close ssh sessions gracefully (#10732 ) Re-enables TestSSH/RemoteForward_Unix_Signal and addresses the underlying race: we were not closing the remote forward on context expiry, only the session and connection. However, there is still a more fundamental issue in that we don't have the ability to ensure that TCP sessions are properly terminated before tearing down the Tailnet conn. This is due to the assumption in the sockets API, that the underlying IP interface is long lived compared with the TCP socket, and thus closing a socket returns immediately and does not wait for the TCP termination handshake --- that is handled async in the tcpip stack. However, this assumption does not hold for us and tailnet, since on shutdown, we also tear down the tailnet connection, and this can race with the TCP termination. Closing the remote forward explicitly should prevent forward state from accumulating, since the Close() function waits for a reply from the remote SSH server. I've also attempted to workaround the TCP/tailnet issue for `--stdio` by using `CloseWrite()` instead of `Close()`. By closing the write side of the connection, half-close the TCP connection, and the server detects this and closes the other direction, which then triggers our read loop to exit only after the server has had a chance to process the close. TODO in a stacked PR is to implement this logic for `vscodessh` as well.	2023-11-17 12:43:20 +04:00
Spike Curtis	4894eda711	feat: capture cli logs in tests (#10669 ) Adds a Logger to cli Invocation and standardizes CLI commands to use it. clitest creates a test logger by default so that CLI command logs are captured in the test logs. CLI commands that do their own log configuration are modified to add sinks to the existing logger, rather than create a new one. This ensures we still capture logs in CLI tests.	2023-11-14 22:56:27 +04:00
Spike Curtis	dc4b1ef406	fix: lock log sink against concurrent write and close (#10668 ) fixes #10663	2023-11-14 16:38:34 +04:00
Spike Curtis	f400d8a0c5	fix: handle SIGHUP from OpenSSH (#10638 ) Fixes an issue where remote forwards are not correctly torn down when using OpenSSH with `coder ssh --stdio`. OpenSSH sends a disconnect signal, but then also sends SIGHUP to `coder`. Previously, we just exited when we got SIGHUP, and this raced against properly disconnecting. Fixes https://github.com/coder/customers/issues/327	2023-11-13 15:14:42 +04:00
Kyle Carberry	1262eef2c0	feat: add support for `coder_script` (#9584 ) * Add basic migrations * Improve schema * Refactor agent scripts into it's own package * Support legacy start and stop script format * Pipe the scripts! * Finish the piping * Fix context usage * It works! * Fix sql query * Fix SQL query * Rename `LogSourceID` -> `SourceID` * Fix the FE * fmt * Rename migrations * Fix log tests * Fix lint err * Fix gen * Fix story type * Rename source to script * Fix schema jank * Uncomment test * Rename proto to TimeoutSeconds * Fix comments * Fix comments * Fix legacy endpoint without specified log_source * Fix non-blocking by default in agent * Fix resources tests * Fix dbfake * Fix resources * Fix linting I think * Add fixtures * fmt * Fix startup script behavior * Fix comments * Fix context * Fix cancel * Fix SQL tests * Fix e2e tests * Interrupt on Windows * Fix agent leaking script process * Fix migrations * Fix stories * Fix duplicate logs appearing * Gen * Fix log location * Fix tests * Fix tests * Fix log output * Show display name in output * Fix print * Return timeout on start context * Gen * Fix fixture * Fix the agent status * Fix startup timeout msg * Fix command using shared context * Fix timeout draining * Change signal type * Add deterministic colors to startup script logs --------- Co-authored-by: Muhammad Atif Ali <atif@coder.com>	2023-09-25 16:47:17 -05:00
Ammar Bandukwala	6ba92ef924	ci: enable gocognit (#9359 ) And, bring the server under 300: * Removed the undocumented "disable" STUN address in favor of the --disable-direct flag.	2023-08-27 14:46:44 -05:00
Kyle Carberry	22e781eced	chore: add /v2 to import module path (#9072 ) * chore: add /v2 to import module path go mod requires semantic versioning with versions greater than 1.x This was a mechanical update by running: ``` go install github.com/marwan-at-work/mod/cmd/mod@latest mod upgrade ``` Migrate generated files to import /v2 * Fix gen	2023-08-18 18:55:43 +00:00
Kyle Carberry	bd944e0d21	chore: rename startup logs to agent logs (#8649 ) * chore: rename startup logs to agent logs This also adds a `source` property to every agent log. It should allow us to group logs and display them nicer in the UI as they stream in. * Fix migration order * Fix naming * Rename the frontend * Fix tests * Fix down migration * Match enums for workspace agent logs * Fix inserting log source * Fix migration order * Fix logs tests * Fix psql insert	2023-07-28 15:57:23 +00:00
Marcin Tojek	9689bca5d2	feat(cli): implement ssh remote forward (#8515 )	2023-07-20 12:05:39 +02:00
Colin Adler	1c3bfacca3	fix(cli): ensure `cliui.Agent` doesn't fetch infinitely (#8446 )	2023-07-12 10:21:54 -05:00
Cian Johnston	7fcf319e01	fix(cli)!: protect client Logger and refactor cli scaletest tests (#8317 ) - (breaking) Protects Logger and LogBodies fields of codersdk.Client with its mutex. This addresses a data race in cli/scaletest. - Fillets the existing cli/createworkspaces unit test and moves the testing logic there into the tests under scaletest/createworkspaces. - Adds testutil.RaceEnabled bool const and conditionaly skips previously-skipped tests under scaletest/ if the race detector is enabled. This is unfortunate and sad, but I would prefer to have these tests at least running without the race detector than not running at all. - Adds IgnoreErrors option to fake in-memory agent loggers; having the agents fail the test immediately when they encounter any sort of error isn't really helpful.	2023-07-06 09:43:39 +01:00

1 2 3

117 Commits