coder

mirror of https://github.com/coder/coder.git synced 2025-07-09 11:45:56 +00:00

Author	SHA1	Message	Date
Dean Sheather	28088165a1	chore: get TUN/DNS working on Windows for CoderVPN (#16310 )	2025-01-29 08:09:36 +00:00
Ethan	ba48069325	chore: implement CoderVPN client & tunnel (#15612 ) Addresses #14734. This PR wires up `tunnel.go` to a `tailnet.Conn` via the new `/tailnet` endpoint, with all the necessary controllers such that a VPN connection can be started, stopped and inspected via the CoderVPN protocol.	2024-12-05 13:30:22 +11:00
Spike Curtis	916df4d411	feat: set DNS hostnames in workspace updates controller (#15507 ) re: #14730 Adds support for the workspace updates protocol controller to also program DNS names for each agent. Right now, we only program names like `myagent.myworkspace.me.coder` and `myworkspace.coder.` (if there is exactly one agent in the workspace). We also want to support `myagent.myworkspace.username.coder.`, but for that we need to update WorkspaceUpdates RPC to also send the workspace owner's username, which will be in a separate PR.	2024-11-15 11:00:19 +04:00
Ethan	5d853fcfd8	chore: support adding dns hosts to `tailnet.Conn` (#15419 ) Relates to #14718. The remaining changes (regarding the Tailscale DNS service) will need to be made on `coder/tailscale`.	2024-11-08 09:37:56 +00:00
Ethan	098728138f	chore: add a tailscale router that uses the CoderVPN protocol (#15391 ) Closes #14732.	2024-11-07 22:45:17 +11:00
Ethan	871cc05e99	chore: add a dns.OSConfigurator implementation that uses the CoderVPN protocol (#15342 ) Closes #14733.	2024-11-05 19:23:16 +11:00
Spike Curtis	7d9f5ab81d	chore: add Coder service prefix to tailnet (#14943 ) re: #14715 This PR introduces the Coder service prefix: `fd60:627a:a42b::/48` and refactors our existing code as calling the Tailscale service prefix explicitly (rather than implicitly). Removes the unused `Addresses` agent option. All clients today assume they can compute the Agent's IP address based on its UUID, so an agent started with a custom address would break things.	2024-10-04 10:04:10 +04:00
Ethan	8c15192433	feat(cli): add p2p diagnostics to ping (#14426 ) First PR to address #14244. Adds common potential reasons as to why a direct connection to the workspace agent couldn't be established to `coder ping`: - If the Coder deployment administrator has blocked direction connections (`CODER_BLOCK_DIRECT`). - If the client has no STUN servers within it's DERP map. - If the client or agent appears to be behind a hard NAT, as per Tailscale `netInfo.MappingVariesByDestIP` Also adds a warning if the client or agent has a network interface below the 'safe' MTU for tailnet. This warning is always displayed at the end of a `coder ping`.	2024-08-28 15:39:01 +10:00
Ethan	4cc26be5ec	fix: set network telemetry client version on server (#14376 )	2024-08-23 06:17:28 +00:00
Dean Sheather	cf8be4eac5	feat: add resume support to coordinator connections (#14234 )	2024-08-20 17:16:49 +10:00
Ethan	e09ad1ddc1	fix: lock adding to tailnet waitgroup to avoid race (and fix flake) (#14195 )	2024-08-07 15:52:42 +10:00
Dean Sheather	d2b035312e	chore: fix parse typo for network telemetry (#13971 )	2024-07-22 17:14:37 +00:00
Ethan	e8db21c89e	chore: add additional network telemetry stats & events (#13800 )	2024-07-10 14:14:35 +10:00
Ethan	a110d18275	chore: add DRPC tailnet & cli network telemetry (#13687 )	2024-07-03 15:23:46 +10:00
Dean Sheather	ed0ca76b0b	chore: do network integration tests in isolated net ns (#13117 )	2024-05-03 05:42:13 +00:00
Spike Curtis	3de737fdc8	fix: start packet capture immediately on speedtest (#13128 ) I initially made this change when hacking wgengine to also capture wireguard packets going into the magicsock, so that we could capture the initial wireguard handshake. I don't think we should ship that additional capture logic, but... it seems generally useful to capture packets from the get go on speedtest, so that you can see disco and pings before the TCP speedtest session starts.	2024-05-02 19:44:32 +04:00
Colin Adler	e801e878ba	feat: add agent acks to in-memory coordinator (#12786 ) When an agent receives a node, it responds with an ACK which is relayed to the client. After the client receives the ACK, it's allowed to begin pinging.	2024-04-10 17:15:33 -05:00
Colin Adler	66154f937e	fix(coderd): pass block endpoints into servertailnet (#12149 )	2024-03-08 05:29:54 +00:00
Spike Curtis	4e7beee102	feat: show tailnet peer diagnostics after coder ping (#12314 ) Beginnings of a solution to #12297 Doesn't cover disco or definitively display whether we successfully connected to DERP, but shows some checklist diagnostics for connecting to an agent. For this first PR, I just added it to `coder ping` to see how we like it, but could be incorporated into `coder ssh` _et al._ after a timeout. ``` $ coder ping dogfood2 p2p connection established in 147ms pong from dogfood2 p2p via 95.217.xxx.yyy:42631 in 147ms pong from dogfood2 p2p via 95.217.xxx.yyy:42631 in 140ms pong from dogfood2 p2p via 95.217.xxx.yyy:42631 in 140ms ✔ preferred DERP region 999 (Council Bluffs, Iowa) ✔ sent local data to Coder networking coodinator ✔ received remote agent data from Coder networking coordinator preferred DERP 10013 (Europe Fly.io (Paris)) endpoints: 95.217.xxx.yyy:42631, 95.217.xxx.yyy:37576, 172.17.0.1:37576, 172.20.0.10:37576 ✔ Wireguard handshake 11s ago ```	2024-02-27 22:04:46 +04:00
Dean Sheather	9861830e87	fix: never send local endpoints if disabled (#12138 )	2024-02-20 15:51:25 +10:00
Spike Curtis	2d0b9106c0	fix: change servertailnet to register the DERP dialer before setting DERP map (#12137 ) I noticed a possible race where tailnet.Conn can try to dial the embedded region before we've set our custom dialer that send the DERP in-memory. This closes that race and adds a test case for servertailnet with no STUN and an embedded relay	2024-02-15 10:51:12 +04:00
Colin Adler	bc14e926d8	feat: add option to speedtest to dump a pcap of network traffic (#11848 )	2024-01-29 09:57:31 -06:00
Spike Curtis	5cbb76b47a	fix: stop spamming DERP map updates for equivalent maps (#11792 ) Fixes 2 related issues: 1. wsconncache had incorrect logic to test whether to send DERPMap updates, sending if the maps were equivalent, instead of if they were _not equivalent_. 2. configmaps used a bugged check to test equality between DERPMaps, since it contains a map and the map entries are serialized in random order. Instead, we avoid comparing the protobufs and instead depend on the existing function that compares `tailcfg.DERPMap`. This also has the effect of reducing the number of times we convert to and from protobuf.	2024-01-24 16:27:15 +04:00
Spike Curtis	5388a1b6d7	fix: use TSMP ping for reachability, not latency (#11749 ) Use TSMP ping for reachability, but leave Disco ping for when we call Ping() since we often use that to determine whether we have a direct connection. Also adds unit tests to make sure Ping() returns direct connection vs DERP correctly.	2024-01-22 17:37:15 +04:00
Spike Curtis	7ffd99cfe2	fix: use DiscoPing (partially reverts #11306 ) (#11744 )	2024-01-22 12:40:21 +00:00
Spike Curtis	3d85cdfa11	feat: set peers lost when disconnected from coordinator (#11681 ) Adds support to Coordination to call SetAllPeersLost() when it is closed. This ensure that when we disconnect from a Coordinator, we set all peers lost. This covers CoderSDK (CLI client) and Agent. Next PR will cover MultiAgent (notably, `wsproxy`).	2024-01-22 15:26:20 +04:00
Spike Curtis	f01cab9894	feat: use tailnet v2 API for coordination (#11638 ) This one is huge, and I'm sorry. The problem is that once I change `tailnet.Conn` to start doing v2 behavior, I kind of have to change it everywhere, including in CoderSDK (CLI), the agent, wsproxy, and ServerTailnet. There is still a bit more cleanup to do, and I need to add code so that when we lose connection to the Coordinator, we mark all peers as LOST, but that will be in a separate PR since this is big enough!	2024-01-22 11:07:50 +04:00
Spike Curtis	58873fa7e2	chore: remove unused context/cancel in tailnet Conn (#11399 ) Spotted during code read; unused fields	2024-01-05 08:15:42 +04:00
Spike Curtis	520c3a8ff7	fix: use TSMP for pings and checking reachability (#11306 ) We're seeing some flaky tests related to agent connectivity - https://github.com/coder/coder/actions/runs/7286675441/job/19856270998 I'm pretty sure what happened in this one is that the client opened a connection while the wgengine was in the process of reconfiguring the wireguard device, so the fact that the peer became "active" as a result of traffic being sent was not noticed. The test calls `AwaitReachable()` but this only tests the disco layer, so it doesn't wait for wireguard to come up. I think we should be using TSMP for pinging and reachability, since this operates at the IP layer, and therefore requires that wireguard comes up before being successful. This should also help with the problems we have seen where a TCP connection starts before wireguard is up and the initial round trip has to wait for the 5 second wireguard handshake retry. fixes: #11294	2024-01-02 15:53:52 +04:00
Spike Curtis	f400d8a0c5	fix: handle SIGHUP from OpenSSH (#10638 ) Fixes an issue where remote forwards are not correctly torn down when using OpenSSH with `coder ssh --stdio`. OpenSSH sends a disconnect signal, but then also sends SIGHUP to `coder`. Previously, we just exited when we got SIGHUP, and this raced against properly disconnecting. Fixes https://github.com/coder/customers/issues/327	2023-11-13 15:14:42 +04:00
Spike Curtis	94eb9b8db1	fix: disable t.Parallel on TestPortForward (#10449 ) I've said it before, I'll say it again: you can't create a timed context before calling `t.Parallel()` and then use it after. Fixes flakes like https://github.com/coder/coder/actions/runs/6716682414/job/18253279157 I've chosen just to drop `t.Parallel()` entirely rather than create a second context after the parallel call, since the vast majority of the test time happens before where the parallel call was. It does all the tailnet setup before `t.Parallel()`. Leaving a call to `t.Parallel()` is a bug risk for future maintainers to come in and use the wrong context in the latter part of the test by accident.	2023-11-01 13:45:13 +04:00
Spike Curtis	236e84c4d6	feat: add logging for forwarded TCP connections part of #7963 log TCP connections as they are forwarded by gVisor	2023-10-09 19:41:26 +04:00
Colin Adler	03a7d2f70b	chore: fix servertailnet test flake (#10110 ) https://github.com/coder/coder/actions/runs/6424100765/job/17444018788?pr=10083#step:5:771	2023-10-06 11:31:53 -05:00
Mathias Fredriksson	19d7da3d24	refactor(coderd/database): split `Time` and `Now` into `dbtime` package (#9482 ) Ref: #9380	2023-09-01 16:50:12 +00:00
Colin Adler	64ef867b4f	fix(tailnet): re-add keepalives (#9410 )	2023-08-29 15:21:30 -05:00
Dean Sheather	64df076328	feat: add server flag to force DERP to use always websockets (#9238 )	2023-08-24 17:22:31 +00:00
Kyle Carberry	22e781eced	chore: add /v2 to import module path (#9072 ) * chore: add /v2 to import module path go mod requires semantic versioning with versions greater than 1.x This was a mechanical update by running: ``` go install github.com/marwan-at-work/mod/cmd/mod@latest mod upgrade ``` Migrate generated files to import /v2 * Fix gen	2023-08-18 18:55:43 +00:00
Colin Adler	5b2ea2e94f	fix(tailnet): disable wireguard trimming (#9098 ) Co-authored-by: Spike Curtis <spike@coder.com>	2023-08-15 14:26:56 -05:00
Colin Adler	344d32b2f1	feat(coderd): expire agents from server tailnet (#9092 )	2023-08-14 20:38:37 -05:00
Colin Adler	bc862fa493	chore: upgrade tailscale to v1.46.1 (#8913 )	2023-08-09 19:50:26 +00:00
Dean Sheather	3c52b01850	chore: add tailscale magicsock debug logging controls (#8982 )	2023-08-08 17:56:08 +00:00
Ammar Bandukwala	25e30c6f41	feat(cli): support fine-grained server log filtering (#8748 )	2023-07-26 16:46:22 -05:00
Colin Adler	c47b78c44b	chore: replace wsconncache with a single tailnet (#8176 )	2023-07-12 17:37:31 -05:00
Kyle Carberry	f40865bc2f	chore: use mutex around `blockEndpoints` (#8209 ) https://github.com/coder/coder/actions/runs/5378950122/jobs/9759972142	2023-06-26 10:01:50 -05:00
Dean Sheather	a28d422c35	feat: add flag to disable all direct connections (#7936 )	2023-06-21 22:02:05 +00:00
Marcin Tojek	b1d1b63113	chore: ensure logs consistency across Coder (#8083 )	2023-06-20 12:30:45 +02:00
Marcin Tojek	247f8a973f	feat: replace ssh maxTimeout with keep-alive mechanism (#8062 ) * Bump up coder/ssh * feat: Set default agent timeout to ~72h * Address PR comments * Fix	2023-06-16 15:22:18 +02:00
Spike Curtis	b3689c8f64	Only send tailnet nodes updates with preferred DERP (#7387 ) Signed-off-by: Spike Curtis <spike@coder.com>	2023-05-04 14:30:57 +04:00
Colin Adler	3eb7f06bf1	feat(agent): add http debug routes for magicsock (#7287 )	2023-04-26 13:01:49 -05:00
Colin Adler	745868fd8a	revert: chore: upgrade tailscale (#7236 )	2023-04-20 17:58:22 -05:00

1 2

83 Commits