coder

mirror of https://github.com/coder/coder.git synced 2025-07-03 16:13:58 +00:00

Author	SHA1	Message	Date
Yevhenii Shcherbina	53e8e9c7cd	fix: reduce cost of prebuild failure (#17697 ) Relates to https://github.com/coder/coder/issues/17432 ### Part 1: Notes: - `GetPresetsAtFailureLimit` SQL query is added, which is similar to `GetPresetsBackoff`, they use same CTEs: `filtered_builds`, `time_sorted_builds`, but they are still different. - Query is executed on every loop iteration. We can consider marking specific preset as permanently failed as an optimization to avoid executing query on every loop iteration. But I decided don't do it for now. - By default `FailureHardLimit` is set to 3. - `FailureHardLimit` is configurable. Setting it to zero - means that hard limit is disabled. ### Part 2 Notes: - `PrebuildFailureLimitReached` notification is added. - Notification is sent to template admins. - Notification is sent only the first time, when hard limit is reached. But it will `log.Warn` on every loop iteration. - I introduced this enum: ```sql CREATE TYPE prebuild_status AS ENUM ( 'normal', -- Prebuilds are working as expected; this is the default, healthy state. 'hard_limited', -- Prebuilds have failed repeatedly and hit the configured hard failure limit; won't be retried anymore. 'validation_failed' -- Prebuilds failed due to a non-retryable validation error (e.g. template misconfiguration); won't be retried. ); ``` `validation_failed` not used in this PR, but I think it will be used in next one, so I wanted to save us an extra migration. - Notification looks like this: <img width="472" alt="image" src="https://github.com/user-attachments/assets/e10efea0-1790-4e7f-a65c-f94c40fced27" /> ### Latest notification views: <img width="463" alt="image" src="https://github.com/user-attachments/assets/11310c58-68d1-4075-a497-f76d854633fe" /> <img width="725" alt="image" src="https://github.com/user-attachments/assets/6bbfe21a-91ac-47c3-a9d1-21807bb0c53a" />	2025-05-21 15:16:38 -04:00
Danny Kopping	6e967780c9	feat: track resource replacements when claiming a prebuilt workspace (#17571 ) Closes https://github.com/coder/internal/issues/369 We can't know whether a replacement (i.e. drift of terraform state leading to a resource needing to be deleted/recreated) will take place apriori; we can only detect it at `plan` time, because the provider decides whether a resource must be replaced and it cannot be inferred through static analysis of the template. This is likely to be the most common gotcha with using prebuilds, since it requires a slight template modification to use prebuilds effectively, so let's head this off before it's an issue for customers. Drift details will now be logged in the workspace build logs: ![image](https://github.com/user-attachments/assets/da1988b6-2cbe-4a79-a3c5-ea29891f3d6f) Plus a notification will be sent to template admins when this situation arises: ![image](https://github.com/user-attachments/assets/39d555b1-a262-4a3e-b529-03b9f23bf66a) A new metric - `coderd_prebuilt_workspaces_resource_replacements_total` - will also increment each time a workspace encounters replacements. We only track _that_ a resource replacement occurred, not how many. Just one is enough to ruin a prebuild, but we can't know apriori which replacement would cause this. For example, say we have 2 replacements: a `docker_container` and a `null_resource`; we don't know which one might cause an issue (or indeed if either would), so we just track the replacement. --------- Signed-off-by: Danny Kopping <dannykopping@gmail.com>	2025-05-14 14:52:22 +02:00
Vincent Vielle	148dae1e9f	fix: add fallback icons for notifications (#17013 ) Related: https://github.com/coder/internal/issues/522	2025-03-28 12:21:48 +01:00
Danielle Maywood	d2419c89ac	feat: add tool to send a test notification (#16611 ) Relates to https://github.com/coder/coder/issues/16463 Adds a CLI command, and API endpoint, to trigger a test notification for administrators of a deployment.	2025-02-19 13:08:38 +00:00
Danielle Maywood	dbad69dbd9	chore: add workspace oom/ood notification templates (#16250 )	2025-02-04 19:25:18 +00:00
Danielle Maywood	f3fe3bc785	feat: notify on workspace update (#15979 ) Relates to https://github.com/coder/coder/issues/15845 When the `/workspace/<name>/builds` endpoint is hit, we check if the requested template version is different to the previously used template version. If these values differ, we can assume that the workspace has been manually updated and send the appropriate notification. Automatic updates happen in the lifecycle executor and bypasses this endpoint entirely.	2025-01-02 12:19:34 +00:00
Danielle Maywood	f0e81ab455	feat: notify on workspace creation (#15934 )	2024-12-20 13:53:10 +00:00
Danielle Maywood	095c9797c9	feat: notify users on template deprecation (#15195 ) Closes https://github.com/coder/coder/issues/15117 Notify users when a template has been deprecated.	2024-10-24 13:12:12 +01:00
Danielle Maywood	4369f2b4b5	feat: implement api for "forgot password?" flow (#14915 ) Relates to https://github.com/coder/coder/issues/14232 This implements two endpoints (names subject to change): - `/api/v2/users/otp/request` - `/api/v2/users/otp/change-password`	2024-10-04 11:53:25 +01:00
Marcin Tojek	6de59371ea	feat: notifications: report failed workspace builds (#14571 )	2024-09-18 09:11:44 +02:00
Marcin Tojek	47f2c7d683	feat: notify about manual failed builds (#14419 )	2024-08-27 14:35:28 +00:00
Marcin Tojek	c818b4ddd4	feat: add notification for suspended/activated account (#14367 ) * migrations * notify * fix * TestNotifyUserSuspended * TestNotifyUserReactivate * post merge * fix escape * TestNotificationTemplatesCanRender * links and events * notifyEnq * findUserAdmins * notifyUserStatusChanged * go build * your and admin * tests * refactor * 247 * Danny's review	2024-08-22 13:52:25 +02:00
Bruno Quaresma	6f1951e1c8	feat: add template delete notification (#14250 )	2024-08-14 14:22:43 -03:00
Danny Kopping	c6076d2d0d	chore: improve notification template tests' resilience (#14196 )	2024-08-07 11:33:26 +02:00
Marcin Tojek	6428a766a9	feat: notify when a user account is deleted (#14113 )	2024-08-02 14:56:54 +02:00
Marcin Tojek	cf1fcab514	feat: notify about created user account (#14010 )	2024-07-30 15:37:45 +02:00
Bruno Quaresma	0d9615b4fd	feat(coderd): notify when workspace is marked as dormant (#13868 )	2024-07-24 13:38:21 -03:00
Marcin Tojek	fbd1d7f9a7	feat: notify on successful autoupdate (#13903 )	2024-07-18 15:19:12 +02:00
Marcin Tojek	a5e4bf38fe	feat: notify owner about failed autobuild (#13891 )	2024-07-16 10:48:17 +02:00
Danny Kopping	bdd2caf95d	feat: implement thin vertical slice of system-generated notifications (#13537 )	2024-07-08 15:38:50 +02:00

20 Commits