balena-supervisor

mirror of https://github.com/balena-os/balena-supervisor.git synced 2025-05-31 06:41:05 +00:00

Author	SHA1	Message	Date
Felipe Lalanne	ccae1f7cb8	Rename aplication manager getStatus as getLegacyState With the move to v3 target state and the move forward to remove database ids from the supervisor, we want to ensure the ids are only used for legacy support (such as within the API). This change renames the method and sets it as deprecated	2022-03-22 19:08:02 -03:00
Felipe Lalanne	21c1c006f7	Always add status to image download report It seems that in some cases the supervisor can report an image without a `status` field leading to a cloud side 401 response. See #1905 for more details. Change-type: patch	2022-03-21 14:39:29 -03:00
Felipe Lalanne	d1956b69cc	Fix check for supervisor0 network The check for the docker network supervisor0 assumed that if the interface supervisor0 existed, then the network would exist too. However this is not true on the case of docker directory corruption, which would lead to a loop with `Error: (HTTP code 404) no such network - network supervisor0 not found`. Change-type: patch Closes: #1806	2022-02-25 19:46:59 -03:00
Felipe Lalanne	1b54ce8bfd	Ignore selinux security opts when comparing services The moby engine v20.x.y adds some selinux [security configurations](https://docs.docker.com/engine/reference/run/#security-configuration) depending on the [container configuration](https://github.com/moby/moby/blob/master/daemon/create.go#L214). This would cause the supervisor to enter a service restart loop as the current and target service configurations will never match. The supervisor now ignores selinux specific security options since those are not supported by balenaOS. Closes: #1890 Change-type: patch	2022-02-23 18:12:27 -03:00
Christina Wang	4f446103f4	Remove lockingIfNecessary in favor of updateLock.lock The functionality is pretty much the same, so we don't need the two functions in two different places. Signed-off-by: Christina Wang <christina@balena.io>	2022-02-14 22:06:18 +00:00
Felipe Lalanne	fa0e28de6d	Clean up image event reporting	2022-02-01 18:35:50 -03:00
Pagan Gazzard	ae501048f5	Ensure the `finish` event is always reported when fetching images Change-type: patch	2022-01-18 11:45:13 +00:00
Felipe Lalanne	f6692ab918	Convert target state types to io-ts for better validation This simplifies target state validation and improves validation messages. Change-type: patch	2021-12-02 15:29:37 -03:00
Felipe Lalanne	ca7c22d854	Move lib/types.ts to src/types/basic.ts	2021-12-02 15:29:37 -03:00
Felipe Lalanne	6fd516a930	Fix broken local mode after PR #1824 PR #1824 changed app update behavior to test that all images are there before moving between releases. This check always fails in local mode since local mode images are handled differently. This PR fixes local mode again by skipping the check when `localMode` is set. Change-type: patch	2021-11-17 17:54:25 -03:00
Felipe Lalanne	394377e0a1	Fix `delete-then-download` strategy The strategy has been broken for a while but it was not clear how to fix it before the changes to image management. This PR fixes application manager to remove images before downloading the new image. This will only have an effect on changing images. Closes: #1233 Change-type: patch	2021-11-16 16:40:15 -03:00
Felipe Lalanne	7aedc97ee1	Wait for images to be ready before moving between releases For download-then-kill strategy, this waits for all changing images on the target release to be available on device before killing the old services. This will prevent that multicontainer applications get to a state where some services of the new release start runnning much before others have been downloaded. When adding new services to a multicontainer app, the supervisor will now wait for other changing services to be downloaded before starting the new service. Closes: #1812 Change-type: patch	2021-11-11 14:08:36 -03:00
Felipe Lalanne	969f4225e5	Check config for networks and volumes inside Service This removes the need for the app module to know about the naming conventions for networks and volumes since those exist now within the service itself. This also fixes a small bug where the volume would be removed before the service itself had been successfully stopped. Change-type: patch	2021-10-28 10:20:53 -03:00
Felipe Lalanne	6f5f3bc2f3	Fix regression with local mode push PR #1749 introduced a bug when pushing local target state. An update to the [image name normalization](`f1bd4b8d9b/src/lib/docker-utils.ts (L81)`) failed to consider the local image name format. This results in mangling of image names in the database, i.e. the image `ubuntu:latest` is stored as `/ubuntu:latest`. This causes an exception to be returned by the dockerode `getImage('/ubuntu:latest').inspect()` call. This sends the supervisor into a crash loop and is shown on the supervisor journal logs as ``` getaddrinfo ENOTFOUND images at GetAddrInfoReqWrap.onlookup [as oncomplete] (dns.js:64:26) ``` Unfortunately if this happens on a user device, since the mangled image name is already on the database, the easiest way to fix is to remove the supervisor database and let the supervisor recreate it. Deleting the database should be side effect free. Change-type: patch	2021-08-02 11:52:07 -04:00
Felipe Lalanne	104a8006fb	Update apiSecret table to id services by name It adds a migration replacing the serviceId column by serviceName and populates serviceNames from services in the target state.	2021-07-28 09:57:38 -04:00
Felipe Lalanne	b67f94802d	Remove comparison based on image, release, and service ids Preparing for the new v3 target state, where the supervisor will make environment dependent ids optional and rely on using general UUIDs and user known identifiers for comparison. This PR moves forward in that direction by removing some of those comparisons for v2 target state. - imageId to be replaced with imageName - serviceId to be replace by serviceName - releaseId to be replaced by commit (future release_uuid) This is a backwards compatible change, meaning it doesn't completely get rid of these identifiers (which are still being used by supervisor API and for state patch), but will not depend on those identifiers for calculating steps to target state. Change-type: minor	2021-07-28 09:57:38 -04:00
Felipe Lalanne	77070712a4	Remove image manager appUpdatePollInterval listener	2021-07-28 09:57:36 -04:00
Felipe Lalanne	a1d098d8f3	Refactor image "volatile state" to use state pattern This replaces stored `volatileState` with a more declarative ImageTask API. An ImageTask stores volatile image state for operations that cannot be obtained through an engine query, such as fetching and removing an image, state that can be updated while the task is running. Image controller methods can now use the `reportEvent` method to create and update the state of a longer running task.	2021-07-28 09:56:38 -04:00
Felipe Lalanne	f1bd4b8d9b	Use tags to track supervised images in docker The image manager module now uses tags instead of docker IDs as the main way to identify docker images on the engine. That is, if the target state image has a name `imageName:tag@digest`, the supervisor will always use the given `imageName` and `tag` (which may be empty) to tag the image on the engine after fetching. This PR also adds checkups to ensure consistency is maintained between the database and the engine. Using tags allows to simplify query and removal operations, since now removing the image now means removing tags matching the image name. Before this change the supervisor relied only on information in the supervisor database, and used that to remove images by docker ID. However, the docker id is not a reliable identifier, since images retain the same id between releases or between services in the same release. List of squashed commits - Remove custom type NormalizedImageInfo - Remove dependency on docker-toolbelt - Use tags to traack supervised images in docker - Ensure tag removal occurs in sequence - Only save database image after download confirmed Relates-to: #1616 #1579 Change-type: patch	2021-07-26 09:52:25 -04:00
Felipe Lalanne	e04e64763f	Improve testing for supervisor composition modules This PR cleans up testing for supervisor compose modules. It also fixes broken tests for application manager and removes a lot of dependencies for those tests on DB and other unnecessary mocks. There are probably a lot of cases that tests are missing but this should make writing new tests a lot easier. This PR also creates a new mock dockerode (mockerode) module that should make it easier to test operations that interact with the engine. All references to the old mock-dockerode have not yet been removed but that should come soon in another PR List of squashed commits: - Add tests for network create/remove - Move compose service tests to test/src/compose and reorganize test descriptions - Add support for image creation to mockerode - Add additional tests for compose volumes - Update mockerode so unimplemented fake methods throw. This is to ensure tests using mockerode fail if an unimplemented method is used - Update tests for volume-manager with mockerode - Update tests for compose/images - Simplify tests using mockerode - Clean up compose/app tests - Create application manager tests Change-type: minor	2021-07-05 17:50:52 -04:00
Felipe Lalanne	2fa0d3dc43	Fix supervisor using wrong source for deltas This fixes a specific issue when the supervisor cannot find the right source for deltas (e.g. after the DB gets deleted), where legacy behavior was to look for any image in the app. Change-type: patch Relates-to: #1729	2021-06-25 16:24:51 -04:00
Pagan Gazzard	ee4d919fca	Improve target state typings Change-type: patch	2021-06-08 13:45:44 +01:00
Miguel Casqueira	ab4fb454e0	Refactor debug log when unmanaged volume is found Change-type: patch Signed-off-by: Miguel Casqueira <miguel@balena.io>	2021-06-02 13:07:24 -04:00
Felipe Lalanne	5197a1330d	Show warning instead of exception for invalid network config A previous PR (#1656) fixed validation for network ipam config, checking that both network and subnet are defined for each ipam config entry (as described in the docker documentation). After that PR, the validations throws an exception if the network target state is incorrect, but this turns out to be the wrong approach, because that exception is also triggered when querying target state. This isn't a problem in normal operation, but it is in local mode, because local mode queries the old target state before sending a new one. Since the query fails, the CLI can never push the new target state. This PR replaces the exception with a warning on the logs, since a misconfigured network won't cause any engine failures, it will just prevent containers to communicate through the provided network. A future improvement should move this validation to an earlier point in the process, so the target state can get rejected before it even gets to a point it can be used. Relates-to: #1693 Change-type: patch	2021-05-06 16:27:40 -04:00
Miguel Casqueira	8b0c2347d8	Patch awaiting response when checking if supervisor0 network exists Change-type: patch Signed-off-by: Miguel Casqueira <miguel@balena.io>	2021-05-06 14:41:32 +00:00
quentinGllmt	1408fd7bcb	Fix parsing driver_opts from compose to docker network creation Change-type: patch Signed-off-by: quentinGllmt <quentin@quentingllmt.fr>	2021-05-06 16:50:11 +02:00
Christina Wang	4a2ac557ef	Remove mz, mkdirp, body-parser dependencies 'mz' can be safely replaced with fs.promises and util.promisify for faster native methods. 'mkdirp' after Node v8 uses native fs.mkdir, thus is redundant. 'body-parser' is deprecated and contained within express v4.x. Closes: #1567 Change-type: patch Signed-off-by: Christina Wang <christina@balena.io>	2021-04-28 07:20:15 +09:00
Felipe Lalanne	95fb568aae	Bump dockerode types to 2.5.34 This commit updates dockerode types to the latest 2.x version, removing the need for custom composer types for network. This commit also modifies network tests to use the new types Change-type: minor	2021-04-27 13:00:56 -04:00
Felipe Lalanne	fd06c06092	Update supervisor to typescript 4 Change-type: patch	2021-04-19 15:18:21 +00:00
Felipe Lalanne	fdb37191e7	Fix broken IPAM network validation Network validaton was failing to identify a bad IPAM network configuration leading to supervisor failures (see #1618) Change-type: patch Closes: #1618	2021-04-09 17:49:09 -04:00
Christina Wang	31effed426	Prevent unintended image removal when calling purge endpoints to remove volumes Using safeStateClone within doPurge to applyIntermediateTarget after successful volume purge has led to various type deficiencies being revealed in common.js. Add several inline types in common.js to satisfy the type checker (credit: Page <page@balena.io>). Delete common.d.ts since it's not required and might mistakenly mask true I/O types of functions in common.js. Closes: #1611 Change-type: patch Signed-off-by: Christina Wang <christina@balena.io>	2021-04-05 12:10:09 +00:00
Miguel Casqueira	ecbe9ee9f9	Patch list volumes to always return an array Change-type: patch Closes: #1636 Signed-off-by: Miguel Casqueira <miguel@balena.io>	2021-04-01 20:31:09 -04:00
Matthew McGinn	f9a157c9ec	typos: seperate -> separate mainly to get the docs one, but figured i could hit them all Change-type: patch Signed-off-by: Matthew McGinn <matthew@balena.io>	2021-03-17 14:27:53 -04:00
Miguel Casqueira	183ea88a2a	Infer legacy Volumes that do not have the supervised label Change-type: patch Closes: #1604 Signed-off-by: Miguel Casqueira <miguel@balena.io>	2021-03-15 19:46:53 -04:00
Miguel Casqueira	c602014617	Patch killServicesUsingApi to not get stuck in noop loop Change-type: patch Closes: #1594 Signed-off-by: Miguel Casqueira <miguel@balena.io>	2021-02-16 18:33:50 -05:00
Robert Günzler	f009d3a3e9	Fix gpu label support The device request object was created with untouched fields left unset. When comparing state to determine if a transition is required this would result in a mismatch between: { Driver: '', Count: 1, DeviceIDs: null, Capabilities: [Array], Options: null } and { Count: 1, Capabilities: [Array], } Which in turn resulted in the target service being continously restarted. The fix is to instantiate the object in full. Connects-to: https://github.com/balena-io/balena-supervisor/issues/1449 Connects-to: ae646a07ec6a6c96f7cb91f1d37898a94dbab47a Change-type: patch Signed-off-by: Robert Günzler <robertg@balena.io>	2021-02-09 11:27:03 +01:00
Rich Bayliss	bc9bdd1094	validation: Ensure commit lookup has a bound value Change-type: patch Signed-off-by: Rich Bayliss <rich@balena.io>	2020-11-11 11:01:20 +00:00
Cameron Diver	f08316dc57	Allow storing commits against their appIds This paves the way for running multiple applications and storing information related to the application against the application itself. A couple of hacks have been added to v1 and v2 endpoints to maintain compatability but these should eventually be removed with the addition of a v3 api. Change-type: minor Signed-off-by: Cameron Diver <cameron@balena.io>	2020-11-10 10:50:08 +00:00
Felipe Lalanne	01477e41b8	Mount docker socket under `/host/run` for services Currently, when the label `io.balena.features.balena-socket` is set, the balena engine socket is mounted under `/run/balena-engine.sock`. This causes a problem when using systemd inside the container, since this service remounts `/run` and `/run/lock` as tmpfs, causing the socket to become unavailable. Making a mount of the socket into `/host/run` solves this issue. This is the same approach taken with DBUS. Change-type: patch Signed-off-by: Felipe Lalanne <felipe@balena.io> Connects-to: #1494	2020-10-29 15:54:31 -03:00
Thomas Manning	2c83864f22	Change log source from docker to journalctl Change-type: minor Signed-off-by: Thomas Manning <thomasm@balena.io>	2020-10-28 16:09:42 +10:00
Miguel Casqueira	77333f1e11	Fixed evaluating if updates are needed to reach target state Closes: #1476 Change-type: patch Signed-off-by: Miguel Casqueira <miguel@balena.io>	2020-10-26 14:54:04 -04:00
Miguel Casqueira	edf23871d9	Improved log message when networks do not match Change-type: patch Signed-off-by: Miguel Casqueira <miguel@balena.io>	2020-10-19 12:01:50 -04:00
Felipe Lalanne	4795c336d0	Handle delete of multiple images with same dockerImageId A docker-compose.yml with the following structure ``` version: '2.1' services: app_1: build: ./noisy-1 image: noisy1 app_2: build: ./noisy-1 image: noisy1 app_3: build: ./noisy-1 image: noisy1 ``` Will lead to the supervisor creating multiple image database entries with the same dockerId (this is because of how the engine handles this particular case). This case is not handled by the removal process leading to image pile up and increased disk usage. Change-type: patch Signed-off-by: Felipe Lalanne <felipe@balena.io> Connects-to: #1434	2020-10-16 14:06:10 -04:00
Thomas Manning	1eeff698ac	Add features label `io.balena.features.journal-logs` Change-type: patch Signed-off-by: Thomas Manning <thomasm@balena.io>	2020-10-12 15:37:35 +10:00
Matthew McGinn	8e65466f2d	version: drop SUPERVISOR_VERSION env var In order to make supervisor upgrades more transparent, lets move away from this env var since it requires a container restart any time the supervisor is upgraded. We should ultimately move towards providing the supervisors set of capabilities, but that can come later Connects-to: #1447 Change-type: major Signed-off-by: Matthew McGinn <matthew@balena.io>	2020-09-29 11:22:30 -04:00
Rich Bayliss	c08de8701e	api: Implement scoped Supervisor API keys Each service, when requesting access to the Supervisor API, will now get an individual key which can be scoped to specific resources. In this iteration the default scope will be to the application that the service belongs to. We also have a `global` scope which is used by the cloud API when in managed mode. Change-type: patch Signed-off-by: Rich Bayliss <rich@balena.io>	2020-09-17 11:25:56 +00:00
Rich Bayliss	96c68166a1	application-manager: Convert to a singleton Change-type: patch Signed-off-by: Rich Bayliss <rich@balena.io> Signed-off-by: Cameron Diver <cameron@balena.io>	2020-09-14 11:23:36 +01:00
Pagan Gazzard	379730a9e1	Update typed-error to 3.x Update typed-error from 2.0.0 to 3.2.1 Change-type: patch	2020-08-19 10:07:54 +01:00
Cameron Diver	0e8d92e08a	Make service-manager module a singleton Change-type: patch Signed-off-by: Cameron Diver <cameron@balena.io>	2020-06-17 14:56:57 +00:00
Cameron Diver	adaad786af	Make volume-manager module a singleton Change-type: patch Signed-off-by: Cameron Diver <cameron@balena.io>	2020-06-17 14:56:57 +00:00

1 2 3 4 5 ...

316 Commits