Commit Graph

4676 Commits

Author SHA1 Message Date
Christina Ying Wang
fb1bd33ab6 Refine update locking interface
* Remove Supervisor lockfile cleanup SIGTERM listener
* Modify lockfile.getLocksTaken to read files from the filesystem
* Remove in-memory tracking of locks taken in favor of filesystem
* Require both `(resin-)updates.lock` to be locked with `nobody` UID
  for service to count as locked by the Supervisor

Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
10f294cf8e Add takeLock to state funnel
A takeLock step should be generated before any of the following steps:
* kill
* start
* stop
* updateMetadata
* restart
* handover

ALL services in an app will be locked for any of the above actions,
unless the action is generated through Supervisor API's
`POST /v2/applications/:appId/(start|stop|restart)-service` endpoints,
in which case only the target service will be locked.

A lock will be taken for a service before it starts by creating the
directory in /tmp before the Engine creates it through bind mounts.

Also, the commit simplifies the generation of service kill
steps from network/volume changes or removals.

Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
cf8d8cedd7 Simplify lock interface to prep for adding takeLock to state funnel
This commit changes a few things:

* Pass `force` to `takeLock` step directly. This allows us to remove
the `lockFn` used by app manager's action executors, setting takeLock
as the main interface to interact with the update lock module. Note
that this commit by itself will not pass tests, as no update locking
occurs where it once did. This will be amended in the next commit.

* Remove locking functions from doRestart & doPurge, as this is
the only area where skipLock is required.

* Remove `skipLock` interface, as it's redundant with the functionality
of `force`. The only time `skipLock` is true is in doRestart/doPurge,
as those API methods are already run within a lock function. We removed
the lock function which removes the need for skipLock, and in the next
commit we'll add locking as a composition step to replace the
functionality removed here.

* Remove some methods not in use, such as app manager's `stopAll`.

Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
2f728ee43e Change lock directory in tests to tmpfs
This prevents leftover lockfiles from interfering with tests
in between test runs.

Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
af6359f7ae Take lock before updating service metadata
Change-type: minor
Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
e6df78a22b Implement takeLock composition step + tests
This commit only implements the action that a takeLock step
results in. It does not add takeLock step generation logic
to the state funnel yet.

Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
f2843e1382 Add update lock release functionality to state funnel
releaseLock is a step that will be inferred if there are services
in target state, and if some of those services have locks taken by
the Supervisor.

The releaseLock composition step calls the method of the same name
in the updateLock module, which takes the exclusive process lock before
disposing all Supervisor lockfiles in the target appId.

This is half of the update lock incorporation into the state funnel, as
we also need to introduce a takeLock step which triggers during crucial
stages of device state transition.

Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
7cfc42e197 Separate rwlock functionality from update-lock for clarity
Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
d18a740a40 Add methods for easier checking of lockfile existence
Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
Christina Ying Wang
b9a6a6b685 Improve types & remove some lodash from state engine
Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-04-04 14:07:47 -07:00
flowzone-app[bot]
4596149d0e
v16.1.10 2024-03-28 16:51:52 +00:00
Florin Sarbu
6ce8da3f45
Merge pull request #2258 from DynamicDevices/shreya/revpi-connect-4-support
Add revpi-connect-4 to RPi variants
2024-03-28 18:50:57 +02:00
Shreya Patel
b5dbef82d7 Add revpi-connect-4 to RPi variants
We need the supervisor to be able to manage config.txt changes for the
RevPi Connect 4.

Change-type: patch
Signed-off-by: Shreya Patel <shreya@dynamicdevices.co.uk>
2024-03-27 11:55:15 +00:00
flowzone-app[bot]
14e91779f4
v16.1.9 2024-03-25 18:36:46 +00:00
Page-
b5e098b249
Merge pull request #2254 from balena-os/report-failure-log-error
Log the full error on device state report failure as it is more useful
2024-03-25 18:36:02 +00:00
Pagan Gazzard
20e57f7f16 Log the full error on device state report failure as it is more useful
The message can be an empty string or similarly unhelpful, therefore
logging the entire error means that we will have whatever the message
may be along with the stack trace and other info that will be helpful
even when the message is not

Change-type: patch
2024-03-25 15:17:09 -03:00
flowzone-app[bot]
073373192c
v16.1.8 2024-03-25 18:11:11 +00:00
flowzone-app[bot]
9b89b9ead5
Merge pull request #2255 from balena-os/es-version-es2022
Set @balena/es-version to es2022 to match tsconfig.json
2024-03-25 18:10:17 +00:00
Pagan Gazzard
6b0500cdbc Set @balena/es-version to es2022 to match tsconfig.json
Change-type: patch
2024-03-25 16:56:27 +00:00
flowzone-app[bot]
1e2d9a71f9
v16.1.7 2024-03-25 15:25:29 +00:00
Page-
3c1763c6a8
Merge pull request #2253 from balena-os/increase-auto-select-family-timeout
Increase the timeout for auto select family to 5000ms to avoid issues
2024-03-25 15:24:27 +00:00
Pagan Gazzard
5cd37e73ac Increase the timeout for auto select family to 5000ms to avoid issues
On slower networks the default of 250ms can cause problems as all
attempts will fail rather than only the ones for interfaces that do not
actually work correctly. Increasing this timeout to 5000ms will help to
avoid these issues

Change-type: patch
2024-03-25 15:05:13 +00:00
flowzone-app[bot]
7dd0323c2b
v16.1.6 2024-03-18 21:37:36 +00:00
flowzone-app[bot]
77e596cc13
Merge pull request #2252 from balena-os/pin-iptables-to-legacy
Pin iptables to 1.8.9 (legacy)
2024-03-18 21:36:20 +00:00
Christina Ying Wang
3d881347e7 Pin iptables to 1.8.9 (legacy)
With Alpine 3.19, iptables gets bumped to 1.8.10 which uses nftables.
The host OS still uses iptables 1.8.7 (legacy), and we should
use legacy as well until the OS uses nftables.

See: https://balena.zulipchat.com/#narrow/stream/345889-balena-io.2Fos/topic/iptables.20host.20vs.2E.20nftables.20Supervisor
Change-type: patch
Signed-off-by: Christina Ying Wang <christina@balena.io>
2024-03-18 14:15:24 -07:00
flowzone-app[bot]
8b173918ea
v16.1.5 2024-03-12 13:33:10 +00:00
flowzone-app[bot]
d5708ee4ed
Merge pull request #2250 from balena-os/fatrw
Update fatrw to v0.2.21
2024-03-12 13:31:45 +00:00
Felipe Lalanne
772702793c Update fatrw to v0.2.21
Update fatrw from 0.2.9 to 0.2.21

Change-type: patch
2024-03-08 11:43:54 -03:00
flowzone-app[bot]
17edef2836
v16.1.4 2024-03-08 14:10:23 +00:00
flowzone-app[bot]
8e7c4365b0
Merge pull request #2249 from balena-os/balena-request
Update balena-register-device and its deps
2024-03-08 14:09:36 +00:00
Felipe Lalanne
790f6b71e4 Update balena-register-device and its deps
This bumps the related dependencies

- balena-register-device
- balena-request
- balena-auth

Change-type: patch
2024-03-07 14:15:38 -03:00
flowzone-app[bot]
e2de416801
v16.1.3 2024-03-07 16:10:27 +00:00
flowzone-app[bot]
f4a42d4e3d
Merge pull request #2248 from balena-os/node-20
Update Node to v20
2024-03-07 16:09:40 +00:00
Felipe Lalanne
08727ed2b5 Remove dependency on @balena/happy-eyeballs
Node 20 now implements the happy eyeballs algorithm as part of its core
`net` module, with the [autoSelectFamily](https://nodejs.org/docs/latest-v20.x/api/net.html#netgetdefaultautoselectfamily) option of `socket.connect`. This option defaults to `true`, meaning that a separate
implementation of happy eyeballs is no longer needed.

Change-type: patch
2024-03-06 15:16:33 -03:00
Felipe Lalanne
b77dba2046 Update Node to v20
This updates the supervisor runtime to latest Node LTS version. There
are no breaking changes related to this bump.

Change-type: patch
2024-03-06 12:29:54 -03:00
flowzone-app[bot]
7bb85d7aa1
v16.1.2 2024-03-06 14:06:49 +00:00
flowzone-app[bot]
ff36ccf4a0
Merge pull request #2247 from balena-os/typescript-5
Update to typescript v5
2024-03-06 14:06:01 +00:00
Felipe Lalanne
6217546894 Update typescript to v5
This also updates code to use the default import syntax instead of
`import * as` when the imported module exposes a default. This is needed
with the latest typescript version.

Change-type: patch
2024-03-05 15:33:56 -03:00
Felipe Lalanne
3c546aa199 Add sync directory to lint sources 2024-03-05 15:33:50 -03:00
flowzone-app[bot]
59137f2baf
v16.1.1 2024-03-04 18:16:14 +00:00
flowzone-app[bot]
161f74a620
Merge pull request #2246 from balena-os/balena-lint
Update @balena/lint to v7
2024-03-04 18:15:27 +00:00
Felipe Lalanne
988a1c9e9a Update @balena/lint to v7
This updates balena lint to the latest version to enable eslint support
and unblock Typescript updates. This is a huge number of changes as the
linting rules are much more strict now, requiring modifiying most files
in the codebase. This commit also bumps the test dependency `rewire` as
that was interfering with the update of balena-lint

Change-type: patch
2024-03-01 18:27:30 -03:00
flowzone-app[bot]
8750951521
v16.1.0 2024-02-27 19:30:56 +00:00
flowzone-app[bot]
d75776131d
Merge pull request #2244 from balena-os/duplicate-overlays
Add support for repeated overlays
2024-02-27 19:29:23 +00:00
Felipe Lalanne
bda1bac04c Add support for repeated overlays
RPI firmware configuration allows repeating overlays to define
configurations on multiple devices. For instance, for configuring
multiple `ads` devices, `config.txt` needs to be setup this way

```
dtoverlay=ads1115,addr=0x48
dtoverlay=ads1115,addr=0x49
```

Before this change, the supervisor would interpret both lines as
belonging to the same overlay, preventing users from configuring multiple
devices, and leading to a loop when trying to apply configurations with
repeated overlays coming from the cloud side.

Change-type: minor
2024-02-27 14:52:41 -03:00
flowzone-app[bot]
d49b5e333f
v16.0.4 2024-02-27 15:06:27 +00:00
flowzone-app[bot]
716ca95e44
Merge pull request #2245 from balena-os/fix-rsync-deltas
Fix support for rsync deltas
2024-02-27 15:05:36 +00:00
Felipe Lalanne
24e222045a Fix support for rsync deltas
Rsync (v2) deltas have been broken since [Supervisor v14](460c3ba0aa). While considered legacy,
they are still used by a few customers with devices running OS < 2.47.1.
This should fix v2 delta support for those devices until we can
completely remove rsync deltas from the supervisor

Change-type: patch
2024-02-27 11:09:04 -03:00
flowzone-app[bot]
4daf6192b8
v16.0.3 2024-02-21 21:03:26 +00:00
flowzone-app[bot]
b1bb7934b0
Merge pull request #2243 from balena-os/config-txt-patch-default-dtparam-handling
Patch default dtparam handling in config.txt
2024-02-21 21:02:18 +00:00