corda

mirror of https://github.com/corda/corda.git synced 2025-03-29 07:06:14 +00:00

Author	SHA1	Message	Date
Dan Newton	f6b5737277	ENT-5196 handle errors during flow initialisation (#6378 ) ## Summary This change deals with multiple issues: * Errors that occur during flow initialisation. * Errors that occur when handling the outcome of an existing flow error. * Failures to rollback and close a database transaction when an error occurs in `TransitionExecutorImpl`. * Removal of create and commit transaction actions around retrying a flow. ## Errors that occur during flow initialisation Flow initialisation has been moved into the try/catch that exists inside `FlowStateMachineImpl.run`. This means if an error is thrown all the way out of `initialiseFlow` (which should rarely happen) it will be caught and move into a flow's standard error handling path. The flow should then properly terminate. `Event.Error` was changed to make the choice to rollback be optional. Errors during flow initialisation cause the flow to not have a open database transaction. Therefore there is no need to rollback. ## Errors that occur when handling the outcome of an existing flow error When an error occurs a flow goes to the flow hospital and is given an outcome event to address the original error. If the transition that was processing the error outcome event (`StartErrorPropagation` and `RetryFlowFromSafePoint`) has an error then the flow aborts and nothing happens. This means that the flow is left in a runnable state. To resolve this, we now retry the original error outcome event whenever another error occurs doing so. This is done by adding a new staff member that looks for `ErrorStateTransitionException` thrown in the error code path of `TransitionExecutorImpl`. It then takes the last outcome for that flow and schedules it to run again. This scheduling runs with a backoff. This means that a flow will continually retry the original error outcome event until it completes it successfully. ## Failures to rollback and close a database transaction when an error occurs in `TransitionExecutorImpl` Rolling back and closing the database transaction inside of `TransitionExecutorImpl` is now done inside individual try/catch blocks as this should not prevent the flow from continuing. ## Removal of create and commit transaction actions around retrying a flow The database commit that occurs after retrying a flow can fail which required some custom code just for that event to prevent inconsistent behaviour. The transaction was only needed for reading checkpoints from the database, therefore the transaction was moved into `retryFlowFromSafePoint` instead and the commit removed. If we need to commit data inside of `retryFlowFromSafePoint` in the future, a commit should be added directly to `retryFlowFromSafePoint`. The commit should occur before the flow is started on a new fiber.	2020-06-30 11:50:42 +01:00
Tamas Veingartner	18cd24c2b4	NOTICK Ignore pagination large volume test as its completion is environment dependent and might cause issues in slower environments (#6420 )	2020-06-30 10:12:31 +01:00
Denis Rekalov	e75dc9e415	Merge branch 'release/os/4.5' into denis/CORDA-3856-amqp-header-os-4.6	2020-06-29 13:13:48 +01:00
Denis Rekalov	7686ca2944	Merge branch 'release/os/4.4' into denis/CORDA-3856-amqp-header-os-4.5	2020-06-29 12:35:05 +01:00
pnemeth	6fc9f2eacf	EG-2854 Failing Test: net.corda.node.internal.NodeStartupCliTest.--nodeconf using absolute path will not be changed (#6393 ) * EG-2854 Failing Test: net.corda.node.internal.NodeStartupCliTest.--nodeconf using absolute path will not be changed * fix bug EG-2854 * PR comment	2020-06-29 09:40:12 +01:00
Denis Rekalov	3f03de6fbd	CORDA-3856: Add Artemis plugin for validating AMQP message header and type (#6407 )	2020-06-29 09:23:29 +01:00
Dan Newton	516a4bf3a1	Merge pull request #6403 from corda/NOTICK-rfowler-merge-OS4.5-OS4.6-20200626 NOTICK rfowler merge os4.5 os4.6 20200626	2020-06-29 08:40:48 +01:00
Dan Newton	796e92b512	CORDA-3720 Extract locking of InnerState out of SMM (#6289 ) The state machines state is held within `InnerState` which lived inside the SMM. `InnerState` has been extracted out of the SMM to allow the SMM to be refactored in the future. Smaller classes can now be made that focus on a single goal as the locking of the state can be accessed from external classes. To achieve this, pass the `InnerState` into the class and request a lock if needed. The locking of `InnerState` has been made a property of the `InnerState` itself. It has a `lock` field that allows locks to be taken out when needed. An inline `withLock` function has been added to tidy up the code and not harm performance. Some classes have been made internal to prevent invalid usage of purely node internal classes. As part of this change, flow timeouts have been extracted out into `FlowTimeoutScheduler`.	2020-06-26 12:48:15 +01:00
Ryan Fowler	31a1ee8803	Merge branch 'release/os/4.5' into NOTICK-rfowler-merge-OS4.5-OS4.6-20200626	2020-06-26 12:14:56 +01:00
Ryan Fowler	881d8d687c	Merge branch 'release/os/4.4' into NOTICK-rfowler-merge-OS4.4-OS4.5-20200626	2020-06-26 09:52:16 +01:00
Ryan Fowler	ef582900cf	NOTICK Expand the regex to match what we already do in ENT (#6400 )	2020-06-25 17:26:55 +01:00
Kyriakos Tharrouniatis	6ec2910f15	NOTICK Revert node_flow_exceptions type length back to 256 (#6395 )	2020-06-25 09:40:58 +01:00
Chris Rankin	6485a025c7	ENT-5430: Increase test coverage when serializing Optional fields. (#6387 )	2020-06-22 16:51:40 +01:00
Waldemar Zurowski	1a4efbac7f	Merge branch 'release/os/4.5' into merge-45-to-46	2020-06-19 19:48:08 +01:00
pnemeth	d6cab0e131	EG-1557 - Configuration data from "include" section ignored while command line contains the path to config file without leading ./ (#6354 ) Configuration data from "include" section ignored while command line contains the path to config file without leading ./	2020-06-19 10:32:55 +01:00
alicer3	e77f7a7546	center console message for registration (#6191 )	2020-06-19 09:49:07 +01:00
LankyDan	56d0bbc036	CORDA-3841 Check `isAnyCheckpointPersisted` in `startFlowInternal` (#6351 ) Only hit the database if `StateMachineState.isAnyCheckpointPersisted` returns true. Otherwise, there will be no checkpoint to retrieve from the database anyway. This can prevent errors due to a transient loss of connection to the database. Update tests after merging to 4.6	2020-06-18 16:15:15 +01:00
LankyDan	e8b17ff7b9	Merge branch 'release/os/4.5' into dan/os-4.5-to-4.6-merge-2020-06-18 # Conflicts: # node/src/main/kotlin/net/corda/node/services/persistence/DBCheckpointStorage.kt # node/src/main/kotlin/net/corda/node/services/statemachine/SingleThreadedStateMachineManager.kt	2020-06-18 15:50:46 +01:00
Christian Sailer	4091fdc8b1	ENT-5264 Synchronise schema on the command line (#6353 ) * Decouple DatabaseConfig and CordaPersistence etc. * Add schema sync to schema migration + test * Add command line parameters for synchronising schema	2020-06-18 11:38:46 +01:00
Chris Rankin	1ef62870bb	Merge commit 'fe617818895edab334d80c5e8de2b38f39e67af6' into chrisr3-os44-merge	2020-06-17 18:54:54 +01:00
Chris Rankin	d0c0a1d9ba	ENT-5430: Fix deserialisation of commands containing generic types. (#6359 )	2020-06-17 17:28:26 +01:00
James Higgs	24b0240d82	EG-2654 - Ensure stack traces are printed to the logs in error reporting (#6345 ) * EG-2654 Ensure stack trace is printed to the logs in error reporting * EG-2654 - Add a test case for exception logging	2020-06-17 14:32:12 +01:00
Dan Newton	7ab6a8f600	CORDA-3841 Check `isAnyCheckpointPersisted` in `startFlowInternal` (#6351 ) Only hit the database if `StateMachineState.isAnyCheckpointPersisted` returns true. Otherwise, there will be no checkpoint to retrieve from the database anyway. This can prevent errors due to a transient loss of connection to the database.	2020-06-16 09:22:26 +01:00
Tamas Veingartner	26d4bfb89f	CORDA-3578 add corda prefix to conf file names as original issue was … (#6322 ) * CORDA-3578 add corda prefix to conf file names as original issue was having non-corda reference.conf files on classpath causes DriverDSLImp failure it is better to have this naming convention and avoid further conflicts of conf files. * fixed weird duplicates * revert renaming changes for web-reference.conf and loadtest-reference.conf	2020-06-16 09:15:51 +01:00
Christian Sailer	836dd559e8	ENT-5316 split schema migration * ENT-5273 Split schema migration into separate core and app schema migration, with separate command line flags	2020-06-15 15:52:31 +01:00
Christian Sailer	2c26f4db5d	Merge remote-tracking branch 'origin/release/os/4.6' into christians/update-fb-2020-06-12	2020-06-15 09:02:55 +01:00
Christian Sailer	f1126226a8	Fix config tests (remove tx isolation level from config files)	2020-06-12 20:54:36 +01:00
Christian Sailer	35c661b9f6	Merge pull request #6341 from corda/chrisr3-45-merge NOTICK: Merge from OS 4.5 up to ef00fa1.	2020-06-12 16:42:51 +01:00
Stefano Franz	64f0011a62	Make Checkpoint classes data classes (#6342 ) * Make Checkpoint classes data classes * tidy up null-checks for array equality	2020-06-12 16:35:32 +01:00
Christian Sailer	d00dc42b18	Merge remote-tracking branch 'origin/release/os/4.6' into christians/update-fb-2020-06-12	2020-06-12 14:51:43 +01:00
Chris Rankin	3f67e314c0	Merge commit 'ef00fa1388db37e155ab8cfed3763c14801f8aa9' into chrisr3-45-merge	2020-06-12 13:14:44 +01:00
James Higgs	6e349f298e	NOTICK - Ignore a potentially dodgy test (#6336 )	2020-06-11 16:47:48 +01:00
James Higgs	ab023d0b07	Merge branch 'release/os/4.5' into jamesh/os-4.5-4.6-merge-11062020	2020-06-11 09:40:39 +01:00
James Higgs	58af87c988	EG-2225 - Create log directory in correct place with verbose flag set (#6321 ) * Ensure logs directory is written to correct location * Remove a superfluous set of log path property * Add a unit test to catch bad log paths * Address detekt issues	2020-06-10 10:46:57 +01:00
James Higgs	8b7275eb97	EG-2564 - Move printed error to logger (#6323 )	2020-06-10 10:45:50 +01:00
Schife	fb184839f4	Merge branch 'release/os/4.5' of https://github.com/corda/corda into razvan-os-4.5-to-4.6-merge	2020-06-05 07:55:48 +01:00
nikinagy	caf5482244	ENT-4064, ENT-4608 - checking for unsigned cordapps in prod mode and checking the PV (#6291 ) * checking for unsigned cordapps in prod mode and shutting down if it the contracts are not signed * inheriting from CordaRuntimeException instead of Exception * had the same tests twice, removed the shutdown for minimumplatformversion, modified some of the tests * shut down when we get invalid platformversion and modified the test according to this * making the message more accurate	2020-06-04 16:24:49 +01:00
Denis Rekalov	45614cf29e	Merge pull request #6266 from corda/denis/CORDA-3805-custom-migration-scripts CORDA-3805: cut dependency from PersistentIdentityService for custom migration scripts	2020-06-04 14:20:26 +01:00
Dan Newton	f0d2c9fe71	NOTICK Correct `StatemachineGeneralErrorHandlingtest` (#6306 )	2020-06-04 08:04:41 +01:00
Christian Sailer	98f62f60f1	Merge branch 'release/os/4.6' into christians/update-feat-20200502	2020-06-02 18:15:32 +01:00
Christian Sailer	2cb89897b4	Merge branch 'release/os/4.6' into christians/update-feat-20200502	2020-06-02 17:55:14 +01:00
Kyriakos Tharrouniatis	b8b462f68e	NOTICK Fix schema validation error for flow parameters (#6304 ) Adding 'corda-blob' type, fixed 'Schema-validation: wrong column type'	2020-06-02 17:52:50 +01:00
williamvigorr3	0554c98d18	NOTICK Restrict to shell commands (#6303 ) Remove shell command for pausing flows	2020-06-02 16:04:10 +01:00
James Higgs	04ddb267fd	[EG-2225] Prevent extra directories being created when relative base directories are specified (#6282 ) Don't create an extra directory if a relative base path specified	2020-06-02 15:26:40 +01:00
Tamas Veingartner	9c4a76d367	CORDA-3176 Inefficient query generated on vault queries with custom p… (#6241 ) * CORDA-3176 Inefficient query generated on vault queries with custom paging a check added to avoid self joins local postgres db tests were executed on large volume test data (50k states) to ensure that the DB optimizes the suspicious query and so it does not cuase performance issues. A test added to check that a pagination query on large volume of sorted data is completed in a reasonable time * foreach detekt fix	2020-06-02 10:33:59 +01:00
Christian Sailer	0ed6307577	Merge remote-tracking branch 'origin/release/os/4.6' into christians/update-feat-20200502	2020-06-02 09:03:11 +01:00
Denis Rekalov	98af7f10f9	CORDA-3805: Make custom migration scripts independent on the latest PersistentIdentityService schema	2020-06-01 16:22:21 +01:00
Rick Parker	9f2bd1dcae	Merge pull request #6295 from corda/feature/checkpoint_table_improvements CORDA-3432 Feature/checkpoint table improvements	2020-06-01 11:31:13 +01:00
Kyriakos Tharrouniatis	4507b55857	CORDA-3725 Fix SQL deadlocks coming from soft locking states (#6287 ) Adding to the query -explicitly- the flow's locked states resolved the SQL Deadlocks in SQL server. The SQL Deadlocks would come up from `softLockRelease` when only the `lockId` was passed in as argument. In that case the query optimizer would use `lock_id_idx(lock_id, state_status)` to search and update entries in `VAULT_STATES` table. However, all rest of the queries would follow the opposite direction meaning they would use PK's `index(output_index, transaction_id)` but they would also update the `lock_id` column and therefore the `lock_id_idx` as well, because `lock_id` is a part of it. That was causing a circular locking among the different transactions (SQL processes) within the database. To resolve this, whenever a flow attempts to reserve soft locks using their flow id (remember the flow id is always the flow id for the very first flow in the flow stack), we then save these states to the fiber. Then, upon releasing soft locks the fiber passes that set to the release soft locks query. That way the database query optimizer will use the primary key index of VAULT_STATES table, instead of lock_id_idx in order to search rows to update. That way the query will be aligned with the rest of the queries that are following that route as well (i.e. making use of the primary key), and therefore its locking order of resources within the database will be aligned with the rest queries' locking orders (solving SQL deadlocks). * Fixed SQL deadlocks caused from softLockRelease, by saving locked states per fiber; NodeVaultService.softLockRelease query then uses VAULT_STATES PK index instead of lock_id_idx Speed up SQL server by breaking down queries with > 16 elements in their IN clause, into sub queries with 16 elements max in their IN clauses * Allow softLockedStates to remove states * Add to softLockedStates only states soft locked under our flow id * Fix softLockRelease not to take into account flowStateMachineImpl.softLockedStates when using lockId != ourFlowId * Moved CriteriaBuilder.executeUpdate at the bottom of the file	2020-05-29 12:35:05 +01:00
Denis Rekalov	499c09e77b	Merge pull request #6285 from corda/denis/CORDA-3818-sync-identity-service CORDA-3818: Synchronize OS implementation of PublicKeyToOwningIdentityCache with CE	2020-05-28 14:21:04 +01:00

... 3 4 5 6 7 ...

3039 Commits