Tech white paper: add diagrams, a few more minor edits in response to feedback from Tim Swanson, Mark Oldfield and Richard Brown.

2025-06-01 23:20:54 +00:00 · 2016-11-25 12:51:53 +01:00 · 2016-11-25 12:51:53 +01:00 · a268e05dbb
commit a268e05dbb
parent 79537898df
9 changed files with 74 additions and 26 deletions
--- a/docs/source/_static/corda-technical-whitepaper.pdf
+++ b/docs/source/_static/corda-technical-whitepaper.pdf
--- a/docs/source/whitepaper/Ref.bib
+++ b/docs/source/whitepaper/Ref.bib
@ -346,3 +346,21 @@ publisher = {USENIX Association},
 publisher = {USENIX Association},
 address = {Berkeley, CA, USA},
 }
+
+@misc{TheDAOHack,
+    author = {David Siegel},
+    howpublished = {\url{http://www.coindesk.com/understanding-dao-hack-journalists/}},
+    year = {2016}
+}
+
+@misc{BitcoinEnergy,
+    author = {Christopher Malmo},
+    howpublished = {\url{http://motherboard.vice.com/read/bitcoin-is-unsustainable}},
+    year = {2015}
+}
+
+@misc{Swanson,
+    author = {Tim Swanson},
+    howpublished = {\url{http://tabbforum.com/opinions/settlement-risks-involving-public-blockchains}},
+    year = {2016}
+}
--- a/docs/source/whitepaper/corda-introductory-whitepaper.tex
+++ b/docs/source/whitepaper/corda-introductory-whitepaper.tex
@ -24,11 +24,6 @@
 \begin{document}

 \maketitle 
-%\epigraphfontsize{\small\itshape}
-
-%\renewcommand{\abstractname}{An introduction}
-%\textit{Confidential: Pre-Publication Final Draft For R3 Distributed Ledger Group Steering Committee}
-

 \begin{abstract}

--- a/docs/source/whitepaper/corda-technical-whitepaper.tex
+++ b/docs/source/whitepaper/corda-technical-whitepaper.tex
@ -235,8 +235,10 @@ either reliably stored the message or processed it completely. Connections betwe
 needed: there is no assumption of constant connectivity. An ideal network would be entirely flat with high quality
 connectivity between all nodes, but Corda recognises that this is not always compatible with common network
 setups and thus the message routing component of a node can be separated from the rest and run outside the firewall.
-In this way nodes that do not have duplex connectivity can still take part in the network as first class citizens.
-Additionally a single node may have multiple advertised IP addresses.
+Being outside the firewall or in the firewall's `de-militarised zone' (DMZ) is required to ensure that nodes can
+connect to anyone on the network, and be connected to in turn. In this way a node can be split into multiple
+sub-services that do not have duplex connectivity yet can still take part in the network as first class citizens.
+Additionally, a single node may have multiple advertised IP addresses.

 The reference implementation provides this functionality using the Apache Artemis message broker, through which it
 obtains journalling, load balancing, flow control, high availability clustering, streaming of messages too large to fit
@ -331,6 +333,11 @@ a payment is not considered acceptable.
 Flows are named using reverse DNS notation and several are defined by the base protocol. Note that the framework is
 not required to implement the wire protocols, it is just a development aid.

+\begin{figure}[H]
+\includegraphics[scale=0.16, center]{trading-flow}
+\caption{A diagram showing the two party trading flow with notarisation}
+\end{figure}
+
 \subsection{Data visibility and dependency resolution}

 When a transaction is presented to a node as part of a flow it may need to be checked. Simply sending you
@ -352,9 +359,9 @@ involve many round-trips and thus take some time to fully complete. How quickly
 is thus difficult to characterise: it depends heavily on usage and distance between nodes. Whilst nodes could
 pre-push transactions in anticipation of them being fetched anyway, such optimisations are left for future work.

-A more important consequence is that in the absence of additional privacy measures it is difficult to reason
-about who may get to see transaction data. We can say it's definitely better than a system that uses global
-broadcast, but how much better is hard to characterise. This uncertainty is mitigated by several factors.
+Whilst this system is simpler than creating rigid data partitions and clearly provides better privacy than global
+broadcast, in the absence of additional privacy measures it is nonetheless still difficult to reason about who
+may get to see transaction data. This uncertainty is mitigated by several factors.

 \paragraph{Small-subgraph transactions.}Some uses of the ledger do not involve widely circulated asset states.
 For example, two institutions that wish to keep their view of a particular deal synchronised but who are making
@ -378,6 +385,11 @@ be an issue.

 \subsection{Transaction structure}

+States are the atomic unit of information in Corda. They are never altered: they are either current (`unspent') or
+consumed (`spent') and hence no longer valid.  Transactions consume zero or more states (inputs) and create zero or more
+new states (outputs). Because states cannot exist outside of the transactions that created them, any state whether consumed
+or not can be identified by the identifier of the creating transaction and the index of the state in the outputs list.
+
 Transactions consist of the following components:

 \begin{labeling}{Input references}
@ -415,6 +427,11 @@ the transaction will not be valid unless every key listed in every command has a
 structures are themselves opaque. In this way algorithmic agility is retained: new signature algorithms can be deployed
 without adjusting the code of the smart contracts themselves.

+\begin{figure}[H]
+\includegraphics[width=\textwidth]{cash}
+\caption{An example of a cash issuance transaction}
+\end{figure}
+
 \subsection{Composite keys}\label{sec:composite-keys}

 The term ``public key'' in the description above actually refers to a \emph{composite key}. Composite keys are trees in
@ -424,6 +441,10 @@ determined by walking the tree bottom-up, summing the weights of the keys that h
 against the threshold. By using weights and thresholds a variety of conditions can be encoded, including boolean
 formulas with AND and OR.

+\begin{figure}[H]
+\includegraphics[width=\textwidth]{composite-keys}
+\end{figure}
+
 Composite keys are useful in multiple scenarios. For example, assets can be placed under the control of a 2-of-2
 composite key where one leaf key is owned by a user, and the other by an independent risk analysis system. The
 risk analysis system refuses to sign if the transaction seems suspicious, like if too much value has been
@ -461,7 +482,7 @@ way to Bitcoin's \texttt{nLockTime} transaction field, which specifies a \emph{h
 Timestamps are checked and enforced by notary services. As the participants in a notary service will themselves
 not have precisely aligned clocks, whether a transaction is considered valid or not at the moment it is submitted
 to a notary may be unpredictable if submission occurs right on a boundary of the given window. However, from the
-perspective of all other observers the notaries signature is decisive: if the signature is present, the transaction
+perspective of all other observers the notary's signature is decisive: if the signature is present, the transaction
 is assumed to have occurred within that time.

 \paragraph{Reference clocks.}In order to allow for relatively tight time windows to be used when transactions are fully
@ -530,7 +551,7 @@ or by requiring the data to be signed by some trusted third party.

 Decentralised ledger systems often differ in their underlying political ideology as well as their technical
 choices. The Ethereum project originally promised ``unstoppable apps'' which would implement ``code as law''. After
-a prominent smart contract was hacked, an argument took place over whether what had occurred could be described
+a prominent smart contract was hacked\cite{TheDAOHack}, an argument took place over whether what had occurred could be described
 as a hack at all given the lack of any non-code specification of what the program was meant to do. The disagreement
 eventually led to a split in the community.

@ -642,6 +663,15 @@ straightforward. Typically an oracle will be presented with the Merkle branches
 contains the data, and the timestamp field, and nothing else. The resulting signature contains flag bits indicating which
 parts of the structure were presented for signing to avoid a single signature covering more than expected.

+\begin{figure}[H]
+\includegraphics[width=\textwidth]{tearoffs1}
+\end{figure}
+
+\begin{figure}[H]
+\includegraphics[width=\textwidth]{tearoffs2}
+\end{figure}
+
+
 % TODO: The flag bits are unused in the current reference implementation.

 There are a couple of reasons to take this more indirect approach. One is to keep a single signature checking
@ -784,6 +814,11 @@ can be rewritten. If a group of trading institutions wish to implement a checked
 can use an encumbrance (see \cref{sec:encumbrances}) to prevent an obligation being changed during certain hours,
 as determined by the clocks of the notaries (see \cref{sec:timestamps}).

+\begin{figure}[H]
+\includegraphics[width=\textwidth]{state-class-hierarchy}
+\caption{Class hierarchy diagram showing the relationships between different state types}
+\end{figure}
+
 \subsection{Market infrastructure}

 Trade is the lifeblood of the economy. A distributed ledger needs to provide a vibrant platform on which trading may
@ -850,7 +885,7 @@ a reward of newly issued bitcoins, an unrecognised block represents a loss and a
 a profit.

 Bitcoin uses proof-of-work because it has a design goal of allowing an unlimited number of identityless parties to join
-and leave the network at will, whilst simultaneously making it hard to execute sybil attacks (attacks in which one party
+and leave the network at will, whilst simultaneously making it hard to execute Sybil attacks (attacks in which one party
 creates multiple identities to gain undue influence over the network). This is an appropriate design to use for a peer to
 peer network formed of volunteers who can't/won't commit to any long term relationships up front, and in which identity
 verification is not done. Using proof-of-work then leads naturally to a requirement to quantise the timeline into chunks,
@ -867,14 +902,14 @@ its multiple unfortunate downsides:

 \begin{itemize}
 \item Energy consumption is excessively high for such a simple task, being comparable at the time of writing to the
-consumption of an entire town. At a time when humanity needs to use less energy rather than more this is ecologically
-undesirable.
+electricity consumption of an entire city\cite{BitcoinEnergy}. At a time when humanity needs to use less energy
+rather than more this is ecologically undesirable.
 \item High energy consumption forces concentration of mining power in regions with cheap or free electricity. This results
 in unpredictable geopolitical complexities that many users would rather do without.
 \item Identityless participants mean all transactions must be broadcast to all network nodes, as there's no reliable
 way to know who the miners are. This worsens privacy.
 \item The algorithm does not provide finality, only a probabilistic approximation, which is a poor fit for existing
-business and legal assumptions.
+business and legal assumptions.\cite{Swanson}
 \item It is theoretically possible for large numbers of miners or even all miners to drop out simultaneously without
 any protocol commitments being violated.
 \end{itemize}
@ -1365,7 +1400,7 @@ human consumption at all.

 Once a group is created other nodes can be invited to join it by using an invitation flow. Membership can be either
 read only or read/write. To add a node as read-only, the certificate i.e. pubkey alone is sent. To add a node as
-read/write the cert and private key are sent. A future elaboration on the design may support giving each member a
+read/write the certificate and private key are sent. A future elaboration on the design may support giving each member a
 separate private key which would allow tracing who added transactions to a group, but this is left for future work.
 In either case the node records in its local database which other nodes it has invited to the group once they accept
 the invitation.
@ -1506,8 +1541,8 @@ opcode-at-a-time accounting turns out to be insufficient.
 A further complexity comes from the need to constrain memory usage. The sandbox imposes a quota on bytes \emph{allocated}
 rather than bytes \emph{retained} in order to simplify the implementation. This strategy is unnecessarily harsh on smart
 contracts that churn large quantities of garbage yet have relatively small peak heap sizes and, again, it may be that
-in practice a more sophisticated strategy that integrates with the GC is required in order to set quotas to a usefully
-generic level.
+in practice a more sophisticated strategy that integrates with the garbage collector is required in order to set
+quotas to a usefully generic level.

 Control over \texttt{Object.hashCode()} takes the form of new JNI calls that allow the JVM's thread local random number
 generator to be reseeded before execution begins. The seed is derived from the hash of the transaction being verified.
--- a/docs/source/whitepaper/images/composite-keys.png
+++ b/docs/source/whitepaper/images/composite-keys.png
--- a/docs/source/whitepaper/images/state-class-hierarchy.png
+++ b/docs/source/whitepaper/images/state-class-hierarchy.png
--- a/docs/source/whitepaper/images/tearoffs1.png
+++ b/docs/source/whitepaper/images/tearoffs1.png
--- a/docs/source/whitepaper/images/tearoffs2.png
+++ b/docs/source/whitepaper/images/tearoffs2.png
--- a/docs/source/whitepaper/images/trading-flow.png
+++ b/docs/source/whitepaper/images/trading-flow.png