tahoe-lafs/docs/historical/peer-selection.txt

When a file is uploaded, the encoded shares are sent to other peers. But to
which ones? Likewise, when we want to download a file, which peers should we
ask for shares? The "peer selection" algorithm is used to make these choices.

During the first tahoe meeting, (actualy on the drive back home), we designed
the now-abandoned "tahoe1" algorithm, which involved a "cabal" for each file,
where the peers involved would check up on each other to make sure the data
was still available. The big limitation was the expense of tracking which
nodes were parts of which cabals.

When we release 0.2.0, we used the "tahoe3" algorithm (see
peer-selection-tahoe3.txt), but in v0.6 (ticket #132) we switched back to
"tahoe2" (see the peer-selection-tahoe2.txt, and the PEER SELECTION section
of docs/architecture.txt), which uses a permuted peerid list and packs the
shares into the first 10 or so members of this list. (It is named "tahoe2"
because it was designed before "tahoe3" was.)

In the future, we might move to an algorithm known as "denver airport", which
uses Chord-like routing to minimize the number of active connections.

Different peer selection algorithms result in different properties:
 * how well do we handle nodes leaving or joining the mesh (differences in the
   peer list)?
 * how many connections do we need to keep open?
 * how many nodes must we speak to when uploading a file?
 * if a file is unrecoverable, how long will it take for us to discover this
   fact?
 * how expensive is a file-checking operation?
 * how well can we accomodate changes to encoding parameters?
move historical docs from wiki pages into the source tree, clearly marked as historical 2008-06-03 01:38:32 +00:00			`When a file is uploaded, the encoded shares are sent to other peers. But to`
			`which ones? Likewise, when we want to download a file, which peers should we`
			`ask for shares? The "peer selection" algorithm is used to make these choices.`

			`During the first tahoe meeting, (actualy on the drive back home), we designed`
			`the now-abandoned "tahoe1" algorithm, which involved a "cabal" for each file,`
			`where the peers involved would check up on each other to make sure the data`
			`was still available. The big limitation was the expense of tracking which`
			`nodes were parts of which cabals.`

docs/historical: move 'tahoe2' from wiki into source tree 2008-06-03 01:43:31 +00:00			`When we release 0.2.0, we used the "tahoe3" algorithm (see`
			`peer-selection-tahoe3.txt), but in v0.6 (ticket #132) we switched back to`
			`"tahoe2" (see the peer-selection-tahoe2.txt, and the PEER SELECTION section`
			`of docs/architecture.txt), which uses a permuted peerid list and packs the`
			`shares into the first 10 or so members of this list. (It is named "tahoe2"`
			`because it was designed before "tahoe3" was.)`
move historical docs from wiki pages into the source tree, clearly marked as historical 2008-06-03 01:38:32 +00:00
			`In the future, we might move to an algorithm known as "denver airport", which`
			`uses Chord-like routing to minimize the number of active connections.`

			`Different peer selection algorithms result in different properties:`
			`* how well do we handle nodes leaving or joining the mesh (differences in the`
			`peer list)?`
			`* how many connections do we need to keep open?`
			`* how many nodes must we speak to when uploading a file?`
			`* if a file is unrecoverable, how long will it take for us to discover this`
			`fact?`
			`* how expensive is a file-checking operation?`
			`* how well can we accomodate changes to encoding parameters?`