tahoe-lafs/roadmap.txt

109 lines
3.8 KiB
Plaintext
Raw Normal View History

2006-11-30 17:16:19 -07:00
2006-12-03 19:55:05 -07:00
['*' means complete]
Connection Management:
*v1: foolscap, no relay, live=connected-to-introducer, broadcast updates, fully connected topology
*v2: configurable IP address -- http://allmydata.org/trac/tahoe/ticket/22
2007-05-01 20:33:22 -07:00
v3: live != connected-to-introducer, connect on demand
v4: decentralized introduction -- http://allmydata.org/trac/tahoe/ticket/68
v5: relay?
2006-11-30 17:16:19 -07:00
File Encoding:
2007-01-19 02:17:24 -07:00
*v1: single-segment, no merkle trees
2007-04-03 10:14:34 -07:00
*v2: multiple-segment (LFE)
2007-04-19 10:44:37 -07:00
*v3: merkle tree to verify each share
*v4: merkle tree to verify each segment
2007-06-29 10:43:27 -07:00
*v5: merkle tree on plaintext and crypttext: incremental validation
2007-06-06 09:49:42 -07:00
v6: only retrieve the minimal number of hashes instead of all of them
2006-11-30 17:16:19 -07:00
Share Encoding:
2007-01-19 02:17:24 -07:00
*v1: fake it (replication)
*v2: PyRS
2007-03-28 00:05:16 -07:00
*v2.5: ICodec-based codecs, but still using replication
*v3: C-based Reed-Solomon
2007-01-19 02:17:24 -07:00
URI:
*v1: really big
2007-06-29 10:43:27 -07:00
*v2: store URI Extension with shares
*v3: derive storage index from readkey
2007-06-29 10:43:27 -07:00
v4: perhaps derive more information from version and filesize, to remove
codec_name, codec_params, tail_codec_params, needed_shares,
total_shares, segment_size from the URI Extension
Upload Peer Selection:
2006-12-03 19:51:33 -07:00
*v1: permuted peer list, consistent hash
*v2: permute peers by verifierid and arrange around ring, intermixed with
shareids on the same range, each share goes to the
next-clockwise-available peer
v3: reliability/goodness-point counting?
v4: denver airport (chord)?
Download Peer Selection:
*v1: ask all peers
v2: permute peers and shareids as in upload, ask next-clockwise peers first
(the "A" list), if necessary ask the ones after them, etc.
v3: denver airport?
2006-11-30 17:16:19 -07:00
Directory/Filesystem Maintenance:
*v1: vdrive-based tree of MutableDirectoryNodes, persisted to vdrive's disk
2006-12-04 04:43:33 -07:00
no accounts
*v2: single-host dirnodes, one tree per user, plus one global mutable space
*v3: distributed storage for dirnodes
v4: maintain file manifest, delete on remove
v5: figure out accounts, users, quotas, snapshots, versioning, etc
2006-11-30 17:16:19 -07:00
2007-04-03 10:32:34 -07:00
Checker/Repairer:
2006-12-04 04:43:33 -07:00
*v1: none
v1.5: maintain file manifest
2006-11-30 17:16:19 -07:00
v2: centralized checker, repair agent
v3: nodes also check their own files
2007-04-03 10:32:34 -07:00
Storage:
2007-04-19 10:44:37 -07:00
*v1: no deletion, one directory per verifierid, no owners of shares,
2006-11-30 17:16:19 -07:00
leases never expire
*v2: multiple shares per verifierid [zooko]
2007-07-03 17:40:09 -07:00
*v3: disk space limits on storage servers -- ticket #34
2007-06-06 09:49:42 -07:00
v4: deletion
v5: leases expire, delete expired data on demand, multiple owners per share
2006-11-30 17:16:19 -07:00
UI:
2006-12-04 04:43:33 -07:00
*v1: readonly webish (nevow, URLs are filepaths)
2006-12-04 19:56:22 -07:00
*v2: read/write webish, mkdir, del (files)
2007-06-29 10:43:27 -07:00
*v2.5: del (directories)
*v3: CLI tool.
v4: FUSE (linux) -- http://allmydata.org/trac/tahoe/ticket/36
v5: WebDAV
2006-11-30 17:16:19 -07:00
2007-04-10 10:33:38 -07:00
Operations/Deployment/Doc/Free Software/Community:
- move this file into the wiki ?
2006-11-30 17:16:19 -07:00
back pocket ideas:
when nodes are unable to reach storage servers, make a note of it, inform
verifier/checker eventually. verifier/checker then puts server under
observation or otherwise looks for differences between their self-reported
availability and the experiences of others
2006-11-30 17:16:19 -07:00
store filetable URI in the first 10 peers that appear after your own nodeid
each entry has a sequence number, maybe a timestamp
on recovery, find the newest
multiple categories of leases:
1: committed leases -- we will not delete these in any case, but will instead
tell an uploader that we are full
1a: active leases
1b: in-progress leases (partially filled, not closed, pb connection is
currently open)
2: uncommitted leases -- we will delete these in order to make room for new
lease requests
2a: interrupted leases (partially filled, not closed, pb connection is
currently not open, but they might come back)
2b: expired leases
(I'm not sure about the precedence of these last two. Probably deleting
expired leases instead of deleting interrupted leases would be okay.)
2006-12-03 19:55:05 -07:00
2006-11-30 17:16:19 -07:00
big questions:
convergence?
peer list maintenance: lots of entries
2006-12-03 19:55:05 -07:00