tahoe-lafs

mirror of https://github.com/tahoe-lafs/tahoe-lafs.git synced 2024-12-28 00:38:52 +00:00

Author	SHA1	Message	Date
david-sarah	973f0afdd3	Change direct accesses to an_uri.storage_index to calls to .get_storage_index() (fixes #948 )	2010-02-21 18:45:04 -08:00
Zooko O'Whielacronx	3e4342ecb3	immutable: downloader accepts notifications of buckets even if those notifications arrive after he has begun downloading shares. This can be useful if one of the ones that he has already begun downloading fails. See #287 for discussion. This fixes part of #287 which part was a regression caused by #928, namely this fixes fail-over in case a share is corrupted (or the server returns an error or disconnects). This does not fix the related issue mentioned in #287 if a server hangs and doesn't reply to requests for blocks.	2010-01-31 22:16:10 -08:00
david-sarah	37a242e01a	Improvements to test_hung_server, and fix for status updates in download.py	2010-01-29 22:43:03 -08:00
Zooko O'Whielacronx	d62428c1e6	immutable: fix bug in tests, change line-endings to unix style, add comment	2010-01-29 10:42:37 -08:00
david-sarah	baa11a0ad4	New tests for #928	2010-01-29 04:38:45 -08:00
Zooko O'Whielacronx	2bd9dfa5bd	immutable: download from the first servers which provide at least K buckets instead of waiting for all servers to reply This should put an end to the phenomenon I've been seeing that a single hung server can cause all downloads on a grid to hang. Also it should speed up all downloads by (a) not-waiting for responses to queries that it doesn't need, and (b) downloading shares from the servers which answered the initial query the fastest. Also, do not count how many buckets you've gotten when deciding whether the download has enough shares or not -- instead count how many buckets to unique shares that you've gotten. This appears to improve a slightly weird behavior in the current download code in which receiving >= K different buckets all to the same sharenumber would make it think it had enough to download the file when in fact it hadn't. This patch needs tests before it is actually ready for trunk.	2010-01-27 15:34:17 -08:00
david-sarah	6057bc02cc	Prevent mutable objects from being retrieved from an immutable directory, and associated forward-compatibility improvements.	2010-01-26 22:44:30 -08:00
Brian Warner	731d15e56f	hush pyflakes-0.4.0 warnings: remove trivial unused variables. For #900 .	2010-01-14 14:15:29 -08:00
Brian Warner	d888bf3377	Clean up log.err calls, for one of the issues in #889 . allmydata.util.log.err() either takes a Failure as the first positional argument, or takes no positional arguments and must be invoked in an exception handler. Fixed its signature to match both foolscap.logging.log.err and twisted.python.log.err . Included a brief unit test.	2010-01-11 17:33:43 -08:00
Brian Warner	bacb6fe5aa	tidy up DeadReferenceError handling, ignore them in add_lease calls Stop checking separately for ConnectionDone/ConnectionLost, since those have been folded into DeadReferenceError since foolscap-0.3.1 . Write rrefutil.trap_deadref() in terms of rrefutil.trap_and_discard() to improve code coverage.	2010-01-11 16:07:23 -08:00
Brian Warner	db19b62702	immutable/checker.py: oops, forgot some imports. Also hush pyflakes.	2009-12-29 15:39:09 -08:00
Brian Warner	794e32738f	checker: don't let failures in add-lease affect checker results. Closes #875 . Mutable servermap updates and the immutable checker, when run with add_lease=True, send both the do-you-have-block and add-lease commands in parallel, to avoid an extra round trip time. Many older servers have problems with add-lease and raise various exceptions, which don't generally matter. The client-side code was catching+ignoring some of them, but unrecognized exceptions were passed through to the DYHB code, concealing the DYHB results from the checker, making it think the server had no shares. The fix is to separate the code paths. Both commands are sent at the same time, but the errback path from add-lease is handled separately. Known exceptions are ignored, the others (both unknown-remote and all-local) are logged (log.WEIRD, which will trigger an Incident), but neither will affect the DYHB results. The add-lease message is sent first, and we know that the server handles them synchronously. So when the checker is done, we can be sure that all the add-lease messages have been retired. This makes life easier for unit tests.	2009-12-29 15:01:08 -08:00
Brian Warner	96834da0a2	Simplify immutable download API: use just filenode.read(consumer, offset, size) * remove Downloader.download_to_data/download_to_filename/download_to_filehandle * remove download.Data/FileName/FileHandle targets * remove filenode.download/download_to_data/download_to_filename methods * leave Downloader.download (the whole Downloader will go away eventually) * add util.consumer.MemoryConsumer/download_to_data, for convenience (this is mostly used by unit tests, but it gets used by enough non-test code to warrant putting it in allmydata.util) * update tests * removes about 180 lines of code. Yay negative code days! Overall plan is to rewrite immutable/download.py and leave filenode.read() as the sole read-side API.	2009-12-01 17:53:30 -05:00
david-sarah	c4d38ad4c5	make status of finished operations consistently "Finished"	2009-11-20 22:15:43 -08:00
Brian Warner	0cf320c2ab	interface name cleanups: IFileNode, IImmutableFileNode, IMutableFileNode The proper hierarchy is: IFilesystemNode +IFileNode ++IMutableFileNode ++IImmutableFileNode +IDirectoryNode Also expand test_client.py (NodeMaker) to hit all IFilesystemNode types.	2009-11-19 23:52:55 -08:00
Brian Warner	d2badbea78	class name cleanups: s/FileNode/ImmutableFileNode/ also fix test/bench_dirnode.py for recent dirnode changes	2009-11-19 23:22:39 -08:00
Brian Warner	e046744f40	make get_size/get_current_size consistent for all IFilesystemNode classes * stop caching most_recent_size in dirnode, rely upon backing filenode for it * start caching most_recent_size in MutableFileNode * return None when you don't know, not "?" * only render None as "?" in the web "more info" page * add get_size/get_current_size to UnknownNode	2009-11-18 11:16:24 -08:00
Brian Warner	131e05b155	clean up uri-vs-cap terminology, emphasize cap instances instead of URI strings * "cap" means a python instance which encapsulates a filecap/dircap (uri.py) * "uri" means a string with a "URI:" prefix * FileNode instances are created with (and retain) a cap instance, and generate uri strings on demand * .get_cap/get_readcap/get_verifycap/get_repaircap return cap instances * .get_uri/get_readonly_uri return uri strings * add filenode.download_to_filename() for control.py, should find a better way * use MutableFileNode.init_from_cap, not .init_from_uri * directory URI instances: use get_filenode_cap, not get_filenode_uri * update/cleanup bench_dirnode.py to match, add Makefile target to run it	2009-11-11 14:26:19 -08:00
Brian Warner	f4aa418086	Verifier: check the full cryptext-hash tree on each share. Removed .todos from the last few test_repairer tests that were waiting on this.	2009-10-05 15:18:49 -07:00
Brian Warner	504c767d03	Verifier: check the full block-hash-tree on each share Removed the .todo from two test_repairer tests that check this. The only remaining .todos are on the three crypttext-hash-tree tests.	2009-10-05 14:48:44 -07:00
Brian Warner	e8f56af5a7	Verifier: check the full share-hash chain on each share Removed the .todo from two test_repairer tests that check this.	2009-10-05 14:34:43 -07:00
Brian Warner	be95129833	immutable/checker.py: rearrange code a little bit, make it easier to follow	2009-10-05 13:02:52 -07:00
Brian Warner	19d336513c	immutable/download.py: wrap to 80cols, no functional changes	2009-10-05 12:25:42 -07:00
Brian Warner	5283d4c19e	de-Service-ify Helper, pass in storage_broker and secret_holder directly. This makes it more obvious that the Helper currently generates leases with the Helper's own secrets, rather than getting values from the client, which is arguably a bug that will likely be resolved with the Accounting project.	2009-08-15 13:17:37 -07:00
Brian Warner	4a4a4f9520	immutable.Downloader: pass StorageBroker to constructor, stop being a Service child of the client, access with client.downloader instead of client.getServiceNamed("downloader"). The single "Downloader" instance is scheduled for demolition anyways, to be replaced by individual filenode.download calls.	2009-08-15 12:25:43 -07:00
Brian Warner	0d5dc51617	Overhaul IFilesystemNode handling, to simplify tests and use POLA internally. * stop using IURI as an adapter * pass cap strings around instead of URI instances * move filenode/dirnode creation duties from Client to new NodeMaker class * move other Client duties to KeyGenerator, SecretHolder, History classes * stop passing Client reference to dirnode/filenode constructors - pass less-powerful references instead, like StorageBroker or Uploader * always create DirectoryNodes by wrapping a filenode (mutable for now) * remove some specialized mock classes from unit tests Detailed list of changes (done one at a time, then merged together) always pass a string to create_node_from_uri(), not an IURI instance always pass a string to IFilesystemNode constructors, not an IURI instance stop using IURI() as an adapter, switch on cap prefix in create_node_from_uri() client.py: move SecretHolder code out to a separate class test_web.py: hush pyflakes client.py: move NodeMaker functionality out into a separate object LiteralFileNode: stop storing a Client reference immutable Checker: remove Client reference, it only needs a SecretHolder immutable Upload: remove Client reference, leave SecretHolder and StorageBroker immutable Repairer: replace Client reference with StorageBroker and SecretHolder immutable FileNode: remove Client reference mutable.Publish: stop passing Client mutable.ServermapUpdater: get StorageBroker in constructor, not by peeking into Client reference MutableChecker: reference StorageBroker and History directly, not through Client mutable.FileNode: removed unused indirection to checker classes mutable.FileNode: remove Client reference client.py: move RSA key generation into a separate class, so it can be passed to the nodemaker move create_mutable_file() into NodeMaker test_dirnode.py: stop using FakeClient mockups, use NoNetworkGrid instead. This simplifies the code, but takes longer to run (17s instead of 6s). This should come down later when other cleanups make it possible to use simpler (non-RSA) fake mutable files for dirnode tests. test_mutable.py: clean up basedir names client.py: move create_empty_dirnode() into NodeMaker dirnode.py: get rid of DirectoryNode.create remove DirectoryNode.init_from_uri, refactor NodeMaker for customization, simplify test_web's mock Client to match stop passing Client to DirectoryNode, make DirectoryNode.create_with_mutablefile the normal DirectoryNode constructor, start removing client from NodeMaker remove Client from NodeMaker move helper status into History, pass History to web.Status instead of Client test_mutable.py: fix minor typo	2009-08-15 04:28:46 -07:00
Brian Warner	1192b61dfe	upload: fix #758 recursion-loop in peer-selection when servers report errors. The bug was in the code that handles a third-or-later pass, and was previously untested.	2009-07-17 00:07:09 -05:00
Zooko O'Whielacronx	22d390acbb	immutable: base32-encode the keys to generate cache filenames that will work on all platforms	2009-07-08 08:26:33 -07:00
Zooko O'Whielacronx	c0d1e7deae	directories: make initialization of the download cache lazy If you open up a directory containing thousands of files, it currently computes the cache filename and checks for the cache file on disk immediately for each immutble file in that directory. With this patch, it delays those steps until you try to do something with an immutable file that could use the cache.	2009-07-07 17:40:40 -07:00
Brian Warner	8fca155a66	repairer.py: wrap to 80cols. No code changes.	2009-06-30 17:00:47 -07:00
Brian Warner	bd6ecc9f44	Split out NoSharesError, stop adding attributes to NotEnoughSharesError, change humanize_failure to include the original exception string, update tests, behave better if humanize_failure fails.	2009-06-24 19:17:07 -07:00
Brian Warner	8df15e9f30	big rework of introducer client: change local API, split division of responsibilites better, remove old-code testing, improve error logging	2009-06-22 19:10:47 -07:00
Brian Warner	711c09bc5d	clean up storage_broker interface: should fix #732	2009-06-21 16:51:19 -07:00
Brian Warner	a6caae9b5d	immutable/download: instrument do-you-have-block responses to investigate #732	2009-06-20 21:12:09 -07:00
Brian Warner	b1290633b8	more storage_broker refactoring: downloader gets a broker instead of a client, use Client.get_storage_broker() accessor instead of direct attribute access.	2009-06-01 19:25:11 -07:00
Brian Warner	4177a3616b	remove plaintext-hashing code from the helper interface, to close #722 and deny the Helper the ability to mount a partial-information-guessing attack. This will probably break compatibility between new clients and very old (pre-1.0) helpers.	2009-06-01 15:49:16 -07:00
Brian Warner	c516361fd2	start to factor server-connection-management into a distinct 'StorageServerFarmBroker' object, separate from the client and the introducer. This is the starting point for #467 : static server selection	2009-06-01 14:06:04 -07:00
Brian Warner	de83526acd	immutable/encode.py: tolerate immediate _remove_shareholder by copying the landlord list before iterating over it. This can probably only happen in unit tests, but cleaning it up makes certain test failures easier to analyze.	2009-05-22 11:44:24 -07:00
Brian Warner	1863aee0aa	switch to using RemoteException instead of 'wrapped' RemoteReferences. Should fix #653 , the rref-EQ problem	2009-05-21 17:46:32 -07:00
Brian Warner	c9803d5217	switch all foolscap imports to use foolscap.api or foolscap.logging	2009-05-21 17:38:23 -07:00
Brian Warner	79437baade	immutable WriteBucketProxy: use pipeline to speed up uploads by overlapping roundtrips, for #392	2009-05-18 16:44:22 -07:00
Brian Warner	67571eb033	add more information to NotEnoughSharesError, split out new exceptions for no-servers and no-source-of-ueb-hash	2009-03-03 19:37:15 -07:00
Brian Warner	400c04c19a	immutable checker add-lease: catch remote IndexError here too	2009-02-27 01:17:24 -07:00
Brian Warner	f95e9b5964	immutable/checker.py: trap ShareVersionIncompatible too. Also, use f.check instead of examining the value returned by f.trap, because the latter appears to squash exception types down into their base classes (i.e. since ShareVersionIncompatible is a subclass of LayoutInvalid, f.trap(Failure(ShareVersionIncompatible)) == LayoutInvalid). All this resulted in 'incompatible' shares being misclassified as 'corrupt'.	2009-02-23 22:14:05 -07:00
Brian Warner	9af9d8ae35	immutable/layout.py: wrap to 80 cols, no functional changes	2009-02-23 18:58:37 -07:00
Brian Warner	ef53da2b12	break storage.py into smaller pieces in storage/*.py . No behavioral changes.	2009-02-18 14:46:55 -07:00
Brian Warner	a0c5f92cbd	immutable/layout: minor change to repr name	2009-02-18 14:46:48 -07:00
Brian Warner	bce4a5385b	add --add-lease to 'tahoe check', 'tahoe deep-check', and webapi.	2009-02-17 19:32:43 -07:00
Brian Warner	fde2289e7b	CLI #590 : convert 'tahoe deep-check' to streaming form, improve display, add tests	2009-02-17 17:15:11 -07:00
Zooko O'Whielacronx	d7dbd6675e	immutable repairer: fix DownUpConnector so that it satisfies short reads the were requested after the last write and before the close This is probably the cause of the very rare "loss of progress" bug. This is tested by unit tests. A recent patch changed this to errback instead of losing progress, and now this patch is changing it again to return a short read instead of errbacking. Returning a short read is what the uploader (in encode.py) is expecting, when it is reading the last block of the ciphertext, which might be shorter than the other blocks.	2009-02-12 17:04:47 -07:00
Zooko O'Whielacronx	bdb992467c	immutable repairer: add an assertion that a certain value in this tricky function is always what I think it is	2009-02-12 16:31:32 -07:00
Zooko O'Whielacronx	76d7cc4404	immutable repairer: errback any pending readers of DownUpConnectorwhen it runs out of bytes, and test that fact	2009-02-11 20:11:29 -07:00
Zooko O'Whielacronx	7eb260a9cf	versioning: include an "appname" in the application version string in the versioning protocol, and make that appname be controlled by setup.py It is currently hardcoded in setup.py to be 'allmydata-tahoe'. Ticket #556 is to make it configurable by a runtime command-line argument to setup.py: "--appname=foo", but I suddenly wondered if we really wanted that and at the same time realized that we don't need that for tahoe-1.3.0 release, so this patch just hardcodes it in setup.py. setup.py inspects a file named 'src/allmydata/_appname.py' and assert that it contains the string "__appname__ = 'allmydata-tahoe'", and creates it if it isn't already present. src/allmydata/__init__.py import _appname and reads __appname__ from it. The rest of the Python code imports allmydata and inspects "allmydata.__appname__", although actually every use it uses "allmydata.__full_version__" instead, where "allmydata.__full_version__" is created in src/allmydata/__init__.py to be: __full_version__ = __appname + '-' + str(__version__). All the code that emits an "application version string" when describing what version of a protocol it supports (introducer server, storage server, upload helper), or when describing itself in general (introducer client), usese allmydata.__full_version__. This fixes ticket #556 at least well enough for tahoe-1.3.0 release.	2009-02-11 17:18:16 -07:00
Zooko O'Whielacronx	ef1bfdd2bf	immutable: repairer: add a simple test to exercise the "leftover" code path, fix the bug (and rename the variable "leftover" to "extra")	2009-02-10 12:12:45 -07:00
Zooko O'Whielacronx	75e4e67ed7	immutable: tighten preconditions -- you can write empty strings or read zero bytes, and add the first simple unit test of DownUpConnector	2009-02-10 00:56:47 -07:00
Zooko O'Whielacronx	c59940852b	immutable: defensive programming: assert that the encrypted readable gave you no more than the number of bytes you asked for (There is a bug in the current DownUpConnector which can cause it to give more bytes than you asked for on one request, and then less on the next, effectively shifting some of the bytes to an earlier request, but I think this bug never gets triggered in practice.)	2009-02-09 23:46:05 -07:00
Brian Warner	a9a3b509df	upload: add a think-of-the-compatibility note to UploadResults	2009-02-09 14:50:04 -07:00
Brian Warner	a5ab6c060d	helper #609 : uploading client should ignore old helper's UploadResults, which were in a different format	2009-02-09 14:45:43 -07:00
Brian Warner	38ee95fec4	immutable/checker: wrap comments to 80cols, my laptop does not have a wide screen. No functional changes.	2009-02-07 14:04:39 -07:00
Brian Warner	d8b3505cf5	filenode: add get_repair_cap(), which uses the read-write filecap for immutable files, and the verifycap for immutable files	2009-01-22 21:38:36 -07:00
Brian Warner	aa50c30aa2	download: tiny cleanup of history code	2009-01-14 16:41:51 -07:00
Brian Warner	10268a4f7f	upload: move upload history into History object	2009-01-14 16:41:06 -07:00
Brian Warner	3920e6d1e7	immutable/download.py move recent-downloads history out of Downloader and into a separate class. upload/etc will follow soon.	2009-01-14 16:14:24 -07:00
Brian Warner	cc50e2f4aa	upload: use WriteBucketProxy_v2 when uploading a large file (with shares larger than 4GiB). This finally closes #346 . I think we can now handle immutable files up to 48EiB.	2009-01-12 20:14:42 -07:00
Brian Warner	bf56e2bb51	deep-check-and-repair: improve results and their HTML representation	2009-01-12 18:56:19 -07:00
Brian Warner	fe362c0021	hush pyflakes by removing unused imports	2009-01-12 15:41:20 -07:00
Zooko O'Whielacronx	25063688b4	immutable repairer This implements an immutable repairer by marrying a CiphertextDownloader to a CHKUploader. It extends the IDownloadTarget interface so that the downloader can provide some metadata that the uploader requires. The processing is incremental -- it uploads the first segments before it finishes downloading the whole file. This is necessary so that you can repair large files without running out of RAM or using a temporary file on the repairer. It requires only a verifycap, not a readcap. That is: it doesn't need or use the decryption key, only the integrity check codes. There are several tests marked TODO and several instances of XXX in the source code. I intend to open tickets to document further improvements to functionality and testing, but the current version is probably good enough for Tahoe-1.3.0.	2009-01-12 11:00:22 -07:00
Zooko O'Whielacronx	b496eba072	trivial: minor changes to in-line comments -- mark plaintext-hash-tree as obsolete	2009-01-10 14:56:01 -07:00
Zooko O'Whielacronx	6e3396fb88	immutable: redefine the "sharemap" member of the upload results to be a map from shnum to set of serverids It used to be a map from shnum to a string saying "placed this share on XYZ server". The new definition is more in keeping with the "sharemap" object that results from immutable file checking and repair, and it is more useful to the repairer, which is a consumer of immutable upload results.	2009-01-10 11:46:23 -07:00
Zooko O'Whielacronx	ef60e85ec6	naming: finish renaming "CheckerResults" to "CheckResults"	2009-01-09 18:00:52 -07:00
Brian Warner	f8de336039	immutable/checker: include a summary (with 'Healthy' or 'Not Healthy' and a count of shares) in the checker results	2009-01-08 20:01:45 -07:00
Zooko O'Whielacronx	ade6a4fa74	immutable: add a monitor API to CiphertextDownloader with which to tell it to stop its work	2009-01-08 14:42:15 -07:00
Zooko O'Whielacronx	157e365d2b	naming: Rename a few things which I touched or changed in the recent patch to download-without-decrypting. Rename "downloadable" to "target". Rename "u" to "v" in FileDownloader.__init__(). Rename "_uri" to "_verifycap" in FileDownloader. Rename "_downloadable" to "_target" in FileDownloader. Rename "FileDownloader" to "CiphertextDownloader".	2009-01-08 12:13:07 -07:00
Zooko O'Whielacronx	600196f571	immutable: refactor download to do only download-and-decode, not decryption FileDownloader takes a verify cap and produces ciphertext, instead of taking a read cap and producing plaintext. FileDownloader does all integrity checking including the mandatory ciphertext hash tree and the optional ciphertext flat hash, rather than expecting its target to do some of that checking. Rename immutable.download.Output to immutable.download.DecryptingOutput. An instance of DecryptingOutput can be passed to FileDownloader to use as the latter's target. Text pushed to the DecryptingOutput is decrypted and then pushed to its target. DecryptingOutput satisfies the IConsumer interface, and if its target also satisfies IConsumer, then it forwards and pause/unpause signals to its producer (which is the FileDownloader). This patch also changes some logging code to use the new logging mixin class. Check integrity of a segment and decrypt the segment one block-sized buffer at a time instead of copying the buffers together into one segment-sized buffer (reduces peak memory usage, I think, and is probably a tad faster/less CPU, depending on your encoding parameters). Refactor FileDownloader so that processing of segments and of tail-segment share as much code is possible. FileDownloader and FileNode take caps as instances of URI (Python objects), not as strings.	2009-01-08 11:53:49 -07:00
Zooko O'Whielacronx	ecabcc674c	immutable: Make more parts of download use logging mixins and know what their "parent msg id" is.	2009-01-08 11:25:30 -07:00
Zooko O'Whielacronx	2a443cd049	trivial: M-x whitespace-cleanup on src/immutable/download.py	2009-01-08 10:49:01 -07:00
Zooko O'Whielacronx	7d15928faa	immutable: ValidatedExtendedURIProxy computes and stores the tail data size as a convenience to its caller. The "tail data size" is how many of the bytes of the tail segment are data (as opposed to padding).	2009-01-08 10:41:39 -07:00
Zooko O'Whielacronx	83b97ee79f	immutable: fix error in validation of ciphertext hash tree and add test for that code pyflakes pointed out to me that I had committed some code that is untested, since it uses an undefined name. This patch exercises that code -- the validation of the ciphertext hash tree -- by corrupting some of the share files in a very specific way, and also fixes the bug.	2009-01-07 23:40:12 -07:00
Zooko O'Whielacronx	6011f4522f	immutable: do not catch arbitrary exceptions/failures from the attempt to get a crypttext hash tree -- catch only ServerFailure, IntegrityCheckReject, LayoutInvalid, ShareVersionIncompatible, and DeadReferenceError Once again I inserted a bug into the code, and once again it was hidden by something catching arbitrary exception/failure and assuming that it means the server failed to provide valid data.	2009-01-07 22:25:51 -07:00
Zooko O'Whielacronx	e598ca2f3f	download: make sure you really get all the crypttext hashes We were not making sure that we really got all the crypttext hashes during download. If a server were to return less than the complete set of crypttext hashes, then our subsequent attempt to verify the correctness of the ciphertext would fail. (And it wouldn't be obvious without very careful debugging why it had failed.) This patch makes it so that you keep trying to get ciphertext hashes until you have a full set or you run out of servers to ask.	2009-01-07 20:26:38 -07:00
Zooko O'Whielacronx	d5a6eed407	trivial: fix redefinition of name "log" in imports (pyflakes)	2009-01-06 22:08:29 -07:00
Zooko O'Whielacronx	c85f75bb08	immutable: refactor uploader to do just encoding-and-uploading, not encryption This makes Uploader take an EncryptedUploadable object instead of an Uploadable object. I also changed it to return a verify cap instead of a tuple of the bits of data that one finds in a verify cap. This will facilitate hooking together an Uploader and a Downloader to make a Repairer. Also move offloaded.py into src/allmydata/immutable/.	2009-01-06 21:48:22 -07:00
Zooko O'Whielacronx	81add135dc	trivial: whitespace and docstring tidyups	2009-01-06 21:41:04 -07:00
Zooko O'Whielacronx	5e6f90a015	rename "checker results" to "check results", because it is more parallel to "check-and-repair results"	2009-01-06 13:37:03 -07:00
Zooko O'Whielacronx	c35a6ee3a2	trivial: fix a bunch of pyflakes complaints	2009-01-06 08:00:54 -07:00
Zooko O'Whielacronx	6a12f316a4	immutable: new checker and verifier New checker and verifier use the new download class. They are robust against various sorts of failures or corruption. They return detailed results explaining what they learned about your immutable files. Some grotesque sorts of corruption are not properly handled yet, and those ones are marked as TODO or commented-out in the unit tests. There is also a repairer module in this patch with the beginnings of a repairer in it. That repairer is mostly just the interface to the outside world -- the core operation of actually reconstructing the missing data blocks and uploading them is not in there yet. This patch also refactors the unit tests in test_immutable so that the handling of each kind of corruption is reported as passing or failing separately, can be separately TODO'ified, etc. The unit tests are also improved in various ways to require more of the code under test or to stop requiring unreasonable things of it. :-)	2009-01-05 18:28:18 -07:00
Zooko O'Whielacronx	206ab2b44d	immutable: handle another form of share corruption with LayoutInvalid exception instead of AssertionError	2009-01-05 17:46:45 -07:00
Zooko O'Whielacronx	c84bb795f3	trivial: remove unused import (pyflakes)	2009-01-05 17:31:20 -07:00
Zooko O'Whielacronx	f4fab23bf6	immutable: raise a LayoutInvalid exception instead of an AssertionError if the share is corrupted so that the sharehashtree is the wrong size	2009-01-05 14:01:14 -07:00
Zooko O'Whielacronx	98b28c1d5e	immutable: stop reading past the end of the sharefile in the process of optimizing download -- Tahoe storage servers < 1.3.0 return an error if you read past the end of the share file	2009-01-05 13:40:57 -07:00
Zooko O'Whielacronx	8a840469c3	immutable: tidy up the notification of waiters for ReadBucketProxy	2009-01-05 13:35:22 -07:00
Zooko O'Whielacronx	778167c2b1	immutable: refactor downloader to be more reusable for checker/verifier/repairer (and better) The code for validating the share hash tree and the block hash tree has been rewritten to make sure it handles all cases, to share metadata about the file (such as the share hash tree, block hash trees, and UEB) among different share downloads, and not to require hashes to be stored on the server unnecessarily, such as the roots of the block hash trees (not needed since they are also the leaves of the share hash tree), and the root of the share hash tree (not needed since it is also included in the UEB). It also passes the latest tests including handling corrupted shares well. ValidatedReadBucketProxy takes a share_hash_tree argument to its constructor, which is a reference to a share hash tree shared by all ValidatedReadBucketProxies for that immutable file download. ValidatedReadBucketProxy requires the block_size and share_size to be provided in its constructor, and it then uses those to compute the offsets and lengths of blocks when it needs them, instead of reading those values out of the share. The user of ValidatedReadBucketProxy therefore has to have first used a ValidatedExtendedURIProxy to compute those two values from the validated contents of the URI. This is pleasingly simplifies safety analysis: the client knows which span of bytes corresponds to a given block from the validated URI data, rather than from the unvalidated data stored on the storage server. It also simplifies unit testing of verifier/repairer, because now it doesn't care about the contents of the "share size" and "block size" fields in the share. It does not relieve the need for share data v2 layout, because we still need to store and retrieve the offsets of the fields which come after the share data, therefore we still need to use share data v2 with its 8-byte fields if we want to store share data larger than about 2^32. Specify which subset of the block hashes and share hashes you need while downloading a particular share. In the future this will hopefully be used to fetch only a subset, for network efficiency, but currently all of them are fetched, regardless of which subset you specify. ReadBucketProxy hides the question of whether it has "started" or not (sent a request to the server to get metadata) from its user. Download is optimized to do as few roundtrips and as few requests as possible, hopefully speeding up download a bit.	2009-01-05 09:51:45 -07:00
Zooko O'Whielacronx	8f5cc24948	trivial: remove unused import (pyflakes)	2009-01-03 12:22:15 -07:00
Zooko O'Whielacronx	5954ab456d	immutable: fix test for truncated reads of URI extension block size	2009-01-03 11:44:27 -07:00
Zooko O'Whielacronx	54787771c3	immutable: fix detection of truncated shares to take into account the fieldsize -- either 4 or 8	2009-01-02 18:57:45 -07:00
Zooko O'Whielacronx	21e0ff97f2	immutable: raise LayoutInvalid instead of struct.error when a share is truncated To fix this error from the Windows buildslave: [ERROR]: allmydata.test.test_immutable.Test.test_download_from_only_3_remaining_shares Traceback (most recent call last): File "C:\Documents and Settings\buildslave\windows-native-tahoe\windows\build\src\allmydata\immutable\download.py", line 135, in _bad raise NotEnoughSharesError("ran out of peers, last error was %s" % (f,)) allmydata.interfaces.NotEnoughSharesError: ran out of peers, last error was [Failure instance: Traceback: <class 'struct.error'>: unpack requires a string argument of length 4 c:\documents and settings\buildslave\windows-native-tahoe\windows\build\support\lib\site-packages\foolscap-0.3.2-py2.5.egg\foolscap\call.py:667:_done c:\documents and settings\buildslave\windows-native-tahoe\windows\build\support\lib\site-packages\foolscap-0.3.2-py2.5.egg\foolscap\call.py:53:complete c:\Python25\lib\site-packages\twisted\internet\defer.py:239:callback c:\Python25\lib\site-packages\twisted\internet\defer.py:304:_startRunCallbacks --- <exception caught here> --- c:\Python25\lib\site-packages\twisted\internet\defer.py:317:_runCallbacks C:\Documents and Settings\buildslave\windows-native-tahoe\windows\build\src\allmydata\immutable\layout.py:374:_got_length C:\Python25\lib\struct.py:87:unpack ] ===============================================================================	2009-01-02 18:48:06 -07:00
Zooko O'Whielacronx	e26cec2502	immutable: add more detailed tests of download, including testing the count of how many reads different sorts of downloads take	2009-01-02 16:54:59 -07:00
Zooko O'Whielacronx	cc70c163ba	trivial: a few improvements to in-line doc and code, and renaming of test/test_immutable_checker.py to test/test_immutable.py That file currently tests checker and verifier and repairer, and will soon also test downloader.	2009-01-02 16:49:41 -07:00
Zooko O'Whielacronx	a52b5542e9	immutable: fix name change from BadOrMissingShareHash to BadOrMissingHash One of the instances of the name accidentally didn't get changed, and pyflakes noticed. The new downloader/checker/verifier/repairer unit tests would also have noticed, but those tests haven't been rolled into a patch and applied to this repo yet...	2009-01-02 13:27:09 -07:00
Zooko O'Whielacronx	c72be1c553	trivial: remove unused import -- thanks, pyflakes	2009-01-02 13:21:28 -07:00
Zooko O'Whielacronx	d8c9c3dc99	immutable: download.py: Raise the appropriate type of exception to indicate the cause of failure, e.g. BadOrMissingHash, ServerFailure, IntegrityCheckReject (which is a supertype of BadOrMissingHash). This helps users (such as verifier/repairer) catch certain classes of reasons for "why did this download not work". The tests of verifier/repairer test this code and rely on this code.	2009-01-02 12:58:58 -07:00
Zooko O'Whielacronx	fa5c1d8326	immutable: ReadBucketProxy defines classes of exception: LayoutInvalid and its two subtypes RidiculouslyLargeURIExtensionBlock and ShareVersionIncompatible. This helps users (such as verifier/repairer) catch certain classes of reasons for "why did this download not work". This code gets exercised by the verifier/repairer unit tests, which corrupt the shares on disk in order to trigger problems like these.	2009-01-02 12:15:54 -07:00
Zooko O'Whielacronx	0ee027c180	immutable: ValidatedExtendedURIProxy computes and stores block_size and share_size for the convenience of its users	2009-01-02 11:43:17 -07:00
Zooko O'Whielacronx	0687f692b0	trivial: "M-x whitespace-cleanup" on immutable/layout.py	2008-12-31 15:07:02 -07:00
Zooko O'Whielacronx	c54783f5e1	immutable: don't catch all exception when downloading, catch only DeadReferenceError and IntegrityCheckReject	2008-12-21 17:41:35 -07:00
Zooko O'Whielacronx	ad58f8b693	immutable: invent download.BadOrMissingHashError which is raised if either hashtree.BadHashError, hashtree.NotEnoughHashesError, and which is a subclass of IntegrityCheckReject	2008-12-21 17:41:30 -07:00
Zooko O'Whielacronx	8b7ce325d7	immutable, checker, and tests: improve docstrings, assertions, tests No functional changes, but remove unused code, improve or fix docstrings, etc.	2008-12-21 15:07:52 -07:00
Zooko O'Whielacronx	ec86563326	immutable: when downloading an immutable file, use primary shares if they are available Primary shares require no erasure decoding so the more primary shares you have, the less CPU is used.	2008-12-20 07:14:56 -07:00
Zooko O'Whielacronx	a71a68b31e	trivial: remove unused import (thanks, pyflakes)	2008-12-19 13:46:29 -07:00
Zooko O'Whielacronx	471e1f1b9b	try to tidy up uri-as-string vs. uri-as-object I get confused about whether a given argument or return value is a uri-as-string or uri-as-object. This patch adds a lot of assertions that it is one or the other, and also changes CheckerResults to take objects not strings. In the future, I hope that we generally use Python objects except when importing into or exporting from the Python interpreter e.g. over the wire, the UI, or a stored file.	2008-12-19 08:39:24 -07:00
Zooko O'Whielacronx	7b285ebcb1	immutable: remove the last bits of code (only test code or unused code) which did something with plaintext hashes or plaintext hash trees	2008-12-19 08:18:07 -07:00
Zooko O'Whielacronx	d67a3fe4b1	immutable: use new logging mixins to simplify logging	2008-12-16 18:04:50 -07:00
Zooko O'Whielacronx	d511941136	immutable: refactor ReadBucketProxy a little	2008-12-16 17:53:25 -07:00
Zooko O'Whielacronx	db566db31a	immutable: remove unused code to produce plaintext hashes	2008-12-09 16:45:46 -07:00
Zooko O'Whielacronx	c3edae5158	finish renaming 'subshare' to 'block' in immutable/encode.py and in docs/	2008-12-09 16:33:18 -07:00
Zooko O'Whielacronx	c456ff8591	rename "get_verifier()" to "get_verify_cap()"	2008-12-08 12:44:11 -07:00
Zooko O'Whielacronx	60bbc46a53	minor: fix unused imports -- thanks, pyflakes	2008-12-05 13:07:23 -07:00
Zooko O'Whielacronx	b315619d6b	download: refactor handling of URI Extension Block and crypttext hash tree, simplify things Refactor into a class the logic of asking each server in turn until one of them gives an answer that validates. It is called ValidatedThingObtainer. Refactor the downloading and verification of the URI Extension Block into a class named ValidatedExtendedURIProxy. The new logic of validating UEBs is minimalist: it doesn't require the UEB to contain any unncessary information, but of course it still accepts such information for backwards compatibility (so that this new download code is able to download files uploaded with old, and for that matter with current, upload code). The new logic of validating UEBs follows the practice of doing all validation up front. This practice advises one to isolate the validation of incoming data into one place, so that all of the rest of the code can assume only valid data. If any redundant information is present in the UEB+URI, the new code cross-checks and asserts that it is all fully consistent. This closes some issues where the uploader could have uploaded inconsistent redundant data, which would probably have caused the old downloader to simply reject that download after getting a Python exception, but perhaps could have caused greater harm to the old downloader. I removed the notion of selecting an erasure codec from codec.py based on the string that was passed in the UEB. Currently "crs" is the only such string that works, so "_assert(codec_name == 'crs')" is simpler and more explicit. This is also in keeping with the "validate up front" strategy -- now if someone sets a different string than "crs" in their UEB, the downloader will reject the download in the "validate this UEB" function instead of in a separate "select the codec instance" function. I removed the code to check plaintext hashes and plaintext Merkle Trees. Uploaders do not produce this information any more (since it potentially exposes confidential information about the file), and the unit tests for it were disabled. The downloader before this patch would check that plaintext hash or plaintext merkle tree if they were present, but not complain if they were absent. The new downloader in this patch complains if they are present and doesn't check them. (We might in the future re-introduce such hashes over the plaintext, but encrypt the hashes which are stored in the UEB to preserve confidentiality. This would be a double- check on the correctness of our own source code -- the current Merkle Tree over the ciphertext is already sufficient to guarantee the integrity of the download unless there is a bug in our Merkle Tree or AES implementation.) This patch increases the lines-of-code count by 8 (from 17,770 to 17,778), and reduces the uncovered-by-tests lines-of-code count by 24 (from 1408 to 1384). Those numbers would be more meaningful if we omitted src/allmydata/util/ from the test-coverage statistics.	2008-12-05 08:17:54 -07:00
Brian Warner	3e25efc010	upload: when using a Helper, insist that it provide protocols/helper/v1 . Related to #538 .	2008-11-21 20:29:32 -07:00
Brian Warner	0fab511be5	upload: don't use servers which can't support the share size we need. This ought to avoid #439 problems. Some day we'll have a storage server which advertises support for a larger share size. No tests yet.	2008-11-21 20:28:12 -07:00
Brian Warner	bf06492a90	#538 : fetch version and attach to the rref. Make IntroducerClient demand v1 support.	2008-11-21 20:07:27 -07:00
Brian Warner	7932fadb5e	webapi: add 'summary' string to checker results JSON	2008-11-18 18:28:26 -07:00
Brian Warner	dfa2408157	checker: add is_recoverable() to checker results, make our stub immutable-verifier not throw an exception on unrecoverable files, add tests	2008-11-06 22:35:47 -07:00
Brian Warner	6fa41e738b	immutable: tolerate filenode.read() with a size= that's too big, rather than hanging	2008-11-04 15:29:19 -07:00
Brian Warner	ba019bfd3a	#527 : expire the cached files that are used to support Range: headers, every hour, when the file is unused and older than an hour	2008-10-30 13:39:09 -07:00
Brian Warner	b1db6d9ff2	web: add 'Repair' button to checker results when they indicate unhealthyness. Also add the object's uri to the CheckerResults instance.	2008-10-29 18:09:17 -07:00
Brian Warner	b1ca238176	#527 : respond to GETs with early ranges quickly, without waiting for the whole file to download. Fixes the alacrity problems with the earlier code. Still needs cache expiration.	2008-10-28 17:56:18 -07:00
Brian Warner	37e3d8e47c	#527 : support HTTP 'Range:' requests, using a cachefile. Adds filenode.read(consumer, offset, size) method. Still needs: cache expiration, reduced alacrity.	2008-10-28 13:41:04 -07:00
Brian Warner	914655c52b	interfaces.py: promote immutable.encode.NotEnoughSharesError.. it isn't just for immutable files any more	2008-10-27 13:34:49 -07:00
Brian Warner	1566a9474c	immutable/filenode.py: add TODO note about the #514 monitor to check(), rather than going through the checker/verifier code and adding it, since Zooko is currently working on that code	2008-10-22 01:42:37 -07:00
Brian Warner	977c6ac510	more #514 : pass a Monitor to all checker operations, make mutable-checker honor the cancel flag	2008-10-22 01:38:18 -07:00
Zooko O'Whielacronx	8a6d1e5da6	repairer: test all different kinds of corruption that can happen to share files on disk	2008-10-14 16:09:20 -07:00
Brian Warner	7031a69bee	storage: introduce v2 immutable shares, with 8-byte offsets fields, to remove two of the three size limitations in #346 . This code handles v2 shares but does not generate them. We'll make a release with this v2-tolerance, wait a while, then make a second release that actually generates v2 shares, to avoid compatibility problems.	2008-10-09 18:13:27 -07:00
Brian Warner	288d55825c	storage: split WriteBucketProxy and ReadBucketProxy out into immutable/layout.py . No behavioral changes.	2008-10-09 17:08:00 -07:00
Brian Warner	d90a3ed7f8	test_system: add test coverage for immutable download.ConsumerAdapter, remove debug messages	2008-10-06 15:50:37 -07:00
Brian Warner	bc237b3956	ftp server: initial implementation. Still needs unit tests, custom Twisted patches. For #512	2008-10-06 12:52:36 -07:00
Brian Warner	9c505e49c2	stop using 'as' as an identifier: as with 'with', 'as' has become a reserved word in python 2.6	2008-10-02 17:27:49 -07:00
Zooko O'Whielacronx	39f305e44f	setup: remove a few minimal unit tests from test_filenode which have been obviated by much better tests in test_mutable and test_system	2008-09-25 09:15:44 -07:00
Zooko O'Whielacronx	ff08bab0c4	immutable: remove unused imports (thanks, pyflakes)	2008-09-23 12:26:10 -07:00
Zooko O'Whielacronx	fa302544fa	immutable: refactor immutable filenodes and comparison thereof * the two kinds of immutable filenode now have a common base class * they store only an instance of their URI, not both an instance and a string * they delegate comparison to that instance	2008-09-23 11:52:49 -07:00
Brian Warner	f570ad7ba5	disallow deep-check on non-directories, simplifies the code a bit	2008-09-10 13:44:58 -07:00
Brian Warner	80d8f3e862	hush pyflakes	2008-09-09 19:50:17 -07:00
Brian Warner	1d2d6a35a6	checker results: add output=JSON to webapi, add tests, clean up APIs to make the internal ones use binary strings (nodeid, storage index) and the web/JSON ones use base32-encoded strings. The immutable verifier is still incomplete (it returns imaginary healty results).	2008-09-09 19:45:17 -07:00
Brian Warner	04513e3ac5	immutable verifier: provide some dummy results so deep-check works, make the tests ignore these results until we finish it off	2008-09-09 18:08:27 -07:00
Brian Warner	f895e39d48	checker results: more tests, more results. immutable verifier tests are disabled until they emit more complete results	2008-09-09 17:15:46 -07:00
Brian Warner	90b934eb71	checker: add tests, add stub for immutable check_and_repair	2008-09-09 16:34:49 -07:00
Brian Warner	af2231563e	immutable/checker: make log() tolerate the format= form	2008-09-07 20:03:08 -07:00
Brian Warner	3408d552cd	checker: overhaul checker results, split check/check_and_repair into separate methods, improve web displays	2008-09-07 12:44:56 -07:00
Brian Warner	a94af879ff	logging: add 'unique-message-ids' (or 'umids') to each WEIRD-or-higher log.msg call, to make it easier to correlate log message with source code	2008-08-25 18:57:59 -07:00
Brian Warner	735aa895b9	logging cleanups: lower DeadReferenceError from WEIRD (which provokes Incidents) to merely UNUSUAL, don't pre-format Failures in others	2008-08-25 17:51:55 -07:00

1 2 3 4 5 ...

262 Commits