mirror of
https://github.com/tahoe-lafs/tahoe-lafs.git
synced 2025-01-11 15:32:39 +00:00
8143183e39
fixes #1225
163 lines
5.7 KiB
ReStructuredText
163 lines
5.7 KiB
ReStructuredText
============================================
|
|
Performance costs for some common operations
|
|
============================================
|
|
|
|
1. `Publishing an A-byte immutable file`_
|
|
2. `Publishing an A-byte mutable file`_
|
|
3. `Downloading B bytes of an A-byte immutable file`_
|
|
4. `Downloading B bytes of an A-byte mutable file`_
|
|
5. `Modifying B bytes of an A-byte mutable file`_
|
|
6. `Inserting/Removing B bytes in an A-byte mutable file`_
|
|
7. `Adding an entry to an A-entry directory`_
|
|
8. `Listing an A entry directory`_
|
|
9. `Performing a file-check on an A-byte file`_
|
|
10. `Performing a file-verify on an A-byte file`_
|
|
11. `Repairing an A-byte file (mutable or immutable)`_
|
|
|
|
Publishing an ``A``-byte immutable file
|
|
=======================================
|
|
|
|
network: A
|
|
|
|
memory footprint: N/k*128KiB
|
|
|
|
notes: An immutable file upload requires an additional I/O pass over the entire
|
|
source file before the upload process can start, since convergent
|
|
encryption derives the encryption key in part from the contents of the
|
|
source file.
|
|
|
|
Publishing an ``A``-byte mutable file
|
|
=====================================
|
|
|
|
network: A
|
|
|
|
memory footprint: N/k*A
|
|
|
|
cpu: O(A) + a large constant for RSA keypair generation
|
|
|
|
notes: Tahoe-LAFS generates a new RSA keypair for each mutable file that it
|
|
publishes to a grid. This takes up to 1 or 2 seconds on a typical desktop PC.
|
|
|
|
Part of the process of encrypting, encoding, and uploading a mutable file to a
|
|
Tahoe-LAFS grid requires that the entire file be in memory at once. For larger
|
|
files, this may cause Tahoe-LAFS to have an unacceptably large memory footprint
|
|
(at least when uploading a mutable file).
|
|
|
|
Downloading ``B`` bytes of an ``A``-byte immutable file
|
|
=======================================================
|
|
|
|
network: B
|
|
|
|
memory footprint: 128KiB
|
|
|
|
notes: When Tahoe-LAFS 1.8.0 or later is asked to read an arbitrary range
|
|
of an immutable file, only the 128-KiB segments that overlap the
|
|
requested range will be downloaded.
|
|
|
|
(Earlier versions would download from the beginning of the file up
|
|
until the end of the requested range, and then continue to download
|
|
the rest of the file even after the request was satisfied.)
|
|
|
|
Downloading ``B`` bytes of an ``A``-byte mutable file
|
|
=====================================================
|
|
|
|
network: A
|
|
|
|
memory footprint: A
|
|
|
|
notes: As currently implemented, mutable files must be downloaded in
|
|
their entirety before any part of them can be read. We are
|
|
exploring fixes for this; see ticket #393 for more information.
|
|
|
|
Modifying ``B`` bytes of an ``A``-byte mutable file
|
|
===================================================
|
|
|
|
network: A
|
|
|
|
memory footprint: N/k*A
|
|
|
|
notes: If you upload a changed version of a mutable file that you
|
|
earlier put onto your grid with, say, 'tahoe put --mutable',
|
|
Tahoe-LAFS will replace the old file with the new file on the
|
|
grid, rather than attempting to modify only those portions of the
|
|
file that have changed. Modifying a file in this manner is
|
|
essentially uploading the file over again, except that it re-uses
|
|
the existing RSA keypair instead of generating a new one.
|
|
|
|
Inserting/Removing ``B`` bytes in an ``A``-byte mutable file
|
|
============================================================
|
|
|
|
network: A
|
|
|
|
memory footprint: N/k*A
|
|
|
|
notes: Modifying any part of a mutable file in Tahoe-LAFS requires that
|
|
the entire file be downloaded, modified, held in memory while it is
|
|
encrypted and encoded, and then re-uploaded. A future version of the
|
|
mutable file layout ("LDMF") may provide efficient inserts and
|
|
deletes. Note that this sort of modification is mostly used internally
|
|
for directories, and isn't something that the WUI, CLI, or other
|
|
interfaces will do -- instead, they will simply overwrite the file to
|
|
be modified, as described in "Modifying B bytes of an A-byte mutable
|
|
file".
|
|
|
|
Adding an entry to an ``A``-entry directory
|
|
===========================================
|
|
|
|
network: O(A)
|
|
|
|
memory footprint: N/k*A
|
|
|
|
notes: In Tahoe-LAFS, directories are implemented as specialized mutable
|
|
files. So adding an entry to a directory is essentially adding B
|
|
(actually, 300-330) bytes somewhere in an existing mutable file.
|
|
|
|
Listing an ``A`` entry directory
|
|
================================
|
|
|
|
network: O(A)
|
|
|
|
memory footprint: N/k*A
|
|
|
|
notes: Listing a directory requires that the mutable file storing the
|
|
directory be downloaded from the grid. So listing an A entry
|
|
directory requires downloading a (roughly) 330 * A byte mutable
|
|
file, since each directory entry is about 300-330 bytes in size.
|
|
|
|
Performing a file-check on an ``A``-byte file
|
|
=============================================
|
|
|
|
network: O(S), where S is the number of servers on your grid
|
|
|
|
memory footprint: negligible
|
|
|
|
notes: To check a file, Tahoe-LAFS queries all the servers that it knows
|
|
about. Note that neither of these values directly depend on the size
|
|
of the file. This is relatively inexpensive, compared to the verify
|
|
and repair operations.
|
|
|
|
Performing a file-verify on an ``A``-byte file
|
|
==============================================
|
|
|
|
network: N/k*A
|
|
|
|
memory footprint: N/k*128KiB
|
|
|
|
notes: To verify a file, Tahoe-LAFS downloads all of the ciphertext
|
|
shares that were originally uploaded to the grid and integrity
|
|
checks them. This is, for well-behaved grids, likely to be more
|
|
expensive than downloading an A-byte file, since only a fraction
|
|
of these shares are necessary to recover the file.
|
|
|
|
Repairing an ``A``-byte file (mutable or immutable)
|
|
===================================================
|
|
|
|
network: variable; up to around O(A)
|
|
|
|
memory footprint: from 128KiB to (1+N/k)*128KiB
|
|
|
|
notes: To repair a file, Tahoe-LAFS downloads the file, and generates/uploads
|
|
missing shares in the same way as when it initially uploads the file.
|
|
So, depending on how many shares are missing, this can be about as
|
|
expensive as initially uploading the file in the first place.
|