48 Commits

Author SHA1 Message Date
Brian Warner
be11bfe812 munin: add tahoe_diskleft plugin, update spacetime/diskwatcher.tac to support it 2008-08-28 13:32:36 -07:00
Brian Warner
482b3e0939 munin/tahoe_doomsday: change the graph title, 'time predictor' is more accurate than 'space predictor' 2008-08-27 14:30:13 -07:00
Brian Warner
ba6e525952 munin/tahoe_diskusage: clip the graph at zero, to prevent transient negative excursions (such as when a lot of old logfiles are deleted from a storage server's disk) from scaling the graph into unusability 2008-08-27 12:35:43 -07:00
Brian Warner
743ca91bcd munin diskusage/doomsday: oops, fix labels, everything was reported in the 1hr column 2008-08-11 13:34:31 -07:00
Brian Warner
8828fe8769 munin/tahoe_overhead: don't emit non-sensicial numbers 2008-08-07 14:40:08 -07:00
Brian Warner
f7ad0d2f6f munin: add tahoe_overhead plugin, to measure effectiveness of GC and deleting data from inactive accounts 2008-08-07 13:39:25 -07:00
Brian Warner
39db60cc2b diskwatcher.tac: include total-bytes-used 2008-08-07 13:12:14 -07:00
Brian Warner
e19564df59 misc/spacetime: add munin plugins, add everything to .deb 2008-08-06 23:00:03 -07:00
Brian Warner
315a89f9d8 munin/tahoe_spacetime: show 2wk data even if 4wk data is unavailable 2008-07-28 12:42:33 -07:00
Brian Warner
870fb68620 munin/tahoe_server_latency: add percentile to graph title 2008-07-16 16:08:17 -07:00
Brian Warner
af0648c0f8 munin/serverops: improve graph title 2008-07-16 12:36:42 -07:00
Brian Warner
390c5c6ca5 munin/tahoe_stats: add server operations per second (allocate/get/writev/readv 2008-07-14 13:25:35 -07:00
Brian Warner
0f231e2e66 munin/tahoe_stats: rearrange config a bit, no functional changes 2008-07-14 13:25:01 -07:00
Brian Warner
6525742b37 munin/tahoe_server_operations: fix comments 2008-07-14 13:18:47 -07:00
Brian Warner
a72dfcc00d munin: add plugin for storage-server operations per second 2008-07-14 13:18:11 -07:00
Brian Warner
b2f062ab82 misc/munin: add server_latency plugin 2008-07-14 12:29:19 -07:00
Brian Warner
f303670265 misc/munin: rename plugins to make munin happy: use underscores rather than hyphens, remove .py suffix 2008-07-11 15:21:13 -07:00
Brian Warner
503ce1cbd7 remove obsolete tahoe-sharesperfile munin plugin 2008-07-11 15:17:33 -07:00
Brian Warner
38252b6df1 remove tahoe-dirnodes.py munin plugin, it is way obsolete 2008-07-11 15:14:28 -07:00
Brian Warner
c23f675095 misc/munin/tahoe_spacetime.py: add a munin plugin to estimate how much time remains before you run out of space 2008-05-28 11:33:54 -07:00
Brian Warner
74df0b58b5 munin: add tahoe_cpu_watcher.py, to track the data from misc/cpu-watcher.tac 2008-05-07 13:19:08 -07:00
Brian Warner
d2a2b8a4ae munin/tahoe_estimate_files.py: tool to estimate the total number of slots (mutable and immutable combined) in the grid, from a small sample 2008-04-24 11:28:35 -07:00
robk-tahoe
5f5ed6b08d munin stats: fix typo in mutable file stats 2008-04-23 15:50:29 -07:00
robk-tahoe
f74042c75e munin stats: add mutable/uploader traffic graphs
this adds munin graphs to present data already published by nodes to
the stats_gatherer, namely mutable files published/retrieved, and 
immutable files uploaded, and the bytes thereof
2008-04-23 14:39:41 -07:00
Brian Warner
239c950810 munin/tahoe-stats.py: category is 'counters' not 'stats', use DERIVE 2008-04-23 14:03:23 -07:00
Brian Warner
500934b72f helper stats: fix the /helper_status page, the recent conflict merging missed some uses. Added tests, updated the munin plugins to match 2008-04-14 13:18:53 -07:00
robk-tahoe
766deaa9b6 stats_gatherer: reconcile helper stats gathering
I'd implemented stats gathering hooks in the helper a while back.
Brian did the same without reference to my changes.  This reconciles
those two changes, encompassing all the stats in both changes,
implemented through the stats_provider interface.

this also provide templates for all 10 helper graphs in the 
tahoe-stats munin plugin.
2008-04-10 17:25:44 -07:00
robk-tahoe
d8f3521a02 munin: added a series of munin graphs to report upload helper state 2008-03-25 18:30:46 -07:00
Brian Warner
fb51a60993 munin plugins: add 'graph_category tahoe' 2008-03-28 16:18:30 -07:00
Brian Warner
5da9f42cf8 add munin/tahoe-rootdir-space 2008-03-28 16:18:09 -07:00
Brian Warner
da6ab9f522 munin/tahoe-introstats.py: put this graph in the 'tahoe' category 2008-03-28 14:51:37 -07:00
Brian Warner
6b416fc28c helper: add another munin plugin 2008-03-27 16:50:30 -07:00
Brian Warner
2c96a32633 helper: add more stats to webapi, at /helper_status 2008-03-27 16:46:08 -07:00
zandr
f4d51ea366 tahoe-nodememory.py: change category to Tahoe 2008-03-25 18:42:11 -07:00
zandr
5245140a1c Copy amd-nodememory munin plugin over to tahoe and point at correct pidfile 2008-03-25 17:50:04 -07:00
Brian Warner
490f815e8f add a munin plugin to display introducer stats 2008-03-25 13:15:52 -07:00
robk-tahoe
3641b7924b munin stats: don't suppress series with no current data
having changed tahoe-stats to not report data series if there was no recent
data recorded for a node, I wound up making it hide the data series.  this
change causes it to report all data series for which stats exist in the
'config' phase, so that they show up, but only report actual data if the
stats are recent, so that they show up as missing if the node is not
reporting stats currently
2008-02-01 18:44:57 -07:00
robk-tahoe
b80cfeb186 munin stats: suppress output of data more that 5min old
if a node fails to report stats, the natural thing to do in re munin is to 
supress the data for that data series.  the previous tahoe-stats would output
whatever data was present in the stats_gatherer's stats.pickle, regardless of
how old.

this change means that if the gatherer hasn't received data within the last
5 min, then no data is reported to munin for that node.
2008-01-31 21:04:23 -07:00
robk-tahoe
e910a64e7d munin stats: change truncated nodeid to 4 char (not 8) 2008-01-31 19:21:17 -07:00
robk-tahoe
7b9f3207d0 stats: add a simple stats gathering system
We have a desire to collect runtime statistics from multiple nodes primarily
for server monitoring purposes.   This implements a simple implementation of
such a system, as a skeleton to build more sophistication upon.

Each client now looks for a 'stats_gatherer.furl' config file.  If it has
been configured to use a stats gatherer, then it instantiates internally
a StatsProvider.  This is a central place for code which wishes to offer
stats up for monitoring to report them to, either by calling 
stats_provider.count('stat.name', value) to increment a counter, or by
registering a class as a stats producer with sp.register_producer(obj).

The StatsProvider connects to the StatsGatherer server and provides its
provider upon startup.  The StatsGatherer is then responsible for polling
the attached providers periodically to retrieve the data provided.
The provider queries each registered producer when the gatherer queries
the provider.  Both the internal 'counters' and the queried 'stats' are
then reported to the gatherer.

This provides a simple gatherer app, (c.f. make stats-gatherer-run)
which prints its furl and listens for incoming connections.  Once a
minute, the gatherer polls all connected providers, and writes the
retrieved data into a pickle file.

Also included is a munin plugin which knows how to read the gatherer's
stats.pickle and output data munin can interpret.  this plugin, 
tahoe-stats.py can be symlinked as multiple different names within
munin's 'plugins' directory, and inspects argv to determine which
data to display, doing a lookup in a table within that file.
It looks in the environment for 'statsfile' to determine the path to
the gatherer's stats.pickle.  An example plugins-conf.d file is
provided.
2008-01-30 20:11:07 -07:00
Brian Warner
475a58fcc5 munin: add a sample file for /etc/munin/plugin-conf.d/ 2007-10-22 15:08:39 -07:00
Brian Warner
0aba63ff17 munin: oops, fix that trailing-newline fix, the previous one mangled the config 2007-10-18 12:49:22 -07:00
Brian Warner
b339c76560 munin: remove extra blank line from config output 2007-10-18 12:39:26 -07:00
Brian Warner
33a02e19f1 munin: update shareholder plugins to match current storage layout 2007-10-18 12:08:58 -07:00
Brian Warner
bd49bafb23 munin/tahoe-sharesperfile: don't truncate to an int, emit a float instead 2007-08-23 16:31:15 -07:00
Brian Warner
b25446d514 munin/tahoe-storagespace.py: use /bin/du instead of our python version, since
it a) is faster and b) will include filesystem overhead that is a real cost.
2007-08-09 18:26:57 -07:00
Brian Warner
cb890cb9e4 munin plugins: fix labels 2007-07-05 14:38:31 -07:00
Brian Warner
8b2f0ef44c add some munin plugins 2007-07-05 13:38:15 -07:00