Summarizing output of multiple servers and threads.

fio.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

From: Mark Smithers <smithersathpe@gmail.com>
To: fio@vger.kernel.org
Subject: Summarizing output of multiple servers and threads.
Date: Mon, 27 Nov 2023 16:09:44 +0000	[thread overview]
Message-ID: <CAM_M6akYhjNzHr=Ky7=XisZCneBRvTCOFtGU0ZeOWEB2mLWL2A@mail.gmail.com> (raw)

 When benchmarking my arrays I need to have multiple volumes presented
to multiple servers in order to get to saturation point. For example,
I may have 10 x 2TiB volumes presented to 3 servers for a total of 30
volumes. I need multiple servers as I run out of CPU power or I start
to block with queuing on the server HBAs This will then deliver
multiple millions of IOs if required.

Currently I use VDBench for testing, but I want to switch to using a
tool that is being maintained, plus there are several features in FIO
I would like to explore.

Because I use multiple servers, I run FIO in the client server mode
which seems fine. However, what I really miss is the results being
summarized into one file that tells me the results
My biggest struggle now is not in job creation but handling the results.

In VDBench, I get the following output for a single job/thread against
a single volume (so one file per thread or disk)

                interval        i/o   MB/sec   bytes   read     resp
  read    write     read    write     resp  queue  cpu%  cpu%
                               rate  1024**2     i/o    pct     time
  resp     resp      max      max   stddev  depth sys+u   sys
22:51:42.008           1   138447.1  1081.62    8192  29.94    1.762
 0.542    2.283    29.00    45.76    4.551  244.1  32.2   2.9
22:51:52.005           2   102852.8   803.54    8192  30.00    2.481
 0.427    3.362    24.45    46.83    5.956  255.1  34.0   4.5
22:52:02.006           3   103044.4   805.03    8192  30.10    2.475
 0.422    3.359    15.07    43.07    5.934  255.1  34.0   4.5
22:52:12.006           4   101983.1   796.74    8192  29.90    2.501
 0.423    3.388    15.56    48.09    6.067  255.1  33.9   4.5
22:52:22.006           5   102079.5   797.50    8192  30.05    2.500
 0.418    3.395    20.43    41.22    6.030  255.1  33.9   4.4
22:52:32.008           6   102795.2   803.09    8192  30.04    2.482
 0.426    3.364    20.89    48.04    5.973  255.1  34.0   4.5
22:52:42.007           7   102271.3   798.99    8192  29.93    2.495
 0.424    3.379    22.18    48.33    5.989  255.1  33.9   4.4
22:52:42.023     avg_2-7   102504.4   800.82    8192  30.00    2.489
 0.423    3.374    24.45    48.33    5.991  255.1  34.0   4.5

Which can result in a summary file like this of all the jobs/threads
that were being run.

                interval        i/o   MB/sec   bytes   read     resp
  read    write     read    write     resp  queue  cpu%  cpu%
                               rate  1024**2     i/o    pct     time
  resp     resp      max      max   stddev  depth sys+u   sys
22:51:42.036           1   849767.7  6638.81    8192  29.98    1.724
 0.535    2.233    31.37    46.26    4.510 1466.4  10.7   2.3
22:51:52.016           2   624072.8  4875.57    8192  29.97    2.455
 0.420    3.327    25.40    46.94    5.934 1531.8  11.8   3.4
22:52:02.017           3   623773.3  4873.23    8192  30.02    2.455
 0.416    3.330    19.12    43.11    5.921 1532.0  11.8   3.4
22:52:12.017           4   617948.9  4827.73    8192  29.95    2.478
 0.416    3.360    19.97    48.09    6.047 1532.0  11.8   3.4
22:52:22.017           5   619355.4  4838.71    8192  29.98    2.475
 0.412    3.357    21.62    45.62    6.005 1532.0  11.8   3.4
22:52:32.019           6   622502.9  4863.30    8192  29.95    2.461
 0.419    3.334    22.10    52.66    5.960 1532.3  11.8   3.4
22:52:42.018           7   620592.2  4848.38    8192  29.99    2.469
 0.416    3.347    22.76    48.33    5.966 1531.9  11.8   3.4
22:52:42.035     avg_2-7   621374.3  4854.49    8192  29.98    2.465
 0.416    3.343    25.40    52.66    5.972 1532.0  11.8   3.4

As you can see, this makes using this data really very simple for post
processing - I can just pick up the summary file, extract the "avg"
line per job and plot the results as required.

I'm yet to find in FIO if this summarization is possible. I keep
getting a lot of files which are taking a long time to post-process
into meaningful data (I need the average number of IOPs and latency
per disk/thread/run). I understand the output of FIO is not the same
as VDBench, but if I could get the combined IOPS in one file, the
latency in another and the bandwidth in a third, that would be usable.

Anyone have any pointers on how to implement this?

Many Thanks

Mark

                 reply	other threads:[~2023-11-27 16:09 UTC|newest]

Thread overview: [no followups] expand[flat|nested]  mbox.gz  Atom feed

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAM_M6akYhjNzHr=Ky7=XisZCneBRvTCOFtGU0ZeOWEB2mLWL2A@mail.gmail.com' \
    --to=smithersathpe@gmail.com \
    --cc=fio@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).