<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

<html>

<head>

  <meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">

</head>

<body bgcolor="#ffffff" text="#000000">

Hi,<br>

<br>

I really appreciate the information from Raghavendra regarding how the

performance translators affect MySQL's integrity.&nbsp; This week I spent

some hours coarsely testing various performance options, and I would be

interested in validating whether I'm getting typical results as well as

learning ways to improve from here.&nbsp; Perhaps my experience would be

useful for others.<br>

<br>

I'm running 2.0rc1 with the 2.6.27 kernel.&nbsp; I have a 2-node cluster.&nbsp;

GlusterFS runs on both nodes, and MySQL runs on the active node.&nbsp; If

the active node fails or is put on standby, MySQL fires up on the other

node.&nbsp; Unlike MySQL Replication with its slave lag, I know my data

changes are durable in the event of a server failure.&nbsp; Most people use

DRBD for this, but I'm hoping to enjoy GlusterFS's benefits of handling

split-brain situations at the file level instead of the volume level,

future scalability avenues, and general ease of use.&nbsp; Hopefully DRBD

doesn't have unmatchable performance advantages I'm overlooking.<br>

<br>

I've been running two tests.&nbsp; They aren't necessarily realistic usage,

but I'm just looking for the big settings that affect performance by a

factor of two or more.&nbsp; My database files are about 1GB.&nbsp; The first

test is "time mysqldump --no-data" which simply prints out the schema.&nbsp;

The second test is "time mysqldump | gzip &gt; /glusterfs/export.gz"

which exports the entire database, compresses it, and saves it onto the

GlusterFS filesystem.&nbsp; The 450MB of exported SQL statements compress to

75MB.<br>

<br>

I'm going to report my testing in order, because the changes were

cumulative.&nbsp; I used server-side io-threads from the start.&nbsp; Before I

started recording the speed, I discovered that running in single

process mode was dramatically faster.&nbsp; At that time, I also configured

read-subvolume to use the local server.&nbsp; At this point I started

measuring:<br>

<ul>

  <li>Printing schema: 18s</li>

  <li>Compressed export: 2m45s</li>

</ul>

For a benchmark, I moved MySQL's datafiles to the local ext3 disk (but

kept writing the export to GlusterFS).&nbsp; It was 10-100X faster!<br>

<ul>

  <li>Printing schema: 0.2s</li>

  <li>Compressed export: 28s</li>

</ul>

There was no appreciable changes from installing fuse-2.7.4glfs11,

using Booster, or running blockdev to increase readahead from 256 to

16384.<br>

<br>

Adding the io-cache client-side translator didn't affect printing the

schema but cut the export in half:<br>

<ul>

  <li>Compressed export: 1m10s<br>

  </li>

</ul>

Going off on a tangent, I shut down the remote node.&nbsp; This increased

the performance by an order of magnitude:<br>

<ul>

  <li>Printing schema: 2s</li>

  <li>Compressed export: 24s<br>

  </li>

</ul>

I resumed testing with both servers running.&nbsp; Switching the I/O

scheduler to deadline had no appreciable affect.&nbsp; Neither did adding

client-side io-threads, or server-side write-behind.&nbsp; Surprisingly, I

found that changing read-subvolume to the remove server had only a

minor penalty.<br>

<br>

Then I noticed that the remote server was listed first in the volfile,

which means that it gets used for the lock server.&nbsp; Swapping the order

in the volfile on one server seemed to cause split-brain errors -- does

the order need to be the same on both servers?&nbsp; When I changed both

servers' volfiles to use the active MySQL server as the lock server,

there was a dramatic performance increase, to roughly around the 2s/24s

speed I saw with one server down.&nbsp; (I lost the exact stats.)<br>

<br>

In summary, running in single process mode, client-side io-cache, and a

local lock file were the changes that made a significant difference.<br>

<br>

<br>

Since I'm only going to have one server writing to the filesystem at a

time, I could mount it read-only (or not at all) on the other server.&nbsp;

Would that mean I could safely set data-lock-server-count=0 and

entry-lock-server-count=0 because I can be confident that there won't

be any conflicting writes?&nbsp; I don't want to take unnecessary risks, but

it seems like unnecessary overhead for my use case.<br>

<br>

Are there any additional recommended performance changes?&nbsp; Would

server-side AFR change things?&nbsp; Printing the schema still runs 10X

faster when the database is on a local ext3 filesystem.<br>

<br>

Thank you,<br>

David<br>

<pre class="moz-signature" cols="72">-- 

David Sickmiller

</pre>

</body>

</html>