<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body bgcolor="#FFFFFF" text="#000000">

    <blockquote type="cite">

      <div class="moz-cite-prefix">

        On 01/16/2013 02:56 PM, Dan Bretherton wrote:<br>

      </div>

      <blockquote cite="mid:50F6BF89.40507@reading.ac.uk" type="cite">On

        01/15/2013 08:17 PM, <a class="moz-txt-link-abbreviated" href="mailto:gluster-users-request@gluster.org">gluster-users-request@gluster.org</a> wrote: <br>

        <blockquote type="cite">Date: Tue, 15 Jan 2013 15:17:00 -0500

          <br>

          From: Jeff Darcy <a class="moz-txt-link-rfc2396E" href="mailto:jdarcy@redhat.com">&lt;jdarcy@redhat.com&gt;</a>

          <br>

          To: <a class="moz-txt-link-abbreviated" href="mailto:gluster-users@gluster.org">gluster-users@gluster.org</a>

          <br>

          Subject: Re: [Gluster-users] Targeted fix-layout?

          <br>

          Message-ID: <a class="moz-txt-link-rfc2396E" href="mailto:50F5B93C.5040802@redhat.com">&lt;50F5B93C.5040802@redhat.com&gt;</a>

          <br>

          Content-Type: text/plain; charset=ISO-8859-1; format=flowed

          <br>

          <br>

          On 01/15/2013 01:10 PM, Dan Bretherton wrote:

          <br>

          <blockquote type="cite">I am running a fix-layout operation on

            a volume after seeing errors mentioning

            <br>

            "anomalies" and "holes" in the logs.&nbsp; There is a particular

            directory that is

            <br>

            giving trouble and I would like to be able to run the layout

            fix on that

            <br>

            first.&nbsp; Users are experiencing various I/O errors including

            "invalid argument"

            <br>

            and "Unknown error 526", but after running for a week the

            volume wide

            <br>

            fix-layout doesn't seem to have reached this particular

            directory yet.

            <br>

            Fix-layout takes a long time because there are millions of

            files in the volume

            <br>

            and the CPU load is consistently very high on all the

            servers while it is

            <br>

            running, sometimes over 20. Therefore I really need to find

            a way to target

            <br>

            particular directories or speed up the volume wide

            fix-layout.

            <br>

          </blockquote>

          You should be able to do the following command on a client to

          fix the layout

          <br>

          for just one directory (it's the same xattr used by the

          rebalance command).

          <br>

          <br>

          &nbsp;&nbsp;&nbsp;&nbsp;setfattr -n distribute.fix.layout -v "anything"

          /bad/directory

          <br>

          <blockquote type="cite">I have no idea what caused these

            errors but it could be related to the previous

            <br>

            fix-layout operation, which I started following the addition

            of a new pair of

            <br>

            bricks, not having completed successfully.&nbsp; The problem is

            that the rebalance

            <br>

            operation on one or more servers often fails before

            completing and there is no

            <br>

            way (that I know of) to restart or resume the process on one

            server.&nbsp; Every

            <br>

            time this happens I stop the fix-layout and start it again,

            but it has never

            <br>

            completed successfully on every server despite sometimes

            running for several

            <br>

            weeks.

            <br>

            <br>

            One other possible cause I can think of is my recent policy

            of using XFS for

            <br>

            new bricks instead of ext4.&nbsp; The reason I think this might

            be causing the

            <br>

            problem is that none of the other volumes have any XFS

            bricks yet and they

            <br>

            aren't experiencing any I/O errors.&nbsp; Are there any special

            mount options

            <br>

            required for XFS, and is there any reason why a volume

            shouldn't contain a

            <br>

            mixture of ext4 and XFS bricks?

            <br>

          </blockquote>

          It doesn't seem like that should be a problem, but maybe

          someone else knows

          <br>

          something about ext4/XFS differences that could shed some

          light.

          <br>

        </blockquote>

        Thanks Jeff, I'll give that a try. <br>

        <br>

        Should the xattr name be trusted.distribute.fix.layout by the

        way? When I try with distribute.fix.layout I get the error

        "Operation not supported". <br>

        <br>

        -Dan. <br>

      </blockquote>

      <br>

      <pre>On 01/16/2013 09:56 AM, Dan Bretherton wrote:

&gt;<i> Should the xattr name be trusted.distribute.fix.layout by the way? When 

</i>&gt;<i> I try with distribute.fix.layout I get the error "Operation not supported".

</i>

I just re-examined and re-ran the code, and distribute.fix.layout does

seem to be correct.  You're doing this on the client side, right?  The

other thing to check would be versions, since I hardly ever run a

version that's more than a day old and that often means I'm using

features that haven't made it into a release yet.  I think that one has

been there for a while, though.

</pre>

    </blockquote>

    Thanks for checking the code for me.&nbsp; I wasn't doing it on the

    client - thanks for pointing out my mistake.&nbsp; I have tested he

    targeted fix layout on a test volume and verified that there weren't

    any detrimental effects, but I don't know how to reproduce the

    layout errors I am seeing in the production volume in order find out

    if the targeted layout fix actually works.&nbsp; Unfortunately I haven't

    been able to try it on the production volume because the owner

    doesn't want to risk it.&nbsp; He is also concerned about performance

    deteriorating any more, given that the volume wide layout fix is

    still going and still slowing things down a lot.<br>

    <br>

    I had to extend another volume a couple of weeks ago and the layout

    fix for that one is now running at the same time.&nbsp; One server now

    has a load of over 70 most of the time (mostly glusterfsd), but none

    of the others seem to be particularly busy.&nbsp; I restarted the server

    in question but the CPU load quickly went up to 70 again.&nbsp; I can't

    see any particular reason why this one server should be so badly

    affected by the layout fixing processes.&nbsp; It isn't a particularly

    big server, with only five 3TB bricks involved in the two volumes

    that were extended.&nbsp; I can't help thinking that this is the reason

    why the volume layout fixes are taking such a long time, even though

    the rebalance processes run on all the servers simultaneously.&nbsp; Can

    anyone suggest a way to troubleshoot this problem?&nbsp; The rebalance

    logs don't show anything unusual but glustershd.log has a lot of

    metadata split-brain warnings.&nbsp;&nbsp; The brick logs are full of scary

    looking warnings but none flagged 'E' or 'C'.&nbsp; The trouble is that I

    see messages like these on all the servers, and I can find nothing

    unusual about the server with a CPU load of 70.<br>

    <br>

    -Dan.<br>

    <br>

  </body>

</html>