<div dir="ltr"><div class="gmail_extra"><div>Hello Pranith,<br><br>Thanks for your reply. I&#39;m using 3.5.2. <br><br>Is it possible that Windows doesn&#39;t release the files after a write happens? <br><br></div>Because the self-heal often never occurs. Just this morning we discovered that when a web server read from the other node, some files that had been changed days ago still had content from before the edit.<br>

<br></div><div class="gmail_extra">How can I ensure that everything syncs reliably and consistently when mounting from SMB? Is Samba VFS more reliable in this respect?<br><br></div><div class="gmail_extra">Tiemen<br></div>

<div class="gmail_extra"><br><div class="gmail_quote">On 7 August 2014 03:14, Pranith Kumar Karampuri <span dir="ltr">&lt;<a href="mailto:pkarampu@redhat.com" target="_blank">pkarampu@redhat.com</a>&gt;</span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

  
    
  
  <div text="#000000" bgcolor="#FFFFFF">
    hi Tiemen,<br>
    From the logs you have pasted, it doesn&#39;t seem there are any
    split-brains. It is just performing self-heals. What version of
    glusterfs are you using? Self-heals sometimes don&#39;t happen if the
    data operations from mount are in progress because it tries to give
    that more priority. Missing files should be created once the
    self-heal completes on the parent directory of those files.<br>
    <br>
    Pranith<div><div class="h5"><br>
    <br>
    <div>On 08/07/2014 01:40 AM, Tiemen Ruiten
      wrote:<br>
    </div>
    </div></div><blockquote type="cite"><div><div class="h5">
      
      <div>Sorry, I seem to have messed up the
        subject. <br>
        <br>
        I should add, I&#39;m mounting these volumes through GlusterFS FUSE,
        not the Samba VFS plugin.<br>
         <br>
        On 06-08-14 21:47, Tiemen Ruiten wrote:<br>
      </div>
      <blockquote type="cite">
        <div dir="ltr">
          <div>
            <div>
              <div>Hello,<br>
                <br>
                I&#39;m running into some serious problems with Gluster +
                CTDB and Samba. What I have:<br>
                <br>
              </div>
              A two node replicated gluster cluster set up to share
              volumes using Samba setup according to this guide: <a href="https://download.gluster.org/pub/gluster/glusterfs/doc/Gluster_CTDB_setup.v1.pdf" target="_blank">https://download.gluster.org/pub/gluster/glusterfs/doc/Gluster_CTDB_setup.v1.pdf</a><br>


              <br>
            </div>
            When we edit or copy files into the volume via SMB (from a
            Windows client accessing through a samba file share) this
            inevitably leads to a split-brain scenario. For example:<br>
            <br>
            gluster&gt; volume heal fl-webroot info<br>
            Brick ankh.int.rdmedia.com:/export/glu/web/flash/webroot/<br>
            &lt;gfid:0b162618-e46f-4921-92d0-c0fdb5290bf5&gt;<br>
            &lt;gfid:a259de7d-69fc-47bd-90e7-06a33b3e6cc8&gt;<br>
            Number of entries: 2<br>
            <br>
            Brick morpork.int.rdmedia.com:/export/glu/web/flash/webroot/<br>
            /LandingPage_Saturn_Production/images<br>
            /LandingPage_Saturn_Production<br>
            /LandingPage_Saturn_Production/Services/v2<br>
            /LandingPage_Saturn_Production/images/country/be<br>
            /LandingPage_Saturn_Production/bin<br>
            /LandingPage_Saturn_Production/Services<br>
            /LandingPage_Saturn_Production/images/generic<br>
            /LandingPage_Saturn_Production/aspnet_client/system_web<br>
            /LandingPage_Saturn_Production/images/country<br>
            /LandingPage_Saturn_Production/Scripts<br>
            /LandingPage_Saturn_Production/aspnet_client<br>
            /LandingPage_Saturn_Production/images/country/fr<br>
            Number of entries: 12<br>
            <br>
            gluster&gt; volume heal fl-webroot info<br>
            Brick ankh.int.rdmedia.com:/export/glu/web/flash/webroot/<br>
            &lt;gfid:0b162618-e46f-4921-92d0-c0fdb5290bf5&gt;<br>
            &lt;gfid:a259de7d-69fc-47bd-90e7-06a33b3e6cc8&gt;<br>
            Number of entries: 2<br>
            <br>
            Brick morpork.int.rdmedia.com:/export/glu/web/flash/webroot/<br>
            /LandingPage_Saturn_Production/images<br>
            /LandingPage_Saturn_Production<br>
            /LandingPage_Saturn_Production/Services/v2<br>
            /LandingPage_Saturn_Production/images/country/be<br>
            /LandingPage_Saturn_Production/bin<br>
            /LandingPage_Saturn_Production/Services<br>
            /LandingPage_Saturn_Production/images/generic<br>
            /LandingPage_Saturn_Production/aspnet_client/system_web<br>
            /LandingPage_Saturn_Production/images/country<br>
            /LandingPage_Saturn_Production/Scripts<br>
            /LandingPage_Saturn_Production/aspnet_client<br>
            /LandingPage_Saturn_Production/images/country/fr<br>
            <br>
            <br>
            <br>
          </div>
          <div>Sometimes self-heal works, sometimes it doesn&#39;t:<br>
            <br>
            [2014-08-06 19:32:17.986790] E
            [afr-self-heal-common.c:2868:afr_log_self_heal_completion_status]
            0-fl-webroot-replicate-0:  entry self heal  failed,   on
            /LandingPage_Saturn_Production/Services/v2<br>
            [2014-08-06 19:32:18.008330] W
            [client-rpc-fops.c:2772:client3_3_lookup_cbk]
            0-fl-webroot-client-0: remote operation failed: No such file
            or directory. Path:
            &lt;gfid:a89d7a07-2e3d-41ee-adcc-cb2fba3d2282&gt;
            (a89d7a07-2e3d-41ee-adcc-cb2fba3d2282)<br>
            [2014-08-06 19:32:18.024057] I
            [afr-self-heal-common.c:2868:afr_log_self_heal_completion_status]
            0-fl-webroot-replicate-0:  gfid or missing entry self heal 
            is started, metadata self heal  is successfully completed,
            backgroung data self heal  is successfully completed,  data
            self heal from fl-webroot-client-1  to sinks 
            fl-webroot-client-0, with 0 bytes on fl-webroot-client-0,
            168 bytes on fl-webroot-client-1,  data - Pending matrix:  [
            [ 0 0 ] [ 1 0 ] ]  metadata self heal from source
            fl-webroot-client-1 to fl-webroot-client-0,  metadata -
            Pending matrix:  [ [ 0 0 ] [ 2 0 ] ], on
            /LandingPage_Saturn_Production/Services/v2/PartnerApiService.asmx<br>
            <br>
          </div>
          <div><b>More seriously, some files are simply missing on one
              of the nodes without any error in the logs or notice when
              running gluster volume heal $volume info.</b><br>
          </div>
          <div><br>
          </div>
          <div>Of course I can provide any log file necessary.<br clear="all">
          </div>
          <div>
            <div>
              <div>
                <div></div></div></div></div></div></blockquote></div></div></blockquote></div></blockquote></div><br></div></div>