<html>
  <head>
    <meta content="text/html; charset=UTF-8" http-equiv="Content-Type">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    <tt>Brick disconnected with ping-time out:</tt><tt><br>
    </tt><tt><br>
    </tt><tt>Here is the log message</tt><tt><br>
    </tt><tt>[2014-05-19 04:29:38.133266] I [MSGID: 100030]
      [glusterfsd.c:1998:main] 0-/build/install/sbin/glusterfsd: Started
      running /build/install/sbi    n/glusterfsd version 3.5qa2 (args:
      /build/install/sbin/glusterfsd -s build.gluster.org --volfile-id
      /snaps/patchy_snap1/3f2ae3fbb4a74587b1a9   
      1013f07d327f.build.gluster.org.var-run-gluster-snaps-3f2ae3fbb4a74587b1a91013f07d327f-brick3
      -p /var/lib/glusterd/snaps/patchy_snap1/3f2ae3f   
      bb4a74587b1a91013f07d327f/run/build.gluster.org-var-run-gluster-snaps-3f2ae3fbb4a74587b1a91013f07d327f-brick3.pid
      -S /var/run/51fe50a6faf0aa    e006c815da946caf3a.socket
      --brick-name
      /var/run/gluster/snaps/3f2ae3fbb4a74587b1a91013f07d327f/brick3 -l
      /build/install/var/log/glusterfs/br   
      icks/var-run-gluster-snaps-3f2ae3fbb4a74587b1a91013f07d327f-brick3.log
      --xlator-option
      *-posix.glusterd-uuid=494ef3cd-15fc-4c8c-8751-2d441ba    7b4b0
      --brick-port 49164 --xlator-option
      3f2ae3fbb4a74587b1a91013f07d327f-server.listen-port=49164)                                        
    </tt><tt><br>
    </tt><tt>  2 [2014-05-19 04:29:38.141118] I
      [rpc-clnt.c:988:rpc_clnt_connection_init] 0-glusterfs: defaulting
      ping-timeout to 30secs</tt><tt><br>
    </tt><font color="#cc0000"><tt>  3 [2014-05-19 04:30:09.139521] C
        [rpc-clnt-ping.c:105:rpc_clnt_ping_timer_expired] 0-glusterfs:
        server 10.3.129.13:24007 has not responded in the last 30
        seconds, disconnecting.</tt></font><tt><br>
    </tt><tt> </tt><tt><br>
    </tt><tt><br>
      <br>
      Patch '<a class="moz-txt-link-freetext" href="http://review.gluster.org/#/c/7753/">http://review.gluster.org/#/c/7753/</a>' will fix the problem,
      where ping-timer will be disabled by default for all the rpc
      connection except for glusterd-glusterd (set to 30sec) and
      client-glusterd (set to 42sec).<br>
      <br>
      <br>
    </tt><tt>Thanks,</tt><tt><br>
    </tt><tt>Vijay</tt><tt><br>
    </tt><br>
    <br>
    <div class="moz-cite-prefix">On Monday 19 May 2014 11:56 AM, Pranith
      Kumar Karampuri wrote:<br>
    </div>
    <blockquote
      cite="mid:2008110912.7635348.1400480797734.JavaMail.zimbra@redhat.com"
      type="cite">
      <pre wrap="">The latest build failure also has the same issue:
Download it from here:
<a class="moz-txt-link-freetext" href="http://build.gluster.org:443/logs/glusterfs-logs-20140518%3a22%3a27%3a31.tgz">http://build.gluster.org:443/logs/glusterfs-logs-20140518%3a22%3a27%3a31.tgz</a>

Pranith

----- Original Message -----
</pre>
      <blockquote type="cite">
        <pre wrap="">From: "Vijaikumar M" <a class="moz-txt-link-rfc2396E" href="mailto:vmallika@redhat.com">&lt;vmallika@redhat.com&gt;</a>
To: "Joseph Fernandes" <a class="moz-txt-link-rfc2396E" href="mailto:josferna@redhat.com">&lt;josferna@redhat.com&gt;</a>
Cc: "Pranith Kumar Karampuri" <a class="moz-txt-link-rfc2396E" href="mailto:pkarampu@redhat.com">&lt;pkarampu@redhat.com&gt;</a>, "Gluster Devel" <a class="moz-txt-link-rfc2396E" href="mailto:gluster-devel@gluster.org">&lt;gluster-devel@gluster.org&gt;</a>
Sent: Monday, 19 May, 2014 11:41:28 AM
Subject: Re: Spurious failures because of nfs and snapshots

Hi Joseph,

In the log mentioned below, it say ping-time is set to default value
30sec.I think issue is different.
Can you please point me to the logs where you where able to re-create
the problem.

Thanks,
Vijay



On Monday 19 May 2014 09:39 AM, Pranith Kumar Karampuri wrote:
</pre>
        <blockquote type="cite">
          <pre wrap="">hi Vijai, Joseph,
     In 2 of the last 3 build failures,
     <a class="moz-txt-link-freetext" href="http://build.gluster.org/job/regression/4479/console">http://build.gluster.org/job/regression/4479/console</a>,
     <a class="moz-txt-link-freetext" href="http://build.gluster.org/job/regression/4478/console">http://build.gluster.org/job/regression/4478/console</a> this
     test(tests/bugs/bug-1090042.t) failed. Do you guys think it is better
     to revert this test until the fix is available? Please send a patch
     to revert the test case if you guys feel so. You can re-submit it
     along with the fix to the bug mentioned by Joseph.

Pranith.

----- Original Message -----
</pre>
          <blockquote type="cite">
            <pre wrap="">From: "Joseph Fernandes" <a class="moz-txt-link-rfc2396E" href="mailto:josferna@redhat.com">&lt;josferna@redhat.com&gt;</a>
To: "Pranith Kumar Karampuri" <a class="moz-txt-link-rfc2396E" href="mailto:pkarampu@redhat.com">&lt;pkarampu@redhat.com&gt;</a>
Cc: "Gluster Devel" <a class="moz-txt-link-rfc2396E" href="mailto:gluster-devel@gluster.org">&lt;gluster-devel@gluster.org&gt;</a>
Sent: Friday, 16 May, 2014 5:13:57 PM
Subject: Re: Spurious failures because of nfs and snapshots


Hi All,

tests/bugs/bug-1090042.t :

I was able to reproduce the issue i.e when this test is done in a loop

for i in {1..135} ; do  ./bugs/bug-1090042.t

When checked the logs
[2014-05-16 10:49:49.003978] I [rpc-clnt.c:973:rpc_clnt_connection_init]
0-management: setting frame-timeout to 600
[2014-05-16 10:49:49.004035] I [rpc-clnt.c:988:rpc_clnt_connection_init]
0-management: defaulting ping-timeout to 30secs
[2014-05-16 10:49:49.004303] I [rpc-clnt.c:973:rpc_clnt_connection_init]
0-management: setting frame-timeout to 600
[2014-05-16 10:49:49.004340] I [rpc-clnt.c:988:rpc_clnt_connection_init]
0-management: defaulting ping-timeout to 30secs

The issue is with ping-timeout and is tracked under the bug

<a class="moz-txt-link-freetext" href="https://bugzilla.redhat.com/show_bug.cgi?id=1096729">https://bugzilla.redhat.com/show_bug.cgi?id=1096729</a>


The workaround is mentioned in
<a class="moz-txt-link-freetext" href="https://bugzilla.redhat.com/show_bug.cgi?id=1096729#c8">https://bugzilla.redhat.com/show_bug.cgi?id=1096729#c8</a>


Regards,
Joe

----- Original Message -----
From: "Pranith Kumar Karampuri" <a class="moz-txt-link-rfc2396E" href="mailto:pkarampu@redhat.com">&lt;pkarampu@redhat.com&gt;</a>
To: "Gluster Devel" <a class="moz-txt-link-rfc2396E" href="mailto:gluster-devel@gluster.org">&lt;gluster-devel@gluster.org&gt;</a>
Cc: "Joseph Fernandes" <a class="moz-txt-link-rfc2396E" href="mailto:josferna@redhat.com">&lt;josferna@redhat.com&gt;</a>
Sent: Friday, May 16, 2014 6:19:54 AM
Subject: Spurious failures because of nfs and snapshots

hi,
     In the latest build I fired for review.gluster.com/7766
     (<a class="moz-txt-link-freetext" href="http://build.gluster.org/job/regression/4443/console">http://build.gluster.org/job/regression/4443/console</a>) failed because
     of
     spurious failure. The script doesn't wait for nfs export to be
     available. I fixed that, but interestingly I found quite a few
     scripts
     with same problem. Some of the scripts are relying on 'sleep 5' which
     also could lead to spurious failures if the export is not available
     in 5
     seconds. We found that waiting for 20 seconds is better, but 'sleep
     20'
     would unnecessarily delay the build execution. So if you guys are
     going
     to write any scripts which has to do nfs mounts, please do it the
     following way:

EXPECT_WITHIN 20 "1" is_nfs_export_available;
TEST mount -t nfs -o vers=3 $H0:/$V0 $N0;

Please review <a class="moz-txt-link-freetext" href="http://review.gluster.com/7773">http://review.gluster.com/7773</a> :-)

I saw one more spurious failure in a snapshot related script
tests/bugs/bug-1090042.t on the next build fired by Niels.
Joesph (CCed) is debugging it. He agreed to reply what he finds and share
it
with us so that we won't introduce similar bugs in future.

I encourage you guys to share what you fix to prevent spurious failures in
future.

Thanks
Pranith

</pre>
          </blockquote>
        </blockquote>
        <pre wrap="">

</pre>
      </blockquote>
    </blockquote>
    <br>
  </body>
</html>