<html>
<head>
<meta content="text/html; charset=UTF-8" http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">These logs show different results. The
results you reported and pasted earlier included, "<span
style="font-size:11.0pt;font-family:"Calibri","sans-serif"">[2013-07-09
00:59:04.706390] I [afr-common.c:3856:afr_local_init]
0-firewall-scripts-replicate-0: no subvolumes up"</span>, which
would produce the "Transport endpoint not connected" error you
reported at first. These results look normal and should have
produced the behavior I described.<br>
<br>
42 is The Answer to Life, The Universe, and Everything.<br>
<br>
Re-establishing FDs and locks is an expensive operation. The
ping-timeout is long because it should not happen, but if there is
temporary network congestion you'd (normally) rather have your
volume remain up and pause than have to re-establish everything.
Typically, unless you expect your servers to crash often, leaving
ping-timeout at the default is best. YMMV and it's configurable in
case you know what you're doing and why.<br>
<br>
<br>
On 07/13/2013 04:58 PM, Greg Scott wrote:<br>
</div>
<blockquote
cite="mid:6a53ef17514c449190bbd0b1c529c0bb@mail2013.infrasupport.local"
type="cite">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta name="Generator" content="Microsoft Word 14 (filtered
medium)">
<style><!--
/* Font Definitions */
@font-face
        {font-family:Wingdings;
        panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
        {font-family:Wingdings;
        panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:Tahoma;
        panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman","serif";
        color:black;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
        {mso-style-priority:34;
        margin-top:0in;
        margin-right:0in;
        margin-bottom:0in;
        margin-left:.5in;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman","serif";
        color:black;}
span.EmailStyle17
        {mso-style-type:personal-reply;
        font-family:"Calibri","sans-serif";
        color:#1F497D;}
.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}
@page WordSection1
        {size:8.5in 11.0in;
        margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
        {page:WordSection1;}
/* List Definitions */
@list l0
        {mso-list-id:96562339;
        mso-list-type:hybrid;
        mso-list-template-ids:-1187977828 -907670208 67698691 67698693 67698689 67698691 67698693 67698689 67698691 67698693;}
@list l0:level1
        {mso-level-start-at:0;
        mso-level-number-format:bullet;
        mso-level-text:-;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:"Calibri","sans-serif";
        mso-fareast-font-family:Calibri;
        mso-bidi-font-family:"Times New Roman";}
@list l0:level2
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:"Courier New";}
@list l0:level3
        {mso-level-number-format:bullet;
        mso-level-text:;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:Wingdings;}
@list l0:level4
        {mso-level-number-format:bullet;
        mso-level-text:;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:Symbol;}
@list l0:level5
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:"Courier New";}
@list l0:level6
        {mso-level-number-format:bullet;
        mso-level-text:;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:Wingdings;}
@list l0:level7
        {mso-level-number-format:bullet;
        mso-level-text:;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:Symbol;}
@list l0:level8
        {mso-level-number-format:bullet;
        mso-level-text:o;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:"Courier New";}
@list l0:level9
        {mso-level-number-format:bullet;
        mso-level-text:;
        mso-level-tab-stop:none;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:Wingdings;}
ol
        {margin-bottom:0in;}
ul
        {margin-bottom:0in;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
<div class="WordSection1">
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Log
files sent privately to Joe. If others from the community
want to look at them, I’m OK with posting them here. I
don’t think they have anything confidential. Now that I
know about that 42 second timeout, the behavior makes more
sense. Why 42? What’s special about 42? Is there a way
I adjust that down for my application to, say, 1 or 2
seconds?<o:p></o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoListParagraph"
style="text-indent:-.25in;mso-list:l0 level1 lfo1"><!--[if !supportLists]--><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><span
style="mso-list:Ignore">-<span style="font:7.0pt
"Times New Roman"">
</span></span></span><!--[endif]--><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Greg<o:p></o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<div>
<div style="border:none;border-top:solid #B5C4DF
1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span
style="font-size:10.0pt;font-family:"Tahoma","sans-serif";color:windowtext">From:</span></b><span
style="font-size:10.0pt;font-family:"Tahoma","sans-serif";color:windowtext">
Joe Julian [<a class="moz-txt-link-freetext" href="mailto:joe@julianfamily.org">mailto:joe@julianfamily.org</a>]
<br>
<b>Sent:</b> Saturday, July 13, 2013 4:28 PM<br>
<b>To:</b> Greg Scott; '<a class="moz-txt-link-abbreviated" href="mailto:gluster-users@gluster.org">gluster-users@gluster.org</a>'<br>
<b>Subject:</b> Re: [Gluster-users] One node goes
offline, the other node can't see the replicated volume
anymore<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Huh.. this was in my sent folder... let's
try again.<br>
<br>
There's something missing from this picture. The logs show
that the client is connecting to both servers, but it only
shows the disconnection from one and claims that it's not
connected to any bricks after that.<br>
<br>
Here's the data I'd like to have you generate:<br>
<br>
unmount the clients<br>
gluster volume set firewall-scripts
diagnostics.client-log-level DEBUG<br>
gluster volume set firewall-scripts
diagnostics.brick-log-level DEBUG<br>
systemctl stop glusterd.service<br>
truncate the client, glusterd, and server logs<br>
systemctl start glusterd<br>
mount /firewall-scripts<br>
Do your iptables disconnect<br>
telnet $this_host_ip 24007 # report whether or not it
establishes a connection<br>
<span
style="font-size:11.0pt;font-family:"Calibri","sans-serif"">ls
/firewall-scripts<br>
wait 42 seconds</span><br>
<span
style="font-size:11.0pt;font-family:"Calibri","sans-serif"">ls
/firewall-scripts<br>
Remove the iptables rule<br>
ls /firewall-scripts<br>
tar up the logs and email them to me.<br>
<br>
You can reset the log-level:<br>
</span><br>
gluster volume reset firewall-scripts
diagnostics.client-log-level<br>
gluster volume reset firewall-scripts
diagnostics.brick-log-level<br>
<br>
lastly, do you have a loopback interface (lo) on 127.0.0.1 and
is localhost defined in /etc/hosts?<o:p></o:p></p>
</div>
</blockquote>
<br>
</body>
</html>