<div dir="ltr">Hi Keith,<br><br>Work is being done on AFR to make it stable. Please wait for a stable release.<br><br><div class="gmail_quote">On Tue, Sep 23, 2008 at 6:41 PM, Keith Freedman <span dir="ltr">&lt;<a href="mailto:freedman@freeformit.com">freedman@freeformit.com</a>&gt;</span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">as a followup.. &nbsp;I have shutdown the &quot;broken&quot; one in the pair since<br>
it kept crashing.<br>
the working one is running on it&#39;s own but gluster dies every 10 mins or so.<br>
seems 1.4pre5 doesn&#39;t like being an AFR client all on it&#39;s own?<br>
<br>
I&#39;m going to see if it works with only itself as the AFR subvolumes list<br>
<br>
<br>
<br>
<br>
2008-09-23 07:24:00 E [afr.c:3434:afr_statfs_cbk] home: (child=home2)<br>
op_ret=-1 op_errno=107(Transport endpoint is not connected)<br>
2008-09-23 07:24:03 E [afr.c:3434:afr_statfs_cbk] home: (child=home2)<br>
op_ret=-1 op_errno=107(Transport endpoint is not connected)<br>
2008-09-23 07:24:28 E [afr.c:4759:afr_create_cbk] home:<br>
(path=/glusterfile/tmp/1222179868.H882395P21565.HOSTNAME child=home2)<br>
op_ret=-1 op_errno=107(Transport endpoint is not connected)<br>
<div class="Ih2E3d">pending frames:<br>
<br>
Signal received: 11<br>
configuration details:argp 1<br>
backtrace 1<br>
dlfcn 1<br>
fdatasync 1<br>
libpthread 1<br>
llistxattr 1<br>
setfsid 1<br>
spinlock 1<br>
epoll.h 1<br>
xattr.h 1<br>
tv_nsec 1<br>
package-string: glusterfs 1.4.0pre5<br>
</div>/lib64/libc.so.6[0x300d0322a0]<br>
/usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so(afr_incver_internal_incver_cbk+0x38)[0xe5250c]<br>
/usr/local/lib/glusterfs/1.4.0pre5/xlator/protocol/client.so(client_incver+0xb9)[0xa29072]<br>
/usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so(afr_incver_internal_lock_cbk+0x6d8)[0xe52d75]<br>
/usr/local/lib/libglusterfs.so.0[0x125c5b]<br>
/usr/local/lib/libglusterfs.so.0(mop_lock_impl+0x103)[0x12a6aa]<br>
/usr/local/lib/libglusterfs.so.0(default_lock+0x126)[0x125d88]<br>
/usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so(afr_incver_internal_fd+0x33a)[0xe530d1]<br>
/usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so(afr_close+0x26d)[0xe5c64d]<br>
/usr/local/lib/glusterfs/1.4.0pre5/xlator/mount/fuse.so[0x7299fa8]<br>
/lib64/libfuse.so.2[0x10824b2]<br>
/usr/local/lib/glusterfs/1.4.0pre5/xlator/mount/fuse.so[0x729cc35]<br>
/lib64/libpthread.so.0[0x300dc0729a]<br>
/lib64/libc.so.6(clone+0x6d)[0x300d0e439d]<br>
---------<br>
<div><div></div><div class="Wj3C7c"><br>
<br>
At 07:09 AM 9/23/2008, Keith Freedman wrote:<br>
&gt;I had a pair of servers running 1.4pre5 in AFR.<br>
&gt;they&#39;ve been running fine for over a week, and suddenly today one of<br>
&gt;them had decided it just will crash anytime it tries to AFR a file.<br>
&gt;<br>
&gt;strange is, it seems to get updates form the other server.<br>
&gt;it&#39;s not up long enough to do any thorough testing, but when I do<br>
&gt;this from the &quot;good&quot; server:<br>
&gt;echo `hostname` `date` &gt; /gluster/shared/file<br>
&gt;I can read the correct hostname and date from the &quot;bad&quot; server, but<br>
&gt;when I do the same thing on the &quot;bad&quot; server, it crashes instantly.<br>
&gt;<br>
&gt;running FC9 with default fuse:<br>
&gt;fuse-2.7.4-8_10.fc9.i386<br>
&gt;<br>
&gt;I&#39;m going to re-install fuse thinking that perhaps something got<br>
&gt;corrupted, but it&#39;s odd it happened while the servers been goign just<br>
&gt;fine for days.<br>
&gt;<br>
&gt;I turned on debugging and here&#39;s what it&#39;s producing<br>
&gt;where the log ends is where the server crashed while I was tailing<br>
&gt;the logfile:<br>
&gt;2008-09-23 06:56:31 D [inode.c:311:__inode_retire] fuse/inode:<br>
&gt;retiring inode(0) lru=21/0 active=21 purge=29<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:437:fuse_lookup] glusterfs-fuse:<br>
&gt;223: LOOKUP /uservideo/public_html/Guests/Images/Misc/.htaccess<br>
&gt;2008-09-23 06:56:31 D [inode.c:443:__inode_create] fuse/inode: create inode(0)<br>
&gt;2008-09-23 06:56:31 D [inode.c:268:__inode_activate] fuse/inode:<br>
&gt;activating inode(0), lru=21/0 active=22 purge=29<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]<br>
&gt;glusterfs-fuse: 222: FLUSH() ERR =&gt; 0<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:1599:fuse_release]<br>
&gt;glusterfs-fuse: 224: CLOSE 0x8fecf58<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:562:fuse_getattr]<br>
&gt;glusterfs-fuse: 225: FGETATTR 20971566<br>
&gt;(/user2/public_html/shopping/var/run/classes/kernel/Profiler.php/0x8fece28)<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:496:fuse_attr_cbk]<br>
&gt;glusterfs-fuse: 225: FSTAT()<br>
&gt;/user2/public_html/shopping/var/run/classes/kernel/Profiler.php =&gt; 20971566<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:398:fuse_entry_cbk]<br>
&gt;glusterfs-fuse: 223: LOOKUP()<br>
&gt;/uservideo/public_html/Guests/Images/Misc/.htaccess =&gt; -1 (No such<br>
&gt;file or directory)<br>
&gt;2008-09-23 06:56:31 D [inode.c:311:__inode_retire] fuse/inode:<br>
&gt;retiring inode(0) lru=21/0 active=21 purge=30<br>
&gt;2008-09-23 06:56:31 D [inode.c:268:__inode_activate] fuse/inode:<br>
&gt;activating inode(11010602), lru=20/0 active=22 purge=30<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:1429:fuse_open] glusterfs-fuse:<br>
&gt;226: OPEN /uservideo/public_html/Guests/Images/Misc/userLogo.jpg<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]<br>
&gt;glusterfs-fuse: 224: CLOSE() ERR =&gt; 0<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:1572:fuse_flush] glusterfs-fuse:<br>
&gt;227: FLUSH 0x8fece28<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:603:fuse_fd_cbk] glusterfs-fuse:<br>
&gt;226: OPEN() /uservideo/public_html/Guests/Images/Misc/userLogo.jpg<br>
&gt;=&gt; 0x8fecd50<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:1487:fuse_readv] glusterfs-fuse:<br>
&gt;228: READ (0x8fecd50, size=4096, offset=0)<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]<br>
&gt;glusterfs-fuse: 227: FLUSH() ERR =&gt; 0<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:1455:fuse_readv_cbk]<br>
&gt;glusterfs-fuse: 228: READ =&gt; 3513/4096,0/3513<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:1599:fuse_release]<br>
&gt;glusterfs-fuse: 229: CLOSE 0x8fece28<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:437:fuse_lookup] glusterfs-fuse:<br>
&gt;230: LOOKUP /user2/public_html/shopping/var/run/classes/kernel/Database.php<br>
&gt;2008-09-23 06:56:31 D [inode.c:443:__inode_create] fuse/inode: create inode(0)<br>
&gt;2008-09-23 06:56:31 D [inode.c:268:__inode_activate] fuse/inode:<br>
&gt;activating inode(0), lru=20/0 active=23 purge=30<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:1572:fuse_flush] glusterfs-fuse:<br>
&gt;231: FLUSH 0x8fecd50<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]<br>
&gt;glusterfs-fuse: 229: CLOSE() ERR =&gt; 0<br>
&gt;2008-09-23 06:56:31 D [inode.c:287:__inode_passivate] fuse/inode:<br>
&gt;passivating inode(20971566) lru=21/0 active=22 purge=30<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:370:fuse_entry_cbk]<br>
&gt;glusterfs-fuse: 230: LOOKUP()<br>
&gt;/user2/public_html/shopping/var/run/classes/kernel/Database.php =&gt; 20971567<br>
&gt;2008-09-23 06:56:31 D [inode.c:287:__inode_passivate] fuse/inode:<br>
&gt;passivating inode(20971567) lru=22/0 active=21 purge=30<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]<br>
&gt;glusterfs-fuse: 231: FLUSH() ERR =&gt; 0<br>
&gt;2008-09-23 06:56:31 D [fuse-bridge.c:1599:fuse_release]<br>
&gt;glusterfs-fuse: 232: CLOSE 0x8fecd50<br>
&gt;2008-09-23 06:56:31 D [fuse-b<br>
&gt;<br>
&gt;here&#39;s some more from when the server rebooted<br>
&gt;+-----<br>
&gt;2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for &#39;home1&#39;<br>
&gt;2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt<br>
&gt;to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/storage/posix.so<br>
&gt;2008-09-23 07:04:39 D [spec.y:219:section_type] parser:<br>
&gt;Type:home1:storage/posix<br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:home1:directory:/gluster/home<br>
&gt;2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:home1<br>
&gt;2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for<br>
&gt;&#39;posix-locks-home1&#39;<br>
&gt;2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt<br>
&gt;to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/features/posix-locks.so<br>
&gt;2008-09-23 07:04:39 D [xlator.c:318:xlator_set_type]<br>
&gt;posix-locks-home1: dlsym(notify) on<br>
&gt;/usr/local/lib/glusterfs/1.4.0pre5/xlator/features/posix-locks.so:<br>
&gt;undefined symbol: notify -- neglecting<br>
&gt;2008-09-23 07:04:39 D [spec.y:219:section_type] parser:<br>
&gt;Type:posix-locks-home1:features/posix-locks<br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:posix-locks-home1:mandatory:on<br>
&gt;2008-09-23 07:04:39 D [spec.y:352:section_sub] parser:<br>
&gt;child:posix-locks-home1-&gt;home1<br>
&gt;2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:posix-locks-home1<br>
&gt;2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for &#39;home2&#39;<br>
&gt;2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt<br>
&gt;to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/protocol/client.so<br>
&gt;2008-09-23 07:04:39 D [spec.y:219:section_type] parser:<br>
&gt;Type:home2:protocol/client<br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:home2:transport-type:tcp/client<br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:home2:remote-host:<a href="http://72.36.173.218" target="_blank">72.36.173.218</a><br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:home2:remote-subvolume:posix-locks-home1<br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:home2:transport-timeout:10<br>
&gt;2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:home2<br>
&gt;2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for &#39;server&#39;<br>
&gt;2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt<br>
&gt;to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/protocol/server.so<br>
&gt;2008-09-23 07:04:39 D [spec.y:219:section_type] parser:<br>
&gt;Type:server:protocol/server<br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:server:transport-type:tcp/server<br>
&gt;2008-09-23 07:04:39 D [spec.y:352:section_sub] parser:<br>
&gt;child:server-&gt;posix-locks-home1<br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:server:auth.addr.posix-locks-home1.allow:<a href="http://72.36.173.218" target="_blank">72.36.173.218</a>,<a href="http://127.0.0.1" target="_blank">127.0.0.1</a><br>
&gt;2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:server<br>
&gt;2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for &#39;home&#39;<br>
&gt;2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt<br>
&gt;to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so<br>
&gt;2008-09-23 07:04:39 D [xlator.c:324:xlator_set_type] home: strict<br>
&gt;option validation is not enforced -- neglecting<br>
&gt;2008-09-23 07:04:39 D [spec.y:219:section_type] parser: Type:home:cluster/afr<br>
&gt;2008-09-23 07:04:39 D [spec.y:285:section_option] parser:<br>
&gt;Option:home:read-subvolume:posix-locks-home1<br>
&gt;2008-09-23 07:04:39 D [spec.y:352:section_sub] parser:<br>
&gt;child:home-&gt;posix-locks-home1<br>
&gt;2008-09-23 07:04:39 D [spec.y:352:section_sub] parser: child:home-&gt;home2<br>
&gt;2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:home<br>
&gt;2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt<br>
&gt;to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/mount/fuse.so<br>
&gt;2008-09-23 07:04:39 D [xlator.c:324:xlator_set_type] fuse: strict<br>
&gt;option validation is not enforced -- neglecting<br>
&gt;2008-09-23 07:04:39 D [glusterfs.c:771:main] glusterfs: running in pid 1145<br>
&gt;2008-09-23 07:04:39 D [fuse-options.c:140:fuse_options_validate]<br>
&gt;fuse-options: using mount-point = /home<br>
&gt;2008-09-23 07:04:39 D [fuse-options.c:147:fuse_options_validate]<br>
&gt;fuse-options: using attr-timeout = 1<br>
&gt;2008-09-23 07:04:39 D [fuse-options.c:159:fuse_options_validate]<br>
&gt;fuse-options: using entry-timeout = 1<br>
&gt;2008-09-23 07:04:39 D [fuse-options.c:171:fuse_options_validate]<br>
&gt;fuse-options: using direct-io-mode = 1<br>
&gt;2008-09-23 07:04:39 D [client-protocol.c:4383:init] home2: setting<br>
&gt;transport-timeout to 10<br>
&gt;2008-09-23 07:04:39 D [transport.c:104:transport_load] transport:<br>
&gt;attempt to load file /usr/local/lib/glusterfs/1.4.0pre5/transport/socket.so<br>
&gt;2008-09-23 07:04:39 D [client-protocol.c:4427:init] home2: defaulting<br>
&gt;limits.transaction-size to 268435456<br>
&gt;2008-09-23 07:04:39 D [afr.c:6397:init] home: self-heal is enabled (default)<br>
&gt;2008-09-23 07:04:39 D [afr.c:6421:init] home: config: reads will be<br>
&gt;done on posix-locks-home1<br>
&gt;2008-09-23 07:04:39 D [afr.c:6309:notify] home: GF_EVENT_CHILD_UP<br>
&gt;from posix-locks-home1<br>
&gt;2008-09-23 07:04:39 D [afr.c:6241:afr_check_xattr_cbk] home:<br>
&gt;&#39;posix-locks-home1&#39; supports Extended attribute<br>
&gt;2008-09-23 07:04:39 D [inode.c:928:inode_table_new] fuse: creating<br>
&gt;new inode table with lru_limit=0<br>
&gt;2008-09-23 07:04:39 D [inode.c:443:__inode_create] fuse/inode: create inode(0)<br>
&gt;2008-09-23 07:04:39 D [client-protocol.c:4653:notify] home2: got<br>
&gt;GF_EVENT_PARENT_UP, attempting connect on transport<br>
&gt;2008-09-23 07:04:39 D [transport.c:104:transport_load] transport:<br>
&gt;attempt to load file /usr/local/lib/glusterfs/1.4.0pre5/transport/socket.so<br>
&gt;2008-09-23 07:04:39 E [name.c:344:af_inet_server_get_local_sockaddr]<br>
&gt;server: getaddrinfo failed (Name or service not known)<br>
&gt;2008-09-23 07:04:39 W [common-utils.c:158:gf_print_bytes] glusterfs:<br>
&gt;Total data (in bytes): transfered (0), received (0)<br>
&gt;pending frames:<br>
&gt;<br>
&gt;Signal received: 11<br>
&gt;configuration details:argp 1<br>
&gt;backtrace 1<br>
&gt;dlfcn 1<br>
&gt;fdatasync 1<br>
&gt;libpthread 1<br>
&gt;llistxattr 1<br>
&gt;setfsid 1<br>
&gt;spinlock 1<br>
&gt;epoll.h 1<br>
&gt;xattr.h 1<br>
&gt;tv_nsec 1<br>
&gt;package-string: glusterfs 1.4.0pre5<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;_______________________________________________<br>
&gt;Gluster-users mailing list<br>
&gt;<a href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a><br>
&gt;<a href="http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users" target="_blank">http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users</a><br>
<br>
<br>
_______________________________________________<br>
Gluster-users mailing list<br>
<a href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a><br>
<a href="http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users" target="_blank">http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users</a><br>
</div></div></blockquote></div><br><br clear="all"><br>-- <br>Raghavendra G<br><br>A centipede was happy quite, until a toad in fun,<br>Said, &quot;Prey, which leg comes after which?&quot;,<br>This raised his doubts to such a pitch,<br>
He fell flat into the ditch,<br>Not knowing how to run.<br>-Anonymous<br>
</div>