<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

<html>

  <head>

    <meta content="text/html; charset=UTF-8" http-equiv="Content-Type">

  </head>

  <body text="#000000" bgcolor="#ffffff">

    On 24/09/2010 05:10, Craig Carl wrote:

    <blockquote

      cite="mid:1067438425.151185.1285301439213.JavaMail.root@mailbox1"

      type="cite">

      <style type="text/css">p { margin: 0; }</style>

      <div style="font-family: Times New Roman; font-size: 12pt; color:

        rgb(0, 0, 0);">Ed - <br>

        Â Â  If I understand it looks like you are recommending a method

        for implementing an asynchronous replication solution as a

        possible alternative to the current synchronous replication

        method? <span></span><br>

      </div>

    </blockquote>

    <br>

    I think there are two main use cases which benefit:<br>

    <br>

    1) Master/Master, epecially where the client is itself one of the

    bricks.Â  Eg recently there have been several threads on poor

    performance using gluster as the backing store for a web server.Â 

    Here a common situation might be that we have a backing store

    holding say two web applications, each frontend server generally

    only serves one of the two applications and so we want to avoid

    network accesses in the common case that files typically have

    affinity for being used by just one of the servers.<br>

    <br>

    2) Achieving effectively the benefit of a large writeback cache, yet

    without compromising coherency, in the face of larger RTT times

    between bricks.Â  This could be anything from a 100mbit IP link

    between heavily accessed servers, to a WAN.Â  <br>

    <br>

    Optimistic locking is basically a way of optimising for the case

    where a single brick at a time tends to access a subset of files.Â 

    It does absolutely nothing for the situation that you have more than

    one brick competing for access to the same file (I think it's

    obvious that the latter situation is hard to improve anyway)<br>

    <br>

    So really optimistic locking is a performance improvement in any

    situation where:<br>

    <br>

    - One server accesses a file more than once in a row, before any

    other server requests access (doesn't matter whether its a read or

    write)<br>

    - The above also implies that we will get maximum benefit in the

    case where there is relatively large RTT times between servers (this

    would include even gigabit for the case of a heavily used server

    though)<br>

    - We can also infer that this optimisation benefits us most if we

    can tweak our applications to have some kind of affinity to prefer a

    given server for a given subset of files (often this is very easily

    done for a whole class of applications, eg webservers point A

    records to specific servers, mailservers trivially route users to

    their preferred storage server, geographic clustering tends to take

    care of itself if the client isn't in a rocket ship, etc)<br>

    <br>

    OK, so that's "optimistic locking" and the reason why it would be

    nice to have it.Â  Traditionally this is done using a shared lock

    server (single point of failure).Â  However, my suggestion was to

    read up on the algorithms in the publications list, which show how

    it's possible to implement a fault tolerant, shared nothing, lock

    server (cool!).Â  Now we have a lock server in the style of gluster,

    where there is no single point of failure!<br>

    <br>

    So I think really it's two feature requests:<br>

    <br>

    1) Can you please implement optimistic locking optimisations using a

    lock server<br>

    2) Can you please make the lock server fault tolerant, fully

    distributed, shared nothing, eg using a Paxos derivative<br>

    <br>

    Cheers<br>

    <br>

    Ed W<br>

  </body>

</html>