[ovs-dev] ovsdb active backup deployment

Andy Zhou azhou at ovn.org
Tue Jul 26 19:48:07 UTC 2016


On Tue, Jul 26, 2016 at 11:59 AM, Russell Bryant <russell at ovn.org> wrote:

>
>
> On Tue, Jul 26, 2016 at 2:41 PM, Andy Zhou <azhou at ovn.org> wrote:
>
>>
>>
>> On Tue, Jul 26, 2016 at 5:37 AM, Russell Bryant <russell at ovn.org> wrote:
>>
>>>
>>>
>>> On Mon, Jul 25, 2016 at 8:15 PM, Andy Zhou <azhou at ovn.org> wrote:
>>>
>>>> Hi, Rayn and Russell,
>>>>
>>>
>>> Can we move this discussion to the ovs dev mailing list?  Feel free to
>>> just add it in a reply if you'd like.
>>>
>> Done.
>>
>>>
>>>
>>>> I am wondering how we can actually use the active/backup feature that
>>>> is now part of
>>>> OVSDB to increase OVN availability.
>>>>
>>>
>>> TO be clear, I haven't actually tried this yet.  I'm only speaking about
>>> how I think it should work.
>>>
>>>
>>>> Specifically:
>>>>
>>>> 1. When the active OVSDB server failed, should the back up server take
>>>> over, and allow write transactions? One simpler possibility is to allow
>>>> read only access to the backup serve.
>>>>
>>>
>>> The  backup server needs to take over.  It's OK if that requires
>>> intervention by an HA manager like Pacemaker.  If we can't make the passive
>>> server take over, I'd say the solution is incomplete.
>>>
>>
>> O.K. make sense.
>>
>> One possible issue with backup server taking over is "split head".  In
>> case due to network error, backup server becomes disconnected from the
>> active
>> server, then we may have both server thinking they are active server
>> now.  Does Pacemaker help with solving this issue.
>>
>
> It can, yes.  I would expect Pacemaker to explicitly configure a node to
> be either the active or passive node.
>
Manual switching is more straight forward. I agree.

>
>>>
>>>> 2. When a crashed active OVSDB server recovers, should it become the
>>>> new backup, or it should switch back.
>>>>
>>>
>>> Becoming the new backup is fine.  Again, this can be orchestrated by an
>>> HA manager (Pacemaker).
>>>
>> I am not familiar with pacemaker. Can I assume it can provide a correct
>> --sync-from argument (pointing to backup server) when relaunch OVSDB
>> server?
>>
>
> Yes.  I'd have to consult with some Pacemaker experts on exactly what the
> implementation would look like, but roughly:
>
> Pacemaker manages services using "OCF Resource Agents", which are just
> scripts with a defined set of inputs and outputs for service management.  I
> would imagine a Pacemaker cluster being told it must have exactly 1 active
> and 1 passive OVSDB service.  When the passive OVSDB service is started, it
> would include the "sync-from" argument based on where the active OVSDB
> service is currently running.
>
> We really need to prototype this and document it.  I'm guessing too much.
> Pacemaker is frequently used to manage active/passive HA, though.
>
> Sounds reasonable,  I will work on ovsdb internal changes to support
manual switching, using appctl commands. Then looking into prototyping with
HA systems.  I have not used pacemaker in the past, so it may take some
time to ramp up.

>
>>>
>>>> Ben said one of you, or both may have worked with similar active-backup
>>>> systems before, so I am very interested in your inputs.
>>>>
>>>> Thanks,
>>>>
>>>> Andy
>>>>
>>>
>>>
>>>
>>> --
>>> Russell Bryant
>>>
>>
>>
> --
> Russell Bryant
>



More information about the dev mailing list