[ovs-dev] ovn-controller CPU use and incremental processing dependencies

Sat Jun 9 00:21:17 UTC 2018

On Thu, Jun 07, 2018 at 04:51:34PM -0700, Ben Pfaff wrote:
> On Thu, Jun 07, 2018 at 04:32:10PM -0700, Han Zhou wrote:
> > On Thu, Jun 7, 2018 at 11:15 AM, Ben Pfaff <blp at ovn.org> wrote:
> > >
> > > To review the problem we're trying to solve, ovn-controller uses too
> > > much CPU in many circumstances because it recomputes everything on every
> > > iteration on the main loop.  That includes when it wakes up for any
> > > reason, e.g. in response to a timer, or to an "ovs-appctl" command, or
> > > to respond to an "echo request" probing whether an OpenFlow or OVSDB
> > > connection is still up.  In even a medium-sized network, this is a lot
> > > of work and can easily lead to ovn-controller using 100% of a CPU
> > > without doing significant valuable work.
> > >
> > > The solution, as you and others have noticed, is to recompute only when
> > > an input has changed.  This can be done on at least three different
> > > granularities:
> > >
> > > 1. Whole system.  Recompute everything if any input changed.
> > >
> > > 2. Coarsely granular.  Recompute intermediate data and output data,
> > coarsely
> > >    defined, if their respective inputs changed.
> > >
> > > 3. Finely granular.  Recompute individual rows (etc.) of intermediate or
> > output
> > >    data if their individual inputs changed.
> > >
> > > The 2016 patches from Ryan Moats were mostly #1 with some #2, and, Han,
> > > as I see it your patch series is mostly #2 with some #3.
> > >
> > > Ryan Moats' series was eventually reverted because it was not correct and
> > could
> > > not be confidently fixed or maintained.  In turn, this was because it was
> > too
> > > difficult to figure out each calculation's actual dependencies.  This was
> > in
> > > fact difficult even for #1, let alone #2.
> > >
> > > Your new patch series uses a much more systematic approach to
> > > dependencies, since it uses a formal incremental processing engine to
> > > propagate changes through a DAG of computations.  However, assuming that
> > > it is correct as is, I am concerned that it will eventually suffer from
> > > the same problem.
> > >
> > > The way I see it, the incremental computation systems that have been
> > proposed
> > > so far are analogous to a Makefile that explicitly lists all of the .h
> > files
> > > that each .c file includes (directly or indirectly).  If you've ever
> > maintained
> > > a Makefile that works this way, you know that it's difficult to get it
> > right at
> > > the start and that it is essentially impossible to keep it correct as a
> > project
> > > evolves.  That's why every modern build system computes header file
> > > dependencies automatically as a side effect of compilation.  Similarly, I
> > don't
> > > see a realistic way that we can keep these dependencies correct as
> > > ovn-controller evolves.
> > >
> > > So, I've been spending some time thinking about how we can come up with
> > the
> > > ovn--controller equivalent of automatic dependency computation.  My
> > current
> > > thinking is that we need to adopt a more functional programming attitude
> > here.
> > > That is, any given part of the ovn-controller computation should only be
> > passed
> > > as input the data that it actually needs, and as output the data
> > structures
> > > that it produces.  This will give us the opportunity to make sure that the
> > > input and the dependencies are in sync.  Ideally, we'd be able to tie
> > those
> > > together in the code, so that passing a piece of data into a computation
> > > function automatically makes that a dependency from the engine's
> > perspective.
> > 
> > Agree. While I don't think it is totally unrealistic, but it is indeed my
> > biggest concern how to ensure the dependencies are correctly maintained in
> > the longer term. It was a manual and tedious process for me (although it
> > helped me to get better understanding of the whole implementation, as a
> > personal benefit) to figure out the dependencies, by looking at the
> > parameters passed to those xxx_run() functions, and also check carefully
> > the global variables, and specially, the OVSDB tables they might be using.
> > If I miss anything, or in the future some contributor miss updating the
> > dependency graph when they should, it will result in problem. It would be
> > great if we have a mechanism to self-describe the dependency from code and
> > then deduce the dependencies automatically (or it might be good enough to
> > still adding dependency manually, but relying on detecting and warning for
> > missing dependencies during make). I have no concrete idea of the
> > implementation though, any detailed suggestion would be helpful.
> 
> I've been playing with some ideas about that this week.  I hope to have
> something to show soon.

I posted a series that takes some first steps:
        https://patchwork.ozlabs.org/project/openvswitch/list/?series=49281