[ovs-discuss] OVS 2.3.2 hung - no arp with lacp bond

Joe Stringer joe at ovn.org
Fri Nov 18 18:59:14 UTC 2016


On 17 November 2016 at 23:25, Varun <ez2517 at gmail.com> wrote:
> Hello
>
> I observed an issue with OVS 2.3.2 on CentOS 6.6 KVM , kernel
> 2.6.32-504.el6,   with 25 tenant VMs where it stopped responding to ARPs all
> of a sudden. There are 2 OVS bonds in balance-tcp mode in this
> configuration.  OVS service was restarted to resolve this state .
>
> Below logs were repeated many times over
>
> 2016-10-01T12:37:32.480Z|1786606|poll_loop|INFO|wakeup due to 0-ms timeout
> at lib/seq.c:179 (97% CPU usage)
>
> 2016-10-01T23:59:02.480Z|1810465|poll_loop|INFO|wakeup due to 0-ms timeout
> at ofproto/ofproto-dpif.c:1503 (99% CPU usage)
>
> 2016-10-02T00:33:32.481Z|1811683|poll_loop|INFO|wakeup due to [POLLIN] on fd
> 10 (<->/var/run/openvswitch/db.sock) at lib/stream-fd-unix.c:124 (97% CPU
> usage)
>
>
> Is this related to revalidator thread  or probably due to high traffic
> volume ?   Having said that , there has no increase in traffic volume to the
> tenant VMs in the days leading up to the OVS hung state or after.
> Hypervisor cpu and memory usage has been at normal levels as well.
>
> Has anyone else observed similar issues ? Any insight or comments would be
> really appreciated.

For what it's worth, I believe that OVS 2.3.3 fixed some issues
related to bonding - for example:
https://github.com/openvswitch/ovs/commit/c2e761f5b15291b56eb0c2f2311ef218ae0653c6


More information about the discuss mailing list