[ovs-discuss] Question about RAFT cluster status output

Winson Wang windson.wang at gmail.com
Tue Apr 21 21:29:15 UTC 2020


On Tue, Apr 21, 2020 at 2:19 PM Han Zhou <hzhou at ovn.org> wrote:

>
>
> On Tue, Apr 21, 2020 at 2:12 PM Winson Wang <windson.wang at gmail.com>
> wrote:
> >
> > Hi Han
> >
> > I have question about the Connections output in my RAFT cluster.
> > Connections: ->0000 ->3c2d <-29ce <-3c2d
> > Should the "0000"  be 29ce?
> >
> Yes, you are right. This seems to be a bug. Do you know how to reproduce
> this?
>

I am seeing this with my 646 node k8s cluster with ovn cni.
To reproduce it,  I think need trigger SB to cpu busy state such as restart
all the ovn-controller clients or NB change which
trigger SB generate large flow count in short time such as 200K flows.

I  run one backgroup script to check the raft node role every 10 seconds.
here is some logout for the role change during the stress time.
........
SB role changed from follower to candidate on 13:21:06
SB role changed from candidate to leader on 13:21:16
SB role changed from leader to follower on 13:22:13
SB role changed from follower to candidate on 13:46:54
SB role changed from candidate to follower on 13:47:05
....


>
> > cluster leader output:
> > ovs-appctl -t /var/run/openvswitch/ovnsb_db.ctl cluster/status
> OVN_Southbound
> > bb7d
> > Name: OVN_Southbound
> > Cluster ID: c316 (c316d7c4-6a72-4124-aa62-657b7c50c5c6)
> > Server ID: bb7d (bb7d4188-d0d5-4e5e-b0a3-fbbbaa849418)
> > Address: tcp:10.0.2.153:6644
> > Status: cluster member
> > Role: leader
> > Term: 41
> > Leader: self
> > Vote: self
> >
> > Election timer: 8000
> > Log: [8899, 8946]
> > Entries not yet committed: 0
> > Entries not yet applied: 0
> > Connections: ->0000 ->3c2d <-29ce <-3c2d
> > Servers:
> >     bb7d (bb7d at tcp:10.0.2.153:6644) (self) next_index=8920
> match_index=8945
> >     29ce (29ce at tcp:10.0.2.151:6644) next_index=8946 match_index=8945
> >     3c2d (3c2d at tcp:10.0.2.152:6644) next_index=8946 match_index=8945
> >
> > Name: OVN_Southbound
> > Cluster ID: c316 (c316d7c4-6a72-4124-aa62-657b7c50c5c6)
> > Server ID: 3c2d (3c2d4666-f10d-49dc-a1b4-dec50720a79f)
> > Address: tcp:10.0.2.152:6644
> > Status: cluster member
> > Role: follower
> > Term: 41
> > Leader: bb7d
> > Vote: unknown
> >
> > Election timer: 8000
> > Log: [8898, 8946]
> > Entries not yet committed: 0
> > Entries not yet applied: 0
> > Connections: ->0000 <-29ce <-bb7d ->bb7d
> > Servers:
> >     bb7d (bb7d at tcp:10.0.2.153:6644)
> >     29ce (29ce at tcp:10.0.2.151:6644)
> >     3c2d (3c2d at tcp:10.0.2.152:6644) (self)
> >
> > 29ce
> > Name: OVN_Southbound
> > Cluster ID: c316 (c316d7c4-6a72-4124-aa62-657b7c50c5c6)
> > Server ID: 29ce (29ce6194-ec71-4c8f-ba70-1953568ed4cc)
> > Address: tcp:10.0.2.151:6644
> > Status: cluster member
> > Role: follower
> > Term: 41
> > Leader: bb7d
> > Vote: bb7d
> >
> > Election timer: 8000
> > Log: [8875, 8929]
> > Entries not yet committed: 0
> > Entries not yet applied: 0
> > Connections: <-3c2d ->3c2d <-bb7d ->bb7d
> > Servers:
> >     bb7d (bb7d at tcp:10.0.2.153:6644)
> >     29ce (29ce at tcp:10.0.2.151:6644) (self)
> >     3c2d (3c2d at tcp:10.0.2.152:6644)
> >
> > --
> > Winson
>


-- 
Winson
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.openvswitch.org/pipermail/ovs-discuss/attachments/20200421/8224fedf/attachment.html>


More information about the discuss mailing list