[ovs-discuss] insmod/rmmod openvswitch.ko frequently, trigger linux kernel crash.

tigergan(甘涛) tigergan at tencent.com
Sat May 5 05:23:07 UTC 2018


Hi,
insmod/rmmod openvswitch.ko frequently, after a few tries, kernel crash(BUG: unable to handle kernel paging request at ffffffffa0641290):
[  320.986901] openvswitch: Open vSwitch switching datapath 2.9.0
[  320.987064] openvswitch: LISP tunneling driver
[  320.987067] openvswitch: STT tunneling driver
[  327.030573] openvswitch: Open vSwitch switching datapath 2.9.0
[  327.030688] openvswitch: LISP tunneling driver
[  327.030691] openvswitch: STT tunneling driver
[  333.075281] openvswitch: Open vSwitch switching datapath 2.9.0
[  333.075393] openvswitch: LISP tunneling driver
[  333.075396] openvswitch: STT tunneling driver
[  336.107467] BUG: unable to handle kernel paging request at ffffffffa0641290
[  336.107676] IP: [<ffffffff81098538>] run_timer_softirq+0x108/0x340
[  336.107822] PGD 19bd067 PUD 19be063 PMD 103635f067 PTE 0
[  336.108142] Oops: 0002 [#1] SMP
[  336.108334] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat xt_conntrack ipt_REJECT nf_reject_ipv4 tun bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter udp_tunnel gre nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_conntrack_ipv6 nf_nat_ipv6 nf_defrag_ipv6 nf_nat nf_conntrack iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi intel_powerclamp coretemp intel_rapl iosf_mbi kvm_intel kvm irqbypass crc32_pclmul ghash_clmulni_intel aesni_intel lrw iTCO_wdt gf128mul glue_helper ablk_helper cryptd liquidio(OE) iTCO_vendor_support sb_edac i2c_i801 edac_core sg pcspkr lpc_ich mei_me mei shpchp acpi_power_meter acpi_pad nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs libcrc32c sd_mod crc_t10dif crct10dif_generic crct10dif_pclmul
[  336.113086]  crct10dif_common crc32c_intel ahci igb libahci ptp pps_core libata i2c_algo_bit i2c_core megaraid_sas dca fjes dm_mirror dm_region_hash dm_log dm_mod [last unloaded: openvswitch]
[  336.114306] CPU: 13 PID: 0 Comm: swapper/13 Tainted: G           OE  ------------   3.10.0-514.el7.x86_64 #1
[  336.114398] Hardware name: Huawei RH1288 V3/BC11HGSC0, BIOS 3.22 05/16/2016
[  336.114662] task: ffff8810f9e50fb0 ti: ffff8810f9e58000 task.ti: ffff8810f9e58000
[  336.115116] RIP: 0010:[<ffffffff81098538>]  [<ffffffff81098538>] run_timer_softirq+0x108/0x340
[  336.115625] RSP: 0018:ffff88203f243e68  EFLAGS: 00010086
[  336.115880] RAX: ffff8810f94a43c8 RBX: ffff8810f94a4000 RCX: ffffffffa0641290
[  336.116139] RDX: ffff8810f94a43a0 RSI: ffff88203f243e90 RDI: ffff8810f94a4000
[  336.116400] RBP: ffff88203f243ed0 R08: 0000004e35057840 R09: ffff88203f243d98
[  336.116669] R10: 0000000000000002 R11: ffff88203f243da0 R12: 000000000000003a
[  336.116936] R13: 0000004e34fd52b6 R14: ffffffff819b3088 R15: 0000000000000001
[  336.117198] FS:  0000000000000000(0000) GS:ffff88203f240000(0000) knlGS:0000000000000000
[  336.117636] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  336.117894] CR2: ffffffffa0641290 CR3: 00000000019ba000 CR4: 00000000001407e0
[  336.118158] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  336.118419] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  336.118678] Stack:
[  336.118933]  ffff8810f94a5c28 ffff8810f94a5828 ffff8810f94a5428 ffff8810f94a5028
[  336.119619]  ffff88203f243ea8 ffff88103eacd908 ffffffffa0641290 7ba4b2baa7f09803
[  336.120303]  0000000000000001 0000000000000001 0000004e34fd52b6 ffffffff819b3088
[  336.121000] Call Trace:
[  336.121243]  <IRQ>
[  336.121312]
[  336.121619]  [<ffffffff8108f0ff>] __do_softirq+0xef/0x280
[  336.121854]  [<ffffffff8169805c>] call_softirq+0x1c/0x30
[  336.122087]  [<ffffffff8102d365>] do_softirq+0x65/0xa0
[  336.122321]  [<ffffffff8108f495>] irq_exit+0x115/0x120
[  336.122558]  [<ffffffff81698cd5>] smp_apic_timer_interrupt+0x45/0x60
[  336.122793]  [<ffffffff8169721d>] apic_timer_interrupt+0x6d/0x80
[  336.123048]  <EOI>
[  336.123116]
[  336.123427]  [<ffffffff81514052>] ? cpuidle_enter_state+0x52/0xc0
[  336.123687]  [<ffffffff81514199>] cpuidle_idle_call+0xd9/0x210
[  336.123954]  [<ffffffff8103516e>] arch_cpu_idle+0xe/0x30
[  336.124215]  [<ffffffff810e7c95>] cpu_startup_entry+0x245/0x290
[  336.124474]  [<ffffffff8104f12a>] start_secondary+0x1ba/0x230
[  336.124730] Code: 49 63 d4 48 83 43 10 01 48 8d 75 c0 48 c1 e2 04 48 01 da 48 8b 4a 28 48 8d 42 28 48 89 71 08 48 89 4d c0 48 8b 4a 30 48 89 4d c8 <48> 89 31 48 89 42 28 48 89 40 08 0f 1f 44 00 00 4c 8b 7d c0 48
[  336.129453] RIP  [<ffffffff81098538>] run_timer_softirq+0x108/0x340
[  336.129771]  RSP <ffff88203f243e68>
[  336.130023] CR2: ffffffffa0641290

# uname -a
Linux S1 3.10.0-514.el7.x86_64 #1 SMP Tue Nov 22 16:42:41 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
[root at S1 ~]#
# cat /etc/centos-release
CentOS Linux release 7.3.1611 (Core)

# modinfo openvswitch.ko
filename:       /home/test/OVS/openvswitch-2.9.0/datapath/linux/openvswitch.ko
alias:          net-pf-16-proto-16-family-ovs_packet
alias:          net-pf-16-proto-16-family-ovs_flow
alias:          net-pf-16-proto-16-family-ovs_vport
alias:          net-pf-16-proto-16-family-ovs_datapath
version:        2.9.0
license:        GPL
description:    Open vSwitch switching datapath
rhelversion:    7.3
srcversion:     2F9B4D6556D9F3EB76C929F
depends:        nf_conntrack,nf_nat,nf_defrag_ipv6,udp_tunnel,libcrc32c,nf_nat_ipv6,nf_nat_ipv4
vermagic:       3.10.0-514.el7.x86_64 SMP mod_unload modversions

the test case:
# for((i=0;i<1000;i++)); do echo "-------insmod $i--------"; insmod openvswitch.ko ; sleep 3; echo "---------rmmod $i-------";rmmod openvswitch; sleep 3; done

Let me know if you need more information. We are curious to know if it is a
know bug, or if it is something new.
Thanks a lot

________________________________
tigergan(甘涛)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.openvswitch.org/pipermail/ovs-discuss/attachments/20180505/359b6e8b/attachment-0001.html>


More information about the discuss mailing list