Kernel bug affecting fd.io vpp on certain hardware: linux-4.15.0-88-generic (Ubuntu 18.04 LTS)


Dave Barach
 

See https://bugzilla.kernel.org/show_bug.cgi?id=206133. Doesn’t happen on all hardware. Also reported on series 5 Linux kernels.

 

FWIW... Dave

 

Feb 20 11:27:42 vppg2 kernel: [ 1386.487195] general protection fault: 0000 [#1] SMP NOPTI

Feb 20 11:27:42 vppg2 kernel: [ 1386.492600] Modules linked in: uio_pci_generic uio intel_rapl x86_pkg_temp_thermal intel_powerc\

lamp coretemp kvm_intel kvm irqbypass intel_cstate intel_rapl_perf mac_hid tpm_infineon shpchp sch_fq_codel ib_iser rdma_cm iw_c\

m ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid4\

56 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_pclm\

ul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel ixgbe igb aes_x86_64 crypto_simd ahci glue_helper i2c_algo_bit dca cryptd p\

tp mdio libahci i2c_ismt pps_core

Feb 20 11:27:42 vppg2 kernel: [ 1386.548572] CPU: 1 PID: 1801 Comm: vpp Not tainted 4.15.0-88-generic #88-Ubuntu

Feb 20 11:27:42 vppg2 kernel: [ 1386.555882] Hardware name: Default string Default string/Default string, BIOS 5.13 01/04/2018

Feb 20 11:27:42 vppg2 kernel: [ 1386.564408] RIP: 0010:remove_files.isra.1+0x24/0x70

Feb 20 11:27:42 vppg2 kernel: [ 1386.569289] RSP: 0018:ffffa179c1b4fc00 EFLAGS: 00010206

Feb 20 11:27:42 vppg2 kernel: [ 1386.574514] RAX: 7b728bc142638d2b RBX: ffff95ce6d3d78c0 RCX: 0000000000000000

Feb 20 11:27:42 vppg2 kernel: [ 1386.581648] RDX: ffff95ce6d3e1488 RSI: ffff95ce6d3d78c0 RDI: ffff95ce60984000

Feb 20 11:27:42 vppg2 kernel: [ 1386.588780] RBP: ffffa179c1b4fc18 R08: 0000000000000000 R09: ffff95ce609841b0

Feb 20 11:27:42 vppg2 kernel: [ 1386.595912] R10: 0000000000000000 R11: 0000000000000000 R12: ffff95ce60984000

Feb 20 11:27:42 vppg2 kernel: [ 1386.603047] R13: ffff95ce6d3e1488 R14: 0000000000000000 R15: 0000000000000060

Feb 20 11:27:42 vppg2 kernel: [ 1386.610179] FS:  00007ffff7fdb7c0(0000) GS:ffff95ce7fc40000(0000) knlGS:0000000000000000

Feb 20 11:27:42 vppg2 kernel: [ 1386.618267] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033

Feb 20 11:27:42 vppg2 kernel: [ 1386.624011] CR2: 0000555555991000 CR3: 00000004669ca000 CR4: 00000000003406e0

Feb 20 11:27:42 vppg2 kernel: [ 1386.631143] Call Trace:

Feb 20 11:27:42 vppg2 kernel: [ 1386.633600]  sysfs_remove_group+0x44/0x90

Feb 20 11:27:42 vppg2 kernel: [ 1386.637617]  sysfs_remove_groups+0x2e/0x50

Feb 20 11:27:42 vppg2 kernel: [ 1386.641717]  device_remove_attrs+0x47/0x80

Feb 20 11:27:42 vppg2 kernel: [ 1386.645815]  device_del+0x161/0x3b0

Feb 20 11:27:42 vppg2 kernel: [ 1386.649308]  cdev_device_del+0x1a/0x40

Feb 20 11:27:42 vppg2 kernel: [ 1386.653062]  posix_clock_unregister+0x26/0x50

Feb 20 11:27:42 vppg2 kernel: [ 1386.657422]  ptp_clock_unregister+0x72/0x80 [ptp]

Feb 20 11:27:42 vppg2 kernel: [ 1386.662134]  igb_ptp_stop+0x23/0x50 [igb]

Feb 20 11:27:42 vppg2 kernel: [ 1386.666152]  igb_remove+0x4b/0x170 [igb]

Feb 20 11:27:42 vppg2 kernel: [ 1386.670085]  pci_device_remove+0x3e/0xb0

Feb 20 11:27:42 vppg2 kernel: [ 1386.674008]  device_release_driver_internal+0x13a/0x220

Feb 20 11:27:42 vppg2 kernel: [ 1386.679233]  device_release_driver+0x12/0x20

Feb 20 11:27:42 vppg2 kernel: [ 1386.683507]  unbind_store+0x87/0x150

Feb 20 11:27:42 vppg2 kernel: [ 1386.687087]  drv_attr_store+0x27/0x40

Feb 20 11:27:42 vppg2 kernel: [ 1386.690751]  sysfs_kf_write+0x3c/0x50

Feb 20 11:27:42 vppg2 kernel: [ 1386.694418]  kernfs_fop_write+0x125/0x1a0

Feb 20 11:27:42 vppg2 kernel: [ 1386.698430]  __vfs_write+0x1b/0x40

Feb 20 11:27:42 vppg2 kernel: [ 1386.701835]  vfs_write+0xb1/0x1a0

Feb 20 11:27:42 vppg2 kernel: [ 1386.705157]  SyS_write+0x5c/0xe0

Feb 20 11:27:42 vppg2 kernel: [ 1386.708391]  do_syscall_64+0x73/0x130

Feb 20 11:27:42 vppg2 kernel: [ 1386.712057]  entry_SYSCALL_64_after_hwframe+0x3d/0xa2

Feb 20 11:27:42 vppg2 kernel: [ 1386.717106] RIP: 0033:0x7ffff6738281

Feb 20 11:27:42 vppg2 kernel: [ 1386.720686] RSP: 002b:00007fffb5a279c8 EFLAGS: 00003246 ORIG_RAX: 0000000000000001

Feb 20 11:27:42 vppg2 kernel: [ 1386.728253] RAX: ffffffffffffffda RBX: 00007fffb66b5670 RCX: 00007ffff6738281

Feb 20 11:27:42 vppg2 kernel: [ 1386.735385] RDX: 000000000000000c RSI: 00007fffb66b5670 RDI: 000000000000000b

Feb 20 11:27:42 vppg2 kernel: [ 1386.742517] RBP: 000000000000000b R08: 0000000000000010 R09: 0000000000000078

Feb 20 11:27:42 vppg2 kernel: [ 1386.749649] R10: 000000000000000e R11: 0000000000003246 R12: 00007fffb66b3e20

Feb 20 11:27:42 vppg2 kernel: [ 1386.756783] R13: 0000555555990a63 R14: 00007fffb66b3e20 R15: 00007fffb5a27b20

Feb 20 11:27:42 vppg2 kernel: [ 1386.763917] Code: 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 85 f6 48 89 e5 41 55 41 54 49 89 d5\

53 49 89 fc 74 24 48 8b 06 48 89 f3 48 85 c0 74 19 <48> 8b 30 31 d2 48 83 c3 08 4c 89 e7 e8 bb d4 ff ff 48 8b 03 48

Feb 20 11:27:42 vppg2 kernel: [ 1386.782776] RIP: remove_files.isra.1+0x24/0x70 RSP: ffffa179c1b4fc00

Feb 20 11:27:42 vppg2 kernel: [ 1386.789163] ---[ end trace d5bfb19fdf227b22 ]---