CSIT failing perf tests for week 47 (11/14 – 11/20)


Viliam Luc -X (vluc - PANTHEON TECH SRO at Cisco) <vluc@...>
 

=====SUMMARY=====

New issues - 4

Unfixed issues - 14

Fixed issues - 0

 

===NEW ISSUES===

1) [H] 2n-aws: All tests fail to start VPP

   rca: Module uio_pci_generic not found in directory /lib/modules/5.4.0-1009-aws

   test: all

   frequency: allways

   testbed: 2n-aws

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-weekly-master-2n-aws/73/log.html.gz#s1-s1-s1-s1-s1-t1

 

2) [H] 2n-aws: All tests fail to initialize interface

   rca: '/sys/bus/pci/devices/0000:3b:00.0/virtfn0/driver/unbind': No such file or directory

   test: all

   frequency: allways

   testbed: 2n-clx

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-weekly-master-2n-clx/168/log.html.gz#s1-s1-s1-s1-s1-s1-s1

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-icx/48/log.html.gz#s1-s1-s1-s1-s1

 

3) [H] 2n-icx: NFV density tests breaks VPP which fails to start (re-opened)

   rca:

   test: all

   frequency: all

   testbed: 2n-icx, 3n-icx, 2n-clx

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-weekly-master-2n-icx/47/log.html.gz#s1-s1-s1-s1-s1-s1-s1

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-icx/48/log.html.gz#s1-s1-s1-s5-s8-t1

 

TICKET: https://jira.fd.io/browse/CSIT-1881

 

4) [H] 2n-clx: dpdk tests failing to unbind PCI device

   rca: DRV_VFIO_PCI

   test: all DPDK

   frequency: allways

   testbed: 2n-clx

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-clx/167/log.html.gz#s1-s1-s1-s2-s16-t1

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-weekly-master-2n-aws/73/log.html.gz#s1-s1-s1-s1-s1-t4

 

===OUTSTANDING UNFIXED===

5) [M] 3n-snr: All hwasync wireguard tests failing when trying to verify device

   rca: Failed to bind PCI device 0000:f4:00.0 to c4xxx on host 10.30.51.93

   test: hwasync wireguard

   frequency: allways

   testbed: 3n-snr

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-3n-snr/45/log.html.gz#s1-s1-s1-s3-s1

 

TICKET: https://jira.fd.io/browse/CSIT-1883

 

6) [M] 1n-aws: TRex NDR PDR ALL IP4 scale and L2 scale tests failing with 50% packet loss

   rca:

   test: ip4scale2m

   frequency: all

   testbed: 1n-aws

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-trex-perf-ndrpdr-weekly-master-1n-aws/8/log.html.gz#s1-s1-s1-s1-s2-t1

 

TICKET: https://jira.fd.io/browse/CSIT-1876

NOTE: The root cause can be shared environment in aws cloud.

 

7) [H] 2n-clx, 2n-zn2: all RDMA tests failing with cli_inband clear runtime command

   rca:

   test: RDMA with CX556A NIC

   frequency: all

   testbed: 2n-clx, 2n-zn2

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-2n-clx/1212/log.html.gz#s1-s1-s1-s1-s1-t1

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-2n-zn2/639/log.html.gz#s1-s1-s1-s1-s1-t1

                                                            https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-clx/167/log.html.gz#s1-s1-s1-s2-s5-t1

 

TICKET: https://jira.fd.io/browse/CSIT-1882

 

8) [M] 3n-tsh: VM tests failing to boot VM

   rca:

   test: 3n-tsh: sporadic VM vhost

   frequency: all

   testbed: 3n-tsh

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-3n-tsh/710/log.html.gz#s1-s1-s1-s7-s2-t1

 

TICKET: https://jira.fd.io/browse/CSIT-1877

NOTE: 3n-alt testbed was fixed. 3n-tsh still failing

 

9) [M] 3n-snr: 25Ge Interface goes down randomly

   rca:

   test: all

   frequency: sporadic

   testbed: 3n-snr

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-3n-snr/45/log.html.gz#s1-s1-s1-s3-s12-t1

 

TICKET: https://jira.fd.io/browse/CSIT-1871

NOTE: Sometimes 'TwentyFiveGigabitEthernetec/0/0' goes down and all subsequent tests fail.

 

10) [H] 2n-clx: half of the packets lost on PDR tests (re-opened)

   rca:

   test: e810Cq ip4base, ip6base

   frequency: sporadic

   testbed: 2n-clx

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-clx/167/log.html.gz#s1-s1-s1-s2-s8-t1

 

TICKET: https://jira.fd.io/browse/CSIT-1864

 

11) [M] 3n-alt, 3n-snr: testpmd tests fail with no traffic

   rca:

   test: testpmd

   frequency: all

   testbed: 3n-alt, 3n-snr

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-dpdk-perf-mrr-weekly-master-3n-alt/33/log.html.gz#s1-s1-s1-s1-t2

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-dpdk-perf-report-iterative-2210-3n-snr/6/log.html.gz#s1-s1-s1-s1-t1

 

TICKET: https://jira.fd.io/browse/CSIT-1848

NOTE: 3n-alt was fixed. Only 3n-tsh failing

 

12) [L] 2n-dnv: sporadic 1518B tput tests failing to establish required sessions

   rca:

   test: 1518B tput

   frequency: sporadic

   testbeds: 2n-dnv

   examples: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-2n-dnv/1264/log.html.gz#s1-s1-s1-s1-s7-t4

 

TICKET: https://jira.fd.io/browse/CSIT-1850

 

13)[H] 3n-icx, 3n-skx, 3n-snr: all 1518B AVF crypto tests failed with no traffic, all IMIX AVF crypto with excessive packet loss

   rca:

   test: all AVF crypto

   frequency: sporadic

   testbed: 3n-skx, 3n-icx, 3n-snr

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-3n-icx/148/log.html.gz#s1-s1-s1-s1-s4-t1

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-3n-snr/32/log.html.gz#s1-s1-s1-s1-s4-t1

                                                            https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-3n-icx/43/log.html.gz#s1-s1-s1-s1-s4-t1

                                          

TICKET: https://jira.fd.io/browse/CSIT-1827

 

14)[L] all testbeds: AF-XDP - NDR tests failing from time to time

   rca:

   test: af-xdp multicore tests

   frequency: low

   testbed: 2n-clx, 2n-skx, 2n-tx2, 2n-icx

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-skx/202/log.html.gz#s1-s1-s1-s2-s4-t3

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-clx/152/log.html.gz#s1-s1-s1-s5-s12-t3

 

TICKET: https://jira.fd.io/browse/CSIT-1802

NOTE: This is mainly observed in iterative and coverage. It's very low frequency ~ 1 out of 100

 

15)[M] 3n-tsh, 3n-alt, 2n-clx testbed (Taishan, Altra, Cascade-lake): NDR tests failing from time to time.

   rca:

   tests: Crypto, Ip4, L2, Srv6, Vm Vhost (all packet sizes, all core configurations affected)

   frequency: medium

   testbed: 3n-tsh, 3n-alt, 2n-clx

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-icx/47/log.html.gz#s1-s1-s1-s2-s37-t1

                                          

TICKET: https://jira.fd.io/browse/CSIT-1804

 

16)[L] T-Rex STL runtime error

   rca: VPP code - X557 speed_capability set 1GE instead of 10GE

   test: sporadic

   frequency: all

   testbed: 2n-dnv and 3n-dnv

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-2n-dnv/1264/log.html.gz#s1-s1-s1-s1-s3-t1

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-3n-dnv/1274/log.html.gz#s1-s1-s1-s2-s1-t1

 

TODO: VPP to fix speed_capability.

TICKET: https://jira.fd.io/browse/VPP-2010

 

17)[L] failed creating AVF interface

   rca: issue in Intel FVL driver

   test: multicore AVF

   frequency: sporadic

   testbed: all testbeds

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-2n-zn2/639/log.html.gz#s1-s1-s1-s2-s18-t3

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-3n-icx/152/log.html.gz#s1-s1-s1-s5-s1-t2

 

NOTE: A long standing issue without a final permanent fix.

TICKET: multicore AVF tests are failing when trying to create interface, https://jira.fd.io/browse/CSIT-1782

 

18)[L] Not all DET44 sessions have been established: 4128767 != 4128768

   rca: unknown

   test: nat44det udp 4m and 16m (64k and 1m are ok)

   frequency: very sporadic. It failed in 1 out of 8 runs.

   testbed: 2n-zn2, 2n-skx, 2n-icx, 2n-clx

   example: https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-2n-icx/160/log.html.gz#s1-s1-s1-s2-s35-t1

                  https://s3-logs.fd.io/vex-yul-rot-jenkins-1/csit-vpp-perf-ndrpdr-weekly-master-2n-clx/164/log.html.gz#s1-s1-s1-s2-s54-t1

 

TICKET: https://jira.fd.io/browse/CSIT-1795

 

===OUTSTANDING FIXED===

 

===FIXED ISSUES===

 

Best regards,

Viliam Luc

Join csit-report@lists.fd.io to automatically receive all group messages.