mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-07-12 17:21:06 +00:00

Author	SHA1	Message	Date
Donald Sharp	6f0331d8b4	Merge pull request #9988 from idryzhov/ospf-gr-broken ospfd: remove commands for broken GR helper mode	2021-11-10 14:05:38 -05:00
Igor Ryzhov	accef597df	ospfd: remove commands for broken GR helper mode Issue #9983 explains what is wrong with the GR helper mode. To unblock the CI that fails almost all the time on the ospf_gr_topo1 test, remove the commands and disable the test. Also add a reminder to completely remove the helper mode if no one fixes the code in a month. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-10 19:17:03 +03:00
Donatas Abraitis	37b6787730	Merge pull request #9700 from mjstapp/add_json_det_attrs bgpd: Add 'show bgp <afi> <safi> json detail' header data	2021-11-10 16:42:30 +02:00
Igor Ryzhov	b6380d60c7	Merge pull request #9996 from opensourcerouting/resolver-fix-threads lib: fix `struct thread **` misuse in c-ares resolver bindings	2021-11-09 13:55:44 +03:00
Russ White	d630e21a0b	Merge pull request #9924 from idryzhov/isis-nb-improvements various isisd northbound fixes	2021-11-08 17:56:22 -05:00
Russ White	e2b5cbf7a0	Merge pull request #9995 from donaldsharp/bfd_ospf_topo1_convvergence tests: bfd_ospf_topo1 expects unreasonable convergence times under load	2021-11-08 13:52:12 -05:00
David Lamparter	ecabab0320	tests: add c-ares "exercise" tool This can't really be run as part of CI, it's intended as a helper instead, to use manually after poking around in the c-ares binding code. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-11-08 14:06:21 +01:00
David Lamparter	865dd9fe0b	tests: allow common_cli.c with logging enabled common_cli.c disables logging by default so stdio is usable as vty without log messages getting strewn inbetween. This the right thing for most tests, but not all; sometimes we do want log messages. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-11-08 14:06:21 +01:00
Igor Ryzhov	9780353f6c	tests: fix bgp_community_change_update `949aaea5` removed debugs from all topotests, but this test relies on the debug logs so it constantly fails now. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-08 13:35:09 +03:00
Donald Sharp	7b66f10e20	tests: bfd_ospf_topo1 expects unreasonable convergence times under load When our CI test system is under high load, expecting bfd to converge in under 2 seconds is not going to happen. Modify the test suites to just ensure that things reconvderge. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-07 08:38:33 -05:00
Donald Sharp	949aaea5ba	tests: Remove debugs from topotests Debugs take up a significant amount of cpu time as well as increased disk space for storage of results. Reduce test over head by removing the debugs, Hopefully this helps alleviate some of the overloading that we are seeing in our CI systems. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-07 07:45:27 -05:00
Donatas Abraitis	79fd3e0c97	Merge pull request #9958 from donaldsharp/all_protocol_nhg_replace tests: Fix route replace test in all_protocol_startup	2021-11-06 15:11:33 +02:00
Jafar Al-Gharaibeh	3e57d69186	Merge pull request #9974 from donaldsharp/ldp_vpls_topo1_wait tests: Ensure ospf has reconverged before continuing	2021-11-05 14:50:02 -05:00
Donald Sharp	b4bee329d2	tests: pim_basic needs to wait for event to happen under load The test system under load looks for upstream state only 1 time immediately after sending 2 streams of S,G data flowing. Give the system some time to process this and ensure that it actually shows up in a small amount of time. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-05 11:50:46 -04:00
Donald Sharp	8a019129dc	tests: Ensure ospf has reconverged before continuing The test_ldp_pseudowires_after_link_down test shuts a link down and was blindly waiting 5 seconds before just assuming the test system was in a sane state. Remove the sleep(5) and actually look for the changed state for the route 2.2.2.2 that the psueudowire actually depends on. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-05 11:18:41 -04:00
Donald Sharp	56bce96682	tests: test_ospf_topo1.py ensure rib has time to converge The test does this: a) shut link down b) test for ospf convergence c) ensure the route is installed When under a heavily loaded system c) is not guaranteed to happen quickly. Give the system 10 extra seconds to ensure it happens. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-05 06:42:38 -04:00
Donald Sharp	8beb469fd3	tests: Fix route replace test in all_protocol_startup The route replace test was doing this seq of events: a) Create nhg b) Install route w/ sharpd c) Ensure it worked d) Modify nhg d) Ensure the update group replace worked The problem is that the sharp code is doing this: /* Only send via ID if nhgroup has been successfully installed */ if (nhgid && sharp_nhgroup_id_is_installed(nhgid)) { SET_FLAG(api.message, ZAPI_MESSAGE_NHG); api.nhgid = nhgid; } else { for (ALL_NEXTHOPS_PTR(nhg, nh)) { api_nh = &api.nexthops[i]; zapi_nexthop_from_nexthop(api_nh, nh); i++; } api.nexthop_num = i; } The created nhg has not been successfully installed( or at least sharpd has not read the results yet) when it gets the command to install the routes. As such it passes down the individual nexthops instead. The route replace is never going to work. Modify the code to add a bit of sleep to allow sharpd to get notified when the system is under load. At this point there is no way to query sharpd for whether or not it thinks it's nhg is installed properly or not. This test is failing all over the place for a bunch of people let's get this fixed so people can get running Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-04 11:49:04 -04:00
Donald Sharp	0567f3a9f3	tests: All_protocol_startup sporadic failure the test_nexthop_groups function is failing occassionally because the test executes 4 in succession sharp install routes commands. When I dumped the rib on a failed test run there were only 2 of the 4 routes in the rib and the two that were in were the last 2 installed. The sharp daemon setups a event process where it installs routes `automatically`. If the previous run is not finished entering a new command to install the routes will mess up the last one from ever happening. It is assumed that the user doesn't do stupid stuff here. In this case I am just adding a small sleep between each installation to just let the test proceed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-01 15:12:23 -04:00
Igor Ryzhov	65a69156b2	Merge pull request #9928 from donaldsharp/isis_topo1_fix tests: isis_topo1 needs to wait for results under load	2021-11-01 12:25:32 +03:00
Donald Sharp	732107a4e1	tests: isis_topo1 needs to wait for results under load the isis_topo1 test has two functions where immediately after the test ensures that the routes are in isis tests to see if they are in the rib. Under system load I am seeing this test failing because the routes are still queued. Modify the zebra check for the isis routes to look for the proper results for 10 seconds. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-31 20:08:29 -04:00
Donatas Abraitis	9cec18f1e5	Merge pull request #9916 from donaldsharp/run_and_expect_failure tests: Fix `check_ping` function in test_bgp_srv6l3vpn_to_bgp_vrf.py	2021-10-31 20:46:14 +02:00
Igor Ryzhov	2f9a06f060	isisd: fix circuit is-type configuration Currently, we have a lot of checks in CLI and NB layer to prevent incompatible IS-types of circuits and areas. All these checks become completely meaningless when the interface is moved between VRFs. If the area IS-type is different in the new VRF, previously done checks mean nothing and we still end up with incorrect circuit IS type. To actually prevent incorrect IS type, all checks must be done in the processing code. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-30 03:17:49 +03:00
Martin Winter	cd206022b1	Merge pull request #9920 from donaldsharp/zebra_seg6local_race tests: zebra_seg6local has a race condition	2021-10-30 01:36:35 +02:00
Donald Sharp	7d2cf93636	tests: Fix zebra_seg6_route to not always reinstall the same route This code has two issues: a) The loop to test for successful installation re-installs the route every time it loops. A system under load will have issues ensuring the route is installed and repeated attempts does not help b) The nexthop group installation was always failing but never noticed (because of the previous commit) and the test was always passing, when it should have never passed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-29 13:17:54 -04:00
Donald Sharp	b7b352c000	Merge pull request #9830 from idryzhov/config-timing-fixes tests: test_static_timing fixes	2021-10-29 13:17:24 -04:00
Donald Sharp	25347872bf	tests: zebra_seg6local has a race condition The test is checking installing of seg6 routes by this loop: for up to 5 times: sharp install seg6 route show ip route and is it installed The problem is that if the system is under heavy load the installation may not have happened yet and by immediately reinstalling the same route the same thing could happen again. Modify the code to pull the route installation outside of the loop and to increase to 10 attempts in case there is very heavy system load. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-29 08:53:03 -04:00
Igor Ryzhov	9f25891037	Merge pull request #9851 from sartura/isis_unpack_item_ext_subtlvs_fixes isisd: fix unpack_item_ext_subtlvs TLV parsing issues	2021-10-29 13:34:02 +03:00
Donald Sharp	6b60e7b81d	tests: Fix `check_ping` function in test_bgp_srv6l3vpn_to_bgp_vrf.py The check_ping function `_check` function was asserting and being passed to the topotests.run_and_expect() functionality causing it to not run the full range of pings if one failed the test. So effectively it was properly detecting pass / failure but only allowing for 1 iteration if it was going to fail. Modify the code to not assert and act like all the other run_and_expect functionality. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-28 15:51:46 -04:00
Juraj Vijtiuk	17b0839b35	isisd: fix unpack_item_ext_subtlvs TLV parsing issues isis_tlvs.c would fail at multiple places if incorrect TLVs were received in unpack_item_ext_subtlvs(), causing stream assertion violations. Signed-off-by: Juraj Vijtiuk <juraj.vijtiuk@sartura.hr>	2021-10-27 17:04:26 +00:00
Donald Sharp	f1506cf36b	Merge pull request #9902 from LabNConsulting/working/lb/lutil-wait-optimization tests: topotests/lib/lutil.py: optimize wait to not repeat command after expected result found	2021-10-27 06:57:26 -04:00
Christian Hopps	e898d2490e	Merge pull request #9862 from donaldsharp/all_protocol_retry tests: all_protocol_startup needs some tweaks to allow for processing	2021-10-27 00:56:06 -04:00
Russ White	a2b52cbeb4	Merge pull request #9854 from opensourcerouting/zapi-call-table *: convert zclient callbacks to table	2021-10-26 11:33:44 -04:00
Lou Berger	fd1aebbe77	Merge pull request #9840 from donaldsharp/lu_commands tests/topotests/lib/lutil.py	2021-10-26 11:27:34 -04:00
Lou Berger	6804af739d	tests: topotests/lib/lutil.py: optimize wait to not repeat command after expected result found Signed-off-by: Lou Berger <lberger@labn.net>	2021-10-26 10:56:50 -04:00
Donald Sharp	fc0a3f8883	tests: Attempt to fix bgp_l3vpn_to_direct timing issues The bgp_l3vpn_to_direct test is failing sometimes because the 2.2.2.2 route is dissapearing. What is happening? The log file for the failed test run shows us this: build 15-Oct-2021 07:26:12 scripts/adjacencies.py:8 WAIT:r4:ping 2.2.2.2 -c 1: 0. packet loss:wait:PE->P2 (loopback) ping:60:0.5: build 15-Oct-2021 07:26:12 Fri Oct 15 14:26:12 2021 (#9) scripts/adjacencies.py:8 COMMAND:r4:ping 2.2.2.2 -c 1: 0. packet loss:wait:PE->P2 (loopback) ping: build 15-Oct-2021 07:26:12 COMMAND OUTPUT:PING 2.2.2.2 (2.2.2.2) 56(84) bytes of data. build 15-Oct-2021 07:26:12 64 bytes from 2.2.2.2: icmp_seq=1 ttl=64 time=0.143 ms build 15-Oct-2021 07:26:12 build 15-Oct-2021 07:26:12 --- 2.2.2.2 ping statistics --- build 15-Oct-2021 07:26:12 1 packets transmitted, 1 received, 0% packet loss, time 0ms build 15-Oct-2021 07:26:12 rtt min/avg/max/mdev = 0.143/0.143/0.143/0.000 ms: build 15-Oct-2021 07:26:12 Done after 1 loops, time=0.024507761001586914, Found= 0% packet loss build 15-Oct-2021 07:26:12 Fri Oct 15 14:26:12 2021 (#9) scripts/adjacencies.py:9 COMMAND:r4:ping 2.2.2.2 -c 1: 0. packet loss:pass:PE->P2 (loopback) ping +0.02 secs: build 15-Oct-2021 07:26:12 2021-10-15 14:26:12,446 WARNING: topolog.r4: LinuxNamespace(r4): proc failed: rc 2 pid 28826 build 15-Oct-2021 07:26:12 args: /usr/bin/nsenter -a -t 27444 -F --wd=/tmp/topotests/bgp_l3vpn_to_bgp_direct.test_bgp_l3vpn_to_bgp_direct/r4 /bin/bash -c ping 2.2.2.2 -c 1 build 15-Oct-2021 07:26:12 stdout: connect: Network is unreachable: build 15-Oct-2021 07:26:17 COMMAND OUTPUT:connect: Network is unreachable: build 15-Oct-2021 07:26:17 R:9 r4 PE->P2 (loopback) ping +0.02 secs 0 1 So the 2.2.2.2 route is coming/going and is failing on these test lines: luCommand( "r1", "ping 2.2.2.2 -c 1", " 0. packet loss", "wait", "PE->P2 (loopback) ping", 60 ) luCommand( "r3", "ping 2.2.2.2 -c 1", " 0. packet loss", "wait", "PE->P2 (loopback) ping", 60 ) luCommand( "r4", "ping 2.2.2.2 -c 1", " 0. packet loss", "wait", "PE->P2 (loopback) ping", 60 ) So the 2.2.2.2 routes on r1,3 and 4 are received via ospf, but are modified by some other process to add labels ( probably ldp, since it is running too ). The 2nd ping to 2.2.2.2 is failing because the 2.2.2.2 route on r4 is being replaced. As an example here is `ip monitor all` on r4 during boot up. Please note timestamps are not necessarily representative of what we will see on the loaded ci system. [2021-10-15T15:46:52.261456] [NEXTHOP]id 27 via 10.0.2.2 dev r4-eth0 scope link proto zebra [2021-10-15T15:46:52.261490] [ROUTE]2.2.2.2 nhid 27 via 10.0.2.2 dev r4-eth0 proto ospf metric 20 <snip> [2021-10-15T15:46:53.556405] [NEXTHOP]Deleted id 27 via 10.0.2.2 dev r4-eth0 scope link proto zebra <snip> [2021-10-15T15:46:53.566575] [NEXTHOP]id 32 via 10.0.2.2 dev r4-eth0 scope link proto zebra [2021-10-15T15:46:53.566585] [ROUTE]2.2.2.2 nhid 32 via 10.0.2.2 dev r4-eth0 proto ospf metric 20 For a small amount of time the route was gone. I believe the upstream CI system hits that window sometimes, causing the test to fail. This patch attempts to ensure that the 2.2.2.2 route should be learned appropriately ( thus slowing it down ) before the test moves onto the ping. I suspect the long term answer might be to add a test to the scripts/adjancies.py script to ensure that the test does not continue until the appropriate label is in place, but I want to make the test run a bit more perscriptive in what it is looking for here. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-25 09:16:08 -04:00
Donald Sharp	f66e42011a	tests: Fix accidental 10 second wait Recent commit `83f325901a` had a accidental turn of a 1 second wait into a 10 second wait between retries. 10 seconds is too long. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-25 08:00:23 -04:00
Martin Winter	f641623151	tests: Fix frequent failure of ospf_gr_topo1 on slower systems Test doesn't wait long enough when it checks the routers after restart. On slower systems, it frequently failed as it ran out of time Signed-off-by: Martin Winter <mwinter@opensourcerouting.org>	2021-10-25 00:53:49 +02:00
Donald Sharp	cbdf030613	Merge pull request #9670 from LabNConsulting/chopps/fix-valgrind-fail-check Chopps/fix valgrind fail check	2021-10-24 08:30:29 -04:00
Donald Sharp	83f325901a	tests: bfd_isis_topo1 expects unreasonable convergence times under load When our ci test system is under high load, expecting bfd to converge in under 2 seconds is not going to happen. Modify the test suites to just ensure that things converge. If we need actual functional testing of bfd response times the topotests are not an appropriate place to do this or we need to modify the test system to gather the data for how long it takes after the tests are run. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-22 15:27:50 -04:00
Donald Sharp	1430ea83da	tests: Fix bgp_ecmp_topo3 to look for a bit more state During a local CI run, bgp_ecmp_topo3 was failing to properly notice the fast-convergence command issued before the interface is shut down. As such there exists a race condition where under high load the zebra process can actually shut an interface down before we have properly ensured that fast convergence is on for ibgp. Modify the test for in two ways: 1) Ensure that previous section makes sure that we have properly converged for when we bring back up the interfaces instead of assuming that we have done so. 2) After issuing the fast-convergence command. Ensure that bgp has fully processed it and is ready to receive the interface down events as triggers for shutting down the ibgp session. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-22 14:47:49 -04:00
Donald Sharp	e5369c471b	tests: Make test_ldp_topo1.py aware of how many neighbors it needs On a local CI run. The test_ldp_topo1.py showed fail to converge on r3. r3 has 2 neighbors but only 1 was up when we got to further steps in the test suites. Modify the neighbor checking to `know` how many neighbors should be operational and continue looking for them until they are up and running. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-22 14:18:33 -04:00
Christian Hopps	a15e5ac082	tests: fix --valgrind-memleaks option Previously, when a valgrind memleak was discovered, would cause a catastrophic pytest failure. Now correctly fails the current pytest as intended. As a result of this fix --valgrind-memleaks now works in distributed pytest mode as well. Signed-off-by: Christian Hopps <chopps@labn.net>	2021-10-22 17:44:47 +00:00
Christian Hopps	1f87861ecc	tests: revert default enable of memleak tests Revert the accidental enabling of the optional memleak tests that came with the large micronet changeset. Signed-off-by: Christian Hopps <chopps@labn.net>	2021-10-22 17:44:47 +00:00
Christian Hopps	f24157851b	tests: fix missing space in --valgrind-extra option Signed-off-by: Christian Hopps <chopps@labn.net>	2021-10-22 17:44:47 +00:00
Donald Sharp	9482d96e3f	tests: all_protocol_startup needs some tweaks to allow for processing The nexthop group code is installing routes and nexthop groups and immediately expecting zebra to have processed the results as a result there is a situation when the CI system is under intense load that the nexthop group might not have been processed. Add a bit of code to allow the test to give FRR some time to finish work before declaring it not working. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-22 11:28:31 -04:00
Russ White	61a7ec774c	Merge pull request #9783 from mjstapp/fix_bgp_lu_lsp bgpd, tests: BGP-labeled-unicast advertise implicit-null in more cases	2021-10-20 18:22:01 -04:00
Igor Ryzhov	e57c66d5e9	tests: relax requirements for test_static_timing When the CI system is heavily loaded, we might see the following failures: ``` test failed at "test_config_timing/test_static_timing": assert 20.083204 <= 19.487716 ``` Currently we allow each step to run 2 times slower than the initial measurement. Let's allow them to run 3 times slower. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-21 01:04:44 +03:00
Igor Ryzhov	925d7f925b	tests: fix test_static_timing route removal On the first step, the test creates 10000 static routes. It passes 10000 to `get_ip_networks` and it generates 10000 /22 routes. On the fourth step, the test tries to remove 5000 previously created routes. It passes 5000 to `get_ip_networks` and here starts the problem. Instead of generating 5000 /22 routes, it generates 5000 /21 routes. And the whole step is a no-op, we constantly see the following logs: ``` % Refusing to remove a non-existent route ``` To consistently generate same routes, `get_ip_networks` must always use the same prefix length. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-21 01:04:44 +03:00
David Lamparter	5a001ddd24	Merge pull request #9855 from donaldsharp/ospf_fini	2021-10-20 19:19:43 +02:00
Mark Stapp	52e458d922	Merge pull request #9766 from opensourcerouting/typesafe-member-nhrp-zap lib: add typesafe membership-test functions	2021-10-20 08:13:17 -04:00

1 2 3 4 5 ...

2195 Commits