mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-11-02 11:01:16 +00:00

Author	SHA1	Message	Date
Quentin Young	712e40c409	Merge pull request #11831 from anlancs/fix/cleanup-default zebra: remove unnecessary check for default vrf	2023-07-18 15:15:39 +00:00
Donatas Abraitis	ef87237121	Merge pull request #14033 from donaldsharp/zebra_same_route Zebra same route	2023-07-18 10:37:15 +03:00
Donald Sharp	788cf6e892	Merge pull request #14025 from guoguojia2021/guozhongfeng_alibaba zebra: The command ipv6 nht xxx not work	2023-07-17 14:27:56 -04:00
Donald Sharp	af80201876	zebra: Further handle route replace semantics When an upper level protocol is installing a route X that needs to be route replaced and at the same time the same or another protocol installs a different route that depends on route X for nexthop resolution can leave us with a state where the route is not accepted because zebra is still really early in the route replace semantics ( route X is still on the work Queue to be processed ) then the dependent route would not be installed. This came up in the bgp_default_originate test cases frequently. Further extendd the ROUTE_ENTR_ROUTE_REPLACING flag to cover this case as well. This has come up because the early route processing queueing that was implemented late last year. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-17 10:00:32 -04:00
guozhongfeng	1193611f8e	zebra: The command ipv6 nht xxx not work If the command is ipv6 nht protocol route-map rmap, this parameter should use AFI_IP6 Signed-off-by: guozhongfeng <guozhongfeng.gzf@alibaba-inc.com>	2023-07-16 17:52:31 +08:00
anlan_cs	a99521a26f	zebra: Fix wrong vrf change procedure Currently the vrf change procedure for the deleted interface is after its deletion, it causes problem for upper daemons. Here is the problem of `bgp`: After deletion of one irrelevant interface in the same vrf, its `ifindex` is set to 0. And then, the vrf change procedure will send "ZEBRA_INTERFACE_DOWN" to `bgpd`. Normally, `bgp_nht_ifp_table_handle()` should igore this message for no correlation. However, it wrongly matched `ifindex` of 0, and removed the related routes for the down `bnc`. Adjust the location of the vrf change procedure to fix this issue. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2023-07-13 15:25:31 +08:00
anlan_cs	f8d94e8a62	zebra: remove unnecessary check for default vrf The default vrf is generally non-NULL, except when shutdown. So, most of the time it is not necessary to check if it is NULL, we should remove the useless checks for it. Searched them with exact match: ``` grep -rI "zebra_vrf_lookup_by_id(VRF_DEFAULT)" \| wc -l 31 ``` Signed-off-by: anlan_cs <vic.lan@pica8.com>	2023-07-12 17:00:27 +08:00
Russ White	916feb7acc	Merge pull request #13885 from donaldsharp/tests_need_to_be_stricter Tests need to be stricter	2023-07-11 11:49:38 -04:00
Russ White	89aba318f7	Merge pull request #13876 from LabNConsulting/mjs/nhrp_resolving Allow NHRP routes to validate incoming nexthops	2023-07-11 11:48:16 -04:00
Donald Sharp	c8971388a9	Merge pull request #13958 from opensourcerouting/fix/coverity Coverity fixes	2023-07-11 11:26:47 -04:00
Russ White	f0f2c7be41	Merge pull request #13964 from pguibert6WIND/mpls_again zebra: fix mpls config on ifaces created post frr	2023-07-11 10:12:04 -04:00
anlan_cs	5581a7fc08	zebra: adjust one debug info Adjust one debug info, separate the ip address from it. Just like it is processed in `redistribute_update()`. Before: ``` 34:1375.75.75.75/32: Redist del: re 0x55c1112067e0 (0:static), new re 0x55c1112de7c0 (0:static) ``` After: ``` (34:13):75.75.75.75/32: Redist del: re 0x55c1112067e0 (0:static), new re 0x55c1112de7c0 (0:static) ``` Signed-off-by: anlan_cs <vic.lan@pica8.com>	2023-07-11 13:36:09 +08:00
Mark Stapp	bb58cad150	zebra: use NHRP routes as valid in nexthop check Treat NHRP-installed routes as valid, as if they were CONNECTED routes, when checking candidate routes' nexthops for validity. This allows use of NHRP by an IGP, for example, that doesn't normally want recursive nexthop resolution. Signed-off-by: Mark Stapp <mjs@labn.net>	2023-07-10 16:43:53 -04:00
Donatas Abraitis	4bd04364ad	zebra: Guard printing an error by checking if VRF is not NULL Check if vrf_lookup_by_id() didn't return a NULL before dereferencing in flor_err(). Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-07-10 22:37:35 +03:00
Donatas Abraitis	f5fee8dd54	zebra: Check if ifp is not NULL in zebra_if_update_ctx() Use the same logic as zebra_if_netconf_update_ctx(). Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-07-10 22:37:33 +03:00
Donatas Abraitis	803375ac69	zebra: Do not check ifp for NULL It's already checked at the bottom of the function. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-07-10 22:36:59 +03:00
Donald Sharp	f4c29914b5	zebra: Lookup up nlsock * one time in call tree Code is looking up the nlsock to generate the batch messages and then looking it up again to get the response. Let's just look it up one time. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-10 09:06:40 -04:00
Philippe Guibert	71b0b0d3b3	zebra: fix mpls config on ifaces created post frr The mpls configuration does not work when an interface is created after having applied the frr configuration. The below scenario illustrates: > root@dut:~# modprobe mpls > root@dut:~# zebra & > [..] > dut(config)# interface ifacenotcreated > dut(config-if)# mpls enable > dut(config-if)# Ctrl-D > root@dut:~# ip li show ifacenotcreated > Device "ifacenotcreated" does not exist. > root@dut:~# ip li add ifacenotcreated type dummy > 0 Fix this by forcing the mpls flag when the interface is detected. > root@dut:~# cat /proc/sys/net/mpls/conf/ifacenotcreat/input > 1 Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2023-07-09 21:57:01 +02:00
Carmine Scarpitta	7f2dec4f09	zebra: Fix crash when `dplane_fpm_nl` fails to process received routes When `dplane_fpm_nl` receives a route, it allocates memory for a dplane context and calls `netlink_route_change_read_unicast_internal` without initializing the `intf_extra_list` contained in the dplane context. If `netlink_route_change_read_unicast_internal` is not able to process the route, we call `dplane_ctx_fini` to free the dplane context. This causes a crash because `dplane_ctx_fini` attempts to access the intf_extra_list which is not initialized. To solve this issue, we can call `dplane_ctx_route_init`to initialize the dplane route context properly, just after the dplane context allocation. (gdb) bt #0 0x0000555dd5ceae80 in dplane_intf_extra_list_pop (h=0x7fae1c007e68) at ../zebra/zebra_dplane.c:427 #1 dplane_ctx_free_internal (ctx=0x7fae1c0074b0) at ../zebra/zebra_dplane.c:724 #2 0x0000555dd5cebc99 in dplane_ctx_free (pctx=0x7fae2aa88c98) at ../zebra/zebra_dplane.c:869 #3 dplane_ctx_free (pctx=0x7fae2aa88c98, pctx@entry=0x7fae2aa78c28) at ../zebra/zebra_dplane.c:855 #4 dplane_ctx_fini (pctx=pctx@entry=0x7fae2aa88c98) at ../zebra/zebra_dplane.c:890 #5 0x00007fae31e93f29 in fpm_read (t=) at ../zebra/dplane_fpm_nl.c:605 #6 0x00007fae325191dd in thread_call (thread=thread@entry=0x7fae2aa98da0) at ../lib/thread.c:2006 #7 0x00007fae324c42b8 in fpt_run (arg=0x555dd74777c0) at ../lib/frr_pthread.c:309 #8 0x00007fae32405ea7 in start_thread () from /lib/x86_64-linux-gnu/libpthread.so.0 #9 0x00007fae32325a2f in clone () from /lib/x86_64-linux-gnu/libc.so.6 Fixes: #13754 Signed-off-by: Carmine Scarpitta <carmine.scarpitta@uniroma2.it>	2023-07-07 10:59:28 +02:00
Carmine Scarpitta	745a0fcbb2	zebra: Abstract `dplane_ctx_route_init` to init route without copying The function `dplane_ctx_route_init` initializes a dplane route context from the route object passed as an argument. Let's abstract this function to allow initializing the dplane route context without actually copying a route object. This allows us to use this function for initializing a dplane route context when we don't have any route to copy in it. Signed-off-by: Carmine Scarpitta <carmine.scarpitta@uniroma2.it>	2023-07-07 10:59:28 +02:00
Russ White	1e9e82e803	Merge pull request #13396 from donaldsharp/interface_is_interface move interface ( LINK and ADDR ) events to the dplane	2023-07-06 08:31:16 -04:00
Donatas Abraitis	2ec7477a26	Merge pull request #13808 from anlancs/fix/zebra-kernel-route-reserved zebra: fix wrong nexthop check for kernel routes	2023-07-06 09:01:21 +03:00
Donald Sharp	605df8d44f	zebra: Use zebra dplane for RTM link and addr a) Move the reads of link and address information into the dplane b) Move the startup read of data into the dplane as well. c) Break up startup reading of the linux kernel data into multiple phases. As that we have implied ordering of data that must be read first and if the dplane has taken over some data reading then we must delay initial read-in of other data. Fixes: #13288 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-05 13:03:14 -04:00
Donald Sharp	a014450441	zebra: Add code to get/set interface to pass up from dplane 1) Add a bunch of get/set functions and associated data structure in zebra_dplane to allow the setting and retrieval of interface netlink data up into the master pthread. 2) Add a bit of code to breakup startup into stages. This is because FRR currently has a mix of dplane and non dplane interactions and the code needs to be paused before continuing on. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-05 13:03:14 -04:00
Donald Sharp	487a96a35f	zebra: Remove duplicate function for netlink interface changes Turns out FRR has 2 functions one specifically for startup and one for normal day to day operations. There were only a couple of minor differences from what I could tell, and where they were different the after startup functionality should have been updated too. I cannot figure out why we have 2. Non-startup handling of bonds appears to be incorrect so let's fix that. Additionally the speed was not properly being set in non-startup situations. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-05 13:03:14 -04:00
Donald Sharp	bc0bac5524	zebra: Remove unused add variable Function was not using the add variable. Remove it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-05 11:49:36 -04:00
Donald Sharp	cd7324dfa6	zebra: Remove unused dplane_intf_delete There is no need for this functionality and it is not used. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-05 11:49:36 -04:00
Donald Sharp	c3c9683f99	zebra: Move protodown_r_bit to a better spot Since we are moving some code handling out of the dataplane and into zebra proper, lets move the protodown r bit as well. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-05 11:49:36 -04:00
Donald Sharp	6a3ae11c9b	zebra: Rename vrf_lookup_by_tableid to zebra_vrf_lookup.. Rename the vrf_lookup_by_id function to zebra_vrf_lookup_by_id and move to zebra_vrf.c where it nominally belongs, as that we need zebra specific data to find this vrf_id and as such it does not belong in vrf.c Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-05 11:49:36 -04:00
Mark Stapp	d6caf0dbd7	Merge pull request #13875 from donaldsharp/static_dplane_issues zebra: Static routes async notification do not need this test	2023-07-05 08:27:23 -04:00
Donatas Abraitis	9a0bb7bcd1	Merge pull request #13333 from donaldsharp/vrf_bitmap_cleanup *: Rearrange vrf_bitmap_X api to reduce memory footprint	2023-07-04 22:11:11 +03:00
anlan_cs	098519caf8	zebra: fix wrong nexthop check for kernel routes When changing one interface's vrf, the kernel routes are wrongly kept in old vrf. Finally, the forwarding table in that old vrf can't forward traffic correctly for those residual entries. Follow these steps to make this problem happen: ( Firstly, "x1" interface of default vrf is with address of "6.6.6.6/24". ) ``` anlan# ip route add 4.4.4.0/24 via 6.6.6.8 dev x1 anlan# ip link add vrf1 type vrf table 1 anlan# ip link set vrf1 up anlan# ip link set x1 master vrf1 ``` Then check `show ip route`, the route of "4.4.4.0/24" is still selected in default vrf. If the interface goes down, the kernel routes will be reevaluated. Those kernel routes with active interface of nexthop can be kept no change, it is a fast path. Otherwise, it enters into slow path to do careful examination on this nexthop. After the interface's vrf had been changed into new vrf, the down message of this interface came. It means the interface is not in old vrf although it still exists during that checking, so the kernel routes should be dropped after this nexthop matching against a default route in slow path. But, in current code they are wrongly kept in fast path for not checking vrf. So, modified the checking active nexthop with vrf comparision for the interface during reevaluation. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2023-07-02 10:30:09 +08:00
anlan_cs	caf896d6ef	zebra: Remove unnecessary condition check for kernel routes There are relaxed nexthop requirements for kernel routes because we trust kernel routes. Two minor changes for kernel routes: 1. `if_is_up()` is one of the necessary conditions for `if_is_operative()`. Here, we can remove this unnecessary check for clarity. 2. Since `nexthop_active()` doesn't distinguish whether it is kernel route, modified the corresponding comment in it. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2023-07-02 10:30:09 +08:00
Donatas Abraitis	64510b9467	zebra: Dump route details when deleting a route Just more details what's going on when deleting a route. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-06-29 17:39:45 +03:00
Donald Sharp	d0123a9012	zebra: Static routes async notification do not need this test When using asic_offload with an asynchronous notification the rib_route_match_ctx function is testing for distance and tag being correct against the re. Normal route notification for static routes is this(well really all routes): a) zebra dplane generates a ctx to send to the dplane for route install b) dplane installs it in the kernel c) if the dplane_fpm_nl.c module is being used it installs it. d) The context's success code is set to it worked and passes the context back up to zebra for processing. e) Zebra master receives this and checks the distance and tag are correct for static routes and accepts the route and marks it installed. If the operator is using a wait for install mechansim where the dplane is asynchronously sending the result back up at a future time and it is using the dplane_fpm_nl.c code where it uses the rt_netlink.c route parsing code, then there is no way to set distance as that we do not pass distance to the kernel. As such static routes were never being properly handled since the re and context would not match and the route would still be marked as queued. Modify the code such that the asynchronous path notification for static routes ignores the distance and tag's as that there is no way to test for this data from that path at this point in time. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-06-29 09:35:00 -04:00
Mark Stapp	59b8965aa6	Merge pull request #13861 from opensourcerouting/fix/memory_leak_zserv zebra: Free Zebra client resources	2023-06-28 08:18:11 -04:00
Donatas Abraitis	97072d144e	zebra: Free Zebra client resources Memory leaks started flowing: ``` AddressSanitizer Topotests Part 0: 15 KB -> 283 KB AddressSanitizer Topotests Part 1: 1 KB -> 495 KB AddressSanitizer Topotests Part 2: 13 KB -> 478 KB AddressSanitizer Topotests Part 3: 39 KB -> 213 KB AddressSanitizer Topotests Part 4: 30 KB -> 836 KB AddressSanitizer Topotests Part 5: 0 bytes -> 356 KB AddressSanitizer Topotests Part 6: 86 KB -> 783 KB AddressSanitizer Topotests Part 7: 0 bytes -> 354 KB AddressSanitizer Topotests Part 8: 0 bytes -> 62 KB AddressSanitizer Topotests Part 9: 408 KB -> 518 KB ``` ``` Direct leak of 3584 byte(s) in 1 object(s) allocated from: #0 0x7f1957b02d28 in __interceptor_calloc (/usr/lib/x86_64-linux-gnu/libasan.so.4+0xded28) #1 0x559895c55df0 in qcalloc lib/memory.c:105 #2 0x559895bc1cdf in zserv_client_create zebra/zserv.c:743 #3 0x559895bc1cdf in zserv_accept zebra/zserv.c:880 #4 0x559895cf3438 in event_call lib/event.c:1995 #5 0x559895c3901c in frr_run lib/libfrr.c:1213 #6 0x559895a698f1 in main zebra/main.c:472 #7 0x7f195635ec86 in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x21c86) ``` Fixes `b20acd0` ("bgpd: Use synchronous way to get labels from Zebra") Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-06-27 22:48:39 +03:00
Russ White	1f08a055a8	Merge pull request #13852 from mjstapp/fix_opq_cov_msg zebra: clean up coverity warning in opaque api	2023-06-27 11:28:31 -04:00
Chirag Shah	a7d77ee58b	zebra: fix evpn rmac nh list cmp function EVPN RMAC (Router MAC) nexthop list compare function needs to return all values so the list element can be compared and added/deleted properly. Ticket:#3486989 Testing Done: Originate EVPN Type-5 route with PIP IP and MAC as remote nexthops. Change the PIP IP address which triggers nexthop change. Before fix: When PIP IP changes RMAC is deleted from remote VTEPs. TORS1# show evpn next-hops vni 4001 \| include 00:02:00:00:00:2d 27.0.0.11 00:02:00:00:00:2d TORS1# show evpn rmac vni 4001 \| include 00:02:00:00:00:2d 00:02:00:00:00:2d 27.0.0.11 ----- Remote VTEP change nexthop IP to 172.16.16.16 ----- TORS1# show evpn next-hops vni 4001 \| include 00:02:00:00:00:2d 172.16.16.16 00:02:00:00:00:2d TORS1# show evpn rmac vni 4001 \| include 00:02:00:00:00:2d TORS1# After fix: RMAC is retained as its nexthop list is not empty, thus it is not deleted from remote VTEPs. TORS1# show evpn rmac vni 4001 \| include 00:02:00:00:00:2d 00:02:00:00:00:2d 172.16.16.16 Log: 2023/06/27 00:50:36.833474 ZEBRA: [XREH0-ZYMH6] L3VNI 4001 Remote VTEP change(27.0.0.11 -> 172.16.16.16) for RMAC 00:02:00:00:00:2d Signed-off-by: Chirag Shah <chirag@nvidia.com>	2023-06-26 17:59:16 -07:00
Donald Sharp	161972c9fe	: Rearrange vrf_bitmap_X api to reduce memory footprint When running all daemons with config for most of them, FRR has sharpd@janelle:~/frr$ vtysh -c "show debug hashtable" \| grep "VRF BIT HASH" \| wc -l 3570 3570 hashes for bitmaps associated with the vrf. This is a very large number of hashes. Let's do two things: a) Reduce the created size of the actually created hashes to 2 instead of 32. b) Delay generation of the hash until* a set operation happens. As that no hash directly implies a unset value if/when checked. This reduces the number of hashes to 61 in my setup for normal operation. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-06-26 14:59:21 -04:00
Mark Stapp	0ee56dd332	zebra: clean up coverity warning in opaque api Seems a bit fussy of coverity, but ... don't NULL a variable unnecessarily. Signed-off-by: Mark Stapp <mjs@labn.net>	2023-06-26 13:19:23 -04:00
Mark Stapp	de1a9ce0a7	zebra: support notifications for opaque ZAPI messages Allow zapi clients to register to be notified when a server for an opaque message type is present. Zebra maintains these notification registrations in the same data structures that it uses for opaque message handling. Signed-off-by: Mark Stapp <mjs@labn.net>	2023-06-23 08:57:37 -04:00
Mark Stapp	ef8e3ac02c	lib, zebra: include source client zapi info in opaque messages Include the sending zapi client info (proto, instance, and session id) in each opaque zapi message. Add opaque 'init' apis for clients who want to encode their opaque data inline, into the zclient's internal stream buffer. Use these init apis in the TE/link-state lib code, instead of hand-coding the zapi opaque header info. Signed-off-by: Mark Stapp <mjs@labn.net>	2023-06-23 08:27:42 -04:00
Donatas Abraitis	3cbc7150bb	Merge pull request #13545 from idryzhov/remove-bond-slave zebra: remove ZEBRA_IF_BOND_SLAVE interface type	2023-06-23 11:01:19 +03:00
Donatas Abraitis	52dde8747b	zebra: Ignore non GR-aware zclient handling for BGP This is for synchronous client (label/table manager) - aka session_id == 1. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-06-20 20:50:40 +03:00
Donatas Abraitis	20c2c8787a	zebra: Show session id when printing an error when the client disconnects Before: ``` 2023/06/18 22:00:42 ZEBRA: [VXKFG-8SJRV][EC 4043309121] Client 'bgp' encountered an error and is shutting down. 2023/06/18 22:00:42 ZEBRA: [VXKFG-8SJRV][EC 4043309121] Client 'bgp' encountered an error and is shutting down. ``` After: ``` 2023/06/18 22:06:44 ZEBRA: [N5M5Y-J5BPG][EC 4043309121] Client 'bgp' (session id 0) encountered an error and is shutting down. 2023/06/18 22:06:44 ZEBRA: [N5M5Y-J5BPG][EC 4043309121] Client 'bgp' (session id 1) encountered an error and is shutting down. ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-06-20 20:50:40 +03:00
Russ White	40502902f4	Merge pull request #13394 from mjstapp/fix_zebra_mpls_config zebra: clarify interface-level mpls config	2023-06-20 09:10:53 -04:00
Donald Sharp	f89d090230	Merge pull request #13755 from LabNConsulting/ziemba/zebra-dplane-priority zebra: bugfix dplane priority sorting	2023-06-13 10:36:57 -04:00
Mark Stapp	a32d40a676	zebra: clarify interface-level mpls config We have both interface-level configuration to enable mpls, and runtime mpls status. They need to be distinct. Signed-off-by: Mark Stapp <mjs@labn.net>	2023-06-12 16:41:27 -04:00
Mark Stapp	4112baec9f	pbrd, zebra: fix zapi and netlink rule encoding In pbrd, don't encode a rule without a table. There are cases where the zapi encoding was incorrect because the 4-octet table id was missing. In zebra, mask off the ECN bits in the TOS byte when encoding an iprule to match netlink's expectation. Signed-off-by: Mark Stapp <mjs@labn.net>	2023-06-12 16:39:26 -04:00

1 2 3 4 5 ...

5461 Commits