mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-07-12 21:53:51 +00:00

Author	SHA1	Message	Date
Donatas Abraitis	4febdb6b9a	Merge pull request #10836 from anlancs/bgpd-mh-delay-esi zebra: delay setting esi in zebra_evpn_local_es_update()	2022-05-23 07:49:08 +02:00
David Lamparter	7ca9c407ed	zebra: clean up rtadv integration Move a few things into places they actually belong, and reduce the number of places we have `#ifdev HAVE_RTADV`. Just overall code prettification. ... I had actually done this quite a while ago while doing some other random hacking and thought it more useful to not be sitting on it on my disk... Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-05-21 14:14:01 +02:00
anlan_cs	c331ef1665	zebra: remove one unnecessary check for l3vni nb The parent node of "vrf" MUST be non-NULL, so the check is unnecessary and misleading. Otherwise, there will be a branch of NULL parent node, it makes no sense, remove it. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-20 03:11:27 -04:00
Sri Mohana Singamsetty	bde51e807f	Merge pull request #11216 from chiragshah6/fdev2 zebra: netlink registry of rtm tunnel notification	2022-05-19 10:28:25 -07:00
Sri Mohana Singamsetty	0e6e6bc36e	Merge pull request #11222 from donaldsharp/bgp_zebra_stuff Bgp zebra stuff	2022-05-19 09:41:41 -07:00
Sri Mohana Singamsetty	595ebf525b	Merge pull request #11210 from anlancs/fix/zebra-leak-vtp zebra: fix missing delete vtep during vni transition	2022-05-19 09:35:27 -07:00
Donald Sharp	1b3cf91b0c	zebra: Fix newline in log message Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-05-18 14:42:22 -04:00
Russ White	c1e2a1eae3	Merge pull request #11205 from chiragshah6/fdev1 zebra: new netlink parse utility for rta used to send nhg msg	2022-05-18 11:13:22 -04:00
Chirag Shah	42ed3bd77f	zebra: add netlink tunnel msg to dump routine This patch parses vxlan vnifilter rtm tunnel message which contains vni mapping to vxlan device. The new notifications are RTM_NEWTUNNEL, RTM_DELTUNNEL, and RTM_GETTUNNEL. https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/ linux.git/commit/?h=v5.18-rc7&id=7b8135f4df98b155b23754b6065c157861e268f1 Testing Done: 2022/05/18 00:34:25 ZEBRA: netlink_recv_msg: << netlink message dump [recv] 2022/05/18 00:34:25 ZEBRA: nlmsghdr [len=36 type=(120) NEWTUNNEL flags=(0x0000) {} seq=0 pid=0] 2022/05/18 00:34:25 ZEBRA: tnlm [family=(7) AF_BRIDGE ifindex=46 2022/05/18 00:34:25 ZEBRA: vni_start 4001, vni_end 0 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-18 07:56:44 -07:00
Chirag Shah	47e2eb270d	zebra: netlink registry rtm tunnel notif The kernel supports l3vxlan device to have (l3vni) vni filter similar to vlan filtering on bridge device. To receive netlink notification, FRR to register for new netlink RTNLGRP_TUNNEL message. This message required to register via additional socket option as it's beyond bitmap size. kernel patches: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/ linux.git/commit/?h=v5.18-rc7&id=7b8135f4df98b155b23754b6065c157861e268f1 https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/ linux.git/commit/?h=v5.18-rc7&id=f9c4bb0b245cee35ef66f75bf409c9573d934cf9 Ticket:#3073812 Testing Done: Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-18 07:56:35 -07:00
Mark Stapp	6ca1b0f44e	Merge pull request #11192 from cyberstorm-mauritius/zebra_netlink zebra: Add startup message and display netlink buffer size.	2022-05-17 08:13:23 -04:00
Chirag Shah	f8f3e484d4	zebra: new netlink parse utility for rta Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-16 10:45:14 -07:00
Chirag Shah	865c12e1a7	zebra: add protocol name to nexthop dump Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-16 08:40:19 -07:00
anlan_cs	0dfc0dd974	zebra: delay setting esi in zebra_evpn_local_es_update() Currently, `zif->es_info.esi` is always set even for a few unnecessary cases in `zebra_evpn_local_es_update()`. Delay setting `zif->es_info.esi` and remove the annoying rollback (i.e. unset `zif->es_info.esi`) operation on failure case. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-16 09:40:49 -04:00
anlan_cs	2fe5a02ea4	zebra: fix missing delete vtep during vni transition All `vtep`s in dplane should be deleted/uninstalled during vni transition. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-16 09:30:28 -04:00
Donald Sharp	950e7e6660	Merge pull request #11207 from anlancs/fix/zebra-remove-check-l3vni zebra: remove unncecessary check for l3vni	2022-05-16 08:02:58 -04:00
Rafael Zalamena	854dea850c	Merge pull request #11199 from donaldsharp/nexthop_dump zebra: Add encap and group type decoding to nexthop dump	2022-05-16 08:09:54 -03:00
anlan_cs	0717f2d83c	zebra: remove unncecessary check for l3vni Since `l3vni` created by `zl3vni_add()` is always valid, remove the check for it. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-16 05:40:15 -04:00
Donatas Abraitis	8f5e706a2f	Merge pull request #11201 from donaldsharp/unused_in_netlink_compiles Remove some unused functions in zebra	2022-05-16 09:57:30 +03:00
anlan_cs	81157cbd10	zebra: remove unnecessary check for "zevpn_vrf" The global vrf in zebra is always non-NULL. In general, it is bound to default vrf by `zebra_vrf_init()`, at other times bound to some specific vrf. Anyway, non-NULL. So remove all redundant checkings for the returned value of `zebra_vrf_get_evpn()`. Additionally, remove the unnecessary check for `zvrf` in `zebra_vxlan_cleanup_tables()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-13 23:31:52 -04:00
Donald Sharp	20ceb5475d	zebra: Remove unused function `route_entry_copy_nexthops` This function is no longer used. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-05-13 16:11:09 -04:00
Donald Sharp	388907d53c	zebra: Remove unused functions in netlink compiles When compiling with netlink, Remove the usage of these functions. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-05-13 15:58:33 -04:00
Donald Sharp	c30c607027	zebra: Add encap and group type decoding to nexthop dump Add the ability to give data about the nexthop group type and encap type so that it is human readable. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-05-13 10:37:30 -04:00
Donald Sharp	f205a2309c	Merge pull request #11177 from opensourcerouting/fix/memset_memcpy *: memcpy/memset zeroing	2022-05-13 07:40:58 -04:00
Loganaden Velvindron	0c99696f30	zebra: Add startup message and display netlink buffer size. Add startup message and display netlink buffer size. Signed-off-by: Loganaden Velvindron <logan@cyberstorm.mu>	2022-05-13 14:58:18 +04:00
Donatas Abraitis	4d5a0ff391	Merge pull request #11186 from anlancs/fix/bgpd-comment-should-es bgpd,zebra: correct one debug log for evpn-mh	2022-05-12 11:32:25 +03:00
anlan_cs	b0b9a2fe52	bgpd,zebra: correct one debug log for evpn-mh Correct one debug log in evpn-mh. BTW, correct one misspelled word in comment. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-12 02:19:51 -04:00
Donatas Abraitis	6006b807b1	*: Properly use memset() when zeroing Wrong: memset(&a, 0, sizeof(struct ...)); Good: memset(&a, 0, sizeof(a)); Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-05-11 14:08:47 +03:00
Mark Stapp	00358e444e	Merge pull request #11155 from LabNConsulting/ziemba/link-delay-min-max zebra bugfix interface link-param: allow delay min <= avg <= max (was: min<avg<max)	2022-05-10 11:31:52 -04:00
Igor Ryzhov	2a3807c3ce	Merge pull request #11163 from opensourcerouting/fix/same_type_casting *: Avoid casting to the same type as on the left	2022-05-10 00:16:30 +03:00
Donatas Abraitis	8998807f69	*: Avoid casting to the same type as on the left Just not necessary. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-05-08 16:07:42 +03:00
Donatas Abraitis	432ee88c21	zebra, ospf6d: Do not check if NULL for XCALLOC() Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-05-08 15:43:21 +03:00
G. Paul Ziemba	d029fe275c	zebra/interface.c: allow link-param delay min <= avg <= max RFC 7471 Section 4.2.7: It is possible for min delay and max delay to be the same value. Prior to this change, the code required min < avg < max. This change allows min == avg and avg == max. test case: interface eth-rt1 link-params delay 8000 min 8000 max 8000 Signed-off-by: G. Paul Ziemba <paulz@labn.net>	2022-05-06 14:48:31 -07:00
Donatas Abraitis	50f1f2e724	Merge pull request #11059 from anlancs/fix/bgpd-evnp-wrong-check-hashget bgpd: fix memory leak for evpn	2022-05-04 21:19:51 +03:00
anlan_cs	8e3aae66ce	: remove the checking returned value for hash_get() Firstly, keep no change* for `hash_get()` with NULL `alloc_func`. Only focus on cases with non-NULL `alloc_func` of `hash_get()`. Since `hash_get()` with non-NULL `alloc_func` parameter shall not fail, just ignore the returned value of it. The returned value must not be NULL. So in this case, remove the unnecessary checking NULL or not for the returned value and add `void` in front of it. Importantly, also keep no change for the two cases with non-NULL `alloc_func` - 1) Use `assert(<returned_data> == <searching_data>)` to ensure it is a created node, not a found node. Refer to `isis_vertex_queue_insert()` of isisd, there are many examples of this case in isid. 2) Use `<returned_data> != <searching_data>` to judge it is a found node, then free <searching_data>. Refer to `aspath_intern()` of bgpd, there are many examples of this case in bgpd. Here, <returned_data> is the returned value from `hash_get()`, and <searching_data> is the data, which is to be put into hash table. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-03 00:41:48 +08:00
Rafael Zalamena	3682bd90f3	*: use FRR interface name definition everywhere Don't rely on the OS interface name length definition and use the FRR definition instead. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2022-05-02 13:00:12 -03:00
David Lamparter	46a3bfa695	Merge pull request #10988 from AbhishekNR/ipv6_mroute_cli	2022-04-29 10:23:37 +02:00
mobash-rasool	c4aa8aa669	Merge pull request #11114 from opensourcerouting/vrf-declvar-macros lib, zebra, pimd: clean up/fix VRF DECLVAR macros	2022-04-29 13:53:08 +05:30
David Lamparter	0cbed9511a	lib, zebra, pimd: clean up/fix VRF DECLVAR macros There's a common pattern of "get VRF context for CLI node" here, which first got a helper macro in zebra that then permeated into pimd. Unfortunately the pimd copy wasn't quite adjusted correctly and thus caused two coverity warnings (CID 1517453, CID 1517454). Fix the PIM one, and clean up by providing a common base macro in `lib/vty.h`. Also rename the macros (add `_VRF`) to make more clear what they do. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-04-28 11:09:26 +02:00
Abhishek N R	1d06e3547a	zebra: Removed show_ipv6_mroute cli from zebra_vty.c Signed-off-by: Abhishek N R <abnr@vmware.com>	2022-04-28 01:43:19 -07:00
Mobashshera Rasool	51f4fd9810	zebra, pimd: Add a field family in the message ZEBRA_IPMR_ROUTE_STATS 1. Adding a field family in the existing ZEBRA_IPMR_ROUTE_STATS to get the ipv4 as well as ipv6 trafic stats between pim and zebra. 2. Modify the debug to print both v4/v6 prefixes pimd: pim6d: Modify pim_zlookup_sg_statistics to get ipv6 stats Modify the pim_zlookup_sg_statistics api to get ipv4/ipv6 stats from zebra. Making the api common. Signed-off-by: Mobashshera Rasool <mrasool@vmware.com>	2022-04-28 01:10:49 -07:00
Mobashshera Rasool	4d3b4b1851	zebra: Modify base code to get ipv6 stats from kernel Modify the structure mcast_route_data to store ipv4/ipv6 addr and lastused multicast information from kernel. Adjust the related APIs to parse ipv4/ipv6 informations. Signed-off-by: Mobashshera Rasool <mrasool@vmware.com>	2022-04-28 01:10:49 -07:00
David Lamparter	34ee41c6c9	zebra, pimd: add AF param on NEXTHOP_LOOKUP_MRIB By changing this API call to use a `struct ipaddr`, which encodes the type of IP address with it. (And rename/remove the `IPV4` from the command name.) Also add a comment explaining that this function call is going to be obsolete in the long run since pimd needs to move to proper MRIB NHT. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-04-26 16:15:00 +02:00
David Lamparter	425fd200c9	zebra: add rib_match_ipv6_multicast variant ... for IPv6, analogous to v4. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-04-26 16:15:00 +02:00
Donald Sharp	0bba3bd873	zebra: Name variable better in zebra_trace.h Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:49:36 -04:00
Donald Sharp	1239b60c06	zebra: Add tracepoint for netlink_rule_change Add a tracepoint for the netlink_rule_change function. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	3cee213500	zebra: Add tracepoint for netlink_route_change_read_unicast Add a tracepoint to zebra for the netlink_route_change_read_unicast functionality. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	14ed061501	zebra: Add netlink_interface_addr tracepoint Add a tracepoint for netlink_interface_addr. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	1d80c20919	zebra: Add netlink_nexthop_change tracepoint Add a tracepoint for the netlink_nexthop_change function. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	097ef2afd1	zebra: Add netlink_request_intf_addr tracepoint Add a tracepoint for the netlink_request_intf_addr function. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	d42e61420a	zebra: Add initial zebra tracepoint support Add initial zebra tracepoint support infrastructure as well as add a frr_zebra:netlink_interface callback. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:39:47 -04:00
Donald Sharp	a71e190d44	Merge pull request #10961 from opensourcerouting/build-ms-ext build: enable `-fms-extensions`	2022-04-20 07:51:45 -04:00
Donatas Abraitis	3d3c38b1d4	Merge pull request #11051 from donaldsharp/speell_more Speell more	2022-04-20 11:04:14 +03:00
mobash-rasool	1815b8f335	Merge pull request #11045 from anlancs/fix/bgpd-cleanup-8-remove zebra: cleanup duplicated "extern"s for evpn-mh	2022-04-20 13:28:17 +05:30
Volodymyr Huti	7fb9825cf7	zebra: set ZEBRA_IFC_DOWN on connected routes for inactive interfaces If you are in a situation where you have multiple addresses on an interface, zebra creates one connected route for them. The issue is that the rib entry is not created if addresses were added before the interface was running. We add the address to a running interface in a typical flow. Therefore, we handle the route & rib creation within a single ADD event. In the opposite case, we create the route entries without activating them. These are considered to be active since ZEBRA_IFC_DOWN is not set. On the following interface UP, we ignore the same ADDR_ADD as it overlaps with the existing prefixes -> rib is never created. The minimal reproducible setup: ----------------------------------------- ip link add name dummy0 type dummy ip addr flush dev dummy0 ip link set dummy0 down ip addr add 192.168.1.7/24 dev dummy0 ip addr add 192.168.1.8/24 dev dummy0 ip link set dummy0 up vtysh -c 'show ip route' \| grep dummy0 Signed-off-by: Volodymyr Huti <v.huti@vyos.io>	2022-04-19 22:53:57 +03:00
mobash-rasool	16b5065b47	Merge pull request #10908 from donaldsharp/proto_only_error zebra: When `zebra nexthop proto only` limit errors	2022-04-19 21:27:29 +05:30
Donald Sharp	4667220e3a	*: Fix spelling of accidently Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-19 08:31:30 -04:00
Donald Sharp	f526739897	*: Fix spelling of accomodate Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-19 08:29:58 -04:00
Donald Sharp	3819e4ced7	*: Fix spelling of inteface Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-19 08:21:31 -04:00
Donatas Abraitis	f5327fc339	Merge pull request #11012 from anlancs/bgpd-mh-simplify-condition zebra: simplify one check for evpn-mh	2022-04-19 13:04:43 +03:00
anlan_cs	4e5bda347c	zebra: cleanup duplicated "extern"s for evpn-mh There are some duplicated `extern`s in this header file, just remove them. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-04-19 05:20:10 -04:00
Russ White	1258cfcd8c	Merge pull request #11001 from donaldsharp/system_route_recursion zebra: Allow system routes to recurse through themselves	2022-04-18 09:47:47 -04:00
Donald Sharp	1cadfaf213	zebra: When `zebra nexthop proto only` limit errors Operators are seeing: Mar 28 07:19:37 kingpin zebra[418]: [TZANK-DEMSE] netlink_nexthop_msg_encode: nhg_id 68 (zebra): proto-based nexthops only, ignoring Mar 28 07:19:37 kingpin zebra[418]: [TZANK-DEMSE] netlink_nexthop_msg_encode: nhg_id 68 (zebra): proto-based nexthops only, ignoring Mar 28 07:19:37 kingpin zebra[418]: [YXPF5-B2CE0] netlink_route_multipath_msg_encode: RTM_DELROUTE 2804:4d48:4000::/42 vrf 0(254) Mar 28 07:19:37 kingpin zebra[418]: [YXPF5-B2CE0] netlink_route_multipath_msg_encode: RTM_NEWROUTE 2804:4d48:4000::/42 vrf 0(254) Mar 28 07:19:37 kingpin zebra[418]: [TVM3E-A8ZAG] _netlink_route_build_singlepath: (single-path): 2804:4d48:4000::/42 nexthop via fe80::b6fb:e4ff:fe26:c5d5 if 2 vrf default(0) Mar 28 07:19:37 kingpin zebra[418]: [HYEHE-CQZ9G] nl_batch_send: netlink-dp (NS 0), batch size=140, msg cnt=2 Mar 28 07:19:37 kingpin zebra[418]: [P2XBZ-RAFQ5][EC 4043309074] Failed to install Nexthop ID (68) into the kernel When `zebra nexthop proto only` is turned on. Effectively zebra intentionally does not do the nexthop group installation and the dplane notification in zebra_nhg.c just assumes it was a failure and prints an error message. Since this act was intentional, let's just notice that it was intentional and not report the message as a failure. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-18 09:41:38 -04:00
Donatas Abraitis	cd876f8a78	Merge pull request #10935 from anlancs/zebra-mh-esi-warning zebra: adjust the warnings for ESI of evpn-mh	2022-04-13 15:45:07 +03:00
anlan_cs	9a8fc8f88d	zebra: simplify one check for evpn-mh An simplification for one check in `zebra_evpn_mh_uplink_oper_flags_update()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-04-12 01:26:54 -04:00
Donald Sharp	c9e4abf81f	zebra: Allow system routes to recurse through themselves Currently if a end user has something like this: Routing entry for 192.168.212.1/32 Known via "kernel", distance 0, metric 100, best Last update 00:07:50 ago * directly connected, ens5 Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.212.1, ens5, src 192.168.212.19, 00:00:15 C>* 192.168.212.0/27 is directly connected, ens5, 00:07:50 K>* 192.168.212.1/32 [0/100] is directly connected, ens5, 00:07:50 And FRR does a link flap, it refigures the route and rejects the default route: 2022/04/09 16:38:20 ZEBRA: [NZNZ4-7P54Y] default(0:254):0.0.0.0/0: Processing rn 0x56224dbb5b00 2022/04/09 16:38:20 ZEBRA: [ZJVZ4-XEGPF] default(0:254):0.0.0.0/0: Examine re 0x56224dbddc20 (kernel) status: Changed Installed flags: Selected dist 0 metric 100 2022/04/09 16:38:20 ZEBRA: [GG8QH-195KE] nexthop_active_update: re 0x56224dbddc20 nhe 0x56224dbdd950 (7), curr_nhe 0x56224dedb550 2022/04/09 16:38:20 ZEBRA: [T9JWA-N8HM5] nexthop_active_check: re 0x56224dbddc20, nexthop 192.168.212.1, via ens5 2022/04/09 16:38:20 ZEBRA: [M7EN1-55BTH] nexthop_active: Route Type kernel has not turned on recursion 2022/04/09 16:38:20 ZEBRA: [HJ48M-MB610] nexthop_active_check: Unable to find active nexthop 2022/04/09 16:38:20 ZEBRA: [JPJF4-TGCY5] default(0:254):0.0.0.0/0: After processing: old_selected 0x56224dbddc20 new_selected 0x0 old_fib 0x56224dbddc20 new_fib 0x0 So the 192.168.212.1 route is matched for the nexthop but it is not connected and zebra treats it as a problem. Modify the code such that if a system route matches through another system route, then it should work imo. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-09 13:17:14 -04:00
Donald Sharp	48dc861028	zebra: Allow multiple connected routes to be choosen for kernel routes This bug should only really affect kernel routes. To reproduce: a) Have multiple connected routes that point to the same prefix swp8 up default 169.254.0.250/30 swp9 up default 169.254.0.250/30 b) Have a kernel route that uses one of those connected routes 7.6.2.8 via 169.254.0.249 dev swp8 proto static (But have it choose a non-selected connected nexthop) c) Introduce an event that causes the rib table to be reprocessed, say a unrelated interface going up / down This causes the route to be lost with this message: 2022/03/28 21:21:53 ZEBRA: [YXCJP-0WZWV] netlink_nexthop_msg_encode: ID (3454): 169.254.0.249, via swp8(1383) vrf default(0) 2022/03/28 21:21:53 ZEBRA: [YF2E6-J60JH] nexthop_active: 169.254.0.249, via swp8 given ifindex does not match nexthops ifindex found found: directly connected, swp9 Effectively the nexthop that zebra is choosing would not be the one that the kernel route has choosen and FRR removes the route: 022/03/28 21:21:53 ZEBRA: [NM15X-X83N9] rib_process: (0:254):7.6.2.8/32: rn 0x56042e632e90, removing re 0x56042e6316e0 2022/03/28 21:21:53 ZEBRA: [Y53JX-CBC5H] rib_unlink: (0:254):7.6.2.8/32: rn 0x56042e632e90, re 0x56042e6316e0 2022/03/28 21:21:53 ZEBRA: [KT8QQ-45WQ0] rib_gc_dest: (0:?):7.6.2.8/32: removing dest from table What is happening? Zebra is not looking at all connected routes and if any of them would have the appropriate ifindex and just blindly rejecting the route. So when nexthop resolution happens and it matches a connected route and the dest->selected nexthop ifindex does not match, let's sort through the rest of them and see if any of them match and if so let's keep the route. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-08 08:15:20 -04:00
Donald Sharp	2c38c8ad35	Merge pull request #10928 from anlancs/zebra-cleanup-1 zebra: use "assert" instead of unnecessary check	2022-04-05 09:49:00 -04:00
Russ White	977405eeac	Merge pull request #10938 from anlancs/fix-zebra-vxlan-change-vrfid zebra: fix missing vrf change of l2vni on vxlan interface	2022-04-05 08:55:42 -04:00
David Lamparter	5b4f4e626f	build: first header must be zebra.h or config.h This has already been a requirement for Solaris, it is still a requirement for some of the autoconf feature checks to work correctly, and it will be a requirement for `-fms-extensions`. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-04-04 18:33:10 +02:00
Donald Sharp	07b12758be	pimd, zebra: Fix spelling of fowarding Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-02 07:46:19 -04:00
Donald Sharp	17be83bf99	*: Fix spelling of Gracefull Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-02 07:46:19 -04:00
anlan_cs	21311bc8a0	zebra: add whitespace after "%%" for prompt Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-04-01 03:27:20 -04:00
anlan_cs	2e39ebbb09	zebra: adjust the warnings for ESI of evpn-mh Since there are two kinds of ESI (Type-0 and Type-3), the warnings should distinguish between the two cases. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-04-01 03:00:11 -04:00
Trey Aspelund	436a6a3e51	zebra: don't send RAs w/o LLv6 or on bridge-ports It's confusing for a user to see 'Tx RA failed' in the logs when they've enabled RAs (either through interface config or BGP unnumbered) on an interface that can't send them. Let's avoid sending RAs on interfaces that are bridge_slaves or don't have a link-local address, since they are the two of the most common reasons for RA Tx failures. Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2022-03-31 16:38:37 +00:00
anlan_cs	c4992a2f71	zebra: fix missing vrf change of l2vni on vxlan interface The bounded vrf of `l2vni/zevpn` have wrong relation with the order in which vxlan interface and svi interface are set. If set vxlan interface with vlanid first, then set svi interface with vrf, it is ok that vxlan interface will get correct `vrf` inherited from svi. But reverse the set sequence (i.e. set svi first, then vxlan), vxlan interface can't get correct `vrf`, becasue the handling of `ZEBRA_VXLIF_VLAN_CHANGE` missed inheritting `vrf` by mistake. ``` host# do show evpn vni 101 VNI: 101 Type: L2 Tenant VRF: vrf1 ``` So update `vrf` ("Tenant VRF") of l2vni in `zebra_vxlan_if_update()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-31 02:51:26 -04:00
anlan_cs	2be18df4dc	zebra: remove unnecessary check for parsing macfdb Since `NDA_VLAN` is no longer mannually defined in header file, the check for `NDA_VLAN` should be removed. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-30 05:50:21 -04:00
Donatas Abraitis	11fc3db305	Merge pull request #10902 from bobuhiro11/fix_zebra_srv6_func_bits zebra: fix doc and default value of "func-bits" for SRv6	2022-03-30 10:20:36 +03:00
anlan_cs	44a84850a9	zebra: use "assert" instead of unnecessary check Like `zvni_map_to_svi_ns()` for `ns_walk_func()`, just use "assert" instead of unnecessary check. Since these parameters for `ns_walk_func()`, e.g. `in_param` and others, must not be NULL. So use `assert` to ensure the these parameters, and remove those unnecessary checks. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-30 03:19:28 -04:00
Donald Sharp	80e39114b5	Merge pull request #10897 from opensourcerouting/safi-nht zebra,staticd,*: SAFI_MULTICAST NHT groundwork	2022-03-28 08:23:36 -04:00
Nobuhiro MIKI	fbd01eaa41	zebra: output optional param "func-bits" for SRv6 Signed-off-by: Nobuhiro MIKI <nmiki@yahoo-corp.jp>	2022-03-28 17:37:45 +09:00
David Lamparter	7d08e1e31c	zebra: add a few `const` in RNH code Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-27 14:57:22 +02:00
David Lamparter	6c90403bb1	zebra: `show ip nht mrib` Prints the SAFI_MULTICAST NHT state in zebra. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-27 14:57:18 +02:00
David Lamparter	e9ac2861e5	zebra: register NHT nexthops with proper SAFI Just a small puzzle piece missing in zebra SAFI NHT support. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-27 14:51:00 +02:00
David Lamparter	bc9b1cbfae	zebra: check other SAFIs when removing gone client When a client disconnects, we need to check & remove NHT entries for other SAFIs too. Otherwise we crash later trying to access stale data. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-27 14:51:00 +02:00
Donald Sharp	2f71996a68	zebra: Note when the netlink DUMP command is interrupted There exists code paths in the linux kernel where a dump command will be interrupted( I am not sure I understand what this really means ) and the data sent back from the kernel is wrong or incomplete. At this point in time I am not 100% certain what should be done, but let's start noticing that this has happened so we can formulate a plan or allow the end operator to know bad stuff is a foot at the circle K. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-25 19:08:14 -04:00
Donald Sharp	a5f711a11a	Merge pull request #10862 from anlancs/zebra-mh-svi-add zebra: optimization on the mac addition for evpn-mh	2022-03-25 10:09:59 -04:00
David Lamparter	619a6623cb	Merge pull request #10867 from donaldsharp/ifp_use_after_free	2022-03-25 06:55:37 +01:00
David Lamparter	f908faed4a	Merge pull request #10866 from donaldsharp/freebsd_unknown_type2str	2022-03-25 04:20:19 +01:00
Donald Sharp	d0438da6b0	zebra: Fix use after deletion event in freebsd In the FreeBSD code if you delete the interface and it has no configuration, the ifp pointer will be deleted from the system but zebra continues to dereference the just freed pointer. ==58624== Invalid read of size 1 ==58624== at 0x48539F3: strlcpy (in /usr/local/libexec/valgrind/vgpreload_memcheck-amd64-freebsd.so) ==58624== by 0x2B0565: ifreq_set_name (ioctl.c:48) ==58624== by 0x2B0565: if_get_flags (ioctl.c:416) ==58624== by 0x2B2D9E: ifan_read (kernel_socket.c:455) ==58624== by 0x2B2D9E: kernel_read (kernel_socket.c:1403) ==58624== by 0x499F46E: thread_call (thread.c:2002) ==58624== by 0x495D2B7: frr_run (libfrr.c:1196) ==58624== by 0x2B40B8: main (main.c:471) ==58624== Address 0x6baa7f0 is 64 bytes inside a block of size 432 free'd ==58624== at 0x484ECDC: free (in /usr/local/libexec/valgrind/vgpreload_memcheck-amd64-freebsd.so) ==58624== by 0x4953A64: if_delete (if.c:283) ==58624== by 0x2A93C1: if_delete_update (interface.c:874) ==58624== by 0x2B2DF3: ifan_read (kernel_socket.c:453) ==58624== by 0x2B2DF3: kernel_read (kernel_socket.c:1403) ==58624== by 0x499F46E: thread_call (thread.c:2002) ==58624== by 0x495D2B7: frr_run (libfrr.c:1196) ==58624== by 0x2B40B8: main (main.c:471) ==58624== Block was alloc'd at ==58624== at 0x4851381: calloc (in /usr/local/libexec/valgrind/vgpreload_memcheck-amd64-freebsd.so) ==58624== by 0x496A022: qcalloc (memory.c:116) ==58624== by 0x49546BC: if_new (if.c:164) ==58624== by 0x49546BC: if_create_name (if.c:218) ==58624== by 0x49546BC: if_get_by_name (if.c:603) ==58624== by 0x2B1295: ifm_read (kernel_socket.c:628) ==58624== by 0x2A7FB6: interface_list (if_sysctl.c:129) ==58624== by 0x2E99C8: zebra_ns_enable (zebra_ns.c:127) ==58624== by 0x2E99C8: zebra_ns_init (zebra_ns.c:214) ==58624== by 0x2B3FF2: main (main.c:401) ==58624== Zebra needs to pass back whether or not the ifp pointer was freed when if_delete_update is called and it should then check in ifan_read as well as ifm_read that the ifp pointer is still valid for use. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-24 20:48:24 -04:00
Donald Sharp	0081ab91e6	zebra: When handling unprocessed messages from kernel print usable string Add new debug output to show the string of the message type that is currently unhandled: 2022-03-24 18:30:15.284 [DEBG] zebra: [V3NSB-BPKBD] Kernel: 2022-03-24 18:30:15.284 [DEBG] zebra: [HDTM1-ENZNM] Kernel: message seq 792 2022-03-24 18:30:15.284 [DEBG] zebra: [MJD4M-0AAAR] Kernel: pid 594488, rtm_addrs {DST,GENMASK} 2022-03-24 18:30:15.285 [DEBG] zebra: [GRDRZ-0N92S] Unprocessed RTM_type: RTM_NEWMADDR(d) Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-24 20:07:00 -04:00
Donald Sharp	ceacdc7216	zebra: Don't send uninited data to kernel on FreeBSD When running zebra w/ valgrind, it was noticed that there was a bunch of passing uninitialized data to the kernel: ==38194== Syscall param ioctl(generic) points to uninitialised byte(s) ==38194== at 0x4CDF88A: ioctl (in /lib/libc.so.7) ==38194== by 0x49A4031: vrf_ioctl (vrf.c:860) ==38194== by 0x2AFE29: vrf_if_ioctl (ioctl.c:91) ==38194== by 0x2AFF39: if_get_mtu (ioctl.c:161) ==38194== by 0x2B12C3: ifm_read (kernel_socket.c:653) ==38194== by 0x2A7F76: interface_list (if_sysctl.c:129) ==38194== by 0x2E9958: zebra_ns_enable (zebra_ns.c:127) ==38194== by 0x2E9958: zebra_ns_init (zebra_ns.c:214) ==38194== by 0x2B3F82: main (main.c:401) ==38194== Address 0x7fc000967 is on thread 1's stack ==38194== in frame #3, created by if_get_mtu (ioctl.c:155) ==38194== ==38194== Syscall param ioctl(generic) points to uninitialised byte(s) ==38194== at 0x4CDF88A: ioctl (in /lib/libc.so.7) ==38194== by 0x49A4031: vrf_ioctl (vrf.c:860) ==38194== by 0x2AFE29: vrf_if_ioctl (ioctl.c:91) ==38194== by 0x2AFED9: if_get_metric (ioctl.c:143) ==38194== by 0x2B12CB: ifm_read (kernel_socket.c:655) ==38194== by 0x2A7F76: interface_list (if_sysctl.c:129) ==38194== by 0x2E9958: zebra_ns_enable (zebra_ns.c:127) ==38194== by 0x2E9958: zebra_ns_init (zebra_ns.c:214) ==38194== by 0x2B3F82: main (main.c:401) ==38194== Address 0x7fc000967 is on thread 1's stack ==38194== in frame #3, created by if_get_metric (ioctl.c:137) ==38194== ==38194== Syscall param ioctl(generic) points to uninitialised byte(s) ==38194== at 0x4CDF88A: ioctl (in /lib/libc.so.7) ==38194== by 0x49A4031: vrf_ioctl (vrf.c:860) ==38194== by 0x2AFE29: vrf_if_ioctl (ioctl.c:91) ==38194== by 0x2B052D: if_get_flags (ioctl.c:419) ==38194== by 0x2B1CF1: ifam_read (kernel_socket.c:930) ==38194== by 0x2A7F57: interface_list (if_sysctl.c:132) ==38194== by 0x2E9958: zebra_ns_enable (zebra_ns.c:127) ==38194== by 0x2E9958: zebra_ns_init (zebra_ns.c:214) ==38194== by 0x2B3F82: main (main.c:401) ==38194== Address 0x7fc000707 is on thread 1's stack ==38194== in frame #3, created by if_get_flags (ioctl.c:411) Valgrind is no longer reporting these issues. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-24 12:57:01 -04:00
anlan_cs	b83d220aa9	zebra: optimization on the mac addition for evpn-mh When `zebra_evpn_mac_svi_add()` adds one found mac by `zebra_evpn_mac_lookup()` and the found mac is without svi flag, then call `zebra_evpn_mac_svi_add()` to create one appropriate mac, but it will call `zebra_evpn_mac_lookup()` the second time. So lookup twice, the procedure is redundant. Just an optimization for it, make sure only lookup once. Modify `zebra_evpn_mac_gw_macip_add()` to check the `macp` parameter passed by caller, so it can distinguish whether really need lookup or not. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-24 22:31:50 +08:00
Sri Mohana Singamsetty	116f0dd905	Merge pull request #10726 from chiragshah6/fdev2 zebra: evpn revamp l3vni routermac db	2022-03-22 22:05:47 -07:00
Donatas Abraitis	ecf0ea4b00	Merge pull request #9953 from donaldsharp/system_route_replace zebra: Better handle replacing our route by a system route	2022-03-20 23:25:52 +02:00
Donald Sharp	0399a608e0	Merge pull request #10830 from anlancs/zebra-rb-remove zebra, bgpd: remove check returning value of RB_INSERT()	2022-03-20 14:32:49 -04:00
Donald Sharp	f2f2a16af4	zebra: Do not complain if deletion fails When issuing a RTM_DELETE operation and the kernel tells us that the route is already deleted, let's not complain about the situation: 2022/03/19 02:40:34 ZEBRA: [EC 100663303] kernel_rtm: 2a10:cc42:1d51::/48: rtm_write() unexpectedly returned -4 for command RTM_DELETE I can recreate this issue on freebsd by doing this: a) create a route using sharpd b) shutdown the nexthop's interface c) remove the route using sharpd This would also be true of pretty much any routing protocol's behavior. Let's just not complain about the situation if a RTM_DELETE operation is issued and FRR is told that the route does not exist to delete. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-19 07:44:54 -04:00
anlan_cs	2a778afe9d	zebra: remove check returning value of RB_INSERT() Since the `RB_INSERT()` is called after not found in RB tree, it MUST be ok and and return zero. The check of returning value of `RB_INSERT()` is redundant, just remove them. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-19 13:45:14 +08:00
Jafar Al-Gharaibeh	d6d0d718b0	Merge pull request #10806 from donaldsharp/dplane_fixup_for_lua zebra: Fixup lua with new dplane ops	2022-03-16 16:38:01 -05:00
Donald Sharp	60cd8d3b14	Merge pull request #10790 from anlancs/zebra-adjust-flag zebra: minor changes on "zebra_evpn_mac_gw_macip_add" function	2022-03-16 16:25:24 -04:00
Donald Sharp	105271792d	zebra: Fixup lua with new dplane ops Commit: `5d41413833` added 3 new dplane ops: DPLANE_OP_INTF_INSTALL DPLANE_OP_INTF_UPDATE DPLANE_OP_INTF_DELETE The build system does not build lua so zebra_script.c was not updated. Update of course! Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-16 15:10:54 -04:00
Russ White	5d97021ba3	Merge pull request #10427 from sworleys/Protodown-Reason-Upstream Add Support for Setting Protodown Reason Code	2022-03-15 19:58:16 -04:00
Sri Mohana Singamsetty	3d58538a75	Merge pull request #10770 from chiragshah6/evpn_dev3 zebra: evpn disable remove l2vni from l3vni list	2022-03-15 12:32:22 -07:00
Donald Sharp	2ea93eb023	Merge pull request #10580 from leonshaw/fix/link-ns zebra: Lookup linked interface in link netns	2022-03-15 15:23:18 -04:00
Donald Sharp	052b0eee2a	Merge pull request #10693 from anlancs/bgpd-add-check-ns zebra: use "assert" instead of unnecessary check	2022-03-15 08:27:44 -04:00
Donald Sharp	6c72dd869e	Merge pull request #10725 from opensourcerouting/zebra-fpm-crash-fix zebra: don't enqueue data with FPM socket closed	2022-03-14 08:27:10 -04:00
Donatas Abraitis	a9321141fc	Merge pull request #10731 from donaldsharp/multipath_output_in_zebra zebra: Multipath output	2022-03-14 13:38:08 +02:00
Rafael Zalamena	3b1caddd34	zebra: don't enqueue data with FPM socket closed It will trigger an assert while trying to schedule the next write. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2022-03-14 07:14:36 -03:00
Donald Sharp	7547d5288e	Merge pull request #10704 from anlancs/zebra-remove-check zebra: Remove unnecessary check	2022-03-13 10:17:13 -04:00
Donald Sharp	b74f72c1fb	zebra: prefixlen is not afi/safi dependant in encoding nexthops When encoding a response to the upper level protocol the prefixlen is not something that needs to be part of the switch statement for handling of a prefix. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-12 11:18:45 -05:00
Donald Sharp	06e4e90132	*: When matching against a nexthop send and process what it matched against Currently the nexthop tracking code is only sending to the requestor what it was requested to match against. When the nexthop tracking code was simplified to not need an import check and a nexthop check in `b8210849b8` for bgpd. It was not noticed that a longer prefix could match but it would be seen as a match because FRR was not sending up both the resolved route prefix and the route FRR was asked to match against. This code change causes the nexthop tracking code to pass back up the matched requested route (so that the calling protocol can figure out which one it is being told about ) as well as the actual prefix that was matched to. Fixes: #10766 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-12 11:18:45 -05:00
Donald Sharp	5c7861fe35	zebra: Remove unused ZEBRA_NHT_EXACT_MATCH This usage was removed in an earlier bit of code do some final cleanup Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-12 08:27:22 -05:00
anlan_cs	df44783078	zebra: use "assert" instead of unnecessary check Since `zvni_map_to_svi_ns()` is used to find and return one specific interface based on passed attributes of SVI, so the two parameters `in_param` and `p_ifp` must not be NULL. Passing NULL `p_ifp` makes no sense, so the check `if (p_ifp)` is unnecessary. So use `assert` to ensure the two parameters, and remove that unnecessary check. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-12 12:44:47 +08:00
Donald Sharp	04442fdbea	zebra: Add ECMP supported to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-11 14:25:23 -05:00
Sri Mohana Singamsetty	1b387a2894	Merge pull request #10711 from anlancs/zebra-remove-flag zebra: remove unnecessary assignment	2022-03-11 11:08:51 -08:00
Chirag Shah	b27be4dbd7	zebra: evpn disable remove l2vni from l3vni list Upon 'no advertise-all-vni', cleanup l2vni from its tenant-vrf's l3vni list, instead of passed zvrf->l3vni which will not be present in case of default instance. Reviewed By: Testing Done: Before Fix: ---------- TORC12(config-router-af)# advertise-all-vni TORC12(config-router-af)# end TORC12# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 Vxlan-Intf: vni4001 State: Up Router MAC: 44:38:39:ff:ff:01 L2 VNIs: 134217728 0 1000 1002 <----- After Fix: ---------- TORC12# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 Vxlan-Intf: vni4001 State: Up Router MAC: 44:38:39:ff:ff:01 L2 VNIs: 1000 1002 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 19:59:33 -08:00
Chirag Shah	3d43b95ce1	zebra: cleanup host prefix from rmac Ticket:#2798406 Testing Done: Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 17:27:15 -08:00
Chirag Shah	4a8e182a66	zebra: print rmac nexthop list Ticket:#2798406 Reviewed By: Testing Done: Before change: -------------- TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 MAC: 44:38:39:ff:ff:01 Remote VTEP: 36.0.0.11 Refcount: 1 Prefixes: [1]:[00:00:00:00:00:00:00:00:00:00]:[::]/352 TORS1# TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 json { "routerMac":"44:38:39:ff:ff:01", "vtepIp":"36.0.0.11", "refCount":1, "localSequence":0, "remoteSequence":0, "prefixList":[ "[1]:[00:00:00:00:00:00:00:00:00:00]:[::]\/352" ] } After change: ------------- TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 MAC: 44:38:39:ff:ff:01 Remote VTEP: 36.0.0.11 Refcount: 0 Prefixes: TORS1# TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 json { "routerMac":"44:38:39:ff:ff:01", "vtepIp":"36.0.0.11", "nexthops":[ "36.0.0.11" ] } Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 17:27:15 -08:00
Chirag Shah	ae9e6beaea	zebra: remove host prefix mapping in rmac RMAC keeping list of nexthops to keep track of its existiance, remove the (old way) host prefix mapping. Ticket: #2798406 Reviewed By: Testing Done: TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 MAC: 44:38:39:ff:ff:01 Remote VTEP: 36.0.0.11 Refcount: 0 Prefixes: Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 17:27:15 -08:00
Chirag Shah	db88997872	zebra: maintain list of nhs in rmac db Keep the list of remote-vteps/nexthops in rmac db. Problem: In CLAG deployment there might be a situation where CLAG secondary sends individual ip as nexthop along with anycast mac as RMAC. This combination is updated in zebra's rmac cache. Upon recovery at clag secondary sends withdrawal of the incorrect rmac and nexthop mapping. The RMAC entry mapping to nh is not cleaned up properly in the zebra rmac cache. Fix: Zebra rmac db needs to maintain a list of nexthops. When a bgp withdrawal for rmac to nexthop mapping is received, remove the old nexthop from the rmac's nh list and if the host reference still remains for the RMAC,fall back to the nexthop one remaining in the list. At most you expect two nexthops mapped to RMAC (in clag deployment). Ticket: 2798406 Reviewed By: Testing Done: CLAG primary and secondary have advertise-pip enabled advertise type-5 route (default route) with individual IP as nh and individual svi mac as rmac. - disable advertise pip on both clag devices, this results in advertisement of routes with anycast ip as nh and anycast mac as rmac. - disable peerlink on clag primary, this triggers clag secondary to (transitory) send bgp update with individual ip as nh and anycast mac as rmac. - At the remote vtep: Check the zebra's rmac cache/nh mapping correctly and points to anycast rmac and anycast ip as nh of the clag system. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 17:27:15 -08:00
Stephen Worley	47c1d76a6c	zebra: cleanup protodown netlink logs Cleanup the logs in the netlink code for setting protodown on/off to be more useful to a user parsing them after an issue. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:45 -05:00
Stephen Worley	7140b00cb0	zebra: use SET/UNSET/CHECK/COND in protodown code Use the SET/UNSET/CHECK/COND macros for flag bifields where appropriate throught the protodown code base. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	26a64ff9ca	zebra: include old reason in evpn-mh bond update Ensure we include the old reason when we are updating the reason code for a evpn-mh bond member. Now that this is a common API it could include things external to EVPN in this reason code bitfield (ex: vrrp). Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	ed7a1622c3	zebra: make netlink protodown func more readable Make the netlink protodown static function for checking if the only bit set for protodown reason is FRR's more easily readable to someone not familiar with the code. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	00d57e6dd5	zebra: clear dplane flags on failure for protodown Make sure we clear our dplane flags for SET/UNSET on failure so that we try again. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	a3ea8493a8	zebra: simplify reason code printing in show Simplify the code for printing the reason codes via show command. Just remove the trailing comma last before printing. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	3db9f2db2b	zebra: cleanup protodown api logs Cleanup the logs in the api for setting protodown on/off that zapi and others use. Make them more useful to a user parsing them after an issue. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	ddfea8a233	zebra: wrap macro zif argument in parans Missing parenthesis for macro ZIF argument. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	b1da028e45	zebra: avoid initialization in ctx_intf_init Avoid initialization in dplane_ctx_intf_init() so the compiler can warn us about using unintialized data. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	4b82b95488	zebra: evpn-mh bonds protodown check for set When we are processing a bond member's protodown we get from the dataplane, check to make sure we haven't already queued up a set. If we have, it's likely this is just a notification we get from the kernel after we set protodown and before we have processed the result in our dplane pthread. This change is needed now that we set protodown via the dplane pthread. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	cb5b31f5d0	zebra: evpn-mh use protodown update reason api When setting the protodown reason use the update api where we can directly update the entire reason bitfield since we have to set more than one. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	321c80d6b2	zebra: extern setting protodown reason directly Extern the api for setting the protodown reason code bitfield directly. Some places may want to completely update the bitfield with more than one reason at a time. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	ab465d24bd	zebra: only clear pd_reason on shutdown/sweep Only clear protodown reason on shutdown/sweep, retain protodown state. This is to retain traditional and expected behavior with daemons like vrrpd setting protodown. They expet it to be set on shutdown and retained on bring up to prevent traffic from being dropped. We must cleanup our reason code though to prevent us from blocking others. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	97c7263373	zebra: add boilerplate protodown updates for *bsd Add boilerplate for someone to come and add protdown updates for bsd platforms if it ever exists. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	e4d87b5894	zebra: remove old protodown dplane path Remove the old protodown dplane path install on the main thread. This is now dead code. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	d89b300829	zebra: use a macro for check protodown Add a helper macro for checking if interface is protodown, typing this out is annoying. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	0dcd8506f2	zebra: clear protodown_rc on shutdown and sweep Add functionality to clear any reason code set on shutdown of zebra when we are freeing the interface, in case a bad client didn't tell us to clear it when the shutdown. Also, in case of a crash or failure to do the above, clear reason on startup if it is set. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:42 -05:00
Stephen Worley	71ef5cbb95	zebra: add enum set/unset states for queing Add enums for set/unset of prodown state to handle the mainthread knowing an update is already queued without actually marking it as complete. This is to make the logic confirm a bit more with other parts of the code where we queue dplane updates and not update our internal structs until success callback is received. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 17:52:44 -05:00
Stephen Worley	c40e1b1cfb	zebra: add command for setting protodown bit Add command for use to set protodown via frr.conf in the case our default conflicts with another application they are using. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 17:52:44 -05:00
Stephen Worley	5d41413833	zebra: add support for protodown reason code Add support for setting the protodown reason code. `829eb208e8` These patches handle all our netlink code for setting the reason. For protodown reason we only set `frr` as the reason externally but internally we have more descriptive reasoning available via `show interface IFNAME`. The kernel only provides a bitwidth of 32 that all userspace programs have to share so this makes the most sense. Since this is new functionality, it needs to be added to the dplane pthread instead. So these patches, also move the protodown setting we were doing before into the dplane pthread. For this, we abstract it a bit more to make it a general interface LINK update dplane API. This API can be expanded to support gernal link creation/updating when/if someone ever adds that code. We also move a more common entrypoint for evpn-mh and from zapi clients like vrrpd. They both call common code now to set our internal flags for protodown and protodown reason. Also add debugging code for dumping netlink packets with protodown/protodown_reason. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 17:52:44 -05:00
Russ White	82aca4ae4f	Merge pull request #10662 from chiragshah6/evpn_dev1 zebra: netlink protodown event handling for vxlan device	2022-03-09 16:37:30 -05:00
Mark Stapp	bc5302b1b1	Merge pull request #10635 from anlancs/staticd-cross zebra: let same host route cross VRF	2022-03-09 11:05:59 -05:00
David Lamparter	e3c54a9383	Merge pull request #10079 from mjstapp/fix_intf_del_nhgs	2022-03-09 10:15:00 +01:00
anlan_cs	3f04f9cf24	zebra: let /32 host route with same IP cross VRF Contraints of host routes are too strict in current code: Host routes with same destination address and nexthop address are forbidden even when cross VRFs. Currently host routes with different destination and nexthop address can cross VRFs, it is ok. But host routes with same addresses are forbidden to cross VRFs, it is wrong. Since different VRFs can have the same addresses, leak specific host route with the same nexthop address ( it means destination address is same to nexthop address ) to other VRFs is a normal case. This commit relaxes that contraints. Host routes with same destination address and nexthop address are forbidden only when not cross VRFs. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-09 07:22:11 +08:00
Mark Stapp	2472d3e876	zebra: shutdown doesn't uninstall zebra's NHGs When an interface goes down, it signals any related NHGs to re-validate themselves. During zebra shutdown, ensure we remove any NHGs we've installed. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-03-08 16:16:55 -05:00
Russ White	5e412c5e73	Merge pull request #10722 from chiragshah6/evpn_dev3 zebra: fix crash in evpn neigh cleanup all	2022-03-08 11:00:29 -05:00
anlan_cs	c2fd85a854	zebra: remove unnecessary assignment In `zebra_evpn_neigh_gw_macip_add()`, it sets `mac->flags` to "ZEBRA_MAC_DEF_GW" for "advertise-default-gw" mode. But this set is redundant because this "mac" is already set by `zebra_evpn_mac_gw_macip_add()`. So remove this redundant assignment. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-08 22:58:22 +08:00
David Lamparter	378260fb65	zebra: remove unused variable clang complains "variable 'curr_length' set but not used". Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-07 17:37:27 +01:00
anlan_cs	38eda16a24	zebra: Delay the usage of one variable until need In the loop, local variable `ip` is always set even if the check condition is not satisfied. Avoid the redundant set, move this set exactly after the check condition is satisfied. Set `ip` only if the check condition is met, otherwise needn't. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-05 06:57:35 +08:00
Chirag Shah	d5fdae8f45	zebra: fix crash in evpn neigh cleanup all zebra crash is seen during shutdown (frr restart). During shutdown, remote neigh and remote mac clean up is triggered first, followed by per vni all neigh (including local) and macs cleanup is triggered. The crash occurs when a remote mac is cleaned up first and its reference is remained in local neigh. When local neigh attempt removes itself from its associated mac's neigh_list it triggers inaccessible memory crash. The fix is during mac deletion if its neigh_list is non-empty then retain the MAC in AUTO state. This can arise when MAC and neigh duo are in different state (remote/local). Otherwise, the order of cleanup operation is neighs followed by macs. The auto mac will be cleaned up when per vni all neighs and macs are cleaned up. Ticket:CM-29826 Reviewed By:CCR-10369 Testing Done: Configure evpn symmetric config where MAC is in remote state and neigh is in local state. Perform frr restart then crash is not seen. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-03 14:59:56 -08:00
Donald Sharp	8b48cdb913	zebra: Prevent installation of connected multiple times With recent changes to interface up mechanics in if_netlink.c FRR was receiving as many as 4 up events for an interface on ifdown/ifup events. This was causing timing issues in FRR based upon some fun timings. Remove this from happening. Ticket: CM-31623 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-02 18:34:32 -08:00
Chirag Shah	d78fa57195	zebra: protodown-up event trigger interface up Vxlan interfaces flap (protodown/up) event, non ptm operative interfaces do not come up as protodown up event do not trigger "if_up()" event. Ticket:CM-30477 Reviewed By:CCR-10681 Testing Done: validated interfaces flaps, ip link down, ifdown and protodown followed by UP event. all Vxlan interfaces come up in bgpd post flap. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-02 18:16:56 -08:00
Chirag Shah	2d04bd98ac	zebra: handle protodown netlink for vxlan device Frr need to handle protocol down event for vxlan interface. In MLAG scenario, one of the pair switch can put vxlan port to protodown state, followed by tunnel-ip change from anycast IP to individual IP. In absence of protodown handling, evpn end up advertising locally learn EVPN (MAC-IP) routes with individual IP as nexthop. This leads an issue of overwriting locally learn entries as remote on MLAG pair. Ticket:CM-24545 Reviewed By:CCR-10310 Testing Done: In EVPN deployment, restart one of the MLAG daemon, which puts vxlan interfaces in protodown state. FRR treats protodown as oper down for vxlan interfaces. VNI down cleans up/withdraws locally learn routes. Followed by vxlan device UP event, re-advertise locally learn routes. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-02 17:40:44 -08:00
David Lamparter	2821405a69	Merge pull request #10640 from donaldsharp/thread_timers	2022-03-01 11:45:36 +01:00
Jafar Al-Gharaibeh	868efb9e9f	Merge pull request #10672 from donaldsharp/bsd_zebra_graceful_restart_cleanup Bsd zebra graceful restart cleanup	2022-02-28 14:57:35 -06:00
Donald Sharp	45dafca86c	zebra: Use the routes vrf not the vrf of the nexthop for route-map application When a end operator is doing cross vrf imports in bgp: router bgp 3239 vrf FOO address-family ipv4 uni import vrf BAR ! and zebra has this configuration: vrf FOO ip protocol bgp route-map EVA ! The current code in zebra_nhg.c was looking up the vrf of the nexthop and attempting to apply the ip protocol route-map. For most people the nexthop vrf and the re vrf are one and the same so they never see a problem. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-28 13:08:01 -05:00
David Lamparter	b9db469fe9	Merge pull request #10667 from donaldsharp/bufsize	2022-02-28 15:56:51 +01:00
Donald Sharp	73d3197c73	zebra: Get zebra graceful restart working when restarting on BSD Upon restart zebra reads in the kernel state. Under linux there is a mechanism to read the route and convert the protocol to the correct internal FRR protocol to allow the zebra graceful restart efforts to work properly. Under BSD I do not see a mechanism to convey the original FRR protocol into the kernel and thus back out of it. Thus when zebra crashes ( or restarts ) the routes read back in are kernel routes and are effectively lost to the system and FRR cannot remove them properly. Why? Because FRR see's kernel routes as routes that it should not own and in general the admin distance for those routes will be a better one than the admin distance from a routing protocol. This is even worse because when the graceful restart timer pops and rib_sweep is run, FRR becomes out of sync with the state of the kernel forwarding on BSD. On restart, notice that the route is a self route that there is no way to know it's originating protocol. In this case let's set the protocol to ZEBRA_ROUTE_STATIC and set the admin distance to 255. This way when an upper level protocol reinstalls it's route the general zebra graceful restart code still works. The high admin distance allows the code to just work in a way that is graceful( HA! ) The drawback here is that the route shows up as a static route for the time the system is doing it's work. FRR could introduce another* route type but this seems like a bad idea and the STATIC route type is loosely analagous to the type of route it has become. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-28 09:50:35 -05:00
Donald Sharp	16d91fce15	zebra: Prevent crash if ZEBRA_ROUTE_ALL is used for a route type FRR will crash when the re->type is a ZEBRA_ROUTE_ALL and it is inserted into the meta-queue. Let's just put some basic code in place to prevent a crash from happening. No routing protocol should be using ZEBRA_ROUTE_ALL as a value but bugs do happen. Let's just accept the weird route type gracefully and move on. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-28 09:50:35 -05:00
Donald Sharp	fbc83b9a10	zebra: Limit speed lookup to at most 4 minutes There exists some interface types that are slow on startup to fully register their link speed. Especially those that are working with an asic backend. The speed_update timer associated with each interface would keep trying if the system returned a MAX_UINT32 as the speed. This speed means both unknown or there is none under linux. Since some interface types are slow on startup let's modify FRR to try for at most 4 minutes and give up trying on those interfaces where we never get any useful data. Why 4 minutes? I wanted to balance the time associated with slow interfaces coming up with those that will never give us a value. So I choose 4 minutes as a good ballpark of time to keep trying Why not track all those interfaces and just not attempt to do the speed lookup? I would prefer to not keep track of these as that I do not know all the interface types, nor do I wish to keep programming as new ones come in. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-28 06:39:07 -05:00
Xiao Liang	86bad1cb6e	zebra: Lookup linked interface in link netns Look up linked interface in the correct netns, otherwise, either a wrong interface or NULL would be used. For example, enable VRF netns backend, and: ip netns add ns1 ip link add link eth0 link1 type macvlan ip link set link1 netns ns1 up Zebra will crash in zebra_vxlan_macvlan_up because zif->link is NULL. Signed-off-by: Xiao Liang <shaw.leon@gmail.com>	2022-02-28 12:12:15 +08:00
Donald Sharp	9fb83b5506	zebra: Allow BSD to specify a receive buffer size End operator is reporting that they are receiving buffer overruns when attempting to read from the kernel receive socket. It is possible to adjust this size to more modern levels especially for when the system is under load. Modify the code base so that BSD operators can use the zebra `-s XXX` option to specify a read buffer. Additionally setup the default receive buffer size on *BSD to be 128k instead of the 8k so that FRR does not run into this issue again. Fixes: #10666 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-27 07:47:58 -05:00
Donald Sharp	ae45a63022	Merge pull request #10669 from anlancs/bgpd-line *: Add necessary new line for output of vty_out()	2022-02-27 07:43:28 -05:00
anlan_cs	4d4c404bf6	*: Add necessary new line for output of vty_out() Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-02-27 10:59:19 +08:00
Mark Stapp	cd787a8a45	zebra: use dataplane to read interface NETCONF info Use the dataplane to query and read interface NETCONF data; add netconf-oriented data to the dplane context object, and add accessors for it. Add handler for incoming update processing. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 10:18:32 -05:00
Mark Stapp	728f2017ae	zebra: add dplane type for NETCONF data Add a new dplane op for interface NETCONF data; add the new enum value to several switch statements. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Mark Stapp	d4bcd88d8a	zebra: avoid default clause in FPM switch Avoid default clause in a switch in the FPM module that handles dplane op codes - include all the codes. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Mark Stapp	9f3f1486c8	zebra: add xxxNETCONF messages to the netlink BPF filter Allow self-produced xxxNETCONF netlink messages through the BPF filter we use. Just like address-configuration actions, we'll process NETCONF changes in one path, whether the changes were generated by zebra or by something else in the host OS. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Mark Stapp	777f96503e	zebra: add netlink debug dump for netconf messages Add the RTM_NETCONF messages to the detailed netlink message dump module. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Mark Stapp	b6beb70047	zebra: include mpls enabled status in interface output Add mpls status to the zebra interface struct; include mpls status in show interface output. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Donald Sharp	ebb61fcaf5	zebra: Start of work to get data about mpls from kernel a) We'll need to pass the info up via some dataplane control method (This way bsd and linux can both be zebra agnostic of each other) b) We'll need to modify `struct interface *` to track this data and when it changes to notify upper level protocols about it. c) Work is needed to dump the entire mpls state at the start so we can gather interface state. This should be done after interface data gathering from the kernel. Signed-off-by: Donald Sharp <sharpd@nvidia.com> Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Christian Hopps	7bf63db79b	Merge pull request #10632 from donaldsharp/thread_return_null *: Change thread->func to return void instead of int	2022-02-24 01:43:48 -05:00
Donald Sharp	cc9f21da22	*: Change thread->func to return void instead of int The int return value is never used. Modify the code base to just return a void instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-23 19:56:04 -05:00
vdhingra	1b1e934fac	zebra: Nexthop tracking, route resolution recursive lookup Description: =========== Change is intended for fixing the NHT resolution logic. While recursively resolving nexthop, keep looking for a valid/useable route in the rib, by not stopping at the first/most-specific route in the rib. Consider the following set of events taking place on R1: R1(config)# ip route 2.2.2.0/24 ens192 R1# sharp watch nexthop 2.2.2.32 connected R1# show ip nht 2.2.2.32(Connected) resolved via static is directly connected, ens192 Client list: sharp(fd 33) -2.2.2.32 NHT is resolved over the above valid static route. R1# sharp install routes 2.2.2.32 nexthop 2.2.2.32 1 R1# 2.2.2.32(Connected) resolved via static is directly connected, ens192 Client list: sharp(fd 33) -.32/32 comes which is going to resolve through itself, but since this is an invalid route, it will be marked as inactive and will not affect the NHT. R1# sharp install routes 2.2.2.31 nexthop 2.2.2.32 1 R1# 2.2.2.32(Connected) unresolved(Connected) Client list: sharp(fd 50) -Now a .31/32 comes which will resolve over .32 route, but as per the current logic, this will trigger the NHT check, in turn making the NHT unresolved. -With fix, NHT should stay in resolved state as long as the valid static or connected route stays installed Fix: ==== -While resolving nexthops, walk up the tree from the most-specific match, walk up the tree without any ZEBRA_NHT_CONNECTED check. Co-authored-by: Vishal Dhingra <vdhingra@vmware.com> Co-authored-by: Kantesh Mundaragi <kmundaragi@vmware.com> Signed-off-by: Iqra Siddiqui <imujeebsiddi@vmware.com>	2022-02-22 09:28:00 -08:00
anlan_cs	5ff58d0a47	zebra: minor changes on "zebra_evpn_mac_gw_macip_add" function Two minor changes: 1) Change `zebra_evpn_mac_gw_macip_add()` 's return type to `void`. 2) Since `zebra_evpn_mac_gw_macip_add()` has already `assert` the returned `mac`, the check of its return value makes no sense. And keep setting `mac->flags` inside `zebra_evpn_mac_gw_macip_add()` is more reasonable. So just move the setting `mac->flags` inside `zebra_evpn_mac_gw_macip_add()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-02-18 22:49:47 -05:00
Donald Sharp	7f6ff7a3d3	Merge pull request #10557 from alexk99/zebra-fpm-multihop-weight Zebra FPM: don't lose next hop weights while exporting via FPM	2022-02-17 09:41:52 -05:00
Russ White	c131015905	Merge pull request #10547 from donaldsharp/10458 zebra: Keep the interface flags safe on multiple ioctl calls	2022-02-16 19:20:47 -05:00
Jafar Al-Gharaibeh	76d8e1a4a7	Merge pull request #10561 from mjstapp/nlsock_hash_lock zebra: make netlink object hash threadsafe	2022-02-16 13:11:21 -06:00
Donald Sharp	b9d95135a8	zebra: Fix spelling mistake Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-14 12:56:44 -05:00
Mark Stapp	348698095d	zebra: make netlink object hash threadsafe The recently-added hashtable of nlsock objects needs to be thread-safe: it's accessed from the main and dplane pthreads. Add a mutex for it, use wrapper apis when accessing it. Add a per-OS init/terminate api so we can do init that's not per-vrf or per-namespace. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-11 17:03:26 -05:00
Trey Aspelund	e54cd97838	zebra: cleanup multiline strings in debug_nl.c NetDEF CI has been whining about multiline string style. Make the strings single-line and call it a day. Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2022-02-10 21:37:45 +00:00
Trey Aspelund	95fe32880f	zebra: add netlink debugs for ip rules Adds functions to parse + decode netlink rules. Adds RTM_NEWRULE + RTM_DELRULE to "debug zebra kernel". Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2022-02-10 21:36:34 +00:00
kiselev99@gmail.com	eca3256db8	zebra: FPM next hop weights Don't lose next hop weights while exporting via FPM Signed-off-by: Alex Kiselev <alex@bisonrouter.com>	2022-02-10 19:16:33 +03:00
Rafael Zalamena	70d79c359b	Merge pull request #10537 from mjstapp/fix_dplane_strdup zebra: use frr mem apis in dplane	2022-02-10 10:24:22 -03:00
Bijan	16dca7cec5	zebra: Keep the interface flags safe on multiple ioctl calls Trying to call multiple ioctl calls on ifreq will result in overwriting ifreq with garbage data. On if_get_flags call, try to keep the flags field safe from another possible ioctl call before applying the flags field. Modified code as per Code Review, done by Donald Sharp. Signed-off-by: Bijan <bijanebrahimi@riseup.net>	2022-02-09 10:07:47 -05:00
Donald Sharp	2cf7651f0b	zebra: Make netlink buffer reads resizeable when needed Currently when the kernel sends netlink messages to FRR the buffers to receive this data is of fixed length. The kernel, with certain configurations, will send netlink messages that are larger than this fixed length. This leads to situations where, on startup, zebra gets really confused about the state of the kernel. Effectively the current algorithm is this: read up to buffer in size while (data to parse) get netlink message header, look at size parse if you can The problem is that there is a 32k buffer we read. We get the first message that is say 1k in size, subtract that 1k to 31k left to parse. We then get the next header and notice that the length of the message is 33k. Which is obviously larger than what we read in. FRR has no recover mechanism nor is there a way to know, a priori, what the maximum size the kernel will send us. Modify FRR to look at the kernel message and see if the buffer is large enough, if not, make it large enough to read in the message. This code has to be per netlink socket because of the usage of pthreads. So add to `struct nlsock` the buffer and current buffer length. Growing it as necessary. Fixes: #10404 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-08 17:28:19 -05:00
Donald Sharp	d4000d7ba3	zebra: Remove `struct nlsock` from dataplane information and use `int fd` Store the fd that corresponds to the appropriate `struct nlsock` and pass that around in the dplane context instead of the pointer to the nlsock. Modify the kernel_netlink.c code to store in a hash the `struct nlsock` with the socket fd as the key. Why do this? The dataplane context is used to pass around the `struct nlsock` but the zebra code has a bug where the received buffer for kernel netlink messages from the kernel is not big enough. So we need to dynamically grow the receive buffer per socket, instead of having a non-dynamic buffer that we read into. By passing around the fd we can look up the `struct nlsock` that will soon have the associated buffer and not have to worry about `const` issues that will arise. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-08 17:28:19 -05:00
Donald Sharp	3670f5047c	zebra: Store the sequence number to use as part of the dp_info Store and use the sequence number instead of using what is in the `struct nlsock`. Future commits are going away from storing the `struct nlsock` and the copy of the nlsock was guaranteeing unique sequence numbers per message. So let's store the sequence number to use instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-08 17:28:19 -05:00
Mark Stapp	b6b6e59c6e	zebra: use frr mem apis Replace a couple of strdup/free with XSTRDUP/XFREE. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-08 15:57:57 -05:00
Russ White	1a8a7016a6	Merge pull request #9066 from donaldsharp/ships_in_the_night zebra: Fix ships in the night issue	2022-02-08 14:41:01 -05:00
Igor Ryzhov	60cda04dda	*: use ipaddr_cmp instead of memcmp Using memcmp is wrong because struct ipaddr may contain unitialized padding bytes that should not be compared. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-02-08 20:31:34 +03:00
Russ White	e735c8073c	Merge pull request #9649 from proelbtn/add-support-for-end-dt4 add support for SRv6 IPv4 L3VPN	2022-02-08 08:30:02 -05:00
Donald Sharp	ce649b9d11	zebra: Abstract nhg deletion to reduce code duplication Reduce code duplication when we are cleaning up nexthop groups. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-07 16:10:36 -05:00
Donald Sharp	c6eee91f66	zebra: Fix ships in the night issue When using wait for install there exists situations where zebra will issue several route change operations to the kernel but end up in a state where we shouldn't be at the end due to extra data being received. Example: a) zebra receives from bgp a route change, installs sends the route to the kernel. b) zebra receives a route deletion from bgp, removes the struct route entry and then sends to the kernel a deletion. c) zebra receives an asynchronous notification that (a) succeeded but we treat this as a new route. This is the ships in the night problem. In this case if we receive notification from the kernel about a route that we know nothing about and we are not in startup and we are doing asic offload then we can ignore this update. Ticket: #2563300 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-07 16:10:03 -05:00
Donald Sharp	81ef8a69ae	zebra: Use AF_UNSPEC instead of setting to 0 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-07 13:22:41 -05:00
anlan_cs	97511d01af	zebra: Remove unnecessary check Since `assert` is already done, just remove these unnecessary check. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-02-06 20:28:31 -05:00
Jafar Al-Gharaibeh	4333379fca	Merge pull request #9926 from donaldsharp/update_issues zebra: Fix v6 route replace failure turned into success	2022-02-04 19:40:55 -06:00
Jafar Al-Gharaibeh	2da1428ab2	Merge pull request #10501 from donaldsharp/more_zebra_show More zebra show	2022-02-04 15:13:45 -06:00
Donald Sharp	c8453cd77e	zebra: Fix v6 route replace failure turned into success Currently when we have a route replace operation for v6 routes with a new nexthop group the order of kernel installation is this: a) New nexthop group insertion seq 1 b) Route delete operation seq 3 c) Route insertion operation seq 2 Currently the code in nl_batch_read_resp is attempting to handle this situation by skipping the delete operation. BUT it is enqueuing the context into the zebra dplane queue before we read the response. Since we create the ctx with an implied success, success is being reported to the upper level dplane and the zebra rib thinks the route has been properly handled. This is showing up in the zebra_seg6_route test code because the test code is installing a seg6 route w/ sharpd and it is failing to install because the route's nexthop is rejected: First installation: 2021/10/29 09:28:10.218 ZEBRA: [JGWSB-SMNVE] dplane: incoming new work counter: 2 2021/10/29 09:28:10.218 ZEBRA: [Q52A7-211QJ] dplane enqueues 2 new work to provider 'Kernel' 2021/10/29 09:28:10.218 ZEBRA: [JVY1P-93VFY] dplane provider 'Kernel': processing 2021/10/29 09:28:10.218 ZEBRA: [TX9N0-9JKDF] ID (9) Dplane nexthop update ctx 0x56125390a820 op NH_INSTALL 2021/10/29 09:28:10.218 ZEBRA: [PM9ZJ-07RCP] 0:1::1/128 Dplane route update ctx 0x56125390add0 op ROUTE_INSTALL 2021/10/29 09:28:10.218 ZEBRA: [TJ327-ET8HE] netlink_send_msg: >> netlink message dump [sent] 2021/10/29 09:28:10.218 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=104 type=(104) NEWNEXTHOP flags=(0x0501) {REQUEST,DUMP,(ROOT\|REPLACE\|CAPPED),(ATOMIC\|CREATE)} seq=9 pid=3539131282] 2021/10/29 09:28:10.218 ZEBRA: [WCX94-SW894] nhm [family=(10) AF_INET6 scope=(0) UNIVERSE protocol=(11) ZEBRA flags=0x00000000 {}] 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(1) ID] 2021/10/29 09:28:10.218 ZEBRA: [Z4E9C-GD9EP] 9 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(6) GATEWAY] 2021/10/29 09:28:10.218 ZEBRA: [STTSM-27M81] 2001::1 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(5) OIF] 2021/10/29 09:28:10.218 ZEBRA: [JR4EA-BKPTA] 6 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=6 (payload=2) type=(7) ENCAP_TYPE] 2021/10/29 09:28:10.218 ZEBRA: [JR4EA-BKPTA] 5 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=36 (payload=32) type=(32776) UNKNOWN] 2021/10/29 09:28:10.218 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=64 type=(24) NEWROUTE flags=(0x0401) {REQUEST,(ATOMIC\|CREATE)} seq=10 pid=3539131282] 2021/10/29 09:28:10.218 ZEBRA: [GCEGC-W8YBF] rtmsg [family=(10) AF_INET6 dstlen=128 srclen=0 tos=0 table=254 protocol=(194) UNKNOWN scope=(0) UNIVERSE type=(1) UNICAST flags=0x0000 {}] 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(1) DST] 2021/10/29 09:28:10.218 ZEBRA: [STTSM-27M81] 1::1 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(6) PRIORITY] 2021/10/29 09:28:10.218 ZEBRA: [Z4E9C-GD9EP] 20 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(30) NH_ID] 2021/10/29 09:28:10.218 ZEBRA: [Z4E9C-GD9EP] 9 2021/10/29 09:28:10.218 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:28:10.218 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=76 type=(2) ERROR flags=(0x0300) {DUMP,(ROOT\|REPLACE\|CAPPED),(MATCH\|EXCLUDE\|ACK_TLVS)} seq=9 pid=3539131282] 2021/10/29 09:28:10.218 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-22) Invalid argument] 2021/10/29 09:28:10.218 ZEBRA: [HSYZM-HV7HF] Extended Error: Gateway can not be a local address 2021/10/29 09:28:10.218 ZEBRA: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWNEXTHOP(104), seq=9, pid=3539131282 2021/10/29 09:28:10.218 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:28:10.218 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=68 type=(2) ERROR flags=(0x0300) {DUMP,(ROOT\|REPLACE\|CAPPED),(MATCH\|EXCLUDE\|ACK_TLVS)} seq=10 pid=3539131282] 2021/10/29 09:28:10.218 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-22) Invalid argument] 2021/10/29 09:28:10.218 ZEBRA: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist 2021/10/29 09:28:10.218 ZEBRA: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=10, pid=3539131282 2021/10/29 09:28:10.218 ZEBRA: [VCDW6-A7ZF1] dplane dequeues 2 completed work from provider Kernel 2021/10/29 09:28:10.218 ZEBRA: [JTWAB-1MH4Y] dplane has 2 completed, 0 errors, for zebra main 2021/10/29 09:28:10.218 ZEBRA: [J7K9Z-9M7DT] Nexthop dplane ctx 0x56125390a820, op NH_INSTALL, nexthop ID (9), result FAILURE 2021/10/29 09:28:10.218 ZEBRA: [P2XBZ-RAFQ5][EC 4043309074] Failed to install Nexthop ID (9) into the kernel 2021/10/29 09:28:10.218 ZEBRA: [RMK34-61HV5] default(0:254):1::1/128 Processing dplane result ctx 0x56125390add0, op ROUTE_INSTALL result FAILURE Note the last line `op ROUTE_INSTALL result FAILURE` because we are attempting to use a a gw nexthop that is local. This is the result. Then the test code was installing the route again: 2021/10/29 09:30:00.493 ZEBRA: [JGWSB-SMNVE] dplane: incoming new work counter: 2 2021/10/29 09:30:00.493 ZEBRA: [Q52A7-211QJ] dplane enqueues 2 new work to provider 'Kernel' 2021/10/29 09:30:00.493 ZEBRA: [JVY1P-93VFY] dplane provider 'Kernel': processing 2021/10/29 09:30:00.493 ZEBRA: [TX9N0-9JKDF] ID (9) Dplane nexthop update ctx 0x561253916a00 op NH_INSTALL 2021/10/29 09:30:00.493 ZEBRA: [PM9ZJ-07RCP] 0:1::1/128 Dplane route update ctx 0x561253915f40 op ROUTE_UPDATE 2021/10/29 09:30:00.493 ZEBRA: [TJ327-ET8HE] netlink_send_msg: >> netlink message dump [sent] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=104 type=(104) NEWNEXTHOP flags=(0x0501) {REQUEST,DUMP,(ROOT\|REPLACE\|CAPPED),(ATOMIC\|CREATE)} seq=11 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [WCX94-SW894] nhm [family=(10) AF_INET6 scope=(0) UNIVERSE protocol=(11) ZEBRA flags=0x00000000 {}] 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(1) ID] 2021/10/29 09:30:00.493 ZEBRA: [Z4E9C-GD9EP] 9 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(6) GATEWAY] 2021/10/29 09:30:00.493 ZEBRA: [STTSM-27M81] 2001::1 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(5) OIF] 2021/10/29 09:30:00.493 ZEBRA: [JR4EA-BKPTA] 6 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=6 (payload=2) type=(7) ENCAP_TYPE] 2021/10/29 09:30:00.493 ZEBRA: [JR4EA-BKPTA] 5 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=36 (payload=32) type=(32776) UNKNOWN] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=56 type=(25) DELROUTE flags=(0x0401) {REQUEST,(ATOMIC\|CREATE)} seq=13 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [GCEGC-W8YBF] rtmsg [family=(10) AF_INET6 dstlen=128 srclen=0 tos=0 table=254 protocol=(194) UNKNOWN scope=(0) UNIVERSE type=(0) UNSPEC flags=0x0000 {}] 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(1) DST] 2021/10/29 09:30:00.493 ZEBRA: [STTSM-27M81] 1::1 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(6) PRIORITY] 2021/10/29 09:30:00.493 ZEBRA: [Z4E9C-GD9EP] 20 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=64 type=(24) NEWROUTE flags=(0x0401) {REQUEST,(ATOMIC\|CREATE)} seq=12 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [GCEGC-W8YBF] rtmsg [family=(10) AF_INET6 dstlen=128 srclen=0 tos=0 table=254 protocol=(194) UNKNOWN scope=(0) UNIVERSE type=(1) UNICAST flags=0x0000 {}] 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(1) DST] 2021/10/29 09:30:00.493 ZEBRA: [STTSM-27M81] 1::1 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(6) PRIORITY] 2021/10/29 09:30:00.493 ZEBRA: [Z4E9C-GD9EP] 20 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(30) NH_ID] 2021/10/29 09:30:00.493 ZEBRA: [Z4E9C-GD9EP] 9 2021/10/29 09:30:00.493 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=76 type=(2) ERROR flags=(0x0300) {DUMP,(ROOT\|REPLACE\|CAPPED),(MATCH\|EXCLUDE\|ACK_TLVS)} seq=11 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-22) Invalid argument] 2021/10/29 09:30:00.493 ZEBRA: [HSYZM-HV7HF] Extended Error: Gateway can not be a local address 2021/10/29 09:30:00.493 ZEBRA: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWNEXTHOP(104), seq=11, pid=3539131282 2021/10/29 09:30:00.493 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=36 type=(2) ERROR flags=(0x0100) {DUMP,(ROOT\|REPLACE\|CAPPED)} seq=13 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-3) No such process] 2021/10/29 09:30:00.493 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=68 type=(2) ERROR flags=(0x0300) {DUMP,(ROOT\|REPLACE\|CAPPED),(MATCH\|EXCLUDE\|ACK_TLVS)} seq=12 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-22) Invalid argument] 2021/10/29 09:30:00.493 ZEBRA: [VCDW6-A7ZF1] dplane dequeues 2 completed work from provider Kernel 2021/10/29 09:30:00.493 ZEBRA: [JTWAB-1MH4Y] dplane has 2 completed, 0 errors, for zebra main 2021/10/29 09:30:00.493 ZEBRA: [J7K9Z-9M7DT] Nexthop dplane ctx 0x561253916a00, op NH_INSTALL, nexthop ID (9), result FAILURE 2021/10/29 09:30:00.493 ZEBRA: [P2XBZ-RAFQ5][EC 4043309074] Failed to install Nexthop ID (9) into the kernel 2021/10/29 09:30:00.493 ZEBRA: [RMK34-61HV5] default(0:254):1::1/128 Processing dplane result ctx 0x561253915f40, op ROUTE_UPDATE result SUCCESS Note that this time we do these three operations a) nexthop installation seq 11 b) route delete seq 13 c) route add seq 12 Note the last line, we report the install as a success but it clearly failed from the seq=12 decode. When we look at the v6 rib it thinks it is installed: unet> r1 show ipv6 route Codes: K - kernel route, C - connected, S - static, R - RIPng, O - OSPFv3, I - IS-IS, B - BGP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure D>* 1::1/128 [150/0] via 2001::1, dum0, seg6local unspec unknown(seg6local_context2str), seg6 a::, weight 1, 00:00:17 So let's modify nl_batch_read_resp to not dequeue/enqueue the context until we are sure we have the right one. This fixes the test code to do the right thing on the second installation. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 15:33:58 -05:00
Donald Sharp	e3ee55d4bd	zebra: set zd_is_update in 1 spot The ctx->zd_is_update is being set in various spots based upon the same value that we are passing into dplane_ctx_ns_init. Let's just consolidate all this into the dplane_ctx_ns_init so that the zd_is_udpate value is set at the same time that we increment the sequence numbers to use. As a note for future me's reading this. The sequence number choosen for the seq number passed to the kernel is that each context gets a copy of the appropriate nlsock to use. Since it's a copy at a point in time, we know we have a unique sequence number value. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 15:33:58 -05:00
Donald Sharp	00249e255e	zebra: When we get an implicit or ack or full failure mark status When nl_batch_read_resp gets a full on failure -1 or an implicit ack 0 from the kernel for a batch of code. Let's immediately mark all of those in the batch pass/fail as needed. Instead of having them marked else where. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 15:33:58 -05:00
Jafar Al-Gharaibeh	40ec6ef9e0	Merge pull request #10161 from donaldsharp/hash_crash zebra: Fix improper usage of hash_iterate that caused crashes	2022-02-04 14:18:03 -06:00
Donald Sharp	07b9ebca65	zebra: Ensure zebra_nhg_sweep_table accounts for double deletes I'm seeing this crash in various forms: Program terminated with signal SIGSEGV, Segmentation fault. 50 ../sysdeps/unix/sysv/linux/raise.c: No such file or directory. [Current thread is 1 (Thread 0x7f418efbc7c0 (LWP 3580253))] (gdb) bt (gdb) f 4 267 (func)(hb, arg); (gdb) p hb $1 = (struct hash_bucket ) 0x558cdaafb250 (gdb) p hb $2 = {len = 0, next = 0x0, key = 0, data = 0x0} (gdb) I've also seen a crash where data is 0x03. My suspicion is that hash_iterate is calling zebra_nhg_sweep_entry which does delete the particular entry we are looking at as well as possibly other entries when the ref count for those entries gets set to 0 as well. Then we have this loop in hash_iterate.c: for (i = 0; i < hash->size; i++) for (hb = hash->index[i]; hb; hb = hbnext) { / get pointer to next hash bucket here, in case (func) decides to delete hb by calling hash_release / hbnext = hb->next; (func)(hb, arg); } Suppose in the previous loop hbnext is set to hb->next and we call zebra_nhg_sweep_entry. This deletes the previous entry and also happens to cause the hbnext entry to be deleted as well, because of nhg refcounts. At this point in time the memory pointed to by hbnext is not owned by the pthread anymore and we can end up on a state where it's overwritten by another pthread in zebra with data for other incoming events. What to do? Let's change the sweep function to a hash_walk and have it stop iterating and to start over if there is a possible double delete operation. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 12:05:38 -05:00
Russ White	ab68283cee	Merge pull request #10401 from donaldsharp/donot_agree zebra: Make Router Advertisement warnings show up once every 6 hours	2022-02-04 10:55:00 -05:00
Donatas Abraitis	66a59f8743	Merge pull request #10469 from mjstapp/fix_dplane_netlink_groups zebra: reduce incoming netlink messages for dplane thread	2022-02-04 17:51:31 +02:00
Donald Sharp	530c9fc4f5	zebra: Convert some `show zebra` output to a table Make the output a bit easier to interpret and use by converting to usage of a table. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	954e1a2bc9	zebra: Add knowledge about RA and RFC 5549 to `show zebra` Add to `show zebra` whether or not RA is compiled into FRR and whether or not BGP is using RFC 5549 at the moment. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	281686819d	zebra: Add evpn status to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	1777ba2ac4	zebra: Add os and version to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	090ee85656	zebra: Add kernel nexthop group support to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	1a97e35eb8	zebra: Add MPLS status to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	9783de6faf	zebra: Add if v4/v6 forwarding is turned on/off to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	dd42779ff9	zebra: Add to `show zebra` the type of vrf devices being used Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	88fd4cb8ca	zebra: Add ability to know when FRR is not asic offloaded Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Mark Stapp	3d1ff4bfdb	Merge pull request #10409 from idryzhov/zebra-mq-clean-crash zebra: fix cleanup of meta queues on vrf disable	2022-02-02 08:35:00 -05:00
Mark Stapp	ceab66b7f4	zebra: reduce incoming netlink messages for dplane thread The dataplane pthread only processes a limited set of incoming netlink notifications: only register for that set of events, reducing duplicate incoming netlink messages. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-01 13:43:51 -05:00
Igor Ryzhov	0ef6eacc95	zebra: fix cleanup of meta queues on vrf disable Current code treats all metaqueues as lists of route_node structures. However, some queues contain other structures that need to be cleaned up differently. Casting the elements of those queues to struct route_node and dereferencing them leads to a crash. The crash may be seen when executing bgp_multi_vrf_topo2. Fix the code by using the proper list element types. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-02-01 18:20:30 +03:00
Igor Ryzhov	461a8d7aba	Merge pull request #10443 from mjstapp/zebra_re_opaque zebra: name the route_entry opaque struct more specifically	2022-02-01 12:19:11 +03:00
Mark Stapp	b86c1f4fcc	zebra: name the route_entry opaque struct more specifically The name 'opaque' is a little general - call the route_entry struct 're_opaque' to make it more specific. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-01-31 08:50:50 -05:00
Donald Sharp	637f95bf2d	zebra: Make Router Advertisement warnings show up once every 6 hours RA packets are pretty chatty and when there is a warning from a missconfiguration on the network, the log file gets filed up with warnings. Modify the code in rtadv.c to only spit out the warning in these cases at most every 6 hours. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-28 11:07:01 -05:00
Xiao Liang	8244ba34aa	zebra: Fix EVPN route nexthop config order EVPN route add should be queued to preserve the config order. In particular, against deletion in rib_delete(). Signed-off-by: Xiao Liang <shaw.leon@gmail.com>	2022-01-28 20:51:10 +08:00
Donald Sharp	6b390b3c7b	zebra: Better handle replacing our route by a system route When a operator has a FRR based route installed into the FIB and a better route comes in from the system. There is code in the data plane to schedule the batching and continue processing. But in this case we are done so we can just return Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-26 10:26:46 -05:00
Donald Sharp	0955f8757b	zebra: Don't double delete the table we are cleaning up vrf_disable is always called first before vrf_delete. The rnh_table and rnh_table_multicast tables are already deleted as part of vrf_disable. No need to do it again. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-26 08:21:03 -05:00
Russ White	e48b2fea63	Merge pull request #10411 from idryzhov/if-config-vrf-name *: do not print vrf name for interface config when using vrf-lite	2022-01-25 11:34:59 -05:00
Russ White	bbf1101240	Merge pull request #10412 from idryzhov/zebra-vrf-delete zebra: fix vrf deletion	2022-01-24 07:33:53 -05:00
Igor Ryzhov	788a036fdb	*: do not print vrf name for interface config when using vrf-lite VRF name should not be printed in the config since `574445ec`. The update was done for NB config output but I missed it for regular vty output. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-01-24 14:44:05 +03:00
Donatas Abraitis	6b968475ed	Merge pull request #10406 from idryzhov/zebra-opaque-memleak zebra: fix opaque data memleak	2022-01-24 09:38:54 +02:00
Russ White	2d9e10d095	Merge pull request #10318 from donaldsharp/redistribution OSPF Redistribution	2022-01-23 22:30:24 -05:00
Igor Ryzhov	e4c5b3ba06	zebra: fix vrf deletion VRF deletion code must be called after the corresponding interface deletion code. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-01-24 01:51:10 +03:00
Igor Ryzhov	dc00940b66	zebra: fix opaque data memleak Opaque data should be freed together with route entry in case of errors. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-01-23 15:39:04 +03:00
Donald Sharp	e8b3a2f74b	lib, zebra: Add ability to tell thread system to ignore late timers Add a thread_ignore_late_timer(struct thread *thread) function that allows thread.c to ignore when timers are late to the party. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-20 11:58:48 -05:00
Russ White	4ec148523c	Merge pull request #10355 from opensourcerouting/noisy-startup lib, zebra: make startup less noisy	2022-01-18 11:30:13 -05:00
Russ White	05786ac774	Merge pull request #9644 from opensourcerouting/ospf-opaque-attrs OSPF opaque route attributes	2022-01-18 09:08:38 -05:00
Donald Sharp	db80a7e2f5	zebra: Add table and instance data to debugs for redistribute_delete Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	659ec5e9c2	zebra: Cleanup temp variable usage in debugs for %pFX Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	5ef3568f22	zebra: Use %pRN instead of %pFX whenver possible Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	a7704e1b98	zebra: Modify route_notify_internal to use a route_node Pass in the route_node that is under consideration into route_notify_internal to allow calling functions to reduce stack size as well as looking up data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	69a2d597d2	zebra: Cleanup %pFX to %pRN in rib_process_result The dest_pfx was pretty much only ever used for debug output and FRR already knows the rn. So use that instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	085e8f8f6b	zebra: Reduce unneeded lookup in rib_process the lookup of the src_p and dest_p is not needed since the values are never used. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	898b4a6d3b	zebra: Reduce lookups in rib_process_dplane_notify the dest_p and src_p values were only ever used for debugs and %pFX, when we already have the rn. There is no need to do this lookup Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	a1742ba8af	zebra: Fix redistribute.h up to our standards FRR must give variable names instead of not defining them in the .h file. This just cleans up this problem for redistribute.h Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	84faa42066	zebra: Modify zsend_redistribute_route to receive struct route_node The function zsend_redistribute_route uses the prefix and source prefix. Just pass in the route_node instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	c8671e9f9b	zebra: Pass rn to zebra_redistribute_check instead FRR is using struct prefix and afi to be passed around. When all that is needed is the route node. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	ee78ed680b	zebra: Convert redistribute_update to use a route_node FRR is passing around a bunch of data that is encapsulated within the route node. Let's just pass that around instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com> Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	b3a9fca150	zebra: Convert redistribute_delete to use a route_node FRR is passing around a bunch of data that is encapsulated within the route node. Let's just pass that around instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	deb39cca21	zebra: Do not allow instance redistribution to happen no matter what If you have this setup: router ospf 3 redistribute sharp ! and then install: sharp install route 4.5.6.7 nexthop 192.168.100.1 1 sharp install route 4.5.6.8 nexthop 192.168.100.1 1 instance 3 sharp install route 4.5.6.9 nexthop 192.168.100.1 1 instance 4 The .8 and .9 routes are auto redistributed into ospf instance 3: eva# show ip ospf data OSPF Instance: 3 OSPF Router with ID (192.168.122.1) AS External Link States Link ID ADV Router Age Seq# CkSum Route 4.5.6.7 192.168.122.1 13 0x80000001 0x477c E2 4.5.6.7/32 [0x0] 4.5.6.8 192.168.122.1 5 0x80000001 0x3d85 E2 4.5.6.8/32 [0x0] 4.5.6.9 192.168.122.1 5 0x80000001 0x338e E2 4.5.6.9/32 [0x0] This cannot be correct behavior. When redistributing in the absense of an instance number the default instance of 0 should be used and should be the only route redistributed. Here is the correct behavior: eva# show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:00:28 D>* 4.5.6.7/32 [150/0] via 192.168.100.1, virbr1, weight 1, 00:00:02 D[3]>* 4.5.6.8/32 [150/0] via 192.168.100.1, virbr1, weight 1, 00:00:02 D[4]>* 4.5.6.9/32 [150/0] via 192.168.100.1, virbr1, weight 1, 00:00:02 C>* 192.168.100.0/24 is directly connected, virbr1, 00:00:28 C>* 192.168.110.0/24 is directly connected, virbr2, 00:00:28 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:00:28 C>* 192.168.122.0/24 is directly connected, virbr0, 00:00:28 eva# show ip ospf data OSPF Instance: 3 OSPF Router with ID (192.168.122.1) AS External Link States Link ID ADV Router Age Seq# CkSum Route 4.5.6.7 192.168.122.1 6 0x80000001 0x477c E2 4.5.6.7/32 [0x0] Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	1945fb7a07	zebra: Add hint to what instance we are looking at FRR allows redistribution to a client with a specific instance in mind. The code was not allowing you to figure out what instance was being looked at. So let's clarify this in the debugs. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:36:26 -05:00
Rafael Zalamena	4e4c027803	Merge pull request #10183 from idryzhov/rework-vrf-rename *: rework renaming the default VRF	2022-01-17 08:45:12 -03:00
David Lamparter	17a4c65576	zebra: remove netlink buffer size log message ... really not much point in printing this. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-01-17 09:46:19 +01:00
Renato Westphal	5b5d66c431	lib, ospfd, ospf6d, zebra: add OSPF opaque route attributes Update ospfd and ospf6d to send opaque route attributes to zebra. Those attributes are stored in the RIB and can be viewed using the "show ip[v6] route" commands (other than that, they are completely ignored by zebra). Example: ``` debian# show ip route 192.168.1.0/24 Routing entry for 192.168.1.0/24 Known via "ospf", distance 110, metric 20, best Last update 01:57:08 ago * 10.0.1.2, via eth-rt2, weight 1 OSPF path type : External-2 OSPF tag : 0 debian# debian# show ip route 192.168.1.0/24 json { "192.168.1.0\/24":[ { "prefix":"192.168.1.0\/24", "prefixLen":24, "protocol":"ospf", "vrfId":0, "vrfName":"default", "selected":true, [snip] "ospfPathType":"External-2", "ospfTag":"0" } ] } ``` Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2022-01-15 17:22:27 +01:00
Sarita Patra	f99f1ff50a	zebra: Fix for route node having no tracking NHT Topology: IXIA-----(ens192)FRR(ens224)------iXIA Configuration: 1. Create 8 sub-interfaces on ens192 under Default VRF and configure 8 EBGP session between FRR and IXIA. 2. Create 1000 sub-interfaces on ens224 under Default VRF and configure 1000 EBGP session between FRR and IXIA. 3. 2M prefixes distributed from Left side Ixia each with 8 ECMP path. 4. So in total, there are 2M prefixes * 8 ECMP = 16M prefixes entries in RIB and FIB. Issue: Shut ens192 and ens224, this is taking 1hr 15 mins to clean up the routes. Root Cause: In the case of route deletion, if the particular route node is having nht count = 0, we are going to the parent and doing nht evaluation, which is not needed. Fix: If the deleted the route node is having nht count > 0, then do a nht evaluation on the parent node. Shut ens192 and ens224, it is taking 1 min to clean up the routes with the fix. Signed-off-by: Sarita Patra <saritap@vmware.com>	2022-01-13 14:15:05 -05:00
Donatas Abraitis	df8d723c5f	*: Add FOREACH_AFI_SAFI_NSF(afi, safi) macro to reduce nesting Used for graceful-restart mostly. Especially for bgp_show_neighbor_graceful_restart_capability_per_afi_safi() Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2022-01-13 14:29:54 +02:00
anlan_cs	5498a9f13d	bfdd: correct one spelling error of comment Signed-off-by: anlan_cs <anlan_cs@tom.com>	2021-12-31 05:32:49 -05:00
Igor Ryzhov	63011ec1c5	Merge pull request #10256 from anlancs/cleanup-zebra_evpn_mac_add zebra: cleanup checking zebra_evpn_mac_add function's return value	2021-12-23 13:10:18 +03:00
anlan_cs	07361b8fdf	zebra: cleanup checking zebra_evpn_mac_add function's return value This function is sure to return correct value by "assert", so the checking its return value should be removed. Signed-off-by: anlan_cs <anlan_cs@tom.com>	2021-12-22 21:13:26 -05:00
Igor Ryzhov	ac2cb9bf94	*: rework renaming the default VRF Currently, it is possible to rename the default VRF either by passing `-o` option to zebra or by creating a file in `/var/run/netns` and binding it to `/proc/self/ns/net`. In both cases, only zebra knows about the rename and other daemons learn about it only after they connect to zebra. This is a problem, because daemons may read their config before they connect to zebra. To handle this rename after the config is read, we have some special code in every single daemon, which is not very bad but not desirable in my opinion. But things are getting worse when we need to handle this in northbound layer as we have to manually rewrite the config nodes. This approach is already hacky, but still works as every daemon handles its own NB structures. But it is completely incompatible with the central management daemon architecture we are aiming for, as mgmtd doesn't even have a connection with zebra to learn from it. And it shouldn't have it, because operational state changes should never affect configuration. To solve the problem and simplify the code, I propose to expand the `-o` option to all daemons. By using the startup option, we let daemons know about the rename before they read their configs so we don't need any special code to deal with it. There's an easy way to pass the option to all daemons by using `frr_global_options` variable. Unfortunately, the second way of renaming by creating a file in `/var/run/netns` is incompatible with the new mgmtd architecture. Theoretically, we could force daemons to read their configs only after they connect to zebra, but it means adding even more code to handle a very specific use-case. And anyway this won't work for mgmtd as it doesn't have a connection with zebra. So I had to remove this option. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-12-21 22:09:29 +03:00
Lou Berger	4b7d297d4b	Merge pull request #9750 from mjstapp/zebra_installed_nhg_id zebra: add installed nexthop-group id value	2021-12-21 10:31:04 -05:00
Donatas Abraitis	20044d8090	Merge pull request #10245 from anlancs/fix-spell-error zebra: correct one spell error	2021-12-20 15:14:53 +02:00
anlan_cs	b816de6213	zebra: correct one spell error Signed-off-by: anlan_cs <anlan_cs@tom.com>	2021-12-19 20:47:01 -05:00
Donatas Abraitis	e69ae079b7	Merge pull request #9899 from Drumato/zebra-srv6-locator-detail-json-support zebra: Add support for json output in srv6 locator detail command	2021-12-14 09:11:36 +02:00
Stephen Worley	f6f16b6073	zebra: add optional NHG ID output to `show ip ro` Add optional NHG ID output to `show ip route` dumps. We have this in json output already as nexthopGroupID but nice to have the option in a normal dump as well. Not including in main output for now to avoid breaking screen scrapers. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-12-02 11:28:37 -05:00
Donald Sharp	c3343a755f	zebra: Prevent thread usage of data after it being freed On startup we create a thread timer event to do a rib sweep of the system. On shutdown we never stopped this timer and as such we have a situation where a thread event could be run on shutdown after the data for it has been freed. Here is the crash I am seeing: (gdb) bt (gdb) Save the thread data in zebra_router and stop the thread so we don't accidently do work on shutdown we don't mean to. In this case it happened in our topotests with some severe system load. Essentially we happened to kill the zebra daemon just as the graceful_restart timer popped here. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-29 15:51:45 -05:00
Yamato Sugawara	559f4b2f2a	zebra: Add support for json output in srv6 locator detail command Signed-off-by: Yamato Sugawara <yamato.sugawara@linecorp.com>	2021-11-28 23:53:41 +00:00
Igor Ryzhov	cb3fa0a612	Merge pull request #10124 from ton31337/feature/vty_json	2021-11-29 02:11:29 +03:00
Donatas Abraitis	c48349e346	*: Remove redundand braces for single statement blocks Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-27 11:20:59 +02:00
Donatas Abraitis	962af8a8cd	zebra: Convert vty_out to vty_json for JSON Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-25 17:49:46 +02:00
Donatas Abraitis	746a6eda2f	*: Remove unused variables Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-25 17:35:55 +02:00
Donatas Abraitis	44991dc163	zebra: Replace prefix2str for JSON to %pFX Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-25 17:28:12 +02:00
Mark Stapp	002930f7bb	zebra: add installed nexthop-group id value In some cases, zebra may install a nexthop-group id that is different from the id of the nhe struct attached to a route-entry. This happens for a singleton recursive nexthop, for example, where a route is installed with the resolving nexthop's id. The installed value is the most useful value - that corresponds to information in the kernel on linux/netlink platforms that support nhgs. Display both values if they differ in ascii output, and include both values in the json form. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2021-11-23 11:23:26 -05:00
Igor Ryzhov	096f7609f9	*: cleanup ifp->vrf_id Since `f60a1188` we store a pointer to the VRF in the interface structure. There's no need anymore to store a separate vrf_id field. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-22 20:47:23 +03:00
Donald Sharp	9d5a61264a	Merge pull request #10076 from idryzhov/if-is-loopback-or-vrf *: unify if_is_loopback/if_is_loopback_or_vrf	2021-11-22 12:02:21 -05:00
Ryoga Saito	7eab60a793	zebra: add support for End.DT4 This patch enables zebra to insert End.DT4 nexthop into linux kernel. Signed-off-by: Ryoga Saito <ryoga.saito@linecorp.com>	2021-11-22 23:32:30 +09:00
Igor Ryzhov	0609190219	Merge pull request #10074 from opensourcerouting/assorted-20211116 lib/vtysh/ospf6d: assorted small bits	2021-11-19 15:43:10 +03:00
Igor Ryzhov	3c52293809	Merge pull request #10092 from ton31337/feature/replace_json_object_string_add_to_json_object_string_addf_for_inet_ntop *: inet_ntop for JSON output	2021-11-18 22:19:40 +03:00
Russ White	ef148de26d	Merge pull request #9706 from donaldsharp/zebra_client_summ_more_room zebra: Expand v4/v6 route space	2021-11-18 12:19:44 -05:00
Donatas Abraitis	4e9a98636f	*: Remove unused variables Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-18 18:45:41 +02:00
Donatas Abraitis	08edf9c6af	zebra: Replace inet_ntop to %pI4/6 for JSON outputs Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-18 18:45:41 +02:00
Igor Ryzhov	31ffc82f65	Merge pull request #10080 from mjstapp/fix_lsp_workqueue zebra: ignore workqueue delete callbacks during shutdown	2021-11-18 18:47:35 +03:00
Mark Stapp	b0d10d93e2	zebra: during shutdown, don't process LSPs on the lsp workqueue During zebra shutdown, we clear out the LSP workqueue. The LSPs will be uninstalled and freed during the shutdown process, so just ignore any LSPs that happen to be on the workqueue. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2021-11-18 07:35:35 -05:00
Mark Stapp	695b279ae3	zebra: free LSP workqueue early, revert PR 10050 this reverts commit `dd9538c5f3`, which tried to clear the LSP workqueue late during shutdown. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2021-11-18 07:35:35 -05:00
Donald Sharp	f2ada31cba	zebra: Expand v4/v6 route space At some scale we eventually run out of room displaying v4/v6 route totals for `show zebra client summ`: janelle# show zebra client summ Name Connect Time Last Read Last Write IPv4 Routes IPv6 Routes -------------------------------------------------------------------------------- bgp 04w0d18h 00:00:19 00:01:2411729127/4052681 2037786/903094 This total over ran the space in just a little over a week of uptime. Expand to have a bit more room. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-17 07:47:28 -05:00
Donald Sharp	f284c1322d	zebra: return void for dplane_ctx_get_pbr_ipset_entry The dplane_ctx_get_pbr_ipset_entry function only failed when the caller did not pass in a valid usable pointer. Change the code to assert on a pointer not being passed in and remove the bool return Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-17 07:46:36 -05:00
Donald Sharp	8d78e148b8	zebra: return void for dplane_ctx_get_pbr_iptable The only time this function ever failed is when the developer does not pass in a usable pointer to place the data in. Change it to an assert to signify to the end developer that is what we want and then remove all the if checks for failure Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-17 07:46:36 -05:00
Donald Sharp	8249f96a5f	zebra: dplane_ctx_get_pbr_ipset should return void The function call dplane_ctx_get_pbr_ipset only returns false when the calling function fails to pass in a valid ipset pointer. This should be an assertion issue since it's a programming issue as opposed to an actual run time issue. Change the function call parameter to not return a bool on success/fail for a compile time decision. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-17 07:46:36 -05:00
David Lamparter	dd2c81b8c0	lib: rework vty_check_node_for_xpath_decrement ...by having a flag in struct cmd_node rather than hardcoding it in `lib/command.c`. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-11-16 18:51:22 +01:00
Igor Ryzhov	608c887069	*: unify if_is_loopback/if_is_loopback_or_vrf We should always treat the VRF interface as a loopback. Currently, this is not the case, because in some old pre-VRF code we use if_is_loopback instead of if_is_loopback_or_vrf. To avoid any future problems, the proposal is to rename if_is_loopback_or_vrf to if_is_loopback and use it everywhere. if_is_loopback is renamed to if_is_loopback_exact in case it's ever needed, but currently it's not used anywhere. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-16 18:07:11 +03:00
Donald Sharp	de093103cb	Merge pull request #10072 from idryzhov/zebra-memleak zebra: fix memleak on shutdown	2021-11-16 08:01:30 -05:00
Igor Ryzhov	9742796ff1	zebra: fix memleak on shutdown During shutdown, when table_manager_disable is called for the default VRF, its vrf_id is already set to VRF_UNKNOWN, so the expression is true and the table manager memory is not freed. Change the expression to compare the VRF name instead of the id. The check in table_manager_enable is changed for consistency. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-16 12:42:32 +03:00
Donatas Abraitis	2247d05efe	Merge pull request #10050 from mjstapp/fix_mpls_queue_cleanup zebra: free LSP workqueue later during shutdown	2021-11-15 17:51:51 +02:00
Jafar Al-Gharaibeh	3357afaa74	Merge pull request #10036 from donaldsharp/finally_frr Finally frr	2021-11-12 21:35:27 -06:00
Mark Stapp	dd9538c5f3	zebra: free LSP workqueue later during shutdown Free the LSP workqueue later during shutdown, so that zebra has enough time to clean up and uninstall any LSPs. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2021-11-12 15:10:00 -05:00
Quentin Young	b0b77855c8	zebra: use tabs instead of spaces zebra_vxlan.c Bad style introduced in https://github.com/FRRouting/frr/pull/10006 Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-11-12 11:09:48 -05:00
Donald Sharp	f15d7a202e	Merge pull request #9982 from idryzhov/fix-netns-delete zebra: fix netns deletion	2021-11-11 18:41:33 -05:00
Donald Sharp	66e108cc25	Merge pull request #9965 from idryzhov/fix-table-manager-disable zebra: fix disabling table manager	2021-11-11 18:40:08 -05:00
Donald Sharp	b72aae2e04	*: Cleanup some documentation from quagga->frr Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-11 14:41:27 -05:00
Donald Sharp	e36f61b507	*: Rename quagga_timestamp with frr_timestamp Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-11 14:41:27 -05:00
Donald Sharp	7cc91e67a3	*: Convert quagga_signal_X to frr_signal_X Naming functions/data structures more appropriately for the project we are actually in. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-11 14:41:27 -05:00
Russ White	7347a4859d	Merge pull request #10006 from chiragshah6/evpn_dev zebra: svi down remove evpn l2vni from l3vni list	2021-11-11 08:09:06 -05:00
Igor Ryzhov	928cd90201	Merge pull request #9931 from leibaogit/master zebra: Fix the RA packets can not sent out	2021-11-11 15:34:01 +03:00
Igor Ryzhov	49df081596	zebra: fix disabling table manager `42d4b30e` introduced per-VRF table manager. Table manager is allocated when the VRF is created, but it is freed when the VRF is disabled. When this VRF is re-enabled, zebra ends up with table manager being NULL pointer and it crashes on any dereference. Table manager should be freed when the VRF is deleted, not when it's disabled. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-11 14:59:51 +03:00
Igor Ryzhov	6502e5f104	zebra: fix netns deletion We don't receive interface down/delete notifications from kernel when a netns is deleted. Therefore we have to manually replicate the necessary actions, otherwise interfaces are kept in the system with stale pointers to the deleted netns. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-11 14:57:18 +03:00
Mark Stapp	34e5d7e884	Merge pull request #9939 from idryzhov/fix-ptm-build zebra: fix build with --enable-bfdd=no	2021-11-09 11:49:09 -05:00
Chirag Shah	b13f35ec67	zebra: svi down remove l2vni from l3vni list Problem: L2-VNI SVI down followed by L2-VNI's vxlan device deletion leads to stale entry into L3VNI's L2-VNI list. Solution: When L2-VNI associated SVI is down, default vrf is the new tenant vrf. Remove L2-VNI from L3VNI's l2vni list as L3VNI/VRF is no longer valid in absence of associated SVI. When SVI is up re-add L2-VNI into associated VRF's L3VNI. The above remove/add from the L3VNI's L2VNI list is already done when vxlan or L2-VNI is flaped, just need to handle when SVI is flapped. Ticket:#2817127 Reviewed By: Testing Done: After deleting SVI following by L2-VNI deletion, L3VNI's L2-VNI list delets the L2-VNI. (no stale entry). After adding back SVI/L2-VNI, L3VNI list adds back the L2-VNI and it is associated right tenant VRF. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2021-11-08 09:33:16 -08:00
Donald Sharp	b7bd8fce85	Merge pull request #9966 from idryzhov/release-daemon-table-chunks zebra: don't register same hook multiple times	2021-11-08 12:28:10 -05:00
Donatas Abraitis	d44d6092ed	Merge pull request #9954 from donaldsharp/redistribute_nexthops zebra: Send up ifindex for redistribution when appropriate	2021-11-07 15:01:03 +02:00
Russ White	ed79d896b2	Merge pull request #9833 from idryzhov/cleanup-if-by-index-all-vrf *: fix usage of if_lookup_by_index_all_vrf	2021-11-05 15:17:31 -04:00
Russ White	97e9cfa584	Merge pull request #9964 from donaldsharp/zebra_ifp_evpn_crash zebra: remove ifp reference against the macs before deleting the ifp-…	2021-11-05 10:26:38 -04:00
LEI BAO	4a2f76dbab	zebar: Fix the RA sent fail in netns mode Make the code more explicit. Signed-off-by: LEI BAO <bali.baolei@cn.ibm.com>	2021-11-05 21:06:12 +08:00
LEI BAO	553dbdbf8e	zebra: Fix the RA send failed in netns mode In the rtadv_timer(), it always uses the zvrf's socket to send RA packets. In the vrf-lite mode, it's righ since it uses the default vrf to send the RA packets. But in the netns mode, it uses socket in each netns. So the issue only happens in the netns mode because the zvrf's socket may not be in the same netns as the interface's netns. In order to compatible with both vrf-lite and netns mode, the fix uses the if_lookup_by_index() to check whether interfaces can use the zvrf's socket. Signed-off-by: LEI BAO <bali.baolei@cn.ibm.com>	2021-11-05 14:13:25 +08:00
Donald Sharp	95d2bed34b	Merge pull request #9943 from idryzhov/crash-fixes a couple of crash fixes after recent interface/vrf rework	2021-11-04 20:29:23 -04:00
Igor Ryzhov	903c6fa24e	zebra: don't register same hook multiple times Before `42d4b30e`, table_manager_enable was called only once and the hook was also registered once. After the change, the hook is registered per each VRF that is created in the system. This is wrong. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-05 02:04:02 +03:00
Anuradha Karuppiah	51aa26938b	zebra: remove ifp reference against the macs before deleting the ifp->mac_list Fix this crash seen in our topotests: root@eva:/var/tmp/frr/zebra.928140# more crashlog ZEBRA: Received signal 11 at 1636047065 (si_addr 0x0, PC 0x7fc01495a7a5); aborting... ZEBRA: zlog_signal+0x18c 7fc01496419a 7ffd595e1f50 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: core_handler+0xe3 7fc0149a205e 7ffd595e2070 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: funlockfile+0x50 7fc014841140 7ffd595e21c0 /lib/x86_64-linux-gnu/libpthread.so.0 (mapped at 0x7fc01482d000) ZEBRA: ---- signal ---- ZEBRA: list_delete_node+0x3c 7fc01495a7a5 7ffd595e2750 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: zebra_evpn_mac_ifp_unlink+0x9f 5600718b6518 7ffd595e2770 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: zebra_evpn_mac_clear_fwd_info+0x18 5600718b6654 7ffd595e27a0 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: zebra_evpn_mac_del+0x110 5600718b90b5 7ffd595e27c0 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: zebra_evpn_mac_del_hash_entry+0xef 5600718b93a7 7ffd595e28f0 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: hash_iterate+0x57 7fc014949fa8 7ffd595e2920 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: zebra_evpn_mac_del_all+0x6d 5600718b9418 7ffd595e2970 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: zebra_evpn_cleanup_all+0x5a 5600718b5322 7ffd595e29e0 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: zebra_evpn_vxlan_cleanup_all+0x88 5600719106ff 7ffd595e2a10 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: hash_iterate+0x57 7fc014949fa8 7ffd595e2a50 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: zebra_vxlan_cleanup_tables+0x3a 56007191a230 7ffd595e2aa0 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: zebra_vrf_disable+0x187 5600718fd656 7ffd595e2ad0 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: vrf_disable+0x8b 7fc0149bc88f 7ffd595e2b40 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: vrf_delete+0x5b 7fc0149bc65e 7ffd595e2b60 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: vrf_terminate_single+0x38 7fc0149bcd71 7ffd595e2b80 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: vrf_terminate+0xe5 7fc0149bce59 7ffd595e2ba0 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: sigint+0x1de 560071883117 7ffd595e2bc0 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: quagga_sigevent_process+0x73 7fc0149a1e6c 7ffd595e2c10 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: thread_fetch+0x4f 7fc0149b8884 7ffd595e2c30 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: frr_run+0x230 7fc01495980c 7ffd595e2cb0 /lib/libfrr.so.0 (mapped at 0x7fc0148b3000) ZEBRA: main+0x3e4 5600718835b3 7ffd595e2dc0 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: __libc_start_main+0xea 7fc01468cd0a 7ffd595e2e90 /lib/x86_64-linux-gnu/libc.so.6 (mapped at 0x7fc014666000) ZEBRA: _start+0x2a 56007186a46a 7ffd595e2f60 /usr/lib/frr/zebra (mapped at 0x5600717ef000) ZEBRA: no thread information available root@eva:/var/tmp/frr/zebra.928140# Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com> Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-04 18:15:27 -04:00
Donald Sharp	6f77db5779	zebra: Send up ifindex for redistribution when appropriate Currently the NEXTHOP_TYPE_IPV4 and NEXTHOP_TYPE_IPV6 are not sending up the resolved ifindex for the route. This is causing upper level protocols that have something like this: route-map FOO permit 10 match interface swp13 ! router ospf redistribute static ! ip route 4.5.6.7/32 10.10.10.10 where 10.10.10.10 resolves to interface swp13. The route-map will never match in this case. Since FRR has the resolved nexthop interface, FRR might as well send it up to be selected on by the upper level protocol as needed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-04 08:05:44 -04:00
Philippe Guibert	e108096b1d	zebra: netfilter operations notif sent back to daemon It appears that without that change, there were no notifications sent to bgp daemon, after flowspec operations have been sent to zebra. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-11-03 17:17:08 +01:00
Philippe Guibert	85b02353a9	zebra: update dataplane flowspec address family in ipset_info It is needed for the ipset entry to know for which address family this ipset entry applies to. Actually, the family is in the original ipset structure and was not passed as attribute in the dataplane ipset_info structure. Add it. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-11-03 17:17:08 +01:00
Philippe Guibert	8f065cd36f	zebra: fix flowspec ipset operations When injecting an ipset entry into the zebra dataplane context, the ipset name is stored in a separate structure. This will permit the flowspec plugin to be able to know which ipset has to be appended with relevant ipset entry. The problem was that the zebra dataplane objects related to ipset entries is made up of an union between the ipset structure and the ipset info structure. This was implying that the two structures were on the same memory zone, and when extracting the data stored, the data were incomplete. Fix this by replacing the union structure by a defined struct. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-11-03 17:17:08 +01:00
Igor Ryzhov	9acc98c865	zebra: fix stale pointer when netns is deleted When the netns is deleted, we should always clear the vrf->ns_ctxt pointer. Currently, it is not cleared when there are interfaces in the netns at the time of deletion. If the netns is re-created, zebra crashes because it tries to use the stale pointer. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-03 11:11:15 +03:00
Igor Ryzhov	77712f66b6	zebra: fix build with --enable-bfdd=no Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-02 19:20:24 +03:00
LEI BAO	9e89bcd4f4	zebra: Fix the RA packets can not sent out Skip the interfaces which not belong to the same VRF as the current thread's zvrf. Signed-off-by: LEI BAO <bali.baolei@cn.ibm.com>	2021-11-02 13:44:21 +08:00
Igor Ryzhov	a2df495fdf	zebra: don't use if_lookup_by_index_all_vrf if_lookup_by_index_all_vrf doesn't work correctly with netns VRF backend as the same index may be used in multiple netns simultaneously. In both case where it's used, we know the VRF in which we need to lookup for the interface. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-28 18:54:46 +03:00
Donald Sharp	7090c9253d	zebra: debug_nl.c ensure we can read RTM_NEWNEIGH bridge nested attrs The kernel can return to us nested attributes for BRIDGE RTM_NEWNEIGH attributes. Just ensure that we can parse and read them. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-28 08:16:49 -04:00
Donald Sharp	6e1e2e8da9	zebra: Fix netlink RTM_NEWNEXTHOP parsing for nested attributes With the addition of resillient hashing for nexthops, the parsing of nexthops requires telling the decoder functions that there may be nested attributes. This was found by code inspection of iproute2/ipnexthop.c when trying to understand resillient hashing as well as statistics gathering for nexthops that are / will be in upstream kernels in the near future. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-28 08:10:28 -04:00
Donald Sharp	68275b093b	Merge pull request #9870 from opensourcerouting/zebra-rib-select-order zebra: set SELECTED before going into dplane code	2021-10-28 07:59:54 -04:00
Russ White	492b3d296c	Merge pull request #9907 from donaldsharp/script_fixes zebra: Recent Merge broke --enable-werror	2021-10-27 15:30:49 -04:00
Russ White	f727c6ae8a	Merge pull request #9837 from idryzhov/cleanup-if-by-name-vrf-all *: fix usage of if_lookup_by_name_all_vrf	2021-10-27 15:29:39 -04:00
Donald Sharp	cbefb650bc	zebra: Recent Merge broke --enable-werror Recent code broke upon compiling with --enable-dev-build and --enable-werror. Fix. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-27 08:53:43 -04:00
Quentin Young	0c124f75db	Merge pull request #9440 from dlqs/dplanehook2	2021-10-26 15:26:32 -04:00
Mark Stapp	d127998785	Merge pull request #9846 from idryzhov/lib-zebra-netns lib: move zebra-only netns stuff to zebra	2021-10-26 12:50:26 -04:00
Igor Ryzhov	e951cdc91f	Merge pull request #9898 from donaldsharp/nexthop_stuff include, zebra: Add recent nexthop.h	2021-10-26 17:45:16 +03:00
Jafar Al-Gharaibeh	bfc87c79e8	Merge pull request #9883 from pguibert6WIND/iface_deleted_omitted_gre_tundest zebra: GRE_UPDATE message incomplete	2021-10-25 22:44:39 -05:00
Donald Sharp	73b8a68e66	include, zebra: Add recent nexthop.h Add actual recent nexthop.h file from kernel and fix up resulting fallout because FRR's original nexthop.h did not match upstream linux kernel. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-25 14:11:37 -04:00
Donald Sharp	d9654571f9	Merge pull request #9316 from ton31337/fix/send_best_path_reason_for_zebra bgpd: Send BGP best path reason to Zebra	2021-10-25 11:09:20 -04:00
Philippe Guibert	bcb68fe0c9	zebra: GRE_UPDATE message incomplete when gre information could not be retrieved because GRE interface has been deleted, a GRE_UPDATE message may be sent to NHRP. In that case, the gre values are reset. There was a missing tunnel destination value, which has been omitted. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-10-25 15:45:44 +02:00
Jafar Al-Gharaibeh	63da89db77	Merge pull request #9742 from elimbaum/add-vlan-actions pbrd: add vlan actions to vty	2021-10-23 00:06:16 -05:00
Sri Mohana Singamsetty	ffab914521	Merge pull request #9378 from AnuradhaKaruppiah/evpn-mh-cleanup evpn-mh: fixes for sync-MAC-IP handling on ES bond del/add	2021-10-22 09:26:06 -07:00
Mark Stapp	036b746570	Merge pull request #9765 from idryzhov/lib-bool-thread-add lib: change thread_add_* API	2021-10-22 09:59:54 -04:00
David Lamparter	f68ce97627	zebra: set SELECTED before going into dplane code There is a bit of an impedance mismatch in the sequence of events here. Depending on the dplane behavior, the `ROUTE_ENTRY_SELECTED` bit will be inconsistent for rib_process_result(). With an asynchronous dataplane: 0. rib_process() is called 1. rib_install_kernel() is called, dplane action is queued 2. rib_install_kernel() returns 3. rib_process() sets the SELECTED bit appropriately, returns 4. dplane is done, triggers rib_process_result() 5. SELECTED bit is seen in "after" state (5a. NHT code looks at the SELECTED bit, works correctly.) With a synchronous dataplane: 0. rib_process() is called 1. rib_install_kernel() is called, dplane action is executed 2. dplane (should) trigger rib_process_result() 3. SELECTED bit is seen in "before" state (3a. NHT code looks at the SELECTED bit, fails.) 4. rib_install_kernel() returns 5. rib_process() sets the SELECTED bit appropriately, too late. Essentially, poking the dataplane is a sequencing point where control is handed over to the dplane. Control may or may not return immediately. Doing /anything/ after triggering the dataplane is a recipe for odd race conditions. (FWIW, I'm not sure rib_process_result() is called correctly in the synchronous case, but that's a separate problem.) Unfortunately, this change might have some unforeseen side effects. I haven't dug through the code to see if anything breaks. There /shouldn't/ be anything looking at the SELECTED bit here, but who knows. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-10-22 12:13:46 +02:00
Igor Ryzhov	6db215219e	Merge pull request #9856 from donaldsharp/always_true zebra: Fix code paths that always resolve to true	2021-10-20 23:16:58 +03:00
Igor Ryzhov	ee1455dd98	lib: change thread_add_* API Do not return pointer to the newly created thread from various thread_add functions. This should prevent developers from storing a thread pointer into some variable without letting the lib know that the pointer is stored. When the lib doesn't know that the pointer is stored, it doesn't prevent rescheduling and it can lead to hard to find bugs. If someone wants to store the pointer, they should pass a double pointer as the last argument. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-20 20:07:15 +03:00
Donald Sharp	e8e6febb74	zebra: Fix code paths that always resolve to true Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-20 10:37:32 -04:00
Donald Sharp	e13f12a7d1	zebra: modify rib_update to be a bit smarter about malloc rib_update() was mallocing memory then attempting to schedule and if the schedule failed( it was already going to be run ) FRR would then free the memory. Fix this memory usage pattern Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-20 08:28:52 -04:00
Russ White	4dfa838afa	Merge pull request #9656 from chiragshah6/mdev zebra: add resolver flag for nexthop in json	2021-10-19 19:16:14 -04:00
Donald Lee	310dd2b362	zebra: Add vty cmd zebra on-rib-process script SCRIPT Signed-off-by: Donald Lee <dlqs@gmx.com>	2021-10-20 00:56:00 +08:00
Donald Lee	461c173cbd	zebra: Add script initialization and destroy Signed-off-by: Donald Lee <dlqs@gmx.com>	2021-10-20 00:56:00 +08:00
Donald Lee	1247efcce4	zebra: Add dplane hook point Signed-off-by: Donald Lee <dlqs@gmx.com>	2021-10-20 00:56:00 +08:00
Donald Lee	4f7e32bafe	zebra: Add encoders/decoders for zebra Signed-off-by: Donald Lee <dlqs@gmx.com>	2021-10-20 00:56:00 +08:00
Igor Ryzhov	f60a11883c	lib: allow to create interfaces in non-existing VRFs It allows FRR to read the interface config even when the necessary VRFs are not yet created and interfaces are in "wrong" VRFs. Currently, such config is rejected. For VRF-lite backend, we don't care at all about the VRF of the inactive interface. When the interface is created in the OS and becomes active, we always use its actual VRF instead of the configured one. So there's no need to reject the config. For netns backend, we may have multiple interfaces with the same name in different VRFs. So we care about the VRF of inactive interfaces. And we must allow to preconfigure the interface in a VRF even before it is moved to the corresponding netns. From now on, we allow to create multiple configs for the same interface name in different VRFs and the necessary config is applied once the OS interface is moved to the corresponding netns. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-19 15:29:51 +03:00
Igor Ryzhov	62d89c648b	lib: move zebra-only netns stuff to zebra When something is used only from zebra and part of its description is "should be called from zebra only" then it belongs to zebra, not lib. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-19 00:16:10 +03:00
Anuradha Karuppiah	09de6e4550	zebra: defer local MAC dataplane install on an ES till the ES-EVI is created When an ES is deleted and re-added bgpd can start sending MAC-IP sync updates before the dataplane and zebra have setup the VLAN membership for the ES. Such MAC entries are not installed in the dataplane till the ES-EVI is created. Ticket: #2668488 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-10-15 10:43:41 -07:00
Anuradha Karuppiah	38f681e1ca	zebra: ignore sync updates from bgp if the dest ES is not ready In the window immediately after an ES deletion bgpd can send MAC-IP updates using that ES. Zebra needs to ignore these updates to prevent creation of stale entries. Ticket: #2668488 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-10-15 10:43:41 -07:00
Anuradha Karuppiah	6e59c4a71d	zebra: deref the ES on interface delete even if it was not setup as a br-port This addresses deletion of ES interfaces that are were not completely configured. Ticket: #2668488 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-10-15 10:43:41 -07:00
Igor Ryzhov	f13fdc673c	zebra: fix ptm message processing When PTM sends a "cbl status" message it specifies the interface name but not the VRF name. It is fine for VRF-lite, but doesn't work for netns because it's possible to have multiple interfaces with the same name. Be more restrictive in this case and return an error instead of randomly using of the interface with the specified name. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-15 03:44:42 +03:00
Igor Ryzhov	31eab818f6	zebra: fix "show interface IFNAME" for netns With netns VRF backend, we may have multiple interfaces with the same name. Currently, the function output is not deterministic in this case, it returns the first interface that it finds in the list. Be more explicit and tell the user that we need the VRF name. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-15 03:44:42 +03:00
Donatas Abraitis	1d7260a1b5	bgpd: Send BGP best path reason to Zebra ``` exit1-debian-9# show ip route 172.16.16.1/32 Routing entry for 172.16.16.1/32 Known via "bgp", distance 20, metric 0, best Last update 00:00:28 ago * 192.168.0.2, via eth1, weight 1 AS-Path : 65003 Communities : first 65001:2 65001:3 Large-Communities: 65001:1:1 65001:1:2 65001:1:3 Selection reason : First path received ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-10-14 16:52:47 +03:00
David Lamparter	c5726f0314	Merge pull request #9676 from donaldsharp/import_register	2021-10-13 22:28:03 +02:00
Jafar Al-Gharaibeh	38e7e55306	Merge pull request #9655 from yyuanam/first_commit Zebra: Ignore the failure of startup intf lookup.	2021-10-12 11:15:39 -05:00
Donald Sharp	ba3df8987f	Merge pull request #9686 from idryzhov/fix-nda-lladdr zebra: fix buffer overflow	2021-10-12 12:04:00 -04:00
Russ White	effd4c7bdd	Merge pull request #9779 from donaldsharp/gr_repeated Some GR fixes	2021-10-12 11:00:44 -04:00
Mark Stapp	e2c3eaddd5	Merge pull request #9789 from idryzhov/if-ll-type lib: set type for newly created interfaces	2021-10-11 08:57:26 -04:00
Igor Ryzhov	8975bbbdd6	lib: set type for newly created interfaces Currently, the ll_type is set only in `netlink_interface` which is executed only during startup. If the interface is created when the FRR is already running, the type is not stored. Fixes #1164. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-10-09 00:23:39 +03:00
Russ White	99497bc4ee	Merge pull request #9471 from pguibert6WIND/table_manager_alloc2 zebra: extend table manager per vrf, add vty configuration	2021-10-08 13:49:54 -04:00
Donald Sharp	76ab1a9702	zebra: Display how long zebra is expected to wait for GR When a client sends to zebra that GR mode is being turned on. The client also passes down the time zebra should hold onto the routes. Display this time with the output of the `show zebra client` command as well. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-07 12:08:42 -04:00
Donald Sharp	cc3d834308	zebra: GR data was being printed 2 times for `show zebra client` When issuing the `show zebra client` command data about Graceful Restart state is being printed 2 times. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-07 12:06:00 -04:00
Eli Baum	d70a31a3ef	pbrd: add vlan actions to vty Signed-off-by: Eli Baum <ebaum@mitre.org>	2021-10-07 09:14:59 -04:00
Russ White	b1003f64b2	Merge pull request #9698 from idryzhov/cleanup-loopback-or-vrf *: cleanup interface loopback/vrf check	2021-10-06 19:01:52 -04:00
Renato Westphal	3f220bc814	Merge pull request #9734 from donaldsharp/interface_startup Interface startup	2021-10-06 01:03:08 -03:00
Yuan Yuan	8eec31ef56	Zebra: Ignore the failure of startup intf lookup. In startup, zebra would dump interface information from Kernel in 3 steps w/o lock: step1, get interface information; step2, get interface ipv4 address; step3, get interface ipv6 address. If any interface gets added after step1, but before step2/3, zebra would get extra interface addresses in step2/3 that has not been added into zebra in step1. Returning error in the referenced interface lookup would cause the startup interface retrieval to be incomplete. Signed-off-by: Yuan Yuan <yyuanam@amazon.com>	2021-10-06 02:00:39 +00:00
Russ White	334d9d259f	Merge pull request #9731 from ton31337/fix/thread_null_set cleanup: struct thread = NULL	2021-10-05 19:27:23 -04:00
Donald Sharp	9bfadae860	zebra: Use a bool for startup indications Let's not pass around an int startup when all we are doing is true/falsing it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-04 20:26:38 -04:00
Donald Sharp	0cf0069d31	zebra: On interface startup note that we are in startup The boolean to notice that we are in startup situations was not being properly set in one spot. Fix. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-04 16:00:27 -04:00
Donatas Abraitis	510404d9f3	zebra: Do not explicitly set the thread pointer to NULL FRR should only ever use the appropriate THREAD_ON/THREAD_OFF semantics. This is espacially true for the functions we end up calling the thread for. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-10-04 19:23:55 +03:00
Chirag Shah	1b3ac4c7ca	zebra: add nhg id to show ip route json Add json field nexthop group id to 'show ip route json'. Testing Done: { "27.0.0.14\/32":[ { "prefix":"27.0.0.14\/32", "protocol":"bgp", "selected":true, "destSelected":true, "distance":20, "metric":0, "installed":true, "table":254, "internalStatus":16, "internalFlags":8, "internalNextHopNum":2, "internalNextHopActiveNum":2, "nexthopGroupId":103, <---- New field "uptime":"00:04:37", "nexthops":[ { "ip":"fe80::202:ff:fe00:11", "interfaceName":"uplink-1", }, { "ip":"fe80::202:ff:fe00:1d", "interfaceName":"uplink-2", } ] } ] } Signed-off-by: Chirag Shah <chirag@nvidia.com>	2021-10-03 16:20:46 -07:00
Igor Ryzhov	ef322b022f	*: cleanup interface loopback/vrf check There's a helper function to check whether the interface is loopback or VRF - if_is_loopback_or_vrf. Let's use it whenever we need to check that. There's no functional change in this commit. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-09-30 12:31:05 +03:00
Igor Ryzhov	b7c21fad11	zebra: fix buffer overflow mac is only 6 bytes long and we shouldn't blindly copy unknown number of bytes into it. Fixes #9671. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-09-28 15:45:14 +03:00
Chirag Shah	6ba578ef29	zebra: add resolver for nexthop in json zebra rib 'show ip route json' lists all nexthops in a flat list. To identify the recursively resolved nexthops relation adding a flag "resolver" as delimiter to identify recursively resolved nexthop in the list. Testing Done: { "1.1.1.0\/24":[ { "prefix":"1.1.1.0\/24", "protocol":"static", .... "nexthops":[ { "flags":5, "ip":"27.0.0.14", "afi":"ipv4", "active":true, "recursive":true, "weight":1 }, { "flags":3, "fib":true, "ip":"fe80::202:ff:fe00:11", "afi":"ipv6", "interfaceIndex":12, "interfaceName":"uplink-1", "resolver":true, <-- Resolver for recursive true flag nh "active":true, "weight":1 }, ] } ] } Signed-off-by: Chirag Shah <chirag@nvidia.com>	2021-09-27 16:05:08 -07:00
Donald Sharp	027db46917	lib, zebra: Send safi for rnh resolution Pass down the safi for when we need address resolution. At this point in time we are hard coding the safi to SAFI_UNICAST. Future commits will take advantage of this. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 15:26:05 -04:00
Donald Sharp	a4598b97d9	zebra: Create the SAFI_MULTICAST rnh tables Actually create the SAFI_MULTICAST rnh tables. No code uses these yet. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 12:38:08 -04:00
Donald Sharp	d597533a9d	zebra: Start carrying safi for rnh processing PIM is going to need to be able to send down the address it is trying to resolve in the multicast rib. We need a way to signal this to the end developer. Start the conversion by adding the ability to have a safi. But only allow SAFI_UNICAST at the moment. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 12:38:08 -04:00
Donald Sharp	6cd9a93ddd	zebra: Attempt to clarify variable names as they are used Cleanup the poorly implemented variable names so that we can understand what is going on a bit better. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 12:38:08 -04:00
Donald Sharp	93f533c308	zebra: remove zvrf->import_check_table The import_check_table is no longer used, so let's remove it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 12:38:08 -04:00
Donald Sharp	6071e5e96e	zebra: remove 'enum rnh_type' from system This code is now dead code since there are not two nexthop resolution types. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 12:38:08 -04:00
Donald Sharp	7272e2a461	zebra: remove import check resolution from zebra The entirety of the import checking no longer needs to be in zebra as that no-one is calling it. Remove the code. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 12:38:08 -04:00
Donald Sharp	3d174ce08d	*: Remove the ZEBRA_IMPORT_ROUTE_XXX zapi messages These are no longer really needed. The client just needs to call nexthop resolution instead. So let's remove the zapi types. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 12:38:08 -04:00
Donald Sharp	ed6cec97d7	*: Add resolve via default flag	2021-09-27 12:38:08 -04:00
Donald Sharp	4aabcba0f1	zebra: Being able to use the default route is a boolean not an int Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-27 12:38:08 -04:00
Sri Mohana Singamsetty	4ab65e0983	Merge pull request #9650 from mjstapp/fix_dup_lookup_netlink zebra: stop asking for AF_BRIDGE interface info twice	2021-09-22 16:18:19 -07:00
Mark Stapp	00fab7fa61	zebra: stop asking for AF_BRIDGE interface info twice There were two identical blocks of code run at init time that requested info about AF_BRIDGE - don't see any reason to do that twice, so remove one block. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2021-09-21 16:33:28 -04:00
Philippe Guibert	42d4b30e00	zebra: extend table manager per vrf, add vty configuration Because vrf backend may be based on namespaces, each vrf can use in the [16-(2^32-1)] range table identifier for daemons that request it. Extend the table manager to be hosted by vrf. That possibility is disabled in the case the vrf backend is vrflite. In that case, all vrf context use the same table manager instance. Add a configuration command to be able to configure the wished range of tables to use. This is a solution that permits to give chunks to bgp daemon when it works with bgp flowspec entries and wants to use specific iptables that do not override vrf tables. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-09-21 18:37:30 +02:00
Sri Mohana Singamsetty	e81192ad7c	Merge pull request #9416 from pguibert6WIND/vxlan_evpn_updates Vxlan evpn updates	2021-09-21 09:26:34 -07:00
Donald Sharp	5b311cf18d	Merge pull request #9052 from mjstapp/dplane_incoming_dev zebra: Move incoming netlink interface address change events to the dplane pthread	2021-09-21 10:51:37 -04:00
Donald Sharp	b51c659775	zebra: Fix ignored return value from inet_pton Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-20 09:20:46 -04:00
Philippe Guibert	f56a15b5bd	zebra: refresh vxlan evpn contexts, when bridge interface goes up When using bgp evpn rt5 setup, after BGP configuration has been loaded, if the user attempts to detach and reattach the bridged vxlan interface from the bridge, then BGP loses its BGP EVPN contexts, and a refresh of BGP configuration is necessary to maintain consistency between linux configuration and BGP EVPN contexts (RIB). The following command can lead to inconsistency: ip netns exec cust1 ip link set dev vxlan1000 nomaster ip netns exec cust1 ip link set dev vxlan1000 master br1000 consecutive to the, BGP l2vpn evpn RIB is empty, and the way to solve this until now is to reconfigure EVPN like this: vrf cust1 no vni 1000 vni 1000 exit-vrf Actually, the link information is correctly handled. In fact, at the time of link event, the lower link status of the bridge interface was not yet up, thus preventing from establishing BGP EVPN contexts. In fact, when a bridge interface does not have any slave interface, the link status of the bridge interface is down. That change of status comes a bit after, and is not detected by slave interfaces, as this event is not intercepted. This commit intercepts the bridge link up event, and triggers a check on slaved vxlan interfaces. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-09-17 10:25:38 +02:00
Philippe Guibert	c762010889	zebra: handle bridge mac address update in evpn contexts when running bgp evpn rt5 setup, the Rmac sent in BGP updates stands for the MAC address of the bridge interface. After having loaded frr configuration, the Rmac address is not refreshed. This issue can be easily reproduced by executing some commands: ip netns exec cust1 ip link set dev br1000 address 2e🆎45:aa:bb:cc Actually, the BGP EVPN contexts are kept unchanged. That commit proposes to fix this by intercepting the mac address change, and refreshing the vxlan interfaces attached to te bridge interface that changed its MAC address. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-09-17 10:25:35 +02:00
Donald Sharp	321f8e0d84	zebra: Fix case default usage w/ enum's We should not be using `case default` with an enumerated type This prevents the developer of new cases from knowing where they need to fix by just compiling. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-14 13:28:00 -04:00
Igor Ryzhov	b8c01bba53	Merge pull request #9486 from slankdev/slankdev-srv6-no-cli-1 CLI to delete SRv6 locator	2021-09-14 19:04:03 +03:00
Russ White	b8c6d0b83b	Merge pull request #9593 from proelbtn/fix-recursive-seg6 zebra: copy nexthop_srv6 in nexthop_set_resolved	2021-09-14 11:15:29 -04:00
Mark Stapp	c6f55fb28f	zebra: intf address handler is platform-neutral Move the handler for incoming interface address events to a neutral source file - it's not netlink-specific and shouldn't have been in a netlink file. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 11:07:30 -04:00
Mark Stapp	d166308be0	zebra: use the dataplane to read netlink intf addr changes Read incoming interface address change notifications in the dplane pthread; enqueue the events to the main pthread for processing. This is netlink-only for now - the bsd kernel socket path remains unchanged. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 11:07:30 -04:00
Mark Stapp	e7c2c1985c	zebra: add interface address apis for dplane Add new apis for dplane interface address handling, based on the existing api. The existing api is basically split in two: the first part processes an incoming netlink message in the dplane pthread, creating a dplane context with info about the event. The second part runs in the main pthread and uses the context data to update an interface or connected object. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 11:07:30 -04:00
Mark Stapp	9d59df634c	zebra: add new dplane op codes for interface addr events Add new dplane op values for incoming interface address add and delete events. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 11:07:30 -04:00
Mark Stapp	971bad8481	zebra: add intf accessors for dplane contexts Add a few more setters for interface data in dplane contexts. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 11:07:30 -04:00
Mark Stapp	bcfce7ad52	zebra: replace sockunion2str in kernel_socket.c Use the frr format spec instead. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 11:07:30 -04:00
Mark Stapp	9c86ee1e02	lib,zebra: use more const Use const in ipX_martian apis, and in some zebra apis. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 10:31:45 -04:00
Mark Stapp	ff45112c07	zebra: use uint32_t instead of __u32 Use more consistent int type. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 10:31:45 -04:00
Mark Stapp	80dcc38831	zebra: add inbound netlink socket for dataplane Add a new netlink socket for events coming in from the host OS to the dataplane system for processing. Rename the existing outbound dplane socket. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-09-14 10:31:45 -04:00
Donald Sharp	96dd1cbd12	Merge pull request #9602 from nkelapur/master zebra: Fix IPv4 routes with IPv6 link local next hops install in FPM	2021-09-14 08:21:40 -04:00
Igor Ryzhov	f17030115f	Merge pull request #9364 from LabNConsulting/ziemba/vrf_name_to_id-unknown vrf_name_to_id(): remove and change callers to use vrf_lookup_by_name()	2021-09-14 13:16:24 +03:00
Nikhil Kelapure	316d2d52a2	zebra: Fix IPv4 routes with IPv6 link local next hops install in FPM Description: Currently IPv4 routes with IPv6 link local next hops are not properly installed in FPM. Reason is the netlink decoding truncates the ipv6 LL address to 4 byte ipv4 address. Ex : fe80:: is directly converted to ipv4 and it results in 254.128.0.0 as next hop for below routes show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup B>* 2.1.0.0/16 [200/0] via fe80::268a:7ff:fed0:d40, Ethernet0, weight 1, 02:22:26 B>* 5.1.0.0/16 [200/0] via fe80::268a:7ff:fed0:d40, Ethernet0, weight 1, 02:22:26 B>* 10.1.0.2/32 [200/0] via fe80::268a:7ff:fed0:d40, Ethernet0, weight 1, 02:22:26 Hence this fix converts the ipv6-LL address to ipv4-LL (169.254.0.1) address before sending it to FPM. This is inline with how these types of routes are currently programmed into kernel. Signed-off-by: Nikhil Kelapure <nikhil.kelapure@broadcom.com>	2021-09-13 08:39:43 -07:00
Donatas Abraitis	016d91cff7	Merge pull request #9479 from AnuradhaKaruppiah/stale_path zebra: Send path del to bgp for local-inactive path	2021-09-12 20:48:44 +03:00
Donatas Abraitis	0f64a435db	Merge pull request #9475 from iqras23/change1 bgpd: VRF-Lite fix nexthop type	2021-09-12 20:47:18 +03:00
Ryoga Saito	24b3c59c2d	zebra: copy nexthop_srv6 in nexthop_set_resolved Current implementation doesn't copy nexthop_srv6. This causes unexpected behavior when receiving SID information and nexthop isn't onlink.t Signed-off-by: Ryoga Saito <contact@proelbtn.com>	2021-09-10 22:30:00 +00:00
Anuradha Karuppiah	bda6be1c8b	zebra: Send path del to bgp for local-inactive path Problem: When IP1:M1 (local) moved to IP1:M2 (remote-VTEP) bgpd continues to advertise IP1:M1. Fix: Local path del is sent to bgp if the neigh was {local-active\|\|peer-active}. So path del needs to be called before the sync flags (including peer-active) are cleared. Ticket: #2706744 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-09-07 09:53:48 -07:00
Donald Sharp	a1f35d7e7c	zebra: If we hand set the router-id only update everyone if it changes When we hand set the router-id, but we have choosen a router-id that is already the `winner` there is no point in updating anyone with this data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-07 12:53:38 -04:00
Donald Sharp	0114135890	zebra: Do not send a router-id of 0.0.0.0 when we don't know it yet At startup there exists a time frame where we might not know a particular vrf's router id. When zebra gets a request for it let's not just blindly send whatever we have. Let's be a bit smart and only respond with one if we have one. The upper level protocol can wait for it to have one. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-07 12:53:37 -04:00
G. Paul Ziemba	a383d4d201	vrf_name_to_id(): remove vrf_name_to_id() returned VRF_DEFAULT when the vrf name was unknown, hiding errors. Per community recommendation, vrf_name_to_id() is now removed and the few callers now use vrf_lookup_by_name() directly. Signed-off-by: G. Paul Ziemba <paulz@labn.net>	2021-09-07 09:47:24 -07:00
Hiroki Shirokura	0a735cd523	zebra: add srv6's no commands CURRENT_CONFIGURATION: configure terminal segment-routing srv6 locators locator loc1 locator loc2 locator loc3 CMD1: delete single locator configure terminal segment-routing srv6 locators no locator loc1 CMD2: delete srv6 whole config (== delete all locators) configure terminal segment-routing no srv6 Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-09-07 12:54:39 +00:00
Hiroki Shirokura	f6e52a81dc	zebra: elliminate srv6 locator auto allocation by zlicnet request Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-09-07 12:54:39 +00:00
Hiroki Shirokura	f5ca329b2d	zebra: implement srv6 locator add/delete notification via ZAPI Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-09-07 12:54:37 +00:00
Philippe Guibert	2490726201	zebra: update zl3vni when bridge link refreshed in other namespaces When running bgp evpn rt5 setup with vrf namespace backend, once the BGP configuration loaded, some refresh like the config change of a vxlan interface is not taken into account. As consequence, the BGP l2vpn evpn entries are empty. This can happen by recreating vxlan interface like follows: ip netns exec cust1 ip li del vxlan1000 ip link add vxlan1000 type vxlan id 1000 dev loopback0 local 10.209.36.1 learning ip link set dev vxlan1000 mtu 9000 ip link set dev vxlan1000 netns cust1 ip netns exec cust1 bash ip link set dev vxlan1000 up ip link set dev vxlan1000 master br1000 Actually, changing learning attribute requires recreation, and this change needs to manually reload the frr configuration. The update mechanism in zebra about vxlan interface updates is already put in place, but it does not work well with namespace based vrf backend. The function zl3vni_from_svi() is then modified to parse all the interfaces of each namespace. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-09-07 14:10:58 +02:00
Kantesh Mundaragi	0789eb69e5	bgpd: VRF-Lite fix nexthop type Description: Change is intended for fixing the following issues related to vrf route leaking: Routes with special nexthops i.e. blackhole/sink routes when imported, are not programmed into the FIB and corresponding nexthop is set as 'inactive', nexthop interface as 'unknown'. While importing/leaking routes between VRFs, in case of special nexthop(ipv4/ipv6) once bgp announces route(s) to zebra, nexthop type is incorrectly set as NEXTHOP_TYPE_IPV6_IFINDEX/NEXTHOP_TYPE_IFINDEX i.e. directly connected even though we are not able to resolve through an interface. This leads to nexthop_active_check marking nexthop !NEXTHOP_FLAG_ACTIVE. Unable to find the active nexthop(s), route is not programmed into the FIB. Whenever BGP leaks routes, set the correct nexthop type, so that route gets resolved and correctly programmed into the FIB, in the imported vrf. Co-authored-by: Kantesh Mundaragi <kmundaragi@vmware.com> Signed-off-by: Iqra Siddiqui <imujeebsiddi@vmware.com>	2021-09-07 01:50:06 -07:00
Donald Sharp	a81982fa56	zebra: Convert to `enum zebra_slave_iftype` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	e6f2bec087	zebra: Convert to `enum zebra_iftype` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	60e3656140	zebra: Convert to `struct zebra_fec` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	8f74a383b3	zebra: Convert to `struct zebra_lsp` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	f2595bd505	zebra: Convert to `struct zebra_nhlfe` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	a7d2146a41	zebra: Convert to `struct zebra_ile` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	72de4110dc	zebra: Convert to `struct zebra_neigh` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	05843a27f5	zebra: Convert to `struct zebra_l3nvi` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Donald Sharp	847f168d76	zebra: Convert to `struct zebra_vxlan_sg` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Donald Sharp	3198b2b347	zebra: Convert to `struct zebra_mac` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Donald Sharp	c172c032ef	zebra: Convert to `struct zebra_vtep` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Donald Sharp	f6371c343a	zebra: Convert to `struct zebra_evpn` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Russ White	648c73647d	Merge pull request #9488 from pguibert6WIND/fix_nhrp_neigh_state Fix nhrp neigh state	2021-08-27 19:00:45 -04:00
David Lamparter	8268be3d16	Merge pull request #9496 from idryzhov/vrf-cmd-init-unused-arg lib: remove unused argument from vrf_cmd_init	2021-08-27 10:39:45 +02:00
Christian Hopps	d448e2c5f9	Merge pull request #9331 from idryzhov/explicit-exit *: explicitly print "exit" at the end of every node config	2021-08-26 11:57:33 -04:00
Igor Ryzhov	cfc369c43a	lib: remove unused argument from vrf_cmd_init Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-08-26 12:01:22 +03:00
Philippe Guibert	80f6b5faeb	lib, zebra: complete the ndm flags on zclient api Insist on the fact that zclient neighbor state flags are mapped over netlink state flags. List all the defines currently known on kernel, and create a netlink API to convert netlink values to zclient values. The function is simplified as it is a 1-1 match. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-08-26 09:19:42 +02:00
Philippe Guibert	c4e1fd52a1	nhrp, zebra, lib: pass exact received neighbor state value to nhrp As NHRP expects some notification of neighboring entries on GRE interface, when a new interface notification is encountered, the exact neighbor state flag is found. Previously, the flag passed to the upper layer was forced to NDM_STATE which is REACHABLE, as can be seen on below trace: 2021/08/25 10:58:39 NHRP: [QQ0NK-1H449] Netlink: new-neigh 102.1.1.1 dev gre1 lladdr 10.125.0.2 nud 0x2 cache used 1 type 5 When passing the real value, NHRP received an other value like STALE. 2021/08/25 11:28:44 NHRP: [QQ0NK-1H449] Netlink: new-neigh 102.1.1.1 dev gre1 lladdr 10.125.0.2 nud 0x4 cache used 0 type 5 This flag is important for NHRP, as it permits to monitor the link layer of NHRP entries. Fixes: `d603c0774e` ("nhrp, zebra, lib: enforce usage of zapi_neigh_ip structure") Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-08-26 09:19:42 +02:00
Donatas Abraitis	3e324ff419	Merge pull request #9466 from idryzhov/vrf-netns lib, zebra: move vrf netns commands from lib to zebra	2021-08-26 07:46:19 +03:00
Donatas Abraitis	d10bda270e	*: Drop `break` after using frr_help_exit() in switch/case Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-08-25 10:49:05 +03:00
Igor Ryzhov	37cb0475e1	lib, zebra: move vrf netns commands from lib to zebra "[no] netns NAME" commands are part of the lib, but they are actually zebra-only: - they are using vrf_netns_handler_create and its description clearly says that it "should be called from zebra only" - vtysh sends these commands only to zebra - only zebra outputs the netns related config - zebra notifies other daemons about netns attachment Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-08-23 23:54:12 +03:00
Igor Ryzhov	07679ad98a	*: explicitly print "exit" at the end of every node config There is a possibility that the same line can be matched as a command in some node and its parent node. In this case, when reading the config, this line is always executed as a command of the child node. For example, with the following config: ``` router ospf network 193.168.0.0/16 area 0 ! mpls ldp discovery hello interval 111 ! ``` Line `mpls ldp` is processed as command `mpls ldp-sync` inside the `router ospf` node. This leads to a complete loss of `mpls ldp` node configuration. To eliminate this issue and all possible similar issues, let's print an explicit "exit" at the end of every node config. This commit also changes indentation for a couple of existing exit commands so that all existing commands are on the same level as their corresponding node-entering commands. Fixes #9206. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-08-23 22:08:20 +03:00
Mark Stapp	b87c5f4dd1	Merge pull request #9434 from anlancs/fix-zebra-mpls-cmd zebra: fix wrong check of mpls command	2021-08-23 09:02:15 -04:00
Donald Sharp	33c0851873	zebra: Fix usage to enum in notify functions For some reason commit #ef524230a6baa decided to remove enums and switch to uint16_t. Which is not the right thing to do. Put it back Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-19 11:31:05 -04:00
anlan_cs	21683186a0	zebra: fix wrong check of mpls command Maybe with empty nexthop to call zebra_mpls_transit_lsp(): "no mpls lsp (16-1048575)". So just remove this "gate_str" check. If without "gate" in command, "gtype" is set to NEXTHOP_TYPE_BLACKHOLE for subsequent processing. Signed-off-by: anlan_cs <anlan_cs@tom.com>	2021-08-18 19:34:03 -04:00
Philippe Guibert	7a52f27e75	zebra: RTM_GETNEIGH messages may be used by nhrp When NHRP registers to zebra to receive link layer events related to gre interfaces, then it is interested in receiving also RTM_GETNEIGH messages. Fixes ("b3b751046495") nhrpd: link layer registration to notifications Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-08-17 09:07:31 +02:00
Renato Westphal	1dfa8b8991	Merge pull request #9380 from mjstapp/fix_static_lsp_cli zebra: mpls validation and static lsp fixes	2021-08-16 12:06:01 -03:00
Igor Ryzhov	f0010840e8	Merge pull request #9389 from mjstapp/fix_netlink_if_name_sa zebra: interface name must be a valid string	2021-08-14 02:14:44 +03:00
Mark Stapp	e9f79fff57	zebra: interface name must be a valid string Validate incoming netlink interface name strings. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-13 16:06:07 -04:00
Igor Ryzhov	1523c0f9ee	Merge pull request #9371 from donaldsharp/zebra_evpn_getl zebra: Ensure stream is long enough	2021-08-13 14:06:37 +03:00
Donald Sharp	a876da9b08	Merge pull request #9374 from mjstapp/fix_nhg_add_leak zebra: clean up nhg allocations in error path	2021-08-12 15:34:07 -04:00
Donald Sharp	86d87c5352	zebra: Ensure stream is long enough In zebra_evpn_proc_remote_nh if we do not pass in a long enough stream, the stream reads will fail. Ensure that we have enough data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-12 15:29:47 -04:00
Mark Stapp	1722cef455	Merge pull request #9304 from donaldsharp/zebra_random_stuff Zebra random stuff	2021-08-12 10:16:46 -04:00
Mark Stapp	a44e310631	zebra: mpls validation and static lsp fixes Handle TYPE_IFINDEX nexthops more consistently in a few places; be more specific about a few integer return values that were being treated as booleans. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-12 08:53:53 -04:00
Mark Stapp	fd99142ab7	zebra: clean up nhg allocations in error path Clean up allocated nhgs in error path in zread_nhg_add(). Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-11 10:41:53 -04:00
Donald Sharp	c472a97080	Merge pull request #9367 from mjstapp/fix_rt_netlink_af zebra: ignore unknown address-family in netlink route msg	2021-08-11 08:11:39 -04:00
Mark Stapp	deb28338de	zebra: ignore unknown address-family in netlink route msg Ignore AFs we don't handle in incoming netlink route updates. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-10 11:44:08 -04:00
Sri Mohana Singamsetty	dd4c59d79a	Merge pull request #9236 from AnuradhaKaruppiah/v6-nh-rmac zebra: use a separate dummy prefix for referencing v6 nexthops	2021-08-10 08:20:55 -07:00
Igor Ryzhov	bdb7b7c5d9	Merge pull request #9321 from donaldsharp/no_leak_re zebra: Prevent memory leak if route is rejected early	2021-08-10 11:39:30 +03:00
Donald Sharp	38c764dde4	zebra: Properly note add/update for rib_add_multipath_nhe When calling rib_add_multipath_nhe ensure that we have well aligned return codes that mean something so that interersted parties can properly handle the situation. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-09 08:06:33 -04:00
Donald Sharp	f94a7703c0	zebra: Prevent memory leak if route is rejected early When receiving a route via zapi, if the route is rejected there exists a code path where we would not free the corresponding re created. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-09 07:55:07 -04:00
Donald Sharp	572bc3167f	zebra: Delete rib_lookup_and_dump since it is not used The rib_lookup_and_dump function is never used, remove Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:04:40 -04:00
Donald Sharp	42aa465ed1	zebra: Remove rib_lookup_and_pushup function The rib_lookup_and_pushup function, from what I can tell, was used more when static route processing and connected routes were more closely integrated in zebra. The goal was when we were adding a new address to remove the connected route and then allow processing of the new address. With the re-org a few years ago to seperate out connected routes as well as static routes, I believe this is no longer needed. on BSD, without this code change we have this log: 2021/08/05 14:33:38 ZEBRA: [QEVVE-G3FQQ] rib_meta_queue_add: (0:0):10.40.30.0/24: queued rn 0x802022bb0 into sub-queue 4 2021/08/05 14:33:38 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 200 Type: RTM_DELETE 2021/08/05 14:33:38 ZEBRA: [V3NSB-BPKBD] Kernel: GATEWAY DONE PROTO1 2021/08/05 14:33:38 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 15 2021/08/05 14:33:38 ZEBRA: [MJD4M-0AAAR] Kernel: pid 53305, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:33:38 ZEBRA: [Y9Y5K-JJ7NT] rtm_read: got rtm of type 2 (RTM_DELETE) addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:33:38 ZEBRA: [V17DT-1FJEN] kernel_rtm: 10.40.30.0/24: successfully did NH 9.8.6.7 2021/08/05 14:33:38 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 164 Type: RTM_NEWADDR 2021/08/05 14:33:38 ZEBRA: [V3NSB-BPKBD] Kernel: 2021/08/05 14:33:38 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 4664 2021/08/05 14:33:38 ZEBRA: [MJD4M-0AAAR] Kernel: pid 0, rtm_addrs {DST} 2021/08/05 14:33:38 ZEBRA: [M09CX-TKB4N] ifam_read_mesg: ifindex 1, ifname vtnet0, ifam_addrs {NETMASK,IFP,IFA,BRD}, ifam_flags 0x0, addr 10.40.30.1/24 broad 10.40.30.255 dst (unspec) gateway (unspec) 2021/08/05 14:33:38 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:0):10.40.30.0/24: Inserting route rn 0x802022bb0, re 0x8032973a0 (connected) existing 0x0, same_count 0 2021/08/05 14:33:38 ZEBRA: [Q4T2G-E2SQF] rib_add_multipath_nhe: dumping RE entry 0x8032973a0 for 10.40.30.0/24 vrf default(0) 2021/08/05 14:33:38 ZEBRA: [M5M58-9PD2R] 10.40.30.0/24: uptime == 1379355, type == 2, instance == 0, table == 0 2021/08/05 14:33:38 ZEBRA: [RVZMM-N7DME] 10.40.30.0/24: metric == 1, mtu == 0, distance == 0, flags == None status == None 2021/08/05 14:33:38 ZEBRA: [Q1NW5-NWY7P] 10.40.30.0/24: nexthop_num == 1, nexthop_active_num == 0 2021/08/05 14:33:38 ZEBRA: [TFHQ8-TC30H] 10.40.30.0/24: NH vtnet0[1] vrf default(0) with flags 2021/08/05 14:33:38 ZEBRA: [SCETK-GQ9E4] 10.40.30.0/24: dump complete 2021/08/05 14:33:38 ZEBRA: [QEVVE-G3FQQ] rib_meta_queue_add: (0:0):10.40.30.0/24: queued rn 0x802022bb0 into sub-queue 2 2021/08/05 14:33:38 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:?):10.40.30.0/24 (MRIB): Inserting route rn 0x802022f30, re 0x803297340 (connected) existing 0x0, same_count 0 2021/08/05 14:33:38 ZEBRA: [Q4T2G-E2SQF] rib_add_multipath_nhe: dumping RE entry 0x803297340 for 10.40.30.0/24 vrf default(0) 2021/08/05 14:33:38 ZEBRA: [M5M58-9PD2R] 10.40.30.0/24: uptime == 1379355, type == 2, instance == 0, table == 0 2021/08/05 14:33:38 ZEBRA: [RVZMM-N7DME] 10.40.30.0/24: metric == 1, mtu == 0, distance == 0, flags == None status == None 2021/08/05 14:33:38 ZEBRA: [Q1NW5-NWY7P] 10.40.30.0/24: nexthop_num == 1, nexthop_active_num == 0 2021/08/05 14:33:38 ZEBRA: [TFHQ8-TC30H] 10.40.30.0/24: NH vtnet0[1] vrf default(0) with flags 2021/08/05 14:33:38 ZEBRA: [SCETK-GQ9E4] 10.40.30.0/24: dump complete 2021/08/05 14:33:38 ZEBRA: [GCGMT-SQR82] rib_link: (0:?):10.40.30.0/24 (MRIB): rn 0x802022f30 adding dest 2021/08/05 14:33:38 ZEBRA: [QEVVE-G3FQQ] rib_meta_queue_add: (0:0):10.40.30.0/24 (MRIB): queued rn 0x802022f30 into sub-queue 2 2021/08/05 14:33:38 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 240 Type: RTM_ADD 2021/08/05 14:33:38 ZEBRA: [V3NSB-BPKBD] Kernel: UP PINNED 2021/08/05 14:33:38 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 0 2021/08/05 14:33:38 ZEBRA: [MJD4M-0AAAR] Kernel: pid 0, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:33:38 ZEBRA: [K0KVE-2GJA1] default(0:0):10.40.30.0/24: Processing rn 0x802022bb0 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x8032973a0 (connected) status: Changed flags: None dist 0 metric 1 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x8032970a0 (static) status: None flags: Recursion RR Distance dist 1 metric 0 2021/08/05 14:33:38 ZEBRA: [NYYJJ-0Q8QG] default(0:0):10.40.30.0/24: After processing: old_selected 0x0 new_selected 0x8032973a0 old_fib 0x0 new_fib 0x8032973a0 2021/08/05 14:33:38 ZEBRA: [RT9DY-ZS2KN] default(0:0):10.40.30.0/24: Adding route rn 0x802022bb0, re 0x8032973a0 (connected) 2021/08/05 14:33:38 ZEBRA: [PP3BZ-RABJN] default(0:0):10.40.30.0/24: rn 0x802022bb0 dequeued from sub-queue 2 2021/08/05 14:33:38 ZEBRA: [K0KVE-2GJA1] default(0:0):10.40.30.0/24: Processing rn 0x802022f30 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x803297340 (connected) status: Changed flags: None dist 0 metric 1 2021/08/05 14:33:38 ZEBRA: [NYYJJ-0Q8QG] default(0:0):10.40.30.0/24: After processing: old_selected 0x0 new_selected 0x803297340 old_fib 0x0 new_fib 0x803297340 2021/08/05 14:33:38 ZEBRA: [RT9DY-ZS2KN] default(0:0):10.40.30.0/24: Adding route rn 0x802022f30, re 0x803297340 (connected) 2021/08/05 14:33:38 ZEBRA: [PP3BZ-RABJN] default(0:0):10.40.30.0/24: rn 0x802022f30 dequeued from sub-queue 2 2021/08/05 14:33:38 ZEBRA: [K0KVE-2GJA1] default(0:0):10.40.30.0/24: Processing rn 0x802022bb0 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x8032973a0 (connected) status: Queued flags: Selected dist 0 metric 1 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x8032970a0 (static) status: None flags: Recursion RR Distance dist 1 metric 0 2021/08/05 14:33:38 ZEBRA: [NYYJJ-0Q8QG] default(0:0):10.40.30.0/24: After processing: old_selected 0x8032973a0 new_selected 0x8032973a0 old_fib 0x8032973a0 new_fib 0x8032973a0 2021/08/05 14:33:38 ZEBRA: [PP3BZ-RABJN] default(0:0):10.40.30.0/24: rn 0x802022bb0 dequeued from sub-queue 4 2021/08/05 14:33:38 ZEBRA: [GHWHS-ZKQM5] update_from_ctx: default(0:0):10.40.30.0/24: SELECTED, re 0x8032973a0 2021/08/05 14:33:38 ZEBRA: [TS3SH-1276M] default(0:0):10.40.30.0/24 update_from_ctx(): no fib nhg 2021/08/05 14:33:38 ZEBRA: [HKQXC-4STSK] default(0:0):10.40.30.0/24 update_from_ctx(): rib nhg matched, changed 'false' 2021/08/05 14:33:38 ZEBRA: [HBZNK-5H1X0] (0:0):10.40.30.0/24: Redist update re 0x8032973a0 (connected), old 0x0 (None) 2021/08/05 14:33:38 ZEBRA: [GHWHS-ZKQM5] update_from_ctx: default(0:0):10.40.30.0/24: SELECTED, re 0x8032973a0 2021/08/05 14:33:38 ZEBRA: [TS3SH-1276M] default(0:0):10.40.30.0/24 update_from_ctx(): no fib nhg 2021/08/05 14:33:38 ZEBRA: [HKQXC-4STSK] default(0:0):10.40.30.0/24 update_from_ctx(): rib nhg matched, changed 'false' 2021/08/05 14:33:38 ZEBRA: [HBZNK-5H1X0] (0:0):10.40.30.0/24: Redist update re 0x8032973a0 (connected), old 0x0 (None) With this code change: 2021/08/05 14:41:24 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:?):10.10.40.0/24: Inserting route rn 0x802022f30, re 0x8021cbe60 (static) existing 0x0, same_count 0 2021/08/05 14:41:24 ZEBRA: [RT9DY-ZS2KN] default(0:0):10.10.40.0/24: Adding route rn 0x802022f30, re 0x8021cbe60 (static) 2021/08/05 14:41:24 ZEBRA: [V17DT-1FJEN] kernel_rtm: 10.10.40.0/24: successfully did NH 9.8.6.7 2021/08/05 14:41:24 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 200 Type: RTM_ADD 2021/08/05 14:41:24 ZEBRA: [V3NSB-BPKBD] Kernel: UP GATEWAY DONE PROTO1 2021/08/05 14:41:24 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 0 2021/08/05 14:41:24 ZEBRA: [MJD4M-0AAAR] Kernel: pid 60818, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:41:24 ZEBRA: [Y9Y5K-JJ7NT] rtm_read: got rtm of type 1 (RTM_ADD) addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:41:24 ZEBRA: [TS3SH-1276M] default(0:0):10.10.40.0/24 update_from_ctx(): no fib nhg 2021/08/05 14:41:24 ZEBRA: [HKQXC-4STSK] default(0:0):10.10.40.0/24 update_from_ctx(): rib nhg matched, changed 'true' 2021/08/05 14:41:24 ZEBRA: [HBZNK-5H1X0] (0:0):10.10.40.0/24: Redist update re 0x8021cbe60 (static), old 0x0 (None) 2021/08/05 14:42:06 ZEBRA: [ZJ4AV-JEMJ3] dplane_intf_addr_set 2021/08/05 14:42:06 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 164 Type: RTM_NEWADDR 2021/08/05 14:42:06 ZEBRA: [V3NSB-BPKBD] Kernel: 2021/08/05 14:42:06 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 4664 2021/08/05 14:42:06 ZEBRA: [MJD4M-0AAAR] Kernel: pid 0, rtm_addrs {DST} 2021/08/05 14:42:06 ZEBRA: [M09CX-TKB4N] ifam_read_mesg: ifindex 1, ifname vtnet0, ifam_addrs {NETMASK,IFP,IFA,BRD}, ifam_flags 0x0, addr 10.10.40.3/24 broad 10.10.40.255 dst (unspec) gateway (unspec) 2021/08/05 14:42:06 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:0):10.10.40.0/24: Inserting route rn 0x802022f30, re 0x80308c4c0 (connected) existing 0x0, same_count 0 2021/08/05 14:42:06 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:?):10.10.40.0/24 (MRIB): Inserting route rn 0x802023160, re 0x80308c460 (connected) existing 0x0, same_count 0 2021/08/05 14:42:06 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 240 Type: RTM_ADD 2021/08/05 14:42:06 ZEBRA: [V3NSB-BPKBD] Kernel: UP PINNED 2021/08/05 14:42:06 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 0 2021/08/05 14:42:06 ZEBRA: [MJD4M-0AAAR] Kernel: pid 0, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:42:06 ZEBRA: [RG9Y6-E93A0] default(0:0):10.10.40.0/24: Updating route rn 0x802022f30, re 0x80308c4c0 (connected) old 0x8021cbe60 (static) 2021/08/05 14:42:06 ZEBRA: [RT9DY-ZS2KN] default(0:0):10.10.40.0/24: Adding route rn 0x802023160, re 0x80308c460 (connected) 2021/08/05 14:42:06 ZEBRA: [THSYN-E2XFY][EC 100663299] rtm_write: write : Address already in use (48) 2021/08/05 14:42:06 ZEBRA: [RV5F2-MQGZG][EC 100663303] kernel_rtm: 10.10.40.0/24: rtm_write() unexpectedly returned -5 for command RTM_DELETE 2021/08/05 14:42:06 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 200 Type: RTM_DELETE 2021/08/05 14:42:06 ZEBRA: [V3NSB-BPKBD] Kernel: UP PROTO1 2021/08/05 14:42:06 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 1 2021/08/05 14:42:06 ZEBRA: [MJD4M-0AAAR] Kernel: pid 60818, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:42:06 ZEBRA: [XASXT-GF69Y] kernel_rtm: No useful nexthops were found in RIB prefix 10.10.40.0/24 2021/08/05 14:42:06 ZEBRA: [TS3SH-1276M] default(0:0):10.10.40.0/24 update_from_ctx(): no fib nhg 2021/08/05 14:42:06 ZEBRA: [HKQXC-4STSK] default(0:0):10.10.40.0/24 update_from_ctx(): rib nhg matched, changed 'false' 2021/08/05 14:42:06 ZEBRA: [HBZNK-5H1X0] (0:0):10.10.40.0/24: Redist update re 0x80308c4c0 (connected), old 0x8021cbe60 (static) netstat -rn: 10.10.40.0/24 link#1 U vtnet0 10.10.40.3 link#1 UHS lo0 show ip route: C>* 10.10.40.0/24 [0/1] is directly connected, vtnet0, 00:18:48 S 10.10.40.0/24 [1/0] via 9.8.6.7, vtnet0, weight 1, 00:19:30 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:04:40 -04:00
Donald Sharp	38ef05ea33	zebra: `debug zebra kernel msgdump` is linux specific The command `debug zebra kernel msgdump is netlink specific. There is no point at all to allow this to be configed on non netlink platforms. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:04:40 -04:00
Donald Sharp	e658173ae6	zebra: Convert srcdest_rnode2str to %pRN in zebra_rib.c There were a bunch of places where we converted the route node to a prefix string via srcdest_rnode2str when we should have been using %pRN in zebra_rib.c. Just convert over the ones we should to use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:04:40 -04:00
Donald Sharp	f0afc61d58	zebra: short-circuit rib_process when nothing to do When we are calling rib_process and the route_node in question has no dest, there is no work to do here at all. As such we should just return before attempting to do any other work. This is just a tiny bit of simplification being done. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:02:53 -04:00
Donald Sharp	6140b3b41b	zebra: prevent crash when nhlfe is NULL There exists a call path where the nhlfe_alloc can return NULL for blackhole nexthops. In this case we were still trying to save the nhlfe pointer causing a crash when we attempted to add it to a self-contained list. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-04 13:38:25 -04:00
Donald Sharp	10cc80cafd	zebra: don't use default case when switching over enum nexthop Do not use the `default` case when switching over an enumerated type. This allows the code to fail to compile when we add a new enumeration. Thus allowing us developers to know all the places in the code we'll need to touch. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-04 13:34:03 -04:00
Russ White	11093fc905	Merge pull request #9231 from idryzhov/zebra-rmap-set-src zebra: remove checks for src address existence when using "set src"	2021-08-03 09:22:18 -04:00
Russ White	1358f9d10a	Merge pull request #9259 from opensourcerouting/moar-json *: can't get enough JSON	2021-08-03 09:13:12 -04:00
Donatas Abraitis	71c06f610f	Merge pull request #9258 from mjstapp/fix_rule_strlcpy zebra: use strlcpy in dplane_rule_init	2021-08-03 09:12:38 +03:00
Renato Westphal	488599bfa2	Merge pull request #9232 from idryzhov/interface-node-cleanup *: cleanup interface node installation	2021-08-02 21:13:29 -03:00
Renato Westphal	c15dc24f79	zebra: add "json" option to "show interface" Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2021-08-02 17:19:45 -03:00
Mark Stapp	bc86b347db	zebra: use strlcpy in dplane_rule_init Use strlcpy for safety in dplane rule init api. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-02 12:35:50 -04:00
Igor Ryzhov	1f74d96c41	zebra: remove checks for src address existence when using "set src" 1. This check is absolutely useless. Nothing keeps user from deleting the address right after this check. 2. This check prevents zebra from correctly reading the user config with "set src" because of a race with interface startup (see #4249). 3. NO OPERATIONAL DATA USAGE ON VALIDATION STAGE. Fixes #7319. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-08-02 18:35:30 +03:00
Igor Ryzhov	72928fa1aa	Merge pull request #9238 from leonshaw/fix/netns-delete lib, zebra: Preserve user-configured VRF on netns deletion	2021-08-02 18:12:19 +03:00
Xiao Liang	6910315f6f	lib, zebra: Preserve user-configured VRF on netns deletion Don't clear VRF's user-configured flag when netns is deleted. Signed-off-by: Xiao Liang <shaw.leon@gmail.com>	2021-07-30 14:53:45 +08:00
Anuradha Karuppiah	82732723da	zebra: use a separate dummy prefix for referencing v6 nexthops v4 and v6 host/refernce prefixes need to be setup separately for [RMAC, VTEP] entries as the VTEP is always normalized to a v4 addr. Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-07-29 17:25:11 -07:00
Igor Ryzhov	9da01b0b7b	*: cleanup interface node installation The only difference in daemons' interface node definition is the config write function. No need to define the node in every daemon, just pass the callback as an argument to a library function and define the node there. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-07-29 21:35:25 +03:00
batmancn	5306e6cf00	zebra: bugfix of error quit of zebra, due to no nexthop ACTIVE There exists some rare situations where fpm will attempt to send a route update with no valid nexthops. In that case an assert would be hit. This is not good for trying to keep your routing daemons up and running when we can safely just recover the situation. Fixes #7588 Signed-off-by: batmancn <batmanustc@gmail.com> <fixed commit message, and used zlog_err> Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-28 16:13:59 -04:00
Jafar Al-Gharaibeh	213d980ff9	Merge pull request #9007 from donaldsharp/pbr_stuff add ability to match on proto to pbr	2021-07-27 15:09:29 -05:00
David Lamparter	631fce38ff	Merge pull request #9107 from donaldsharp/label_destruction zebra: On client shutdown cleanup any vrf labels associated with it	2021-07-27 14:28:13 +02:00
David Lamparter	9c9d8a6129	Merge pull request #9088 from donaldsharp/zebra_redistribute_wrong_tables zebra: Do not allow redistribution for non-vrf tables	2021-07-27 14:14:23 +02:00
Trey Aspelund	fb0b54b361	zebra: Remove MM seq from evpn rmac json output Currently 'show evpn rmac vni .. mac .. json' includes fields for localSequence and remoteSequence, which are misleading since they aren't applicable to a macs in the IP-VRF mac table (RMAC). This removes the localSequence + remoteSequence fields from the output. Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2021-07-22 20:23:56 +00:00
Donald Sharp	9fbbcbeb1f	Merge pull request #9091 from gord1306/remove_lst_vlan zebra: trigger remove all access vlans info for access port	2021-07-22 07:04:20 -04:00
Donald Sharp	06302ecb88	zebra: On client shutdown cleanup any vrf labels associated with it When a vrf label is created by a client and the client disconnects we should clean up any vrf labels associated with that client. eva# show mpls table Inbound Label Type Nexthop Outbound Label ----------------------------------------------- 1000 SHARP RED - eva# exit sharpd@eva ~/f/zebra (label_destruction)> ps -ef \| grep frr root 4017793 1 0 13:57 ? 00:00:00 /usr/lib/frr/watchfrr -d -F datacenter --log file:/var/log/frr/watchfrr.log --log-level debug zebra bgpd ospfd isisd pimd eigrpd sharpd staticd frr 4017824 1 0 13:57 ? 00:00:00 /usr/lib/frr/zebra -d -F datacenter --log file:/tmp/zebra.log -r --graceful_restart 60 -A 127.0.0.1 -s 90000000 frr 4017829 1 0 13:57 ? 00:00:00 /usr/lib/frr/bgpd -d -F datacenter -M rpki -A 127.0.0.1 frr 4017836 1 0 13:57 ? 00:00:00 /usr/lib/frr/ospfd -d -F datacenter -A 127.0.0.1 frr 4017839 1 0 13:57 ? 00:00:00 /usr/lib/frr/isisd -d -F datacenter -A 127.0.0.1 frr 4017842 1 0 13:57 ? 00:00:00 /usr/lib/frr/pimd -d -F datacenter -A 127.0.0.1 frr 4017865 1 0 13:57 ? 00:00:00 /usr/lib/frr/eigrpd -d -F datacenter -A 127.0.0.1 frr 4017869 1 0 13:57 ? 00:00:00 /usr/lib/frr/sharpd -d -F datacenter -A 127.0.0.1 frr 4017888 1 0 13:57 ? 00:00:00 /usr/lib/frr/staticd -d -F datacenter -A 127.0.0.1 sharpd 4018624 3938423 0 14:02 pts/10 00:00:00 grep --color=auto frr sharpd@eva ~/f/zebra (label_destruction)> sudo kill -9 4017869 sharpd@eva ~/f/zebra (label_destruction)> sudo vtysh -c "show mpls table" sharpd@eva ~/f/zebra (label_destruction)> Fixes: #1787 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-21 14:04:36 -04:00
David Lamparter	63116a7008	build: fix `AM_LDFLAGS` usage (and gcov) like the other automake variables, setting `xyz_LDFLAGS` causes `AM_LDFLAGS` to be ignored for `xyz`. For some reason I had in my mind that automake doesn't do this for LDFLAGS, but... it does. (Which is consistent with `_CFLAGS` and co.) So, all the libraries and modules have been ignoring `AM_LDFLAGS` (which includes `SAN_FLAGS` too). Set up new `LIB_LDFLAGS` and `MODULE_LDFLAGS` to handle all of this correctly (and move these bits to a central location.) Fixes: #9034 Fixes: `0c4285d77e` ("build: properly split CFLAGS from AC_CFLAGS") Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-07-21 17:10:08 +02:00
Donald Sharp	ecff5258a0	zebra: Mark some bsd interface prefixes as SECONDARY Notice when a ip address on a bsd interface is considered an alias, let's mark the connected prefix we generate as a SECONDARY. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-20 10:12:04 -04:00
gord_chen	ec8977510e	zebra: trigger remove all access vlans for access port When port was removed from last access vlan, the linux kernel won't send any vlan info in the netlink message, it might affact the evpn mh not withdraw EAD-EVI routes. Signed-off-by: Gord Chen <gord_chen@edge-core.com>	2021-07-20 09:39:45 +00:00
Donald Sharp	79a9ad1450	zebra: Do not allow redistribution for non-vrf tables Current code was allowing redistribution of kernel routes from the non-default non vrf tables once FRR was already up and running. In the case where we add `redistribute kernel` in an upper level protocol we never consider the non-default vrf or non-vrf tables so it is never accepted. In the case where a kernel route is added after `redistribute kernel` is already in place we were never looking at the fact that the route was in a non-default non-vrf table. This code fixes that issue. Fixes: #9073 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-19 20:04:03 -04:00
Mark Stapp	80ff3f05ea	zebra: replace ipaddr2str in dplane module Replace a couple of ipaddr2str calls with pIA in the dplane module. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	7e5b0b2b36	zebra: process EVPN remote VTEP updates from the workqueue Move remote VTEP updates from immediate, inline processing in their ZAPI message handlers to the main workqueue. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	7f7e49d11a	zebra: use workqueue for vxlan remote macip updates Enqueue incoming vxlan remote macip updates on the main workqueue, instead of performing the updates immediately, in-line. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	1a3bd37f7c	zebra: use more const Use const in many more evpn apis, especially for macaddr, ipaddr arguments. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	32367e7a3b	zebra: add workqueue support for EVPN updates Add workqueue subqueue for EVPN/VxLAN updates; migrate the evpn route and remote ES processing from their ZAPI handlers to the workqueue. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	272e11bfc4	zebra: give some evpn apis better names Use more useful names for a few evpn apis. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 08:43:48 -04:00
Mark Stapp	12e1fe1251	Merge pull request #9063 from sworleys/Fix-IFP-NHG zebra: fix ifp pointer for groups/recursives	2021-07-16 09:33:52 -04:00
Stephen Worley	bf157b9263	zebra: fix ifp pointer for groups/recursives At some point we broke the ifp pointer for nhe->ifp such that it was pointing to an interface even in groups/recurisve instances. Add checks here to make it again so that we only set the ifp pointer if it is a fully resolved singleton NHE. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-07-15 11:24:24 -04:00
Donald Sharp	b59839af7d	zebra: When passing lookup information back pass the fully resolved In the reachability code we auto pass back the fully resolved nexthops. Modify the ZEBRA_IPV4_NEXTHOP_LOOKUP_MRIB code to do the exact same thing so that the zclient_lookup_nexthop code does not need to recursively look for the data that zebra already has. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-15 08:50:09 -04:00
Donald Sharp	f56697eff3	bgpd, pbrd, zebra: Encode/decode the ip proto from daemons to zebra Ensure that we properly encode/decode the ip protocol from daemons to zebra. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:47 -04:00
Donald Sharp	b94683f0db	lib, zebra: add ip_proto to the filter data structure Add ip_proto to the filter data structure and also account for it in the hash when stored. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:47 -04:00
Donald Sharp	8ccbc778cf	zebra: Add ability for dataplane code to understand rule ip protocols The zebra dplane needs to be taught about the rule ip_proto that can be installed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:47 -04:00
Donald Sharp	8096bd72aa	zebra: Add ability to encode/decode netlink FRA_IP_PROTO for rule changes Encode/Decode the FRA_IP_PROTO but do nothing with it at the moment. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:47 -04:00
Donald Sharp	94d70a6533	zebra: Add nl_attr_put8 so we can put uint8_t in netlink messages Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:46 -04:00
Donatas Abraitis	24447a70d0	zebra: Show prefixLen in `show ip route json` output additionally Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-03 14:21:06 +03:00
Donatas Abraitis	45c8ba8fb3	zebra: Do not escape forward slashes for `show ip route json` Basically, this is handled by JSON-C library. I've compiled with the latest release of json-c and it works well. Didn't test with various distribution versions, but this change is kinda dependend from the json-c lib version the distra has. Before: ``` "192.168.100.1\/32":[ { "prefix":"192.168.100.1\/32", ``` After: ``` "192.168.100.1/32":[ { "prefix":"192.168.100.1/32", ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-03 14:19:48 +03:00
Donatas Abraitis	8643c2e5f7	*: Replace 4/16 integers to IPV4_MAX_BYTELEN/IPV6_MAX_BYTELEN Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 23:54:39 +03:00
Donatas Abraitis	12256b84a5	*: Convert numeric 32 into IPV4_MAX_BITLEN for prefixlen Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 23:50:39 +03:00
Donatas Abraitis	13ccce6e7e	*: Convert numeric 128 into IPV6_MAX_BITLEN for prefixlen Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 17:53:21 +03:00
Donatas Abraitis	936fbaef47	*: Replace IPV4_MAX_PREFIXLEN to IPV4_MAX_BITLEN Just drop IPV4_MAX_PREFIXLEN at all, no need keeping both. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 17:44:09 +03:00
Donatas Abraitis	f4d81e5507	*: Replace IPV6_MAX_PREFIXLEN to IPV6_MAX_BITLEN Just drop IPV6_MAX_PREFIXLEN at all, no need keeping both. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 17:41:09 +03:00
Renato Westphal	8b0ab1f8a0	Merge pull request #8780 from idryzhov/fix-zebra-coverity zebra: fix a couple of coverity warnings	2021-06-30 16:08:35 -03:00
Philippe Guibert	eed936b334	Merge pull request #8744 from sworleys/RTADV-Fix-Upstream zebra: rework RA handling for vrf-lite	2021-06-29 19:20:54 +02:00
Igor Ryzhov	b08dcc3f3f	*: unify prefix copying There are a few places in the code where we use PREFIX_COPY(_IPV4/IPV6) macro to copy a prefix. Let's always use prefix_copy function for this. This should fix CID 1482142 and 1504610. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-06-29 16:11:47 +03:00
Stephen Worley	a7c91c4246	Merge pull request #8731 from mjstapp/fix_pw_backups zebra: Fix pseudowires with backup nexthops	2021-06-24 12:46:31 -04:00
Patrick Ruddy	fa855f8fa3	Merge pull request #6695 from adharkar/frr-master-gateway_ip EVPN route type-5 gateway IP overlay Index	2021-06-23 09:23:54 +01:00
Donald Sharp	3caaa17764	zebra: We already store the last command as part of zserv_write when sending nexthop information. We do not need to reset the last_write_cmd since that is taken care of in the send routine. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-18 08:37:52 -04:00
Mark Stapp	072b487b8f	zebra: update pw dataplane info Include the complete set of primary and backup nexthops from the resolving route for a pseudowire. Add accessors for that info. Modify the logic that creates the fib set of pw nexthops so that only installed, labelled nexthops are included. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-11 09:30:09 -04:00
Mark Stapp	0d145d47c8	zebra: revise pw reachability logic Modify the pseudowire reachability logic so that it returns success if there is at least one installed labelled nexthop for the route resolving the pw destination. We also check for valid backup nexthops if necessary, in case there's been a switchover event. Only OpenBSD requires that _all_ nexthops be labelled, so we have a more strict version of the logic also. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-11 09:30:09 -04:00
Mark Stapp	6fb3580882	zebra: add boolean to control pw reachability checking Add a boolean to control whether pseudowire reachability checking needs to be strict. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-11 09:29:13 -04:00
Mark Stapp	bc77c3bb8a	zebra: use const in rib_match Use const in common rib_match api. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-11 09:29:13 -04:00
Donald Sharp	9691937d8b	zebra: Move individual lines to table in `show zebra client` command Move some individual add/delete lines to the table format in the `show zebra client` command Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-10 20:41:35 -04:00
Donald Sharp	a9d8faf7ab	zebra: Add message counts for `show zebra client` There were counters FRR was keeping but never displaying. Add them in. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-10 20:24:44 -04:00
Donald Sharp	6dbaa012be	Merge pull request #8807 from mjstapp/fix_srv6_delete lib,zebra: srv6 cleanup	2021-06-09 09:07:53 -04:00
Donald Sharp	010b575b7d	zebra: Give extra space and stop processing if we run out of space When processing bulk messages we need more space to handle more mroutes. In this case we are doubling the stream size from 16k -> 32k, which should roughly double the number of mroutes we can handle in one go. Additionally. If we cannot parse the passed message into the stream to pass up to pimd then gracefully stop processing Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-09 06:43:28 -04:00
Stephen Worley	0bcf7589a6	zebra: print adv_if count with %zu Use the %zu formatter for adv_if count printing for portability. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-06-08 16:27:12 -04:00
Stephen Worley	2a356cee0d	zebra: add show command for RA interface lists Add a show command so we can easily get info on what interfaces are turned on per ver and in which list. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-06-08 15:06:04 -04:00
Stephen Worley	7c2ddfb976	zebra: rework RA handling for vrf-lite Rework RA handling for vrf-lite scenarios. Before we were using a single FD descriptor for polling across multiple zvrf's. This would cause us to hit this assert() in some bgp unnumbered and vrrp configs: ``` /* * What happens if we have a thread already * created for this event? */ if (thread_array[fd]) assert(!"Thread already scheduled for file descriptor"); ``` We were scheduling a thread_read on the same FD for every zvrf. With vrf-lite, RAs and ARPs are not vrf-bound, so we can just use one rtadv instance to manage them for all VRFs. We will choose the default VRF for this. This patch removes the rtadv_sock altogether for zrouter and moves the functionality this represented to the default VRF. All RAs will be handled in the default VRF under vrf-lite configs with only one poll thread started for it. This patch also extends how we track subscribed interfaces (s or msec) to use an actual sorted list by interface names rather than just a counter. With multiple daemons turning interfaces/on/off these counters can get very wrong during ifup/down events. Making them a sorted list prevents this from happening by preventing duplicates. With netns-vrf's nothing should change other than the interface list. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-06-08 15:05:43 -04:00
Renato Westphal	98cb53f96a	zebra, ospfd: fix typos in the graceful restart code Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2021-06-08 11:41:33 -03:00
Ameya Dharkar	1b09e77e4d	Zebra: FPM support for gateway IP overlay Index FPM sends VNI to the data plane with the EVPN prefix. For pure type-5 EVPN route, nexthop interface of EVPN prefix is L3VNI SVI. Thus, we encode L3VNI corresponding to the nexthop vrf with rtmsg for this prefix. For EVPN type-5 route with gateway IP overlay index, we supporting asymmetric IRB. Thus, nexthop interface is L2VNI SVI. So, instead of fetching vrf VNI, fetch VNI corresponding to the nexthop SVI and encode it in the rtmsg for EVPN prefix. Signed-off-by: Ameya Dharkar <adharkar@vmware.com>	2021-06-07 17:59:45 -07:00
Ameya Dharkar	9daa5d471a	bgpd, zebra: Add svi_interface to zebra VNI and bgp EVPN structures SVI ifindex for L2VNI is required in BGP to perform EVPN type-5 to type-2 recusrsive resolution using gateway IP overlay index. Program this svi_ifindex in struct zebra_vni_t as well as in struct bgpevpn Changes include: 1. Add svi_if field to struct zebra_evpn_t 2. Add svi_ifindex field to struct bgpevpn 3. When SVI (bridge or VLAN) is bound to a VxLAN interface, store it in the zebra_evpn_t structure. 4. Add this SVI ifindex to ZEBRA_VNI_ADD 5. Store svi_ifindex in struct bgpevpn Signed-off-by: Ameya Dharkar <adharkar@vmware.com>	2021-06-07 17:58:23 -07:00
Mark Stapp	f502d7af0f	zebra: srv6 cleanup Use NO_PROTO consistently in tests; make sure zapi client instance and session are used for srv6 'chunks'. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-07 14:26:25 -04:00
Mark Stapp	16bd37d687	zebra: small srv6 text cleanup Couple of small typos in srv6 zapi code. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-07 14:25:46 -04:00
Rafael Zalamena	455d14ae31	Merge pull request #8778 from idryzhov/fix-zebra-vrf zebra: fix config after exit from vrf	2021-06-07 08:59:10 -03:00
Igor Ryzhov	58929633fb	zebra: fix config after exit from vrf When the VRF node is exited using "exit" or "quit", there's still a VRF pointer stored in the vty context. If you try to configure some router related command, it will be applied to the previous VRF instead of the default VRF. For example: ``` (config)# vrf test (config-vrf)# ip router-id 1.1.1.1 (config-vrf)# do show run ... ! vrf test ip router-id 1.1.1.1 exit-vrf ! ... (config-vrf)# exit (config)# ip router-id 2.2.2.2 (config)# do show run ... ! vrf test ip router-id 2.2.2.2 exit-vrf ! ... ``` `vrf-exit` works correctly, because it stores a pointer to the default VRF into the vty context (but weirdly keeping the VRF_NODE instead of changing it to CONFIG_NODE). Instead of relying on the behavior of exit function, always use the default VRF when in CONFIG_NODE. Another problem is missing `VTY_CHECK_CONTEXT`. If someone deletes the VRF in which node the user enters the command, then zebra applies the command to the default VRF instead of throwing an error. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-06-04 19:02:32 +03:00
Hiroki Shirokura	2ba6be5b24	bgpd,sharpd,zebra: fix code style Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	c60c1ade86	: delete ZEBRA_FLAG_SEG6_ROUTE and add ZAPI_NEXTHOP_FLAG_SEG6* https://github.com/FRRouting/frr/pull/5865#discussion_r597670225 As this comment says. ZEBRA_FLAG_XXX should not have been used. To communicate SRv6 Route Information. A simple Nexthop Flag would have been sufficient for SRv6 information. And I fixed the whole thing that way. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	0a543b7929	zebra: early return on seg6local nlmsg crafting Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	eab0f8f0a2	lib,sharpd,zebra: update nexthop object with nh_srv6 Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	f463eac768	zebra: fill_seg6ipt_encap func with boundary check Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	52026569ca	zebra: error check for nl_attr_xxx Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	8a18a7c0c9	zebra: use const on fill_seg6ipt_encap func Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	b9596f139b	zebra: fix implicit conversion Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	7b778857f8	zebra: drop un-needed info in locator-zapi Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	a9510347aa	zebra: delete unneeded zebra_srv6_manager_connect Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	1bda3e627d	*: use one line init instead of memset and format it Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	9f900cda30	zebra: fix typo Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	1d5f59a235	zebra: fix Dereference of null pointer Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	f29aed7480	*: fix code format accourding to checkpatch Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	ac6a9479af	zebra: add zapi_srv6_locator_chunk_{en,de}code Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	361a62ac9d	zebra: fix compile error of missing-braces Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	daedb8b3cf	zebra: rewrite locator_prefix_cmd with DEFPY Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	4df9d8592b	*: fix code format accourding to checkpatch Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	f16de90b8c	zebra: parse non-zebra seg6 configuration via netlink (step3) FRRouting operator can install seg6 route via ZAPI, But linux kernel operator also can install seg6 route via Netlink directry (i.e. iproute2) This commit make zebra to parse non-frr seg6 route configuration via netlink and audit Zebra's RIB. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	76fb7ae4de	zebra: ZEBRA_ROUTE_ADD supports seg6 route (step3) With this patch, zclient can intall seg6 rotues when they set properties "nh_seg6_segs" on struct nexthop and set ZEBRA_FLAG_SEG6_ROUTE on zapi_route's flag. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	0097897734	zebra: add new CLI to manipulate srv6-locator (step2) This commit is a part of #5853 works that add new clis to configure SRv6 locator and its show commands. Following clis are added on this commit. vtysh -c 'conf te' \ -c 'segment-routing' \ -c ' srv6' \ -c ' locators' \ -c ' locator LOC1' \ -c ' prefix A::/64' - "show segment-routing srv6 sid [json]" - "show segment-routing srv6 locator [json]" - "show segment-routing srv6 locator NAME detail [json]" - "show runnning-config" (make it to print srv6 configuration) Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Hiroki Shirokura	6e68a08484	zebra: ZAPI add new api to manipulate srv6-locator (step2) This commit is a part of #5853 works that add new ZAPI to configure SRv6 locator which manages chunk prefix for SRv6 SID IPv6 address for each routing protocol daemons. NEW-ZAPIs: * ZEBRA_SRV6_LOCATOR_ADD * ZEBRA_SRV6_LOCATOR_DELETE * ZEBRA_SRV6_MANAGER_CONNECT * ZEBRA_SRV6_MANAGER_GET_LOCATOR_CHUNK * ZEBRA_SRV6_MANAGER_RELEASE_LOCATOR_CHUNK Zclient can connect to zebra's srv6-manager with ZEBRA_SRV6_MANAGER_CONNECT api like a label-manager. Then zclient uses ZEBRA_SRV6_MANAGER_GET_LOCATOR_CHUNK to allocated dedicated locator chunk for it's routing protocol. Zebra works for only prefix reservation and distribute the ownership of the locator chunks for zcliens. Then, zclient installs SRv6 function with ZEBRA_ROUTE_ADD api with nh_seg6local_* fields. This feature is already implemented by another PR(#7680). Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Hiroki Shirokura	6c0a7c0941	: new cli-nodes for SRv6 manager (step2) This commit is a part of #5853 that add new cmd-node for SRv6 configuration. This commit just add cmd-node and moving node cli only, acutual SRv6 config command isn't added. (that is added later commit. of this branch) new cli nodes: SRv6 * SRv6-locators * SRv6-locator Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Hiroki Shirokura	d49e6c4afd	zebra: parse non-zebra seg6local configuration via netlink (step1) FRRouting operator can install seg6local route via ZAPI, But linux kernel operator also can install seg6local route via Netlink directry (i.e. iproute2) This commit make zebra to parse non-frr seg6local route configuration via netlink and audit Zebra's RIB. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Hiroki Shirokura	8689b25a08	zebra: ZEBRA_ROUTE_ADD supports seg6local route (step1) With this patch, zclient can intall seg6local rotues whem they set properties nh_seg6local_{action,ctx} on struct nexthop and set ZEBRA_FLAG_SEG6LOCAL_ROUTE on zapi_route's flag. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Rafael Zalamena	6c1a2a6538	Merge pull request #6317 from rgirada/fix_route_dump zebrad: Added a command to dump routes in support bundle	2021-05-28 18:12:17 -03:00
Stephen Worley	7d4651cc9c	Merge pull request #8174 from mjstapp/backup_nht zebra: hide backup-nexthop activations in nht	2021-05-27 09:49:41 -04:00
Mark Stapp	2d3eb91699	Merge pull request #8498 from ton31337/feature/opaque_data_void_zebra Zebra OPAQUE data from other daemons stuff	2021-05-24 07:48:02 -04:00
Igor Ryzhov	389faf93b7	zebra: fix possible uninitialized value Found by Coverity. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-05-19 14:59:00 +03:00
Patrick Ruddy	4006e41baf	Merge pull request #8646 from chiragshah6/mdev zebra: evpn check vni oper state in svi up/down event	2021-05-18 11:45:56 +01:00
Donatas Abraitis	82689214b5	Merge pull request #8535 from opensourcerouting/zlog-rnode zebra: replace _rnode_zlog with %pZN ext	2021-05-18 09:50:42 +03:00
Donatas Abraitis	94effaf032	zebra: Send more OPAQUE data from BGP This includes community and large-community data. ``` exit1-debian-9# show ip route 172.16.16.1/32 Routing entry for 172.16.16.1/32 Known via "bgp", distance 20, metric 0, best Last update 00:00:23 ago * 192.168.0.2, via eth1, weight 1 AS-Path : 65030 Communities : 65001:1 65001:2 65001:3 65001:4 65001:5 65001:6 Large-Communities: 65001:123:1 65001:123:2 ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-05-14 22:12:33 +03:00
Donatas Abraitis	638fc64c64	zebra: Format changes for evpn_mh_neigh_holdtime_cmd Just to avoid fixing all the time manually this stuff after not relevant changes. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-05-14 22:12:33 +03:00
Donald Sharp	e524fc1e2c	Merge pull request #8659 from mjstapp/fix_connected_multi lib,zebra: Use a flag to track down status for connected addrs	2021-05-13 07:23:42 -04:00
Donald Sharp	7d7be47ef0	zebra: Use __func__ instead of __PRETTY_FUNCTION__ Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-12 12:02:05 -04:00
Mark Stapp	e3d901f863	lib,zebra: Use a flag to track down status for connected addrs Track 'down' state of connected addresses with a new flag. We may have multiple addresses on an interface that share a prefix; in those cases, we need to determine when the first address is valid, to install a connected route, and similarly detect when the last address goes 'down', to remove the connected route. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-05-12 09:37:00 -04:00
Donald Sharp	c9d842c710	zebra: Consolidate on 1 function netlink_parse_rattr_nested if_netlink.c created it's on nested parsing #define which is identical to netlink_parse_rtattr_nested. Consolidate on one instead of having this duality. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-11 20:05:51 -04:00
Donald Sharp	269b69d703	zebra: memset the `struct rtattr tb[SIZE]` in setting function In order to parse the netlink message into the `struct rtattr tb[size]` it is assumed that the buffer is memset to 0 before the parsing. As such if you attempt to read a value that was not returned in the message you will not crash when you test for it. The code has places were we memset it and places where we don't. This will lead to crashes when the kernel changes. In our parsing routines let's have them memset instead of having to remember to do it pre pass in to the parser. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-11 20:05:51 -04:00
Russ White	6099bb989d	Merge pull request #8650 from idryzhov/bgp-fix-redist bgpd: fix redistribution in vrf	2021-05-11 07:28:42 -04:00
Igor Ryzhov	d9083050c8	Revert "bgpd: vrf route leaking, fix vrf redistribute" This reverts commit `6b2433c63f`.	2021-05-09 22:28:36 +03:00
David Lamparter	e207132594	zebra: fix style warnings in previous commits Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-05-09 19:37:12 +02:00
Chirag Shah	196d7a86d0	zebra: check vni oper state in svi up notif When clagd is stopped on secondary device, all vxlan interfaces (vnis) are kept in protodown state. FRR treats protodown vxlan interfaces (vnis) as interface down and sends vni delete to bgpd. In the event of clagd down, SVIs are flapping as underlying bridge is going through churn. When FRR receives SVI up notification do not trigger event to bgpd if vnis are operationaly down. Ticket:#2600210 CM-22929 Reviewed By:CCR-11544 Testing Done: Performed CLAG stop/start on secondary device, all vxlan devices remained in protodown along with this validated the vnis are cleaned up and added back in bgpd. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2021-05-07 15:02:05 -07:00
rgirada	d29fd1b72e	zebrad: Added a command to dump routes in support bundle Description: Added a new show command("show ip zebra route dump") to dump all routes with detailed information including nexthops,flags, status ..etc. This helps for dubugging and added to support_bundle_command.conf. Defined this command as a hidden command. Signed-off-by: Rajesh Girada <rgirada@vmware.com>	2021-05-06 02:40:12 -07:00
Donald Sharp	4a73887e0f	zebra: Reduce per vrf memory usage from hash table creation When creating a large number of vrf's we are creating a fairly large number of hash tables per vrf. Reduce memory usage on startup as well as let us identify the table these things come from. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-05 10:08:06 -04:00
Donald Sharp	da55bcbcb3	zebra: Reduce size of vni hash tables to a more reasonable start size We are creating 2 hash tables per vni in zebra. Once we start to scale the number of vni's we start to see some serious memory usage in zebra. Let's reduce the memory usage at startup for scale of vni's. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-05 10:08:06 -04:00
Donald Sharp	38078b1d5a	zebra: Add some ability to know what hash is for what vni Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-05 10:08:06 -04:00
Donald Sharp	ec64a634c2	zebra: Allow the zvrf to know it's vrf when allocing Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-05 10:08:06 -04:00
Mark Stapp	3d4b999fab	Merge pull request #8237 from pguibert6WIND/nhrp_use_zebra_2 Nhrp use zebra 2	2021-05-05 07:57:04 -04:00
Russ White	4ae7bb11fc	Merge pull request #8620 from donaldsharp/redistribution_and_infinite zebra: Allow redistribution for routes selected	2021-05-04 11:14:35 -04:00
Russ White	8ad44ef497	Merge pull request #8514 from donaldsharp/connected_is_limited zebra: Allow one connected route per network mask on a interface	2021-05-04 07:45:33 -04:00
Donald Sharp	c3d0d6e8a1	zebra: Allow redistribution for routes selected Current code has an inconsistent behavior with redistribute routes. Suppose you have a kernel route that is being read w/ a distance of 255: eva# show ip route kernel Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.161.1, enp39s0, 00:06:39 K>* 4.4.4.4/32 [255/8192] via 192.168.161.1, enp39s0, 00:01:26 eva# If you have redistribution already turned on for kernel routes you will be notified of the 4.4.4.4/32 route. If you turn on kernel route redistribution watching after the 4.4.4.4/32 route has been read by zebra you will never learn of it. There is no need to look for infinite distance in the redistribution code. Either we are selected or not. In other words non kernel routes with an 255 distance are never installed so the checks were pointless. So let's just remove the distance checking and tell interested parties about the 255 kernel route if it exists. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-03 19:53:12 -04:00
Mark Stapp	f71e1ff6a9	Merge pull request #8545 from opensourcerouting/assert-our-own *: make our own assert() actually work	2021-05-03 11:17:36 -04:00
Donald Sharp	9298056138	zebra: Allow one connected route per network mask on a interface Currently FRR reads the kernel for interface state and FRR creates a connected route per address on an interface. If you are in a situation where you have multiple addresses on an interface just create 1 connected route for them: sharpd@eva:/tmp/topotests$ vtysh -c "show int dummy302" Interface dummy302 is up, line protocol is up Link ups: 0 last: (never) Link downs: 0 last: (never) vrf: default index 3279 metric 0 mtu 1500 speed 0 flags: <UP,BROADCAST,RUNNING,NOARP> Type: Ethernet HWaddr: aa:4a:ed:95:9f:18 inet 10.4.1.1/24 inet 10.4.1.2/24 secondary inet 10.4.1.3/24 secondary inet 10.4.1.4/24 secondary inet 10.4.1.5/24 secondary inet6 fe80::a84a:edff:fe95:9f18/64 Interface Type Other Interface Slave Type None protodown: off sharpd@eva:/tmp/topotests$ vtysh -c "show ip route connected" Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure C>* 10.4.1.0/24 is directly connected, dummy302, 00:10:03 C>* 192.168.161.0/24 is directly connected, enp39s0, 00:10:03 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-03 09:17:22 -04:00
David Lamparter	9d75e30960	zebra: replace _rnode_zlog with %pZN ext Since _rnode_zlog was wrapping zlog(), these messages weren't getting an unique ID assigned through the xref mechanism. Replace macro with a small extension that prints (almost) the same thing. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-05-02 16:20:30 +02:00
Donald Sharp	c490437e6f	zebra: Allow interface up events to read speed Initially the reading of the speed of an interface happened upon interface creation and happened until the speed of a link settled down to a single value. The speed of an interface can also change as that a new optic can be inserted that changes the speed, in which case FRR would see a interface down (optic removal) and then a interface up (optic insertion). In this case FRR would not treat this as an event that changed the speed. Let's expand the checking a bit more. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-02 07:30:02 -04:00
Philippe Guibert	e3d3fa06f7	zebra: collect gre information and push it when needed - gre keys are collected and stored locally. - when gre source set is requested, and the link interface configured is different, the gre information collected is pushed in the query, namely source ip or gre keys if present. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	db51f0cd10	nhrp: Preserve mtu during interface up/down and tunnel source change preserve mtu upon interface flapping and tunnel source change. Signed-off-by:Reuben Dowle <reuben.dowle@4rf.com> Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	62b4b7e44a	zebra: new dplane action to set gre link interface This action is initiated by nhrp and has been stubbed when moving to zebra. Now, a netlink request is forged to set the link interface of a gre interface if that gre interface does not have already a link interface. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	d17af8dd04	lib, zebra: get gre information the get gre information code is obtained by nhrp, via zebra. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	b716ab61e2	zebra: add stub implementation for zebra gre source set this functionality is stubbed. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	077c07cc58	zebra: storage of gre information in zebra layer zebra is able to get information about gre tunnels. zebra_gre file is created to handle hooks, but is not yet used. also, debug zebra gre command is done to add gre traces. A zebra_gre file is used for complementary actions that may be needed. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:15 +02:00
Philippe Guibert	357b150dae	zebra: at startup, fix links on all namespaces when zebra has vrf backend mapped to namespaces, the polling of interfaces leads to fix all linkages of interfaces. This was not done on non default namespace. do it for other namespaces. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 08:05:01 +02:00
Philippe Guibert	ecffe9167b	zebra: add the link interface information on interface updates There are cases where either link information is not present at interface creation or link information changed. handle this situation. Signed-off-by: Philippe.Guibert <philippe.guibert@6wind.com> zebra dd link	2021-04-30 08:05:01 +02:00
Rafael Zalamena	5418880923	Merge pull request #7165 from qlyoung/fix-zapi-codec-badness Fix zapi codec badness	2021-04-29 13:50:16 -03:00
Donald Sharp	4d0773c4ea	zebra: msgdump debug strangeness cleanup a) `debug zebra kernel` turns off `debug zebra kernel msgdump....` this is odd and bad b) `debug zebra kernel msgdump send` turns off receive and vice versa this is counter intuitive as well c) `no zebra kernel msgdump ...` turns off all kernel level debugging we should only turn off msgdump specific debugs d) `no debug zebra kernel` turns off all kernel level debugging we should leave msgdump on. e) Fix `show run` and show debug output Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-04-29 08:22:53 -04:00
Quentin Young	693fc882d7	zebra: use safe stream decodes for evpn zapi msg Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-28 11:43:50 -04:00
Quentin Young	f3aa221ffd	pimd, zebra: explicit cast int netlink val to uint encoding signed int as unsigned is bad practice; since we want to do it here lets at least be explicit about it Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-28 11:43:50 -04:00
Quentin Young	bbad027684	lib, bgpd, zebra: RA interval is unsigned Use unsigned value for all RA requests to Zebra - encoding signed int as unsigned is bad practice - RA interval is never, and should never be, negative Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-28 11:43:50 -04:00
Quentin Young	0ffd0fb536	bgpd, zebra: encode ip addr len as uint16 This is always a 16 bit unsigned value. - signed int is the wrong type to use - encoding a signed int as a uint32 is bad practice - decoding a signed int encoded as a uint32 into a uint16 is bad practice Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-28 11:43:45 -04:00
Russ White	d8c3daca19	Merge pull request #8531 from mjstapp/fix_backups_misc zebra: Misc fixups for backup nexthops	2021-04-27 16:04:24 -04:00
Stephen Worley	829c939a88	Merge pull request #8488 from mjstapp/more_workqueue lib, zebra: use zebra workqueue for NHG updates	2021-04-27 11:59:33 -04:00
Renato Westphal	120dab7e17	Merge pull request #8517 from volta-networks/ldp_defer_zebra_updates ldpd: defer register for info until configured	2021-04-26 23:57:57 -03:00
Renato Westphal	54e9f5138c	Merge pull request #8538 from mjstapp/re_dump_nh_labels zebra: include nexthops' label stacks in zebra rib debug	2021-04-26 23:57:03 -03:00
Emanuele Di Pascale	67da957372	zebra: debug log for redistribute_del We're firing an event debug log for zebra_redistribute_add, but not one for zebra_redistribute_delete. Let's make it symmetric. Signed-off-by: Emanuele Di Pascale <emanuele@voltanet.io>	2021-04-26 10:00:37 +02:00
David Lamparter	6a0eb6885b	*: drop zassert.h It's not actually working properly... Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-04-23 12:06:35 +02:00
David Lamparter	1f8031f79a	*: make sure `config.h` or `zebra.h` is first `config.h` has all the defines from autoconf, which may include things that switch behavior of other included headers (e.g. _GNU_SOURCE enabling prototypes for additional functions.) So, the first include in any `.c` file must be either `config.h` (with the appropriate guard) or `zebra.h` (which includes `config.h` first thing.) Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-04-23 12:06:35 +02:00
Stephen Worley	dc65cd999d	zebra: handle gracefulRS/retain with proto NHGs Properly handle refcounting of Proto-owned NHGs when zebra is operating under graceful restart and retain conditions. We have an extra refcnt of 1 we keep for proto-owned NHGs to indicate the upper level proto has created and owns it. When we are reading these in from the kernel, we need to set them to 1 as appropriate. Without this, we fail in the assert() during zebra_nhg_proto_add() after the owning daemons resends the NHG and the refcnts are off by one. Also add in the same logic we use for routes when sweeping with respect to uptimes. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-04-22 17:25:15 -04:00
Stephen Worley	45691de9a0	zebra: add uptime to NHEs Add uptime for use with NHEs to keep track of how long we have had this NHE in our rib without an update. This is treated exactly the same as the re->uptime for routes. When we get an update for a route, we reset the uptime. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-04-22 17:25:15 -04:00
Stephen Worley	65f137fe3c	zebra: add PROTO_OWNED macro for NHE id bounds checking Add a PROTO_OWNED macro for code readability when checking ID bounds for whether a NHG is proto owned. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-04-22 17:25:15 -04:00
Mark Stapp	cbe5bafbd5	zebra: include nexthops' label stacks in debugs Include nexthops' labels in an important debug early in route processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-22 11:51:50 -04:00
Mark Stapp	8283551d3c	zebra: handle TE policy changes in LSP async notifs Handle SR-TE policy changes in the LSP async notification handler, as we do in the normal LSP dplane results handler. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-21 14:30:15 -04:00
Mark Stapp	a082cd9a51	zebra: include inner labels with recursive backups When capturing backup nexthops with recursive resolution, ensure that inner labels from the recursive nexthop are included in each backup (as they are with the resolving primary nexthops). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-21 14:30:15 -04:00
Mark Stapp	c56c16eb2c	zebra: fix some issues in recursive backup nexthop code Fix a couple of small things in the code that captures backup nexthops during recursive resolution. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-21 14:30:15 -04:00
David Lamparter	0c4285d77e	build: properly split CFLAGS from AC_CFLAGS `CFLAGS` is a "user variable", not intended to be controlled by configure itself. Let's put all the "important" stuff in AC_CFLAGS and only leave debug/optimization controls in CFLAGS. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-04-21 15:42:36 +02:00
David Lamparter	09781197b6	build: make builddir include path consistent ... by referencing all autogenerated headers relative to the root directory. (90% of the changes here is `version.h`.) Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-04-21 15:42:33 +02:00
Russ White	2bbf1bd88b	Merge pull request #8361 from rameshabhinay/change_1 bgpd: vrf route leaking related fixes	2021-04-20 11:23:49 -04:00
Mark Stapp	04bec7b217	zebra: use workqueue for daemon-owned NHGs Use the main zebra workqueue for daemon-owned NHGs, in addition to processing kernel-owned NHGs. The zapi message processing creates a temporary object that's enqueued to the workqueue, then processed/installed as part of the workqueue processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-15 14:20:39 -04:00
David Lamparter	c574670847	build: don't use $(top_srcdir) in vtysh_scan It's not necessary and can confuse scripts. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-04-13 23:57:14 +02:00
Quentin Young	54bb4ab3ec	Merge pull request #8426 from idryzhov/fix-interface-nb-stale-pointers lib: fix interface nb stale pointers	2021-04-13 15:26:51 +00:00
Mark Stapp	f3dbd9d3ef	Merge pull request #8145 from pguibert6WIND/nhrp_use_zebra nhrp: use zebra	2021-04-13 08:02:56 -04:00
Philippe Guibert	88217099de	zebra, lib: replace ZEBRA_ROUTE_NEIGH with simplified version do not add a new route type, and consider 0 as a value meaning that zebra should be the owner. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-13 08:58:54 +02:00
Philippe Guibert	d603c0774e	nhrp, zebra, lib: enforce usage of zapi_neigh_ip structure zapi_nbr structure is renamed to zapi_neigh_ip. Initially used to set a neighbor ip entry for gre interfaces, this structure is used to get events from the zebra layer to nhrp layer. The ndm state has been added, as it is needed on both sides. The zebra dplane layer is slightly modified. Also, to clarify what ZEBRA_NEIGH_ADD/DEL means, a rename is done: it is called now ZEBRA_NEIGH_IP_ADD/DEL, and it signified that this zapi interface permits to set link operations by associating ip addresses to link addresses. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-13 08:58:49 +02:00
Igor Ryzhov	af736200e1	lib: fix interface nb stale pointers The first change in this commit is the processing of the VRF termination. When we terminate the VRF, we should not delete the underlying interfaces, because there may be pointers to them in the northbound configuration. We should move them to the default VRF instead. Because of the first change, the VRF interface itself is also not deleted when deleting the VRF. It should be handled in netlink_link_change. This is done by the second change. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-04-12 10:56:04 +03:00
Quentin Young	b832909b42	: remove .conf.sample files Most of these are many, many years out of date. All of them vary randomly in quality. They show up by default in packages where they aren't really useful now that we use integrated config. Remove them. The useful ones have been moved to the docs. Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-09 13:14:30 -04:00
Philippe Guibert	e18747a967	zebra: move neighbor table configuration to dplane contexts Instead of directly configuring the neighbor table after read from zapi interface, a zebra dplane context is prepared to host the interface and the family where the neighbor table is updated. Also, some other fields are hosted: app_probes, ucast_probes, and mcast_probes. More information on those fields can be found on ip-ntable configuration. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	0a27a2fef5	zebra, lib: handle NEIGH_ADD/DELETE to zebra dataplane framework EVPN neighbor operations were already done in the zebra dataplane framework. Now that NHRP is able to use zebra to perform neighbor IP operations (by programming link IP operations), handle this operation under dataplane framework: - assign two new operations NEIGH_IP_INSTALL and NEIGH_IP_DELETE; this is reserved for GRE like interfaces: example: ip neigh add A.B.C.D lladdr E.F.G.H - use 'struct ipaddr' to store and encode the link ip address - reuse dplane_neigh_info, and create an union with mac address - reuse the protocol type and use it for neighbor operations; this permits to store the daemon originating this neighbor operation. a new route type is created: ZEBRA_ROUTE_NEIGH. - the netlink level functions will handle a pointer, and a type; the type indicates the family of the pointer: AF_INET or AF_INET6 if the link type is an ip address, mac address otherwise. - to keep backward compatibility with old queries, as no extension was done, an option NEIGH_NO_EXTENSION has been put in place - also, 2 new state flags are used: NUD_PERMANENT and NUD_FAILED. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	541025d6ff	zebra: handler for configuring neighbor table neighbor table api in zebra is added. a netlink api is created for that. the handler is called from the api defined in the previous commit. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	df948efc56	zebra: fixes NDA_DST in netlink_neigh_update() function When netlink_neigh_update() is called, the link registration was failing, due to bad request length. Also, the query was failing if NDA_DST was an ipv6 address. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	05657ec2b7	nhrp, lib, zebra: add/del neighbor entry possible from nhrp a zebra api is extended to offer ability to add or remove neighbor entry from daemon. Also this extension makes possible to add neigh entry, not only between IPs and macs, but also between IPs and NBMA IPs. This API supports configuring ipv6/ipv4 entries with ipv4/ipv6 lladdr. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	7723e8d3fd	zebra: link layer config and notification, implementation in zebra zebra implements zebra api for configuring link layer information. that can be an arp entry (for ipv4) or ipv6 neighbor discovery entry. This can also be an ipv4/ipv6 entry associated to an underlay ipv4 address, as it is used in gre point to multipoint interfaces. this api will also be used as monitoring. an hash list is instantiated into zebra (this is the vrf bitmap). each client interested in those entries in a specific vrf, will listen for following messages: entries added, removed, or who-has messages. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Mark Stapp	b254f784ae	zebra: optionally hide backup-nexthop events in nht Optionally hide route changes that only involve backup nexthop activation/deactivation. The goal is to avoid route churn during backup nexthop switchover events, before the resolving routes re-converge. A UI config enables this 'hiding' behavior. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-08 11:03:49 -04:00
Mark Stapp	aef1d5404f	zebra: add config control to hide backup nh events in nht Add a config that can control hiding of backup-nexthop activation changes in nexthop-tracking. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-07 15:38:09 -04:00
Abhinay Ramesh	6b2433c63f	bgpd: vrf route leaking, fix vrf redistribute Description: After FRR restart, routes are not getting redistributed; when routes added first and then 'redistribute static' cmd is issued. During the frr restart, vrf_id will be unknown, so irrespective of redistribution, we set the redistribute vrf bitmap. Later, when we add a route and then issue 'redistribute' cmd, we check the redistribute vrf bitmap and return CMD_WARNING; zebra_redistribute_add also checks the redistribute vrf bitmap and returns. Instead of checking the redistribute vrf bitmap, always set it anyways. Co-authored-by: Santosh P K <sapk@vmware.com> Co-authored-by: Kantesh Mundaragi <kmundaragi@vmware.com> Signed-off-by: Abhinay Ramesh <rabhinay@vmware.com>	2021-04-07 06:09:42 +00:00
Mark Stapp	2aa2a407e4	zebra: be more selective about processing LSPs When certain events occur (connected route changes e.g.) zebra examines LSPs to see if they might have been affected. For LSPs with backup nhlfes, skip this immediate processing and wait for the owning protocol daemon to react. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-05 15:53:48 -04:00
Mark Stapp	04dda09218	zebra: add 'detail' mpls debug setting Add setting and cli for 'debug zebra mpls detail'. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-05 15:53:48 -04:00
Mark Stapp	cc6e7d13d5	Merge pull request #8358 from idryzhov/fix-nb-vrf-crash *: modify VRF_CONFIGURED flag only in VRF NB layer	2021-04-01 16:42:03 -04:00
Sarita Patra	e71627cbcb	zebra: North-bound implementation for zebra rmaps This commit introduces the implementation for the north-bound callbacks for the zebra-specific route-map match and set clauses. Signed-off-by: NaveenThanikachalam <nthanikachal@vmware.com> Signed-off-by: Sarita Patra <saritap@vmware.com>	2021-03-30 22:58:42 +03:00
Igor Ryzhov	b9b794db21	: modify VRF_CONFIGURED flag only in VRF NB layer This is to fix the crash reproduced by the following steps: ip link add red type vrf table 1 Creates VRF. * vtysh -c "conf" -c "vrf red" Creates VRF NB node and marks VRF as configured. * ip route 1.1.1.0/24 2.2.2.2 vrf red * no ip route 1.1.1.0/24 2.2.2.2 vrf red (or similar l3vni set/unset in zebra) Marks VRF as NOT configured. * ip link del red VRF is deleted, because it is marked as not configured, but NB node stays. Subsequent attempt to configure something in the VRF leads to a crash because of the stale pointer in NB layer. Fixes #8357. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-03-29 00:52:39 +03:00
Anuradha Karuppiah	7bfa7d0233	lib/zebra: zapi for installing EVPN nexthops from bgp EVPN nexthops are installed as remote neighs by zebra. This was earlier done only via VRF IPvX uni routes imported from EVPN routes. With EVPN-MH these VRF routes now reference a L3NHG which is setup based on the EAD and doesn't include the RMAC. To workaround that BGP now consolidates and maintains EVPN nexthops which are then sent to zebra. zebra sets up these nexthops as L3-VNI nh entries using a dummy type-1 route as reference. Ticket: CM-31398 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-03-25 17:09:53 -07:00
Renato Westphal	b1c875d692	Merge pull request #8250 from idryzhov/fix-nb-running-get-entry Fix aborts when using nb_running_get_entry during validation stage	2021-03-24 19:39:09 -03:00
Rafael Zalamena	b9f1b4d3d3	Merge pull request #8078 from idryzhov/fix-zebra-vni zebra: fix vni configuration in default vrf	2021-03-24 13:32:44 +00:00
David Lamparter	224ccf29d9	zebra: kill zebra_memory.h, use MTYPE_STATIC This one also needed a bit of shuffling around, but MTYPE_RE is the only one left used across file boundaries now. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-22 20:02:17 +01:00
Donatas Abraitis	37916b2b11	Merge pull request #8121 from opensourcerouting/macro-cleanup *: require ISO C11 + semicolons after file-scope macros	2021-03-22 11:00:34 +02:00
David Lamparter	80413c2073	*: require semicolon after FRR_DAEMON_INFO & co. ... again ... Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:39 +01:00
David Lamparter	960b9a5383	*: require semicolon after DEFINE_<typesafe...> Again, see previous commits. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:39 +01:00
David Lamparter	96244aca23	*: require semicolon after DEFINE_QOBJ & co. Again, see previous commits. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:37 +01:00
David Lamparter	8451921b70	*: require semicolon after DEFINE_HOOK & co. See previous commit. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:17 +01:00
David Lamparter	bf8d3d6aca	*: require semicolon after DEFINE_MTYPE & co Back when I put this together in 2015, ISO C11 was still reasonably new and we couldn't require it just yet. Without ISO C11, there is no "good" way (only bad hacks) to require a semicolon after a macro that ends with a function definition. And if you added one anyway, you'd get "spurious semicolon" warnings on some compilers... With C11, `_Static_assert()` at the end of a macro will make it so that the semicolon is properly required, consumed, and not warned about. Consistently requiring semicolons after "file-level" macros matches Linux kernel coding style and helps some editors against mis-syntax'ing these macros. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:17 +01:00
David Lamparter	247c7e27a9	snmp: change -std=gnu99 to -std=gnu11 The point of the `-std=gnu99` was to override a `-std=c99` that may be coming in from net-snmp. However, we want C11, not C99. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:17 +01:00
Mark Stapp	5530d55d3c	zebra: capture backup nexthop info with recursive resolution When resolving a recursive route, capture backup nexthop info along with the resolving nexthops. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-03-16 12:14:53 -04:00
Mark Stapp	aa45883818	zebra: add ui control for use of backup nexthops in resolution Add a control and api for the use of backup nexthops in recursive resolution. With 'no', we won't try to use installed backup nexthops when resolving a recursive route. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-03-16 12:14:53 -04:00
Stephen Worley	0a7edab036	Merge pull request #7993 from mjstapp/reorg_resolve zebra: reorg nexthop resolution code	2021-03-16 11:34:33 -04:00
Igor Ryzhov	6c38095749	zebra: make ribs config false Zebra routing tables are not controlled by the user and can not be created/deleted manually. Current NB create/destroy callbacks are incorrectly implemented because instead of creating/deleting the RIB they are only checking for it's existence. YANG model should reflect the real situation. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-03-16 17:25:49 +03:00
Igor Ryzhov	4ba756ed9c	*: fix aborts when validating configuration There are places in the code where function nb_running_get_entry is used with abort_if_not_found set to true during the config validation stage. This is incorrect because when used in transactional CLI, the running entry won't be set until the apply stage, and such usage leads to crash. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-03-16 17:25:49 +03:00
David Lamparter	ad6f7449ef	*: remove remaining severity prefixes Having a "warning:" prefix on a debug message is particularly dumb... Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-14 22:56:07 +01:00
David Lamparter	5d27875b7d	zebra: move up prefix2str call in rib dump Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-14 22:56:07 +01:00
David Lamparter	ef7b8be459	zebra: use printfrr exts in EVPN/VXLAN code Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-14 22:56:07 +01:00
David Lamparter	5e9f9adbb4	fpm: use printfrr exts Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-14 22:56:07 +01:00
Jafar Al-Gharaibeh	d532dd6d6a	Revert "zebra: Remove `first_p` which is never used" This reverts commit `8617eb7c5f`.	2021-03-12 01:02:25 -06:00
Donald Sharp	8617eb7c5f	zebra: Remove `first_p` which is never used Remove dead code. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-03-11 21:22:53 -05:00
Mark Stapp	6ff2514b41	Merge pull request #8124 from pguibert6WIND/ipsec_iptable_dplane zebra: move netfilter contexts to zebra dplane	2021-03-10 16:43:15 -05:00
Philippe Guibert	ef524230a6	zebra: move ipset and ipset_entry to zebra dplane contexts like it has been done for iptable contexts, a zebra dplane context is created for each ipset/ipset entry event. The zebra_dplane_ctx job is then enqueued and processed by separate thread. Like it has been done for zebra_pbr_iptable context, the ipset and ipset entry contexts are encapsulated into an union of structures in zebra_dplane_ctx. There is a specificity in that when storing ipset_entry structure, there was a backpointer pointer to the ipset structure that is necessary to get some complementary information before calling the hook. The proposal is to use an ipset_entry_info structure next to the ipset_entry, in the zebra_dplane context. That information is used for ipset_entry processing. The ipset name and the ipset type are the only fields necessary. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-03-10 14:57:32 +01:00
Philippe Guibert	5162e00045	zebra: move iptable handling in zebra_dplane The iptable processing was not handled in remote dataplane, and was directly processed by the thread in charge of zapi calls. Now that call can be handled in the zebra_dplane separate thread. once a zebra_dplane_ctx is allocated for iptable handling, the hook call is performed later. Subsequently, a return code may be triggered to zclient interface if any problem occurs when calling the hook call. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-03-04 11:50:25 +01:00
Stephen Worley	b1077f0fa2	Merge pull request #8152 from idryzhov/fix-zebra-blackhole zebra: don't use kernel nexthops for blackhole routes	2021-03-02 11:50:46 -05:00
Patrick Ruddy	e1cfd75ffb	Merge pull request #8021 from AnuradhaKaruppiah/evpn-weak-override-fix zebra: disable setting weak override flag in neigh updates	2021-03-02 10:44:43 +00:00
Igor Ryzhov	4be03ff4ca	zebra: don't use kernel nexthops for blackhole routes Fixes #6522 and #8149. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-03-01 17:47:38 +03:00
Anuradha Karuppiah	3f589fa8ec	zebra: fix problem with bypass getting set accidentally on all ESs This was caused because of uninitialized netlint attrs in the bond-member netlink parse API. PS: It was caught by the upstream topotests on ARM8 (passed everywhere else). Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-02-24 08:11:26 -08:00
Anuradha Karuppiah	8e1337c5dd	zebra: del/add remote mac if there is a change from es->non-es dst and vicevera This is needed as kernel currently doesn't allow a mac replace if the dst changes from a L2NHG to a single-VTEP and viceversa. Ticket: CM-31561 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-24 08:11:26 -08:00
Anuradha Karuppiah	fd40906be9	zebra: flush macs linked to the bond when it moves out of bypass When a ES-bond is in bypass state MACs learnt on it are linked to the access port instead of the ES. When LACP converges on the bond it moves out of bypass and the MACs previously learnt on it are flushed to force a re-learn on new traffic. Ticket: CM-31326 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-24 08:11:26 -08:00
Anuradha Karuppiah	8b07f173e8	zebra: link local MACs to destination port for efficient lacp-bypass processing When an ES-bond comes out of bypass FRR needs to flush the local MACs learnt while the bond was in bypass. To do that efficiently local MACs are linked to the dest-access port. This only happens if the access-port is in LACP-bypass or if it is non-ES. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-24 08:11:24 -08:00
Anuradha Karuppiah	00a7710c25	zebra: support for lacp bypass with EVPN MH Feature overview: ================= A 802.3ad bond can be setup to allow lacp-bypass. This is done to enable servers to pxe boot without a LACP license i.e. allows the bond to go oper up (with a single link) without LACP converging. If an ES-bond is oper-up in an "LACP-bypass" state MH treats it as a non-ES bond. This involves the following special handling - 1. If the bond is in a bypass-state the associated ES is placed in a bypass state. 2. If an ES is in a bypass state - a. DF election is disabled (i.e. assumed DF) b. SPH filter is not installed. 3. MACs learnt via the host bond are advertised with a zero ESI. When the ES moves out of "bypass" the MACs are moved from a zero-ESI to the correct non-zero id. This is treated as a local station move. Implementation: =============== When (a) an ES is detached from a hostbond or (b) an ES-bond goes into LACP bypass zebra deletes all the local macs (with that ES as destination) in the kernel and its local db. BGP re-sends any imported MAC-IP routes that may exist with this ES destination as remote routes i.e. zebra can end up programming a MAC that was perviously local as remote pointing to a VTEP-ECMP group. When an ES is attached to a hostbond or an ES-bond goes LACP-up (out of bypss) zebra again deletes all the local macs in the kernel and its local db. At this point BGP resends any imported MAC-IP routes that may exist with this ES destination as sync routes i.e. zebra can end up programming a MAC that was perviously remote as local pointing to an access port. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-24 08:09:33 -08:00
Igor Ryzhov	db1f688d44	zebra: fix duplicated definitions Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-02-24 14:51:00 +03:00
Igor Ryzhov	9082b3eb3d	zebra: fix vni configuration in default vrf VNI configuration is done without NB layer in default VRF. It leads to the following problems: ``` vtysh -c "conf" -c "vni 1" vtysh -c "conf" -c "vrf default" -c "no vni" ``` Second command does nothing, because the NB node is not created by the first command. ``` vtysh -c "conf" -c "vrf default" -c "vni 1" vtysh -c "conf" -c "no vni 1" ``` Second command doesn't delete the NB node created by the first command. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-02-24 14:51:00 +03:00
Patrick Ruddy	0ff7911386	Merge pull request #7879 from AnuradhaKaruppiah/advertise-svi-mac evpn-mh: Advertise SVI MAC as a type-2 route if EVPN MH is enabled	2021-02-24 10:20:24 +00:00
Mark Stapp	15869cd81d	Merge pull request #8035 from qlyoung/remove-more-sprintf *: remove more sprintf()	2021-02-23 15:55:02 -05:00
Anuradha Karuppiah	736475cdf6	zebra: disable setting weak override flag in neigh updates This is causing problems with VM move i.e. transition from remote neigh to local neigh. This transition involves changing the NUD_STATE NUD_NOARP to NUD_STALE. And the weak override flag prevents changing the state from connected (REACHABLE, NOARP, PERMANENT) to STALE. PS: Weak-override was originally used to prevent race conditions where FRR can end up making a REACHABLE neigh STALE. We may need to revisit and address that case at a later point. Ticket: CM-30273 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-22 11:56:30 -08:00
Mark Stapp	9b4ab90984	zebra: support nh resolution without a route Start reorg of zebra nexthop-resolution so that we can use the resolution logic for nexthop-groups as well as routes. Change the signature of the core nexthop_active() api so that it does not require a route-entry or route-node. Move some of the logic around so that nexthop-specific logic is in nexthop_active(), while route-oriented logic is in nexthop_active_check(). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-02-19 15:38:37 -05:00
Anuradha Karuppiah	e4c3ece6e0	zebra: fix problem with SVI MAC not being sent to BGP For MH the SVI MAC is advertised to prevent flooding of ARP replies. But because of a bug the SVI MAC was being added to the zebra database but not sent to bgpd for advertising. Ticket: CM-33329 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-02-19 08:11:15 -08:00
Anuradha Karuppiah	bd2ac9a794	zebra: drop the SVI MAC cleanup done as a part of interface delete As a part of FRR shutdown interfaces are force flushed (in an arbitary order). Interfaces are already down at that point i.e. resources like SVI-MAC have already been released. Attempting to clean it up again as a part of the force-flush was resulting in access of freed up memory - >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ==26457== Thread 1: ==26457== Invalid read of size 8 ==26457== at 0x1AE6B0: zebra_evpn_acc_bd_svi_set (zebra_evpn_mh.c:606) ==26457== by 0x1B1460: zebra_evpn_if_cleanup (zebra_evpn_mh.c:1040) ==26457== by 0x13CA69: if_zebra_delete_hook (interface.c:244) ==26457== by 0x48A0E34: hook_call_if_del (if.c:59) ==26457== by 0x48A0E34: if_delete_retain (if.c:290) ==26457== by 0x48A2F94: if_delete (if.c:313) ==26457== by 0x48A3169: if_terminate (if.c:1217) ==26457== by 0x48E0024: vrf_delete (vrf.c:254) ==26457== by 0x48E0024: vrf_delete (vrf.c:225) ==26457== by 0x48E02FE: vrf_terminate (vrf.c:551) ==26457== by 0x1442E1: sigint (main.c:203) ==26457== by 0x1442E1: sigint (main.c:141) ==26457== by 0x48CF862: quagga_sigevent_process (sigevent.c:103) ==26457== by 0x48DD324: thread_fetch (thread.c:1404) ==26457== by 0x48A926A: frr_run (libfrr.c:1122) >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> (gdb) bt (gdb) fr 5 1037 zebra/zebra_evpn_mh.c: No such file or directory. (gdb) p zif->ifp->name $2 = "vlan131", '\000' <repeats 12 times> (gdb) p zif->link->info $5 = (void *) 0x1 (gdb) p/x zif->ifp->flags $7 = 0x1002 (gdb) >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Ticket: CM-32435 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-02-19 08:11:15 -08:00
Chirag Shah	3b63732a42	zebra: prevent crash in evpn if cleanup zebra crash is seen while cleaning up evpn interface during shutdown event. evpn interface clean up is called from vrf_delete callback (gdb) frame 4 (is_up=false, br_zif=0x0, vlan_zif=0x557f31fb36f0) at zebra/zebra_evpn_mh.c:614 614 zebra/zebra_evpn_mh.c: No such file or directory. (gdb) p tmp_br_zif $1 = (struct zebra_if ) 0x0 (gdb) p vlan_zif->link $2 = (struct interface ) 0x557f31fb2d40 (gdb) p vlan_zif->link->info $3 = (void *) 0x0 (gdb) p zebra_if->ifp->name No symbol "zebra_if" in current context. (gdb) p vlan_zif->ifp->name $4 = "peerlink-3.4094\000\000\000\000" Ticket:CM-32435 Reviewed By:CCR-10957 Testing Done: Signed-off-by: Chirag Shah <chirag@nvidia.com>	2021-02-19 08:11:15 -08:00
Anuradha Karuppiah	243b74eda6	zebra: changes to advertise SVI mac by default if evpn-mh is enabled Added support for advertising SVI MAC if EVPN-MH is enabled. In the case of EVPN MH arp replies from an attached server can be sent to the ES-peer. To prevent flooding of the reply the SVI MAC needs to be advertised by default. Note: advertise-svi-ip could have been used as an alternate way to advertise SVI MAC. However that config cannot be turned on if SVI IPs are re-used (which is done to avoid wasting IP addresses in a subnet). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-19 08:11:15 -08:00
Anuradha Karuppiah	c0c7707d0d	zebra: fix problem with SVI IP being advertised even if disabled SVI IP is being advertised unconditionally i.e. even if disabled (and that is the default config). This can be problematic when the SVI address is re-used across racks. Added the user config condition in all the relevant places where the SVI advertisement is triggered. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-19 08:11:15 -08:00
Donald Sharp	d6816f68bd	zebra: use AF_INET for protocol family When looking up the conversion from kernel protocol to internal protocol family make sure we use the correct AF_INET( what the kernel uses ) instead of AFI_IP (which is what FRR uses ). Routes from OSPF will show up from the kernel as OSPF6 instead of OSPF. Which will cause mayhem Ticket: CM-33306 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-02-16 15:54:08 -05:00
David Lamparter	1d5453d607	*: remove tabs & newlines from log messages Neither tabs nor newlines are acceptable in syslog messages. They also break line-based parsing of file logs. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-02-14 15:36:51 +01:00
Stephen Worley	3d26211e08	Merge pull request #7508 from sudhanshukumar22/zebra-vrf-delete zebra: treat vrf add for existing vrf as update	2021-02-10 02:05:10 -05:00
Quentin Young	7533cad751	*: remove more sprintf() Should be just a couple non-development, non-test occurrences of this function left now. Signed-off-by: Quentin Young <qlyoung@qlyoung.net>	2021-02-09 15:40:40 -05:00
Russ White	d887c7bf04	Merge pull request #7973 from sworleys/Pbr-More-Fixes zebra,pbrd,doc: PBR more fixes	2021-02-09 07:37:09 -05:00
Donald Sharp	4b24d96930	Merge pull request #8009 from pjdruddy/evpn-cleanup zebra: resolve multiple functions for local MAC delete	2021-02-04 13:37:24 -05:00
Donald Sharp	99a30b4760	zebra: Display instance id as part of `show zebra client summ` When displaying `show zebra client summ` when we have instances running, display the instance number as well. New Output: sharpd@eva ~/frr7 (instance_data)> vtysh -c "show zebra client summ" Name Connect Time Last Read Last Write IPv4 Routes IPv6 Routes -------------------------------------------------------------------------------- ospf[1] 00:00:02 00:00:02 00:00:02 0/0 0/0 ospf[5] 00:00:02 00:00:02 00:00:02 0/0 0/0 sharp 00:00:02 00:00:02 00:00:02 0/0 0/0 static 00:00:02 00:00:02 00:00:02 0/0 0/0 Routes column shows (added+updated)/deleted Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-02-04 08:35:14 -05:00
Pat Ruddy	46d6f5a2c6	zebra: resolve multiple functions for local MAC delete the old VXLAN function for local MAC deletion was still in existence and being called from the VXLAN code whilst the new generic function was not being called at all. Resolve this so the generic function matches the old function and is called exclusively. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-02-03 12:22:00 +00:00
Igor Ryzhov	1ac88792c0	*: fix all backets Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-02-02 19:11:25 +03:00
Russ White	a67b8731d2	Merge pull request #7991 from donaldsharp/valgrind_cleanups1 Valgrind cleanups	2021-02-02 07:30:06 -05:00
Stephen Worley	f7692085cb	zebra: move pbr hash create after update release Move the pbr hash creation to be after the update release and dplane install. Now that rules are installed in a separate dplane pthread, we can have scenarios where we have an interface flapping and we install/remove rules sufficiently fast enough we could issue what we think is an update for an identical rule and end up releasing the rule right after we created it and sent it to the dplane. This solves the problem of recving duplicate rules during interface flapping. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-02-01 13:32:37 -05:00
Stephen Worley	8eeca5a201	zebra: add some debugging for PBR events in zebra Add some debugging for PBR events internal to zebra, specifically ADD/UPDATE/DELETE of pbr rules. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-02-01 13:32:37 -05:00
Stephen Worley	3d30f6defb	zebra: disallow resolution to duplicate nexthops Disallow the resolution to nexthops that are marked duplicate. When we are resolving to an ecmp group, it's possible this group has duplicates. I found this when I hit a bug where we can have groups resolving to each other and cause the resolved->next->next pointer to increase exponentially. Sufficiently large ecmp and zebra will grind to a hault. Like so: ``` D> 4.4.4.14/32 [150/0] via 1.1.1.1 (recursive), weight 1, 00:00:02 * via 1.1.1.1, dummy1 onlink, weight 1, 00:00:02 via 4.4.4.1 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.2 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.3 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.4 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.5 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.6 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.7 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.8 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.9 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.10 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.11 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.12 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.13 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.15 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1 onlink, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1 onlink, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.16 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1 onlink, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 D> 4.4.4.15/32 [150/0] via 1.1.1.1 (recursive), weight 1, 00:00:09 * via 1.1.1.1, dummy1 onlink, weight 1, 00:00:09 via 4.4.4.1 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.2 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.3 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.4 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.5 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.6 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.7 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.8 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.9 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.10 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.11 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.12 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.13 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.14 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.16 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1 onlink, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 D> 4.4.4.16/32 [150/0] via 1.1.1.1 (recursive), weight 1, 00:00:19 * via 1.1.1.1, dummy1 onlink, weight 1, 00:00:19 via 4.4.4.1 (recursive), weight 1, 00:00:19 via 1.1.1.1, dummy1, weight 1, 00:00:19 via 4.4.4.2 (recursive), weight 1, 00:00:19 ............... ................ and on... ``` You can repro the above via: ``` kernel routes: 1.1.1.1 dev dummy1 scope link 4.4.4.0/24 via 1.1.1.1 dev dummy1 ============================== config: nexthop-group doof nexthop 1.1.1.1 nexthop 4.4.4.1 nexthop 4.4.4.10 nexthop 4.4.4.11 nexthop 4.4.4.12 nexthop 4.4.4.13 nexthop 4.4.4.14 nexthop 4.4.4.15 nexthop 4.4.4.16 nexthop 4.4.4.2 nexthop 4.4.4.3 nexthop 4.4.4.4 nexthop 4.4.4.5 nexthop 4.4.4.6 nexthop 4.4.4.7 nexthop 4.4.4.8 nexthop 4.4.4.9 ! =========================== Then use sharpd to install 4.4.4.16 -> 4.4.4.1 pointing to that nexthop group in decending order. ``` With these changes it prevents the growing ecmp above by disallowing duplicates to be in the resolution decision. These nexthops are not installed anyways so why should we be resolving to them? Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-02-01 13:02:40 -05:00
sudhanshukumar22	75d26fb313	zebra: treat vrf add for existing vrf as update Description: When we get a new vrf add and vrf with same name, but different vrf-id already exists in the database, we should treat vrf add as update. This happens mostly when there are lots of vrf and other configuration being replayed. There may be a stale vrf delete followed by new vrf add. This can cause timing race condition where vrf delete could be missed and further same vrf add would get rejected instead of treating last arrived vrf add as update. Treat vrf add for existing vrf as update. Implicitly disable this VRF to cleanup routes and other functions as part of vrf disable. Update vrf_id for the vrf and update vrf_id tree. Re-enable VRF so that all routes are freshly installed. Above 3 steps are mandatory since it can happen that with config reload stale routes which are installed in vrf-1 table might contain routes from older vrf-0 table which might have got deleted due to missing vrf-0 in new configuration. Signed-off-by: sudhanshukumar22 <sudhanshu.kumar@broadcom.com>	2021-02-01 08:33:13 -08:00
Donald Sharp	a013777abc	zebra: Prevent sending of unininted data valgrind is reporting: 2448137-==2448137== Thread 5 zebra_apic: 2448137-==2448137== Syscall param writev(vector[...]) points to uninitialised byte(s) 2448137:==2448137== at 0x4D6FDDD: __writev (writev.c:26) 2448137-==2448137== by 0x4D6FDDD: writev (writev.c:24) 2448137-==2448137== by 0x48A35F5: buffer_flush_available (buffer.c:431) 2448137-==2448137== by 0x48A3504: buffer_flush_all (buffer.c:237) 2448137-==2448137== by 0x495948: zserv_write (zserv.c:263) 2448137-==2448137== by 0x4904B7E: thread_call (thread.c:1681) 2448137-==2448137== by 0x48BD3E5: fpt_run (frr_pthread.c:308) 2448137-==2448137== by 0x4C61EA6: start_thread (pthread_create.c:477) 2448137-==2448137== by 0x4D78DEE: clone (clone.S:95) 2448137-==2448137== Address 0x720c3ce is 62 bytes inside a block of size 4,120 alloc'd 2448137:==2448137== at 0x483877F: malloc (vg_replace_malloc.c:307) 2448137-==2448137== by 0x48D2977: qmalloc (memory.c:110) 2448137-==2448137== by 0x48A30E3: buffer_add (buffer.c:135) 2448137-==2448137== by 0x48A30E3: buffer_put (buffer.c:161) 2448137-==2448137== by 0x49591B: zserv_write (zserv.c:256) 2448137-==2448137== by 0x4904B7E: thread_call (thread.c:1681) 2448137-==2448137== by 0x48BD3E5: fpt_run (frr_pthread.c:308) 2448137-==2448137== by 0x4C61EA6: start_thread (pthread_create.c:477) 2448137-==2448137== by 0x4D78DEE: clone (clone.S:95) 2448137-==2448137== Uninitialised value was created by a stack allocation 2448137:==2448137== at 0x43E490: zserv_encode_vrf (zapi_msg.c:103) Effectively we are sending `struct vrf_data` without ensuring data has been properly initialized. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-02-01 08:57:51 -05:00
Donald Sharp	cadc15cfe2	zebra: Remove #if 0 code Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-28 13:57:49 -05:00
Mark Stapp	4c99d413e6	zebra: debug messages go under conditionals Move a couple of unprotected debug calls in the netlink code under DEBUG_KERNEL. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-26 12:29:39 -05:00
Donald Sharp	431deca7ea	Merge pull request #7905 from mjstapp/fix_zapi_nhg zebra, sharpd: async results for NHGs	2021-01-25 10:29:04 -05:00
Mark Stapp	ee94437e28	zebra: send async nhg update results Send the results of daemons' nhg updates asynchronously, after the update has actually completed. Capture additional info about the source daemon in order to locate the correct zapi session. Simplify the result types considered by the zebra_nhg module. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-22 16:33:01 -05:00
Mark Stapp	f5b7e50f9a	zebra: use afi_t for route-map address family arg Use afi_t in the route_map_check api Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-21 10:13:57 -05:00
Mark Stapp	bdbef5edc6	Merge pull request #7233 from donaldsharp/route_map_optimizations Route map optimizations	2021-01-19 13:20:02 -05:00
Patrick Ruddy	f87fe77aeb	Merge pull request #7723 from AnuradhaKaruppiah/fdb-ext-attrs zebra: move from NDA_NOTIFY to NDA_FDB_EXT_ATTRS	2021-01-19 16:27:54 +00:00
Mark Stapp	5898ce6f35	libs,zebra: remove zapi nhg encode and decode public apis The raw zapi apis to encode and decode NHGs don't need to be public; also add a little more validity-checking. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-19 08:48:54 -05:00
Donald Sharp	3a15018892	zebra: Tell SA that we are intentionally ignoring the return Calling fpm_nl_enqueue we should expect a it fit or not return value on the outgoing stream. This is not necessary to check here because the while loop where we are checking this already has ensured that the data being written will fit. CID -> 1499854 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-18 09:06:49 -05:00
Donald Sharp	d33da0e071	zebra: A `zebra route-map delay-timer 0` command should still run the route-map Setting `zebra route-map delay-timer 0` completely turns of any route-map processing in zebra. Which is completely wrong. A timer of 0 means `do it now`. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	4dfcfabfa9	zebra: Push timer out if another route-map change comes in for zebra If we are running with a delayed timer to handle route-map changes in zebra, if another route-map change is made to the cli, push out the timer instead of not modifying the timer. This will allow a large set of route-maps to be possibly be read in by the system and we don't have a state where new route-map changes are being read in and having the timer pop in the middle of it. Additionally convert to use THREAD_OFF, preventing a possible use after free as well as aligning the thread api usage with what we consider correct. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	cfcd844c0b	zebra: Limit routemap changes to reconsider only routes associated with that rm Current code when a route map changes schedules a rerun of all routes in the particular table. So if you modify the `ip protocol XX route-map FOO` route-map `FOO` all routes will be rechecked. This is extremely expensive. Modify zebra to only update the routes associated with the route-map. So if we have 800k bgp routes and 50 ospf routes and we are route-map'ing the ospf routes we'll only look at 50 routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	54aeba3540	zebra: Allow rib_update_table to receive a specified route type When we need to cause a reprocessing of data the code currently marks all routes as needing to be looked at. Modify the rib_update_table code to allow us to specify a specific route type we only want to reprocess. At this point none of the code is behaving differently this is just setup for a future code change. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	1866a6f65b	zebra: remove unused function rib_update_vrf The function rib_update_vrf is never used. Remove it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	3d34678f1d	doc: Document the "zebra route-map delay-timer" functionality Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Duncan Eastoe	869a5f7168	zebra: set nlmsg_pid in netlink msgs sent by 'fpm' Use nl_pid from the netlink socket used for programming the kernel (netlink_dplane) in netlink route messages sent by the 'fpm' module. This makes 'fpm' consistent with 'dplane_fpm_nl' which already behaves this way, and allows FPM server implementations to determine route origin via nlmsg_pid. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2021-01-15 16:28:06 +00:00
Donald Sharp	f7f52f0d2b	Merge pull request #7868 from mjstapp/fix_fpm_conn_up zebra: don't set connection-up event pointer directly	2021-01-15 06:55:29 -05:00
Mark Stapp	9fad1340d4	Merge pull request #7866 from kishorekunal01/fpm_dump_issue zebra: Scale setup RMAC is send multiple time to fpm	2021-01-14 14:13:31 -05:00
Mark Stapp	ef1dbba83a	zebra: don't set connection-up event pointer directly Use thread_cancel to reset the connection-up processing timer. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-14 14:09:14 -05:00
Kishore Kunal	e840edcacb	zebra: Scale setup RMAC is send multiple time to fpm Thread zfpm_conn_up_thread_cb can Yield and send RMAC multiple times to FPM. Signed-off-by: Kishore Kunal <kishorekunal01@broadcom.com>	2021-01-14 15:53:52 +00:00
Donald Sharp	700cae7698	zebra: in zebra_evpn_mac.c use size_t for buffer length Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:29 -05:00
Donald Sharp	b16e800423	zebra: Create a dump function for mac->flags and use it Create a function that can dump the mac->flags in human readable output and convert all debugs to use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:29 -05:00
Donald Sharp	bf902d4c52	zebra: Create function to dump MACIP flags Create a function to dump MACIP flags and to use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:27 -05:00
Mark Stapp	d7ceaa8f5a	Merge pull request #7819 from donaldsharp/more_data_for_debug_dumps zebra: Add ability to display human readable format re->flags and status	2021-01-13 13:06:23 -05:00
Mark Stapp	3c57be5936	Merge pull request #7818 from donaldsharp/ip_proto_denied zebra: notify installing protocol when nexthops cannot be resolved	2021-01-13 10:33:33 -05:00
Donald Sharp	61e6de9d57	zebra: Add ability to display in human readable format re->flags and status The re->flags and re->status in debugs were being dumped as hex values. I can never quickly decode this. Here is an idea. Let's let FRR do it for me. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 10:16:06 -05:00
Donald Sharp	1afacb94e6	Merge pull request #6853 from mjstapp/fix_rib_dups zebra: reduce impact of route-update overload	2021-01-13 09:42:34 -05:00
Donald Sharp	7874422ad2	Merge pull request #7850 from mjstapp/build_dplane_plugin zebra: build the sample dataplane plugin	2021-01-12 08:43:53 -05:00
Mark Stapp	b9f15b49b2	zebra: add the sample dataplane plugin to the build Build the sample dataplane plugin with debug/dev builds. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-11 16:33:55 -05:00
Mark Stapp	fb913e53a5	zebra: remove unused local in dplane sample plugin Remove an unused local in the sample dataplane plugin. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-11 16:33:27 -05:00
Donald Sharp	7e010c4b78	zebra: notify installing protocol when nexthops cannot be resolved In the case where a routes nexthops cannot be resolved as part of route processing, immmediately notify the upper level protocol that their routes failed to install if they are interested in being informed about this issue. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-11 10:11:35 -05:00
Donatas Abraitis	88ffa95dc3	Merge pull request #7823 from donaldsharp/zebra_delay_timer Zebra delay timer	2021-01-11 16:46:23 +02:00
Donald Sharp	f10f8f0e98	Merge pull request #7652 from adharkar/frr-vni_switch zebra: L3VNI to L2VNI conversion is not handled	2021-01-10 18:44:49 -05:00
Donald Sharp	7df0e6bb3b	Merge pull request #7756 from pjdruddy/bgplu-fixes Bgplu fixes	2021-01-09 15:48:22 -05:00
Donald Sharp	24420c8200	Merge pull request #7787 from deastoe/fpm-work-ready-fixes dplane_fpm_nl: routes stuck with 'q' flag (revisited)	2021-01-09 15:38:46 -05:00
Donald Sharp	9df81095f8	zebra: zebra route-map delay-timer is global not per vrf The zebra route-map delay timer value is a global value not a per vrf change. As such we should only print it out one time. We are seeing this: zebra route-map delay-timer 33 exit-vrf zebra route-map delay-timer 33 When we have 2 vrf's configured. Fix the code to only write it out for the default vrf Ticket: CM-32888 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-08 22:34:41 -05:00
Donald Sharp	c70e585e05	zebra: Remove uncalled function Remove the dead function zebra_route_map_write_delay_timer Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-08 22:34:41 -05:00
Renato Westphal	dc70c83afa	Merge pull request #7816 from pjdruddy/revert_labelmanager_statics Revert labelmanager statics	2021-01-08 20:57:25 -03:00
Mark Stapp	6b66913275	Merge pull request #7762 from sworleys/PBR-Ipv4/Ipv6-Match-Fixes pbrd: pbr ipv4/ipv6 match fixes	2021-01-05 13:54:06 -05:00
Pat Ruddy	507d2737d6	zebra: expose label-manager util-funcs Revert "zebra: unexpose label-manager util-funcs as static" This reverts commit `d3d9639d9a`. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-05 18:19:44 +00:00
Patrick Ruddy	b567ed7eeb	Merge pull request #7722 from AnuradhaKaruppiah/mh-fixes bgpd, zebra: evpn mh fixes	2021-01-05 09:26:17 +00:00
Pat Ruddy	189982283a	zebra: labelmanager could return reserved labels when checking if there is a "hole" behind the current reservation marker the calculation of whether the hole is big enough to satisfy the requested chunk is out by 1. This could result in returning a label which has already been allocated. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-04 14:29:44 +00:00
Pat Ruddy	3c84497943	zebra: label manager should never return a reserved block if the requested chunk size was less than 16 then a chunk within the reserved block would be returned. Make sure that we never return labels that are below MPLS_LABEL_UNRESERVED_MIN Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-04 14:29:44 +00:00
Quentin Young	19ff5340a1	Merge pull request #7777 from volta-networks/fix_zebra_rib_c++ zebra: avoid c++ reserved keyword	2020-12-29 11:07:12 -05:00
Stephen Worley	a4525d25b5	Merge pull request #7788 from deastoe/zebra2proto-kernel-connect zebra: zebra2proto() handle kernel/connect type	2020-12-28 14:57:41 -05:00
Mark Stapp	7c08b70a53	Merge pull request #7724 from donaldsharp/pbr_zebra_was_wrong Pbr zebra was wrong	2020-12-23 13:34:18 -05:00
Duncan Eastoe	911d4d4804	zebra: zebra2proto() handle kernel/connect type When dplane_fpm_nl is used the "Please add this protocol(n) to proper rt_netlink.c handling" debug message is emitted for any route of type kernel or connected. This severely reduces performance of dplane_fpm_nl when large numbers of these routes are present in the RIB. The messages are not observed when using the original fpm module since this uses a custom function, netlink_proto_from_route_type(). zebra2proto() now returns RTPROT_KERNEL for ZEBRA_ROUTE_CONNECT and ZEBRA_ROUTE_KERNEL. This should only impact dplane_fpm_nl's use of the common netlink routines since these routes generally ignored via checking of RSYSTEM_ROUTE(). Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:27:52 +00:00
Duncan Eastoe	b677907c99	zebra: fpm_nl_process() reschedule dp thread fpm_nl_process() now ensures that the dataplane thread is rescheduled if it hits the work limit while processing its incoming work queue. This would probably already occur due to some other event, such as fpm_process_queue() enqueuing completed work to the output queue, however it does no harm to add this explicit reschedule. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:14:03 +00:00
Duncan Eastoe	f1595ce439	zebra: resched dp thread if output queue limit hit If the dataplane thread hits the work limit while processing the output queue for any given provider, we now explicitly reschedule the thread. Otherwise, if the number of items in the output queue is greater than the work limit, draining of that output queue is dependent on new dataplane work. Routes which are not drained from the output queue are stuck with the 'q' flag, so this is a similar issue to that observed in `164d8e8608`. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:14:03 +00:00
Rafael Zalamena	fb1e954880	Merge pull request #7767 from mjstapp/fix_dplane_extra_info zebra: fix loop logic in dplane for extra intf info	2020-12-22 15:08:35 -03:00
Mark Stapp	700ff41ed3	Merge pull request #7472 from opensourcerouting/fpm-fixes fpm: frr-reload, IPv6 and an improvement	2020-12-22 11:37:58 -05:00
Anuradha Karuppiah	0b05c9bbe1	zebra: skip EVI setup if an ES is applied to a pseudo interface zebra maintains pseudo interface for hanging off user config after the interface is deleted in the kernel. If an user tried to config an ES against such an interface zebra would crash with the following call stack - at zebra/zebra_evpn_mh.c:2095 sysmac=sysmac@entry=0x55cfbadd3160) at zebra/zebra_evpn_mh.c:2258 at zebra/zebra_evpn_mh.c:3222 argv=<optimized out>, es_lid_str=<optimized out>, es_lid=1, no=0x0, vty=0x55cfbaf4c7b0) at zebra/zebra_evpn_mh.c:3222 argv=<optimized out>) at ./zebra/zebra_evpn_mh_clippy.c:202 vty=vty@entry=0x55cfbaf4c7b0, cmd=cmd@entry=0x0, filter=FILTER_RELAXED) at lib/command.c:1073 Ticket: CM-31702 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	16de1338a9	zebra: accept bgp remote mac-ip update if the higher-seq-local mac is not bgp-ready If a local-MAC or local-neigh is not active locally it is not sent to BGP. At this point if BGP rxes a remote route it accepts it and installs in zebra. Zebra was rejecting BGP's update if it had a higher seq local (inactive) entry. This would result in bgp and zebra falling out of sync. In some cases zebra would delete the local-inactive entries in sometime (as a part of the dplane/kernel garbage collection). This would leave zebra with missing remote entries (which were still present in bgpd). This change allows lower-seq BGP updates to overwrite zebra's local entry if that entry happens to be local-inactive. Note: This logic was already in use for sync-mac-ip updates. Extended the same logic to remote-mac-ip updates. Ticket: CM-31626 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	963b0c55fd	zebra: clean zevpn references in the access bd database when the VNI is deleted When an VNI was deleted as a part of FRR/zebra shutdown the zevpn entry was being freed without removing its reference in the access vlan entry (i.e. without clearing the VLAN->VNI mapping) used by MH. Ticket: CM-31197 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	7c0e4dc659	zebra: reinstall missing peer-sync flag If a netlink/dp notification is rxed for a neigh without the peer-sync flag FRR re-installs the entry with the right flags. This change is needed to handle cases where the dataplane and FRR may fall out of sync because of neigh learning on the network ports (i.e. via the VxLAN). Ticket: CM-30693 The problem was found during VM mobility "torture" tests where 100s of extended VM moves were done. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	2c89cb9017	zebra: changes to log ext_flags in neigh nl add Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	c1735c08c9	zebra: fix a problem with local MAC pointing to a remote ES If a remote MAC update is rxed from BGP with a lower sequence number than the local one zebra ignores the MAC update. This typically happens if there is a race condition (where updates are in flight from zebra to BGP). There was a bug in zebra because of which the dest ES was being updated before this check. This left the local MAC pointing to a remote ES. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Relevant Dumps: =============== root@leaf21:mgmt:~# net show evpn mac vni 101101 mac 00:93:00:00:00:01 MAC: 00:93:00:00:00:01 ESI: 03:00:00:00:77:01:03:00:00:0d Intf: - VLAN: 101 Sync-info: neigh#: 1 peer-proxy Local Seq: 3 Remote Seq: 0 Neighbors: 21.1.13.1 Active root@leaf21:mgmt:~# net sho evpn es Type: L local, R remote, N non-DF ESI Type ES-IF VTEPs 03:00:00:00:77:01:02:00:00:0c R - 6.0.0.10,6.0.0.11 03:00:00:00:77:01:03:00:00:0d R - 6.0.0.10,6.0.0.11,6.0.0.12 03:00:00:00:77:01:04:00:00:0e R - 6.0.0.10,6.0.0.11,6.0.0.12,6.0.0.13 03:00:00:00:77:02:02:00:00:16 LR bondP2-H2 6.0.0.15 03:00:00:00:77:02:03:00:00:17 LR bondP2-H3 6.0.0.15,6.0.0.16 03:00:00:00:77:02:04:00:00:18 LR bondP2-H4 6.0.0.15,6.0.0.16,6.0.0.17 root@leaf21:mgmt:~# Relevant logs: =============== 2020/07/29 15:41:27.110846 ZEBRA: Recv MACIP ADD VNI 101101 MAC 00:93:00:00:00:01 IP 21.1.13.1 flags 0x0 seq 2 VTEP 0.0.0.0 ESI 03:00:00:00:77:01:03:00:00:0d from bgp 2020/07/29 15:41:27.110867 ZEBRA: Ignore remote MACIP ADD VNI 101101 MAC 00:93:00:00:00:01 IP 21.1.13.1 as existing MAC has higher seq 3 flags 0x401 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Ticket: CM-30273 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	c7bfd08568	zebra: advertise stale neighs if EVPN-MH is not enabled With EVPN-MH, Type-2 routes are also used for MAC-IP syncing between ES peers so a change was done to only treat REACHABLE local neigh entries as local-active and advertise them as Type-2 routes i.e. STALE neigh entries are no longer advertised as Type-2s. This however exposed some unexpected problems with MLAG where a secondary reboot followed by a primary reboot left a lot of neighs in STALE state (on the primary) resulting in them not being advertised. And remote routed traffic to those hosts being blackholed in a sym-IRB setup. This commit is a workaround to fix the regression (it doesn't fix the underlying problems with entries not becoming REACHABLE; which maybe a day-1 problem). The workaround is to continue advertising STALE neighbors if EVPN-MH is not enabled. Ticket: CM-30303 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:15 -08:00
Anuradha Karuppiah	362c8f2d73	zebra: handle "show evpn es-evi" a non-existent VNI zebra was crashing when the command was run on a non-existent VNI. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 VNI 16777215 doesn't exist root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 detail VNI 16777215 doesn't exist root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 json [ ] root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 detail json [ ] root@torm-12:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Ticket: CM-30232 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:40:07 -08:00
Emanuele Di Pascale	2e8db20d7e	zebra: avoid c++ reserved keyword in rib_handle_nhg_replace, do not use new as a parameter name to allow compilation of c++ code including zebra headers. Signed-off-by: Emanuele Di Pascale <emanuele@voltanet.io>	2020-12-21 14:34:55 +01:00
Mark Stapp	b364e87d56	zebra: fix loop logic in dplane for extra intf info The way a couple of clauses were placed in a loop meant that some info might not be collected - re-order things just a bit. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-18 13:49:07 -05:00
Stephen Worley	e36ea40d3b	zebra: derive rule family from src->dst->ipv4 Derive the rule family from src if available, otherwise dst if available, otherwise assume ipv4. We only support ipv4/ipv6 currently so it we cant tell from the src/dst it must be ipv4 and likely a dsfield match. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-18 11:53:18 -05:00
Duncan Eastoe	438dd3e7df	zebra: reduce atomic ops in fpm_process_queue() Maintain the count of contexts which have been processed in a local variable, and perform a single atomic update after we have consumed all queued contexts. Generally this results in at least one less atomic operation per context. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:37:13 +00:00
Duncan Eastoe	3f2b998f61	zebra: local var in fpm_process_queue() sched cond Don't use an atomic operation to determine whether fpm_process_queue() needs to be re-scheduled. Instead we can simply use a local variable to determine if we stopped processing because we ran out of buffers. In the case where we would have re-scheduled due to new context objects in the queue (enqueued after we stopped processing), fpm_nl_process() will schedule us (or will have done already). Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:36:39 +00:00
Duncan Eastoe	bf2f783945	zebra: reduce atomic ops in fpm_nl_process() Maintain the peak ctxqueue length in a local variable, and perform a single atomic update after processing all contexts. Generally this results in at least one less atomic operation per context. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:36:38 +00:00
Duncan Eastoe	dc693fe057	zebra: reduce dplane_fpm_nl ctxqueue_mutex contention Reduce code in the critical sections of fpm_nl_process() and fpm_process_queue() to the bare minimum - basically only enqueue and dequeue operations on the shared ctxqueue. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:33:46 +00:00
Mark Stapp	86723fe89b	zebra: nht resolve-via-default doesn't need force We don't need to use the 'force' flag when processing the resolve-via-default clis for ip and ipv6: we can just do normal nht processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-17 11:22:09 -05:00
Ameya Dharkar	3b0a590bf3	zebra: L3VNI to L2VNI conversion is not handled After removal of L3VNI config, the VNI should become an L2VNI if a VxLAN interface is present for the VNI. This case is not handled in the code. Changes: 1. After unconfiguring L3VNI, create an L2VNI if VxLAN interface is present for the VNI. 2. Trigger an update to BGP. 3. Read MAC and ARP entries from kernel. This PR fixes the issue only for route type-2, 3 and 5. This PR does not address states regarding route type-1, 4 and multicast group for VxLAN interface. Signed-off-by: Ameya Dharkar <adharkar@vmware.com>	2020-12-16 18:06:37 -08:00
Anuradha Karuppiah	35f5c31b0e	zebra: add support for DF delay timer When a new ES is created it is held in a non-DF state for 3 seconds as specified by RFC7432. This allows the switch time to import the Type-4 routes from the peers. And the peers time to rx the new Type-4 route. root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01"\|grep DF DF status: non-df DF delay: 00:00:01 DF preference: 50000 root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01"\|grep DF DF status: df DF preference: 50000 root@torm-11:mgmt:~# Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-15 10:03:50 -08:00
Anuradha Karuppiah	0109f42f86	zebra: display DF status only for local ESs For remote ESs it is not relevant and confuses the admin. Local ES sample - =============== root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01" ESI: 03:44:38:39:ff:ff:01:00:00:01 Type: Local,Remote Interface: hostbond1 State: up Bridge port: yes Ready for BGP: yes VNI Count: 10 MAC Count: 3 DF: status: df preference: 50000 >>>>>>>>>>>>>>> Nexthop group: 536870913 VTEPs: 27.0.0.16 df_alg: preference df_pref: 32767 nh: 268435465 27.0.0.17 df_alg: preference df_pref: 32767 nh: 268435466 root@torm-11:mgmt:~# Remote ES sample - =============== root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:02:00:00:01" ESI: 03:44:38:39:ff:ff:02:00:00:01 Type: Remote Interface: - Ready for BGP: no VNI Count: 0 MAC Count: 6 DF: status: - preference: 0 >>>>>>>>>>>>>>> Nexthop group: 536870919 VTEPs: 27.0.0.18 nh: 268435464 27.0.0.19 nh: 268435467 27.0.0.20 nh: 268435461 root@torm-11:mgmt:~# Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-15 10:02:03 -08:00
Patrick Ruddy	a119a429e4	Merge pull request #7637 from AnuradhaKaruppiah/evpn-pim-fixes evpn-pim: cleanup and display fixes	2020-12-15 17:36:24 +00:00
Patrick Ruddy	bedf36e327	Merge pull request #7636 from AnuradhaKaruppiah/type-0-esi zebra: support for type-0 ESI	2020-12-15 17:33:46 +00:00
Patrick Ruddy	01c65ba77e	Merge pull request #7633 from AnuradhaKaruppiah/protodown-fixes evpn-mh: protodown handling fixes	2020-12-15 17:23:32 +00:00
Russ White	930c9b7be8	Merge pull request #7736 from ton31337/fix/s_addr_INADDR_ANY *: Replace s_addr check agains 0 with INADDR_ANY	2020-12-15 07:12:49 -05:00
Donatas Abraitis	3a6290bdd1	*: Replace s_addr check agains 0 with INADDR_ANY Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-12-14 21:03:38 +02:00
Stephen Worley	3bece1e0e3	Merge pull request #7162 from opensourcerouting/zebra-human-netlink zebra: human readable netlink dumps	2020-12-14 14:03:35 -05:00
Anuradha Karuppiah	dc261b8de4	zebra: restart start-up delay timer when the first uplink comes up When all the uplinks go down the VTEP is disconnected from the VxLAN overlay and this was handled by proto-downing the ES bonds. When the uplinks come up again we need to re-enable the ES bonds but that needs to be done after a delay to allow the EVPN network to converge. And that is done by firing off the startup-delay timer on first uplink-up. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:41 -08:00
Anuradha Karuppiah	2bcf92e18b	zebra: re-sync protodown state with the dplane on new ES add 1. When a bond is associated with an ES we may need to re-sync the dplane protodown state (which maybe stale/set by some other app). 2. Also change the uplink state display to avoid confusion with protodown reason code (both used to show uplink-up). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Anuradha Karuppiah	26ba45e33d	zebra: update protodown display protodown state is a combination of the dplane and zebra states. protodown reason is maintained exclusively by zebra. Display this information on two separate lines to make that ownership clearer. Also display n/a for bonds as the dplane doesn't support protodowning the bond device. Sample output - ============== root@torm-11:mgmt:~# vtysh -c "show interface hostbond1"\|grep -i protodown protodown: off (n/a) protodown reasons: (uplinks-down) root@torm-11:mgmt:~# vtysh -c "show interface swp5"\|grep -i protodown protodown: on protodown reasons: (uplinks-down) root@torm-11:mgmt:~# PS: Cosmetic changes only, no functional change. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Anuradha Karuppiah	5c84327054	zebra: re-sync protodown state when a port/mbr is linked to an ES-bond The code for this was already there but was not kicking in because of a zebra local reason-code dup check. Even if the reason-code is the same, if the dplane and zebra disagree about the protodown state zebra will need to re-program the dplane. Fixed a couple of spelling errors in the protodown logs to make greps easy. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Donatas Abraitis	219218d964	Merge pull request #7664 from donaldsharp/global_bgp_wait Global bgp wait	2020-12-14 10:28:02 +02:00
Donald Sharp	3ceae22b7f	Revert "zebra: When shutting down an interface immediately notify about rnh" This reverts commit `0aaa722883`.	2020-12-11 20:45:43 -05:00
Nikolay Aleksandrov	4bcdb6086c	zebra: move from NDA_NOTIFY to NDA_FDB_EXT_ATTRS Use the new nested NDA_FDB_EXT_ATTRS attribute to control per-fdb notifications. PS: The attributes where updated as a part of the kernel upstreaming hence the change. Signed-off-by: Nikolay Aleksandrov <nikolay@cumulusnetworks.com> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-11 12:13:36 -08:00
Duncan Eastoe	164d8e8608	zebra: routes stuck with 'q' when using dplane FPM New work enqueued to the dplane_fpm_nl provider is initially de-queued and re-enqueued, in fpm_nl_process(), to be processed by the provider's own thread. After performing this initial de-queue/enqueue we return to dplane_thread_loop() and check the dplane_fpm_nl output queue for any work which has been completed. Since this work is being processed in another thread it is very likely that there will be some (or all) work still outstanding at this point. The dataplane thread finishes up any other tasks and then waits until it is next scheduled. In the meantime the dplane_fpm_nl thread is processing its work queue until completion. The issue arises here as the dataplane thread is not explicitly re-scheduled once dplane_fpm_nl has drained its work queue and populated its output queue with completed work. This completed work can sit in the output queue for an indeterminate period of time, depending upon when the dataplane thread is next scheduled for other work. If the RIB has reached a stable state then this could be a significant period of time. During this period zebra marks these routes as queued, even though they have actually been processed by all dataplane providers. An un-related RIB change which triggers a FIB update will result in the dataplane thread being scheduled and this completed work then being processed. At this point the routes will then no longer be marked as queued by zebra. However this new FIB update might itself then fall victim to the same scenario! We can observe the above behaviour in these detailed dplane logs. 11:24:47 zebra[7282]: dplane: incoming new work counter: 2 11:24:47 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:24:47 zebra[7282]: dplane provider 'Kernel': processing 11:24:47 zebra[7282]: Dplane NEIGH_DISCOVER, ip 192.168.2.2, ifindex 9 11:24:47 zebra[7282]: Dplane NEIGH_DISCOVER, ip 192.168.2.2, ifindex 9 11:24:47 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:24:47 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:24:47 zebra[7282]: dplane dequeues 1 completed work from provider dplane_fpm_nl 11:24:47 zebra[7282]: dplane has 1 completed, 0 errors, for zebra main 2 contexts (all incoming work) have been queued to dplane_fpm_nl - all good. 1 completed context was de-queued, so there is outstanding work. 11:24:58 zebra[7282]: dplane: incoming new work counter: 2 11:24:58 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:24:58 zebra[7282]: dplane provider 'Kernel': processing 11:24:58 zebra[7282]: ID (193) Dplane nexthop update ctx 0x55c429b6fed0 op NH_INSTALL 11:24:58 zebra[7282]: 0:5.5.5.5/32 Dplane route update ctx 0x55c429b79690 op ROUTE_INSTALL 11:24:58 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:24:58 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:24:58 zebra[7282]: dplane dequeues 2 completed work from provider dplane_fpm_nl 11:24:58 zebra[7282]: dplane has 2 completed, 0 errors, for zebra main A further 2 contexts (all incoming work) have been queued to dplane_fpm_nl - all good. 2 completed contexts were de-queued, which sounds good as that is what we en-queued. However, there is an outstanding context from earlier, so there is still outstanding work. Indeed the new 5.5.5.5/32 route is marked as queued: O>q 5.5.5.5/32 [110/10] via 192.168.2.2, dp0p1s3, weight 1, 00:01:19 This remains the case until we trigger a FIB update by installation of the (eg.) 10.10.10.10/32 route: 11:26:41 zebra[7282]: dplane: incoming new work counter: 2 11:26:41 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:26:41 zebra[7282]: dplane provider 'Kernel': processing 11:26:41 zebra[7282]: ID (195) Dplane nexthop update ctx 0x55c429b78ce0 op NH_INSTALL 11:26:41 zebra[7282]: 0:10.10.10.10/32 Dplane route update ctx 0x55c429b7a040 op ROUTE_INSTALL 11:26:41 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:26:41 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:26:41 zebra[7282]: dplane dequeues 2 completed work from provider dplane_fpm_nl 11:26:41 zebra[7282]: dplane has 2 completed, 0 errors, for zebra main 11:26:41 zebra[7282]: zebra2proto: Please add this protocol(2) to proper rt_netlink.c handling 11:26:41 zebra[7282]: Nexthop dplane ctx 0x55c429b6fed0, op NH_INSTALL, nexthop ID (193), result SUCCESS 11:26:41 zebra[7282]: default(0:254):5.5.5.5/32 Processing dplane result ctx 0x55c429b79690, op ROUTE_INSTALL result SUCCESS We observe the same 2 enqueues and 2 dequeues as before, which again suggests that there is outstanding work. As expected, the 5.5.5.5/32 route is no longer marked as queued: O>* 5.5.5.5/32 [110/10] via 192.168.2.2, dp0p1s3, weight 1, 00:02:06 But the 10.10.10.10/32 route is, as we have not yet processed the completed context: C>q 10.10.10.10/32 is directly connected, lo, 00:26:05 Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 15:04:15 +00:00
Duncan Eastoe	53706b4e51	zebra: dplane API to get provider output q length Returns the current number of (completed) contexts in the provider's output queue (dp_ctx_out_q), allowing access to this data from the provider itself. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 15:04:11 +00:00
Duncan Eastoe	7545bda0a4	dplane_fpm_nl: queue peak counter never increments The context queue length peak counter is always set to its current value, hence never increments. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 12:09:56 +00:00
Donald Sharp	7ed5844bef	zebra: Allow `show zebra client` to give clues about route update status When entering `show zebra client` allow the display of the client->notify_status for route updates. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-10 12:59:14 -05:00
Russ White	101ad544fa	Merge pull request #7678 from donaldsharp/aspath_to_zebra Aspath to zebra	2020-12-10 10:38:14 -05:00
Donald Sharp	b2c7cf18b2	Merge pull request #7706 from slankdev/slankdev-unexpose-lm-func-1 zebra: unexpose label-manager util-funcs as static	2020-12-10 07:43:02 -05:00
Rafael Zalamena	0c7e0f2f70	Merge pull request #7697 from pguibert6WIND/zebra_crash_startup_zns zebra: anticipate zns creation at vrf creation when backend is vrf-lite	2020-12-10 09:10:34 -03:00
Donatas Abraitis	82b773e63b	Merge pull request #7524 from donaldsharp/zebra_route_map_tighten zebra: deny when route map is specified but does not exist yet	2020-12-10 11:01:25 +02:00
Hiroki Shirokura	d3d9639d9a	zebra: unexpose label-manager util-funcs as static Following functions which is a piece of label-maanager implementation isn't called from out side of its file. And all lines of label-manager are coded on zebra/label_manager.c at this time. So these functions should be unexposed. Functions: - create_label_chunk - assign_label_chunk - delete_label_chunk - release_label_chunk Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2020-12-10 09:56:55 +09:00
Philippe Guibert	91b1421e84	zebra: anticipate zns creation at vrf creation when backend is vrf-lite in the case the namespace pointer is already available, feed it at vrf creation. this prevents from crashing if the netlink parsing already began, and the vrf-lite is not enabled yet. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-12-09 13:26:20 +00:00
Mark Stapp	e386d2b154	Merge pull request #7690 from donaldsharp/nht_show_is_not_not_not zebra, tests: Fix `show ip nht`	2020-12-09 07:58:37 -05:00
Hiroki Shirokura	732d22cbf2	zebra: use zserv_send_message instead of writen Following functions is using writen to dispatch message into socket, but another function uses zserv_send_message. This commit does tiny unification for zapi's socket messaging. Funcs: - zsend_assign_label_chunk_response() - zsend_label_manager_connect_response() Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2020-12-09 17:17:21 +09:00
Donald Sharp	dda33b6e0c	zebra, tests: Fix `show ip nht` The `show ip nht` and `show ipv6 nht` commands were broken. This is because recent code commit: `0154d8ce45` assumed that p must not be NULL and this is not the case. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 15:50:46 -05:00
Donald Sharp	e46723a50e	bgpd, zebra: Add ability for bgp to send AS-Path information to zebra Add a bit of code to allow bgp to send the AS-Path associated with the route being installed to zebra so it can be displayed and used as part of the `show ip route A` command in zebra. eva# show ip route 20.0.0.0/11 Routing entry for 20.0.0.0/11 Known via "bgp", distance 20, metric 0, best Last update 00:00:00 ago * 192.168.161.1, via enp39s0, weight 1 AS-Path: 60000 64539 15096 6939 8075 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:07:21 -05:00
Donald Sharp	cfa2a35d8d	sharpd, zebra: Pass and display opaque data as PoC Pass data from sharpd to zebra as opaque data and display it as part of the detailed route data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:09 -05:00
Donald Sharp	80a6ee90c3	zebra: Setup structure for opaque data to be displayed Setup the output mechanism for opaque data to be displayed to the end operator. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Donald Sharp	a29a60016e	zebra: Gather opaque data into the route entry for storage Just gather the opaque data into the route entry. Later commits will display this data for end users as well as to send it down. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Donald Sharp	aab4eca1c0	lib, zebra: Fix overlapping message types We had duplicate message id's. Shit's broke yo. Fix. I have no idea how this properly worked. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Karen Schoener	581e797e02	zebra: Adding zapi client close notification When zebra detects a client close, send a zapi client close notification. Signed-off-by: Karen Schoener <karen@voltanet.io>	2020-12-07 18:22:36 -05:00
Mark Stapp	a88a7c8d43	zebra: improve dataplane plugin queue counters Add the current queue depths for each plugin to the 'show dplane providers' output. Maintain the out-bound queue max counter properly, that was being ignored. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-07 13:54:08 -05:00
Mark Stapp	0ca6f3b1e6	zebra: remove useless deleted route_entries promptly Zebra accumulates route-entry objects and then processes them as a group. If that rib processing is delayed, because the dataplane/fib programming has built up a queue e.g., zebra can hold multiple deleted route objects in memory. At scale, this can be a problem. Delete unneeded route entries promptly, if they can't contribute to rib processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-07 13:54:08 -05:00
Patrick Ruddy	dd662ca570	Merge pull request #7399 from AnuradhaKaruppiah/mh-mac-ecmp-fixes evpn-mh: miscellaneous fixes in MAC-sync and MAC-ECMP handling	2020-12-03 16:27:49 +00:00
Rafael Zalamena	f584de526d	fpm: reset/walk data structures on connection Don't attempt to walk data structures while not connected so we can save some CPU usage when FPM server is offline. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00
Rafael Zalamena	1f9193c1f0	fpm: simplify reset logic Instead of checking for next group reset, always do it and skip sending if next hop group support is disabled. Also remove unused `*_complete` variables. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00
Rafael Zalamena	a3adec468e	zebra,fpm: fix configuration display Use `pI4` and `pI6` to format addresses and fix a bug when displaying IPv6 addresses. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00
Donatas Abraitis	c49042b407	Merge pull request #7638 from donaldsharp/reduce_warn zebra: Reduce warn -> debug	2020-12-03 08:17:59 +02:00
Donald Sharp	0fb4ab0388	Merge pull request #6950 from opensourcerouting/bfd-distributed-v3 bfdd: distributed BFD	2020-12-02 20:50:47 -05:00
Donald Sharp	af8a77d636	Merge pull request #7644 from mjstapp/dplane_cleaner zebra: add an api to process/clean the pending dplane queue	2020-12-02 09:01:44 -05:00
Donald Sharp	fe76cf322e	Merge pull request #7646 from volta-networks/fix_show_route_summary zebra: fix show ip route vrf X summary	2020-12-02 08:59:54 -05:00
Mark Stapp	b238167a9b	Merge pull request #7645 from sworleys/NHG-IFP-Error2Log zebra: make a couple NHG errors debugs	2020-12-01 17:17:59 -05:00
Rafael Zalamena	de5fa92042	Merge pull request #7617 from deastoe/dplane-fpm-lsp zebra: dplane FPM LSP support	2020-12-01 16:01:09 -03:00
Stephen Worley	8c74d904d4	zebra: remove unused EC_ZEBRA_IF_LOOKUP_FAILED EC_ZEBRA_IF_LOOKUP_FAILED is no longer being used, remove it. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-01 13:05:36 -05:00
Anuradha Karuppiah	46bf266c1c	zebra: debug logs to detect incorrect mac deletions A MAC entry cannot be deleted while a neigh is referencing it. It seems there is some race condition where this may be happening. The log is to help identify those cases. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	4f9bb78eca	zebra: change the L2 NHG id format to co-exist with the L3NHG ids It is now 4bits of type and 28bits of value - 1. type=0 is for L3 NHG 2. type=1 is for L2 NH 3. type=2 is for L2 NHG Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	5de10c3705	zebra: allocate one nexthop id per-VTEP instead of one per-ES-VTEP This is an optimization to reduce the number of L2 nexthops. A l2 or fdb nexthop simply provides the dataplane with a nexthop ip- torm-12:mgmt:~# ip nexthop id 268435461 via 27.0.0.20 scope link fdb id 268435463 via 27.0.0.20 scope link fdb id 268435465 via 27.0.0.20 scope link fdb So there is no need to allocate a nexthop per-ES/per-VTEP. There can be 100+ ESs per-VTEP so this change cuts the scale down by a factor of 100. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	15400f95b7	zebra: support for slow-failover of local MACs on an ES When a local ES flaps there are two modes in which the local MACs are failed over - 1. Fast failover - A backup NHG (ES-peer group) is programmed in the dataplane per-access port. When a local ES flaps the MAC entries are left unaltered i.e. pointing to the down access port. And the dataplane redirects traffic destined to the oper-down access port via the backup NHG. 2. Slow failover - This mode needs to be turned on to allow dataplanes not capable of re-directing traffic. In this mode local MAC entries on a down local ES are re-programmed to point to the ES-peers' NHG. And vice-versa i.e. when the ES comes up the MAC entries are re-programmed with the access port as dest. Fast failover is on by default. Slow failover can be enabled via the following config - evpn mh redirect-off Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:26 -08:00
Anuradha Karuppiah	69711b3f83	zebra: on local mac add from the dplane a re-install maybe need as static As a part of extended MM handing a MAC can be updated from local to remote while being referenced by SYNC neighs (this is really a temporary/small window). During this window if the MAC transitions back to local again we need to re-inforce the previous SYNC flags (based on the sync-neigh count) as subsequent SYNC updates to the MAC will be de-duped and ignored. Ticket: CM-29636 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	1a4f9efd54	zebra: set inactive bit when zebra re-installs the MAC on dplane del When a local mac is deleted by the dataplane zebra can re-install it if the MAC is a SYNC MAC (learned from ES peers). The "local_inactive" bit must be set as a part of the re-install to prevent zebra turning around and advertising the MAC as locally active. Also fixed up some debug logs in the slow-fail path to include the VNI. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	80e19eb71f	zebra: skip NDA_DST attr if NHG is present NHG and DST (VTEP-IP) are mutually exclusive attributes. If DST is present the kernel ignores NHG. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	de86cc5bb1	zebra: free up the L2 NHG bitmap as a part of shutdown Fix for a shutdown time memory leak found during review. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	f3722826a4	zebra: remove FDB entries before de-activating a L2-NHG NHG is activated i.e. programmed in the dataplane only if there are active-VTEPs associated with it. When a NHG is de-activated all the remote-mac entries associated with it need to be removed before the NHG is removed. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Patrick Ruddy	0091461961	Merge pull request #7483 from AnuradhaKaruppiah/evpn-mh-dad bgpd, zebra: Keep DAD disabled if EVPN MH is turned on	2020-12-01 17:37:32 +00:00
Emanuele Di Pascale	265ac74a87	zebra: fix show ip route vrf X summary The lookup for non default VRFs was always using a tableId; if not provided, we were defaulting to RT_TABLE_MAIN. This is fine for the default VRF but not for others. As a result, the command was silently failing for non-default VRFs unless we also specified the correct tableId. Fix this by only performing the lookup using the tableId if it is provided; else use zebra_vrf_table. Signed-off-by: Emanuele Di Pascale <emanuele@voltanet.io>	2020-12-01 18:34:05 +01:00
Stephen Worley	306720345a	zebra: make a couple NHG errors debugs A couple NHG messages we were logging as errors are a bit spammy in usecases where you routinely add/remove interfaces (VM heavy deployments). Its not really an error a user cares about and more for a developer to know what went wrong after the fact so it makes more sense for these to be under a debug rather than an error since seeing them does not implicitly mean error during those usecases. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-01 12:04:30 -05:00
Donald Sharp	34c9b28ba8	zebra: Reduce warn -> debug During times of network trauma and when we are at large network scale the process_remote_macip_add function can issue a zlog_warn for a common occurrence. Modify the code to be a debug statement. This behavior is the same now as the process_remote_macip_del function Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-30 19:37:53 -05:00
Mark Stapp	aa21da071c	zebra: add an api to process/clean the pending dplane queue Add an api that allows a caller in the zebra main pthread to process the queue of pending dplane updates. The caller supplies a function to call to test each pending context. Selected contexts are dequeued, and freed without being processed. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-30 16:42:18 -05:00
Anuradha Karuppiah	0c16fb7262	zebra: fix crash seen on VxLAN SG table cleanup done as a part of vrf disable There are two fixes in this commit - 1. Prevent implicit deletion of (,G) entries during (S,G) cleanup. This is done by creating a dummy reference on all (,G) entries. This is needed for a hash-walk based table cleanup. 2. Free up the SG hash table when the VRF is deleted. Ticket: CM-30151 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-30 12:50:38 -08:00
Anuradha Karuppiah	325d694b93	zebra: support for type-0 ESI Earlier type-3 ESI was the only format supported for evpn-mh. Updated the CLI to allow a 10-byte type-0 ESI. Both type-0 and type-3 ESIs are statically configured; just in two different ways - 1. type-0 is configured as a complete 10-byte string 2. type-3 is configured as a 6-byte es-sys-mac and a 3-byte local-discriminator. Sample config - ! interface hostbond1 evpn mh es-id 00:44:38:39:ff:ff:01:00:00:01 ! This is a CLI-only change and has no functional impact. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-30 12:36:41 -08:00
Mark Stapp	a20e6c32a2	zebra: free dplane ctx after pw update Free the dplane contexts used for pseudowire updates; we were leaking these. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-30 10:02:40 -05:00
Duncan Eastoe	f9bf1ecc38	zebra: dplane FPM LSP table walk Add routines to walk the LSP table and generate FPM updates for all entries. A walk of the LSP table is triggered when (re-)connecting to an FPM. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-11-30 12:13:43 +00:00
Duncan Eastoe	b300c8bbcf	zebra: dplane FPM handle LSP install/update/delete Export netlink_lsp_msg_encoder() and use it to encode and send netlink messages concerning LSP updates to connected FPMs. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-11-27 16:32:01 +00:00
Anuradha Karuppiah	dfa3d3d70a	zebra: change the nhg format from hex to dec for easy match up with the dp Dataplane/kernel prints the NHG and NH ids as decimal. Zebra was printing it as hex (to display type vs. val). This became a debugging hassle hence normalizing the format. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-24 11:06:08 -08:00
Anuradha Karuppiah	b2ee2b71f4	zebra: Keep DAD disabled if EVPN MH is turned on DAD is not supported currently with EVPN-MH so we turn it off internally when the first ES config is detected. PS: Note that when all local ESs are deleted DAD will stay off and will need to be cleared via a daemon restart. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-24 10:20:32 -08:00
Rafael Zalamena	91804f630c	lib: add new stream function to reorganize buffer The function was originally implemented for zebra data plane FPM plugin, but another code places could use it. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-11-24 07:54:07 -03:00
Donatas Abraitis	53a85efa51	Merge pull request #7554 from donaldsharp/sockunion2hostprefix_watch_returns bgpd, lib, nhrpd, zebra: verify return of sockunion2hostprefix	2020-11-19 11:26:02 +02:00
Mark Stapp	84c709bc6e	Merge pull request #7555 from idryzhov/cppcheck-fixes fix a couple of issues found by cppcheck	2020-11-18 14:29:25 -05:00
Igor Ryzhov	b0efbc16e4	zebra: fix writing to pointer instead of value Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-11-18 19:05:30 +03:00
Donald Sharp	0154d8ce45	bgpd, lib, nhrpd, zebra: verify return of sockunion2hostprefix The return from sockunion2hostprefix tells us if the conversion succeeded or not. There are places in the code where we always assume that it just `works`, since it can fail notice and try to do the right thing. Please note that failure of this function for most cases of sockunion2hostprefix is highly highly unlikely as that the sockunion was already created and tested elsewhere it's just that this function can fail. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-18 11:04:27 -05:00
Mark Stapp	926bc58f78	Merge pull request #7478 from donaldsharp/buffer Buffer	2020-11-18 08:30:47 -05:00
Russ White	7dce3c57c2	Merge pull request #7518 from donaldsharp/asic_offload_more Asic offload more	2020-11-17 07:27:41 -05:00
Russ White	2bd9d50ca1	Merge pull request #7523 from donaldsharp/route_map_object_t *: Remove route_map_object_t from the system	2020-11-17 07:16:12 -05:00
Mark Stapp	55e74ca925	zebra: use smaller stream buffer for zapi route notifications The owner-notification zapi message is small; use a small buffer for it. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-15 14:50:17 -05:00
Donald Sharp	f7a9d0120d	zebra: Add offload and trap counts to summary command for json output For the json output add offload and trap route counts for the json output. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:19:25 -05:00
Donald Sharp	e4876266e4	zebra: Add `--asic-offload` command Add a command that allows FRR to know it's being used with an underlying asic offload, from the linux kernel perspective. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:19:25 -05:00
Donald Sharp	0d32fbee6d	lib, zebra: Add ability to read kernel notice of Offload Failed The linux kernel is getting RTM_F_OFFLOAD_FAILED for kernel routes that have failed to offload. Write the code to receive these notifications from the linux kernel and store that data for display about the routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:12:50 -05:00

... 15 16 17 18 19 ...

5774 Commits