mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-06-05 07:52:11 +00:00

Author	SHA1	Message	Date
Donald Sharp	725d9f5945	zebra: No need for a rib_delete before a rib_add In kernel_socket.c, the code is deleting and then adding the route back in on a change operation. This just translates too two re's, one for deletion and one for addition. The deletion will just be ignored. Let's not do the extra deletion. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-17 16:04:50 -04:00
Donald Sharp	07fd1f7e94	zebra: use rib_add_multipath in rt_netlink.c The new route code path was using a combination of both rib_add() and rib_add_multipath() let's clean it up some to use rib_add_multipath() Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-17 16:04:50 -04:00
Donald Sharp	b0385873fa	zebra: Create a zebra_rib_route_entry_new function and use it Abstract the creation of the route_entry and use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-17 16:04:50 -04:00
Donald Sharp	d7ac4c4d88	zebra: Introduce early route processing on the MetaQ Currently if an operator does this operation: sharpd@eva ~/frr8> sudo ip nexthop add id 5000 via 192.168.119.44 dev enp39s0 ; sudo ip route add 10.0.0.1 nhid 5000 2022/06/30 08:52:40 ZEBRA: [ZHQK5-J9M1R] proto2zebra: Please add this protocol(0) to proper rt_netlink.c handling 2022/06/30 08:52:40 ZEBRA: [PS16P-365FK][EC 4043309076] Zebra failed to find the nexthop hash entry for id=5000 in a route entry sharpd@eva ~/frr8> vtysh -c "show ip route 10.0.0.1" Routing entry for 0.0.0.0/0 Known via "kernel", distance 0, metric 100, best Last update 00:01:58 ago * 192.168.119.1, via enp39s0 The route is dropped by zebra with no warnings. This is not good, but unlikely to happen at this point in time. In order to fix this issue route processing from inputs needs to happen after nexthop group processing from inputs. This was not possible because nexthop groups are placed on the metaQ. As such the above nexthop group creation is placed on the metaQ for processing in META_QUEUE_NHG. Then the route is read in and processed immediately. The nexthop group is not found ( not processed yet!) and the route is dropped in zebra. Modify the code to have early route processing of validity on the MetaQ. This preserves the order of operations. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-17 16:04:50 -04:00
Donald Sharp	53216dff6e	zebra: Convert label processing to Meta-Q Convert label processing that comes from zapi messages into being handled by the meta-Q. This is because early route processing is going to be moved to the meta-Q as well and we will have a chicken and egg problem without moving this code to be processed by the meta-Q. Ordering of messages from ospf as an example: 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:48] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:48] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:48] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:48] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:62] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:43] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:61] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_ROUTE_ADD:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_MPLS_LABELS_REPLACE:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_MPLS_LABELS_REPLACE:0:66] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_MPLS_LABELS_REPLACE:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_MPLS_LABELS_REPLACE:0:47] comes from socket [36] 2022/08/09 08:55:52.740 ZEBRA: [YXG8K-BCYMV] zebra message[ZEBRA_MPLS_LABELS_REPLACE:0:47] comes from socket [36] The ZEBRA_MPLS_LABELS_REPLACE immediately turn around and attempt to replace nexthop labels on routes that were added. If the route add is placed on the metaQ, it will not exist yet and as such the label replace will fail. Modify the zebra code to take the label operations and place them on the metaQ as well. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-17 10:44:33 -04:00
Donald Sharp	c8e1ce6278	Merge pull request #11824 from sigeryang/master zebra: fix deadcodes in tc dplane & netlink	2022-08-17 09:59:37 -04:00
Siger Yang	4c9b85ac7a	zebra: fix ctab calculation typo in tc netlink Signed-off-by: Siger Yang <siger.yang@outlook.com>	2022-08-17 19:10:07 +08:00
Siger Yang	ec195c6603	zebra: use default NS directly in tc dplane Signed-off-by: Siger Yang <siger.yang@outlook.com>	2022-08-17 19:10:01 +08:00
Stephen Worley	17e6ba2828	zebra: add TC handlers in script code Add TC handlers in script code and move non-handled code together. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-08-16 15:56:35 -04:00
Stephen Worley	d30d63f4f7	Merge pull request #11694 from sigeryang/master zebra: add basic traffic control API	2022-08-16 11:21:04 -04:00
Donald Sharp	b545e0f99b	Merge pull request #11813 from anlancs/fix/minor-11 zebra: correct one comment about ethtool ioctl	2022-08-16 09:46:53 -04:00
Russ White	261e70c66a	Merge pull request #11804 from donaldsharp/aug_coverity_update Aug coverity update	2022-08-15 18:10:03 -04:00
Donald Sharp	e1a39919e1	Merge pull request #11771 from anlancs/fix/minor-zebra-mh-comment zebra: correct one comment for evpn-mh	2022-08-15 17:43:15 -04:00
Donald Sharp	50a38e23fd	zebra: Remove unused assignment in zebra_dplane.c Coverity spotted 3 places where `int ret = XXX` was being used and FRR was immediately assigning a different value. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-15 11:20:59 -04:00
Donald Sharp	86c4fdfac8	zebra: Fix crash in shutdown w/ pw thread still running I am seeing the zebra_pw_install_retry timer thread crashing on shutdown The shutdown of the timer is only in an if () { ... } else if Let's just always shut it down. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-11 16:31:28 -04:00
Siger Yang	c8e718ce1f	zebra: add empty placeholders for tc via BSD socket This commit adds unimplemented updating functions for traffic control via BSD socket. Signed-off-by: Siger Yang <siger.yang@outlook.com>	2022-08-11 02:32:50 +08:00
Siger Yang	449a30edf6	zebra: add tc netlink and dplane ops This commit implements necessary netlink encoders for traffic control including QDISC, TCLASS and TFILTER, and adds basic dplane operations. Co-authored-by: Stephen Worley <sworley@nvidia.com> Signed-off-by: Siger Yang <siger.yang@outlook.com>	2022-08-11 02:32:43 +08:00
Donald Sharp	1548fbbc44	zebra: Remove unused return codes in zebra_mpls.c There are some return codes for functions that are not really used. Clean them up. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-10 07:15:31 -04:00
Donald Sharp	a310ebc114	zebra: Combine meta_queue_free and meta_queue_vrf_free functions These functions essentially do the same thing. Combine them for the goodness of mankind. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-10 07:14:43 -04:00
Donald Sharp	57a5524578	zebra: System routes should be processed the same time as kernel For whatever reason. ZEBRA_ROUTE_SYSTEM routes were being processed last. Since a system route is just another kernel route type. Let's just switch it to be processed the same time as kernel routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-10 07:14:43 -04:00
Donald Sharp	014e732d3e	zebra: Let's use enum for META Queue indexes Convert the meta queue values to an enum and use them. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-10 07:14:43 -04:00
Donald Sharp	4bbbd1f6b3	zebra: Explicitly call out the correct queue name There were more than a few places where the NHG meta queue was not being explicitly called out. Let's be consistent and use the same nomenclature as much as possible when talking about metaQ's. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-10 07:14:43 -04:00
anlan_cs	d2fb26ef9f	zebra: correct one comment about ethtool ioctl `get_iflink_speed()` uses ioctl to get speed, not ip address. Additionally adjust format for another one comment line. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-08-09 01:55:31 -04:00
anlan_cs	209813e94d	zebra: correct one comment for evpn-mh These moved mac addresses are actually in active status during moving phase, just correct comment. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-08-09 00:29:16 -04:00
Donald Sharp	db1c2223fd	zebra: Don't install connected routes multiple times into FRR When moving an interface between vrf's we do not need to install the connected routes multiple times. eva# show ip route vrf all Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure VRF BLUE: C>* 4.5.6.7/32 is directly connected, dummy7, 00:00:10 VRF default: K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:00:10 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:00:10 eva# exit sharpd@eva ~/f/t/topotests (multiple_connected_installs)> sudo ip link add GREEN type vrf table 11000 sharpd@eva ~/f/t/topotests (multiple_connected_installs)> sudo ip link set GREEN up sharpd@eva ~/f/t/topotests (multiple_connected_installs)> sudo ip link set dummy7 master GREEN sharpd@eva ~/f/t/topotests (multiple_connected_installs)> vtysh Hello, this is FRRouting (version 8.4-dev). Copyright 1996-2005 Kunihiro Ishiguro, et al. eva# show ip route vrf all Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure VRF GREEN: C>* 4.5.6.7/32 is directly connected, dummy7, 00:00:05 VRF default: K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:01:03 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:01:03 eva# exit sharpd@eva ~/f/t/topotests (multiple_connected_installs)> sudo ip link set dummy7 nomaster sharpd@eva ~/f/t/topotests (multiple_connected_installs)> sudo vtysh -c "show ip route vrf all" Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure VRF default: K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:03:22 C>* 4.5.6.7/32 is directly connected, dummy7, 00:00:08 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:03:22 sharpd@eva ~/f/t/topotests (multiple_connected_installs)> @ 11 0:-* 5h50m 0.06 24x1.9GHz 31.4G26% 426G70% 2022-08-08 13:49:24 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-08 13:51:24 -04:00
Donald Sharp	39ffa8e8e8	zebra: Add a `mpls enable` interface node command Allow individual interfaces to turn on/off the mpls subsystem for it in linux. sharpd@eva:~/frr9$ sudo sysctl -a \| grep enp39s0 \| grep mpls net.mpls.conf.enp39s0.input = 0 sharpd@eva:~/frr9$ vtysh -c "conf" -c "int enp39s0" -c "mpls enable" sharpd@eva:~/frr9$ sudo sysctl -a \| grep enp39s0 \| grep mpls net.mpls.conf.enp39s0.input = 1 sharpd@eva:~/frr9$ vtysh -c "conf" -c "int enp39s0" -c "no mpls enable" sharpd@eva:~/frr9$ sudo sysctl -a \| grep enp39s0 \| grep mpls net.mpls.conf.enp39s0.input = 0 sharpd@eva:~/frr9$ Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-08 09:15:22 -04:00
Donald Sharp	c87f5c2392	zebra: Notice when an interface is turned on w/ mpls and enable mpls subsystem Currently when FRR starts up it queries the kernel to see if mpls is turned on. If not FRR does not enable zebra's mpls subsection. If at a later time mpls is turned on, let's notice that an interface now is enabled for mpls( thus implying that all the bits and bobs in the kernel are now setup properly ). a) convert mpls_enabled to a bool b) abstract a new function zebra_mpls_turned_on and call it when FRR notices that an interface now has mpls enabled. c) mpls_processq_init cannot fail, so actually notice that and don't have special code to detect a failure. New results: sharpd@eva ~> vtysh -c "show zebra" OS Linux(5.10.0-12-amd64) ECMP Maximum 128 v4 Forwarding On v6 Forwarding On MPLS Off EVPN Off Kernel socket buffer size 90000000 VRF l3mdev Available ASIC offload Unavailable RA Compiled in RFC 5549 BGP is not using Kernel NHG Available v4 All LinkDown Routes Off v4 Default LinkDown Routes Off v6 All LinkDown Routes Off v6 Default LinkDown Routes Off v4 All MC Forwarding On v4 Default MC Forwarding Off v6 All MC Forwarding On v6 Default MC Forwarding Off Route Route Neighbor LSP LSP VRF Installs Removals Updates Installs Removals default 26 7 0 0 0 <turn on mpls_iptunnel and mpls_router modules in the kernel and then do this>: sharpd@eva ~> sudo sysctl -w net.mpls.conf.enp39s0.input=1 [sudo] password for sharpd: net.mpls.conf.enp39s0.input = 1 sharpd@eva ~> vtysh -c "show zebra" OS Linux(5.10.0-12-amd64) ECMP Maximum 128 v4 Forwarding On v6 Forwarding On MPLS On EVPN Off Kernel socket buffer size 90000000 VRF l3mdev Available ASIC offload Unavailable RA Compiled in RFC 5549 BGP is not using Kernel NHG Available v4 All LinkDown Routes Off v4 Default LinkDown Routes Off v6 All LinkDown Routes Off v6 Default LinkDown Routes Off v4 All MC Forwarding On v4 Default MC Forwarding Off v6 All MC Forwarding On v6 Default MC Forwarding Off Route Route Neighbor LSP LSP VRF Installs Removals Updates Installs Removals default 26 7 0 0 0 sharpd@eva ~> I am doing this work because FRR keeps having operators not know about how to properly use mpls. Let's make FRR behave a bit better in this weird edge case. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-08 09:15:22 -04:00
Donald Sharp	e63831f133	zebra: Add IF_ZEBRA_DATA_X define There are 2 defines IF_ZEBRA_MULTICAST_X and IF_ZEBRA_SHUTDOWN_X macros that do the same thing. Combine into one. Future commits will use the IF_ZEBRA_DATA_X macro as well. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-08 09:15:22 -04:00
Donald Sharp	a69b10c1e6	zebra: Cleanup unguarded debug Left over debug from earlier commits Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-08 09:15:22 -04:00
Donald Sharp	0a5f9773a8	zebra: zrouter.in_shutdown is an atomic variable So let's treat the variable like it is atomic and properly load it when we need to look at it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-05 07:51:27 -04:00
Donald Sharp	d5795103bc	zebra: Fix memory leaks and use after frees in nhg's on shutdown Fixup both memory leaks as well as use after free's in nhg's on shutdown. This approach is effectively just iterating through all the hash items and directly just freeing the memory instead of handling ref counts or cross references. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-05 07:51:27 -04:00
Donald Sharp	34a67a7d1e	zebra: When saving nhg for later stop processing Commit `35729f38fa` introduced the idea of holding a nexthop group for a small amount of time before removing it from the system. When this code was introduced the nexthop group entry was saved and a timer started, except instead of stopping processing at that point in time, zebra was continuing on and deleting nexthop group entries that that entry depended on as well. This should not be done until the timer pops. Fixes: #11596 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-05 07:51:27 -04:00
Stephen Worley	098a55c5f8	Merge pull request #11713 from anlancs/fix/evpn-mh-bond-redirect zebra: fix bond down for evpn-mh	2022-08-03 10:11:08 -04:00
anlan_cs	df6c198269	zebra: fix bond down for evpn-mh The test case is with `redirect-off` in evpn multi-homing environment: ``` evpn mh redirect-off ``` After the environment is setup, do the following steps: 1) Let one member of ES learn one mac: ``` 2e:52:bb:bb:2f:46 dev ae1 vlan 100 master bridge0 static ``` Now everything is ok and the mac can be synced to other ES peers. 2) Shutdown bond1. At this time, zebra will get three netlink messages, not one as current code expected. Like: ``` e4:f0:04:89:b6:46 dev vxlan10030 vlan 30 master bridge0 static <-A e4:f0:04:89:b6:46 dev vxlan10030 nhid 536870913 self extern_learn <-B e4:f0:04:89:b6:46 dev vxlan10030 vlan 30 self <-C ``` With A), zebra will wrongly remove this mac again: ``` ZEBRA: dpAdd remote MAC e4:f0:04:89:b6:46 VID 30 ZEBRA: Add/update remote MAC e4:f0:04:89:b6:46 intf vxlan10030(26) VNI 10030 flags 0xa01 - del local ZEBRA: Send MACIP Del f None MAC e4:f0:04:89:b6:46 IP (null) seq 0 L2-VNI 10030 ESI - to bgp ``` With C), zebra will wrongly add this mac again: ``` ZEBRA: Rx RTM_NEWNEIGH AF_BRIDGE IF 26 VLAN 30 st 0x2 fl 0x2 MAC e4:f0:04:89:b6:46 nhg 0 ZEBRA: dpAdd remote MAC e4:f0:04:89:b6:46 VID 30 ``` zebra should skip the two messages with `vid`. Otherwise, it will send many wrong messages to bgpd, and the logic is wrong. `nhg/dst` is in 2nd message without `vid`, it is useful to call `zebra_evpn_add_update_local_mac()`. But it will fail with "could not find EVPN" warning for no `vid`, can't call `zebra_evpn_add_update_local_mac()`: With B): ``` ZEBRA: Rx RTM_NEWNEIGH AF_BRIDGE IF 26 st 0x2 fl 0x12 MAC e4:f0:04:89:b6:46 nhg 536870913 ZEBRA: dpAdd local-nw-MAC e4:f0:04:89:b6:46 VID 0 ZEBRA: Add/Update MAC e4:f0:04:89:b6:46 intf ae1(18) VID 0, could not find EVPN ``` Here, we can get `vid` from vxlan interface instead of from netlink message. In summary, `zebra_vxlan_dp_network_mac_add()` will process the three messages wrongly expecting only one messsage, so its logic is wrong. Just skip the two unuseful messages with `vid`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-08-03 02:47:16 -04:00
Donald Sharp	a17eb04734	zebra: Remove usage of newline in zlog_X message Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-29 18:31:58 -04:00
Donald Sharp	1cf42d6a00	zebra: Fix lost memory on lsp free When cleaning up memory associated with a lsp the nhlfe is lost in some cases. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-26 12:41:11 -04:00
Donatas Abraitis	eed5b70f94	Merge pull request #11657 from donaldsharp/why_timer convert thread_cancel to THREAD_OFF	2022-07-22 08:26:08 +03:00
Donald Sharp	146bcb9b92	zebra: Convert thread_cancel to THREAD_OFF Just convert all uses of thread_cancel to THREAD_OFF Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-21 08:30:50 -04:00
Donald Sharp	cb1991af8c	*: frr_with_mutex change to follow our standard convert: frr_with_mutex(..) to: frr_with_mutex (..) To make all our code agree with what clang-format is going to produce Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-20 15:50:32 -04:00
Donald Sharp	e05506b8dd	zebra: Add some more data to rtadv socket failures The creation of the rtadv socket can fail but there is very very little data associated with this event to let the operator know something has gone terribly wrong. Please note if this socket fails to create or fails the setsockopt's rtadv is basically just really really messed up. I am not sure what can be done here. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-19 13:57:56 -04:00
Russ White	b40a74fc2d	Merge pull request #11613 from anlancs/fix/minor-6 zebra: correctly display one debug flag about ES Peer	2022-07-19 11:16:54 -04:00
Jafar Al-Gharaibeh	13d7231039	Merge pull request #11635 from opensourcerouting/fix/memory_leak_for_mpls zebra: Cleanup the memory from the hash for MPLS stuff	2022-07-18 12:42:36 -05:00
Donald Sharp	5f5b29862f	Merge pull request #11626 from opensourcerouting/fix/avoid_buffer_overflow zebra: Avoid buffer overflow using netlink_parse_rtattr_nested()	2022-07-18 09:20:11 -04:00
Donatas Abraitis	912b6a5b5c	zebra: Cleanup the memory from the hash for MPLS stuff ==1595641== 280 (80 direct, 200 indirect) bytes in 1 blocks are definitely lost in loss record 30 of 38 ==1595641== at 0x483AB65: calloc (vg_replace_malloc.c:760) ==1595641== by 0x493C89C: qcalloc (memory.c:116) ==1595641== by 0x1E8426: lsp_alloc (zebra_mpls.c:1116) ==1595641== by 0x49147F1: hash_get (hash.c:162) ==1595641== by 0x1EC880: mpls_lsp_install (zebra_mpls.c:3192) ==1595641== by 0x1C51BB: zread_vrf_label (zapi_msg.c:3197) ==1595641== by 0x1C6F11: zserv_handle_commands (zapi_msg.c:3863) ==1595641== by 0x24D0F4: zserv_process_messages (zserv.c:523) ==1595641== by 0x498F4CC: thread_call (thread.c:2002) ==1595641== by 0x49253A2: frr_run (libfrr.c:1198) ==1595641== by 0x1A28BA: main (main.c:475) ==1595641== ==1595641== 1,400 (400 direct, 1,000 indirect) bytes in 5 blocks are definitely lost in loss record 35 of 38 ==1595641== at 0x483AB65: calloc (vg_replace_malloc.c:760) ==1595641== by 0x493C89C: qcalloc (memory.c:116) ==1595641== by 0x1E8426: lsp_alloc (zebra_mpls.c:1116) ==1595641== by 0x49147F1: hash_get (hash.c:162) ==1595641== by 0x1EBD7C: mpls_zapi_labels_process (zebra_mpls.c:2915) ==1595641== by 0x1C35D9: zread_mpls_labels_add (zapi_msg.c:2513) ==1595641== by 0x1C6F11: zserv_handle_commands (zapi_msg.c:3863) ==1595641== by 0x24D0F4: zserv_process_messages (zserv.c:523) ==1595641== by 0x498F4CC: thread_call (thread.c:2002) ==1595641== by 0x49253A2: frr_run (libfrr.c:1198) ==1595641== by 0x1A28BA: main (main.c:475) Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-07-18 14:26:29 +03:00
Donatas Abraitis	ce39ca16dd	zebra: Avoid buffer overflow using netlink_parse_rtattr_nested() memset(tb, 0, sizeof(struct rtattr ) (max + 1)); in netlink_parse_rtattr() seems a good candidate to buffer overflow. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-07-17 22:31:48 +03:00
Donald Sharp	2e4e3ba10b	zebra: Delete the malloced memory under `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-16 19:01:12 -04:00
Donald Sharp	9d1fec4c7e	zebra: When deleting nexthop group entries ensure the thread is off Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-16 19:00:43 -04:00
anlan_cs	0ef63eb5cc	zebra: correctly display one debug flag about ES Peer Since the `mac->flags` with `ZEBRA_MAC_ES_PEER_ACTIVE` is about ES Peer, it should be displayed as `PEER Active`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-07-15 19:24:24 -04:00
Christian Hopps	11c9ab3202	zebra: free neighbor state before exit to avoid memleaks Signed-off-by: Christian Hopps <chopps@labn.net>	2022-07-14 12:20:01 -04:00
anlan_cs	9faf4e426c	zebra: remove redundant flags assignment The assignment of neigh flags on ES peers is set twice. Just clean it. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-07-12 14:18:18 -04:00
David Lamparter	ca8a395da8	zebra: debug decode RTA_EXPIRES and RTA_MFC_STATS Just adding two more attributes to decode and show nicely in netlink msgdump debug output. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-07-07 11:36:44 +02:00
David Lamparter	e1cd4bb178	zebra: fix remaining MR RTM_GETROUTE oddities The IPv6 version needs rtm_src_len and rtm_dst_len filled in due to strict validation. IPv4 also has this requirement, but zebra is running in non-strict mode there so the kernel accepts it... Also the table ID hack is IPv4 only. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-07-07 11:36:40 +02:00
David Lamparter	c6a89c8ef5	zebra: IIF/OIF are not used in MR RTM_GETROUTE The multicast routing RTM_GETROUTE command does not use IIF/OIF attributes, and the IPv6 version will refuse them with an error due to being new netlink API and thus using strict validation. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-07-06 11:29:56 +02:00
David Lamparter	31071fe357	zebra: netlink MR is a rtmsg, not ndmsg These two structs happen to be the same size and have the family field in the same spot, but the correct one to use here is rtmsg not ndmsg. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-07-06 11:23:06 +02:00
David Lamparter	afeb8524e2	zebra: correctly ignore multicast nl msgs zebra does not care about _notifications_ from the kernel regarding multicast routing; we only use the MR netlink API to request stats from the kernel by actively sending a RTM_GETROUTE. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-07-06 11:19:05 +02:00
Russ White	1dc900fe27	Merge pull request #11502 from donaldsharp/zebra_dplane_fini zebra: make rib_process_dplane_results own ctx freeing	2022-07-05 09:49:02 -04:00
David Lamparter	87101cd399	Merge pull request #11507 from donaldsharp/setsockopt_changes	2022-07-04 10:38:25 +02:00
Donatas Abraitis	484cc1bd97	Merge pull request #11514 from donaldsharp/zebra_odds_and_ends Zebra odds and ends	2022-07-04 08:11:29 +03:00
Donald Sharp	7a96ccc7b4	zebra: Add a subqueue2str function to give more useful data in debugs New output example: 2022-07-03 09:40:29.310 [DEBG] zebra: [JF0K0-DVHWH] rib_meta_queue_add: (0:254):4.5.6.8/32: queued rn 0x55937f586ee0 into sub-queue Kernel Routes 2022-07-03 09:40:29.321 [DEBG] zebra: [HH6N2-PDCJS] default(0:254):4.5.6.8/32 rn 0x55937f586ee0 dequeued from sub-queue Kernel Routes Let's make it a bit more human readable. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-03 09:41:31 -04:00
Rafael Zalamena	26e95efa4d	zebra: handle FreeBSD routing socket ENOBUFS This is a slightly modified version of Hiroki Sato's version: `9ca79c941f` Handle the `ENOBUFS` on a OS basis since it could have been implemented differently (OpenBSD for an example uses `RTM_DESYNC`). Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2022-07-01 10:21:41 -03:00
Donald Sharp	b09388f0ea	zebra: Add more cases to proto2zebra for understanding kernel routes Just some missing ones. Make zebra stop complaining, was getting some messages from proto2zebra when doing testing, let's clean that up from happening. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-01 07:59:53 -04:00
Donald Sharp	88b0baa648	zebra: move allow_delete to zrouter.allow_delete Instead of having global allow_delete move it to where it belongs in the zrouter data structure. Additionally show this data in `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-01 07:59:53 -04:00
Donald Sharp	c3dcd24bc2	zebra: Notice to end operator when a failure happens When reading a multipath route and we detect an encoding error from the kernel( yeah I don't think so either ), let's tell the operator what happened to that route. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-07-01 07:59:53 -04:00
Donald Sharp	cc408c062b	zebra: Realign SOL_NETLINK to warn when FRR does not have it There exists a possibility that an end operator has choosen to compile FRR on an extremely old KERNEL that does not support the SOL_NETLINK sockopt call. If so let's note it for them instead of stuff silently not working. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-30 08:03:02 -04:00
Donald Sharp	fe953d7cde	zebra: Correct implication of SOL_NETLINK NETLINK_ADD_MEMBERSHIP usage The usage of SOL_NETLINK for adding memberships of interest is 1 group per call. The netink_socket function implied that the call could be a bitfield of values. This is not correct at all. This will trip someone else up in the future when a new value is needed. Let's get it right `now` before it becomes a problem. Let's also add a bit of extra code to give operator a better understanding of what went wrong when a kernel does not support the option. Finally as a point of future reference should FRR just switch over to a loop to add the required loops instead of having this bastardized approach of some going in one way and some going in another way? Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-30 07:57:55 -04:00
Donald Sharp	f00b37e710	zebra: make rib_process_dplane_results own ctx freeing The rib_process_dplane_results function was having each sub function handler process the results and then free the ctx. Lot's of functionality that needs to remember to free the context. Let's just free it in the main loop. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-29 15:24:20 -04:00
mobash-rasool	d841284b70	Merge pull request #11454 from routingrocks/evpn_clag_frrlogs Zebra EVPN Debug: Fixing log flooding when disabling MLAG leaf config…	2022-06-28 17:32:10 +05:30
Russ White	5295200043	Merge pull request #11474 from donaldsharp/dp-dpdk Dp dpdk	2022-06-28 07:33:22 -04:00
Donald Sharp	385d37ab31	zebra: Add ability for netconf dplane to handle global values Add the ability for the netconf dplane code to handle the global NETCONFA_IFINDEX_DEFAULT and NETCONF_IFINDEX_ALL values. Then store our interested values when we get them from the kernel as well as being able to display them to the end operator. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-27 16:30:20 -04:00
Donald Sharp	d53dc9bd81	zebra: Pass afi received for netconf updates When Zebra receives the netconf update an afi is passed let's seperate that out and track the v4/v6 specific data to save and store appropriately. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-27 15:26:13 -04:00
Donald Sharp	4c84aa5ebd	zebra: mc_forwarding was being sent but not retrieved across dataplane The mc_forwarding status for an interface was being sent but not properly retrieved on the zebra master side of the dplane. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-27 15:04:21 -04:00
Anuradha Karuppiah	7f0416b368	zebra: PBR dpdk programming 1. Offload PBR rule add/del 2. Query DPDK flow stats and display per-PBR entry Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 08:08:02 -04:00
Anuradha Karuppiah	a66d624661	zebra: setup the zebra interface to dpdk port map table 1. Create mappping table between ifIndex and dpdk-port-id 2. Start the DPDK port Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 07:56:55 -04:00
Anuradha Karuppiah	67f5a23240	zebra: initialize hw via DPDK Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 07:56:55 -04:00
Anuradha Karuppiah	fd03f1d4b7	configure, zebra: include DPDK headers and shared libs in the dp-dpdk build Signed-off-by: Donald Sharp <sharpd@nvidia.com> -> Moved new capabilities needed to under HAVE_DPDK Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 07:56:55 -04:00
Anuradha Karuppiah	36c3b29675	zebra: infastructure for the new dataplane plugin Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 07:56:55 -04:00
Anuradha Karuppiah	0c57fcc731	zebra: add ipc_lock, read_search and sys_rawio to zebra's privileges These are needed for dpdk:rte_eal_init. Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 07:56:55 -04:00
Anuradha Karuppiah	9898473fb4	zebra: pass PBR expanded actions to the dataplane These attributes are needed for dpdk dataplane programming Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 07:56:55 -04:00
Anuradha Karuppiah	59f47eb010	zebra: expand pbr rule action for dataplane programming PBR rules are installed as match, action rules in most dataplanes. This requires the action to be resolved via a GW. And the GW to be subsequently resolved to {SMAC, DMAC}. Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 07:56:55 -04:00
Anuradha Karuppiah	4cf4fad153	zebra: add support for maintaining local neigh entries Currently specific local neighbors (attached to SVIs) are maintatined in an EVPN specific database. There is a need to maintain L3 neighbors for other purposes including MAC resolution for PBR nexthops. Signed-off-by: Donald Sharp <sharpd@nvidia.com> Cleanup compile and fix crash Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2022-06-27 07:56:55 -04:00
Donatas Abraitis	cc6c75300d	Merge pull request #11429 from donaldsharp/interface_funkiness zebra: Fix rtadv startup when config read in is before interface up	2022-06-24 23:07:45 +03:00
Jafar Al-Gharaibeh	443795a864	Merge pull request #11469 from donaldsharp/fdev2 zebra: netlink rtm tunnel msg parsing	2022-06-24 10:05:56 -05:00
Russ White	a9adefc22f	Merge pull request #11464 from donaldsharp/linkdown Linkdown	2022-06-24 10:15:55 -04:00
Donald Sharp	4cd26be0f3	Merge pull request #11258 from anlancs/fix/zebra-keep-nb-check zebra: move the checks for l3vni	2022-06-24 09:46:12 -04:00
Chirag Shah	acc8e68720	zebra: netlink rtm tunnel msg parsing 'bridge vni add vni <id> dev <vxlan device>' generates new RTM_NEWTUNNEL and RTM_DELTUNNEL to add or remove vni to l3vxlan device. Register new RTNLGRP_TUNNEL group to receive new netlink notification. Callback for the new RTM_xxxTUNNEL. kernel patches: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/ linux.git/commit/?h=v5.18-rc7&id=7b8135f4df98b155b23754b6065c157861e268f1 https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/ linux.git/commit/?h=v5.18-rc7&id=f9c4bb0b245cee35ef66f75bf409c9573d934cf9 Ticket:#3073812 Testing Done: Signed-off-by: Donald Sharp <sharpd@nvidia.com> Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-06-24 07:33:34 -04:00
Donald Sharp	7937058b94	zebra: Fix rtadv startup when config read in is before interface up When a interface is configured with this: int eva ipv6 nd ra-interval 5 no ipv6 nd suppress-ra ! And then subsuquently the interface is created and brought up, FRR would both error on joining the RA multicast address and never properly work in this state. Delay the startup of the join and start of the Router Advertisements until after the ifindex has actually been found. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-24 07:18:41 -04:00
Donald Sharp	8a8fd10a47	Merge pull request #11453 from ribarroetavena/master zebra: rtnetlink: flow attr per gateway attr in multipath updates	2022-06-23 13:55:51 -04:00
Donald Sharp	fc3de981be	zebra: Allow kernel routes to stick around better on interface state changes Currently kernel routes on system bring up would be `auto-accepted`, then if an interface went down all kernel and system routes would be re-evaluated. There exists situations where a kernel route can exist but the interface itself is not exactly in a state that is ready to create a connected route yet. As such when any interface goes down in the system all kernel/system routes would be re-evaluated and then since that interfaces connected route is not in the table yet the route is matching against a default route( or not at all ) and is being dropped. Modify the code such that kernel or system routes just look for interface being in a good state (up or operative) and accept it. Broken code: eva# show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:05:08 K>* 1.2.3.5/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.6/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.7/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.8/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.9/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.10/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.11/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.12/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.13/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.14/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.16/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 1.2.3.17/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 C>* 4.5.6.99/32 is directly connected, dummy9, 00:05:08 K>* 4.9.10.11/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:05:08 K>* 10.11.12.13/32 [0/0] via 192.168.119.1, enp39s0, 00:05:08 C>* 192.168.10.0/24 is directly connected, dummy99, 00:05:08 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:05:08 <shutdown a non-related interface> eva# show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:05:28 C>* 4.5.6.99/32 is directly connected, dummy9, 00:05:28 K>* 10.11.12.13/32 [0/0] via 192.168.119.1, enp39s0, 00:05:28 C>* 192.168.10.0/24 is directly connected, dummy99, 00:05:28 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:05:28 Working code: eva# show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:00:04 K>* 1.2.3.5/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.6/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.7/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.8/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.9/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.10/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.11/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.12/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.13/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.14/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.16/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 1.2.3.17/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 C>* 4.5.6.99/32 is directly connected, dummy9, 00:00:04 K>* 4.9.10.11/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:04 K>* 10.11.12.13/32 [0/0] via 192.168.119.1, enp39s0, 00:00:04 C>* 192.168.10.0/24 is directly connected, dummy99, 00:00:04 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:00:04 <shutdown a non-related interface> eva# show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:00:15 K>* 1.2.3.5/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.6/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.7/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.8/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.9/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.10/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.11/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.12/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.13/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.14/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.16/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 1.2.3.17/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 C>* 4.5.6.99/32 is directly connected, dummy9, 00:00:15 K>* 4.9.10.11/32 [0/0] via 172.22.0.44, br-23e378ed7fd2 linkdown, 00:00:15 K>* 10.11.12.13/32 [0/0] via 192.168.119.1, enp39s0, 00:00:15 C>* 192.168.10.0/24 is directly connected, dummy99, 00:00:15 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:00:15 eva# Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-23 12:22:30 -04:00
Donald Sharp	3689905d32	zebra: Add interface sysctl ignore on linkdown status Add the ability to decode the ignore on linkdown nexthop status for an interface. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-23 11:23:37 -04:00
Donald Sharp	c704cb44a9	lib, zebra: Notice when a nexthop is set linkdown When a nexthop is set RTNH_F_LINKDOWN, start noticing that this flag is set. Allow FRR to know about this flag but at this point do not do anything with it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-23 11:23:37 -04:00
Donald Sharp	2b9dc841b7	zebra: Fix bug in netconf handling where dplane would drop the change When reading a on the fly change of an interested netconf netlink message. The ifindex and ns_id for the context was being set for the sub structure but not for the main context data structure and zebra_if_dplane_result was dropping the result on the floor because it was expecting the ns_id and the interface id to be in a different spot. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-23 11:23:08 -04:00
Ricardo	63eaefa86c	zebra: rtnetlink: flow attr per gateway attr in multipath updates Signed-off-by: Ricardo <rbarroetavena@anura.com.ar>	2022-06-23 12:05:26 -03:00
Russ White	98b3ab772e	Merge pull request #10629 from leonshaw/fix/mp-evpn-nh lib, zebra, bgpd: Move route EVPN flag to nexthop	2022-06-23 07:00:33 -04:00
anlan_cs	41a8b88ce4	zebra: move the check for l3vni The two checks for l3vni have been already done in `lib_vrf_zebra_l3vni_id_modify()` as it should be. And it is improper that the two checks are put after `zebra_vxlan_handle_vni_transition()`, which will do real things. My original fix is to remove them. But NB module can't guarantee many changes, so we'd better keep them in `zebra_vxlan_process_vrf_vni_cmd()` in APPLY stage for safe. Just move them in front of `zebra_vxlan_handle_vni_transition()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-06-23 01:33:45 -04:00
Donatas Abraitis	0ba27f1cb2	Merge pull request #11427 from anlancs/fix/minor-2 zebra: remove redundant calling hook for fpm	2022-06-22 11:39:13 +03:00
rvaratharaj	4bf66f436e	Zebra EVPN Debug: Fixing log flooding when disabling MLAG leaf configuration When disabling MLAG leaf configuration with EVPN, logs are getting flooded for each VNI, This is the result of each Type-2 packets. Ideally, this should be under log debugging, not a warning. Testing: UT Signed-off-by: Rajesh Varatharaj <rvaratharaj@nvidia.com>	2022-06-21 18:18:14 -07:00
Donatas Abraitis	01ceb8b23c	Merge pull request #11417 from donaldsharp/nhg_timer Nhg timer	2022-06-21 18:43:45 +03:00
Donatas Abraitis	4ed0abcb55	Merge pull request #11423 from donaldsharp/lgtm_fixes Lgtm fixes	2022-06-19 22:33:01 +03:00
anlan_cs	f1f4a65288	zebra: remove redundant calling hook for fpm Since the calling hook for old fpm is done in `rib_uninstall_kernel()` inside, this calling place outside should be redundant. Just remove it. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-06-17 17:42:12 -04:00
Donald Sharp	8dfcf20fa0	Merge pull request #11419 from anlancs/fix/minor-1 zebra, lib: minor changes	2022-06-16 16:35:38 -04:00
Donald Sharp	87472c6999	zebra: ret is always -1 or 0 at this point so remove the if test Remove the if test and cleanup the code to better align. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-16 16:31:35 -04:00
Donald Sharp	c9af62e314	zebra: Add a configurable knob `zebra nexthop-group keep (1-3600)` Allow end operator to set how long a nexthop-group is kept around in the system after it is no-longer being used. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-16 14:47:19 -04:00
Donald Sharp	35729f38fa	zebra: Add a timer to nexthop group deletion Before deleting nexthop groups, that are installed, from the system, start a timer and hold the nexthop group for that time. Suppose you have this scenario a) create a static route with 1 x ecmp creates a nhg with 1 x ecmp b) create a static route with 2 x ecmp creates a nhg with 2 x ecmp deletes a's nhg c) create a static route with 3 x ecmp creates a nhg with 3 x ecmp deletes b's nhg d) create a different route with 1 x ecmp creates another 1 x ecmp ( since a's ecmp was deleted ) e) create a different route with 2 x ecmp creates another 2 x ecmp ( since b's ecmp was deleted ) If you don't delete the nhg, start a timer, the nhg's used in steps a and b can be reused for steps d and e. This reduces overhead work with zebra <-> kernel interactions and improves the speed of the system. So modify the code to note that an installed nexthop group should be kept around a bit and hopefully reused. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-16 14:47:19 -04:00
Donald Sharp	382858d015	zebra: Move where zebra marks a nhg as uninstalled in fib Currently the code is marking the nhg as uninstalled but not causing that to flood up to the dependent nhgs: nhg 3 is a group of 1/2 1 -> interface A 2 -> interface B Suppose A goes down, old code would mark nhg 1 as !VALID and !INSTALLED. Suppose B then goes down, old code would mark nhg 2 as !VALID and !INSTALLED But would not mark nhg 3 as !VALID and !INSTALLED (sort of assuming that it would just be cleaned up by NHG refcounts ). I would prefer that the code is pedantic about nhg 3 actually being removed from the system. This code moves the setting of !INSTALLED into zebra_nhg.c where it really belongs. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-16 14:47:19 -04:00
Donald Sharp	5772319ef1	zebra: Document some data structures better I keep getting confused about nhg_depends and nhg_dependents. So take a second and write them down for the next person. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-16 14:47:19 -04:00
Donatas Abraitis	7ea104c111	Merge pull request #11415 from donaldsharp/interface_duplication_linux Interface duplication linux	2022-06-16 21:19:14 +03:00
anlan_cs	e3f05a8a1a	zebra: adjust one variable name Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-06-16 10:15:12 -04:00
Russ White	93ab9a2c0b	Merge pull request #11409 from donaldsharp/zebra_nhg_debug Zebra nhg debug	2022-06-16 10:13:20 -04:00
Donald Sharp	8b9b1d6043	zebra: On linux let interface data come in through netlink messaging Consolidate on linux to using the netlink api for gathering all data about a interface. Leave this interface alone in the meantime for other OS's. This also has the side effect of reducing the amount of work being done on linux in that FRR was handling shut/no shut events 2 times. Once for the ioctl question asked and once for the netlink message received. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-15 10:34:30 -04:00
Donald Sharp	7c4910cead	zebra: Attempt to make ioctl.c have a bit more useful log messges While examining the code, it was noticed that there was a chance to improve the log output in some cases to give a fuller understanding of what went wrong where. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-15 10:34:30 -04:00
Donald Sharp	d9db1a4092	zebra: stream_dup cannot fail If stream_dup was unable to actually allocate memory then FRR would crash instead. So let's remove the check for null since it is not needed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-15 10:24:01 -04:00
Donald Sharp	68d188be7a	zebra: Convert debugs to use %pNG The nexthop group debugs were using %u to just display the id. I found this very hard to figure out what was going on. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-14 20:25:56 -04:00
Donald Sharp	cc75cbea1b	zebra: Add `%pNG` to zebra print routines Add `%pNG` so that a nexthop group can be displayed in debugs/logs such that it can provide useful information. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-06-14 20:25:56 -04:00
Donald Sharp	f90391998c	Merge pull request #11229 from anlancs/fix/zebra-nb-remove-checknode zebra: remove one unnecessary check for l3vni nb	2022-06-14 13:58:16 -04:00
Donatas Abraitis	70dd370f5a	*: Use autocomplete for route-maps under commands that require it For example: ``` donatas-laptop# show bgp ipv4 unicast neighbors 127.0.0.2 advertised-routes route-map ? RMAP_NAME Name of the route map testas2 testas donatas-laptop(config)# router bgp donatas-laptop(config-router)# address-family ipv4 donatas-laptop(config-router-af)# redistribute connected route-map ? RMAP_NAME Pointer to route-map entries testas2 testas donatas-laptop(config-router-af)# network 192.168.0.0/23 route-map ? RMAP_NAME Name of the route map testas2 testas ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-06-13 21:00:51 +03:00
Xiao Liang	5609e70fb8	lib, zebra, bgpd: Move route EVPN flag to nexthop Multipath route may have mixed nexthops of EVPN and IP unicast. Move EVPN flag to nexthop to support such cases. Signed-off-by: Xiao Liang <shaw.leon@gmail.com>	2022-06-10 17:12:48 +08:00
Chirag Shah	e5b1de8a11	zebra: add error check condition to sock option Adding error checking condition which was missed in PR-11216. *** CID 1517953: Error handling issues (CHECKED_RETURN) /zebra/kernel_netlink.c: 313 in netlink_socket() 307 memset(&snl, 0, sizeof(snl)); 308 snl.nl_family = AF_NETLINK; 309 snl.nl_groups = groups; 310 311 #if defined SOL_NETLINK 312 if (ext_groups) >>> CID 1517953: Error handling issues (CHECKED_RETURN) >>> Calling "setsockopt(sock, 270, 1, &ext_groups, 8U)" without checking return value. This library function may fail and return an error code. 313 setsockopt(sock, SOL_NETLINK, NETLINK_ADD_MEMBERSHIP, 314 &ext_groups, sizeof(ext_groups)); 315 #endif 316 317 /* Bind the socket to the netlink structure for anything. / 318 ret = bind(sock, (struct sockaddr )&snl, sizeof(snl)); Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-31 13:50:48 -07:00
Trey Aspelund	56599dd9c8	zebra: Update advertise-svi-ip MACIPs w/ new MAC When the kernel was sending an RTM_NEWLINK updating the MAC of a known SVI, Type-2 routes created by advertise-svi-ip were not getting updated with the new address. This adds removal of any old Type-2 routes (with old MAC) and creation of new Type-2 routes (with new MAC) into RTM_NEWLINK processing. Fixes: #11174 Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2022-05-28 03:30:05 -04:00
Philippe Guibert	c9250e28e8	zebra: avoid pbr iptable added twice when used with flowspec The usage of zebra dplane makes the job asyncronous which implies that a given job will try to add an iptable, while the second job will not know that its iptable is the same as the former one. The below exabgp rules stand for two bgp flowspec rules sent to the bgp device: flow { route {match { source 185.228.172.73/32; destination 0.0.0.0/0; source-port >=49156&<=49159; }then {redirect 213.242.114.113;}} route {match { source 185.228.172.73/32; destination 0.0.0.0/0; source-port >=49160&<=49163; }then {redirect 213.242.114.113;}} } This rule creates a single iptable, but in fact, the same iptable name is appended twice. This results in duplicated entries in the iptables context. This also results in contexts not flushed, when BGP session or 'flush' operation is performed. iptables-save: [..] -A PREROUTING -m set --match-set match0x55baf4c25cb0 src,src -g match0x55baf4c25cb0 -A PREROUTING -m set --match-set match0x55baf4c25cb0 src,src -g match0x55baf4c25cb0 -A match0x55baf4c25cb0 -j MARK --set-xmark 0x100/0xffffffff -A match0x55baf4c25cb0 -j ACCEPT -A match0x55baf4c25cb0 -j MARK --set-xmark 0x100/0xffffffff -A match0x55baf4c25cb0 -j ACCEPT [..] This commit addresses this issue, by checking that an iptable context is not already being processed. A flag is added in the original iptable context, and a check is done if the iptable context is not already being processed for install or uinstall. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2022-05-25 14:26:28 +02:00
Donatas Abraitis	4febdb6b9a	Merge pull request #10836 from anlancs/bgpd-mh-delay-esi zebra: delay setting esi in zebra_evpn_local_es_update()	2022-05-23 07:49:08 +02:00
David Lamparter	7ca9c407ed	zebra: clean up rtadv integration Move a few things into places they actually belong, and reduce the number of places we have `#ifdev HAVE_RTADV`. Just overall code prettification. ... I had actually done this quite a while ago while doing some other random hacking and thought it more useful to not be sitting on it on my disk... Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-05-21 14:14:01 +02:00
anlan_cs	c331ef1665	zebra: remove one unnecessary check for l3vni nb The parent node of "vrf" MUST be non-NULL, so the check is unnecessary and misleading. Otherwise, there will be a branch of NULL parent node, it makes no sense, remove it. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-20 03:11:27 -04:00
Sri Mohana Singamsetty	bde51e807f	Merge pull request #11216 from chiragshah6/fdev2 zebra: netlink registry of rtm tunnel notification	2022-05-19 10:28:25 -07:00
Sri Mohana Singamsetty	0e6e6bc36e	Merge pull request #11222 from donaldsharp/bgp_zebra_stuff Bgp zebra stuff	2022-05-19 09:41:41 -07:00
Sri Mohana Singamsetty	595ebf525b	Merge pull request #11210 from anlancs/fix/zebra-leak-vtp zebra: fix missing delete vtep during vni transition	2022-05-19 09:35:27 -07:00
Donald Sharp	1b3cf91b0c	zebra: Fix newline in log message Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-05-18 14:42:22 -04:00
Russ White	c1e2a1eae3	Merge pull request #11205 from chiragshah6/fdev1 zebra: new netlink parse utility for rta used to send nhg msg	2022-05-18 11:13:22 -04:00
Chirag Shah	42ed3bd77f	zebra: add netlink tunnel msg to dump routine This patch parses vxlan vnifilter rtm tunnel message which contains vni mapping to vxlan device. The new notifications are RTM_NEWTUNNEL, RTM_DELTUNNEL, and RTM_GETTUNNEL. https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/ linux.git/commit/?h=v5.18-rc7&id=7b8135f4df98b155b23754b6065c157861e268f1 Testing Done: 2022/05/18 00:34:25 ZEBRA: netlink_recv_msg: << netlink message dump [recv] 2022/05/18 00:34:25 ZEBRA: nlmsghdr [len=36 type=(120) NEWTUNNEL flags=(0x0000) {} seq=0 pid=0] 2022/05/18 00:34:25 ZEBRA: tnlm [family=(7) AF_BRIDGE ifindex=46 2022/05/18 00:34:25 ZEBRA: vni_start 4001, vni_end 0 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-18 07:56:44 -07:00
Chirag Shah	47e2eb270d	zebra: netlink registry rtm tunnel notif The kernel supports l3vxlan device to have (l3vni) vni filter similar to vlan filtering on bridge device. To receive netlink notification, FRR to register for new netlink RTNLGRP_TUNNEL message. This message required to register via additional socket option as it's beyond bitmap size. kernel patches: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/ linux.git/commit/?h=v5.18-rc7&id=7b8135f4df98b155b23754b6065c157861e268f1 https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/ linux.git/commit/?h=v5.18-rc7&id=f9c4bb0b245cee35ef66f75bf409c9573d934cf9 Ticket:#3073812 Testing Done: Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-18 07:56:35 -07:00
Mark Stapp	6ca1b0f44e	Merge pull request #11192 from cyberstorm-mauritius/zebra_netlink zebra: Add startup message and display netlink buffer size.	2022-05-17 08:13:23 -04:00
Chirag Shah	f8f3e484d4	zebra: new netlink parse utility for rta Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-16 10:45:14 -07:00
Chirag Shah	865c12e1a7	zebra: add protocol name to nexthop dump Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-05-16 08:40:19 -07:00
anlan_cs	0dfc0dd974	zebra: delay setting esi in zebra_evpn_local_es_update() Currently, `zif->es_info.esi` is always set even for a few unnecessary cases in `zebra_evpn_local_es_update()`. Delay setting `zif->es_info.esi` and remove the annoying rollback (i.e. unset `zif->es_info.esi`) operation on failure case. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-16 09:40:49 -04:00
anlan_cs	2fe5a02ea4	zebra: fix missing delete vtep during vni transition All `vtep`s in dplane should be deleted/uninstalled during vni transition. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-16 09:30:28 -04:00
Donald Sharp	950e7e6660	Merge pull request #11207 from anlancs/fix/zebra-remove-check-l3vni zebra: remove unncecessary check for l3vni	2022-05-16 08:02:58 -04:00
Rafael Zalamena	854dea850c	Merge pull request #11199 from donaldsharp/nexthop_dump zebra: Add encap and group type decoding to nexthop dump	2022-05-16 08:09:54 -03:00
anlan_cs	0717f2d83c	zebra: remove unncecessary check for l3vni Since `l3vni` created by `zl3vni_add()` is always valid, remove the check for it. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-16 05:40:15 -04:00
Donatas Abraitis	8f5e706a2f	Merge pull request #11201 from donaldsharp/unused_in_netlink_compiles Remove some unused functions in zebra	2022-05-16 09:57:30 +03:00
anlan_cs	81157cbd10	zebra: remove unnecessary check for "zevpn_vrf" The global vrf in zebra is always non-NULL. In general, it is bound to default vrf by `zebra_vrf_init()`, at other times bound to some specific vrf. Anyway, non-NULL. So remove all redundant checkings for the returned value of `zebra_vrf_get_evpn()`. Additionally, remove the unnecessary check for `zvrf` in `zebra_vxlan_cleanup_tables()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-13 23:31:52 -04:00
Donald Sharp	20ceb5475d	zebra: Remove unused function `route_entry_copy_nexthops` This function is no longer used. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-05-13 16:11:09 -04:00
Donald Sharp	388907d53c	zebra: Remove unused functions in netlink compiles When compiling with netlink, Remove the usage of these functions. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-05-13 15:58:33 -04:00
Donald Sharp	c30c607027	zebra: Add encap and group type decoding to nexthop dump Add the ability to give data about the nexthop group type and encap type so that it is human readable. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-05-13 10:37:30 -04:00
Donald Sharp	f205a2309c	Merge pull request #11177 from opensourcerouting/fix/memset_memcpy *: memcpy/memset zeroing	2022-05-13 07:40:58 -04:00
Loganaden Velvindron	0c99696f30	zebra: Add startup message and display netlink buffer size. Add startup message and display netlink buffer size. Signed-off-by: Loganaden Velvindron <logan@cyberstorm.mu>	2022-05-13 14:58:18 +04:00
Donatas Abraitis	4d5a0ff391	Merge pull request #11186 from anlancs/fix/bgpd-comment-should-es bgpd,zebra: correct one debug log for evpn-mh	2022-05-12 11:32:25 +03:00
anlan_cs	b0b9a2fe52	bgpd,zebra: correct one debug log for evpn-mh Correct one debug log in evpn-mh. BTW, correct one misspelled word in comment. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-12 02:19:51 -04:00
Donatas Abraitis	6006b807b1	*: Properly use memset() when zeroing Wrong: memset(&a, 0, sizeof(struct ...)); Good: memset(&a, 0, sizeof(a)); Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-05-11 14:08:47 +03:00
Mark Stapp	00358e444e	Merge pull request #11155 from LabNConsulting/ziemba/link-delay-min-max zebra bugfix interface link-param: allow delay min <= avg <= max (was: min<avg<max)	2022-05-10 11:31:52 -04:00
Igor Ryzhov	2a3807c3ce	Merge pull request #11163 from opensourcerouting/fix/same_type_casting *: Avoid casting to the same type as on the left	2022-05-10 00:16:30 +03:00
Donatas Abraitis	8998807f69	*: Avoid casting to the same type as on the left Just not necessary. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-05-08 16:07:42 +03:00
Donatas Abraitis	432ee88c21	zebra, ospf6d: Do not check if NULL for XCALLOC() Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-05-08 15:43:21 +03:00
G. Paul Ziemba	d029fe275c	zebra/interface.c: allow link-param delay min <= avg <= max RFC 7471 Section 4.2.7: It is possible for min delay and max delay to be the same value. Prior to this change, the code required min < avg < max. This change allows min == avg and avg == max. test case: interface eth-rt1 link-params delay 8000 min 8000 max 8000 Signed-off-by: G. Paul Ziemba <paulz@labn.net>	2022-05-06 14:48:31 -07:00
Donatas Abraitis	50f1f2e724	Merge pull request #11059 from anlancs/fix/bgpd-evnp-wrong-check-hashget bgpd: fix memory leak for evpn	2022-05-04 21:19:51 +03:00
anlan_cs	8e3aae66ce	: remove the checking returned value for hash_get() Firstly, keep no change* for `hash_get()` with NULL `alloc_func`. Only focus on cases with non-NULL `alloc_func` of `hash_get()`. Since `hash_get()` with non-NULL `alloc_func` parameter shall not fail, just ignore the returned value of it. The returned value must not be NULL. So in this case, remove the unnecessary checking NULL or not for the returned value and add `void` in front of it. Importantly, also keep no change for the two cases with non-NULL `alloc_func` - 1) Use `assert(<returned_data> == <searching_data>)` to ensure it is a created node, not a found node. Refer to `isis_vertex_queue_insert()` of isisd, there are many examples of this case in isid. 2) Use `<returned_data> != <searching_data>` to judge it is a found node, then free <searching_data>. Refer to `aspath_intern()` of bgpd, there are many examples of this case in bgpd. Here, <returned_data> is the returned value from `hash_get()`, and <searching_data> is the data, which is to be put into hash table. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-05-03 00:41:48 +08:00
Rafael Zalamena	3682bd90f3	*: use FRR interface name definition everywhere Don't rely on the OS interface name length definition and use the FRR definition instead. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2022-05-02 13:00:12 -03:00
David Lamparter	46a3bfa695	Merge pull request #10988 from AbhishekNR/ipv6_mroute_cli	2022-04-29 10:23:37 +02:00
mobash-rasool	c4aa8aa669	Merge pull request #11114 from opensourcerouting/vrf-declvar-macros lib, zebra, pimd: clean up/fix VRF DECLVAR macros	2022-04-29 13:53:08 +05:30
David Lamparter	0cbed9511a	lib, zebra, pimd: clean up/fix VRF DECLVAR macros There's a common pattern of "get VRF context for CLI node" here, which first got a helper macro in zebra that then permeated into pimd. Unfortunately the pimd copy wasn't quite adjusted correctly and thus caused two coverity warnings (CID 1517453, CID 1517454). Fix the PIM one, and clean up by providing a common base macro in `lib/vty.h`. Also rename the macros (add `_VRF`) to make more clear what they do. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-04-28 11:09:26 +02:00
Abhishek N R	1d06e3547a	zebra: Removed show_ipv6_mroute cli from zebra_vty.c Signed-off-by: Abhishek N R <abnr@vmware.com>	2022-04-28 01:43:19 -07:00
Mobashshera Rasool	51f4fd9810	zebra, pimd: Add a field family in the message ZEBRA_IPMR_ROUTE_STATS 1. Adding a field family in the existing ZEBRA_IPMR_ROUTE_STATS to get the ipv4 as well as ipv6 trafic stats between pim and zebra. 2. Modify the debug to print both v4/v6 prefixes pimd: pim6d: Modify pim_zlookup_sg_statistics to get ipv6 stats Modify the pim_zlookup_sg_statistics api to get ipv4/ipv6 stats from zebra. Making the api common. Signed-off-by: Mobashshera Rasool <mrasool@vmware.com>	2022-04-28 01:10:49 -07:00
Mobashshera Rasool	4d3b4b1851	zebra: Modify base code to get ipv6 stats from kernel Modify the structure mcast_route_data to store ipv4/ipv6 addr and lastused multicast information from kernel. Adjust the related APIs to parse ipv4/ipv6 informations. Signed-off-by: Mobashshera Rasool <mrasool@vmware.com>	2022-04-28 01:10:49 -07:00
David Lamparter	34ee41c6c9	zebra, pimd: add AF param on NEXTHOP_LOOKUP_MRIB By changing this API call to use a `struct ipaddr`, which encodes the type of IP address with it. (And rename/remove the `IPV4` from the command name.) Also add a comment explaining that this function call is going to be obsolete in the long run since pimd needs to move to proper MRIB NHT. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-04-26 16:15:00 +02:00
David Lamparter	425fd200c9	zebra: add rib_match_ipv6_multicast variant ... for IPv6, analogous to v4. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-04-26 16:15:00 +02:00
Donald Sharp	0bba3bd873	zebra: Name variable better in zebra_trace.h Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:49:36 -04:00
Donald Sharp	1239b60c06	zebra: Add tracepoint for netlink_rule_change Add a tracepoint for the netlink_rule_change function. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	3cee213500	zebra: Add tracepoint for netlink_route_change_read_unicast Add a tracepoint to zebra for the netlink_route_change_read_unicast functionality. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	14ed061501	zebra: Add netlink_interface_addr tracepoint Add a tracepoint for netlink_interface_addr. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	1d80c20919	zebra: Add netlink_nexthop_change tracepoint Add a tracepoint for the netlink_nexthop_change function. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	097ef2afd1	zebra: Add netlink_request_intf_addr tracepoint Add a tracepoint for the netlink_request_intf_addr function. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:43:47 -04:00
Donald Sharp	d42e61420a	zebra: Add initial zebra tracepoint support Add initial zebra tracepoint support infrastructure as well as add a frr_zebra:netlink_interface callback. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-20 09:39:47 -04:00
Donald Sharp	a71e190d44	Merge pull request #10961 from opensourcerouting/build-ms-ext build: enable `-fms-extensions`	2022-04-20 07:51:45 -04:00
Donatas Abraitis	3d3c38b1d4	Merge pull request #11051 from donaldsharp/speell_more Speell more	2022-04-20 11:04:14 +03:00
mobash-rasool	1815b8f335	Merge pull request #11045 from anlancs/fix/bgpd-cleanup-8-remove zebra: cleanup duplicated "extern"s for evpn-mh	2022-04-20 13:28:17 +05:30
Volodymyr Huti	7fb9825cf7	zebra: set ZEBRA_IFC_DOWN on connected routes for inactive interfaces If you are in a situation where you have multiple addresses on an interface, zebra creates one connected route for them. The issue is that the rib entry is not created if addresses were added before the interface was running. We add the address to a running interface in a typical flow. Therefore, we handle the route & rib creation within a single ADD event. In the opposite case, we create the route entries without activating them. These are considered to be active since ZEBRA_IFC_DOWN is not set. On the following interface UP, we ignore the same ADDR_ADD as it overlaps with the existing prefixes -> rib is never created. The minimal reproducible setup: ----------------------------------------- ip link add name dummy0 type dummy ip addr flush dev dummy0 ip link set dummy0 down ip addr add 192.168.1.7/24 dev dummy0 ip addr add 192.168.1.8/24 dev dummy0 ip link set dummy0 up vtysh -c 'show ip route' \| grep dummy0 Signed-off-by: Volodymyr Huti <v.huti@vyos.io>	2022-04-19 22:53:57 +03:00
mobash-rasool	16b5065b47	Merge pull request #10908 from donaldsharp/proto_only_error zebra: When `zebra nexthop proto only` limit errors	2022-04-19 21:27:29 +05:30
Donald Sharp	4667220e3a	*: Fix spelling of accidently Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-19 08:31:30 -04:00
Donald Sharp	f526739897	*: Fix spelling of accomodate Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-19 08:29:58 -04:00
Donald Sharp	3819e4ced7	*: Fix spelling of inteface Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-19 08:21:31 -04:00
Donatas Abraitis	f5327fc339	Merge pull request #11012 from anlancs/bgpd-mh-simplify-condition zebra: simplify one check for evpn-mh	2022-04-19 13:04:43 +03:00
anlan_cs	4e5bda347c	zebra: cleanup duplicated "extern"s for evpn-mh There are some duplicated `extern`s in this header file, just remove them. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-04-19 05:20:10 -04:00
Russ White	1258cfcd8c	Merge pull request #11001 from donaldsharp/system_route_recursion zebra: Allow system routes to recurse through themselves	2022-04-18 09:47:47 -04:00
Donald Sharp	1cadfaf213	zebra: When `zebra nexthop proto only` limit errors Operators are seeing: Mar 28 07:19:37 kingpin zebra[418]: [TZANK-DEMSE] netlink_nexthop_msg_encode: nhg_id 68 (zebra): proto-based nexthops only, ignoring Mar 28 07:19:37 kingpin zebra[418]: [TZANK-DEMSE] netlink_nexthop_msg_encode: nhg_id 68 (zebra): proto-based nexthops only, ignoring Mar 28 07:19:37 kingpin zebra[418]: [YXPF5-B2CE0] netlink_route_multipath_msg_encode: RTM_DELROUTE 2804:4d48:4000::/42 vrf 0(254) Mar 28 07:19:37 kingpin zebra[418]: [YXPF5-B2CE0] netlink_route_multipath_msg_encode: RTM_NEWROUTE 2804:4d48:4000::/42 vrf 0(254) Mar 28 07:19:37 kingpin zebra[418]: [TVM3E-A8ZAG] _netlink_route_build_singlepath: (single-path): 2804:4d48:4000::/42 nexthop via fe80::b6fb:e4ff:fe26:c5d5 if 2 vrf default(0) Mar 28 07:19:37 kingpin zebra[418]: [HYEHE-CQZ9G] nl_batch_send: netlink-dp (NS 0), batch size=140, msg cnt=2 Mar 28 07:19:37 kingpin zebra[418]: [P2XBZ-RAFQ5][EC 4043309074] Failed to install Nexthop ID (68) into the kernel When `zebra nexthop proto only` is turned on. Effectively zebra intentionally does not do the nexthop group installation and the dplane notification in zebra_nhg.c just assumes it was a failure and prints an error message. Since this act was intentional, let's just notice that it was intentional and not report the message as a failure. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-18 09:41:38 -04:00
Donatas Abraitis	cd876f8a78	Merge pull request #10935 from anlancs/zebra-mh-esi-warning zebra: adjust the warnings for ESI of evpn-mh	2022-04-13 15:45:07 +03:00
anlan_cs	9a8fc8f88d	zebra: simplify one check for evpn-mh An simplification for one check in `zebra_evpn_mh_uplink_oper_flags_update()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-04-12 01:26:54 -04:00
Donald Sharp	c9e4abf81f	zebra: Allow system routes to recurse through themselves Currently if a end user has something like this: Routing entry for 192.168.212.1/32 Known via "kernel", distance 0, metric 100, best Last update 00:07:50 ago * directly connected, ens5 Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.212.1, ens5, src 192.168.212.19, 00:00:15 C>* 192.168.212.0/27 is directly connected, ens5, 00:07:50 K>* 192.168.212.1/32 [0/100] is directly connected, ens5, 00:07:50 And FRR does a link flap, it refigures the route and rejects the default route: 2022/04/09 16:38:20 ZEBRA: [NZNZ4-7P54Y] default(0:254):0.0.0.0/0: Processing rn 0x56224dbb5b00 2022/04/09 16:38:20 ZEBRA: [ZJVZ4-XEGPF] default(0:254):0.0.0.0/0: Examine re 0x56224dbddc20 (kernel) status: Changed Installed flags: Selected dist 0 metric 100 2022/04/09 16:38:20 ZEBRA: [GG8QH-195KE] nexthop_active_update: re 0x56224dbddc20 nhe 0x56224dbdd950 (7), curr_nhe 0x56224dedb550 2022/04/09 16:38:20 ZEBRA: [T9JWA-N8HM5] nexthop_active_check: re 0x56224dbddc20, nexthop 192.168.212.1, via ens5 2022/04/09 16:38:20 ZEBRA: [M7EN1-55BTH] nexthop_active: Route Type kernel has not turned on recursion 2022/04/09 16:38:20 ZEBRA: [HJ48M-MB610] nexthop_active_check: Unable to find active nexthop 2022/04/09 16:38:20 ZEBRA: [JPJF4-TGCY5] default(0:254):0.0.0.0/0: After processing: old_selected 0x56224dbddc20 new_selected 0x0 old_fib 0x56224dbddc20 new_fib 0x0 So the 192.168.212.1 route is matched for the nexthop but it is not connected and zebra treats it as a problem. Modify the code such that if a system route matches through another system route, then it should work imo. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-09 13:17:14 -04:00
Donald Sharp	48dc861028	zebra: Allow multiple connected routes to be choosen for kernel routes This bug should only really affect kernel routes. To reproduce: a) Have multiple connected routes that point to the same prefix swp8 up default 169.254.0.250/30 swp9 up default 169.254.0.250/30 b) Have a kernel route that uses one of those connected routes 7.6.2.8 via 169.254.0.249 dev swp8 proto static (But have it choose a non-selected connected nexthop) c) Introduce an event that causes the rib table to be reprocessed, say a unrelated interface going up / down This causes the route to be lost with this message: 2022/03/28 21:21:53 ZEBRA: [YXCJP-0WZWV] netlink_nexthop_msg_encode: ID (3454): 169.254.0.249, via swp8(1383) vrf default(0) 2022/03/28 21:21:53 ZEBRA: [YF2E6-J60JH] nexthop_active: 169.254.0.249, via swp8 given ifindex does not match nexthops ifindex found found: directly connected, swp9 Effectively the nexthop that zebra is choosing would not be the one that the kernel route has choosen and FRR removes the route: 022/03/28 21:21:53 ZEBRA: [NM15X-X83N9] rib_process: (0:254):7.6.2.8/32: rn 0x56042e632e90, removing re 0x56042e6316e0 2022/03/28 21:21:53 ZEBRA: [Y53JX-CBC5H] rib_unlink: (0:254):7.6.2.8/32: rn 0x56042e632e90, re 0x56042e6316e0 2022/03/28 21:21:53 ZEBRA: [KT8QQ-45WQ0] rib_gc_dest: (0:?):7.6.2.8/32: removing dest from table What is happening? Zebra is not looking at all connected routes and if any of them would have the appropriate ifindex and just blindly rejecting the route. So when nexthop resolution happens and it matches a connected route and the dest->selected nexthop ifindex does not match, let's sort through the rest of them and see if any of them match and if so let's keep the route. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-08 08:15:20 -04:00
Donald Sharp	2c38c8ad35	Merge pull request #10928 from anlancs/zebra-cleanup-1 zebra: use "assert" instead of unnecessary check	2022-04-05 09:49:00 -04:00
Russ White	977405eeac	Merge pull request #10938 from anlancs/fix-zebra-vxlan-change-vrfid zebra: fix missing vrf change of l2vni on vxlan interface	2022-04-05 08:55:42 -04:00
David Lamparter	5b4f4e626f	build: first header must be zebra.h or config.h This has already been a requirement for Solaris, it is still a requirement for some of the autoconf feature checks to work correctly, and it will be a requirement for `-fms-extensions`. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-04-04 18:33:10 +02:00
Donald Sharp	07b12758be	pimd, zebra: Fix spelling of fowarding Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-02 07:46:19 -04:00
Donald Sharp	17be83bf99	*: Fix spelling of Gracefull Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-04-02 07:46:19 -04:00
anlan_cs	21311bc8a0	zebra: add whitespace after "%%" for prompt Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-04-01 03:27:20 -04:00
anlan_cs	2e39ebbb09	zebra: adjust the warnings for ESI of evpn-mh Since there are two kinds of ESI (Type-0 and Type-3), the warnings should distinguish between the two cases. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-04-01 03:00:11 -04:00
Trey Aspelund	436a6a3e51	zebra: don't send RAs w/o LLv6 or on bridge-ports It's confusing for a user to see 'Tx RA failed' in the logs when they've enabled RAs (either through interface config or BGP unnumbered) on an interface that can't send them. Let's avoid sending RAs on interfaces that are bridge_slaves or don't have a link-local address, since they are the two of the most common reasons for RA Tx failures. Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2022-03-31 16:38:37 +00:00
anlan_cs	c4992a2f71	zebra: fix missing vrf change of l2vni on vxlan interface The bounded vrf of `l2vni/zevpn` have wrong relation with the order in which vxlan interface and svi interface are set. If set vxlan interface with vlanid first, then set svi interface with vrf, it is ok that vxlan interface will get correct `vrf` inherited from svi. But reverse the set sequence (i.e. set svi first, then vxlan), vxlan interface can't get correct `vrf`, becasue the handling of `ZEBRA_VXLIF_VLAN_CHANGE` missed inheritting `vrf` by mistake. ``` host# do show evpn vni 101 VNI: 101 Type: L2 Tenant VRF: vrf1 ``` So update `vrf` ("Tenant VRF") of l2vni in `zebra_vxlan_if_update()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-31 02:51:26 -04:00
anlan_cs	2be18df4dc	zebra: remove unnecessary check for parsing macfdb Since `NDA_VLAN` is no longer mannually defined in header file, the check for `NDA_VLAN` should be removed. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-30 05:50:21 -04:00
Donatas Abraitis	11fc3db305	Merge pull request #10902 from bobuhiro11/fix_zebra_srv6_func_bits zebra: fix doc and default value of "func-bits" for SRv6	2022-03-30 10:20:36 +03:00
anlan_cs	44a84850a9	zebra: use "assert" instead of unnecessary check Like `zvni_map_to_svi_ns()` for `ns_walk_func()`, just use "assert" instead of unnecessary check. Since these parameters for `ns_walk_func()`, e.g. `in_param` and others, must not be NULL. So use `assert` to ensure the these parameters, and remove those unnecessary checks. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-30 03:19:28 -04:00
Donald Sharp	80e39114b5	Merge pull request #10897 from opensourcerouting/safi-nht zebra,staticd,*: SAFI_MULTICAST NHT groundwork	2022-03-28 08:23:36 -04:00
Nobuhiro MIKI	fbd01eaa41	zebra: output optional param "func-bits" for SRv6 Signed-off-by: Nobuhiro MIKI <nmiki@yahoo-corp.jp>	2022-03-28 17:37:45 +09:00
David Lamparter	7d08e1e31c	zebra: add a few `const` in RNH code Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-27 14:57:22 +02:00
David Lamparter	6c90403bb1	zebra: `show ip nht mrib` Prints the SAFI_MULTICAST NHT state in zebra. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-27 14:57:18 +02:00
David Lamparter	e9ac2861e5	zebra: register NHT nexthops with proper SAFI Just a small puzzle piece missing in zebra SAFI NHT support. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-27 14:51:00 +02:00
David Lamparter	bc9b1cbfae	zebra: check other SAFIs when removing gone client When a client disconnects, we need to check & remove NHT entries for other SAFIs too. Otherwise we crash later trying to access stale data. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-27 14:51:00 +02:00
Donald Sharp	2f71996a68	zebra: Note when the netlink DUMP command is interrupted There exists code paths in the linux kernel where a dump command will be interrupted( I am not sure I understand what this really means ) and the data sent back from the kernel is wrong or incomplete. At this point in time I am not 100% certain what should be done, but let's start noticing that this has happened so we can formulate a plan or allow the end operator to know bad stuff is a foot at the circle K. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-25 19:08:14 -04:00
Donald Sharp	a5f711a11a	Merge pull request #10862 from anlancs/zebra-mh-svi-add zebra: optimization on the mac addition for evpn-mh	2022-03-25 10:09:59 -04:00
David Lamparter	619a6623cb	Merge pull request #10867 from donaldsharp/ifp_use_after_free	2022-03-25 06:55:37 +01:00
David Lamparter	f908faed4a	Merge pull request #10866 from donaldsharp/freebsd_unknown_type2str	2022-03-25 04:20:19 +01:00
Donald Sharp	d0438da6b0	zebra: Fix use after deletion event in freebsd In the FreeBSD code if you delete the interface and it has no configuration, the ifp pointer will be deleted from the system but zebra continues to dereference the just freed pointer. ==58624== Invalid read of size 1 ==58624== at 0x48539F3: strlcpy (in /usr/local/libexec/valgrind/vgpreload_memcheck-amd64-freebsd.so) ==58624== by 0x2B0565: ifreq_set_name (ioctl.c:48) ==58624== by 0x2B0565: if_get_flags (ioctl.c:416) ==58624== by 0x2B2D9E: ifan_read (kernel_socket.c:455) ==58624== by 0x2B2D9E: kernel_read (kernel_socket.c:1403) ==58624== by 0x499F46E: thread_call (thread.c:2002) ==58624== by 0x495D2B7: frr_run (libfrr.c:1196) ==58624== by 0x2B40B8: main (main.c:471) ==58624== Address 0x6baa7f0 is 64 bytes inside a block of size 432 free'd ==58624== at 0x484ECDC: free (in /usr/local/libexec/valgrind/vgpreload_memcheck-amd64-freebsd.so) ==58624== by 0x4953A64: if_delete (if.c:283) ==58624== by 0x2A93C1: if_delete_update (interface.c:874) ==58624== by 0x2B2DF3: ifan_read (kernel_socket.c:453) ==58624== by 0x2B2DF3: kernel_read (kernel_socket.c:1403) ==58624== by 0x499F46E: thread_call (thread.c:2002) ==58624== by 0x495D2B7: frr_run (libfrr.c:1196) ==58624== by 0x2B40B8: main (main.c:471) ==58624== Block was alloc'd at ==58624== at 0x4851381: calloc (in /usr/local/libexec/valgrind/vgpreload_memcheck-amd64-freebsd.so) ==58624== by 0x496A022: qcalloc (memory.c:116) ==58624== by 0x49546BC: if_new (if.c:164) ==58624== by 0x49546BC: if_create_name (if.c:218) ==58624== by 0x49546BC: if_get_by_name (if.c:603) ==58624== by 0x2B1295: ifm_read (kernel_socket.c:628) ==58624== by 0x2A7FB6: interface_list (if_sysctl.c:129) ==58624== by 0x2E99C8: zebra_ns_enable (zebra_ns.c:127) ==58624== by 0x2E99C8: zebra_ns_init (zebra_ns.c:214) ==58624== by 0x2B3FF2: main (main.c:401) ==58624== Zebra needs to pass back whether or not the ifp pointer was freed when if_delete_update is called and it should then check in ifan_read as well as ifm_read that the ifp pointer is still valid for use. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-24 20:48:24 -04:00
Donald Sharp	0081ab91e6	zebra: When handling unprocessed messages from kernel print usable string Add new debug output to show the string of the message type that is currently unhandled: 2022-03-24 18:30:15.284 [DEBG] zebra: [V3NSB-BPKBD] Kernel: 2022-03-24 18:30:15.284 [DEBG] zebra: [HDTM1-ENZNM] Kernel: message seq 792 2022-03-24 18:30:15.284 [DEBG] zebra: [MJD4M-0AAAR] Kernel: pid 594488, rtm_addrs {DST,GENMASK} 2022-03-24 18:30:15.285 [DEBG] zebra: [GRDRZ-0N92S] Unprocessed RTM_type: RTM_NEWMADDR(d) Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-24 20:07:00 -04:00
Donald Sharp	ceacdc7216	zebra: Don't send uninited data to kernel on FreeBSD When running zebra w/ valgrind, it was noticed that there was a bunch of passing uninitialized data to the kernel: ==38194== Syscall param ioctl(generic) points to uninitialised byte(s) ==38194== at 0x4CDF88A: ioctl (in /lib/libc.so.7) ==38194== by 0x49A4031: vrf_ioctl (vrf.c:860) ==38194== by 0x2AFE29: vrf_if_ioctl (ioctl.c:91) ==38194== by 0x2AFF39: if_get_mtu (ioctl.c:161) ==38194== by 0x2B12C3: ifm_read (kernel_socket.c:653) ==38194== by 0x2A7F76: interface_list (if_sysctl.c:129) ==38194== by 0x2E9958: zebra_ns_enable (zebra_ns.c:127) ==38194== by 0x2E9958: zebra_ns_init (zebra_ns.c:214) ==38194== by 0x2B3F82: main (main.c:401) ==38194== Address 0x7fc000967 is on thread 1's stack ==38194== in frame #3, created by if_get_mtu (ioctl.c:155) ==38194== ==38194== Syscall param ioctl(generic) points to uninitialised byte(s) ==38194== at 0x4CDF88A: ioctl (in /lib/libc.so.7) ==38194== by 0x49A4031: vrf_ioctl (vrf.c:860) ==38194== by 0x2AFE29: vrf_if_ioctl (ioctl.c:91) ==38194== by 0x2AFED9: if_get_metric (ioctl.c:143) ==38194== by 0x2B12CB: ifm_read (kernel_socket.c:655) ==38194== by 0x2A7F76: interface_list (if_sysctl.c:129) ==38194== by 0x2E9958: zebra_ns_enable (zebra_ns.c:127) ==38194== by 0x2E9958: zebra_ns_init (zebra_ns.c:214) ==38194== by 0x2B3F82: main (main.c:401) ==38194== Address 0x7fc000967 is on thread 1's stack ==38194== in frame #3, created by if_get_metric (ioctl.c:137) ==38194== ==38194== Syscall param ioctl(generic) points to uninitialised byte(s) ==38194== at 0x4CDF88A: ioctl (in /lib/libc.so.7) ==38194== by 0x49A4031: vrf_ioctl (vrf.c:860) ==38194== by 0x2AFE29: vrf_if_ioctl (ioctl.c:91) ==38194== by 0x2B052D: if_get_flags (ioctl.c:419) ==38194== by 0x2B1CF1: ifam_read (kernel_socket.c:930) ==38194== by 0x2A7F57: interface_list (if_sysctl.c:132) ==38194== by 0x2E9958: zebra_ns_enable (zebra_ns.c:127) ==38194== by 0x2E9958: zebra_ns_init (zebra_ns.c:214) ==38194== by 0x2B3F82: main (main.c:401) ==38194== Address 0x7fc000707 is on thread 1's stack ==38194== in frame #3, created by if_get_flags (ioctl.c:411) Valgrind is no longer reporting these issues. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-24 12:57:01 -04:00
anlan_cs	b83d220aa9	zebra: optimization on the mac addition for evpn-mh When `zebra_evpn_mac_svi_add()` adds one found mac by `zebra_evpn_mac_lookup()` and the found mac is without svi flag, then call `zebra_evpn_mac_svi_add()` to create one appropriate mac, but it will call `zebra_evpn_mac_lookup()` the second time. So lookup twice, the procedure is redundant. Just an optimization for it, make sure only lookup once. Modify `zebra_evpn_mac_gw_macip_add()` to check the `macp` parameter passed by caller, so it can distinguish whether really need lookup or not. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-24 22:31:50 +08:00
Sri Mohana Singamsetty	116f0dd905	Merge pull request #10726 from chiragshah6/fdev2 zebra: evpn revamp l3vni routermac db	2022-03-22 22:05:47 -07:00
Donatas Abraitis	ecf0ea4b00	Merge pull request #9953 from donaldsharp/system_route_replace zebra: Better handle replacing our route by a system route	2022-03-20 23:25:52 +02:00
Donald Sharp	0399a608e0	Merge pull request #10830 from anlancs/zebra-rb-remove zebra, bgpd: remove check returning value of RB_INSERT()	2022-03-20 14:32:49 -04:00
Donald Sharp	f2f2a16af4	zebra: Do not complain if deletion fails When issuing a RTM_DELETE operation and the kernel tells us that the route is already deleted, let's not complain about the situation: 2022/03/19 02:40:34 ZEBRA: [EC 100663303] kernel_rtm: 2a10:cc42:1d51::/48: rtm_write() unexpectedly returned -4 for command RTM_DELETE I can recreate this issue on freebsd by doing this: a) create a route using sharpd b) shutdown the nexthop's interface c) remove the route using sharpd This would also be true of pretty much any routing protocol's behavior. Let's just not complain about the situation if a RTM_DELETE operation is issued and FRR is told that the route does not exist to delete. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-19 07:44:54 -04:00
anlan_cs	2a778afe9d	zebra: remove check returning value of RB_INSERT() Since the `RB_INSERT()` is called after not found in RB tree, it MUST be ok and and return zero. The check of returning value of `RB_INSERT()` is redundant, just remove them. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-19 13:45:14 +08:00
Jafar Al-Gharaibeh	d6d0d718b0	Merge pull request #10806 from donaldsharp/dplane_fixup_for_lua zebra: Fixup lua with new dplane ops	2022-03-16 16:38:01 -05:00
Donald Sharp	60cd8d3b14	Merge pull request #10790 from anlancs/zebra-adjust-flag zebra: minor changes on "zebra_evpn_mac_gw_macip_add" function	2022-03-16 16:25:24 -04:00
Donald Sharp	105271792d	zebra: Fixup lua with new dplane ops Commit: `5d41413833` added 3 new dplane ops: DPLANE_OP_INTF_INSTALL DPLANE_OP_INTF_UPDATE DPLANE_OP_INTF_DELETE The build system does not build lua so zebra_script.c was not updated. Update of course! Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-16 15:10:54 -04:00
Russ White	5d97021ba3	Merge pull request #10427 from sworleys/Protodown-Reason-Upstream Add Support for Setting Protodown Reason Code	2022-03-15 19:58:16 -04:00
Sri Mohana Singamsetty	3d58538a75	Merge pull request #10770 from chiragshah6/evpn_dev3 zebra: evpn disable remove l2vni from l3vni list	2022-03-15 12:32:22 -07:00
Donald Sharp	2ea93eb023	Merge pull request #10580 from leonshaw/fix/link-ns zebra: Lookup linked interface in link netns	2022-03-15 15:23:18 -04:00
Donald Sharp	052b0eee2a	Merge pull request #10693 from anlancs/bgpd-add-check-ns zebra: use "assert" instead of unnecessary check	2022-03-15 08:27:44 -04:00
Donald Sharp	6c72dd869e	Merge pull request #10725 from opensourcerouting/zebra-fpm-crash-fix zebra: don't enqueue data with FPM socket closed	2022-03-14 08:27:10 -04:00
Donatas Abraitis	a9321141fc	Merge pull request #10731 from donaldsharp/multipath_output_in_zebra zebra: Multipath output	2022-03-14 13:38:08 +02:00
Rafael Zalamena	3b1caddd34	zebra: don't enqueue data with FPM socket closed It will trigger an assert while trying to schedule the next write. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2022-03-14 07:14:36 -03:00
Donald Sharp	7547d5288e	Merge pull request #10704 from anlancs/zebra-remove-check zebra: Remove unnecessary check	2022-03-13 10:17:13 -04:00
Donald Sharp	b74f72c1fb	zebra: prefixlen is not afi/safi dependant in encoding nexthops When encoding a response to the upper level protocol the prefixlen is not something that needs to be part of the switch statement for handling of a prefix. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-12 11:18:45 -05:00
Donald Sharp	06e4e90132	*: When matching against a nexthop send and process what it matched against Currently the nexthop tracking code is only sending to the requestor what it was requested to match against. When the nexthop tracking code was simplified to not need an import check and a nexthop check in `b8210849b8` for bgpd. It was not noticed that a longer prefix could match but it would be seen as a match because FRR was not sending up both the resolved route prefix and the route FRR was asked to match against. This code change causes the nexthop tracking code to pass back up the matched requested route (so that the calling protocol can figure out which one it is being told about ) as well as the actual prefix that was matched to. Fixes: #10766 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-12 11:18:45 -05:00
Donald Sharp	5c7861fe35	zebra: Remove unused ZEBRA_NHT_EXACT_MATCH This usage was removed in an earlier bit of code do some final cleanup Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-12 08:27:22 -05:00
anlan_cs	df44783078	zebra: use "assert" instead of unnecessary check Since `zvni_map_to_svi_ns()` is used to find and return one specific interface based on passed attributes of SVI, so the two parameters `in_param` and `p_ifp` must not be NULL. Passing NULL `p_ifp` makes no sense, so the check `if (p_ifp)` is unnecessary. So use `assert` to ensure the two parameters, and remove that unnecessary check. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-12 12:44:47 +08:00
Donald Sharp	04442fdbea	zebra: Add ECMP supported to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-11 14:25:23 -05:00
Sri Mohana Singamsetty	1b387a2894	Merge pull request #10711 from anlancs/zebra-remove-flag zebra: remove unnecessary assignment	2022-03-11 11:08:51 -08:00
Chirag Shah	b27be4dbd7	zebra: evpn disable remove l2vni from l3vni list Upon 'no advertise-all-vni', cleanup l2vni from its tenant-vrf's l3vni list, instead of passed zvrf->l3vni which will not be present in case of default instance. Reviewed By: Testing Done: Before Fix: ---------- TORC12(config-router-af)# advertise-all-vni TORC12(config-router-af)# end TORC12# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 Vxlan-Intf: vni4001 State: Up Router MAC: 44:38:39:ff:ff:01 L2 VNIs: 134217728 0 1000 1002 <----- After Fix: ---------- TORC12# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 Vxlan-Intf: vni4001 State: Up Router MAC: 44:38:39:ff:ff:01 L2 VNIs: 1000 1002 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 19:59:33 -08:00
Chirag Shah	3d43b95ce1	zebra: cleanup host prefix from rmac Ticket:#2798406 Testing Done: Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 17:27:15 -08:00
Chirag Shah	4a8e182a66	zebra: print rmac nexthop list Ticket:#2798406 Reviewed By: Testing Done: Before change: -------------- TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 MAC: 44:38:39:ff:ff:01 Remote VTEP: 36.0.0.11 Refcount: 1 Prefixes: [1]:[00:00:00:00:00:00:00:00:00:00]:[::]/352 TORS1# TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 json { "routerMac":"44:38:39:ff:ff:01", "vtepIp":"36.0.0.11", "refCount":1, "localSequence":0, "remoteSequence":0, "prefixList":[ "[1]:[00:00:00:00:00:00:00:00:00:00]:[::]\/352" ] } After change: ------------- TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 MAC: 44:38:39:ff:ff:01 Remote VTEP: 36.0.0.11 Refcount: 0 Prefixes: TORS1# TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 json { "routerMac":"44:38:39:ff:ff:01", "vtepIp":"36.0.0.11", "nexthops":[ "36.0.0.11" ] } Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 17:27:15 -08:00
Chirag Shah	ae9e6beaea	zebra: remove host prefix mapping in rmac RMAC keeping list of nexthops to keep track of its existiance, remove the (old way) host prefix mapping. Ticket: #2798406 Reviewed By: Testing Done: TORS1# show evpn rmac vni 4001 mac 44:38:39:ff:ff:01 MAC: 44:38:39:ff:ff:01 Remote VTEP: 36.0.0.11 Refcount: 0 Prefixes: Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 17:27:15 -08:00
Chirag Shah	db88997872	zebra: maintain list of nhs in rmac db Keep the list of remote-vteps/nexthops in rmac db. Problem: In CLAG deployment there might be a situation where CLAG secondary sends individual ip as nexthop along with anycast mac as RMAC. This combination is updated in zebra's rmac cache. Upon recovery at clag secondary sends withdrawal of the incorrect rmac and nexthop mapping. The RMAC entry mapping to nh is not cleaned up properly in the zebra rmac cache. Fix: Zebra rmac db needs to maintain a list of nexthops. When a bgp withdrawal for rmac to nexthop mapping is received, remove the old nexthop from the rmac's nh list and if the host reference still remains for the RMAC,fall back to the nexthop one remaining in the list. At most you expect two nexthops mapped to RMAC (in clag deployment). Ticket: 2798406 Reviewed By: Testing Done: CLAG primary and secondary have advertise-pip enabled advertise type-5 route (default route) with individual IP as nh and individual svi mac as rmac. - disable advertise pip on both clag devices, this results in advertisement of routes with anycast ip as nh and anycast mac as rmac. - disable peerlink on clag primary, this triggers clag secondary to (transitory) send bgp update with individual ip as nh and anycast mac as rmac. - At the remote vtep: Check the zebra's rmac cache/nh mapping correctly and points to anycast rmac and anycast ip as nh of the clag system. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-10 17:27:15 -08:00
Stephen Worley	47c1d76a6c	zebra: cleanup protodown netlink logs Cleanup the logs in the netlink code for setting protodown on/off to be more useful to a user parsing them after an issue. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:45 -05:00
Stephen Worley	7140b00cb0	zebra: use SET/UNSET/CHECK/COND in protodown code Use the SET/UNSET/CHECK/COND macros for flag bifields where appropriate throught the protodown code base. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	26a64ff9ca	zebra: include old reason in evpn-mh bond update Ensure we include the old reason when we are updating the reason code for a evpn-mh bond member. Now that this is a common API it could include things external to EVPN in this reason code bitfield (ex: vrrp). Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	ed7a1622c3	zebra: make netlink protodown func more readable Make the netlink protodown static function for checking if the only bit set for protodown reason is FRR's more easily readable to someone not familiar with the code. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	00d57e6dd5	zebra: clear dplane flags on failure for protodown Make sure we clear our dplane flags for SET/UNSET on failure so that we try again. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	a3ea8493a8	zebra: simplify reason code printing in show Simplify the code for printing the reason codes via show command. Just remove the trailing comma last before printing. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	3db9f2db2b	zebra: cleanup protodown api logs Cleanup the logs in the api for setting protodown on/off that zapi and others use. Make them more useful to a user parsing them after an issue. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	ddfea8a233	zebra: wrap macro zif argument in parans Missing parenthesis for macro ZIF argument. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	b1da028e45	zebra: avoid initialization in ctx_intf_init Avoid initialization in dplane_ctx_intf_init() so the compiler can warn us about using unintialized data. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	4b82b95488	zebra: evpn-mh bonds protodown check for set When we are processing a bond member's protodown we get from the dataplane, check to make sure we haven't already queued up a set. If we have, it's likely this is just a notification we get from the kernel after we set protodown and before we have processed the result in our dplane pthread. This change is needed now that we set protodown via the dplane pthread. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	cb5b31f5d0	zebra: evpn-mh use protodown update reason api When setting the protodown reason use the update api where we can directly update the entire reason bitfield since we have to set more than one. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	321c80d6b2	zebra: extern setting protodown reason directly Extern the api for setting the protodown reason code bitfield directly. Some places may want to completely update the bitfield with more than one reason at a time. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	ab465d24bd	zebra: only clear pd_reason on shutdown/sweep Only clear protodown reason on shutdown/sweep, retain protodown state. This is to retain traditional and expected behavior with daemons like vrrpd setting protodown. They expet it to be set on shutdown and retained on bring up to prevent traffic from being dropped. We must cleanup our reason code though to prevent us from blocking others. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	97c7263373	zebra: add boilerplate protodown updates for *bsd Add boilerplate for someone to come and add protdown updates for bsd platforms if it ever exists. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	e4d87b5894	zebra: remove old protodown dplane path Remove the old protodown dplane path install on the main thread. This is now dead code. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	d89b300829	zebra: use a macro for check protodown Add a helper macro for checking if interface is protodown, typing this out is annoying. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:44 -05:00
Stephen Worley	0dcd8506f2	zebra: clear protodown_rc on shutdown and sweep Add functionality to clear any reason code set on shutdown of zebra when we are freeing the interface, in case a bad client didn't tell us to clear it when the shutdown. Also, in case of a crash or failure to do the above, clear reason on startup if it is set. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 18:02:42 -05:00
Stephen Worley	71ef5cbb95	zebra: add enum set/unset states for queing Add enums for set/unset of prodown state to handle the mainthread knowing an update is already queued without actually marking it as complete. This is to make the logic confirm a bit more with other parts of the code where we queue dplane updates and not update our internal structs until success callback is received. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 17:52:44 -05:00
Stephen Worley	c40e1b1cfb	zebra: add command for setting protodown bit Add command for use to set protodown via frr.conf in the case our default conflicts with another application they are using. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 17:52:44 -05:00
Stephen Worley	5d41413833	zebra: add support for protodown reason code Add support for setting the protodown reason code. `829eb208e8` These patches handle all our netlink code for setting the reason. For protodown reason we only set `frr` as the reason externally but internally we have more descriptive reasoning available via `show interface IFNAME`. The kernel only provides a bitwidth of 32 that all userspace programs have to share so this makes the most sense. Since this is new functionality, it needs to be added to the dplane pthread instead. So these patches, also move the protodown setting we were doing before into the dplane pthread. For this, we abstract it a bit more to make it a general interface LINK update dplane API. This API can be expanded to support gernal link creation/updating when/if someone ever adds that code. We also move a more common entrypoint for evpn-mh and from zapi clients like vrrpd. They both call common code now to set our internal flags for protodown and protodown reason. Also add debugging code for dumping netlink packets with protodown/protodown_reason. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2022-03-09 17:52:44 -05:00
Russ White	82aca4ae4f	Merge pull request #10662 from chiragshah6/evpn_dev1 zebra: netlink protodown event handling for vxlan device	2022-03-09 16:37:30 -05:00
Mark Stapp	bc5302b1b1	Merge pull request #10635 from anlancs/staticd-cross zebra: let same host route cross VRF	2022-03-09 11:05:59 -05:00
David Lamparter	e3c54a9383	Merge pull request #10079 from mjstapp/fix_intf_del_nhgs	2022-03-09 10:15:00 +01:00
anlan_cs	3f04f9cf24	zebra: let /32 host route with same IP cross VRF Contraints of host routes are too strict in current code: Host routes with same destination address and nexthop address are forbidden even when cross VRFs. Currently host routes with different destination and nexthop address can cross VRFs, it is ok. But host routes with same addresses are forbidden to cross VRFs, it is wrong. Since different VRFs can have the same addresses, leak specific host route with the same nexthop address ( it means destination address is same to nexthop address ) to other VRFs is a normal case. This commit relaxes that contraints. Host routes with same destination address and nexthop address are forbidden only when not cross VRFs. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-09 07:22:11 +08:00
Mark Stapp	2472d3e876	zebra: shutdown doesn't uninstall zebra's NHGs When an interface goes down, it signals any related NHGs to re-validate themselves. During zebra shutdown, ensure we remove any NHGs we've installed. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-03-08 16:16:55 -05:00
Russ White	5e412c5e73	Merge pull request #10722 from chiragshah6/evpn_dev3 zebra: fix crash in evpn neigh cleanup all	2022-03-08 11:00:29 -05:00
anlan_cs	c2fd85a854	zebra: remove unnecessary assignment In `zebra_evpn_neigh_gw_macip_add()`, it sets `mac->flags` to "ZEBRA_MAC_DEF_GW" for "advertise-default-gw" mode. But this set is redundant because this "mac" is already set by `zebra_evpn_mac_gw_macip_add()`. So remove this redundant assignment. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-08 22:58:22 +08:00
David Lamparter	378260fb65	zebra: remove unused variable clang complains "variable 'curr_length' set but not used". Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-03-07 17:37:27 +01:00
anlan_cs	38eda16a24	zebra: Delay the usage of one variable until need In the loop, local variable `ip` is always set even if the check condition is not satisfied. Avoid the redundant set, move this set exactly after the check condition is satisfied. Set `ip` only if the check condition is met, otherwise needn't. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-03-05 06:57:35 +08:00
Chirag Shah	d5fdae8f45	zebra: fix crash in evpn neigh cleanup all zebra crash is seen during shutdown (frr restart). During shutdown, remote neigh and remote mac clean up is triggered first, followed by per vni all neigh (including local) and macs cleanup is triggered. The crash occurs when a remote mac is cleaned up first and its reference is remained in local neigh. When local neigh attempt removes itself from its associated mac's neigh_list it triggers inaccessible memory crash. The fix is during mac deletion if its neigh_list is non-empty then retain the MAC in AUTO state. This can arise when MAC and neigh duo are in different state (remote/local). Otherwise, the order of cleanup operation is neighs followed by macs. The auto mac will be cleaned up when per vni all neighs and macs are cleaned up. Ticket:CM-29826 Reviewed By:CCR-10369 Testing Done: Configure evpn symmetric config where MAC is in remote state and neigh is in local state. Perform frr restart then crash is not seen. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-03 14:59:56 -08:00
Donald Sharp	8b48cdb913	zebra: Prevent installation of connected multiple times With recent changes to interface up mechanics in if_netlink.c FRR was receiving as many as 4 up events for an interface on ifdown/ifup events. This was causing timing issues in FRR based upon some fun timings. Remove this from happening. Ticket: CM-31623 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-03-02 18:34:32 -08:00
Chirag Shah	d78fa57195	zebra: protodown-up event trigger interface up Vxlan interfaces flap (protodown/up) event, non ptm operative interfaces do not come up as protodown up event do not trigger "if_up()" event. Ticket:CM-30477 Reviewed By:CCR-10681 Testing Done: validated interfaces flaps, ip link down, ifdown and protodown followed by UP event. all Vxlan interfaces come up in bgpd post flap. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-02 18:16:56 -08:00
Chirag Shah	2d04bd98ac	zebra: handle protodown netlink for vxlan device Frr need to handle protocol down event for vxlan interface. In MLAG scenario, one of the pair switch can put vxlan port to protodown state, followed by tunnel-ip change from anycast IP to individual IP. In absence of protodown handling, evpn end up advertising locally learn EVPN (MAC-IP) routes with individual IP as nexthop. This leads an issue of overwriting locally learn entries as remote on MLAG pair. Ticket:CM-24545 Reviewed By:CCR-10310 Testing Done: In EVPN deployment, restart one of the MLAG daemon, which puts vxlan interfaces in protodown state. FRR treats protodown as oper down for vxlan interfaces. VNI down cleans up/withdraws locally learn routes. Followed by vxlan device UP event, re-advertise locally learn routes. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2022-03-02 17:40:44 -08:00
David Lamparter	2821405a69	Merge pull request #10640 from donaldsharp/thread_timers	2022-03-01 11:45:36 +01:00
Jafar Al-Gharaibeh	868efb9e9f	Merge pull request #10672 from donaldsharp/bsd_zebra_graceful_restart_cleanup Bsd zebra graceful restart cleanup	2022-02-28 14:57:35 -06:00
Donald Sharp	45dafca86c	zebra: Use the routes vrf not the vrf of the nexthop for route-map application When a end operator is doing cross vrf imports in bgp: router bgp 3239 vrf FOO address-family ipv4 uni import vrf BAR ! and zebra has this configuration: vrf FOO ip protocol bgp route-map EVA ! The current code in zebra_nhg.c was looking up the vrf of the nexthop and attempting to apply the ip protocol route-map. For most people the nexthop vrf and the re vrf are one and the same so they never see a problem. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-28 13:08:01 -05:00
David Lamparter	b9db469fe9	Merge pull request #10667 from donaldsharp/bufsize	2022-02-28 15:56:51 +01:00
Donald Sharp	73d3197c73	zebra: Get zebra graceful restart working when restarting on BSD Upon restart zebra reads in the kernel state. Under linux there is a mechanism to read the route and convert the protocol to the correct internal FRR protocol to allow the zebra graceful restart efforts to work properly. Under BSD I do not see a mechanism to convey the original FRR protocol into the kernel and thus back out of it. Thus when zebra crashes ( or restarts ) the routes read back in are kernel routes and are effectively lost to the system and FRR cannot remove them properly. Why? Because FRR see's kernel routes as routes that it should not own and in general the admin distance for those routes will be a better one than the admin distance from a routing protocol. This is even worse because when the graceful restart timer pops and rib_sweep is run, FRR becomes out of sync with the state of the kernel forwarding on BSD. On restart, notice that the route is a self route that there is no way to know it's originating protocol. In this case let's set the protocol to ZEBRA_ROUTE_STATIC and set the admin distance to 255. This way when an upper level protocol reinstalls it's route the general zebra graceful restart code still works. The high admin distance allows the code to just work in a way that is graceful( HA! ) The drawback here is that the route shows up as a static route for the time the system is doing it's work. FRR could introduce another* route type but this seems like a bad idea and the STATIC route type is loosely analagous to the type of route it has become. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-28 09:50:35 -05:00
Donald Sharp	16d91fce15	zebra: Prevent crash if ZEBRA_ROUTE_ALL is used for a route type FRR will crash when the re->type is a ZEBRA_ROUTE_ALL and it is inserted into the meta-queue. Let's just put some basic code in place to prevent a crash from happening. No routing protocol should be using ZEBRA_ROUTE_ALL as a value but bugs do happen. Let's just accept the weird route type gracefully and move on. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-28 09:50:35 -05:00
Donald Sharp	fbc83b9a10	zebra: Limit speed lookup to at most 4 minutes There exists some interface types that are slow on startup to fully register their link speed. Especially those that are working with an asic backend. The speed_update timer associated with each interface would keep trying if the system returned a MAX_UINT32 as the speed. This speed means both unknown or there is none under linux. Since some interface types are slow on startup let's modify FRR to try for at most 4 minutes and give up trying on those interfaces where we never get any useful data. Why 4 minutes? I wanted to balance the time associated with slow interfaces coming up with those that will never give us a value. So I choose 4 minutes as a good ballpark of time to keep trying Why not track all those interfaces and just not attempt to do the speed lookup? I would prefer to not keep track of these as that I do not know all the interface types, nor do I wish to keep programming as new ones come in. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-28 06:39:07 -05:00
Xiao Liang	86bad1cb6e	zebra: Lookup linked interface in link netns Look up linked interface in the correct netns, otherwise, either a wrong interface or NULL would be used. For example, enable VRF netns backend, and: ip netns add ns1 ip link add link eth0 link1 type macvlan ip link set link1 netns ns1 up Zebra will crash in zebra_vxlan_macvlan_up because zif->link is NULL. Signed-off-by: Xiao Liang <shaw.leon@gmail.com>	2022-02-28 12:12:15 +08:00
Donald Sharp	9fb83b5506	zebra: Allow BSD to specify a receive buffer size End operator is reporting that they are receiving buffer overruns when attempting to read from the kernel receive socket. It is possible to adjust this size to more modern levels especially for when the system is under load. Modify the code base so that BSD operators can use the zebra `-s XXX` option to specify a read buffer. Additionally setup the default receive buffer size on *BSD to be 128k instead of the 8k so that FRR does not run into this issue again. Fixes: #10666 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-27 07:47:58 -05:00
Donald Sharp	ae45a63022	Merge pull request #10669 from anlancs/bgpd-line *: Add necessary new line for output of vty_out()	2022-02-27 07:43:28 -05:00
anlan_cs	4d4c404bf6	*: Add necessary new line for output of vty_out() Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-02-27 10:59:19 +08:00
Mark Stapp	cd787a8a45	zebra: use dataplane to read interface NETCONF info Use the dataplane to query and read interface NETCONF data; add netconf-oriented data to the dplane context object, and add accessors for it. Add handler for incoming update processing. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 10:18:32 -05:00
Mark Stapp	728f2017ae	zebra: add dplane type for NETCONF data Add a new dplane op for interface NETCONF data; add the new enum value to several switch statements. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Mark Stapp	d4bcd88d8a	zebra: avoid default clause in FPM switch Avoid default clause in a switch in the FPM module that handles dplane op codes - include all the codes. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Mark Stapp	9f3f1486c8	zebra: add xxxNETCONF messages to the netlink BPF filter Allow self-produced xxxNETCONF netlink messages through the BPF filter we use. Just like address-configuration actions, we'll process NETCONF changes in one path, whether the changes were generated by zebra or by something else in the host OS. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Mark Stapp	777f96503e	zebra: add netlink debug dump for netconf messages Add the RTM_NETCONF messages to the detailed netlink message dump module. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Mark Stapp	b6beb70047	zebra: include mpls enabled status in interface output Add mpls status to the zebra interface struct; include mpls status in show interface output. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Donald Sharp	ebb61fcaf5	zebra: Start of work to get data about mpls from kernel a) We'll need to pass the info up via some dataplane control method (This way bsd and linux can both be zebra agnostic of each other) b) We'll need to modify `struct interface *` to track this data and when it changes to notify upper level protocols about it. c) Work is needed to dump the entire mpls state at the start so we can gather interface state. This should be done after interface data gathering from the kernel. Signed-off-by: Donald Sharp <sharpd@nvidia.com> Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-25 09:53:02 -05:00
Christian Hopps	7bf63db79b	Merge pull request #10632 from donaldsharp/thread_return_null *: Change thread->func to return void instead of int	2022-02-24 01:43:48 -05:00
Donald Sharp	cc9f21da22	*: Change thread->func to return void instead of int The int return value is never used. Modify the code base to just return a void instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-23 19:56:04 -05:00
vdhingra	1b1e934fac	zebra: Nexthop tracking, route resolution recursive lookup Description: =========== Change is intended for fixing the NHT resolution logic. While recursively resolving nexthop, keep looking for a valid/useable route in the rib, by not stopping at the first/most-specific route in the rib. Consider the following set of events taking place on R1: R1(config)# ip route 2.2.2.0/24 ens192 R1# sharp watch nexthop 2.2.2.32 connected R1# show ip nht 2.2.2.32(Connected) resolved via static is directly connected, ens192 Client list: sharp(fd 33) -2.2.2.32 NHT is resolved over the above valid static route. R1# sharp install routes 2.2.2.32 nexthop 2.2.2.32 1 R1# 2.2.2.32(Connected) resolved via static is directly connected, ens192 Client list: sharp(fd 33) -.32/32 comes which is going to resolve through itself, but since this is an invalid route, it will be marked as inactive and will not affect the NHT. R1# sharp install routes 2.2.2.31 nexthop 2.2.2.32 1 R1# 2.2.2.32(Connected) unresolved(Connected) Client list: sharp(fd 50) -Now a .31/32 comes which will resolve over .32 route, but as per the current logic, this will trigger the NHT check, in turn making the NHT unresolved. -With fix, NHT should stay in resolved state as long as the valid static or connected route stays installed Fix: ==== -While resolving nexthops, walk up the tree from the most-specific match, walk up the tree without any ZEBRA_NHT_CONNECTED check. Co-authored-by: Vishal Dhingra <vdhingra@vmware.com> Co-authored-by: Kantesh Mundaragi <kmundaragi@vmware.com> Signed-off-by: Iqra Siddiqui <imujeebsiddi@vmware.com>	2022-02-22 09:28:00 -08:00
anlan_cs	5ff58d0a47	zebra: minor changes on "zebra_evpn_mac_gw_macip_add" function Two minor changes: 1) Change `zebra_evpn_mac_gw_macip_add()` 's return type to `void`. 2) Since `zebra_evpn_mac_gw_macip_add()` has already `assert` the returned `mac`, the check of its return value makes no sense. And keep setting `mac->flags` inside `zebra_evpn_mac_gw_macip_add()` is more reasonable. So just move the setting `mac->flags` inside `zebra_evpn_mac_gw_macip_add()`. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-02-18 22:49:47 -05:00
Donald Sharp	7f6ff7a3d3	Merge pull request #10557 from alexk99/zebra-fpm-multihop-weight Zebra FPM: don't lose next hop weights while exporting via FPM	2022-02-17 09:41:52 -05:00
Russ White	c131015905	Merge pull request #10547 from donaldsharp/10458 zebra: Keep the interface flags safe on multiple ioctl calls	2022-02-16 19:20:47 -05:00
Jafar Al-Gharaibeh	76d8e1a4a7	Merge pull request #10561 from mjstapp/nlsock_hash_lock zebra: make netlink object hash threadsafe	2022-02-16 13:11:21 -06:00
Donald Sharp	b9d95135a8	zebra: Fix spelling mistake Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-14 12:56:44 -05:00
Mark Stapp	348698095d	zebra: make netlink object hash threadsafe The recently-added hashtable of nlsock objects needs to be thread-safe: it's accessed from the main and dplane pthreads. Add a mutex for it, use wrapper apis when accessing it. Add a per-OS init/terminate api so we can do init that's not per-vrf or per-namespace. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-11 17:03:26 -05:00
Trey Aspelund	e54cd97838	zebra: cleanup multiline strings in debug_nl.c NetDEF CI has been whining about multiline string style. Make the strings single-line and call it a day. Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2022-02-10 21:37:45 +00:00
Trey Aspelund	95fe32880f	zebra: add netlink debugs for ip rules Adds functions to parse + decode netlink rules. Adds RTM_NEWRULE + RTM_DELRULE to "debug zebra kernel". Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2022-02-10 21:36:34 +00:00
kiselev99@gmail.com	eca3256db8	zebra: FPM next hop weights Don't lose next hop weights while exporting via FPM Signed-off-by: Alex Kiselev <alex@bisonrouter.com>	2022-02-10 19:16:33 +03:00
Rafael Zalamena	70d79c359b	Merge pull request #10537 from mjstapp/fix_dplane_strdup zebra: use frr mem apis in dplane	2022-02-10 10:24:22 -03:00
Bijan	16dca7cec5	zebra: Keep the interface flags safe on multiple ioctl calls Trying to call multiple ioctl calls on ifreq will result in overwriting ifreq with garbage data. On if_get_flags call, try to keep the flags field safe from another possible ioctl call before applying the flags field. Modified code as per Code Review, done by Donald Sharp. Signed-off-by: Bijan <bijanebrahimi@riseup.net>	2022-02-09 10:07:47 -05:00
Donald Sharp	2cf7651f0b	zebra: Make netlink buffer reads resizeable when needed Currently when the kernel sends netlink messages to FRR the buffers to receive this data is of fixed length. The kernel, with certain configurations, will send netlink messages that are larger than this fixed length. This leads to situations where, on startup, zebra gets really confused about the state of the kernel. Effectively the current algorithm is this: read up to buffer in size while (data to parse) get netlink message header, look at size parse if you can The problem is that there is a 32k buffer we read. We get the first message that is say 1k in size, subtract that 1k to 31k left to parse. We then get the next header and notice that the length of the message is 33k. Which is obviously larger than what we read in. FRR has no recover mechanism nor is there a way to know, a priori, what the maximum size the kernel will send us. Modify FRR to look at the kernel message and see if the buffer is large enough, if not, make it large enough to read in the message. This code has to be per netlink socket because of the usage of pthreads. So add to `struct nlsock` the buffer and current buffer length. Growing it as necessary. Fixes: #10404 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-08 17:28:19 -05:00
Donald Sharp	d4000d7ba3	zebra: Remove `struct nlsock` from dataplane information and use `int fd` Store the fd that corresponds to the appropriate `struct nlsock` and pass that around in the dplane context instead of the pointer to the nlsock. Modify the kernel_netlink.c code to store in a hash the `struct nlsock` with the socket fd as the key. Why do this? The dataplane context is used to pass around the `struct nlsock` but the zebra code has a bug where the received buffer for kernel netlink messages from the kernel is not big enough. So we need to dynamically grow the receive buffer per socket, instead of having a non-dynamic buffer that we read into. By passing around the fd we can look up the `struct nlsock` that will soon have the associated buffer and not have to worry about `const` issues that will arise. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-08 17:28:19 -05:00
Donald Sharp	3670f5047c	zebra: Store the sequence number to use as part of the dp_info Store and use the sequence number instead of using what is in the `struct nlsock`. Future commits are going away from storing the `struct nlsock` and the copy of the nlsock was guaranteeing unique sequence numbers per message. So let's store the sequence number to use instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-08 17:28:19 -05:00
Mark Stapp	b6b6e59c6e	zebra: use frr mem apis Replace a couple of strdup/free with XSTRDUP/XFREE. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-08 15:57:57 -05:00
Russ White	1a8a7016a6	Merge pull request #9066 from donaldsharp/ships_in_the_night zebra: Fix ships in the night issue	2022-02-08 14:41:01 -05:00
Igor Ryzhov	60cda04dda	*: use ipaddr_cmp instead of memcmp Using memcmp is wrong because struct ipaddr may contain unitialized padding bytes that should not be compared. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-02-08 20:31:34 +03:00
Russ White	e735c8073c	Merge pull request #9649 from proelbtn/add-support-for-end-dt4 add support for SRv6 IPv4 L3VPN	2022-02-08 08:30:02 -05:00
Donald Sharp	ce649b9d11	zebra: Abstract nhg deletion to reduce code duplication Reduce code duplication when we are cleaning up nexthop groups. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-07 16:10:36 -05:00
Donald Sharp	c6eee91f66	zebra: Fix ships in the night issue When using wait for install there exists situations where zebra will issue several route change operations to the kernel but end up in a state where we shouldn't be at the end due to extra data being received. Example: a) zebra receives from bgp a route change, installs sends the route to the kernel. b) zebra receives a route deletion from bgp, removes the struct route entry and then sends to the kernel a deletion. c) zebra receives an asynchronous notification that (a) succeeded but we treat this as a new route. This is the ships in the night problem. In this case if we receive notification from the kernel about a route that we know nothing about and we are not in startup and we are doing asic offload then we can ignore this update. Ticket: #2563300 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-07 16:10:03 -05:00
Donald Sharp	81ef8a69ae	zebra: Use AF_UNSPEC instead of setting to 0 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-07 13:22:41 -05:00
anlan_cs	97511d01af	zebra: Remove unnecessary check Since `assert` is already done, just remove these unnecessary check. Signed-off-by: anlan_cs <vic.lan@pica8.com>	2022-02-06 20:28:31 -05:00
Jafar Al-Gharaibeh	4333379fca	Merge pull request #9926 from donaldsharp/update_issues zebra: Fix v6 route replace failure turned into success	2022-02-04 19:40:55 -06:00
Jafar Al-Gharaibeh	2da1428ab2	Merge pull request #10501 from donaldsharp/more_zebra_show More zebra show	2022-02-04 15:13:45 -06:00
Donald Sharp	c8453cd77e	zebra: Fix v6 route replace failure turned into success Currently when we have a route replace operation for v6 routes with a new nexthop group the order of kernel installation is this: a) New nexthop group insertion seq 1 b) Route delete operation seq 3 c) Route insertion operation seq 2 Currently the code in nl_batch_read_resp is attempting to handle this situation by skipping the delete operation. BUT it is enqueuing the context into the zebra dplane queue before we read the response. Since we create the ctx with an implied success, success is being reported to the upper level dplane and the zebra rib thinks the route has been properly handled. This is showing up in the zebra_seg6_route test code because the test code is installing a seg6 route w/ sharpd and it is failing to install because the route's nexthop is rejected: First installation: 2021/10/29 09:28:10.218 ZEBRA: [JGWSB-SMNVE] dplane: incoming new work counter: 2 2021/10/29 09:28:10.218 ZEBRA: [Q52A7-211QJ] dplane enqueues 2 new work to provider 'Kernel' 2021/10/29 09:28:10.218 ZEBRA: [JVY1P-93VFY] dplane provider 'Kernel': processing 2021/10/29 09:28:10.218 ZEBRA: [TX9N0-9JKDF] ID (9) Dplane nexthop update ctx 0x56125390a820 op NH_INSTALL 2021/10/29 09:28:10.218 ZEBRA: [PM9ZJ-07RCP] 0:1::1/128 Dplane route update ctx 0x56125390add0 op ROUTE_INSTALL 2021/10/29 09:28:10.218 ZEBRA: [TJ327-ET8HE] netlink_send_msg: >> netlink message dump [sent] 2021/10/29 09:28:10.218 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=104 type=(104) NEWNEXTHOP flags=(0x0501) {REQUEST,DUMP,(ROOT\|REPLACE\|CAPPED),(ATOMIC\|CREATE)} seq=9 pid=3539131282] 2021/10/29 09:28:10.218 ZEBRA: [WCX94-SW894] nhm [family=(10) AF_INET6 scope=(0) UNIVERSE protocol=(11) ZEBRA flags=0x00000000 {}] 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(1) ID] 2021/10/29 09:28:10.218 ZEBRA: [Z4E9C-GD9EP] 9 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(6) GATEWAY] 2021/10/29 09:28:10.218 ZEBRA: [STTSM-27M81] 2001::1 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(5) OIF] 2021/10/29 09:28:10.218 ZEBRA: [JR4EA-BKPTA] 6 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=6 (payload=2) type=(7) ENCAP_TYPE] 2021/10/29 09:28:10.218 ZEBRA: [JR4EA-BKPTA] 5 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=36 (payload=32) type=(32776) UNKNOWN] 2021/10/29 09:28:10.218 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=64 type=(24) NEWROUTE flags=(0x0401) {REQUEST,(ATOMIC\|CREATE)} seq=10 pid=3539131282] 2021/10/29 09:28:10.218 ZEBRA: [GCEGC-W8YBF] rtmsg [family=(10) AF_INET6 dstlen=128 srclen=0 tos=0 table=254 protocol=(194) UNKNOWN scope=(0) UNIVERSE type=(1) UNICAST flags=0x0000 {}] 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(1) DST] 2021/10/29 09:28:10.218 ZEBRA: [STTSM-27M81] 1::1 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(6) PRIORITY] 2021/10/29 09:28:10.218 ZEBRA: [Z4E9C-GD9EP] 20 2021/10/29 09:28:10.218 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(30) NH_ID] 2021/10/29 09:28:10.218 ZEBRA: [Z4E9C-GD9EP] 9 2021/10/29 09:28:10.218 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:28:10.218 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=76 type=(2) ERROR flags=(0x0300) {DUMP,(ROOT\|REPLACE\|CAPPED),(MATCH\|EXCLUDE\|ACK_TLVS)} seq=9 pid=3539131282] 2021/10/29 09:28:10.218 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-22) Invalid argument] 2021/10/29 09:28:10.218 ZEBRA: [HSYZM-HV7HF] Extended Error: Gateway can not be a local address 2021/10/29 09:28:10.218 ZEBRA: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWNEXTHOP(104), seq=9, pid=3539131282 2021/10/29 09:28:10.218 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:28:10.218 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=68 type=(2) ERROR flags=(0x0300) {DUMP,(ROOT\|REPLACE\|CAPPED),(MATCH\|EXCLUDE\|ACK_TLVS)} seq=10 pid=3539131282] 2021/10/29 09:28:10.218 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-22) Invalid argument] 2021/10/29 09:28:10.218 ZEBRA: [HSYZM-HV7HF] Extended Error: Nexthop id does not exist 2021/10/29 09:28:10.218 ZEBRA: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWROUTE(24), seq=10, pid=3539131282 2021/10/29 09:28:10.218 ZEBRA: [VCDW6-A7ZF1] dplane dequeues 2 completed work from provider Kernel 2021/10/29 09:28:10.218 ZEBRA: [JTWAB-1MH4Y] dplane has 2 completed, 0 errors, for zebra main 2021/10/29 09:28:10.218 ZEBRA: [J7K9Z-9M7DT] Nexthop dplane ctx 0x56125390a820, op NH_INSTALL, nexthop ID (9), result FAILURE 2021/10/29 09:28:10.218 ZEBRA: [P2XBZ-RAFQ5][EC 4043309074] Failed to install Nexthop ID (9) into the kernel 2021/10/29 09:28:10.218 ZEBRA: [RMK34-61HV5] default(0:254):1::1/128 Processing dplane result ctx 0x56125390add0, op ROUTE_INSTALL result FAILURE Note the last line `op ROUTE_INSTALL result FAILURE` because we are attempting to use a a gw nexthop that is local. This is the result. Then the test code was installing the route again: 2021/10/29 09:30:00.493 ZEBRA: [JGWSB-SMNVE] dplane: incoming new work counter: 2 2021/10/29 09:30:00.493 ZEBRA: [Q52A7-211QJ] dplane enqueues 2 new work to provider 'Kernel' 2021/10/29 09:30:00.493 ZEBRA: [JVY1P-93VFY] dplane provider 'Kernel': processing 2021/10/29 09:30:00.493 ZEBRA: [TX9N0-9JKDF] ID (9) Dplane nexthop update ctx 0x561253916a00 op NH_INSTALL 2021/10/29 09:30:00.493 ZEBRA: [PM9ZJ-07RCP] 0:1::1/128 Dplane route update ctx 0x561253915f40 op ROUTE_UPDATE 2021/10/29 09:30:00.493 ZEBRA: [TJ327-ET8HE] netlink_send_msg: >> netlink message dump [sent] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=104 type=(104) NEWNEXTHOP flags=(0x0501) {REQUEST,DUMP,(ROOT\|REPLACE\|CAPPED),(ATOMIC\|CREATE)} seq=11 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [WCX94-SW894] nhm [family=(10) AF_INET6 scope=(0) UNIVERSE protocol=(11) ZEBRA flags=0x00000000 {}] 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(1) ID] 2021/10/29 09:30:00.493 ZEBRA: [Z4E9C-GD9EP] 9 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(6) GATEWAY] 2021/10/29 09:30:00.493 ZEBRA: [STTSM-27M81] 2001::1 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(5) OIF] 2021/10/29 09:30:00.493 ZEBRA: [JR4EA-BKPTA] 6 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=6 (payload=2) type=(7) ENCAP_TYPE] 2021/10/29 09:30:00.493 ZEBRA: [JR4EA-BKPTA] 5 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=36 (payload=32) type=(32776) UNKNOWN] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=56 type=(25) DELROUTE flags=(0x0401) {REQUEST,(ATOMIC\|CREATE)} seq=13 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [GCEGC-W8YBF] rtmsg [family=(10) AF_INET6 dstlen=128 srclen=0 tos=0 table=254 protocol=(194) UNKNOWN scope=(0) UNIVERSE type=(0) UNSPEC flags=0x0000 {}] 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(1) DST] 2021/10/29 09:30:00.493 ZEBRA: [STTSM-27M81] 1::1 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(6) PRIORITY] 2021/10/29 09:30:00.493 ZEBRA: [Z4E9C-GD9EP] 20 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=64 type=(24) NEWROUTE flags=(0x0401) {REQUEST,(ATOMIC\|CREATE)} seq=12 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [GCEGC-W8YBF] rtmsg [family=(10) AF_INET6 dstlen=128 srclen=0 tos=0 table=254 protocol=(194) UNKNOWN scope=(0) UNIVERSE type=(1) UNICAST flags=0x0000 {}] 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=20 (payload=16) type=(1) DST] 2021/10/29 09:30:00.493 ZEBRA: [STTSM-27M81] 1::1 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(6) PRIORITY] 2021/10/29 09:30:00.493 ZEBRA: [Z4E9C-GD9EP] 20 2021/10/29 09:30:00.493 ZEBRA: [KFBSR-XYJV1] rta [len=8 (payload=4) type=(30) NH_ID] 2021/10/29 09:30:00.493 ZEBRA: [Z4E9C-GD9EP] 9 2021/10/29 09:30:00.493 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=76 type=(2) ERROR flags=(0x0300) {DUMP,(ROOT\|REPLACE\|CAPPED),(MATCH\|EXCLUDE\|ACK_TLVS)} seq=11 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-22) Invalid argument] 2021/10/29 09:30:00.493 ZEBRA: [HSYZM-HV7HF] Extended Error: Gateway can not be a local address 2021/10/29 09:30:00.493 ZEBRA: [WVJCK-PPMGD][EC 4043309093] netlink-dp (NS 0) error: Invalid argument, type=RTM_NEWNEXTHOP(104), seq=11, pid=3539131282 2021/10/29 09:30:00.493 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=36 type=(2) ERROR flags=(0x0100) {DUMP,(ROOT\|REPLACE\|CAPPED)} seq=13 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-3) No such process] 2021/10/29 09:30:00.493 ZEBRA: [V8KNF-8EXH8] netlink_recv_msg: << netlink message dump [recv] 2021/10/29 09:30:00.493 ZEBRA: [JAS4D-NCWGP] nlmsghdr [len=68 type=(2) ERROR flags=(0x0300) {DUMP,(ROOT\|REPLACE\|CAPPED),(MATCH\|EXCLUDE\|ACK_TLVS)} seq=12 pid=3539131282] 2021/10/29 09:30:00.493 ZEBRA: [KWP1C-6CSXF] nlmsgerr [error=(-22) Invalid argument] 2021/10/29 09:30:00.493 ZEBRA: [VCDW6-A7ZF1] dplane dequeues 2 completed work from provider Kernel 2021/10/29 09:30:00.493 ZEBRA: [JTWAB-1MH4Y] dplane has 2 completed, 0 errors, for zebra main 2021/10/29 09:30:00.493 ZEBRA: [J7K9Z-9M7DT] Nexthop dplane ctx 0x561253916a00, op NH_INSTALL, nexthop ID (9), result FAILURE 2021/10/29 09:30:00.493 ZEBRA: [P2XBZ-RAFQ5][EC 4043309074] Failed to install Nexthop ID (9) into the kernel 2021/10/29 09:30:00.493 ZEBRA: [RMK34-61HV5] default(0:254):1::1/128 Processing dplane result ctx 0x561253915f40, op ROUTE_UPDATE result SUCCESS Note that this time we do these three operations a) nexthop installation seq 11 b) route delete seq 13 c) route add seq 12 Note the last line, we report the install as a success but it clearly failed from the seq=12 decode. When we look at the v6 rib it thinks it is installed: unet> r1 show ipv6 route Codes: K - kernel route, C - connected, S - static, R - RIPng, O - OSPFv3, I - IS-IS, B - BGP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure D>* 1::1/128 [150/0] via 2001::1, dum0, seg6local unspec unknown(seg6local_context2str), seg6 a::, weight 1, 00:00:17 So let's modify nl_batch_read_resp to not dequeue/enqueue the context until we are sure we have the right one. This fixes the test code to do the right thing on the second installation. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 15:33:58 -05:00
Donald Sharp	e3ee55d4bd	zebra: set zd_is_update in 1 spot The ctx->zd_is_update is being set in various spots based upon the same value that we are passing into dplane_ctx_ns_init. Let's just consolidate all this into the dplane_ctx_ns_init so that the zd_is_udpate value is set at the same time that we increment the sequence numbers to use. As a note for future me's reading this. The sequence number choosen for the seq number passed to the kernel is that each context gets a copy of the appropriate nlsock to use. Since it's a copy at a point in time, we know we have a unique sequence number value. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 15:33:58 -05:00
Donald Sharp	00249e255e	zebra: When we get an implicit or ack or full failure mark status When nl_batch_read_resp gets a full on failure -1 or an implicit ack 0 from the kernel for a batch of code. Let's immediately mark all of those in the batch pass/fail as needed. Instead of having them marked else where. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 15:33:58 -05:00
Jafar Al-Gharaibeh	40ec6ef9e0	Merge pull request #10161 from donaldsharp/hash_crash zebra: Fix improper usage of hash_iterate that caused crashes	2022-02-04 14:18:03 -06:00
Donald Sharp	07b9ebca65	zebra: Ensure zebra_nhg_sweep_table accounts for double deletes I'm seeing this crash in various forms: Program terminated with signal SIGSEGV, Segmentation fault. 50 ../sysdeps/unix/sysv/linux/raise.c: No such file or directory. [Current thread is 1 (Thread 0x7f418efbc7c0 (LWP 3580253))] (gdb) bt (gdb) f 4 267 (func)(hb, arg); (gdb) p hb $1 = (struct hash_bucket ) 0x558cdaafb250 (gdb) p hb $2 = {len = 0, next = 0x0, key = 0, data = 0x0} (gdb) I've also seen a crash where data is 0x03. My suspicion is that hash_iterate is calling zebra_nhg_sweep_entry which does delete the particular entry we are looking at as well as possibly other entries when the ref count for those entries gets set to 0 as well. Then we have this loop in hash_iterate.c: for (i = 0; i < hash->size; i++) for (hb = hash->index[i]; hb; hb = hbnext) { / get pointer to next hash bucket here, in case (func) decides to delete hb by calling hash_release / hbnext = hb->next; (func)(hb, arg); } Suppose in the previous loop hbnext is set to hb->next and we call zebra_nhg_sweep_entry. This deletes the previous entry and also happens to cause the hbnext entry to be deleted as well, because of nhg refcounts. At this point in time the memory pointed to by hbnext is not owned by the pthread anymore and we can end up on a state where it's overwritten by another pthread in zebra with data for other incoming events. What to do? Let's change the sweep function to a hash_walk and have it stop iterating and to start over if there is a possible double delete operation. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 12:05:38 -05:00
Russ White	ab68283cee	Merge pull request #10401 from donaldsharp/donot_agree zebra: Make Router Advertisement warnings show up once every 6 hours	2022-02-04 10:55:00 -05:00
Donatas Abraitis	66a59f8743	Merge pull request #10469 from mjstapp/fix_dplane_netlink_groups zebra: reduce incoming netlink messages for dplane thread	2022-02-04 17:51:31 +02:00
Donald Sharp	530c9fc4f5	zebra: Convert some `show zebra` output to a table Make the output a bit easier to interpret and use by converting to usage of a table. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	954e1a2bc9	zebra: Add knowledge about RA and RFC 5549 to `show zebra` Add to `show zebra` whether or not RA is compiled into FRR and whether or not BGP is using RFC 5549 at the moment. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	281686819d	zebra: Add evpn status to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	1777ba2ac4	zebra: Add os and version to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	090ee85656	zebra: Add kernel nexthop group support to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	1a97e35eb8	zebra: Add MPLS status to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	9783de6faf	zebra: Add if v4/v6 forwarding is turned on/off to `show zebra` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	dd42779ff9	zebra: Add to `show zebra` the type of vrf devices being used Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Donald Sharp	88fd4cb8ca	zebra: Add ability to know when FRR is not asic offloaded Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-04 10:29:38 -05:00
Mark Stapp	3d1ff4bfdb	Merge pull request #10409 from idryzhov/zebra-mq-clean-crash zebra: fix cleanup of meta queues on vrf disable	2022-02-02 08:35:00 -05:00
Mark Stapp	ceab66b7f4	zebra: reduce incoming netlink messages for dplane thread The dataplane pthread only processes a limited set of incoming netlink notifications: only register for that set of events, reducing duplicate incoming netlink messages. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-02-01 13:43:51 -05:00
Igor Ryzhov	0ef6eacc95	zebra: fix cleanup of meta queues on vrf disable Current code treats all metaqueues as lists of route_node structures. However, some queues contain other structures that need to be cleaned up differently. Casting the elements of those queues to struct route_node and dereferencing them leads to a crash. The crash may be seen when executing bgp_multi_vrf_topo2. Fix the code by using the proper list element types. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-02-01 18:20:30 +03:00
Igor Ryzhov	461a8d7aba	Merge pull request #10443 from mjstapp/zebra_re_opaque zebra: name the route_entry opaque struct more specifically	2022-02-01 12:19:11 +03:00
Mark Stapp	b86c1f4fcc	zebra: name the route_entry opaque struct more specifically The name 'opaque' is a little general - call the route_entry struct 're_opaque' to make it more specific. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2022-01-31 08:50:50 -05:00
Donald Sharp	637f95bf2d	zebra: Make Router Advertisement warnings show up once every 6 hours RA packets are pretty chatty and when there is a warning from a missconfiguration on the network, the log file gets filed up with warnings. Modify the code in rtadv.c to only spit out the warning in these cases at most every 6 hours. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-28 11:07:01 -05:00
Xiao Liang	8244ba34aa	zebra: Fix EVPN route nexthop config order EVPN route add should be queued to preserve the config order. In particular, against deletion in rib_delete(). Signed-off-by: Xiao Liang <shaw.leon@gmail.com>	2022-01-28 20:51:10 +08:00
Donald Sharp	6b390b3c7b	zebra: Better handle replacing our route by a system route When a operator has a FRR based route installed into the FIB and a better route comes in from the system. There is code in the data plane to schedule the batching and continue processing. But in this case we are done so we can just return Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-26 10:26:46 -05:00
Donald Sharp	0955f8757b	zebra: Don't double delete the table we are cleaning up vrf_disable is always called first before vrf_delete. The rnh_table and rnh_table_multicast tables are already deleted as part of vrf_disable. No need to do it again. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-26 08:21:03 -05:00
Russ White	e48b2fea63	Merge pull request #10411 from idryzhov/if-config-vrf-name *: do not print vrf name for interface config when using vrf-lite	2022-01-25 11:34:59 -05:00
Russ White	bbf1101240	Merge pull request #10412 from idryzhov/zebra-vrf-delete zebra: fix vrf deletion	2022-01-24 07:33:53 -05:00
Igor Ryzhov	788a036fdb	*: do not print vrf name for interface config when using vrf-lite VRF name should not be printed in the config since `574445ec`. The update was done for NB config output but I missed it for regular vty output. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-01-24 14:44:05 +03:00
Donatas Abraitis	6b968475ed	Merge pull request #10406 from idryzhov/zebra-opaque-memleak zebra: fix opaque data memleak	2022-01-24 09:38:54 +02:00
Russ White	2d9e10d095	Merge pull request #10318 from donaldsharp/redistribution OSPF Redistribution	2022-01-23 22:30:24 -05:00
Igor Ryzhov	e4c5b3ba06	zebra: fix vrf deletion VRF deletion code must be called after the corresponding interface deletion code. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-01-24 01:51:10 +03:00
Igor Ryzhov	dc00940b66	zebra: fix opaque data memleak Opaque data should be freed together with route entry in case of errors. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2022-01-23 15:39:04 +03:00
Donald Sharp	e8b3a2f74b	lib, zebra: Add ability to tell thread system to ignore late timers Add a thread_ignore_late_timer(struct thread *thread) function that allows thread.c to ignore when timers are late to the party. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-20 11:58:48 -05:00
Russ White	4ec148523c	Merge pull request #10355 from opensourcerouting/noisy-startup lib, zebra: make startup less noisy	2022-01-18 11:30:13 -05:00
Russ White	05786ac774	Merge pull request #9644 from opensourcerouting/ospf-opaque-attrs OSPF opaque route attributes	2022-01-18 09:08:38 -05:00
Donald Sharp	db80a7e2f5	zebra: Add table and instance data to debugs for redistribute_delete Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	659ec5e9c2	zebra: Cleanup temp variable usage in debugs for %pFX Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	5ef3568f22	zebra: Use %pRN instead of %pFX whenver possible Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	a7704e1b98	zebra: Modify route_notify_internal to use a route_node Pass in the route_node that is under consideration into route_notify_internal to allow calling functions to reduce stack size as well as looking up data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	69a2d597d2	zebra: Cleanup %pFX to %pRN in rib_process_result The dest_pfx was pretty much only ever used for debug output and FRR already knows the rn. So use that instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:41 -05:00
Donald Sharp	085e8f8f6b	zebra: Reduce unneeded lookup in rib_process the lookup of the src_p and dest_p is not needed since the values are never used. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	898b4a6d3b	zebra: Reduce lookups in rib_process_dplane_notify the dest_p and src_p values were only ever used for debugs and %pFX, when we already have the rn. There is no need to do this lookup Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	a1742ba8af	zebra: Fix redistribute.h up to our standards FRR must give variable names instead of not defining them in the .h file. This just cleans up this problem for redistribute.h Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	84faa42066	zebra: Modify zsend_redistribute_route to receive struct route_node The function zsend_redistribute_route uses the prefix and source prefix. Just pass in the route_node instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	c8671e9f9b	zebra: Pass rn to zebra_redistribute_check instead FRR is using struct prefix and afi to be passed around. When all that is needed is the route node. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	ee78ed680b	zebra: Convert redistribute_update to use a route_node FRR is passing around a bunch of data that is encapsulated within the route node. Let's just pass that around instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com> Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	b3a9fca150	zebra: Convert redistribute_delete to use a route_node FRR is passing around a bunch of data that is encapsulated within the route node. Let's just pass that around instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	deb39cca21	zebra: Do not allow instance redistribution to happen no matter what If you have this setup: router ospf 3 redistribute sharp ! and then install: sharp install route 4.5.6.7 nexthop 192.168.100.1 1 sharp install route 4.5.6.8 nexthop 192.168.100.1 1 instance 3 sharp install route 4.5.6.9 nexthop 192.168.100.1 1 instance 4 The .8 and .9 routes are auto redistributed into ospf instance 3: eva# show ip ospf data OSPF Instance: 3 OSPF Router with ID (192.168.122.1) AS External Link States Link ID ADV Router Age Seq# CkSum Route 4.5.6.7 192.168.122.1 13 0x80000001 0x477c E2 4.5.6.7/32 [0x0] 4.5.6.8 192.168.122.1 5 0x80000001 0x3d85 E2 4.5.6.8/32 [0x0] 4.5.6.9 192.168.122.1 5 0x80000001 0x338e E2 4.5.6.9/32 [0x0] This cannot be correct behavior. When redistributing in the absense of an instance number the default instance of 0 should be used and should be the only route redistributed. Here is the correct behavior: eva# show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.119.1, enp39s0, 00:00:28 D>* 4.5.6.7/32 [150/0] via 192.168.100.1, virbr1, weight 1, 00:00:02 D[3]>* 4.5.6.8/32 [150/0] via 192.168.100.1, virbr1, weight 1, 00:00:02 D[4]>* 4.5.6.9/32 [150/0] via 192.168.100.1, virbr1, weight 1, 00:00:02 C>* 192.168.100.0/24 is directly connected, virbr1, 00:00:28 C>* 192.168.110.0/24 is directly connected, virbr2, 00:00:28 C>* 192.168.119.0/24 is directly connected, enp39s0, 00:00:28 C>* 192.168.122.0/24 is directly connected, virbr0, 00:00:28 eva# show ip ospf data OSPF Instance: 3 OSPF Router with ID (192.168.122.1) AS External Link States Link ID ADV Router Age Seq# CkSum Route 4.5.6.7 192.168.122.1 6 0x80000001 0x477c E2 4.5.6.7/32 [0x0] Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:39:40 -05:00
Donald Sharp	1945fb7a07	zebra: Add hint to what instance we are looking at FRR allows redistribution to a client with a specific instance in mind. The code was not allowing you to figure out what instance was being looked at. So let's clarify this in the debugs. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-01-18 08:36:26 -05:00
Rafael Zalamena	4e4c027803	Merge pull request #10183 from idryzhov/rework-vrf-rename *: rework renaming the default VRF	2022-01-17 08:45:12 -03:00
David Lamparter	17a4c65576	zebra: remove netlink buffer size log message ... really not much point in printing this. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2022-01-17 09:46:19 +01:00
Renato Westphal	5b5d66c431	lib, ospfd, ospf6d, zebra: add OSPF opaque route attributes Update ospfd and ospf6d to send opaque route attributes to zebra. Those attributes are stored in the RIB and can be viewed using the "show ip[v6] route" commands (other than that, they are completely ignored by zebra). Example: ``` debian# show ip route 192.168.1.0/24 Routing entry for 192.168.1.0/24 Known via "ospf", distance 110, metric 20, best Last update 01:57:08 ago * 10.0.1.2, via eth-rt2, weight 1 OSPF path type : External-2 OSPF tag : 0 debian# debian# show ip route 192.168.1.0/24 json { "192.168.1.0\/24":[ { "prefix":"192.168.1.0\/24", "prefixLen":24, "protocol":"ospf", "vrfId":0, "vrfName":"default", "selected":true, [snip] "ospfPathType":"External-2", "ospfTag":"0" } ] } ``` Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2022-01-15 17:22:27 +01:00
Sarita Patra	f99f1ff50a	zebra: Fix for route node having no tracking NHT Topology: IXIA-----(ens192)FRR(ens224)------iXIA Configuration: 1. Create 8 sub-interfaces on ens192 under Default VRF and configure 8 EBGP session between FRR and IXIA. 2. Create 1000 sub-interfaces on ens224 under Default VRF and configure 1000 EBGP session between FRR and IXIA. 3. 2M prefixes distributed from Left side Ixia each with 8 ECMP path. 4. So in total, there are 2M prefixes * 8 ECMP = 16M prefixes entries in RIB and FIB. Issue: Shut ens192 and ens224, this is taking 1hr 15 mins to clean up the routes. Root Cause: In the case of route deletion, if the particular route node is having nht count = 0, we are going to the parent and doing nht evaluation, which is not needed. Fix: If the deleted the route node is having nht count > 0, then do a nht evaluation on the parent node. Shut ens192 and ens224, it is taking 1 min to clean up the routes with the fix. Signed-off-by: Sarita Patra <saritap@vmware.com>	2022-01-13 14:15:05 -05:00
Donatas Abraitis	df8d723c5f	*: Add FOREACH_AFI_SAFI_NSF(afi, safi) macro to reduce nesting Used for graceful-restart mostly. Especially for bgp_show_neighbor_graceful_restart_capability_per_afi_safi() Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2022-01-13 14:29:54 +02:00
anlan_cs	5498a9f13d	bfdd: correct one spelling error of comment Signed-off-by: anlan_cs <anlan_cs@tom.com>	2021-12-31 05:32:49 -05:00
Igor Ryzhov	63011ec1c5	Merge pull request #10256 from anlancs/cleanup-zebra_evpn_mac_add zebra: cleanup checking zebra_evpn_mac_add function's return value	2021-12-23 13:10:18 +03:00
anlan_cs	07361b8fdf	zebra: cleanup checking zebra_evpn_mac_add function's return value This function is sure to return correct value by "assert", so the checking its return value should be removed. Signed-off-by: anlan_cs <anlan_cs@tom.com>	2021-12-22 21:13:26 -05:00
Igor Ryzhov	ac2cb9bf94	*: rework renaming the default VRF Currently, it is possible to rename the default VRF either by passing `-o` option to zebra or by creating a file in `/var/run/netns` and binding it to `/proc/self/ns/net`. In both cases, only zebra knows about the rename and other daemons learn about it only after they connect to zebra. This is a problem, because daemons may read their config before they connect to zebra. To handle this rename after the config is read, we have some special code in every single daemon, which is not very bad but not desirable in my opinion. But things are getting worse when we need to handle this in northbound layer as we have to manually rewrite the config nodes. This approach is already hacky, but still works as every daemon handles its own NB structures. But it is completely incompatible with the central management daemon architecture we are aiming for, as mgmtd doesn't even have a connection with zebra to learn from it. And it shouldn't have it, because operational state changes should never affect configuration. To solve the problem and simplify the code, I propose to expand the `-o` option to all daemons. By using the startup option, we let daemons know about the rename before they read their configs so we don't need any special code to deal with it. There's an easy way to pass the option to all daemons by using `frr_global_options` variable. Unfortunately, the second way of renaming by creating a file in `/var/run/netns` is incompatible with the new mgmtd architecture. Theoretically, we could force daemons to read their configs only after they connect to zebra, but it means adding even more code to handle a very specific use-case. And anyway this won't work for mgmtd as it doesn't have a connection with zebra. So I had to remove this option. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-12-21 22:09:29 +03:00
Lou Berger	4b7d297d4b	Merge pull request #9750 from mjstapp/zebra_installed_nhg_id zebra: add installed nexthop-group id value	2021-12-21 10:31:04 -05:00
Donatas Abraitis	20044d8090	Merge pull request #10245 from anlancs/fix-spell-error zebra: correct one spell error	2021-12-20 15:14:53 +02:00
anlan_cs	b816de6213	zebra: correct one spell error Signed-off-by: anlan_cs <anlan_cs@tom.com>	2021-12-19 20:47:01 -05:00
Donatas Abraitis	e69ae079b7	Merge pull request #9899 from Drumato/zebra-srv6-locator-detail-json-support zebra: Add support for json output in srv6 locator detail command	2021-12-14 09:11:36 +02:00
Stephen Worley	f6f16b6073	zebra: add optional NHG ID output to `show ip ro` Add optional NHG ID output to `show ip route` dumps. We have this in json output already as nexthopGroupID but nice to have the option in a normal dump as well. Not including in main output for now to avoid breaking screen scrapers. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-12-02 11:28:37 -05:00
Donald Sharp	c3343a755f	zebra: Prevent thread usage of data after it being freed On startup we create a thread timer event to do a rib sweep of the system. On shutdown we never stopped this timer and as such we have a situation where a thread event could be run on shutdown after the data for it has been freed. Here is the crash I am seeing: (gdb) bt (gdb) Save the thread data in zebra_router and stop the thread so we don't accidently do work on shutdown we don't mean to. In this case it happened in our topotests with some severe system load. Essentially we happened to kill the zebra daemon just as the graceful_restart timer popped here. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-29 15:51:45 -05:00
Yamato Sugawara	559f4b2f2a	zebra: Add support for json output in srv6 locator detail command Signed-off-by: Yamato Sugawara <yamato.sugawara@linecorp.com>	2021-11-28 23:53:41 +00:00
Igor Ryzhov	cb3fa0a612	Merge pull request #10124 from ton31337/feature/vty_json	2021-11-29 02:11:29 +03:00
Donatas Abraitis	c48349e346	*: Remove redundand braces for single statement blocks Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-27 11:20:59 +02:00
Donatas Abraitis	962af8a8cd	zebra: Convert vty_out to vty_json for JSON Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-25 17:49:46 +02:00
Donatas Abraitis	746a6eda2f	*: Remove unused variables Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-25 17:35:55 +02:00
Donatas Abraitis	44991dc163	zebra: Replace prefix2str for JSON to %pFX Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-25 17:28:12 +02:00
Mark Stapp	002930f7bb	zebra: add installed nexthop-group id value In some cases, zebra may install a nexthop-group id that is different from the id of the nhe struct attached to a route-entry. This happens for a singleton recursive nexthop, for example, where a route is installed with the resolving nexthop's id. The installed value is the most useful value - that corresponds to information in the kernel on linux/netlink platforms that support nhgs. Display both values if they differ in ascii output, and include both values in the json form. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2021-11-23 11:23:26 -05:00
Igor Ryzhov	096f7609f9	*: cleanup ifp->vrf_id Since `f60a1188` we store a pointer to the VRF in the interface structure. There's no need anymore to store a separate vrf_id field. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-11-22 20:47:23 +03:00
Donald Sharp	9d5a61264a	Merge pull request #10076 from idryzhov/if-is-loopback-or-vrf *: unify if_is_loopback/if_is_loopback_or_vrf	2021-11-22 12:02:21 -05:00
Ryoga Saito	7eab60a793	zebra: add support for End.DT4 This patch enables zebra to insert End.DT4 nexthop into linux kernel. Signed-off-by: Ryoga Saito <ryoga.saito@linecorp.com>	2021-11-22 23:32:30 +09:00
Igor Ryzhov	0609190219	Merge pull request #10074 from opensourcerouting/assorted-20211116 lib/vtysh/ospf6d: assorted small bits	2021-11-19 15:43:10 +03:00
Igor Ryzhov	3c52293809	Merge pull request #10092 from ton31337/feature/replace_json_object_string_add_to_json_object_string_addf_for_inet_ntop *: inet_ntop for JSON output	2021-11-18 22:19:40 +03:00
Russ White	ef148de26d	Merge pull request #9706 from donaldsharp/zebra_client_summ_more_room zebra: Expand v4/v6 route space	2021-11-18 12:19:44 -05:00
Donatas Abraitis	4e9a98636f	*: Remove unused variables Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-18 18:45:41 +02:00
Donatas Abraitis	08edf9c6af	zebra: Replace inet_ntop to %pI4/6 for JSON outputs Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-18 18:45:41 +02:00
Igor Ryzhov	31ffc82f65	Merge pull request #10080 from mjstapp/fix_lsp_workqueue zebra: ignore workqueue delete callbacks during shutdown	2021-11-18 18:47:35 +03:00
Mark Stapp	b0d10d93e2	zebra: during shutdown, don't process LSPs on the lsp workqueue During zebra shutdown, we clear out the LSP workqueue. The LSPs will be uninstalled and freed during the shutdown process, so just ignore any LSPs that happen to be on the workqueue. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2021-11-18 07:35:35 -05:00
Mark Stapp	695b279ae3	zebra: free LSP workqueue early, revert PR 10050 this reverts commit `dd9538c5f3`, which tried to clear the LSP workqueue late during shutdown. Signed-off-by: Mark Stapp <mstapp@nvidia.com>	2021-11-18 07:35:35 -05:00
Donald Sharp	f2ada31cba	zebra: Expand v4/v6 route space At some scale we eventually run out of room displaying v4/v6 route totals for `show zebra client summ`: janelle# show zebra client summ Name Connect Time Last Read Last Write IPv4 Routes IPv6 Routes -------------------------------------------------------------------------------- bgp 04w0d18h 00:00:19 00:01:2411729127/4052681 2037786/903094 This total over ran the space in just a little over a week of uptime. Expand to have a bit more room. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-17 07:47:28 -05:00

... 6 7 8 9 10 ...

5443 Commits