mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-07-13 05:05:35 +00:00

Author	SHA1	Message	Date
Donatas Abraitis	c49042b407	Merge pull request #7638 from donaldsharp/reduce_warn zebra: Reduce warn -> debug	2020-12-03 08:17:59 +02:00
Donald Sharp	0fb4ab0388	Merge pull request #6950 from opensourcerouting/bfd-distributed-v3 bfdd: distributed BFD	2020-12-02 20:50:47 -05:00
Donald Sharp	af8a77d636	Merge pull request #7644 from mjstapp/dplane_cleaner zebra: add an api to process/clean the pending dplane queue	2020-12-02 09:01:44 -05:00
Donald Sharp	fe76cf322e	Merge pull request #7646 from volta-networks/fix_show_route_summary zebra: fix show ip route vrf X summary	2020-12-02 08:59:54 -05:00
Mark Stapp	b238167a9b	Merge pull request #7645 from sworleys/NHG-IFP-Error2Log zebra: make a couple NHG errors debugs	2020-12-01 17:17:59 -05:00
Rafael Zalamena	de5fa92042	Merge pull request #7617 from deastoe/dplane-fpm-lsp zebra: dplane FPM LSP support	2020-12-01 16:01:09 -03:00
Stephen Worley	8c74d904d4	zebra: remove unused EC_ZEBRA_IF_LOOKUP_FAILED EC_ZEBRA_IF_LOOKUP_FAILED is no longer being used, remove it. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-01 13:05:36 -05:00
Anuradha Karuppiah	46bf266c1c	zebra: debug logs to detect incorrect mac deletions A MAC entry cannot be deleted while a neigh is referencing it. It seems there is some race condition where this may be happening. The log is to help identify those cases. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	4f9bb78eca	zebra: change the L2 NHG id format to co-exist with the L3NHG ids It is now 4bits of type and 28bits of value - 1. type=0 is for L3 NHG 2. type=1 is for L2 NH 3. type=2 is for L2 NHG Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	5de10c3705	zebra: allocate one nexthop id per-VTEP instead of one per-ES-VTEP This is an optimization to reduce the number of L2 nexthops. A l2 or fdb nexthop simply provides the dataplane with a nexthop ip- torm-12:mgmt:~# ip nexthop id 268435461 via 27.0.0.20 scope link fdb id 268435463 via 27.0.0.20 scope link fdb id 268435465 via 27.0.0.20 scope link fdb So there is no need to allocate a nexthop per-ES/per-VTEP. There can be 100+ ESs per-VTEP so this change cuts the scale down by a factor of 100. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	15400f95b7	zebra: support for slow-failover of local MACs on an ES When a local ES flaps there are two modes in which the local MACs are failed over - 1. Fast failover - A backup NHG (ES-peer group) is programmed in the dataplane per-access port. When a local ES flaps the MAC entries are left unaltered i.e. pointing to the down access port. And the dataplane redirects traffic destined to the oper-down access port via the backup NHG. 2. Slow failover - This mode needs to be turned on to allow dataplanes not capable of re-directing traffic. In this mode local MAC entries on a down local ES are re-programmed to point to the ES-peers' NHG. And vice-versa i.e. when the ES comes up the MAC entries are re-programmed with the access port as dest. Fast failover is on by default. Slow failover can be enabled via the following config - evpn mh redirect-off Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:26 -08:00
Anuradha Karuppiah	69711b3f83	zebra: on local mac add from the dplane a re-install maybe need as static As a part of extended MM handing a MAC can be updated from local to remote while being referenced by SYNC neighs (this is really a temporary/small window). During this window if the MAC transitions back to local again we need to re-inforce the previous SYNC flags (based on the sync-neigh count) as subsequent SYNC updates to the MAC will be de-duped and ignored. Ticket: CM-29636 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	1a4f9efd54	zebra: set inactive bit when zebra re-installs the MAC on dplane del When a local mac is deleted by the dataplane zebra can re-install it if the MAC is a SYNC MAC (learned from ES peers). The "local_inactive" bit must be set as a part of the re-install to prevent zebra turning around and advertising the MAC as locally active. Also fixed up some debug logs in the slow-fail path to include the VNI. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	80e19eb71f	zebra: skip NDA_DST attr if NHG is present NHG and DST (VTEP-IP) are mutually exclusive attributes. If DST is present the kernel ignores NHG. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	de86cc5bb1	zebra: free up the L2 NHG bitmap as a part of shutdown Fix for a shutdown time memory leak found during review. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	f3722826a4	zebra: remove FDB entries before de-activating a L2-NHG NHG is activated i.e. programmed in the dataplane only if there are active-VTEPs associated with it. When a NHG is de-activated all the remote-mac entries associated with it need to be removed before the NHG is removed. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Patrick Ruddy	0091461961	Merge pull request #7483 from AnuradhaKaruppiah/evpn-mh-dad bgpd, zebra: Keep DAD disabled if EVPN MH is turned on	2020-12-01 17:37:32 +00:00
Emanuele Di Pascale	265ac74a87	zebra: fix show ip route vrf X summary The lookup for non default VRFs was always using a tableId; if not provided, we were defaulting to RT_TABLE_MAIN. This is fine for the default VRF but not for others. As a result, the command was silently failing for non-default VRFs unless we also specified the correct tableId. Fix this by only performing the lookup using the tableId if it is provided; else use zebra_vrf_table. Signed-off-by: Emanuele Di Pascale <emanuele@voltanet.io>	2020-12-01 18:34:05 +01:00
Stephen Worley	306720345a	zebra: make a couple NHG errors debugs A couple NHG messages we were logging as errors are a bit spammy in usecases where you routinely add/remove interfaces (VM heavy deployments). Its not really an error a user cares about and more for a developer to know what went wrong after the fact so it makes more sense for these to be under a debug rather than an error since seeing them does not implicitly mean error during those usecases. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-01 12:04:30 -05:00
Donald Sharp	34c9b28ba8	zebra: Reduce warn -> debug During times of network trauma and when we are at large network scale the process_remote_macip_add function can issue a zlog_warn for a common occurrence. Modify the code to be a debug statement. This behavior is the same now as the process_remote_macip_del function Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-30 19:37:53 -05:00
Mark Stapp	aa21da071c	zebra: add an api to process/clean the pending dplane queue Add an api that allows a caller in the zebra main pthread to process the queue of pending dplane updates. The caller supplies a function to call to test each pending context. Selected contexts are dequeued, and freed without being processed. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-30 16:42:18 -05:00
Anuradha Karuppiah	0c16fb7262	zebra: fix crash seen on VxLAN SG table cleanup done as a part of vrf disable There are two fixes in this commit - 1. Prevent implicit deletion of (,G) entries during (S,G) cleanup. This is done by creating a dummy reference on all (,G) entries. This is needed for a hash-walk based table cleanup. 2. Free up the SG hash table when the VRF is deleted. Ticket: CM-30151 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-30 12:50:38 -08:00
Anuradha Karuppiah	325d694b93	zebra: support for type-0 ESI Earlier type-3 ESI was the only format supported for evpn-mh. Updated the CLI to allow a 10-byte type-0 ESI. Both type-0 and type-3 ESIs are statically configured; just in two different ways - 1. type-0 is configured as a complete 10-byte string 2. type-3 is configured as a 6-byte es-sys-mac and a 3-byte local-discriminator. Sample config - ! interface hostbond1 evpn mh es-id 00:44:38:39:ff:ff:01:00:00:01 ! This is a CLI-only change and has no functional impact. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-30 12:36:41 -08:00
Mark Stapp	a20e6c32a2	zebra: free dplane ctx after pw update Free the dplane contexts used for pseudowire updates; we were leaking these. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-30 10:02:40 -05:00
Duncan Eastoe	f9bf1ecc38	zebra: dplane FPM LSP table walk Add routines to walk the LSP table and generate FPM updates for all entries. A walk of the LSP table is triggered when (re-)connecting to an FPM. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-11-30 12:13:43 +00:00
Duncan Eastoe	b300c8bbcf	zebra: dplane FPM handle LSP install/update/delete Export netlink_lsp_msg_encoder() and use it to encode and send netlink messages concerning LSP updates to connected FPMs. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-11-27 16:32:01 +00:00
Anuradha Karuppiah	dfa3d3d70a	zebra: change the nhg format from hex to dec for easy match up with the dp Dataplane/kernel prints the NHG and NH ids as decimal. Zebra was printing it as hex (to display type vs. val). This became a debugging hassle hence normalizing the format. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-24 11:06:08 -08:00
Anuradha Karuppiah	b2ee2b71f4	zebra: Keep DAD disabled if EVPN MH is turned on DAD is not supported currently with EVPN-MH so we turn it off internally when the first ES config is detected. PS: Note that when all local ESs are deleted DAD will stay off and will need to be cleared via a daemon restart. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-24 10:20:32 -08:00
Rafael Zalamena	91804f630c	lib: add new stream function to reorganize buffer The function was originally implemented for zebra data plane FPM plugin, but another code places could use it. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-11-24 07:54:07 -03:00
Donatas Abraitis	53a85efa51	Merge pull request #7554 from donaldsharp/sockunion2hostprefix_watch_returns bgpd, lib, nhrpd, zebra: verify return of sockunion2hostprefix	2020-11-19 11:26:02 +02:00
Mark Stapp	84c709bc6e	Merge pull request #7555 from idryzhov/cppcheck-fixes fix a couple of issues found by cppcheck	2020-11-18 14:29:25 -05:00
Igor Ryzhov	b0efbc16e4	zebra: fix writing to pointer instead of value Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-11-18 19:05:30 +03:00
Donald Sharp	0154d8ce45	bgpd, lib, nhrpd, zebra: verify return of sockunion2hostprefix The return from sockunion2hostprefix tells us if the conversion succeeded or not. There are places in the code where we always assume that it just `works`, since it can fail notice and try to do the right thing. Please note that failure of this function for most cases of sockunion2hostprefix is highly highly unlikely as that the sockunion was already created and tested elsewhere it's just that this function can fail. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-18 11:04:27 -05:00
Mark Stapp	926bc58f78	Merge pull request #7478 from donaldsharp/buffer Buffer	2020-11-18 08:30:47 -05:00
Russ White	7dce3c57c2	Merge pull request #7518 from donaldsharp/asic_offload_more Asic offload more	2020-11-17 07:27:41 -05:00
Russ White	2bd9d50ca1	Merge pull request #7523 from donaldsharp/route_map_object_t *: Remove route_map_object_t from the system	2020-11-17 07:16:12 -05:00
Mark Stapp	55e74ca925	zebra: use smaller stream buffer for zapi route notifications The owner-notification zapi message is small; use a small buffer for it. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-15 14:50:17 -05:00
Donald Sharp	f7a9d0120d	zebra: Add offload and trap counts to summary command for json output For the json output add offload and trap route counts for the json output. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:19:25 -05:00
Donald Sharp	e4876266e4	zebra: Add `--asic-offload` command Add a command that allows FRR to know it's being used with an underlying asic offload, from the linux kernel perspective. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:19:25 -05:00
Donald Sharp	0d32fbee6d	lib, zebra: Add ability to read kernel notice of Offload Failed The linux kernel is getting RTM_F_OFFLOAD_FAILED for kernel routes that have failed to offload. Write the code to receive these notifications from the linux kernel and store that data for display about the routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:12:50 -05:00
Donald Sharp	fd303a4ba1	zebra: deny when route map is specified but does not exist yet If we have `ip protocol <proto> route-map FOO` and FOO has not been defined in any way shape fashion or form, we should deny the match instead of permitting it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-13 21:11:48 -05:00
Donald Sharp	1782514fb9	*: Remove route_map_object_t from the system The route_map_object_t was being used to track what protocol we were being called against. But each protocol was only ever calling itself. So we had a variable that was only ever being passed in from route_map_apply that had to be carried against and everyone was testing if that variable was for their own stack. Clean up this route_map_object_t from the entire system. We should speed some stuff up. Yes I know not a bunch but this will add up. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-13 19:35:20 -05:00
Donald Sharp	6d12b20703	zebra: Allow `set src X` to work on startup If a route-map in zebra has `set src X` and the interface X is on has not been configured yet, we are rejecting the command outright. This is a problem on boot up especially( and where I found this issue ) in that interfaces can and will be slow on startup and config can easily be read in before the interface has an ip address. Let's modify zebra to just warn to the user we may have a problem and let the chips fall where they may. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-13 16:12:26 -05:00
Santosh P K	9b936c5c36	Merge pull request #4770 from kssoman/fib Advertise FIB installed routes to bgp peers	2020-11-12 18:59:24 +05:30
Anuradha Karuppiah	60e372e9cb	zebra: Set NUD_NOARP on sticky MAC entries in addition to NTF_STICKY (ndm_state & NUD_NOARP) - prevents the entry from expiring (ndm_flags & NTF_STICKY) - prevents station moves on the entry Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-06 17:21:12 -08:00
Soman K S	77b38a4a7d	bgpd: Advertise FIB installed routes to bgp peers (Part 1) Issue: The bgp routes learnt from peers which are not installed in kernel are advertised to peers. This can cause routers to send traffic to these destinations only to get dropped. The fix is to provide a configurable option "bgp suppress-fib-pending". When the option is enabled, bgp will advertise routes only if it these are successfully installed in kernel. Fix (Part1) : * Added message ZEBRA_ROUTE_NOTIFY_REQUEST used by client to request FIB install status for routes * Added AFI/SAFI to ZAPI messages * Modified the functions zapi_route_notify_decode(), zsend_route_notify_owner() and route_notify_internal() to include AFI, SAFI as parameters Signed-off-by: kssoman <somanks@gmail.com>	2020-11-06 08:39:28 +05:30
Donald Sharp	9ea714e143	zebra: Rework code to make SA happy Clan SA was saying: ./zebra/zebra_vty_clippy.c: In function ‘show_route’: zebra/zebra_vty.c:1775:4: warning: ‘zvrf’ may be used uninitialized in this function [-Wmaybe-uninitialized] do_show_ip_route_all(vty, zvrf, afi, !!fib, !!json, tag, ^ I do not see a way that zvrf could ever be uninited in the code path but rearrange the code a tiny bit to make it happier. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-04 11:48:49 -05:00
Mark Stapp	5917df094a	zebra: add optional extra data about routes' interfaces Add extra data about the interfaces used in route updates' nexthops - some consumers of route updates may want additional data, but dataplane plugins running in the dplane pthread cannot safely access the normal zebra data structures. Capturing this info is optional - a plugin must request it (via an api). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-30 10:51:54 -04:00
Mark Stapp	93ca501b61	Merge pull request #7418 from donaldsharp/manuall *: spelling fixes	2020-10-30 08:16:46 -04:00
Donald Sharp	cd8cae5489	Merge pull request #7415 from mjstapp/fix_sa_strlen ospfd, zebra: Fix SA warnings	2020-10-30 07:21:45 -04:00
Jafar Al-Gharaibeh	b131b5f539	Merge pull request #7414 from donaldsharp/32bitflags zebra: Consolidate on 32 bits as the flag size for route flags	2020-10-29 21:47:15 -05:00
Donald Sharp	02c671af40	*: Correct spelling stuff Pretty obvious. WE R SPELL GOOD Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 16:16:00 -04:00
Mark Stapp	904e9b0570	zebra: clean up zebra_protodown_rc_str() Clean up api SA warning, use 'const', and replace snprintf+ pointer math with strlcat. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-29 12:03:25 -04:00
Donald Sharp	acde7f6b8e	zebra: Consolidate on 32 bits as the flag size for route flags When we get a route for installation via any method we should consolidate on 32 bits as the flag size, since we have actually more than 8 bits of data to bass around. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 09:13:59 -04:00
Donald Sharp	82144f532b	zebra: Don't do expensive string manip if not in debug Modify the code to not load up a string that is only used in debugging unless we are debugging. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 09:00:43 -04:00
Russ White	763a60663c	Merge pull request #7371 from AnuradhaKaruppiah/mh-uplink-tracking-1 evpn-mh: uplink tracking and startup delay	2020-10-28 12:13:57 -04:00
Donald Sharp	4d8fa81fbe	Merge pull request #7352 from mjstapp/fix_rt_netlink_indent zebra: fix strange indentation	2020-10-27 20:07:15 -04:00
Anuradha Karuppiah	c36e442c4b	zebra: uplink tracking and startup delay for EVPN-MH Local ethernet segments are held in a protodown or error-disabled state if access to the VxLAN overlay is not ready - 1. When FRR comes up the local-ESs/access-port are kept protodown for the startup-delay duration. During this time the underlay and EVPN routes via it are expected to converge. 2. When all the uplinks/core-links attached to the underlay go down the access-ports are similarly protodowned. The ES-bond protodown state is propagated to each ES-bond member and programmed in the dataplane/kernel (per-bond-member). Configuring uplinks - vtysh -c "conf t" vtysh -c "interface swp4" vtysh -c "evpn mh uplink" Configuring startup delay - vtysh -c "conf t" vtysh -c "evpn mh startup-delay 100" >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> EVPN protodown display - ======================== root@torm-11:mgmt:~# vtysh -c "show evpn" L2 VNIs: 10 L3 VNIs: 3 Advertise gateway mac-ip: No Advertise svi mac-ip: No Duplicate address detection: Disable Detection max-moves 5, time 180 EVPN MH: mac-holdtime: 60s, neigh-holdtime: 60s startup-delay: 180s, start-delay-timer: 00:01:14 <<<<<<<<<<<< uplink-cfg-cnt: 4, uplink-active-cnt: 4 protodown: startup-delay <<<<<<<<<<<<<<<<<<<<<<< >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ES-bond protodown display - =========================== root@torm-11:mgmt:~# vtysh -c "show interface hostbond1" Interface hostbond1 is up, line protocol is down Link ups: 0 last: (never) Link downs: 1 last: 2020/04/26 20:38:03.53 PTM status: disabled vrf: default OS Description: Local Node/s torm-11 and Ports swp5 <==> Remote Node/s hostd-11 and Ports swp1 index 58 metric 0 mtu 9152 speed 4294967295 flags: <UP,BROADCAST,MULTICAST> Type: Ethernet HWaddr: 00:02:00:00:00:35 Interface Type bond Master interface: bridge EVPN-MH: ES id 1 ES sysmac 00:00:00:00:01:11 protodown: off rc: startup-delay <<<<<<<<<<<<<<<<< >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ES-bond member protodown display - ================================== root@torm-11:mgmt:~# vtysh -c "show interface swp5" Interface swp5 is up, line protocol is down Link ups: 0 last: (never) Link downs: 3 last: 2020/04/26 20:38:03.52 PTM status: disabled vrf: default index 7 metric 0 mtu 9152 speed 10000 flags: <UP,BROADCAST,MULTICAST> Type: Ethernet HWaddr: 00:02:00:00:00:35 Interface Type Other Master interface: hostbond1 protodown: on rc: startup-delay <<<<<<<<<<<<<<<< root@torm-11:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-27 09:34:09 -07:00
Patrick Ruddy	dd51171227	Merge pull request #7158 from AnuradhaKaruppiah/mh-df-election evpn-mh: support for DF election	2020-10-27 16:09:45 +00:00
Mark Stapp	bdd085a874	zebra: fix strange indentation Fix some odd indentation in rt_netlink.c - merge damage, maybe? Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-27 12:03:41 -04:00
Mark Stapp	aa9d75efaf	Merge pull request #7381 from sworleys/NHG-Show-Proto-Filter zebra: add type specifier to show nexthop-group	2020-10-27 11:33:00 -04:00
Donald Sharp	f1dbb1c7e1	zebra: Add uptime to `show evpn mac vni ... detail` Add the uptime a mac entry has been in the system. New Output: eva# show evpn mac vni all detail VNI 1000 #MACs (local and remote) 16 MAC: 4e:2d:f3:75:ff:db ESI: 03:44:38:39:ff:ff:01:00:00:02 Intf: hostbond2(10) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 7a:a4:f2:30:dd:5d ESI: 03:44:38:39:ff:ff:01:00:00:01 Intf: hostbond1(9) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 66:9e:d7:3a:f1:f1 Remote VTEP: 192.168.100.18 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:26 Neighbors: 45.0.0.5 Active fe80::649e:d7ff:fe3a:f1f1 Active MAC: 26:f1:bd:5f:e1:77 Remote ES: 03:44:38:39:ff:ff:02:00:00:02 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:23 Neighbors: No Neighbors MAC: 16:80:eb:c4:43:6d ESI: 03:44:38:39:ff:ff:01:00:00:01 Intf: hostbond1(9) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 00:00:00:00:00:22 Remote ES: 03:44:38:39:ff:ff:02:00:00:02 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:26 Neighbors: No Neighbors Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 16:47:07 -04:00
Donald Sharp	a05111ba3d	zebra: Add uptime to `show evpn arp-cache vni .. detail` Add uptime data to `show evpn arp-cache vni ... detail` command. Effectively when we create a neighbor entry store the time it was created. When we modify the neighbor entry store the time it was modified. Display under detail output and json output. New output: eva# show evpn arp-cache vni all detail VNI 1000 #ARP (IPv4 and IPv6, local and remote) 8 IP: 45.0.0.5 Type: remote State: active Uptime: 00:01:59 MAC: 0a:fd:87:ca:7c:00 Sync-info: - Remote VTEP: 192.168.100.18 Local Seq: 0 Remote Seq: 0 IP: fe80::8fd:87ff:feca:7c00 Type: remote State: active Uptime: 00:01:59 MAC: 0a:fd:87:ca:7c:00 Sync-info: - Remote VTEP: 192.168.100.18 Local Seq: 0 Remote Seq: 0 IP: fe80::14e5:c2ff:fe50:fa59 Type: local State: active Uptime: 00:02:04 MAC: 16:e5:c2:50:fa:59 Sync-info: - Local Seq: 0 Remote Seq: 0 IP: 45.0.0.3 Type: remote State: active Uptime: 00:02:02 MAC: 0e:50:e8:cf:6b:eb Sync-info: - Remote VTEP: 192.168.100.16 Local Seq: 0 Remote Seq: 0 IP: 45.0.0.2 Type: local State: active Uptime: 00:02:05 MAC: 16:e5:c2:50:fa:59 Sync-info: - Local Seq: 0 Remote Seq: 0 IP: fe80::c50:e8ff:fecf:6beb Type: remote State: active Uptime: 00:02:02 MAC: 0e:50:e8:cf:6b:eb Sync-info: - Remote VTEP: 192.168.100.16 Local Seq: 0 Remote Seq: 0 IP: 45.0.0.4 Type: remote State: active Uptime: 00:01:55 MAC: 02:ad:5f:d8:da:80 Sync-info: - Remote VTEP: 192.168.100.17 Local Seq: 0 Remote Seq: 0 IP: fe80::ad:5fff:fed8:da80 Type: remote State: active Uptime: 00:01:55 MAC: 02:ad:5f:d8:da:80 Sync-info: - Remote VTEP: 192.168.100.17 Local Seq: 0 Remote Seq: 0 eva# Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 16:47:07 -04:00
Stephen Worley	a8ad9a89ea	zebra,doc: add type specifier to show nexthop-group Add a type specifier to the `show nexthop-group` command so we can easily filter by type when using proto created nexthop groups. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-26 15:55:02 -04:00
Anuradha Karuppiah	2747f6f786	zebra: cleanup inet_ntoa usage in zebra_evpn_mh.c logs Replaced inet_ntoa with %pI4 in the zebra debugs logs. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:43:05 -07:00
Anuradha Karuppiah	acffa256ba	zebra: add json output for zebra ES, ES-EVI and access vlan dumps 1. ES root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01 json" \|python -m json.tool { "accessPort": "hostbond1", "dfPreference": 50000, "esi": "03:44:38:39:ff:ff:01:00:00:01", "flags": [ "local", "remote", "readyForBgp", "bridgePort", "operUp", "nexthopGroupActive" ], "macCount": 10, "nexthopGroup": 536870913, "vniCount": 10, "vteps": [ { "dfAlgorithm": "preference", "dfPreference": 32767, "nexthopId": 268435460, "vtep": "27.0.0.16" }, { "dfAlgorithm": "preference", "dfPreference": 32767, "nexthopId": 268435463, "vtep": "27.0.0.17" } ] } >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 2. ES-EVI - root@torm-11:mgmt:~# vtysh -c "show evpn es-evi vni 1001 detail json" \|python -m json.tool [ { "esi": "03:44:38:39:ff:ff:01:00:00:01", "flags": [ "local", "readyForBgp" ], "vni": 1001 }, { "esi": "03:44:38:39:ff:ff:01:00:00:02", "flags": [ "local", "readyForBgp" ], "vni": 1001 }, ] >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 3. access-vlan root@torm-11:mgmt:~# vtysh -c "show evpn access-vlan 1001 json" \|python -m json. tool { "memberIfCount": 4, "members": [ { "ifName": "hostbond4" }, { "ifName": "hostbond1" }, { "ifName": "hostbond2" }, { "ifName": "hostbond3" } ], "vlan": 1001, "vni": 1001, "vxlanIf": "vx-1001" } root@torm-11:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:21 -07:00
Anuradha Karuppiah	72f2674a95	zebra: handle local-es bridge port association A local ES can be added or removed to a bridge after it is created. When it becomes a bridge port member the dataplane attributes need to be programmed. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:21 -07:00
Anuradha Karuppiah	28e80a037f	zebra: changes for programming SPH, non-DF and backup NHG br-port attrs split horizon filter, non-DF block filter and backup nexthop group are passed as bridge port attributes to the dataplane. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:19 -07:00
Anuradha Karuppiah	c60522f702	zebra: dplane APIs for programming evpn-mh access port attributes This includes - 1. non-DF block filter 2. List of es-peers that need to be blocked per-access port (for split horizon filtering) 3. Backup nexthop group to failover local-es via the VxLAN overlay Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:32:51 -07:00
Anuradha Karuppiah	1103c5c6cd	zebra: changes to run DF election 1. DF preference is configurable per-ES ! interface hostbond1 evpn mh es-df-pref 100 >>>>>>>>>>> evpn mh es-id 1 evpn mh es-sys-mac 00:00:00:00:01:11 ! 2. This parameter is sent to BGP and advertised via the ESR. 3. The peer-ESs' DF params are sent to zebra (by BGP) and used for running the DF election. 4. If the local VTEP becomes non-DF on an ES a block filter is programmed in the dataplane to drop de-capsulated BUM packets destined to that ES. Sample output ============= >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> torm-11# sh evpn es Type: L local, R remote, N non-DF ESI Type ES-IF VTEPs 03:00:00:00:00:01:11:00:00:01 LRN hostbond1 27.0.0.16 03:00:00:00:00:01:22:00:00:02 LR hostbond2 27.0.0.16 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> torm-11# sh evpn es 03:00:00:00:00:01:11:00:00:01 ESI: 03:00:00:00:00:01:11:00:00:01 Type: Local,Remote Interface: hostbond1 State: up Ready for BGP: yes VNI Count: 10 MAC Count: 2 DF: status: non-df preference: 100 >>>>>>>> Nexthop group: 0x2000001 VTEPs: 27.0.0.16 df_alg: preference df_pref: 32767 nh: 0x100000d >>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:32:49 -07:00
Donald Sharp	b467b4b462	zebra: Fix prefix2str buf and some invalid data output in zebra_mpls.c There are several places where prefix2str was used to convert a prefix but they were debug guarded and the buffer was used for flog_err/warn. This would lead to corrupt data being output in the failure cases if debugs were not turned on. Modify the code in zebra_mpls.c to not use prefix2str Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 09:38:33 -04:00
Donald Sharp	2919eea86a	zebra: Replace some prefix2str with %pFX We are loading a buffer with the prefix2str results then using it in the debugs throughout functions. Replace with just using %pFX and remove the buffer. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 09:38:29 -04:00
Patrick Ruddy	d7bd0c043c	Merge pull request #7217 from AnuradhaKaruppiah/fix-es-del-regression zebra: fix double clearing of zif->es_info.es	2020-10-26 10:12:54 +00:00
Mark Stapp	874e77acce	Merge pull request #7374 from sworleys/Revert-Revert-NHG-Dependents zebra: Fix the NHG dependents relationship	2020-10-24 16:49:09 -04:00
Mark Stapp	33fa4b14db	Merge pull request #7382 from sworleys/Fix-Msg-Buff zebra: fix unitialized msg header reading at startup	2020-10-23 18:05:04 -04:00
Quentin Young	939bd6ac52	Merge pull request #6788 from mjstapp/thread_cancel_off *: unify thread/task cancel apis	2020-10-23 15:02:50 -04:00
Stephen Worley	9d06e1219a	zebra: fix unitialized msg header reading at startup Fixes the valgrind error we were seeing on startup due to initializing the msg header struct: ``` ==2534283== Thread 3 zebra_dplane: ==2534283== Syscall param recvmsg(msg) points to uninitialised byte(s) ==2534283== at 0x4D616DD: recvmsg (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x43107C: netlink_recv_msg (kernel_netlink.c:744) ==2534283== by 0x4330E4: nl_batch_read_resp (kernel_netlink.c:1070) ==2534283== by 0x431D12: nl_batch_send (kernel_netlink.c:1201) ==2534283== by 0x431E8B: kernel_update_multi (kernel_netlink.c:1369) ==2534283== by 0x46019B: kernel_dplane_process_func (zebra_dplane.c:3979) ==2534283== by 0x45EB7F: dplane_thread_loop (zebra_dplane.c:4368) ==2534283== by 0x493F5CC: thread_call (thread.c:1585) ==2534283== by 0x48D3450: fpt_run (frr_pthread.c:303) ==2534283== by 0x48D3D41: frr_pthread_inner (frr_pthread.c:156) ==2534283== by 0x4D56431: start_thread (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x4E709D2: clone (in /usr/lib64/libc-2.31.so) ==2534283== Address 0x85cd850 is on thread 3's stack ==2534283== in frame #2, created by nl_batch_read_resp (kernel_netlink.c:1051) ==2534283== ==2534283== Syscall param recvmsg(msg.msg_control) points to unaddressable byte(s) ==2534283== at 0x4D616DD: recvmsg (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x43107C: netlink_recv_msg (kernel_netlink.c:744) ==2534283== by 0x4330E4: nl_batch_read_resp (kernel_netlink.c:1070) ==2534283== by 0x431D12: nl_batch_send (kernel_netlink.c:1201) ==2534283== by 0x431E8B: kernel_update_multi (kernel_netlink.c:1369) ==2534283== by 0x46019B: kernel_dplane_process_func (zebra_dplane.c:3979) ==2534283== by 0x45EB7F: dplane_thread_loop (zebra_dplane.c:4368) ==2534283== by 0x493F5CC: thread_call (thread.c:1585) ==2534283== by 0x48D3450: fpt_run (frr_pthread.c:303) ==2534283== by 0x48D3D41: frr_pthread_inner (frr_pthread.c:156) ==2534283== by 0x4D56431: start_thread (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x4E709D2: clone (in /usr/lib64/libc-2.31.so) ==2534283== Address 0xa0 is not stack'd, malloc'd or (recently) free'd ==2534283== ``` Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-23 14:57:29 -04:00
Mark Stapp	5047884528	*: unify thread/event cancel macros Replace all lib/thread cancel macros, use thread_cancel() everywhere. Only the THREAD_OFF macro and thread_cancel() api are supported. Also adjust thread_cancel_async() to NULL caller's pointer (if present). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-23 12:16:52 -04:00
Mark Stapp	1e4fa7f46c	Merge pull request #7364 from donaldsharp/zebra_nhg_keep zebra: Do not delete nhg's when retain_mode is engaged	2020-10-23 10:28:31 -04:00
Mark Stapp	b3d6bc6ef0	* : update signature of thread_cancel api Change thread_cancel to take a ** to an event, NULL-check before dereferencing, and NULL the caller's pointer. Update many callers to use the new signature. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-23 08:59:34 -04:00
Stephen Worley	7fa239f165	zebra: disable dependent backpointers for backup nexthops Because the backup nexthop groups currently are more like pseudo-NHEs (they don't have IDs and are not inserted into the ID table or hashed), they can't really have this depends/dependents relationship yet in both directions. Some work needs to be done there to make them more like first class citizens like "normal" NHGs to enable this. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-22 18:10:44 -04:00
Stephen Worley	8459128259	Revert "Revert "zebra: fix NHE dependents backpointer relationship"" This reverts commit `a682deea0f`.	2020-10-22 18:09:44 -04:00
Mark Stapp	9bcef951be	zebra: replace inet_ntoa Stop using inet_ntoa - use %pI4 or inet_ntop instead Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-22 13:37:25 -04:00
Donald Sharp	b1b07ef5a6	zebra: Do not delete nhg's when retain_mode is engaged When `-r` is specified to zebra, on shutdown we should not remove any routes from the fib. This was a problem with nhg's on shutdown due to their ref-count behavior. Introduce a methodology where on shutdown we don't mess with the nexthop groups in the kernel. That way on next startup things will be ok. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-22 08:02:33 -04:00
Donatas Abraitis	2dbe669bdf	:* Convert prefix2str to %pFX Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-10-22 09:07:41 +03:00
Stephen Worley	a682deea0f	Revert "zebra: fix NHE dependents backpointer relationship" This reverts commit `f9f9466e04`.	2020-10-20 17:11:35 -04:00
Donald Sharp	203098301c	Merge pull request #7348 from mjstapp/fix_router_id_lists zebra: clean up all router id lists	2020-10-20 15:53:52 -04:00
Donatas Abraitis	9072f5c89a	Merge pull request #7311 from donaldsharp/table_lock_count Abstract rn->lock accessing and cleanup usage to %pFX and %pRN	2020-10-20 16:04:15 +03:00
Mark Stapp	cdc09a4b04	zebra: clean up all router id lists Clean up the ipv6 router-id lists associated with a zvrf - these were being leaked. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-20 08:48:12 -04:00
Mark Stapp	658084c229	Merge pull request #7289 from sworleys/NHG-Crash-Start zebra: a couple NHG fixes	2020-10-20 08:41:36 -04:00
Stephen Worley	dc1c436278	zebra: add alias for "show ip/ipv6 ro" Add an alias so people can still type `show ip ro`. It became ambigious in a recent release. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-19 14:08:18 -04:00
Stephen Worley	f9f9466e04	zebra: fix NHE dependents backpointer relationship Apparantly the dependents backpointer trees for singletons got broken at some point and we never noticed. There is not really any code making use of this right now so not suprising but let's go ahead and fix it for zebra and proto NHGs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-19 12:52:39 -04:00
Anuradha Karuppiah	ab06b03315	zebra: fix double clearing of zif->es_info.es This problem was accidentally introduced as a part of another fixup - [ commit `e378f5020d` (anuradhak/mh-misc-fixes, mh-misc-fixes) Author: Anuradha Karuppiah <anuradhak@cumulusnetworks.com> Date: Tue Sep 15 16:50:14 2020 -0700 zebra: fix use of freed es during zebra shutdown ] zif->es_info.es is cleared as a part of zebra_evpn_es_local_info_clear so it cannot be passed around as a pointer from zebra_evpn_local_es_update/del. Because of this bug removing ES from an interface resulted in a zebra crash. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-19 09:36:44 -07:00
Donald Sharp	c85b63238a	Merge pull request #7333 from mjstapp/fix_multi_connected zebra: support multiple connected subnets on an interface	2020-10-18 08:29:19 -04:00
Donald Sharp	c10e14e96d	*: Create/Use accessor functions for lock count Create appropriate accessor functions for the rn->lock data. We should be accessing this data through accessor functions since it is private data to the data structure. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-17 13:39:10 -04:00
Donald Sharp	7e26b9d4a2	zebra: Fix use after free in debug path When zebra is running with debugs turned on there is a use after free reported by the address sanitizer: 2020/10/16 12:58:02 ZEBRA: rib_delnode: (0:254):4.5.6.16/32: rn 0x60b000026f20, re 0x6080000131a0, removing 2020/10/16 12:58:02 ZEBRA: rib_meta_queue_add: (0:254):4.5.6.16/32: queued rn 0x60b000026f20 into sub-queue 3 ================================================================= ==3101430==ERROR: AddressSanitizer: heap-use-after-free on address 0x608000011d28 at pc 0x555555705ab6 bp 0x7fffffffdab0 sp 0x7fffffffdaa8 READ of size 8 at 0x608000011d28 thread T0 #0 0x555555705ab5 in re_list_const_first zebra/rib.h:222 #1 0x555555705b54 in re_list_first zebra/rib.h:222 #2 0x555555711a4f in process_subq_route zebra/zebra_rib.c:2248 #3 0x555555711d2e in process_subq zebra/zebra_rib.c:2286 #4 0x555555711ec7 in meta_queue_process zebra/zebra_rib.c:2320 #5 0x7ffff74701f7 in work_queue_run lib/workqueue.c:291 #6 0x7ffff7450e9c in thread_call lib/thread.c:1581 #7 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #8 0x55555561a578 in main zebra/main.c:455 #9 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 #10 0x5555555e3429 in _start (/usr/lib/frr/zebra+0x8f429) 0x608000011d28 is located 8 bytes inside of 88-byte region [0x608000011d20,0x608000011d78) freed by thread T0 here: #0 0x7ffff768bb6f in __interceptor_free (/lib/x86_64-linux-gnu/libasan.so.6+0xa9b6f) #1 0x7ffff739ccad in qfree lib/memory.c:129 #2 0x555555709ee4 in rib_gc_dest zebra/zebra_rib.c:746 #3 0x55555570ca76 in rib_process zebra/zebra_rib.c:1240 #4 0x555555711a05 in process_subq_route zebra/zebra_rib.c:2245 #5 0x555555711d2e in process_subq zebra/zebra_rib.c:2286 #6 0x555555711ec7 in meta_queue_process zebra/zebra_rib.c:2320 #7 0x7ffff74701f7 in work_queue_run lib/workqueue.c:291 #8 0x7ffff7450e9c in thread_call lib/thread.c:1581 #9 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #10 0x55555561a578 in main zebra/main.c:455 #11 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 previously allocated by thread T0 here: #0 0x7ffff768c037 in calloc (/lib/x86_64-linux-gnu/libasan.so.6+0xaa037) #1 0x7ffff739cb98 in qcalloc lib/memory.c:110 #2 0x555555712ace in zebra_rib_create_dest zebra/zebra_rib.c:2515 #3 0x555555712c6c in rib_link zebra/zebra_rib.c:2576 #4 0x555555712faa in rib_addnode zebra/zebra_rib.c:2607 #5 0x555555715bf0 in rib_add_multipath_nhe zebra/zebra_rib.c:3012 #6 0x555555715f56 in rib_add_multipath zebra/zebra_rib.c:3049 #7 0x55555571788b in rib_add zebra/zebra_rib.c:3327 #8 0x5555555e584a in connected_up zebra/connected.c:254 #9 0x5555555e42ff in connected_announce zebra/connected.c:94 #10 0x5555555e4fd3 in connected_update zebra/connected.c:195 #11 0x5555555e61ad in connected_add_ipv4 zebra/connected.c:340 #12 0x5555555f26f5 in netlink_interface_addr zebra/if_netlink.c:1213 #13 0x55555560f756 in netlink_information_fetch zebra/kernel_netlink.c:350 #14 0x555555612e49 in netlink_parse_info zebra/kernel_netlink.c:941 #15 0x55555560f9f1 in kernel_read zebra/kernel_netlink.c:402 #16 0x7ffff7450e9c in thread_call lib/thread.c:1581 #17 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #18 0x55555561a578 in main zebra/main.c:455 #19 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 SUMMARY: AddressSanitizer: heap-use-after-free zebra/rib.h:222 in re_list_const_first This is happening because we are using the dest pointer after a call into rib_gc_dest. In process_subq_route, we call rib_process() and if the dest is deleted dest pointer is now garbage. We must reload the dest pointer in this case. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-17 08:55:26 -04:00
Mark Stapp	87009d7df0	zebra: support multiple connected subnets on an interface We support configuration of multiple addresses in the same subnet on a single interface: make sure that zebra supports multiple instances of the corresponding connected route. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-16 16:46:33 -04:00
Mark Stapp	027b3ca2e0	Merge pull request #7244 from donaldsharp/mlag_backout_and_fix Mlag backout and fix	2020-10-14 08:30:54 -04:00
Donald Sharp	4fe30ff1eb	Merge pull request #7298 from mjstapp/quiet_opaque_debugs zebra: quiet the zebra opaque message debugs	2020-10-14 07:27:27 -04:00
Donald Sharp	ca3491262b	zebra: Isolate mlag_rd_buf_offset to the actual using function Isolate the mlag_rd_buf_offset variable to the actual used function, instead of having it a global. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-13 16:02:05 -04:00
Donald Sharp	ded3e3e39c	Revert "zebra: the mlag_rd_buf_offset variable was write only" This reverts commit `00e0d113e5`.	2020-10-13 15:57:54 -04:00
Donald Sharp	82b4a8bf2c	Merge pull request #7258 from mjstapp/zebra_remove_slsp zebra: remove 'static' lsp objects	2020-10-13 15:51:18 -04:00
Mark Stapp	674afc2b0a	zebra: quiet the zebra opaque message debugs Put most of the debugs about opaque ZAPI messages under 'detail' to reduce the noise. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-13 14:07:17 -04:00
Donald Sharp	c8c5009ec5	Merge pull request #7288 from rsmarples/BSD-link_state BSD: ifi_link_state is the link state	2020-10-13 13:43:07 -04:00
Stephen Worley	475852b263	zebra: only track NHEs from the dataplane for ID usage Let's just track the NHEs we get from the kernel(dplane) for ID usage with internal routes. I tried to be smart originally and allow them to be re-used internal to zebra but its proving to cause more bugs than it's worth. This doesn't break any functionality. It just means we won't use NHEs we get from the kernel with our routes, we will create new ones. Decided this based on various bugs seen ith the lastest one being on startup with this kernel state: ``` [root@alfred frr-2]# ip next ls id 15 via 192.168.161.1 dev doof scope link proto zebra id 17 group 15 proto zebra [root@alfred frr-2]# ip ro show 3.3.3.1 3.3.3.1 nhid 17 via 192.168.161.1 dev doof ``` Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-13 11:23:57 -04:00
Emanuele Bovisio	2a3a97be8c	doc, zebra: remove keep_kernel option everywhere remove all remaining parts related to keep_kernel option Signed-off-by: Emanuele Bovisio <emanuele.bovisio@eolo.it>	2020-10-13 12:59:50 +02:00
Roy Marples	98f3df554b	zebra: ifi_link_state is the link state SIOCGIFMEDIA returns the media state. SIOCGIFDATA returns interface data which includes the link state. While the status of the former is usually indicitive of the latter, this is not always the case. Ifact some recent net80211 changes in at least NetBSD and OpenBSD have MONITOR media set to active but the link status set to DOWN. All interfaces will return link state with SIOCGIFDATA, unlike SIOCGIFMEDIA. However not all BSD's support SIOCGIFDATA - it has recently been accepted into FreeBSD-13. However, all BSD's do report the same structure in ifa_data for AF_LINK addresses from getifaddrs(3) so the information has always been available. Signed-off-by: Roy Marples <roy@marples.name>	2020-10-13 11:32:36 +01:00
Stephen Worley	5588801e7a	zebra: add from_dplane info for NHE creation Add a param to the common NHE creation callstack so we can know if this is one we have read in from the dataplane. We can add some logic on how to handle these special ones later. I considered putting this on a struct as a flag or something but it would have required it being put on struct nexthop since we have some `*_find_nexthop()` functions that can be called when given NHEs from the dataplane. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-12 20:39:28 -04:00
Donald Sharp	ba49e033f5	zebra: zevpn cannot be null passed into zebra_evpn_es_evi_show_one_evpn In zebra_evpn_es_evi_show_vni the zevpn pointer if passed into zebra_evpn_es_evi_show_one_evi will crash if it is null and we have code that checks that it is non null and then immediately calls the function. Add a return to prevent a crash. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-10-11 10:47:37 -04:00
Donald Sharp	bc3cd39bc4	zebra: n->mac is derefed in all paths No need to check for n->mac existence as that all paths leading to this code have n->mac already derefed. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-10-11 10:47:37 -04:00
Renato Westphal	8b6b6b694d	Merge pull request #7222 from idryzhov/fix-debug fix debug commands node inconsistencies	2020-10-09 21:58:24 -03:00
Mark Stapp	608a57c08b	zebra: remove 'static' lsp objects Use the same lsp and nexthop/nhlfe objects for 'static' and dynamic LSPs; remove the 'static' objects and their supporting code. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-08 15:50:41 -04:00
Renato Westphal	9cfb2747ad	Merge pull request #7241 from chiragshah6/evpn_dev1 lib: add errmsg to nb rpc	2020-10-07 11:50:52 -03:00
Mark Stapp	628995a30c	Merge pull request #7214 from donaldsharp/more_vrf_usefulness zebra: cleanup zebra_rnh.c debugs	2020-10-06 08:29:45 -04:00
Chirag Shah	9bee02322f	zebra: display rpc error msg to vtysh Zebra's clear duplicate detect command is rpc converted. There is condition where cli fails with human readable message. Using northboun's errmsg buffer to display error message to user. Testing: bharat# clear evpn dup-addr vni 1002 ip 2011:11::11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state bharat# clear evpn dup-addr vni 1002 ip 11.11.11.11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-10-05 13:57:54 -07:00
Chirag Shah	f63f5f1947	*: add errmsg to nb rpc Display human readable error message in northbound rpc transaction failure. In case of vtysh nb client, the error message will be displayed to user. Testing: bharat# clear evpn dup-addr vni 1002 ip 11.11.11.11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-10-05 13:15:59 -07:00
Mark Stapp	10da81824a	Merge pull request #7219 from donaldsharp/rib_fixes Rib fixes	2020-10-05 09:11:50 -04:00
Roy Marples	355c74b7e9	BSD: Add whitespace between declaration and code Signed-off-by: Roy Marples <roy@marples.name>	2020-10-05 08:10:42 +01:00
Roy Marples	68cd699df5	BSD: Detect route(4) overflows NetBSD and DragonFlyBSD support reporting of route(4) overflows by setting the socket option SO_RERROR. This is handled the same as on Linux by exiting with a -1 error code. Signed-off-by: Roy Marples <roy@marples.name>	2020-10-04 20:32:26 +01:00
Donald Sharp	5c30573e2a	zebra: cleanup zebra_rnh.c debugs a) Use appropriate %p modifiers for output 2) Display vrf name in addition to vrf id c) Remove now unused function Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-02 12:15:03 -04:00
Igor Ryzhov	d7b86ae4fe	vtysh: dynamically generate the list of daemons for commands Some daemons were actually missing from the static definitions: nhrpd, babeld, eigrpd and bfdd. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-10-02 15:06:27 +03:00
Igor Ryzhov	dd73744d8c	*: move "show debugging ..." commands to enable node Use the same node for "show debugging" commands in all daemons. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-10-02 15:06:05 +03:00
Donald Sharp	c17b2d5b6b	zebra: Make connected routes their own entry on the meta_q During quick ifdown / ifup events from the linux kernel there exists a situation where a prefix that has both a kernel route and a static route can queued up on the meta-q. If the static route happens to point at a connected route for nexthop resolution and we receive a series of quick up/down events after the static route and kernel route are queued up for rib reprocessing. Since the static route and kernel route are queued on meta-q 1 and the connected route is also on meta-q 1 there exists a situation where the connected route will be resolved after the static route fails to resolve, leaving the static route in a unresolved state. Add a new queue level and put connected routes on their own level, since they are the fundamental building blocks of pretty much all the other routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-01 15:17:06 -04:00
Donald Sharp	9d221fac7e	zebra: When processing route_entries ignore unusable routes When zebra is processing routes to determine what to send to the rib, suppose we have two routes (a) a route processed earlier that none of it's nexthops were active and (b) a route that has good nexthops but has a worse admin distance. rib_process, would not relook at (a)'s nexthops because the ROUTE_ENTRY_CHANGED flag was not true and it would win when compared to (b) because it's admin distance was better, leaving us with a state where we would attempt and fail to install route (a) because it was not valid. Modify the code to consider the number of nexthops we have as a determiner if we can use the route. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-01 15:17:06 -04:00
Donald Sharp	5c18e66208	zebra: Prevent uninstall attempts when new entry is not happy In rib_process_update_fib, the function is sent two route entries the old ( previously installed ) and new ( the one to install ) When the function detects that the new is unusable because the number of nexthops that are usable for that route is 0, then we uninstall the old route. The problem here is that we should not attempt to uninstall any route that is not owned by FRR. Modify the code to not attempt this behavior Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-30 17:26:44 -04:00
Quentin Young	fb3bc7a74b	Merge pull request #7215 from mjstapp/fix_z_mlag_read zebra: don't touch mlag read event pointer	2020-09-30 16:27:01 -04:00
Mark Stapp	f5d8487244	zebra: don't touch mlag read event pointer Don't touch the mlag read event pointer, it's not safe. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-09-30 13:24:54 -04:00
Mark Stapp	4fdfda2e34	Merge pull request #7167 from donaldsharp/mlag_rd_killer zebra: the mlag_rd_buf_offset variable was write only	2020-09-30 11:40:40 -04:00
Donald Sharp	dbbae374d4	Merge pull request #7192 from deastoe/zebra-fpm-blackhole-abort zebra: fix FPM abort for unreach/prohibit routes	2020-09-29 13:47:38 -04:00
Patrick Ruddy	aa1f6a8795	Merge pull request #7188 from chiragshah6/evpn_dev zebra: EVPN avoid duplicate list-node in l3vni's l2vni-list	2020-09-29 16:33:19 +01:00
Duncan Eastoe	94f7786375	zebra: fix FPM abort for unreach/prohibit routes `b0e9567ed1` fixed an issue whereby zebra would abort while building an update for a blackhole route. The same issue, `assert(data_len)` failing in `zfpm_build_route_updates()`, can be observed when building updates for unreachable and prohibit routes. To address this `netlink_route_info_fill()` is updated to not indicate failure, due to lack of nexthops, for any blackhole routes. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-09-29 12:59:30 +01:00
Donald Sharp	a24d04f4db	zebra: Make nexthop_active check use the same debug When debugging why a route was not successfully installed into the rib, it would be preferable that the end user only have to turn on `debug zebra rib detail` as that is what we have been telling people to do for the last couple of years. Consolidate back to this. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-29 07:54:35 -04:00
Donald Sharp	81194feec9	zebra: Add missing reason we could not make an active_nexthop check Add a missing reason as to why we are unable to make an active nexthop check be successful. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-29 07:45:19 -04:00
Chirag Shah	c7e83a4efe	zebra: avoid duplication node in l3vni l2vni-list With l2vni flap leading to duplicate entry creation in l3vni's l2vni-list. Use list sorted add with no duplicates. root@TORC11:mgmt:~# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1000 1000 0 0 1002 root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1000 1000 0 0 1002 1002 Ticket:CM-31545 Reviewed By: Testing Done: With Fix: Multiple time flaps vni counts remained the same. root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# net show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1002 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-09-28 21:44:30 -07:00
Stephen Worley	66c28560ba	zebra: set NHG/backup NHG pointers on success zapi read Only set the NHG/backup NHG pointers of the caller if the read of the nexthops was successfull. Otherwise, we might free when not neccessary or double free. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	2173535298	lib,zebra,sharpd: add code for backup proto-NHs but disabled Add the zapi code for encoding/decoding of backup nexthops for when we are ready for it, but disable it for now so that we revert to the old way with them. When zebra gets a proto-NHG with a backup in it, we early fail and tell the upper level proto. In this case sharpd. Sharpd then reverts to the old way of installation with the route. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	aaa42e056f	zebra: add type to nhg_prot_del API for sanity check Add type to the nhg_proto_del API params for sanity checking that the types of the route sent by the proto matches the type found with the ID. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	841f77ff04	zebra: free ctx if we skip replace for NHG PROTO routes Free the ctx if we decide we dont need to do anything with this route update. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	3d3a9dc8a7	zebra: limit no re-install to NHG PROTO using routes Limit the not re-installation of routes with the same NHG ID to routes that are using the new NHG PROTO API. This would only include sharpd and EVPN-MH for now. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8f830b8c64	zebra: use list to mark for removal when scoring In scoring our NHEs during shutdown there is a chance we could release mutliple NHEs at the same time during one iteration. This can cause memory corruption if the two being released are directly next to each other in the hash table. hash_iterate accounts for releasing one during the iteration but not two by setting hbnext before release but if hbnext is also freed, we obviously can have a problem. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	70f3cda6c1	zebra: reject proto NHGs of blackhole/interface Reject proto NHGs of type blackhole/interface for now. We need to think a bit more about how to resolve these given the linux kernel needs to know the Address Family of the routes that will use them and install it with them. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	73937edb73	zebra,sharpd: checkpatch fixes Check patches fixes for NHG API pathes. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	ff9aca4f8d	lib,zebra,sharpd: clang format Clang format for NHG API and sharpd patches. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8b2d3a0fb6	zebra: clean up the NHG proto zapi code a bit Clean up the function names and remove some TODOs that are no longer needed/hacks we used for testing. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	e270f004ae	zebra: multipath number checks with NHG proto Get the multipath number checks working with proto-based NHG message decoding in zapi_msg.c Modify the function that checks this for routes to work without being passed a prefix as is the case with NHG creates. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8155e8c592	zebra: add flag track released state of proto NHGS Add a flag to track the released state of a proto-based NHG. This flag is used to know whether the upper level proto has called the *_del API. Typically, the NHG would just get removed and uninstalled at this point but there is a chance we are being sent it while routes are still being owned or we were sent it multiple times. This flag and associated code handles that. Ticket: CM-30369 Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	70347b7ad6	zebra: reply fail on NHG add if not ifindex/onlink We currently don't support ADD/DEL/REPLACE with proto-based NHGs that are not already fully resolved and ifindex/onlink based. If we are handed one that doesn't have ifindex set i.e. recursive, gracefully fail and with a notification. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	2c7819b9d4	lib,zebra: fixup NHG notify zapi messaging Make the message parameters align better with other zapi notifications and change the ID to correctly be a uint32. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Donald Sharp	27805e74f0	zebra: Properly set NEXTHOP_FLAG_FIB when skipping install When the dataplane detects that we have no need to reinstall the same route, setup the NEXTHOP_FLAG_FIB appropriately. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	e3b9c0f2f6	zebra: Only install a minimal amount of times The code was installing the nexthop group again using the NLM_F_REPLACE function causing extremely large route installation times. This reduces the time from installing 1 million routes from sharpd with a nhg from > 200 seconds ( where I gave up ) to ~15 seconds on my machine for 32 x ecmp. As a side note 1 million routes using master sharpd takes ~50 seconds to do the same thing. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	72938edfbc	zebra: add logging for NHG ignoring in netlink Add some logging for when we choose to ignore a NHG install for one reason or another. Also, cleanup some of the code using the same accessor functions for the context object. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	9c6c48bc10	zebra: return the proto nhe on del even with refs Return the proto nhe on del even if their are still possible route references. We may get a del before the routes are removed. So we still need to return this to the caller so they can decrement the ref. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	3bccc0f5eb	zebra: fix releasing proto-owned singletons Fix the releasing of proto-owned singletons from the attribute hashed table. Proto-owned singleton nexthops are hashed so they can still be shared therefore they are present in this table and need to be released when the time comes. This check was only matching on zebra proto before. Changed to match IDs in zebra allocated range. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	f651b708e0	zebra: increment the nhg proto score iterator Increment the nhg proto score iterator we used to count leftover NHGs after client disconnect and log. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	1f65568046	zebra: fix refcnt/rib issues in NHG replace/delete Fix some reference counting issues seen when replacing a NHG and deleting one. For replacement, we should end with the same refcnt on the new one. For delete, its the caller's job to decrement its ref after its done with it. Further, update routes in the rib with the new pointer after replace. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	68671c7439	zebra: warn if zapi NHG add has no nexthops Log a warning and return if we receive a NHG add via zapi that has no nexthops. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	24db1a7b9a	zebra: handle proto NHG uninstall client disconnect Add code to handle proto-based NHG uninstalling after the owning client disconnects. This is handled the same way as rib_score_proto() but for now we are ignoring instance. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	6fae63d2ba	zebra: inc/dec refcount on add/del NHG proto When we add a proto NHG, increment the refcount, when we del a proto NHG, decrement the refcount rather than deleting it explicitly. If the upper level proto is handling it properly, it should get decremented to zero when we receive a NHG del. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2d8a9c544b	zebra: remove unneeded nhg repalce boilerplate Remove some leftover boilerplate from the old replace code path. That code ended up in the add API so its no longer needed. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	df3cef24c5	zebra: Prevent duplicate re-install If we have received a route that the already existing route is exactly the same, just note that it happened and move on. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2b5ecd4ca6	zebra: fix route validity check with NHG ID Fix check in zread where we determine validity of a route based on reading in nexthops/checking ID is present. We had a bad conditional that was determining a route is bad if its not NHG ID based. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	cd53e3a6e6	zebra: use the passed proto from zapi We were hard coding proto bgp for use with the NHG creation. Use the actual passed one from zapi now that it exists. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	ac5d1091dc	zebra: make NHG ID allocation smarter Make NHG ID allocation smarter so it wraps once it hits the lower bound for protos and performs a lookup to make sure we don't already have that ID in use. Its pretty unlikely we would wrap since the ID space is somewhere around 24million for Zebra at this point in time. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	54c89c9377	zebra: NHG ID bounds macros Determine the NHG ID spacing and lower bound with ZEBRA_ROUTE_MAX in macros. Directly set the upperbound to be the lower 28bits of the uint32_t ID space (the top 4 are reserved for l2-NHGs). Round that number down a bit to make it more even. Convert all former lower_bound calls to just use the macro. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	16b20ad062	zebra: dont update counter if outside of zebra ID range When we receive a NHG from the kernel, we set the ID counter to that to avoid using IDs owned from the kernel. If we get one outside of zebra's range, lets not update it since its probably one we created and never deleted anyway. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2c41ef8c17	zebra: special handling for proto-NHG-based routes For now let's assume proto-NHG-based routes are good to go (we assume they are onlink/interface based anyway) and bypass route resolution altogether. Once we determine how to handle recursive nexthop-resolution for proto-NHGs we will revisit this. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	50db3f2f1d	zebra: handle zapi routes with NHG ID set Add code to properly handle routes sent with NHG ID rather than a nexthop_group. For now, we separate this from backup nexthop handling since that should probably be added to the nhg_proto_add calls. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	dd1e105fe3	zebra: implement NHG proto replace Implement the ability to replace an NHG sent down from an upper level proto. With proto-owned NHGs, we make the assumption they are ecmp and always treat them as a group to make the replace from 1 -> 2 and 2 -> 1 quite a bit easier. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	08da8bbc22	zebra: hash proto-created but zebra ID spaced NHGS To prevent duplication of singleton NHGs, lets hash any zebra-ID spaced NHGs sent from an upper level proto. These would be singleton NHGs anyway and should prevent duplication of dataplane installs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	6c67f41f9e	zebra,lib: command to only install proto-based nexthops Add a command/functionality to only install proto-based nexthops. That is nexthops owned/created by upper level protocols, not ones implicitly created by zebra. There are some scenarios where you would not want zebra to be arbitrarily installing nexthop groups and but you still want to use ones you have control over via lib/nexthop_group config and an upper level protocol. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	0885b1e3d9	zebra: implement protocol NHG Add/Del Implement the underlying zebra functionality to Add/Del an internal zebra and kernel NHG. These NHGs are managed by the upperlevel protocols that send them down via zapi messaging. They are not put into the overall zebra NHG hash table and only put into to the ID table. Therefore, different protos cannot and will not share NHGs. The proto is also set appropriately when sent to the kernel. Expand the separation of Zebra hashed/shared/created NHGs and proto created and mangaged NHGs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	5b27c09d4e	zebra: remove NHG unhashable flag and its code Remove the code for setting a NHG as unhashable. Originally this was to prevent us from attempting to put duplicates from the kernel in our hashtable. Now I think its better to not use them in the hashtable at all and only track them in the ID table. Routes will still be able to use them if they specify the ID explicitly when sending Zebra the route, but 'normal' routes we hash the nexthop group on will not. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	27141ea94e	lib, zebra: Add ability to send down a nhgid over route install Modify the send down of a route to use the nexthop group id if we have one associated with the route. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	2f35a820bf	lib, zebra: Add ZAPI_NHG_ADD\|DELETE Add the ability to send a NHG from an upper level protocol down to zebra. ZAPI_NHG_ADD encompasses both the addition and replace semantics ( If the id passed down does not exist yet, it's Add, else it's a replace ). Effectively zebra will take this nhg passed down save the nhg in the id hash for nhg's and then create the appropriate nhg's and finally install them into the linux kernel. Notification will be the ZAPI_NHG_NOTIFY_OWNER zapi message for normal success/failure messaging to the installing protocol. This work is being done to allow us to work with EVPN MH which needs the ability to modify NHG's that BGP will own and operate on. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	f70da2a390	zebra: Refactor nexthop reading from zapi messages Take the zebra code that reads nexthops and combine it into one function so that when we add zapi messages to send/receive nexthops we can take advantage of this function. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	786a9bd9eb	zebra: Convert zserv_nexthop_num_warn to return bool Allow us to key of the warning if we have one. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donatas Abraitis	b1f476731a	Merge pull request #7169 from donaldsharp/some_code_cleanup Some code cleanup	2020-09-25 10:19:34 +03:00
Sri Mohana Singamsetty	46dd92c522	Merge pull request #7164 from AnuradhaKaruppiah/mh-misc-fixes evpn-mh: miscellaneous cleanup/fixes	2020-09-24 08:37:45 -07:00
Donald Sharp	9781e6a047	zebra: Don't ignore setsockopt return When attempting to limit the amount of data sent from the kernel to FRR, some kernels we can run against may not have this ability in which case the setsockopt will fail. Notice that in the log. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-24 07:42:51 -04:00
Rafael Zalamena	eead0bc46b	zebra: human readable netlink dumps Add new compile option to enable human readable netlink dumps with `debug zebra kernel msgdump`. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-09-23 23:07:02 -03:00
Donald Sharp	00e0d113e5	zebra: the mlag_rd_buf_offset variable was write only The mlag_rd_buf_offset function was only ever being set to 0 in the mlag_read function and only written in that function. There is no need for this global variable. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 20:36:51 -04:00
Mark Stapp	ccda0eadac	Merge pull request #7155 from donaldsharp/TRAP Offload/Trap	2020-09-23 16:06:37 -04:00
Mark Stapp	4020564a3c	Merge pull request #7163 from donaldsharp/zebra_mlag_bugs Zebra mlag bugs	2020-09-23 15:32:31 -04:00
Anuradha Karuppiah	e378f5020d	zebra: fix use of freed es during zebra shutdown This problem was reported by the sanitizer - ================================================================= ==24764==ERROR: AddressSanitizer: heap-use-after-free on address 0x60d0000115c8 at pc 0x55cb9cfad312 bp 0x7fffa0552140 sp 0x7fffa0552138 READ of size 8 at 0x60d0000115c8 thread T0 #0 0x55cb9cfad311 in zebra_evpn_remote_es_flush zebra/zebra_evpn_mh.c:2041 #1 0x55cb9cfad311 in zebra_evpn_es_cleanup zebra/zebra_evpn_mh.c:2234 #2 0x55cb9cf6ae78 in zebra_vrf_disable zebra/zebra_vrf.c:205 #3 0x7fc8d478f114 in vrf_delete lib/vrf.c:229 #4 0x7fc8d478f99a in vrf_terminate lib/vrf.c:541 #5 0x55cb9ceba0af in sigint zebra/main.c:176 #6 0x55cb9ceba0af in sigint zebra/main.c:130 #7 0x7fc8d4765d20 in quagga_sigevent_process lib/sigevent.c:103 #8 0x7fc8d4787e8c in thread_fetch lib/thread.c:1396 #9 0x7fc8d4708782 in frr_run lib/libfrr.c:1092 #10 0x55cb9ce931d8 in main zebra/main.c:488 #11 0x7fc8d43ee09a in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x2409a) #12 0x55cb9ce94c09 in _start (/usr/lib/frr/zebra+0x8ac09) ================================================================= Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 11:20:13 -07:00
Anuradha Karuppiah	4d8b658c8c	zebra: evpn-mh: add error logs on ES processing failures Cleanup some of the XXX added during development of MH. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 11:15:08 -07:00
Donatas Abraitis	5fde152be6	Merge pull request #7112 from AnuradhaKaruppiah/mac-neigh-ht evpn-mh: mac-ip sync hold timers	2020-09-23 21:11:56 +03:00
Patrick Ruddy	a3b5e4fdf7	Merge pull request #7157 from donaldsharp/nhg_speeds zebra: Move debug information gathering to inside guard	2020-09-23 18:42:00 +01:00
Donald Sharp	c19808acad	zebra: Increase the read/write mlag buffer sizes The read/write mlag buffer sizes of 2k were sufficient for ~100 S,G notifications at one go. Increase to 32k to give us 16 times the space. Ticket: CM-31576 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:13:03 -04:00
Donald Sharp	7692744f2c	zebra: Ensure that message received from mlag will fit If we receive a message that is greater than our buffer size we are in a situation where both the read and write buffers are fubar'ed beyond the end. Assert when we notice this fact. Ticket: CM-31576 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:12:26 -04:00
Donald Sharp	f24d9ab667	zebra: modify mlag code to only need 1 stream when generating data The normal pattern of writing the type/length at the beginning of the packet was not being quite followed. Modify the mlag code to respect the proper way of doing things and get rid of a stream_new and copy. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:12:20 -04:00
Anuradha Karuppiah	2b9e207e0e	zebra: stop neigh hold timer when the neigh is deleted The neigh hold timer was firing after the neigh was deleted resulting in the following crash - [ at ./zebra/zebra_evpn_neigh.h:155 at zebra/zebra_evpn_neigh.c:447 at lib/thread.c:1578 at zebra/main.c:488 ] Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Don Slice	f9f0463fb9	zebra: fix deletion of evpn mh neigh-holdtime Found that the command "evpn mh neigh-holdtime" can be set but not deleted. This fix solves the delete process Signed-off-by: Don Slice <dslice@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Anuradha Karuppiah	41c809b2a8	zebra: changes for configuring mac and neigh holdtime When an ES peer withdraws a MAC-IP route we hold the entry for N seconds to allow an external daemon (neighmgr) to establish host reachability independent of the peer. Add config commands to allow the user to set this holdtime (N). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Donald Sharp	aa178efd49	Merge pull request #7148 from pguibert6WIND/fix_fd_not_closed zebra: fix fd going out of scope leaks the handle	2020-09-23 07:40:14 -04:00
Donatas Abraitis	0ce5baaab1	Merge pull request #7018 from gouault6wind/show_ip_route Clean up in vrf management	2020-09-23 08:45:09 +03:00
Donald Sharp	bed74d178e	zebra: Move debug information gathering to inside guard Let's not make the entire `depend_finds` function pay for the data gathering needed for the debug. There are numerous other places in the code that check the NEXTHOP_FLAG_RECURSIVE and do the same output. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-22 20:47:33 -04:00
Sri Mohana Singamsetty	efdd997dad	Merge pull request #7116 from AnuradhaKaruppiah/mh-neigh-fixes evpn-mh: changes for programming synced neighs as static in the dataplane	2020-09-22 15:45:09 -07:00
Mark Stapp	b6033bd1c1	Merge pull request #7067 from donaldsharp/remove_solaris Remove solaris	2020-09-22 17:04:19 -04:00
Donald Sharp	5a3cf85391	lib, zebra: Add ability to read kernel notice of TRAP/OFFLOAD The linux kernel is getting RTM_F_TRAP and RTM_F_OFFLOAD for kernel routes that have an underlying asic offload. Write the code to receive these notifications from the linux kernel and to store that data for display about the routes. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-22 15:57:43 -04:00
Donald Sharp	4c56ce1cea	zebra: Add basic knowledge of asic offload available Some linux kernels are starting to support the idea of knowledge about the underlying asic. Add a boolean that we can set/unset to track whether or not we think the router has this functionality available. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-22 15:57:43 -04:00

... 2 3 4 5 6 ...

4314 Commits