mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-08-02 22:09:48 +00:00

Author	SHA1	Message	Date
Stephen Worley	e36ea40d3b	zebra: derive rule family from src->dst->ipv4 Derive the rule family from src if available, otherwise dst if available, otherwise assume ipv4. We only support ipv4/ipv6 currently so it we cant tell from the src/dst it must be ipv4 and likely a dsfield match. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-18 11:53:18 -05:00
Anuradha Karuppiah	35f5c31b0e	zebra: add support for DF delay timer When a new ES is created it is held in a non-DF state for 3 seconds as specified by RFC7432. This allows the switch time to import the Type-4 routes from the peers. And the peers time to rx the new Type-4 route. root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01"\|grep DF DF status: non-df DF delay: 00:00:01 DF preference: 50000 root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01"\|grep DF DF status: df DF preference: 50000 root@torm-11:mgmt:~# Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-15 10:03:50 -08:00
Anuradha Karuppiah	0109f42f86	zebra: display DF status only for local ESs For remote ESs it is not relevant and confuses the admin. Local ES sample - =============== root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01" ESI: 03:44:38:39:ff:ff:01:00:00:01 Type: Local,Remote Interface: hostbond1 State: up Bridge port: yes Ready for BGP: yes VNI Count: 10 MAC Count: 3 DF: status: df preference: 50000 >>>>>>>>>>>>>>> Nexthop group: 536870913 VTEPs: 27.0.0.16 df_alg: preference df_pref: 32767 nh: 268435465 27.0.0.17 df_alg: preference df_pref: 32767 nh: 268435466 root@torm-11:mgmt:~# Remote ES sample - =============== root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:02:00:00:01" ESI: 03:44:38:39:ff:ff:02:00:00:01 Type: Remote Interface: - Ready for BGP: no VNI Count: 0 MAC Count: 6 DF: status: - preference: 0 >>>>>>>>>>>>>>> Nexthop group: 536870919 VTEPs: 27.0.0.18 nh: 268435464 27.0.0.19 nh: 268435467 27.0.0.20 nh: 268435461 root@torm-11:mgmt:~# Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-15 10:02:03 -08:00
Patrick Ruddy	a119a429e4	Merge pull request #7637 from AnuradhaKaruppiah/evpn-pim-fixes evpn-pim: cleanup and display fixes	2020-12-15 17:36:24 +00:00
Patrick Ruddy	bedf36e327	Merge pull request #7636 from AnuradhaKaruppiah/type-0-esi zebra: support for type-0 ESI	2020-12-15 17:33:46 +00:00
Patrick Ruddy	01c65ba77e	Merge pull request #7633 from AnuradhaKaruppiah/protodown-fixes evpn-mh: protodown handling fixes	2020-12-15 17:23:32 +00:00
Russ White	930c9b7be8	Merge pull request #7736 from ton31337/fix/s_addr_INADDR_ANY *: Replace s_addr check agains 0 with INADDR_ANY	2020-12-15 07:12:49 -05:00
Donatas Abraitis	3a6290bdd1	*: Replace s_addr check agains 0 with INADDR_ANY Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-12-14 21:03:38 +02:00
Stephen Worley	3bece1e0e3	Merge pull request #7162 from opensourcerouting/zebra-human-netlink zebra: human readable netlink dumps	2020-12-14 14:03:35 -05:00
Anuradha Karuppiah	dc261b8de4	zebra: restart start-up delay timer when the first uplink comes up When all the uplinks go down the VTEP is disconnected from the VxLAN overlay and this was handled by proto-downing the ES bonds. When the uplinks come up again we need to re-enable the ES bonds but that needs to be done after a delay to allow the EVPN network to converge. And that is done by firing off the startup-delay timer on first uplink-up. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:41 -08:00
Anuradha Karuppiah	2bcf92e18b	zebra: re-sync protodown state with the dplane on new ES add 1. When a bond is associated with an ES we may need to re-sync the dplane protodown state (which maybe stale/set by some other app). 2. Also change the uplink state display to avoid confusion with protodown reason code (both used to show uplink-up). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Anuradha Karuppiah	26ba45e33d	zebra: update protodown display protodown state is a combination of the dplane and zebra states. protodown reason is maintained exclusively by zebra. Display this information on two separate lines to make that ownership clearer. Also display n/a for bonds as the dplane doesn't support protodowning the bond device. Sample output - ============== root@torm-11:mgmt:~# vtysh -c "show interface hostbond1"\|grep -i protodown protodown: off (n/a) protodown reasons: (uplinks-down) root@torm-11:mgmt:~# vtysh -c "show interface swp5"\|grep -i protodown protodown: on protodown reasons: (uplinks-down) root@torm-11:mgmt:~# PS: Cosmetic changes only, no functional change. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Anuradha Karuppiah	5c84327054	zebra: re-sync protodown state when a port/mbr is linked to an ES-bond The code for this was already there but was not kicking in because of a zebra local reason-code dup check. Even if the reason-code is the same, if the dplane and zebra disagree about the protodown state zebra will need to re-program the dplane. Fixed a couple of spelling errors in the protodown logs to make greps easy. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Donatas Abraitis	219218d964	Merge pull request #7664 from donaldsharp/global_bgp_wait Global bgp wait	2020-12-14 10:28:02 +02:00
Duncan Eastoe	164d8e8608	zebra: routes stuck with 'q' when using dplane FPM New work enqueued to the dplane_fpm_nl provider is initially de-queued and re-enqueued, in fpm_nl_process(), to be processed by the provider's own thread. After performing this initial de-queue/enqueue we return to dplane_thread_loop() and check the dplane_fpm_nl output queue for any work which has been completed. Since this work is being processed in another thread it is very likely that there will be some (or all) work still outstanding at this point. The dataplane thread finishes up any other tasks and then waits until it is next scheduled. In the meantime the dplane_fpm_nl thread is processing its work queue until completion. The issue arises here as the dataplane thread is not explicitly re-scheduled once dplane_fpm_nl has drained its work queue and populated its output queue with completed work. This completed work can sit in the output queue for an indeterminate period of time, depending upon when the dataplane thread is next scheduled for other work. If the RIB has reached a stable state then this could be a significant period of time. During this period zebra marks these routes as queued, even though they have actually been processed by all dataplane providers. An un-related RIB change which triggers a FIB update will result in the dataplane thread being scheduled and this completed work then being processed. At this point the routes will then no longer be marked as queued by zebra. However this new FIB update might itself then fall victim to the same scenario! We can observe the above behaviour in these detailed dplane logs. 11:24:47 zebra[7282]: dplane: incoming new work counter: 2 11:24:47 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:24:47 zebra[7282]: dplane provider 'Kernel': processing 11:24:47 zebra[7282]: Dplane NEIGH_DISCOVER, ip 192.168.2.2, ifindex 9 11:24:47 zebra[7282]: Dplane NEIGH_DISCOVER, ip 192.168.2.2, ifindex 9 11:24:47 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:24:47 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:24:47 zebra[7282]: dplane dequeues 1 completed work from provider dplane_fpm_nl 11:24:47 zebra[7282]: dplane has 1 completed, 0 errors, for zebra main 2 contexts (all incoming work) have been queued to dplane_fpm_nl - all good. 1 completed context was de-queued, so there is outstanding work. 11:24:58 zebra[7282]: dplane: incoming new work counter: 2 11:24:58 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:24:58 zebra[7282]: dplane provider 'Kernel': processing 11:24:58 zebra[7282]: ID (193) Dplane nexthop update ctx 0x55c429b6fed0 op NH_INSTALL 11:24:58 zebra[7282]: 0:5.5.5.5/32 Dplane route update ctx 0x55c429b79690 op ROUTE_INSTALL 11:24:58 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:24:58 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:24:58 zebra[7282]: dplane dequeues 2 completed work from provider dplane_fpm_nl 11:24:58 zebra[7282]: dplane has 2 completed, 0 errors, for zebra main A further 2 contexts (all incoming work) have been queued to dplane_fpm_nl - all good. 2 completed contexts were de-queued, which sounds good as that is what we en-queued. However, there is an outstanding context from earlier, so there is still outstanding work. Indeed the new 5.5.5.5/32 route is marked as queued: O>q 5.5.5.5/32 [110/10] via 192.168.2.2, dp0p1s3, weight 1, 00:01:19 This remains the case until we trigger a FIB update by installation of the (eg.) 10.10.10.10/32 route: 11:26:41 zebra[7282]: dplane: incoming new work counter: 2 11:26:41 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:26:41 zebra[7282]: dplane provider 'Kernel': processing 11:26:41 zebra[7282]: ID (195) Dplane nexthop update ctx 0x55c429b78ce0 op NH_INSTALL 11:26:41 zebra[7282]: 0:10.10.10.10/32 Dplane route update ctx 0x55c429b7a040 op ROUTE_INSTALL 11:26:41 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:26:41 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:26:41 zebra[7282]: dplane dequeues 2 completed work from provider dplane_fpm_nl 11:26:41 zebra[7282]: dplane has 2 completed, 0 errors, for zebra main 11:26:41 zebra[7282]: zebra2proto: Please add this protocol(2) to proper rt_netlink.c handling 11:26:41 zebra[7282]: Nexthop dplane ctx 0x55c429b6fed0, op NH_INSTALL, nexthop ID (193), result SUCCESS 11:26:41 zebra[7282]: default(0:254):5.5.5.5/32 Processing dplane result ctx 0x55c429b79690, op ROUTE_INSTALL result SUCCESS We observe the same 2 enqueues and 2 dequeues as before, which again suggests that there is outstanding work. As expected, the 5.5.5.5/32 route is no longer marked as queued: O>* 5.5.5.5/32 [110/10] via 192.168.2.2, dp0p1s3, weight 1, 00:02:06 But the 10.10.10.10/32 route is, as we have not yet processed the completed context: C>q 10.10.10.10/32 is directly connected, lo, 00:26:05 Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 15:04:15 +00:00
Duncan Eastoe	53706b4e51	zebra: dplane API to get provider output q length Returns the current number of (completed) contexts in the provider's output queue (dp_ctx_out_q), allowing access to this data from the provider itself. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 15:04:11 +00:00
Duncan Eastoe	7545bda0a4	dplane_fpm_nl: queue peak counter never increments The context queue length peak counter is always set to its current value, hence never increments. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 12:09:56 +00:00
Donald Sharp	7ed5844bef	zebra: Allow `show zebra client` to give clues about route update status When entering `show zebra client` allow the display of the client->notify_status for route updates. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-10 12:59:14 -05:00
Russ White	101ad544fa	Merge pull request #7678 from donaldsharp/aspath_to_zebra Aspath to zebra	2020-12-10 10:38:14 -05:00
Donald Sharp	b2c7cf18b2	Merge pull request #7706 from slankdev/slankdev-unexpose-lm-func-1 zebra: unexpose label-manager util-funcs as static	2020-12-10 07:43:02 -05:00
Rafael Zalamena	0c7e0f2f70	Merge pull request #7697 from pguibert6WIND/zebra_crash_startup_zns zebra: anticipate zns creation at vrf creation when backend is vrf-lite	2020-12-10 09:10:34 -03:00
Donatas Abraitis	82b773e63b	Merge pull request #7524 from donaldsharp/zebra_route_map_tighten zebra: deny when route map is specified but does not exist yet	2020-12-10 11:01:25 +02:00
Hiroki Shirokura	d3d9639d9a	zebra: unexpose label-manager util-funcs as static Following functions which is a piece of label-maanager implementation isn't called from out side of its file. And all lines of label-manager are coded on zebra/label_manager.c at this time. So these functions should be unexposed. Functions: - create_label_chunk - assign_label_chunk - delete_label_chunk - release_label_chunk Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2020-12-10 09:56:55 +09:00
Philippe Guibert	91b1421e84	zebra: anticipate zns creation at vrf creation when backend is vrf-lite in the case the namespace pointer is already available, feed it at vrf creation. this prevents from crashing if the netlink parsing already began, and the vrf-lite is not enabled yet. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-12-09 13:26:20 +00:00
Mark Stapp	e386d2b154	Merge pull request #7690 from donaldsharp/nht_show_is_not_not_not zebra, tests: Fix `show ip nht`	2020-12-09 07:58:37 -05:00
Hiroki Shirokura	732d22cbf2	zebra: use zserv_send_message instead of writen Following functions is using writen to dispatch message into socket, but another function uses zserv_send_message. This commit does tiny unification for zapi's socket messaging. Funcs: - zsend_assign_label_chunk_response() - zsend_label_manager_connect_response() Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2020-12-09 17:17:21 +09:00
Donald Sharp	dda33b6e0c	zebra, tests: Fix `show ip nht` The `show ip nht` and `show ipv6 nht` commands were broken. This is because recent code commit: `0154d8ce45` assumed that p must not be NULL and this is not the case. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 15:50:46 -05:00
Donald Sharp	e46723a50e	bgpd, zebra: Add ability for bgp to send AS-Path information to zebra Add a bit of code to allow bgp to send the AS-Path associated with the route being installed to zebra so it can be displayed and used as part of the `show ip route A` command in zebra. eva# show ip route 20.0.0.0/11 Routing entry for 20.0.0.0/11 Known via "bgp", distance 20, metric 0, best Last update 00:00:00 ago * 192.168.161.1, via enp39s0, weight 1 AS-Path: 60000 64539 15096 6939 8075 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:07:21 -05:00
Donald Sharp	cfa2a35d8d	sharpd, zebra: Pass and display opaque data as PoC Pass data from sharpd to zebra as opaque data and display it as part of the detailed route data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:09 -05:00
Donald Sharp	80a6ee90c3	zebra: Setup structure for opaque data to be displayed Setup the output mechanism for opaque data to be displayed to the end operator. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Donald Sharp	a29a60016e	zebra: Gather opaque data into the route entry for storage Just gather the opaque data into the route entry. Later commits will display this data for end users as well as to send it down. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Donald Sharp	aab4eca1c0	lib, zebra: Fix overlapping message types We had duplicate message id's. Shit's broke yo. Fix. I have no idea how this properly worked. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Karen Schoener	581e797e02	zebra: Adding zapi client close notification When zebra detects a client close, send a zapi client close notification. Signed-off-by: Karen Schoener <karen@voltanet.io>	2020-12-07 18:22:36 -05:00
Patrick Ruddy	dd662ca570	Merge pull request #7399 from AnuradhaKaruppiah/mh-mac-ecmp-fixes evpn-mh: miscellaneous fixes in MAC-sync and MAC-ECMP handling	2020-12-03 16:27:49 +00:00
Donatas Abraitis	c49042b407	Merge pull request #7638 from donaldsharp/reduce_warn zebra: Reduce warn -> debug	2020-12-03 08:17:59 +02:00
Donald Sharp	0fb4ab0388	Merge pull request #6950 from opensourcerouting/bfd-distributed-v3 bfdd: distributed BFD	2020-12-02 20:50:47 -05:00
Donald Sharp	af8a77d636	Merge pull request #7644 from mjstapp/dplane_cleaner zebra: add an api to process/clean the pending dplane queue	2020-12-02 09:01:44 -05:00
Donald Sharp	fe76cf322e	Merge pull request #7646 from volta-networks/fix_show_route_summary zebra: fix show ip route vrf X summary	2020-12-02 08:59:54 -05:00
Mark Stapp	b238167a9b	Merge pull request #7645 from sworleys/NHG-IFP-Error2Log zebra: make a couple NHG errors debugs	2020-12-01 17:17:59 -05:00
Rafael Zalamena	de5fa92042	Merge pull request #7617 from deastoe/dplane-fpm-lsp zebra: dplane FPM LSP support	2020-12-01 16:01:09 -03:00
Stephen Worley	8c74d904d4	zebra: remove unused EC_ZEBRA_IF_LOOKUP_FAILED EC_ZEBRA_IF_LOOKUP_FAILED is no longer being used, remove it. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-01 13:05:36 -05:00
Anuradha Karuppiah	46bf266c1c	zebra: debug logs to detect incorrect mac deletions A MAC entry cannot be deleted while a neigh is referencing it. It seems there is some race condition where this may be happening. The log is to help identify those cases. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	4f9bb78eca	zebra: change the L2 NHG id format to co-exist with the L3NHG ids It is now 4bits of type and 28bits of value - 1. type=0 is for L3 NHG 2. type=1 is for L2 NH 3. type=2 is for L2 NHG Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	5de10c3705	zebra: allocate one nexthop id per-VTEP instead of one per-ES-VTEP This is an optimization to reduce the number of L2 nexthops. A l2 or fdb nexthop simply provides the dataplane with a nexthop ip- torm-12:mgmt:~# ip nexthop id 268435461 via 27.0.0.20 scope link fdb id 268435463 via 27.0.0.20 scope link fdb id 268435465 via 27.0.0.20 scope link fdb So there is no need to allocate a nexthop per-ES/per-VTEP. There can be 100+ ESs per-VTEP so this change cuts the scale down by a factor of 100. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	15400f95b7	zebra: support for slow-failover of local MACs on an ES When a local ES flaps there are two modes in which the local MACs are failed over - 1. Fast failover - A backup NHG (ES-peer group) is programmed in the dataplane per-access port. When a local ES flaps the MAC entries are left unaltered i.e. pointing to the down access port. And the dataplane redirects traffic destined to the oper-down access port via the backup NHG. 2. Slow failover - This mode needs to be turned on to allow dataplanes not capable of re-directing traffic. In this mode local MAC entries on a down local ES are re-programmed to point to the ES-peers' NHG. And vice-versa i.e. when the ES comes up the MAC entries are re-programmed with the access port as dest. Fast failover is on by default. Slow failover can be enabled via the following config - evpn mh redirect-off Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:26 -08:00
Anuradha Karuppiah	69711b3f83	zebra: on local mac add from the dplane a re-install maybe need as static As a part of extended MM handing a MAC can be updated from local to remote while being referenced by SYNC neighs (this is really a temporary/small window). During this window if the MAC transitions back to local again we need to re-inforce the previous SYNC flags (based on the sync-neigh count) as subsequent SYNC updates to the MAC will be de-duped and ignored. Ticket: CM-29636 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	1a4f9efd54	zebra: set inactive bit when zebra re-installs the MAC on dplane del When a local mac is deleted by the dataplane zebra can re-install it if the MAC is a SYNC MAC (learned from ES peers). The "local_inactive" bit must be set as a part of the re-install to prevent zebra turning around and advertising the MAC as locally active. Also fixed up some debug logs in the slow-fail path to include the VNI. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	80e19eb71f	zebra: skip NDA_DST attr if NHG is present NHG and DST (VTEP-IP) are mutually exclusive attributes. If DST is present the kernel ignores NHG. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	de86cc5bb1	zebra: free up the L2 NHG bitmap as a part of shutdown Fix for a shutdown time memory leak found during review. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	f3722826a4	zebra: remove FDB entries before de-activating a L2-NHG NHG is activated i.e. programmed in the dataplane only if there are active-VTEPs associated with it. When a NHG is de-activated all the remote-mac entries associated with it need to be removed before the NHG is removed. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00

1 2 3 4 5 ...

4198 Commits