mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-05-27 22:12:14 +00:00

Author	SHA1	Message	Date
Donald Sharp	3a15018892	zebra: Tell SA that we are intentionally ignoring the return Calling fpm_nl_enqueue we should expect a it fit or not return value on the outgoing stream. This is not necessary to check here because the while loop where we are checking this already has ensured that the data being written will fit. CID -> 1499854 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-18 09:06:49 -05:00
Donald Sharp	d33da0e071	zebra: A `zebra route-map delay-timer 0` command should still run the route-map Setting `zebra route-map delay-timer 0` completely turns of any route-map processing in zebra. Which is completely wrong. A timer of 0 means `do it now`. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	4dfcfabfa9	zebra: Push timer out if another route-map change comes in for zebra If we are running with a delayed timer to handle route-map changes in zebra, if another route-map change is made to the cli, push out the timer instead of not modifying the timer. This will allow a large set of route-maps to be possibly be read in by the system and we don't have a state where new route-map changes are being read in and having the timer pop in the middle of it. Additionally convert to use THREAD_OFF, preventing a possible use after free as well as aligning the thread api usage with what we consider correct. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	cfcd844c0b	zebra: Limit routemap changes to reconsider only routes associated with that rm Current code when a route map changes schedules a rerun of all routes in the particular table. So if you modify the `ip protocol XX route-map FOO` route-map `FOO` all routes will be rechecked. This is extremely expensive. Modify zebra to only update the routes associated with the route-map. So if we have 800k bgp routes and 50 ospf routes and we are route-map'ing the ospf routes we'll only look at 50 routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	54aeba3540	zebra: Allow rib_update_table to receive a specified route type When we need to cause a reprocessing of data the code currently marks all routes as needing to be looked at. Modify the rib_update_table code to allow us to specify a specific route type we only want to reprocess. At this point none of the code is behaving differently this is just setup for a future code change. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	1866a6f65b	zebra: remove unused function rib_update_vrf The function rib_update_vrf is never used. Remove it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	3d34678f1d	doc: Document the "zebra route-map delay-timer" functionality Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Duncan Eastoe	869a5f7168	zebra: set nlmsg_pid in netlink msgs sent by 'fpm' Use nl_pid from the netlink socket used for programming the kernel (netlink_dplane) in netlink route messages sent by the 'fpm' module. This makes 'fpm' consistent with 'dplane_fpm_nl' which already behaves this way, and allows FPM server implementations to determine route origin via nlmsg_pid. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2021-01-15 16:28:06 +00:00
Donald Sharp	f7f52f0d2b	Merge pull request #7868 from mjstapp/fix_fpm_conn_up zebra: don't set connection-up event pointer directly	2021-01-15 06:55:29 -05:00
Mark Stapp	9fad1340d4	Merge pull request #7866 from kishorekunal01/fpm_dump_issue zebra: Scale setup RMAC is send multiple time to fpm	2021-01-14 14:13:31 -05:00
Mark Stapp	ef1dbba83a	zebra: don't set connection-up event pointer directly Use thread_cancel to reset the connection-up processing timer. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-14 14:09:14 -05:00
Kishore Kunal	e840edcacb	zebra: Scale setup RMAC is send multiple time to fpm Thread zfpm_conn_up_thread_cb can Yield and send RMAC multiple times to FPM. Signed-off-by: Kishore Kunal <kishorekunal01@broadcom.com>	2021-01-14 15:53:52 +00:00
Donald Sharp	700cae7698	zebra: in zebra_evpn_mac.c use size_t for buffer length Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:29 -05:00
Donald Sharp	b16e800423	zebra: Create a dump function for mac->flags and use it Create a function that can dump the mac->flags in human readable output and convert all debugs to use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:29 -05:00
Donald Sharp	bf902d4c52	zebra: Create function to dump MACIP flags Create a function to dump MACIP flags and to use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:27 -05:00
Mark Stapp	d7ceaa8f5a	Merge pull request #7819 from donaldsharp/more_data_for_debug_dumps zebra: Add ability to display human readable format re->flags and status	2021-01-13 13:06:23 -05:00
Mark Stapp	3c57be5936	Merge pull request #7818 from donaldsharp/ip_proto_denied zebra: notify installing protocol when nexthops cannot be resolved	2021-01-13 10:33:33 -05:00
Donald Sharp	61e6de9d57	zebra: Add ability to display in human readable format re->flags and status The re->flags and re->status in debugs were being dumped as hex values. I can never quickly decode this. Here is an idea. Let's let FRR do it for me. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 10:16:06 -05:00
Donald Sharp	1afacb94e6	Merge pull request #6853 from mjstapp/fix_rib_dups zebra: reduce impact of route-update overload	2021-01-13 09:42:34 -05:00
Donald Sharp	7874422ad2	Merge pull request #7850 from mjstapp/build_dplane_plugin zebra: build the sample dataplane plugin	2021-01-12 08:43:53 -05:00
Mark Stapp	b9f15b49b2	zebra: add the sample dataplane plugin to the build Build the sample dataplane plugin with debug/dev builds. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-11 16:33:55 -05:00
Mark Stapp	fb913e53a5	zebra: remove unused local in dplane sample plugin Remove an unused local in the sample dataplane plugin. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-11 16:33:27 -05:00
Donald Sharp	7e010c4b78	zebra: notify installing protocol when nexthops cannot be resolved In the case where a routes nexthops cannot be resolved as part of route processing, immmediately notify the upper level protocol that their routes failed to install if they are interested in being informed about this issue. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-11 10:11:35 -05:00
Donatas Abraitis	88ffa95dc3	Merge pull request #7823 from donaldsharp/zebra_delay_timer Zebra delay timer	2021-01-11 16:46:23 +02:00
Donald Sharp	f10f8f0e98	Merge pull request #7652 from adharkar/frr-vni_switch zebra: L3VNI to L2VNI conversion is not handled	2021-01-10 18:44:49 -05:00
Donald Sharp	7df0e6bb3b	Merge pull request #7756 from pjdruddy/bgplu-fixes Bgplu fixes	2021-01-09 15:48:22 -05:00
Donald Sharp	24420c8200	Merge pull request #7787 from deastoe/fpm-work-ready-fixes dplane_fpm_nl: routes stuck with 'q' flag (revisited)	2021-01-09 15:38:46 -05:00
Donald Sharp	9df81095f8	zebra: zebra route-map delay-timer is global not per vrf The zebra route-map delay timer value is a global value not a per vrf change. As such we should only print it out one time. We are seeing this: zebra route-map delay-timer 33 exit-vrf zebra route-map delay-timer 33 When we have 2 vrf's configured. Fix the code to only write it out for the default vrf Ticket: CM-32888 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-08 22:34:41 -05:00
Donald Sharp	c70e585e05	zebra: Remove uncalled function Remove the dead function zebra_route_map_write_delay_timer Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-08 22:34:41 -05:00
Renato Westphal	dc70c83afa	Merge pull request #7816 from pjdruddy/revert_labelmanager_statics Revert labelmanager statics	2021-01-08 20:57:25 -03:00
Mark Stapp	6b66913275	Merge pull request #7762 from sworleys/PBR-Ipv4/Ipv6-Match-Fixes pbrd: pbr ipv4/ipv6 match fixes	2021-01-05 13:54:06 -05:00
Pat Ruddy	507d2737d6	zebra: expose label-manager util-funcs Revert "zebra: unexpose label-manager util-funcs as static" This reverts commit `d3d9639d9a`. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-05 18:19:44 +00:00
Patrick Ruddy	b567ed7eeb	Merge pull request #7722 from AnuradhaKaruppiah/mh-fixes bgpd, zebra: evpn mh fixes	2021-01-05 09:26:17 +00:00
Pat Ruddy	189982283a	zebra: labelmanager could return reserved labels when checking if there is a "hole" behind the current reservation marker the calculation of whether the hole is big enough to satisfy the requested chunk is out by 1. This could result in returning a label which has already been allocated. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-04 14:29:44 +00:00
Pat Ruddy	3c84497943	zebra: label manager should never return a reserved block if the requested chunk size was less than 16 then a chunk within the reserved block would be returned. Make sure that we never return labels that are below MPLS_LABEL_UNRESERVED_MIN Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-04 14:29:44 +00:00
Quentin Young	19ff5340a1	Merge pull request #7777 from volta-networks/fix_zebra_rib_c++ zebra: avoid c++ reserved keyword	2020-12-29 11:07:12 -05:00
Stephen Worley	a4525d25b5	Merge pull request #7788 from deastoe/zebra2proto-kernel-connect zebra: zebra2proto() handle kernel/connect type	2020-12-28 14:57:41 -05:00
Mark Stapp	7c08b70a53	Merge pull request #7724 from donaldsharp/pbr_zebra_was_wrong Pbr zebra was wrong	2020-12-23 13:34:18 -05:00
Duncan Eastoe	911d4d4804	zebra: zebra2proto() handle kernel/connect type When dplane_fpm_nl is used the "Please add this protocol(n) to proper rt_netlink.c handling" debug message is emitted for any route of type kernel or connected. This severely reduces performance of dplane_fpm_nl when large numbers of these routes are present in the RIB. The messages are not observed when using the original fpm module since this uses a custom function, netlink_proto_from_route_type(). zebra2proto() now returns RTPROT_KERNEL for ZEBRA_ROUTE_CONNECT and ZEBRA_ROUTE_KERNEL. This should only impact dplane_fpm_nl's use of the common netlink routines since these routes generally ignored via checking of RSYSTEM_ROUTE(). Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:27:52 +00:00
Duncan Eastoe	b677907c99	zebra: fpm_nl_process() reschedule dp thread fpm_nl_process() now ensures that the dataplane thread is rescheduled if it hits the work limit while processing its incoming work queue. This would probably already occur due to some other event, such as fpm_process_queue() enqueuing completed work to the output queue, however it does no harm to add this explicit reschedule. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:14:03 +00:00
Duncan Eastoe	f1595ce439	zebra: resched dp thread if output queue limit hit If the dataplane thread hits the work limit while processing the output queue for any given provider, we now explicitly reschedule the thread. Otherwise, if the number of items in the output queue is greater than the work limit, draining of that output queue is dependent on new dataplane work. Routes which are not drained from the output queue are stuck with the 'q' flag, so this is a similar issue to that observed in `164d8e8608`. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:14:03 +00:00
Rafael Zalamena	fb1e954880	Merge pull request #7767 from mjstapp/fix_dplane_extra_info zebra: fix loop logic in dplane for extra intf info	2020-12-22 15:08:35 -03:00
Mark Stapp	700ff41ed3	Merge pull request #7472 from opensourcerouting/fpm-fixes fpm: frr-reload, IPv6 and an improvement	2020-12-22 11:37:58 -05:00
Anuradha Karuppiah	0b05c9bbe1	zebra: skip EVI setup if an ES is applied to a pseudo interface zebra maintains pseudo interface for hanging off user config after the interface is deleted in the kernel. If an user tried to config an ES against such an interface zebra would crash with the following call stack - at zebra/zebra_evpn_mh.c:2095 sysmac=sysmac@entry=0x55cfbadd3160) at zebra/zebra_evpn_mh.c:2258 at zebra/zebra_evpn_mh.c:3222 argv=<optimized out>, es_lid_str=<optimized out>, es_lid=1, no=0x0, vty=0x55cfbaf4c7b0) at zebra/zebra_evpn_mh.c:3222 argv=<optimized out>) at ./zebra/zebra_evpn_mh_clippy.c:202 vty=vty@entry=0x55cfbaf4c7b0, cmd=cmd@entry=0x0, filter=FILTER_RELAXED) at lib/command.c:1073 Ticket: CM-31702 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	16de1338a9	zebra: accept bgp remote mac-ip update if the higher-seq-local mac is not bgp-ready If a local-MAC or local-neigh is not active locally it is not sent to BGP. At this point if BGP rxes a remote route it accepts it and installs in zebra. Zebra was rejecting BGP's update if it had a higher seq local (inactive) entry. This would result in bgp and zebra falling out of sync. In some cases zebra would delete the local-inactive entries in sometime (as a part of the dplane/kernel garbage collection). This would leave zebra with missing remote entries (which were still present in bgpd). This change allows lower-seq BGP updates to overwrite zebra's local entry if that entry happens to be local-inactive. Note: This logic was already in use for sync-mac-ip updates. Extended the same logic to remote-mac-ip updates. Ticket: CM-31626 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	963b0c55fd	zebra: clean zevpn references in the access bd database when the VNI is deleted When an VNI was deleted as a part of FRR/zebra shutdown the zevpn entry was being freed without removing its reference in the access vlan entry (i.e. without clearing the VLAN->VNI mapping) used by MH. Ticket: CM-31197 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	7c0e4dc659	zebra: reinstall missing peer-sync flag If a netlink/dp notification is rxed for a neigh without the peer-sync flag FRR re-installs the entry with the right flags. This change is needed to handle cases where the dataplane and FRR may fall out of sync because of neigh learning on the network ports (i.e. via the VxLAN). Ticket: CM-30693 The problem was found during VM mobility "torture" tests where 100s of extended VM moves were done. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	2c89cb9017	zebra: changes to log ext_flags in neigh nl add Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	c1735c08c9	zebra: fix a problem with local MAC pointing to a remote ES If a remote MAC update is rxed from BGP with a lower sequence number than the local one zebra ignores the MAC update. This typically happens if there is a race condition (where updates are in flight from zebra to BGP). There was a bug in zebra because of which the dest ES was being updated before this check. This left the local MAC pointing to a remote ES. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Relevant Dumps: =============== root@leaf21:mgmt:~# net show evpn mac vni 101101 mac 00:93:00:00:00:01 MAC: 00:93:00:00:00:01 ESI: 03:00:00:00:77:01:03:00:00:0d Intf: - VLAN: 101 Sync-info: neigh#: 1 peer-proxy Local Seq: 3 Remote Seq: 0 Neighbors: 21.1.13.1 Active root@leaf21:mgmt:~# net sho evpn es Type: L local, R remote, N non-DF ESI Type ES-IF VTEPs 03:00:00:00:77:01:02:00:00:0c R - 6.0.0.10,6.0.0.11 03:00:00:00:77:01:03:00:00:0d R - 6.0.0.10,6.0.0.11,6.0.0.12 03:00:00:00:77:01:04:00:00:0e R - 6.0.0.10,6.0.0.11,6.0.0.12,6.0.0.13 03:00:00:00:77:02:02:00:00:16 LR bondP2-H2 6.0.0.15 03:00:00:00:77:02:03:00:00:17 LR bondP2-H3 6.0.0.15,6.0.0.16 03:00:00:00:77:02:04:00:00:18 LR bondP2-H4 6.0.0.15,6.0.0.16,6.0.0.17 root@leaf21:mgmt:~# Relevant logs: =============== 2020/07/29 15:41:27.110846 ZEBRA: Recv MACIP ADD VNI 101101 MAC 00:93:00:00:00:01 IP 21.1.13.1 flags 0x0 seq 2 VTEP 0.0.0.0 ESI 03:00:00:00:77:01:03:00:00:0d from bgp 2020/07/29 15:41:27.110867 ZEBRA: Ignore remote MACIP ADD VNI 101101 MAC 00:93:00:00:00:01 IP 21.1.13.1 as existing MAC has higher seq 3 flags 0x401 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Ticket: CM-30273 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	c7bfd08568	zebra: advertise stale neighs if EVPN-MH is not enabled With EVPN-MH, Type-2 routes are also used for MAC-IP syncing between ES peers so a change was done to only treat REACHABLE local neigh entries as local-active and advertise them as Type-2 routes i.e. STALE neigh entries are no longer advertised as Type-2s. This however exposed some unexpected problems with MLAG where a secondary reboot followed by a primary reboot left a lot of neighs in STALE state (on the primary) resulting in them not being advertised. And remote routed traffic to those hosts being blackholed in a sym-IRB setup. This commit is a workaround to fix the regression (it doesn't fix the underlying problems with entries not becoming REACHABLE; which maybe a day-1 problem). The workaround is to continue advertising STALE neighbors if EVPN-MH is not enabled. Ticket: CM-30303 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:15 -08:00
Anuradha Karuppiah	362c8f2d73	zebra: handle "show evpn es-evi" a non-existent VNI zebra was crashing when the command was run on a non-existent VNI. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 VNI 16777215 doesn't exist root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 detail VNI 16777215 doesn't exist root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 json [ ] root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 detail json [ ] root@torm-12:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Ticket: CM-30232 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:40:07 -08:00
Emanuele Di Pascale	2e8db20d7e	zebra: avoid c++ reserved keyword in rib_handle_nhg_replace, do not use new as a parameter name to allow compilation of c++ code including zebra headers. Signed-off-by: Emanuele Di Pascale <emanuele@voltanet.io>	2020-12-21 14:34:55 +01:00
Mark Stapp	b364e87d56	zebra: fix loop logic in dplane for extra intf info The way a couple of clauses were placed in a loop meant that some info might not be collected - re-order things just a bit. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-18 13:49:07 -05:00
Stephen Worley	e36ea40d3b	zebra: derive rule family from src->dst->ipv4 Derive the rule family from src if available, otherwise dst if available, otherwise assume ipv4. We only support ipv4/ipv6 currently so it we cant tell from the src/dst it must be ipv4 and likely a dsfield match. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-18 11:53:18 -05:00
Duncan Eastoe	438dd3e7df	zebra: reduce atomic ops in fpm_process_queue() Maintain the count of contexts which have been processed in a local variable, and perform a single atomic update after we have consumed all queued contexts. Generally this results in at least one less atomic operation per context. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:37:13 +00:00
Duncan Eastoe	3f2b998f61	zebra: local var in fpm_process_queue() sched cond Don't use an atomic operation to determine whether fpm_process_queue() needs to be re-scheduled. Instead we can simply use a local variable to determine if we stopped processing because we ran out of buffers. In the case where we would have re-scheduled due to new context objects in the queue (enqueued after we stopped processing), fpm_nl_process() will schedule us (or will have done already). Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:36:39 +00:00
Duncan Eastoe	bf2f783945	zebra: reduce atomic ops in fpm_nl_process() Maintain the peak ctxqueue length in a local variable, and perform a single atomic update after processing all contexts. Generally this results in at least one less atomic operation per context. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:36:38 +00:00
Duncan Eastoe	dc693fe057	zebra: reduce dplane_fpm_nl ctxqueue_mutex contention Reduce code in the critical sections of fpm_nl_process() and fpm_process_queue() to the bare minimum - basically only enqueue and dequeue operations on the shared ctxqueue. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:33:46 +00:00
Mark Stapp	86723fe89b	zebra: nht resolve-via-default doesn't need force We don't need to use the 'force' flag when processing the resolve-via-default clis for ip and ipv6: we can just do normal nht processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-17 11:22:09 -05:00
Ameya Dharkar	3b0a590bf3	zebra: L3VNI to L2VNI conversion is not handled After removal of L3VNI config, the VNI should become an L2VNI if a VxLAN interface is present for the VNI. This case is not handled in the code. Changes: 1. After unconfiguring L3VNI, create an L2VNI if VxLAN interface is present for the VNI. 2. Trigger an update to BGP. 3. Read MAC and ARP entries from kernel. This PR fixes the issue only for route type-2, 3 and 5. This PR does not address states regarding route type-1, 4 and multicast group for VxLAN interface. Signed-off-by: Ameya Dharkar <adharkar@vmware.com>	2020-12-16 18:06:37 -08:00
Anuradha Karuppiah	35f5c31b0e	zebra: add support for DF delay timer When a new ES is created it is held in a non-DF state for 3 seconds as specified by RFC7432. This allows the switch time to import the Type-4 routes from the peers. And the peers time to rx the new Type-4 route. root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01"\|grep DF DF status: non-df DF delay: 00:00:01 DF preference: 50000 root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01"\|grep DF DF status: df DF preference: 50000 root@torm-11:mgmt:~# Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-15 10:03:50 -08:00
Anuradha Karuppiah	0109f42f86	zebra: display DF status only for local ESs For remote ESs it is not relevant and confuses the admin. Local ES sample - =============== root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01" ESI: 03:44:38:39:ff:ff:01:00:00:01 Type: Local,Remote Interface: hostbond1 State: up Bridge port: yes Ready for BGP: yes VNI Count: 10 MAC Count: 3 DF: status: df preference: 50000 >>>>>>>>>>>>>>> Nexthop group: 536870913 VTEPs: 27.0.0.16 df_alg: preference df_pref: 32767 nh: 268435465 27.0.0.17 df_alg: preference df_pref: 32767 nh: 268435466 root@torm-11:mgmt:~# Remote ES sample - =============== root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:02:00:00:01" ESI: 03:44:38:39:ff:ff:02:00:00:01 Type: Remote Interface: - Ready for BGP: no VNI Count: 0 MAC Count: 6 DF: status: - preference: 0 >>>>>>>>>>>>>>> Nexthop group: 536870919 VTEPs: 27.0.0.18 nh: 268435464 27.0.0.19 nh: 268435467 27.0.0.20 nh: 268435461 root@torm-11:mgmt:~# Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-15 10:02:03 -08:00
Patrick Ruddy	a119a429e4	Merge pull request #7637 from AnuradhaKaruppiah/evpn-pim-fixes evpn-pim: cleanup and display fixes	2020-12-15 17:36:24 +00:00
Patrick Ruddy	bedf36e327	Merge pull request #7636 from AnuradhaKaruppiah/type-0-esi zebra: support for type-0 ESI	2020-12-15 17:33:46 +00:00
Patrick Ruddy	01c65ba77e	Merge pull request #7633 from AnuradhaKaruppiah/protodown-fixes evpn-mh: protodown handling fixes	2020-12-15 17:23:32 +00:00
Russ White	930c9b7be8	Merge pull request #7736 from ton31337/fix/s_addr_INADDR_ANY *: Replace s_addr check agains 0 with INADDR_ANY	2020-12-15 07:12:49 -05:00
Donatas Abraitis	3a6290bdd1	*: Replace s_addr check agains 0 with INADDR_ANY Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-12-14 21:03:38 +02:00
Stephen Worley	3bece1e0e3	Merge pull request #7162 from opensourcerouting/zebra-human-netlink zebra: human readable netlink dumps	2020-12-14 14:03:35 -05:00
Anuradha Karuppiah	dc261b8de4	zebra: restart start-up delay timer when the first uplink comes up When all the uplinks go down the VTEP is disconnected from the VxLAN overlay and this was handled by proto-downing the ES bonds. When the uplinks come up again we need to re-enable the ES bonds but that needs to be done after a delay to allow the EVPN network to converge. And that is done by firing off the startup-delay timer on first uplink-up. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:41 -08:00
Anuradha Karuppiah	2bcf92e18b	zebra: re-sync protodown state with the dplane on new ES add 1. When a bond is associated with an ES we may need to re-sync the dplane protodown state (which maybe stale/set by some other app). 2. Also change the uplink state display to avoid confusion with protodown reason code (both used to show uplink-up). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Anuradha Karuppiah	26ba45e33d	zebra: update protodown display protodown state is a combination of the dplane and zebra states. protodown reason is maintained exclusively by zebra. Display this information on two separate lines to make that ownership clearer. Also display n/a for bonds as the dplane doesn't support protodowning the bond device. Sample output - ============== root@torm-11:mgmt:~# vtysh -c "show interface hostbond1"\|grep -i protodown protodown: off (n/a) protodown reasons: (uplinks-down) root@torm-11:mgmt:~# vtysh -c "show interface swp5"\|grep -i protodown protodown: on protodown reasons: (uplinks-down) root@torm-11:mgmt:~# PS: Cosmetic changes only, no functional change. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Anuradha Karuppiah	5c84327054	zebra: re-sync protodown state when a port/mbr is linked to an ES-bond The code for this was already there but was not kicking in because of a zebra local reason-code dup check. Even if the reason-code is the same, if the dplane and zebra disagree about the protodown state zebra will need to re-program the dplane. Fixed a couple of spelling errors in the protodown logs to make greps easy. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Donatas Abraitis	219218d964	Merge pull request #7664 from donaldsharp/global_bgp_wait Global bgp wait	2020-12-14 10:28:02 +02:00
Donald Sharp	3ceae22b7f	Revert "zebra: When shutting down an interface immediately notify about rnh" This reverts commit `0aaa722883`.	2020-12-11 20:45:43 -05:00
Nikolay Aleksandrov	4bcdb6086c	zebra: move from NDA_NOTIFY to NDA_FDB_EXT_ATTRS Use the new nested NDA_FDB_EXT_ATTRS attribute to control per-fdb notifications. PS: The attributes where updated as a part of the kernel upstreaming hence the change. Signed-off-by: Nikolay Aleksandrov <nikolay@cumulusnetworks.com> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-11 12:13:36 -08:00
Duncan Eastoe	164d8e8608	zebra: routes stuck with 'q' when using dplane FPM New work enqueued to the dplane_fpm_nl provider is initially de-queued and re-enqueued, in fpm_nl_process(), to be processed by the provider's own thread. After performing this initial de-queue/enqueue we return to dplane_thread_loop() and check the dplane_fpm_nl output queue for any work which has been completed. Since this work is being processed in another thread it is very likely that there will be some (or all) work still outstanding at this point. The dataplane thread finishes up any other tasks and then waits until it is next scheduled. In the meantime the dplane_fpm_nl thread is processing its work queue until completion. The issue arises here as the dataplane thread is not explicitly re-scheduled once dplane_fpm_nl has drained its work queue and populated its output queue with completed work. This completed work can sit in the output queue for an indeterminate period of time, depending upon when the dataplane thread is next scheduled for other work. If the RIB has reached a stable state then this could be a significant period of time. During this period zebra marks these routes as queued, even though they have actually been processed by all dataplane providers. An un-related RIB change which triggers a FIB update will result in the dataplane thread being scheduled and this completed work then being processed. At this point the routes will then no longer be marked as queued by zebra. However this new FIB update might itself then fall victim to the same scenario! We can observe the above behaviour in these detailed dplane logs. 11:24:47 zebra[7282]: dplane: incoming new work counter: 2 11:24:47 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:24:47 zebra[7282]: dplane provider 'Kernel': processing 11:24:47 zebra[7282]: Dplane NEIGH_DISCOVER, ip 192.168.2.2, ifindex 9 11:24:47 zebra[7282]: Dplane NEIGH_DISCOVER, ip 192.168.2.2, ifindex 9 11:24:47 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:24:47 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:24:47 zebra[7282]: dplane dequeues 1 completed work from provider dplane_fpm_nl 11:24:47 zebra[7282]: dplane has 1 completed, 0 errors, for zebra main 2 contexts (all incoming work) have been queued to dplane_fpm_nl - all good. 1 completed context was de-queued, so there is outstanding work. 11:24:58 zebra[7282]: dplane: incoming new work counter: 2 11:24:58 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:24:58 zebra[7282]: dplane provider 'Kernel': processing 11:24:58 zebra[7282]: ID (193) Dplane nexthop update ctx 0x55c429b6fed0 op NH_INSTALL 11:24:58 zebra[7282]: 0:5.5.5.5/32 Dplane route update ctx 0x55c429b79690 op ROUTE_INSTALL 11:24:58 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:24:58 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:24:58 zebra[7282]: dplane dequeues 2 completed work from provider dplane_fpm_nl 11:24:58 zebra[7282]: dplane has 2 completed, 0 errors, for zebra main A further 2 contexts (all incoming work) have been queued to dplane_fpm_nl - all good. 2 completed contexts were de-queued, which sounds good as that is what we en-queued. However, there is an outstanding context from earlier, so there is still outstanding work. Indeed the new 5.5.5.5/32 route is marked as queued: O>q 5.5.5.5/32 [110/10] via 192.168.2.2, dp0p1s3, weight 1, 00:01:19 This remains the case until we trigger a FIB update by installation of the (eg.) 10.10.10.10/32 route: 11:26:41 zebra[7282]: dplane: incoming new work counter: 2 11:26:41 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:26:41 zebra[7282]: dplane provider 'Kernel': processing 11:26:41 zebra[7282]: ID (195) Dplane nexthop update ctx 0x55c429b78ce0 op NH_INSTALL 11:26:41 zebra[7282]: 0:10.10.10.10/32 Dplane route update ctx 0x55c429b7a040 op ROUTE_INSTALL 11:26:41 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:26:41 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:26:41 zebra[7282]: dplane dequeues 2 completed work from provider dplane_fpm_nl 11:26:41 zebra[7282]: dplane has 2 completed, 0 errors, for zebra main 11:26:41 zebra[7282]: zebra2proto: Please add this protocol(2) to proper rt_netlink.c handling 11:26:41 zebra[7282]: Nexthop dplane ctx 0x55c429b6fed0, op NH_INSTALL, nexthop ID (193), result SUCCESS 11:26:41 zebra[7282]: default(0:254):5.5.5.5/32 Processing dplane result ctx 0x55c429b79690, op ROUTE_INSTALL result SUCCESS We observe the same 2 enqueues and 2 dequeues as before, which again suggests that there is outstanding work. As expected, the 5.5.5.5/32 route is no longer marked as queued: O>* 5.5.5.5/32 [110/10] via 192.168.2.2, dp0p1s3, weight 1, 00:02:06 But the 10.10.10.10/32 route is, as we have not yet processed the completed context: C>q 10.10.10.10/32 is directly connected, lo, 00:26:05 Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 15:04:15 +00:00
Duncan Eastoe	53706b4e51	zebra: dplane API to get provider output q length Returns the current number of (completed) contexts in the provider's output queue (dp_ctx_out_q), allowing access to this data from the provider itself. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 15:04:11 +00:00
Duncan Eastoe	7545bda0a4	dplane_fpm_nl: queue peak counter never increments The context queue length peak counter is always set to its current value, hence never increments. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 12:09:56 +00:00
Donald Sharp	7ed5844bef	zebra: Allow `show zebra client` to give clues about route update status When entering `show zebra client` allow the display of the client->notify_status for route updates. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-10 12:59:14 -05:00
Russ White	101ad544fa	Merge pull request #7678 from donaldsharp/aspath_to_zebra Aspath to zebra	2020-12-10 10:38:14 -05:00
Donald Sharp	b2c7cf18b2	Merge pull request #7706 from slankdev/slankdev-unexpose-lm-func-1 zebra: unexpose label-manager util-funcs as static	2020-12-10 07:43:02 -05:00
Rafael Zalamena	0c7e0f2f70	Merge pull request #7697 from pguibert6WIND/zebra_crash_startup_zns zebra: anticipate zns creation at vrf creation when backend is vrf-lite	2020-12-10 09:10:34 -03:00
Donatas Abraitis	82b773e63b	Merge pull request #7524 from donaldsharp/zebra_route_map_tighten zebra: deny when route map is specified but does not exist yet	2020-12-10 11:01:25 +02:00
Hiroki Shirokura	d3d9639d9a	zebra: unexpose label-manager util-funcs as static Following functions which is a piece of label-maanager implementation isn't called from out side of its file. And all lines of label-manager are coded on zebra/label_manager.c at this time. So these functions should be unexposed. Functions: - create_label_chunk - assign_label_chunk - delete_label_chunk - release_label_chunk Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2020-12-10 09:56:55 +09:00
Philippe Guibert	91b1421e84	zebra: anticipate zns creation at vrf creation when backend is vrf-lite in the case the namespace pointer is already available, feed it at vrf creation. this prevents from crashing if the netlink parsing already began, and the vrf-lite is not enabled yet. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-12-09 13:26:20 +00:00
Mark Stapp	e386d2b154	Merge pull request #7690 from donaldsharp/nht_show_is_not_not_not zebra, tests: Fix `show ip nht`	2020-12-09 07:58:37 -05:00
Hiroki Shirokura	732d22cbf2	zebra: use zserv_send_message instead of writen Following functions is using writen to dispatch message into socket, but another function uses zserv_send_message. This commit does tiny unification for zapi's socket messaging. Funcs: - zsend_assign_label_chunk_response() - zsend_label_manager_connect_response() Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2020-12-09 17:17:21 +09:00
Donald Sharp	dda33b6e0c	zebra, tests: Fix `show ip nht` The `show ip nht` and `show ipv6 nht` commands were broken. This is because recent code commit: `0154d8ce45` assumed that p must not be NULL and this is not the case. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 15:50:46 -05:00
Donald Sharp	e46723a50e	bgpd, zebra: Add ability for bgp to send AS-Path information to zebra Add a bit of code to allow bgp to send the AS-Path associated with the route being installed to zebra so it can be displayed and used as part of the `show ip route A` command in zebra. eva# show ip route 20.0.0.0/11 Routing entry for 20.0.0.0/11 Known via "bgp", distance 20, metric 0, best Last update 00:00:00 ago * 192.168.161.1, via enp39s0, weight 1 AS-Path: 60000 64539 15096 6939 8075 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:07:21 -05:00
Donald Sharp	cfa2a35d8d	sharpd, zebra: Pass and display opaque data as PoC Pass data from sharpd to zebra as opaque data and display it as part of the detailed route data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:09 -05:00
Donald Sharp	80a6ee90c3	zebra: Setup structure for opaque data to be displayed Setup the output mechanism for opaque data to be displayed to the end operator. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Donald Sharp	a29a60016e	zebra: Gather opaque data into the route entry for storage Just gather the opaque data into the route entry. Later commits will display this data for end users as well as to send it down. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Donald Sharp	aab4eca1c0	lib, zebra: Fix overlapping message types We had duplicate message id's. Shit's broke yo. Fix. I have no idea how this properly worked. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Karen Schoener	581e797e02	zebra: Adding zapi client close notification When zebra detects a client close, send a zapi client close notification. Signed-off-by: Karen Schoener <karen@voltanet.io>	2020-12-07 18:22:36 -05:00
Mark Stapp	a88a7c8d43	zebra: improve dataplane plugin queue counters Add the current queue depths for each plugin to the 'show dplane providers' output. Maintain the out-bound queue max counter properly, that was being ignored. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-07 13:54:08 -05:00
Mark Stapp	0ca6f3b1e6	zebra: remove useless deleted route_entries promptly Zebra accumulates route-entry objects and then processes them as a group. If that rib processing is delayed, because the dataplane/fib programming has built up a queue e.g., zebra can hold multiple deleted route objects in memory. At scale, this can be a problem. Delete unneeded route entries promptly, if they can't contribute to rib processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-07 13:54:08 -05:00
Patrick Ruddy	dd662ca570	Merge pull request #7399 from AnuradhaKaruppiah/mh-mac-ecmp-fixes evpn-mh: miscellaneous fixes in MAC-sync and MAC-ECMP handling	2020-12-03 16:27:49 +00:00
Rafael Zalamena	f584de526d	fpm: reset/walk data structures on connection Don't attempt to walk data structures while not connected so we can save some CPU usage when FPM server is offline. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00
Rafael Zalamena	1f9193c1f0	fpm: simplify reset logic Instead of checking for next group reset, always do it and skip sending if next hop group support is disabled. Also remove unused `*_complete` variables. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00
Rafael Zalamena	a3adec468e	zebra,fpm: fix configuration display Use `pI4` and `pI6` to format addresses and fix a bug when displaying IPv6 addresses. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00

1 2 3 4 5 ...

4314 Commits