mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-12-06 16:03:23 +00:00

Author	SHA1	Message	Date
Donald Sharp	5c18e66208	zebra: Prevent uninstall attempts when new entry is not happy In rib_process_update_fib, the function is sent two route entries the old ( previously installed ) and new ( the one to install ) When the function detects that the new is unusable because the number of nexthops that are usable for that route is 0, then we uninstall the old route. The problem here is that we should not attempt to uninstall any route that is not owned by FRR. Modify the code to not attempt this behavior Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-30 17:26:44 -04:00
Quentin Young	fb3bc7a74b	Merge pull request #7215 from mjstapp/fix_z_mlag_read zebra: don't touch mlag read event pointer	2020-09-30 16:27:01 -04:00
Mark Stapp	f5d8487244	zebra: don't touch mlag read event pointer Don't touch the mlag read event pointer, it's not safe. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-09-30 13:24:54 -04:00
Mark Stapp	4fdfda2e34	Merge pull request #7167 from donaldsharp/mlag_rd_killer zebra: the mlag_rd_buf_offset variable was write only	2020-09-30 11:40:40 -04:00
Donald Sharp	dbbae374d4	Merge pull request #7192 from deastoe/zebra-fpm-blackhole-abort zebra: fix FPM abort for unreach/prohibit routes	2020-09-29 13:47:38 -04:00
Patrick Ruddy	aa1f6a8795	Merge pull request #7188 from chiragshah6/evpn_dev zebra: EVPN avoid duplicate list-node in l3vni's l2vni-list	2020-09-29 16:33:19 +01:00
Duncan Eastoe	94f7786375	zebra: fix FPM abort for unreach/prohibit routes `b0e9567ed1` fixed an issue whereby zebra would abort while building an update for a blackhole route. The same issue, `assert(data_len)` failing in `zfpm_build_route_updates()`, can be observed when building updates for unreachable and prohibit routes. To address this `netlink_route_info_fill()` is updated to not indicate failure, due to lack of nexthops, for any blackhole routes. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-09-29 12:59:30 +01:00
Donald Sharp	a24d04f4db	zebra: Make nexthop_active check use the same debug When debugging why a route was not successfully installed into the rib, it would be preferable that the end user only have to turn on `debug zebra rib detail` as that is what we have been telling people to do for the last couple of years. Consolidate back to this. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-29 07:54:35 -04:00
Donald Sharp	81194feec9	zebra: Add missing reason we could not make an active_nexthop check Add a missing reason as to why we are unable to make an active nexthop check be successful. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-29 07:45:19 -04:00
Chirag Shah	c7e83a4efe	zebra: avoid duplication node in l3vni l2vni-list With l2vni flap leading to duplicate entry creation in l3vni's l2vni-list. Use list sorted add with no duplicates. root@TORC11:mgmt:~# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1000 1000 0 0 1002 root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1000 1000 0 0 1002 1002 Ticket:CM-31545 Reviewed By: Testing Done: With Fix: Multiple time flaps vni counts remained the same. root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# net show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1002 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-09-28 21:44:30 -07:00
Stephen Worley	66c28560ba	zebra: set NHG/backup NHG pointers on success zapi read Only set the NHG/backup NHG pointers of the caller if the read of the nexthops was successfull. Otherwise, we might free when not neccessary or double free. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	2173535298	lib,zebra,sharpd: add code for backup proto-NHs but disabled Add the zapi code for encoding/decoding of backup nexthops for when we are ready for it, but disable it for now so that we revert to the old way with them. When zebra gets a proto-NHG with a backup in it, we early fail and tell the upper level proto. In this case sharpd. Sharpd then reverts to the old way of installation with the route. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	aaa42e056f	zebra: add type to nhg_prot_del API for sanity check Add type to the nhg_proto_del API params for sanity checking that the types of the route sent by the proto matches the type found with the ID. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	841f77ff04	zebra: free ctx if we skip replace for NHG PROTO routes Free the ctx if we decide we dont need to do anything with this route update. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	3d3a9dc8a7	zebra: limit no re-install to NHG PROTO using routes Limit the not re-installation of routes with the same NHG ID to routes that are using the new NHG PROTO API. This would only include sharpd and EVPN-MH for now. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8f830b8c64	zebra: use list to mark for removal when scoring In scoring our NHEs during shutdown there is a chance we could release mutliple NHEs at the same time during one iteration. This can cause memory corruption if the two being released are directly next to each other in the hash table. hash_iterate accounts for releasing one during the iteration but not two by setting hbnext before release but if hbnext is also freed, we obviously can have a problem. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	70f3cda6c1	zebra: reject proto NHGs of blackhole/interface Reject proto NHGs of type blackhole/interface for now. We need to think a bit more about how to resolve these given the linux kernel needs to know the Address Family of the routes that will use them and install it with them. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	73937edb73	zebra,sharpd: checkpatch fixes Check patches fixes for NHG API pathes. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	ff9aca4f8d	lib,zebra,sharpd: clang format Clang format for NHG API and sharpd patches. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8b2d3a0fb6	zebra: clean up the NHG proto zapi code a bit Clean up the function names and remove some TODOs that are no longer needed/hacks we used for testing. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	e270f004ae	zebra: multipath number checks with NHG proto Get the multipath number checks working with proto-based NHG message decoding in zapi_msg.c Modify the function that checks this for routes to work without being passed a prefix as is the case with NHG creates. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8155e8c592	zebra: add flag track released state of proto NHGS Add a flag to track the released state of a proto-based NHG. This flag is used to know whether the upper level proto has called the *_del API. Typically, the NHG would just get removed and uninstalled at this point but there is a chance we are being sent it while routes are still being owned or we were sent it multiple times. This flag and associated code handles that. Ticket: CM-30369 Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	70347b7ad6	zebra: reply fail on NHG add if not ifindex/onlink We currently don't support ADD/DEL/REPLACE with proto-based NHGs that are not already fully resolved and ifindex/onlink based. If we are handed one that doesn't have ifindex set i.e. recursive, gracefully fail and with a notification. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	2c7819b9d4	lib,zebra: fixup NHG notify zapi messaging Make the message parameters align better with other zapi notifications and change the ID to correctly be a uint32. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Donald Sharp	27805e74f0	zebra: Properly set NEXTHOP_FLAG_FIB when skipping install When the dataplane detects that we have no need to reinstall the same route, setup the NEXTHOP_FLAG_FIB appropriately. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	e3b9c0f2f6	zebra: Only install a minimal amount of times The code was installing the nexthop group again using the NLM_F_REPLACE function causing extremely large route installation times. This reduces the time from installing 1 million routes from sharpd with a nhg from > 200 seconds ( where I gave up ) to ~15 seconds on my machine for 32 x ecmp. As a side note 1 million routes using master sharpd takes ~50 seconds to do the same thing. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	72938edfbc	zebra: add logging for NHG ignoring in netlink Add some logging for when we choose to ignore a NHG install for one reason or another. Also, cleanup some of the code using the same accessor functions for the context object. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	9c6c48bc10	zebra: return the proto nhe on del even with refs Return the proto nhe on del even if their are still possible route references. We may get a del before the routes are removed. So we still need to return this to the caller so they can decrement the ref. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	3bccc0f5eb	zebra: fix releasing proto-owned singletons Fix the releasing of proto-owned singletons from the attribute hashed table. Proto-owned singleton nexthops are hashed so they can still be shared therefore they are present in this table and need to be released when the time comes. This check was only matching on zebra proto before. Changed to match IDs in zebra allocated range. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	f651b708e0	zebra: increment the nhg proto score iterator Increment the nhg proto score iterator we used to count leftover NHGs after client disconnect and log. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	1f65568046	zebra: fix refcnt/rib issues in NHG replace/delete Fix some reference counting issues seen when replacing a NHG and deleting one. For replacement, we should end with the same refcnt on the new one. For delete, its the caller's job to decrement its ref after its done with it. Further, update routes in the rib with the new pointer after replace. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	68671c7439	zebra: warn if zapi NHG add has no nexthops Log a warning and return if we receive a NHG add via zapi that has no nexthops. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	24db1a7b9a	zebra: handle proto NHG uninstall client disconnect Add code to handle proto-based NHG uninstalling after the owning client disconnects. This is handled the same way as rib_score_proto() but for now we are ignoring instance. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	6fae63d2ba	zebra: inc/dec refcount on add/del NHG proto When we add a proto NHG, increment the refcount, when we del a proto NHG, decrement the refcount rather than deleting it explicitly. If the upper level proto is handling it properly, it should get decremented to zero when we receive a NHG del. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2d8a9c544b	zebra: remove unneeded nhg repalce boilerplate Remove some leftover boilerplate from the old replace code path. That code ended up in the add API so its no longer needed. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	df3cef24c5	zebra: Prevent duplicate re-install If we have received a route that the already existing route is exactly the same, just note that it happened and move on. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2b5ecd4ca6	zebra: fix route validity check with NHG ID Fix check in zread where we determine validity of a route based on reading in nexthops/checking ID is present. We had a bad conditional that was determining a route is bad if its not NHG ID based. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	cd53e3a6e6	zebra: use the passed proto from zapi We were hard coding proto bgp for use with the NHG creation. Use the actual passed one from zapi now that it exists. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	ac5d1091dc	zebra: make NHG ID allocation smarter Make NHG ID allocation smarter so it wraps once it hits the lower bound for protos and performs a lookup to make sure we don't already have that ID in use. Its pretty unlikely we would wrap since the ID space is somewhere around 24million for Zebra at this point in time. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	54c89c9377	zebra: NHG ID bounds macros Determine the NHG ID spacing and lower bound with ZEBRA_ROUTE_MAX in macros. Directly set the upperbound to be the lower 28bits of the uint32_t ID space (the top 4 are reserved for l2-NHGs). Round that number down a bit to make it more even. Convert all former lower_bound calls to just use the macro. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	16b20ad062	zebra: dont update counter if outside of zebra ID range When we receive a NHG from the kernel, we set the ID counter to that to avoid using IDs owned from the kernel. If we get one outside of zebra's range, lets not update it since its probably one we created and never deleted anyway. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2c41ef8c17	zebra: special handling for proto-NHG-based routes For now let's assume proto-NHG-based routes are good to go (we assume they are onlink/interface based anyway) and bypass route resolution altogether. Once we determine how to handle recursive nexthop-resolution for proto-NHGs we will revisit this. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	50db3f2f1d	zebra: handle zapi routes with NHG ID set Add code to properly handle routes sent with NHG ID rather than a nexthop_group. For now, we separate this from backup nexthop handling since that should probably be added to the nhg_proto_add calls. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	dd1e105fe3	zebra: implement NHG proto replace Implement the ability to replace an NHG sent down from an upper level proto. With proto-owned NHGs, we make the assumption they are ecmp and always treat them as a group to make the replace from 1 -> 2 and 2 -> 1 quite a bit easier. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	08da8bbc22	zebra: hash proto-created but zebra ID spaced NHGS To prevent duplication of singleton NHGs, lets hash any zebra-ID spaced NHGs sent from an upper level proto. These would be singleton NHGs anyway and should prevent duplication of dataplane installs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	6c67f41f9e	zebra,lib: command to only install proto-based nexthops Add a command/functionality to only install proto-based nexthops. That is nexthops owned/created by upper level protocols, not ones implicitly created by zebra. There are some scenarios where you would not want zebra to be arbitrarily installing nexthop groups and but you still want to use ones you have control over via lib/nexthop_group config and an upper level protocol. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	0885b1e3d9	zebra: implement protocol NHG Add/Del Implement the underlying zebra functionality to Add/Del an internal zebra and kernel NHG. These NHGs are managed by the upperlevel protocols that send them down via zapi messaging. They are not put into the overall zebra NHG hash table and only put into to the ID table. Therefore, different protos cannot and will not share NHGs. The proto is also set appropriately when sent to the kernel. Expand the separation of Zebra hashed/shared/created NHGs and proto created and mangaged NHGs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	5b27c09d4e	zebra: remove NHG unhashable flag and its code Remove the code for setting a NHG as unhashable. Originally this was to prevent us from attempting to put duplicates from the kernel in our hashtable. Now I think its better to not use them in the hashtable at all and only track them in the ID table. Routes will still be able to use them if they specify the ID explicitly when sending Zebra the route, but 'normal' routes we hash the nexthop group on will not. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	27141ea94e	lib, zebra: Add ability to send down a nhgid over route install Modify the send down of a route to use the nexthop group id if we have one associated with the route. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	2f35a820bf	lib, zebra: Add ZAPI_NHG_ADD\|DELETE Add the ability to send a NHG from an upper level protocol down to zebra. ZAPI_NHG_ADD encompasses both the addition and replace semantics ( If the id passed down does not exist yet, it's Add, else it's a replace ). Effectively zebra will take this nhg passed down save the nhg in the id hash for nhg's and then create the appropriate nhg's and finally install them into the linux kernel. Notification will be the ZAPI_NHG_NOTIFY_OWNER zapi message for normal success/failure messaging to the installing protocol. This work is being done to allow us to work with EVPN MH which needs the ability to modify NHG's that BGP will own and operate on. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	f70da2a390	zebra: Refactor nexthop reading from zapi messages Take the zebra code that reads nexthops and combine it into one function so that when we add zapi messages to send/receive nexthops we can take advantage of this function. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	786a9bd9eb	zebra: Convert zserv_nexthop_num_warn to return bool Allow us to key of the warning if we have one. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donatas Abraitis	b1f476731a	Merge pull request #7169 from donaldsharp/some_code_cleanup Some code cleanup	2020-09-25 10:19:34 +03:00
Sri Mohana Singamsetty	46dd92c522	Merge pull request #7164 from AnuradhaKaruppiah/mh-misc-fixes evpn-mh: miscellaneous cleanup/fixes	2020-09-24 08:37:45 -07:00
Donald Sharp	9781e6a047	zebra: Don't ignore setsockopt return When attempting to limit the amount of data sent from the kernel to FRR, some kernels we can run against may not have this ability in which case the setsockopt will fail. Notice that in the log. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-24 07:42:51 -04:00
Donald Sharp	00e0d113e5	zebra: the mlag_rd_buf_offset variable was write only The mlag_rd_buf_offset function was only ever being set to 0 in the mlag_read function and only written in that function. There is no need for this global variable. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 20:36:51 -04:00
Mark Stapp	ccda0eadac	Merge pull request #7155 from donaldsharp/TRAP Offload/Trap	2020-09-23 16:06:37 -04:00
Mark Stapp	4020564a3c	Merge pull request #7163 from donaldsharp/zebra_mlag_bugs Zebra mlag bugs	2020-09-23 15:32:31 -04:00
Anuradha Karuppiah	e378f5020d	zebra: fix use of freed es during zebra shutdown This problem was reported by the sanitizer - ================================================================= ==24764==ERROR: AddressSanitizer: heap-use-after-free on address 0x60d0000115c8 at pc 0x55cb9cfad312 bp 0x7fffa0552140 sp 0x7fffa0552138 READ of size 8 at 0x60d0000115c8 thread T0 #0 0x55cb9cfad311 in zebra_evpn_remote_es_flush zebra/zebra_evpn_mh.c:2041 #1 0x55cb9cfad311 in zebra_evpn_es_cleanup zebra/zebra_evpn_mh.c:2234 #2 0x55cb9cf6ae78 in zebra_vrf_disable zebra/zebra_vrf.c:205 #3 0x7fc8d478f114 in vrf_delete lib/vrf.c:229 #4 0x7fc8d478f99a in vrf_terminate lib/vrf.c:541 #5 0x55cb9ceba0af in sigint zebra/main.c:176 #6 0x55cb9ceba0af in sigint zebra/main.c:130 #7 0x7fc8d4765d20 in quagga_sigevent_process lib/sigevent.c:103 #8 0x7fc8d4787e8c in thread_fetch lib/thread.c:1396 #9 0x7fc8d4708782 in frr_run lib/libfrr.c:1092 #10 0x55cb9ce931d8 in main zebra/main.c:488 #11 0x7fc8d43ee09a in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x2409a) #12 0x55cb9ce94c09 in _start (/usr/lib/frr/zebra+0x8ac09) ================================================================= Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 11:20:13 -07:00
Anuradha Karuppiah	4d8b658c8c	zebra: evpn-mh: add error logs on ES processing failures Cleanup some of the XXX added during development of MH. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 11:15:08 -07:00
Donatas Abraitis	5fde152be6	Merge pull request #7112 from AnuradhaKaruppiah/mac-neigh-ht evpn-mh: mac-ip sync hold timers	2020-09-23 21:11:56 +03:00
Patrick Ruddy	a3b5e4fdf7	Merge pull request #7157 from donaldsharp/nhg_speeds zebra: Move debug information gathering to inside guard	2020-09-23 18:42:00 +01:00
Donald Sharp	c19808acad	zebra: Increase the read/write mlag buffer sizes The read/write mlag buffer sizes of 2k were sufficient for ~100 S,G notifications at one go. Increase to 32k to give us 16 times the space. Ticket: CM-31576 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:13:03 -04:00
Donald Sharp	7692744f2c	zebra: Ensure that message received from mlag will fit If we receive a message that is greater than our buffer size we are in a situation where both the read and write buffers are fubar'ed beyond the end. Assert when we notice this fact. Ticket: CM-31576 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:12:26 -04:00
Donald Sharp	f24d9ab667	zebra: modify mlag code to only need 1 stream when generating data The normal pattern of writing the type/length at the beginning of the packet was not being quite followed. Modify the mlag code to respect the proper way of doing things and get rid of a stream_new and copy. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:12:20 -04:00
Anuradha Karuppiah	2b9e207e0e	zebra: stop neigh hold timer when the neigh is deleted The neigh hold timer was firing after the neigh was deleted resulting in the following crash - [ at ./zebra/zebra_evpn_neigh.h:155 at zebra/zebra_evpn_neigh.c:447 at lib/thread.c:1578 at zebra/main.c:488 ] Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Don Slice	f9f0463fb9	zebra: fix deletion of evpn mh neigh-holdtime Found that the command "evpn mh neigh-holdtime" can be set but not deleted. This fix solves the delete process Signed-off-by: Don Slice <dslice@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Anuradha Karuppiah	41c809b2a8	zebra: changes for configuring mac and neigh holdtime When an ES peer withdraws a MAC-IP route we hold the entry for N seconds to allow an external daemon (neighmgr) to establish host reachability independent of the peer. Add config commands to allow the user to set this holdtime (N). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Donald Sharp	aa178efd49	Merge pull request #7148 from pguibert6WIND/fix_fd_not_closed zebra: fix fd going out of scope leaks the handle	2020-09-23 07:40:14 -04:00
Donatas Abraitis	0ce5baaab1	Merge pull request #7018 from gouault6wind/show_ip_route Clean up in vrf management	2020-09-23 08:45:09 +03:00
Donald Sharp	bed74d178e	zebra: Move debug information gathering to inside guard Let's not make the entire `depend_finds` function pay for the data gathering needed for the debug. There are numerous other places in the code that check the NEXTHOP_FLAG_RECURSIVE and do the same output. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-22 20:47:33 -04:00
Sri Mohana Singamsetty	efdd997dad	Merge pull request #7116 from AnuradhaKaruppiah/mh-neigh-fixes evpn-mh: changes for programming synced neighs as static in the dataplane	2020-09-22 15:45:09 -07:00
Mark Stapp	b6033bd1c1	Merge pull request #7067 from donaldsharp/remove_solaris Remove solaris	2020-09-22 17:04:19 -04:00
Donald Sharp	5a3cf85391	lib, zebra: Add ability to read kernel notice of TRAP/OFFLOAD The linux kernel is getting RTM_F_TRAP and RTM_F_OFFLOAD for kernel routes that have an underlying asic offload. Write the code to receive these notifications from the linux kernel and to store that data for display about the routes. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-22 15:57:43 -04:00
Donald Sharp	4c56ce1cea	zebra: Add basic knowledge of asic offload available Some linux kernels are starting to support the idea of knowledge about the underlying asic. Add a boolean that we can set/unset to track whether or not we think the router has this functionality available. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-22 15:57:43 -04:00
Philippe Guibert	7529bf8f05	zebra: fix fd going out of scope leaks the handle the file descriptor is closed if it has been locally created. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-22 21:09:13 +02:00
Donald Sharp	cae8bc967c	*: Remove solaris from FRR The Solaris code has gone through a deprecation cycle. No-one has said anything to us and worse of all we don't have any test systems running Solaris to know if we are making changes that are breaking on Solaris. Remove it from the system so we can clean up a bit. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-21 10:02:20 -04:00
Christophe Gouault	1eb92f06c4	vrf: VRF_DEFAULT must be 0, remove useless code Code was added in the past to support a value of VRF_DEFAULT different from 0. This option was abandoned, the default vrf id is always 0. Remove this code, this will simplify the code and improve performance (use a constant value instead of a function that performs tests). Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>	2020-09-21 10:17:35 +02:00
Christophe Gouault	d58b6f7568	zebra: always display vrf in show ip route json In route json outputs, always display the vrf even if it is the default vrf. Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>	2020-09-21 10:07:56 +02:00
Christophe Gouault	6ac9404177	zebra: simplify and optimize vrf display in show ip route In all outputs (text and json): simplify and optimize the vrf name display, use the vrf_id_to_name() handler. Note: vrf_id_to_name() has a safeguard system that prevents from crashing when the vrf cannot be found because it changed in some (unexpected) manner, it returns "n/a". Note: "vrf n/a" will now be displayed instead of "vrf UNKNOWN" in this case, like in most other frr components. This safeguard was missing for show ip route json, so this optimization also fixes a potential crash. Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>	2020-09-21 10:07:56 +02:00
Christophe Gouault	8277d4b796	zebra: fix show ip route output Variable "show ip route" commands invoke the same helper (do_show_ip_route), potentially several times. When asking to dump a non-default vrf, all vrfs or all tables, the output is messy, the header summarizing abbreviations is repeated several times, excess line feeds appear, the default table of default VRF is concatenated to the previous table output... Normalize the output: - whatever the case, display the common header at most once, if there is at least an entry to dump. - when using a "vrf all" or "table all" command, prepend a line with the VRF and table (even for the default vrf or table). - when dumping a specific vrf or table, prepend a line with the VRF and table. Example (vrf all) ================= router# show ip route vrf all Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF main: C>* 10.0.2.0/24 is directly connected, mgmt0, 00:24:09 K>* 10.0.2.2/32 [0/100] is directly connected, mgmt0, 00:24:09 C>* 10.125.0.0/24 is directly connected, ntfp2, 00:00:26 VRF private: S>* 1.1.1.0/24 [1/0] via 10.125.0.2, loop0, 00:00:29 C>* 10.125.0.0/24 is directly connected, loop0, 00:00:42 Example (main vrf) ================== router# show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route C>* 10.0.2.0/24 is directly connected, mgmt0, 00:24:41 K>* 10.0.2.2/32 [0/100] is directly connected, mgmt0, 00:24:41 C>* 10.125.0.0/24 is directly connected, ntfp2, 00:00:58 Example (specific vrf) ====================== router# show ip route vrf private Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF private: S>* 1.1.1.0/24 [1/0] via 10.125.0.2, loop0, 00:01:23 C>* 10.125.0.0/24 is directly connected, loop0, 00:01:36 Example (all tables) ==================== router# show ip route table all Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF main table 200: S>* 4.4.4.4/32 [1/0] via 10.125.0.3, ntfp2, 00:01:51 VRF main table 254: C>* 10.0.2.0/24 is directly connected, mgmt0, 00:25:34 K>* 10.0.2.2/32 [0/100] is directly connected, mgmt0, 00:25:34 C>* 10.125.0.0/24 is directly connected, ntfp2, 00:01:51 Example (all vrf, all table) ============================ router# show ip route table all vrf all Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF main table 200: S>* 4.4.4.4/32 [1/0] via 10.125.0.3, ntfp2, 00:02:15 VRF main table 254: C>* 10.0.2.0/24 is directly connected, mgmt0, 00:25:58 K>* 10.0.2.2/32 [0/100] is directly connected, mgmt0, 00:25:58 C>* 10.125.0.0/24 is directly connected, ntfp2, 00:02:15 VRF private table 200: S>* 2.2.2.0/24 [1/0] via 10.125.0.2, loop0, 00:02:18 VRF private table 254: S>* 1.1.1.0/24 [1/0] via 10.125.0.2, loop0, 00:02:18 C>* 10.125.0.0/24 is directly connected, loop0, 00:02:31 Example (specific table) ======================== router# show ip route table 200 Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF main table 200: S>* 4.4.4.4/32 [1/0] via 10.125.0.3, ntfp2, 00:05:26 Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>	2020-09-21 10:07:56 +02:00
Donald Sharp	8c36640b7c	zebra: Allow FreeBSD to set and delete addresses from an interface This series of events: $ sudo ifconfig lo0 add 4.4.4.4/32 $ sudo ifconfig lo0 inet 4.4.4.4/32 delete would end up leaving the 4.4.4.4/32 address on the interface under freebsd. This all boils down to the fact that the interface is not considered connected yet we have a destination. If the destination is the same and we are not connected ignore it on freebsd. I am sure there are other fun scenarios that someone will have to squirrel out. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-20 20:53:27 -04:00
Russ White	0a6e6613d7	Merge pull request #7089 from pguibert6WIND/netns-refactor Netns refactor	2020-09-18 11:02:30 -04:00
Anuradha Karuppiah	fb8f609d48	zebra: fixup a neigh del bug introduced by a previous commit for MAC-IP sync Problem commit - [ `b169fd6fd5` zebra: support for MAC-IP sync routes ] That commit had accidentally replaced a mac-ip del to bgp with a mac del (consequence of a bad cut-paste). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-16 17:32:33 -07:00
Anuradha Karuppiah	ccd187cdaa	zebra: setup static neigh flag in the dataplane Changes to setup peer-synced as static in the dataplane. This prevents them from being flushed out when the local switch cannot establish their reachability. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-16 17:24:18 -07:00
Patrick Ruddy	f6374bd9b7	Merge pull request #7107 from AnuradhaKaruppiah/mh-cleanup zebra: re-name some mh functions to make the code more readable	2020-09-16 15:42:49 +01:00
Anuradha Karuppiah	945ee7b272	zebra: re-name some mh functions to make the code more readable As a part of the re-factoring some of the evpn_vni_es apis got re-named as evpn_evpn_es. Changed them to evpn_es_evi to make it common to vxlan and mpls. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-15 16:24:59 -07:00
Chirag Shah	839dfe29a6	zebra: dup detected mac avoid update to bgpd When a MAC is detected duplicate on a local learn event (with freeze action), do not send update to bgp to advertise into evpn control plane. With evpn mh, inform_client flag is set and sends notification to bgp albeit dup detect is set. Check mac are detected as duplicate before setting inform_client to true. Ticket:CM-29817 Reviewed By:CCR-10329 Testing Done: Enable DAD with freeze action Upon local learn MAC detected as duplica Signed-off-by: Chirag Shah <chirag@cumulusnetworks.com>	2020-09-15 16:12:44 -07:00
Donald Sharp	58a1d24924	bgpd, lib, pbrd, zebra: Pass by ifname When installing rules pass by the interface name across zapi. This is being changed because we have a situation where if you quickly create/destroy ephermeal interfaces under linux the upper level protocol may be trying to add a rule for a interface that does not quite exist at the moment. Since ip rules actually want the interface name ( to handle just this sort of situation ) convert over to passing the interface name and storing it and using it in zebra. Ticket: CM-31042 Signed-off-by: Stephen Worley <sworley@nvidia.com> Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-11 20:04:45 -04:00
Renato Westphal	2525fc315a	Merge pull request #6974 from liron-ze/high-cpu-usage zebra: Solve the problem of high CPU and memory usage when creating t…	2020-09-11 15:59:27 -03:00
Philippe Guibert	a1ce03e1fd	zebra: support for macvlan interfaces mac vlan interfaces are supported in different network namespaces. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	289b0f0dfd	zebra: dynamically detect vxlan link interfaces in other netns this is used when parsing the newly network namespaces. actually, to track the link of some interfaces like vxlan interfaces, both link index and link nsid are necessary. if a vxlan interface is moved to a new netns, the link information is in the default network namespace, then LINK_NSID is the value of the netns by default in the new netns. That value of the default netns in the new netns is not known, because the system does not automatically assign an NSID of default network namespace in the new netns. Now a new NSID of default netns, seen from that new netns, is created. This permits to store at netns creation the default netns relative value for further usage. Because the default netns value is set from the new netns perspective, it is not needed anymore to use the NETNSA_TARGET_NSID attribute only available in recent kernels. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	2961d0601c	lib, zebra: reuse and adapt ns_list walk functionality the walk routine is used by vxlan service to identify some contexts in each specific network namespace, when vrf netns backend is used. that walk mechanism is extended with some additional paramters to the walk routine. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	b5fde6fd5c	zebra: local mac entries populated in correct netnamespace MAC address neighbours are read on the same netnamespace where the vxlan interface is. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	d6bf8f136d	zebra: when parsing local entry against dad, retrieve config when duplicate address detection is observed, some incrementation, some timing mechanisms need to be done. For that the main evpn configuration is retrieved. Until now, the VRF that was storing the dad config parameters was the same VRF that hosted the VXLAN interface. With netns backend, this is not true, as the VXLAN interface is in the same VRF as the bridge interface. The modification takes same definition as in BGP, that is to say that there is a single bgp evpn instance, and this is that instance that will give the correct config settings. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	9609fab712	zebra: zvni_map_to_vlan() adaptation for all namespaces this change is needed when a MAC/IP entry is learned by zebra, and the entry happens to be in a different namespace. So that the entry be active, the correct vni match has to be found. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:02:50 +02:00
Jakub Urbańczyk	97f8514460	zebra: increase netlink receive buffer size Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-09-01 11:35:16 +02:00
Anuradha Karuppiah	4cd940502d	zebra: fix misc problems with ES management 1. MAC ref of a zero ESI was accidentally creating a new ES with zero ES id. 2. When an ES was deleted and re-added the ES was not being sent to BGP because of a stale flag that suppressed the update as a dup. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-08-28 15:10:18 -07:00
Donald Sharp	9271987f1e	zebra: When we get a rib deletion event be smarter When we get a rib deletion event and we already have that particular route node in the queue to be reprocessed, just note that someone from kernel land has done us dirty and allow it to be cleaned up by normal processing Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-28 14:45:59 -04:00
Donald Sharp	0aaa722883	zebra: When shutting down an interface immediately notify about rnh Imagine a situation where a interface is bouncing up/down. The interface comes up and daemons like pbr will get a nht tracking callback for a connected interface up and will install the routes down to zebra. At this same time the interface can go down. But since zebra is busy handling route changes ( from pbr ) it has not read the netlink message and can get into a situation where the route resolves properly and then we attempt to install it into the kernel( which is rejected ). If the interface bounces back up fast at this point, the down then up netlink message will be read and create two route entries off the connected route node. Zebra will then enqueue both route entries for future processing. After this processing happens the down/up is collapsed into an up and nexthop tracking sees no changes and does not inform any upper level protocol( in this case pbr ) that nexthop tracking has changed. So pbr still believes the nexthops are good but the routes are not installed since pbr has taken no action. Fix this by immediately running rnh when we signal a connected route entry is scheduled for removal. This should cause upper level protocols to get a rnh notification for the small amount of time that the connected route was bouncing around like a madman. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-28 14:45:59 -04:00

1 2 3 4 5 ...

4089 Commits