mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-12-26 18:40:48 +00:00

Author	SHA1	Message	Date
Anuradha Karuppiah	bda6be1c8b	zebra: Send path del to bgp for local-inactive path Problem: When IP1:M1 (local) moved to IP1:M2 (remote-VTEP) bgpd continues to advertise IP1:M1. Fix: Local path del is sent to bgp if the neigh was {local-active\|\|peer-active}. So path del needs to be called before the sync flags (including peer-active) are cleared. Ticket: #2706744 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-09-07 09:53:48 -07:00
Donald Sharp	a1f35d7e7c	zebra: If we hand set the router-id only update everyone if it changes When we hand set the router-id, but we have choosen a router-id that is already the `winner` there is no point in updating anyone with this data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-07 12:53:38 -04:00
Donald Sharp	0114135890	zebra: Do not send a router-id of 0.0.0.0 when we don't know it yet At startup there exists a time frame where we might not know a particular vrf's router id. When zebra gets a request for it let's not just blindly send whatever we have. Let's be a bit smart and only respond with one if we have one. The upper level protocol can wait for it to have one. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-07 12:53:37 -04:00
G. Paul Ziemba	a383d4d201	vrf_name_to_id(): remove vrf_name_to_id() returned VRF_DEFAULT when the vrf name was unknown, hiding errors. Per community recommendation, vrf_name_to_id() is now removed and the few callers now use vrf_lookup_by_name() directly. Signed-off-by: G. Paul Ziemba <paulz@labn.net>	2021-09-07 09:47:24 -07:00
Hiroki Shirokura	0a735cd523	zebra: add srv6's no commands CURRENT_CONFIGURATION: configure terminal segment-routing srv6 locators locator loc1 locator loc2 locator loc3 CMD1: delete single locator configure terminal segment-routing srv6 locators no locator loc1 CMD2: delete srv6 whole config (== delete all locators) configure terminal segment-routing no srv6 Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-09-07 12:54:39 +00:00
Hiroki Shirokura	f6e52a81dc	zebra: elliminate srv6 locator auto allocation by zlicnet request Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-09-07 12:54:39 +00:00
Hiroki Shirokura	f5ca329b2d	zebra: implement srv6 locator add/delete notification via ZAPI Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-09-07 12:54:37 +00:00
Philippe Guibert	2490726201	zebra: update zl3vni when bridge link refreshed in other namespaces When running bgp evpn rt5 setup with vrf namespace backend, once the BGP configuration loaded, some refresh like the config change of a vxlan interface is not taken into account. As consequence, the BGP l2vpn evpn entries are empty. This can happen by recreating vxlan interface like follows: ip netns exec cust1 ip li del vxlan1000 ip link add vxlan1000 type vxlan id 1000 dev loopback0 local 10.209.36.1 learning ip link set dev vxlan1000 mtu 9000 ip link set dev vxlan1000 netns cust1 ip netns exec cust1 bash ip link set dev vxlan1000 up ip link set dev vxlan1000 master br1000 Actually, changing learning attribute requires recreation, and this change needs to manually reload the frr configuration. The update mechanism in zebra about vxlan interface updates is already put in place, but it does not work well with namespace based vrf backend. The function zl3vni_from_svi() is then modified to parse all the interfaces of each namespace. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-09-07 14:10:58 +02:00
Kantesh Mundaragi	0789eb69e5	bgpd: VRF-Lite fix nexthop type Description: Change is intended for fixing the following issues related to vrf route leaking: Routes with special nexthops i.e. blackhole/sink routes when imported, are not programmed into the FIB and corresponding nexthop is set as 'inactive', nexthop interface as 'unknown'. While importing/leaking routes between VRFs, in case of special nexthop(ipv4/ipv6) once bgp announces route(s) to zebra, nexthop type is incorrectly set as NEXTHOP_TYPE_IPV6_IFINDEX/NEXTHOP_TYPE_IFINDEX i.e. directly connected even though we are not able to resolve through an interface. This leads to nexthop_active_check marking nexthop !NEXTHOP_FLAG_ACTIVE. Unable to find the active nexthop(s), route is not programmed into the FIB. Whenever BGP leaks routes, set the correct nexthop type, so that route gets resolved and correctly programmed into the FIB, in the imported vrf. Co-authored-by: Kantesh Mundaragi <kmundaragi@vmware.com> Signed-off-by: Iqra Siddiqui <imujeebsiddi@vmware.com>	2021-09-07 01:50:06 -07:00
Donald Sharp	a81982fa56	zebra: Convert to `enum zebra_slave_iftype` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	e6f2bec087	zebra: Convert to `enum zebra_iftype` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	60e3656140	zebra: Convert to `struct zebra_fec` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	8f74a383b3	zebra: Convert to `struct zebra_lsp` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	f2595bd505	zebra: Convert to `struct zebra_nhlfe` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	a7d2146a41	zebra: Convert to `struct zebra_ile` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	72de4110dc	zebra: Convert to `struct zebra_neigh` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:23 -04:00
Donald Sharp	05843a27f5	zebra: Convert to `struct zebra_l3nvi` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Donald Sharp	847f168d76	zebra: Convert to `struct zebra_vxlan_sg` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Donald Sharp	3198b2b347	zebra: Convert to `struct zebra_mac` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Donald Sharp	c172c032ef	zebra: Convert to `struct zebra_vtep` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Donald Sharp	f6371c343a	zebra: Convert to `struct zebra_evpn` as per our internal standard We do not use typedef's to talk about structures as per our standard. Fixing. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-02 10:33:22 -04:00
Russ White	648c73647d	Merge pull request #9488 from pguibert6WIND/fix_nhrp_neigh_state Fix nhrp neigh state	2021-08-27 19:00:45 -04:00
David Lamparter	8268be3d16	Merge pull request #9496 from idryzhov/vrf-cmd-init-unused-arg lib: remove unused argument from vrf_cmd_init	2021-08-27 10:39:45 +02:00
Christian Hopps	d448e2c5f9	Merge pull request #9331 from idryzhov/explicit-exit *: explicitly print "exit" at the end of every node config	2021-08-26 11:57:33 -04:00
Igor Ryzhov	cfc369c43a	lib: remove unused argument from vrf_cmd_init Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-08-26 12:01:22 +03:00
Philippe Guibert	80f6b5faeb	lib, zebra: complete the ndm flags on zclient api Insist on the fact that zclient neighbor state flags are mapped over netlink state flags. List all the defines currently known on kernel, and create a netlink API to convert netlink values to zclient values. The function is simplified as it is a 1-1 match. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-08-26 09:19:42 +02:00
Philippe Guibert	c4e1fd52a1	nhrp, zebra, lib: pass exact received neighbor state value to nhrp As NHRP expects some notification of neighboring entries on GRE interface, when a new interface notification is encountered, the exact neighbor state flag is found. Previously, the flag passed to the upper layer was forced to NDM_STATE which is REACHABLE, as can be seen on below trace: 2021/08/25 10:58:39 NHRP: [QQ0NK-1H449] Netlink: new-neigh 102.1.1.1 dev gre1 lladdr 10.125.0.2 nud 0x2 cache used 1 type 5 When passing the real value, NHRP received an other value like STALE. 2021/08/25 11:28:44 NHRP: [QQ0NK-1H449] Netlink: new-neigh 102.1.1.1 dev gre1 lladdr 10.125.0.2 nud 0x4 cache used 0 type 5 This flag is important for NHRP, as it permits to monitor the link layer of NHRP entries. Fixes: `d603c0774e` ("nhrp, zebra, lib: enforce usage of zapi_neigh_ip structure") Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-08-26 09:19:42 +02:00
Donatas Abraitis	3e324ff419	Merge pull request #9466 from idryzhov/vrf-netns lib, zebra: move vrf netns commands from lib to zebra	2021-08-26 07:46:19 +03:00
Donatas Abraitis	d10bda270e	*: Drop `break` after using frr_help_exit() in switch/case Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-08-25 10:49:05 +03:00
Igor Ryzhov	37cb0475e1	lib, zebra: move vrf netns commands from lib to zebra "[no] netns NAME" commands are part of the lib, but they are actually zebra-only: - they are using vrf_netns_handler_create and its description clearly says that it "should be called from zebra only" - vtysh sends these commands only to zebra - only zebra outputs the netns related config - zebra notifies other daemons about netns attachment Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-08-23 23:54:12 +03:00
Igor Ryzhov	07679ad98a	*: explicitly print "exit" at the end of every node config There is a possibility that the same line can be matched as a command in some node and its parent node. In this case, when reading the config, this line is always executed as a command of the child node. For example, with the following config: ``` router ospf network 193.168.0.0/16 area 0 ! mpls ldp discovery hello interval 111 ! ``` Line `mpls ldp` is processed as command `mpls ldp-sync` inside the `router ospf` node. This leads to a complete loss of `mpls ldp` node configuration. To eliminate this issue and all possible similar issues, let's print an explicit "exit" at the end of every node config. This commit also changes indentation for a couple of existing exit commands so that all existing commands are on the same level as their corresponding node-entering commands. Fixes #9206. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-08-23 22:08:20 +03:00
Mark Stapp	b87c5f4dd1	Merge pull request #9434 from anlancs/fix-zebra-mpls-cmd zebra: fix wrong check of mpls command	2021-08-23 09:02:15 -04:00
Donald Sharp	33c0851873	zebra: Fix usage to enum in notify functions For some reason commit #ef524230a6baa decided to remove enums and switch to uint16_t. Which is not the right thing to do. Put it back Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-19 11:31:05 -04:00
anlan_cs	21683186a0	zebra: fix wrong check of mpls command Maybe with empty nexthop to call zebra_mpls_transit_lsp(): "no mpls lsp (16-1048575)". So just remove this "gate_str" check. If without "gate" in command, "gtype" is set to NEXTHOP_TYPE_BLACKHOLE for subsequent processing. Signed-off-by: anlan_cs <anlan_cs@tom.com>	2021-08-18 19:34:03 -04:00
Philippe Guibert	7a52f27e75	zebra: RTM_GETNEIGH messages may be used by nhrp When NHRP registers to zebra to receive link layer events related to gre interfaces, then it is interested in receiving also RTM_GETNEIGH messages. Fixes ("b3b751046495") nhrpd: link layer registration to notifications Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-08-17 09:07:31 +02:00
Renato Westphal	1dfa8b8991	Merge pull request #9380 from mjstapp/fix_static_lsp_cli zebra: mpls validation and static lsp fixes	2021-08-16 12:06:01 -03:00
Igor Ryzhov	f0010840e8	Merge pull request #9389 from mjstapp/fix_netlink_if_name_sa zebra: interface name must be a valid string	2021-08-14 02:14:44 +03:00
Mark Stapp	e9f79fff57	zebra: interface name must be a valid string Validate incoming netlink interface name strings. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-13 16:06:07 -04:00
Igor Ryzhov	1523c0f9ee	Merge pull request #9371 from donaldsharp/zebra_evpn_getl zebra: Ensure stream is long enough	2021-08-13 14:06:37 +03:00
Donald Sharp	a876da9b08	Merge pull request #9374 from mjstapp/fix_nhg_add_leak zebra: clean up nhg allocations in error path	2021-08-12 15:34:07 -04:00
Donald Sharp	86d87c5352	zebra: Ensure stream is long enough In zebra_evpn_proc_remote_nh if we do not pass in a long enough stream, the stream reads will fail. Ensure that we have enough data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-12 15:29:47 -04:00
Mark Stapp	1722cef455	Merge pull request #9304 from donaldsharp/zebra_random_stuff Zebra random stuff	2021-08-12 10:16:46 -04:00
Mark Stapp	a44e310631	zebra: mpls validation and static lsp fixes Handle TYPE_IFINDEX nexthops more consistently in a few places; be more specific about a few integer return values that were being treated as booleans. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-12 08:53:53 -04:00
Mark Stapp	fd99142ab7	zebra: clean up nhg allocations in error path Clean up allocated nhgs in error path in zread_nhg_add(). Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-11 10:41:53 -04:00
Donald Sharp	c472a97080	Merge pull request #9367 from mjstapp/fix_rt_netlink_af zebra: ignore unknown address-family in netlink route msg	2021-08-11 08:11:39 -04:00
Mark Stapp	deb28338de	zebra: ignore unknown address-family in netlink route msg Ignore AFs we don't handle in incoming netlink route updates. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-10 11:44:08 -04:00
Sri Mohana Singamsetty	dd4c59d79a	Merge pull request #9236 from AnuradhaKaruppiah/v6-nh-rmac zebra: use a separate dummy prefix for referencing v6 nexthops	2021-08-10 08:20:55 -07:00
Igor Ryzhov	bdb7b7c5d9	Merge pull request #9321 from donaldsharp/no_leak_re zebra: Prevent memory leak if route is rejected early	2021-08-10 11:39:30 +03:00
Donald Sharp	38c764dde4	zebra: Properly note add/update for rib_add_multipath_nhe When calling rib_add_multipath_nhe ensure that we have well aligned return codes that mean something so that interersted parties can properly handle the situation. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-09 08:06:33 -04:00
Donald Sharp	f94a7703c0	zebra: Prevent memory leak if route is rejected early When receiving a route via zapi, if the route is rejected there exists a code path where we would not free the corresponding re created. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-09 07:55:07 -04:00
Donald Sharp	572bc3167f	zebra: Delete rib_lookup_and_dump since it is not used The rib_lookup_and_dump function is never used, remove Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:04:40 -04:00
Donald Sharp	42aa465ed1	zebra: Remove rib_lookup_and_pushup function The rib_lookup_and_pushup function, from what I can tell, was used more when static route processing and connected routes were more closely integrated in zebra. The goal was when we were adding a new address to remove the connected route and then allow processing of the new address. With the re-org a few years ago to seperate out connected routes as well as static routes, I believe this is no longer needed. on BSD, without this code change we have this log: 2021/08/05 14:33:38 ZEBRA: [QEVVE-G3FQQ] rib_meta_queue_add: (0:0):10.40.30.0/24: queued rn 0x802022bb0 into sub-queue 4 2021/08/05 14:33:38 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 200 Type: RTM_DELETE 2021/08/05 14:33:38 ZEBRA: [V3NSB-BPKBD] Kernel: GATEWAY DONE PROTO1 2021/08/05 14:33:38 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 15 2021/08/05 14:33:38 ZEBRA: [MJD4M-0AAAR] Kernel: pid 53305, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:33:38 ZEBRA: [Y9Y5K-JJ7NT] rtm_read: got rtm of type 2 (RTM_DELETE) addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:33:38 ZEBRA: [V17DT-1FJEN] kernel_rtm: 10.40.30.0/24: successfully did NH 9.8.6.7 2021/08/05 14:33:38 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 164 Type: RTM_NEWADDR 2021/08/05 14:33:38 ZEBRA: [V3NSB-BPKBD] Kernel: 2021/08/05 14:33:38 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 4664 2021/08/05 14:33:38 ZEBRA: [MJD4M-0AAAR] Kernel: pid 0, rtm_addrs {DST} 2021/08/05 14:33:38 ZEBRA: [M09CX-TKB4N] ifam_read_mesg: ifindex 1, ifname vtnet0, ifam_addrs {NETMASK,IFP,IFA,BRD}, ifam_flags 0x0, addr 10.40.30.1/24 broad 10.40.30.255 dst (unspec) gateway (unspec) 2021/08/05 14:33:38 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:0):10.40.30.0/24: Inserting route rn 0x802022bb0, re 0x8032973a0 (connected) existing 0x0, same_count 0 2021/08/05 14:33:38 ZEBRA: [Q4T2G-E2SQF] rib_add_multipath_nhe: dumping RE entry 0x8032973a0 for 10.40.30.0/24 vrf default(0) 2021/08/05 14:33:38 ZEBRA: [M5M58-9PD2R] 10.40.30.0/24: uptime == 1379355, type == 2, instance == 0, table == 0 2021/08/05 14:33:38 ZEBRA: [RVZMM-N7DME] 10.40.30.0/24: metric == 1, mtu == 0, distance == 0, flags == None status == None 2021/08/05 14:33:38 ZEBRA: [Q1NW5-NWY7P] 10.40.30.0/24: nexthop_num == 1, nexthop_active_num == 0 2021/08/05 14:33:38 ZEBRA: [TFHQ8-TC30H] 10.40.30.0/24: NH vtnet0[1] vrf default(0) with flags 2021/08/05 14:33:38 ZEBRA: [SCETK-GQ9E4] 10.40.30.0/24: dump complete 2021/08/05 14:33:38 ZEBRA: [QEVVE-G3FQQ] rib_meta_queue_add: (0:0):10.40.30.0/24: queued rn 0x802022bb0 into sub-queue 2 2021/08/05 14:33:38 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:?):10.40.30.0/24 (MRIB): Inserting route rn 0x802022f30, re 0x803297340 (connected) existing 0x0, same_count 0 2021/08/05 14:33:38 ZEBRA: [Q4T2G-E2SQF] rib_add_multipath_nhe: dumping RE entry 0x803297340 for 10.40.30.0/24 vrf default(0) 2021/08/05 14:33:38 ZEBRA: [M5M58-9PD2R] 10.40.30.0/24: uptime == 1379355, type == 2, instance == 0, table == 0 2021/08/05 14:33:38 ZEBRA: [RVZMM-N7DME] 10.40.30.0/24: metric == 1, mtu == 0, distance == 0, flags == None status == None 2021/08/05 14:33:38 ZEBRA: [Q1NW5-NWY7P] 10.40.30.0/24: nexthop_num == 1, nexthop_active_num == 0 2021/08/05 14:33:38 ZEBRA: [TFHQ8-TC30H] 10.40.30.0/24: NH vtnet0[1] vrf default(0) with flags 2021/08/05 14:33:38 ZEBRA: [SCETK-GQ9E4] 10.40.30.0/24: dump complete 2021/08/05 14:33:38 ZEBRA: [GCGMT-SQR82] rib_link: (0:?):10.40.30.0/24 (MRIB): rn 0x802022f30 adding dest 2021/08/05 14:33:38 ZEBRA: [QEVVE-G3FQQ] rib_meta_queue_add: (0:0):10.40.30.0/24 (MRIB): queued rn 0x802022f30 into sub-queue 2 2021/08/05 14:33:38 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 240 Type: RTM_ADD 2021/08/05 14:33:38 ZEBRA: [V3NSB-BPKBD] Kernel: UP PINNED 2021/08/05 14:33:38 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 0 2021/08/05 14:33:38 ZEBRA: [MJD4M-0AAAR] Kernel: pid 0, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:33:38 ZEBRA: [K0KVE-2GJA1] default(0:0):10.40.30.0/24: Processing rn 0x802022bb0 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x8032973a0 (connected) status: Changed flags: None dist 0 metric 1 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x8032970a0 (static) status: None flags: Recursion RR Distance dist 1 metric 0 2021/08/05 14:33:38 ZEBRA: [NYYJJ-0Q8QG] default(0:0):10.40.30.0/24: After processing: old_selected 0x0 new_selected 0x8032973a0 old_fib 0x0 new_fib 0x8032973a0 2021/08/05 14:33:38 ZEBRA: [RT9DY-ZS2KN] default(0:0):10.40.30.0/24: Adding route rn 0x802022bb0, re 0x8032973a0 (connected) 2021/08/05 14:33:38 ZEBRA: [PP3BZ-RABJN] default(0:0):10.40.30.0/24: rn 0x802022bb0 dequeued from sub-queue 2 2021/08/05 14:33:38 ZEBRA: [K0KVE-2GJA1] default(0:0):10.40.30.0/24: Processing rn 0x802022f30 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x803297340 (connected) status: Changed flags: None dist 0 metric 1 2021/08/05 14:33:38 ZEBRA: [NYYJJ-0Q8QG] default(0:0):10.40.30.0/24: After processing: old_selected 0x0 new_selected 0x803297340 old_fib 0x0 new_fib 0x803297340 2021/08/05 14:33:38 ZEBRA: [RT9DY-ZS2KN] default(0:0):10.40.30.0/24: Adding route rn 0x802022f30, re 0x803297340 (connected) 2021/08/05 14:33:38 ZEBRA: [PP3BZ-RABJN] default(0:0):10.40.30.0/24: rn 0x802022f30 dequeued from sub-queue 2 2021/08/05 14:33:38 ZEBRA: [K0KVE-2GJA1] default(0:0):10.40.30.0/24: Processing rn 0x802022bb0 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x8032973a0 (connected) status: Queued flags: Selected dist 0 metric 1 2021/08/05 14:33:38 ZEBRA: [RWCK7-TX4HT] default(0:0):10.40.30.0/24: Examine re 0x8032970a0 (static) status: None flags: Recursion RR Distance dist 1 metric 0 2021/08/05 14:33:38 ZEBRA: [NYYJJ-0Q8QG] default(0:0):10.40.30.0/24: After processing: old_selected 0x8032973a0 new_selected 0x8032973a0 old_fib 0x8032973a0 new_fib 0x8032973a0 2021/08/05 14:33:38 ZEBRA: [PP3BZ-RABJN] default(0:0):10.40.30.0/24: rn 0x802022bb0 dequeued from sub-queue 4 2021/08/05 14:33:38 ZEBRA: [GHWHS-ZKQM5] update_from_ctx: default(0:0):10.40.30.0/24: SELECTED, re 0x8032973a0 2021/08/05 14:33:38 ZEBRA: [TS3SH-1276M] default(0:0):10.40.30.0/24 update_from_ctx(): no fib nhg 2021/08/05 14:33:38 ZEBRA: [HKQXC-4STSK] default(0:0):10.40.30.0/24 update_from_ctx(): rib nhg matched, changed 'false' 2021/08/05 14:33:38 ZEBRA: [HBZNK-5H1X0] (0:0):10.40.30.0/24: Redist update re 0x8032973a0 (connected), old 0x0 (None) 2021/08/05 14:33:38 ZEBRA: [GHWHS-ZKQM5] update_from_ctx: default(0:0):10.40.30.0/24: SELECTED, re 0x8032973a0 2021/08/05 14:33:38 ZEBRA: [TS3SH-1276M] default(0:0):10.40.30.0/24 update_from_ctx(): no fib nhg 2021/08/05 14:33:38 ZEBRA: [HKQXC-4STSK] default(0:0):10.40.30.0/24 update_from_ctx(): rib nhg matched, changed 'false' 2021/08/05 14:33:38 ZEBRA: [HBZNK-5H1X0] (0:0):10.40.30.0/24: Redist update re 0x8032973a0 (connected), old 0x0 (None) With this code change: 2021/08/05 14:41:24 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:?):10.10.40.0/24: Inserting route rn 0x802022f30, re 0x8021cbe60 (static) existing 0x0, same_count 0 2021/08/05 14:41:24 ZEBRA: [RT9DY-ZS2KN] default(0:0):10.10.40.0/24: Adding route rn 0x802022f30, re 0x8021cbe60 (static) 2021/08/05 14:41:24 ZEBRA: [V17DT-1FJEN] kernel_rtm: 10.10.40.0/24: successfully did NH 9.8.6.7 2021/08/05 14:41:24 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 200 Type: RTM_ADD 2021/08/05 14:41:24 ZEBRA: [V3NSB-BPKBD] Kernel: UP GATEWAY DONE PROTO1 2021/08/05 14:41:24 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 0 2021/08/05 14:41:24 ZEBRA: [MJD4M-0AAAR] Kernel: pid 60818, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:41:24 ZEBRA: [Y9Y5K-JJ7NT] rtm_read: got rtm of type 1 (RTM_ADD) addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:41:24 ZEBRA: [TS3SH-1276M] default(0:0):10.10.40.0/24 update_from_ctx(): no fib nhg 2021/08/05 14:41:24 ZEBRA: [HKQXC-4STSK] default(0:0):10.10.40.0/24 update_from_ctx(): rib nhg matched, changed 'true' 2021/08/05 14:41:24 ZEBRA: [HBZNK-5H1X0] (0:0):10.10.40.0/24: Redist update re 0x8021cbe60 (static), old 0x0 (None) 2021/08/05 14:42:06 ZEBRA: [ZJ4AV-JEMJ3] dplane_intf_addr_set 2021/08/05 14:42:06 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 164 Type: RTM_NEWADDR 2021/08/05 14:42:06 ZEBRA: [V3NSB-BPKBD] Kernel: 2021/08/05 14:42:06 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 4664 2021/08/05 14:42:06 ZEBRA: [MJD4M-0AAAR] Kernel: pid 0, rtm_addrs {DST} 2021/08/05 14:42:06 ZEBRA: [M09CX-TKB4N] ifam_read_mesg: ifindex 1, ifname vtnet0, ifam_addrs {NETMASK,IFP,IFA,BRD}, ifam_flags 0x0, addr 10.10.40.3/24 broad 10.10.40.255 dst (unspec) gateway (unspec) 2021/08/05 14:42:06 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:0):10.10.40.0/24: Inserting route rn 0x802022f30, re 0x80308c4c0 (connected) existing 0x0, same_count 0 2021/08/05 14:42:06 ZEBRA: [MFYWV-KH3MC] rib_add_multipath_nhe: (0:?):10.10.40.0/24 (MRIB): Inserting route rn 0x802023160, re 0x80308c460 (connected) existing 0x0, same_count 0 2021/08/05 14:42:06 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 240 Type: RTM_ADD 2021/08/05 14:42:06 ZEBRA: [V3NSB-BPKBD] Kernel: UP PINNED 2021/08/05 14:42:06 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 0 2021/08/05 14:42:06 ZEBRA: [MJD4M-0AAAR] Kernel: pid 0, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:42:06 ZEBRA: [RG9Y6-E93A0] default(0:0):10.10.40.0/24: Updating route rn 0x802022f30, re 0x80308c4c0 (connected) old 0x8021cbe60 (static) 2021/08/05 14:42:06 ZEBRA: [RT9DY-ZS2KN] default(0:0):10.10.40.0/24: Adding route rn 0x802023160, re 0x80308c460 (connected) 2021/08/05 14:42:06 ZEBRA: [THSYN-E2XFY][EC 100663299] rtm_write: write : Address already in use (48) 2021/08/05 14:42:06 ZEBRA: [RV5F2-MQGZG][EC 100663303] kernel_rtm: 10.10.40.0/24: rtm_write() unexpectedly returned -5 for command RTM_DELETE 2021/08/05 14:42:06 ZEBRA: [ZPR30-5G1FB] Kernel: Len: 200 Type: RTM_DELETE 2021/08/05 14:42:06 ZEBRA: [V3NSB-BPKBD] Kernel: UP PROTO1 2021/08/05 14:42:06 ZEBRA: [HDTM1-ENZNM] Kernel: message seq 1 2021/08/05 14:42:06 ZEBRA: [MJD4M-0AAAR] Kernel: pid 60818, rtm_addrs {DST,GATEWAY,NETMASK} 2021/08/05 14:42:06 ZEBRA: [XASXT-GF69Y] kernel_rtm: No useful nexthops were found in RIB prefix 10.10.40.0/24 2021/08/05 14:42:06 ZEBRA: [TS3SH-1276M] default(0:0):10.10.40.0/24 update_from_ctx(): no fib nhg 2021/08/05 14:42:06 ZEBRA: [HKQXC-4STSK] default(0:0):10.10.40.0/24 update_from_ctx(): rib nhg matched, changed 'false' 2021/08/05 14:42:06 ZEBRA: [HBZNK-5H1X0] (0:0):10.10.40.0/24: Redist update re 0x80308c4c0 (connected), old 0x8021cbe60 (static) netstat -rn: 10.10.40.0/24 link#1 U vtnet0 10.10.40.3 link#1 UHS lo0 show ip route: C>* 10.10.40.0/24 [0/1] is directly connected, vtnet0, 00:18:48 S 10.10.40.0/24 [1/0] via 9.8.6.7, vtnet0, weight 1, 00:19:30 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:04:40 -04:00
Donald Sharp	38ef05ea33	zebra: `debug zebra kernel msgdump` is linux specific The command `debug zebra kernel msgdump is netlink specific. There is no point at all to allow this to be configed on non netlink platforms. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:04:40 -04:00
Donald Sharp	e658173ae6	zebra: Convert srcdest_rnode2str to %pRN in zebra_rib.c There were a bunch of places where we converted the route node to a prefix string via srcdest_rnode2str when we should have been using %pRN in zebra_rib.c. Just convert over the ones we should to use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:04:40 -04:00
Donald Sharp	f0afc61d58	zebra: short-circuit rib_process when nothing to do When we are calling rib_process and the route_node in question has no dest, there is no work to do here at all. As such we should just return before attempting to do any other work. This is just a tiny bit of simplification being done. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-06 10:02:53 -04:00
Donald Sharp	6140b3b41b	zebra: prevent crash when nhlfe is NULL There exists a call path where the nhlfe_alloc can return NULL for blackhole nexthops. In this case we were still trying to save the nhlfe pointer causing a crash when we attempted to add it to a self-contained list. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-04 13:38:25 -04:00
Donald Sharp	10cc80cafd	zebra: don't use default case when switching over enum nexthop Do not use the `default` case when switching over an enumerated type. This allows the code to fail to compile when we add a new enumeration. Thus allowing us developers to know all the places in the code we'll need to touch. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-08-04 13:34:03 -04:00
Russ White	11093fc905	Merge pull request #9231 from idryzhov/zebra-rmap-set-src zebra: remove checks for src address existence when using "set src"	2021-08-03 09:22:18 -04:00
Russ White	1358f9d10a	Merge pull request #9259 from opensourcerouting/moar-json *: can't get enough JSON	2021-08-03 09:13:12 -04:00
Donatas Abraitis	71c06f610f	Merge pull request #9258 from mjstapp/fix_rule_strlcpy zebra: use strlcpy in dplane_rule_init	2021-08-03 09:12:38 +03:00
Renato Westphal	488599bfa2	Merge pull request #9232 from idryzhov/interface-node-cleanup *: cleanup interface node installation	2021-08-02 21:13:29 -03:00
Renato Westphal	c15dc24f79	zebra: add "json" option to "show interface" Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2021-08-02 17:19:45 -03:00
Mark Stapp	bc86b347db	zebra: use strlcpy in dplane_rule_init Use strlcpy for safety in dplane rule init api. Signed-off-by: Mark Stapp <mjs.ietf@gmail.com>	2021-08-02 12:35:50 -04:00
Igor Ryzhov	1f74d96c41	zebra: remove checks for src address existence when using "set src" 1. This check is absolutely useless. Nothing keeps user from deleting the address right after this check. 2. This check prevents zebra from correctly reading the user config with "set src" because of a race with interface startup (see #4249). 3. NO OPERATIONAL DATA USAGE ON VALIDATION STAGE. Fixes #7319. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-08-02 18:35:30 +03:00
Igor Ryzhov	72928fa1aa	Merge pull request #9238 from leonshaw/fix/netns-delete lib, zebra: Preserve user-configured VRF on netns deletion	2021-08-02 18:12:19 +03:00
Xiao Liang	6910315f6f	lib, zebra: Preserve user-configured VRF on netns deletion Don't clear VRF's user-configured flag when netns is deleted. Signed-off-by: Xiao Liang <shaw.leon@gmail.com>	2021-07-30 14:53:45 +08:00
Anuradha Karuppiah	82732723da	zebra: use a separate dummy prefix for referencing v6 nexthops v4 and v6 host/refernce prefixes need to be setup separately for [RMAC, VTEP] entries as the VTEP is always normalized to a v4 addr. Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-07-29 17:25:11 -07:00
Igor Ryzhov	9da01b0b7b	*: cleanup interface node installation The only difference in daemons' interface node definition is the config write function. No need to define the node in every daemon, just pass the callback as an argument to a library function and define the node there. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-07-29 21:35:25 +03:00
batmancn	5306e6cf00	zebra: bugfix of error quit of zebra, due to no nexthop ACTIVE There exists some rare situations where fpm will attempt to send a route update with no valid nexthops. In that case an assert would be hit. This is not good for trying to keep your routing daemons up and running when we can safely just recover the situation. Fixes #7588 Signed-off-by: batmancn <batmanustc@gmail.com> <fixed commit message, and used zlog_err> Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-28 16:13:59 -04:00
Jafar Al-Gharaibeh	213d980ff9	Merge pull request #9007 from donaldsharp/pbr_stuff add ability to match on proto to pbr	2021-07-27 15:09:29 -05:00
David Lamparter	631fce38ff	Merge pull request #9107 from donaldsharp/label_destruction zebra: On client shutdown cleanup any vrf labels associated with it	2021-07-27 14:28:13 +02:00
David Lamparter	9c9d8a6129	Merge pull request #9088 from donaldsharp/zebra_redistribute_wrong_tables zebra: Do not allow redistribution for non-vrf tables	2021-07-27 14:14:23 +02:00
Trey Aspelund	fb0b54b361	zebra: Remove MM seq from evpn rmac json output Currently 'show evpn rmac vni .. mac .. json' includes fields for localSequence and remoteSequence, which are misleading since they aren't applicable to a macs in the IP-VRF mac table (RMAC). This removes the localSequence + remoteSequence fields from the output. Signed-off-by: Trey Aspelund <taspelund@nvidia.com>	2021-07-22 20:23:56 +00:00
Donald Sharp	9fbbcbeb1f	Merge pull request #9091 from gord1306/remove_lst_vlan zebra: trigger remove all access vlans info for access port	2021-07-22 07:04:20 -04:00
Donald Sharp	06302ecb88	zebra: On client shutdown cleanup any vrf labels associated with it When a vrf label is created by a client and the client disconnects we should clean up any vrf labels associated with that client. eva# show mpls table Inbound Label Type Nexthop Outbound Label ----------------------------------------------- 1000 SHARP RED - eva# exit sharpd@eva ~/f/zebra (label_destruction)> ps -ef \| grep frr root 4017793 1 0 13:57 ? 00:00:00 /usr/lib/frr/watchfrr -d -F datacenter --log file:/var/log/frr/watchfrr.log --log-level debug zebra bgpd ospfd isisd pimd eigrpd sharpd staticd frr 4017824 1 0 13:57 ? 00:00:00 /usr/lib/frr/zebra -d -F datacenter --log file:/tmp/zebra.log -r --graceful_restart 60 -A 127.0.0.1 -s 90000000 frr 4017829 1 0 13:57 ? 00:00:00 /usr/lib/frr/bgpd -d -F datacenter -M rpki -A 127.0.0.1 frr 4017836 1 0 13:57 ? 00:00:00 /usr/lib/frr/ospfd -d -F datacenter -A 127.0.0.1 frr 4017839 1 0 13:57 ? 00:00:00 /usr/lib/frr/isisd -d -F datacenter -A 127.0.0.1 frr 4017842 1 0 13:57 ? 00:00:00 /usr/lib/frr/pimd -d -F datacenter -A 127.0.0.1 frr 4017865 1 0 13:57 ? 00:00:00 /usr/lib/frr/eigrpd -d -F datacenter -A 127.0.0.1 frr 4017869 1 0 13:57 ? 00:00:00 /usr/lib/frr/sharpd -d -F datacenter -A 127.0.0.1 frr 4017888 1 0 13:57 ? 00:00:00 /usr/lib/frr/staticd -d -F datacenter -A 127.0.0.1 sharpd 4018624 3938423 0 14:02 pts/10 00:00:00 grep --color=auto frr sharpd@eva ~/f/zebra (label_destruction)> sudo kill -9 4017869 sharpd@eva ~/f/zebra (label_destruction)> sudo vtysh -c "show mpls table" sharpd@eva ~/f/zebra (label_destruction)> Fixes: #1787 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-21 14:04:36 -04:00
David Lamparter	63116a7008	build: fix `AM_LDFLAGS` usage (and gcov) like the other automake variables, setting `xyz_LDFLAGS` causes `AM_LDFLAGS` to be ignored for `xyz`. For some reason I had in my mind that automake doesn't do this for LDFLAGS, but... it does. (Which is consistent with `_CFLAGS` and co.) So, all the libraries and modules have been ignoring `AM_LDFLAGS` (which includes `SAN_FLAGS` too). Set up new `LIB_LDFLAGS` and `MODULE_LDFLAGS` to handle all of this correctly (and move these bits to a central location.) Fixes: #9034 Fixes: `0c4285d77e` ("build: properly split CFLAGS from AC_CFLAGS") Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-07-21 17:10:08 +02:00
Donald Sharp	ecff5258a0	zebra: Mark some bsd interface prefixes as SECONDARY Notice when a ip address on a bsd interface is considered an alias, let's mark the connected prefix we generate as a SECONDARY. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-20 10:12:04 -04:00
gord_chen	ec8977510e	zebra: trigger remove all access vlans for access port When port was removed from last access vlan, the linux kernel won't send any vlan info in the netlink message, it might affact the evpn mh not withdraw EAD-EVI routes. Signed-off-by: Gord Chen <gord_chen@edge-core.com>	2021-07-20 09:39:45 +00:00
Donald Sharp	79a9ad1450	zebra: Do not allow redistribution for non-vrf tables Current code was allowing redistribution of kernel routes from the non-default non vrf tables once FRR was already up and running. In the case where we add `redistribute kernel` in an upper level protocol we never consider the non-default vrf or non-vrf tables so it is never accepted. In the case where a kernel route is added after `redistribute kernel` is already in place we were never looking at the fact that the route was in a non-default non-vrf table. This code fixes that issue. Fixes: #9073 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-19 20:04:03 -04:00
Mark Stapp	80ff3f05ea	zebra: replace ipaddr2str in dplane module Replace a couple of ipaddr2str calls with pIA in the dplane module. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	7e5b0b2b36	zebra: process EVPN remote VTEP updates from the workqueue Move remote VTEP updates from immediate, inline processing in their ZAPI message handlers to the main workqueue. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	7f7e49d11a	zebra: use workqueue for vxlan remote macip updates Enqueue incoming vxlan remote macip updates on the main workqueue, instead of performing the updates immediately, in-line. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	1a3bd37f7c	zebra: use more const Use const in many more evpn apis, especially for macaddr, ipaddr arguments. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	32367e7a3b	zebra: add workqueue support for EVPN updates Add workqueue subqueue for EVPN/VxLAN updates; migrate the evpn route and remote ES processing from their ZAPI handlers to the workqueue. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 10:36:12 -04:00
Mark Stapp	272e11bfc4	zebra: give some evpn apis better names Use more useful names for a few evpn apis. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-07-19 08:43:48 -04:00
Mark Stapp	12e1fe1251	Merge pull request #9063 from sworleys/Fix-IFP-NHG zebra: fix ifp pointer for groups/recursives	2021-07-16 09:33:52 -04:00
Stephen Worley	bf157b9263	zebra: fix ifp pointer for groups/recursives At some point we broke the ifp pointer for nhe->ifp such that it was pointing to an interface even in groups/recurisve instances. Add checks here to make it again so that we only set the ifp pointer if it is a fully resolved singleton NHE. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-07-15 11:24:24 -04:00
Donald Sharp	b59839af7d	zebra: When passing lookup information back pass the fully resolved In the reachability code we auto pass back the fully resolved nexthops. Modify the ZEBRA_IPV4_NEXTHOP_LOOKUP_MRIB code to do the exact same thing so that the zclient_lookup_nexthop code does not need to recursively look for the data that zebra already has. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-15 08:50:09 -04:00
Donald Sharp	f56697eff3	bgpd, pbrd, zebra: Encode/decode the ip proto from daemons to zebra Ensure that we properly encode/decode the ip protocol from daemons to zebra. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:47 -04:00
Donald Sharp	b94683f0db	lib, zebra: add ip_proto to the filter data structure Add ip_proto to the filter data structure and also account for it in the hash when stored. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:47 -04:00
Donald Sharp	8ccbc778cf	zebra: Add ability for dataplane code to understand rule ip protocols The zebra dplane needs to be taught about the rule ip_proto that can be installed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:47 -04:00
Donald Sharp	8096bd72aa	zebra: Add ability to encode/decode netlink FRA_IP_PROTO for rule changes Encode/Decode the FRA_IP_PROTO but do nothing with it at the moment. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:47 -04:00
Donald Sharp	94d70a6533	zebra: Add nl_attr_put8 so we can put uint8_t in netlink messages Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-07-08 11:12:46 -04:00
Donatas Abraitis	24447a70d0	zebra: Show prefixLen in `show ip route json` output additionally Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-03 14:21:06 +03:00
Donatas Abraitis	45c8ba8fb3	zebra: Do not escape forward slashes for `show ip route json` Basically, this is handled by JSON-C library. I've compiled with the latest release of json-c and it works well. Didn't test with various distribution versions, but this change is kinda dependend from the json-c lib version the distra has. Before: ``` "192.168.100.1\/32":[ { "prefix":"192.168.100.1\/32", ``` After: ``` "192.168.100.1/32":[ { "prefix":"192.168.100.1/32", ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-03 14:19:48 +03:00
Donatas Abraitis	8643c2e5f7	*: Replace 4/16 integers to IPV4_MAX_BYTELEN/IPV6_MAX_BYTELEN Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 23:54:39 +03:00
Donatas Abraitis	12256b84a5	*: Convert numeric 32 into IPV4_MAX_BITLEN for prefixlen Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 23:50:39 +03:00
Donatas Abraitis	13ccce6e7e	*: Convert numeric 128 into IPV6_MAX_BITLEN for prefixlen Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 17:53:21 +03:00
Donatas Abraitis	936fbaef47	*: Replace IPV4_MAX_PREFIXLEN to IPV4_MAX_BITLEN Just drop IPV4_MAX_PREFIXLEN at all, no need keeping both. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 17:44:09 +03:00
Donatas Abraitis	f4d81e5507	*: Replace IPV6_MAX_PREFIXLEN to IPV6_MAX_BITLEN Just drop IPV6_MAX_PREFIXLEN at all, no need keeping both. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-01 17:41:09 +03:00
Renato Westphal	8b0ab1f8a0	Merge pull request #8780 from idryzhov/fix-zebra-coverity zebra: fix a couple of coverity warnings	2021-06-30 16:08:35 -03:00
Philippe Guibert	eed936b334	Merge pull request #8744 from sworleys/RTADV-Fix-Upstream zebra: rework RA handling for vrf-lite	2021-06-29 19:20:54 +02:00
Igor Ryzhov	b08dcc3f3f	*: unify prefix copying There are a few places in the code where we use PREFIX_COPY(_IPV4/IPV6) macro to copy a prefix. Let's always use prefix_copy function for this. This should fix CID 1482142 and 1504610. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-06-29 16:11:47 +03:00
Stephen Worley	a7c91c4246	Merge pull request #8731 from mjstapp/fix_pw_backups zebra: Fix pseudowires with backup nexthops	2021-06-24 12:46:31 -04:00
Patrick Ruddy	fa855f8fa3	Merge pull request #6695 from adharkar/frr-master-gateway_ip EVPN route type-5 gateway IP overlay Index	2021-06-23 09:23:54 +01:00
Donald Sharp	3caaa17764	zebra: We already store the last command as part of zserv_write when sending nexthop information. We do not need to reset the last_write_cmd since that is taken care of in the send routine. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-18 08:37:52 -04:00
Mark Stapp	072b487b8f	zebra: update pw dataplane info Include the complete set of primary and backup nexthops from the resolving route for a pseudowire. Add accessors for that info. Modify the logic that creates the fib set of pw nexthops so that only installed, labelled nexthops are included. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-11 09:30:09 -04:00
Mark Stapp	0d145d47c8	zebra: revise pw reachability logic Modify the pseudowire reachability logic so that it returns success if there is at least one installed labelled nexthop for the route resolving the pw destination. We also check for valid backup nexthops if necessary, in case there's been a switchover event. Only OpenBSD requires that _all_ nexthops be labelled, so we have a more strict version of the logic also. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-11 09:30:09 -04:00
Mark Stapp	6fb3580882	zebra: add boolean to control pw reachability checking Add a boolean to control whether pseudowire reachability checking needs to be strict. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-11 09:29:13 -04:00
Mark Stapp	bc77c3bb8a	zebra: use const in rib_match Use const in common rib_match api. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-11 09:29:13 -04:00
Donald Sharp	9691937d8b	zebra: Move individual lines to table in `show zebra client` command Move some individual add/delete lines to the table format in the `show zebra client` command Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-10 20:41:35 -04:00
Donald Sharp	a9d8faf7ab	zebra: Add message counts for `show zebra client` There were counters FRR was keeping but never displaying. Add them in. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-10 20:24:44 -04:00
Donald Sharp	6dbaa012be	Merge pull request #8807 from mjstapp/fix_srv6_delete lib,zebra: srv6 cleanup	2021-06-09 09:07:53 -04:00
Donald Sharp	010b575b7d	zebra: Give extra space and stop processing if we run out of space When processing bulk messages we need more space to handle more mroutes. In this case we are doubling the stream size from 16k -> 32k, which should roughly double the number of mroutes we can handle in one go. Additionally. If we cannot parse the passed message into the stream to pass up to pimd then gracefully stop processing Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-09 06:43:28 -04:00
Stephen Worley	0bcf7589a6	zebra: print adv_if count with %zu Use the %zu formatter for adv_if count printing for portability. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-06-08 16:27:12 -04:00
Stephen Worley	2a356cee0d	zebra: add show command for RA interface lists Add a show command so we can easily get info on what interfaces are turned on per ver and in which list. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-06-08 15:06:04 -04:00
Stephen Worley	7c2ddfb976	zebra: rework RA handling for vrf-lite Rework RA handling for vrf-lite scenarios. Before we were using a single FD descriptor for polling across multiple zvrf's. This would cause us to hit this assert() in some bgp unnumbered and vrrp configs: ``` /* * What happens if we have a thread already * created for this event? */ if (thread_array[fd]) assert(!"Thread already scheduled for file descriptor"); ``` We were scheduling a thread_read on the same FD for every zvrf. With vrf-lite, RAs and ARPs are not vrf-bound, so we can just use one rtadv instance to manage them for all VRFs. We will choose the default VRF for this. This patch removes the rtadv_sock altogether for zrouter and moves the functionality this represented to the default VRF. All RAs will be handled in the default VRF under vrf-lite configs with only one poll thread started for it. This patch also extends how we track subscribed interfaces (s or msec) to use an actual sorted list by interface names rather than just a counter. With multiple daemons turning interfaces/on/off these counters can get very wrong during ifup/down events. Making them a sorted list prevents this from happening by preventing duplicates. With netns-vrf's nothing should change other than the interface list. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-06-08 15:05:43 -04:00
Renato Westphal	98cb53f96a	zebra, ospfd: fix typos in the graceful restart code Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2021-06-08 11:41:33 -03:00
Ameya Dharkar	1b09e77e4d	Zebra: FPM support for gateway IP overlay Index FPM sends VNI to the data plane with the EVPN prefix. For pure type-5 EVPN route, nexthop interface of EVPN prefix is L3VNI SVI. Thus, we encode L3VNI corresponding to the nexthop vrf with rtmsg for this prefix. For EVPN type-5 route with gateway IP overlay index, we supporting asymmetric IRB. Thus, nexthop interface is L2VNI SVI. So, instead of fetching vrf VNI, fetch VNI corresponding to the nexthop SVI and encode it in the rtmsg for EVPN prefix. Signed-off-by: Ameya Dharkar <adharkar@vmware.com>	2021-06-07 17:59:45 -07:00
Ameya Dharkar	9daa5d471a	bgpd, zebra: Add svi_interface to zebra VNI and bgp EVPN structures SVI ifindex for L2VNI is required in BGP to perform EVPN type-5 to type-2 recusrsive resolution using gateway IP overlay index. Program this svi_ifindex in struct zebra_vni_t as well as in struct bgpevpn Changes include: 1. Add svi_if field to struct zebra_evpn_t 2. Add svi_ifindex field to struct bgpevpn 3. When SVI (bridge or VLAN) is bound to a VxLAN interface, store it in the zebra_evpn_t structure. 4. Add this SVI ifindex to ZEBRA_VNI_ADD 5. Store svi_ifindex in struct bgpevpn Signed-off-by: Ameya Dharkar <adharkar@vmware.com>	2021-06-07 17:58:23 -07:00
Mark Stapp	f502d7af0f	zebra: srv6 cleanup Use NO_PROTO consistently in tests; make sure zapi client instance and session are used for srv6 'chunks'. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-07 14:26:25 -04:00
Mark Stapp	16bd37d687	zebra: small srv6 text cleanup Couple of small typos in srv6 zapi code. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-06-07 14:25:46 -04:00
Rafael Zalamena	455d14ae31	Merge pull request #8778 from idryzhov/fix-zebra-vrf zebra: fix config after exit from vrf	2021-06-07 08:59:10 -03:00
Igor Ryzhov	58929633fb	zebra: fix config after exit from vrf When the VRF node is exited using "exit" or "quit", there's still a VRF pointer stored in the vty context. If you try to configure some router related command, it will be applied to the previous VRF instead of the default VRF. For example: ``` (config)# vrf test (config-vrf)# ip router-id 1.1.1.1 (config-vrf)# do show run ... ! vrf test ip router-id 1.1.1.1 exit-vrf ! ... (config-vrf)# exit (config)# ip router-id 2.2.2.2 (config)# do show run ... ! vrf test ip router-id 2.2.2.2 exit-vrf ! ... ``` `vrf-exit` works correctly, because it stores a pointer to the default VRF into the vty context (but weirdly keeping the VRF_NODE instead of changing it to CONFIG_NODE). Instead of relying on the behavior of exit function, always use the default VRF when in CONFIG_NODE. Another problem is missing `VTY_CHECK_CONTEXT`. If someone deletes the VRF in which node the user enters the command, then zebra applies the command to the default VRF instead of throwing an error. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-06-04 19:02:32 +03:00
Hiroki Shirokura	2ba6be5b24	bgpd,sharpd,zebra: fix code style Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	c60c1ade86	: delete ZEBRA_FLAG_SEG6_ROUTE and add ZAPI_NEXTHOP_FLAG_SEG6* https://github.com/FRRouting/frr/pull/5865#discussion_r597670225 As this comment says. ZEBRA_FLAG_XXX should not have been used. To communicate SRv6 Route Information. A simple Nexthop Flag would have been sufficient for SRv6 information. And I fixed the whole thing that way. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	0a543b7929	zebra: early return on seg6local nlmsg crafting Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	eab0f8f0a2	lib,sharpd,zebra: update nexthop object with nh_srv6 Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	f463eac768	zebra: fill_seg6ipt_encap func with boundary check Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	52026569ca	zebra: error check for nl_attr_xxx Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	8a18a7c0c9	zebra: use const on fill_seg6ipt_encap func Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	b9596f139b	zebra: fix implicit conversion Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	7b778857f8	zebra: drop un-needed info in locator-zapi Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	a9510347aa	zebra: delete unneeded zebra_srv6_manager_connect Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	1bda3e627d	*: use one line init instead of memset and format it Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	9f900cda30	zebra: fix typo Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	1d5f59a235	zebra: fix Dereference of null pointer Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	f29aed7480	*: fix code format accourding to checkpatch Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	ac6a9479af	zebra: add zapi_srv6_locator_chunk_{en,de}code Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	361a62ac9d	zebra: fix compile error of missing-braces Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	daedb8b3cf	zebra: rewrite locator_prefix_cmd with DEFPY Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	4df9d8592b	*: fix code format accourding to checkpatch Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	f16de90b8c	zebra: parse non-zebra seg6 configuration via netlink (step3) FRRouting operator can install seg6 route via ZAPI, But linux kernel operator also can install seg6 route via Netlink directry (i.e. iproute2) This commit make zebra to parse non-frr seg6 route configuration via netlink and audit Zebra's RIB. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	76fb7ae4de	zebra: ZEBRA_ROUTE_ADD supports seg6 route (step3) With this patch, zclient can intall seg6 rotues when they set properties "nh_seg6_segs" on struct nexthop and set ZEBRA_FLAG_SEG6_ROUTE on zapi_route's flag. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:48 -04:00
Hiroki Shirokura	0097897734	zebra: add new CLI to manipulate srv6-locator (step2) This commit is a part of #5853 works that add new clis to configure SRv6 locator and its show commands. Following clis are added on this commit. vtysh -c 'conf te' \ -c 'segment-routing' \ -c ' srv6' \ -c ' locators' \ -c ' locator LOC1' \ -c ' prefix A::/64' - "show segment-routing srv6 sid [json]" - "show segment-routing srv6 locator [json]" - "show segment-routing srv6 locator NAME detail [json]" - "show runnning-config" (make it to print srv6 configuration) Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Hiroki Shirokura	6e68a08484	zebra: ZAPI add new api to manipulate srv6-locator (step2) This commit is a part of #5853 works that add new ZAPI to configure SRv6 locator which manages chunk prefix for SRv6 SID IPv6 address for each routing protocol daemons. NEW-ZAPIs: * ZEBRA_SRV6_LOCATOR_ADD * ZEBRA_SRV6_LOCATOR_DELETE * ZEBRA_SRV6_MANAGER_CONNECT * ZEBRA_SRV6_MANAGER_GET_LOCATOR_CHUNK * ZEBRA_SRV6_MANAGER_RELEASE_LOCATOR_CHUNK Zclient can connect to zebra's srv6-manager with ZEBRA_SRV6_MANAGER_CONNECT api like a label-manager. Then zclient uses ZEBRA_SRV6_MANAGER_GET_LOCATOR_CHUNK to allocated dedicated locator chunk for it's routing protocol. Zebra works for only prefix reservation and distribute the ownership of the locator chunks for zcliens. Then, zclient installs SRv6 function with ZEBRA_ROUTE_ADD api with nh_seg6local_* fields. This feature is already implemented by another PR(#7680). Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Hiroki Shirokura	6c0a7c0941	: new cli-nodes for SRv6 manager (step2) This commit is a part of #5853 that add new cmd-node for SRv6 configuration. This commit just add cmd-node and moving node cli only, acutual SRv6 config command isn't added. (that is added later commit. of this branch) new cli nodes: SRv6 * SRv6-locators * SRv6-locator Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Hiroki Shirokura	d49e6c4afd	zebra: parse non-zebra seg6local configuration via netlink (step1) FRRouting operator can install seg6local route via ZAPI, But linux kernel operator also can install seg6local route via Netlink directry (i.e. iproute2) This commit make zebra to parse non-frr seg6local route configuration via netlink and audit Zebra's RIB. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Hiroki Shirokura	8689b25a08	zebra: ZEBRA_ROUTE_ADD supports seg6local route (step1) With this patch, zclient can intall seg6local rotues whem they set properties nh_seg6local_{action,ctx} on struct nexthop and set ZEBRA_FLAG_SEG6LOCAL_ROUTE on zapi_route's flag. Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2021-06-02 10:24:47 -04:00
Rafael Zalamena	6c1a2a6538	Merge pull request #6317 from rgirada/fix_route_dump zebrad: Added a command to dump routes in support bundle	2021-05-28 18:12:17 -03:00
Stephen Worley	7d4651cc9c	Merge pull request #8174 from mjstapp/backup_nht zebra: hide backup-nexthop activations in nht	2021-05-27 09:49:41 -04:00
Mark Stapp	2d3eb91699	Merge pull request #8498 from ton31337/feature/opaque_data_void_zebra Zebra OPAQUE data from other daemons stuff	2021-05-24 07:48:02 -04:00
Igor Ryzhov	389faf93b7	zebra: fix possible uninitialized value Found by Coverity. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-05-19 14:59:00 +03:00
Patrick Ruddy	4006e41baf	Merge pull request #8646 from chiragshah6/mdev zebra: evpn check vni oper state in svi up/down event	2021-05-18 11:45:56 +01:00
Donatas Abraitis	82689214b5	Merge pull request #8535 from opensourcerouting/zlog-rnode zebra: replace _rnode_zlog with %pZN ext	2021-05-18 09:50:42 +03:00
Donatas Abraitis	94effaf032	zebra: Send more OPAQUE data from BGP This includes community and large-community data. ``` exit1-debian-9# show ip route 172.16.16.1/32 Routing entry for 172.16.16.1/32 Known via "bgp", distance 20, metric 0, best Last update 00:00:23 ago * 192.168.0.2, via eth1, weight 1 AS-Path : 65030 Communities : 65001:1 65001:2 65001:3 65001:4 65001:5 65001:6 Large-Communities: 65001:123:1 65001:123:2 ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-05-14 22:12:33 +03:00
Donatas Abraitis	638fc64c64	zebra: Format changes for evpn_mh_neigh_holdtime_cmd Just to avoid fixing all the time manually this stuff after not relevant changes. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-05-14 22:12:33 +03:00
Donald Sharp	e524fc1e2c	Merge pull request #8659 from mjstapp/fix_connected_multi lib,zebra: Use a flag to track down status for connected addrs	2021-05-13 07:23:42 -04:00
Donald Sharp	7d7be47ef0	zebra: Use __func__ instead of __PRETTY_FUNCTION__ Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-12 12:02:05 -04:00
Mark Stapp	e3d901f863	lib,zebra: Use a flag to track down status for connected addrs Track 'down' state of connected addresses with a new flag. We may have multiple addresses on an interface that share a prefix; in those cases, we need to determine when the first address is valid, to install a connected route, and similarly detect when the last address goes 'down', to remove the connected route. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-05-12 09:37:00 -04:00
Donald Sharp	c9d842c710	zebra: Consolidate on 1 function netlink_parse_rattr_nested if_netlink.c created it's on nested parsing #define which is identical to netlink_parse_rtattr_nested. Consolidate on one instead of having this duality. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-11 20:05:51 -04:00
Donald Sharp	269b69d703	zebra: memset the `struct rtattr tb[SIZE]` in setting function In order to parse the netlink message into the `struct rtattr tb[size]` it is assumed that the buffer is memset to 0 before the parsing. As such if you attempt to read a value that was not returned in the message you will not crash when you test for it. The code has places were we memset it and places where we don't. This will lead to crashes when the kernel changes. In our parsing routines let's have them memset instead of having to remember to do it pre pass in to the parser. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-11 20:05:51 -04:00
Russ White	6099bb989d	Merge pull request #8650 from idryzhov/bgp-fix-redist bgpd: fix redistribution in vrf	2021-05-11 07:28:42 -04:00
Igor Ryzhov	d9083050c8	Revert "bgpd: vrf route leaking, fix vrf redistribute" This reverts commit `6b2433c63f`.	2021-05-09 22:28:36 +03:00
David Lamparter	e207132594	zebra: fix style warnings in previous commits Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-05-09 19:37:12 +02:00
Chirag Shah	196d7a86d0	zebra: check vni oper state in svi up notif When clagd is stopped on secondary device, all vxlan interfaces (vnis) are kept in protodown state. FRR treats protodown vxlan interfaces (vnis) as interface down and sends vni delete to bgpd. In the event of clagd down, SVIs are flapping as underlying bridge is going through churn. When FRR receives SVI up notification do not trigger event to bgpd if vnis are operationaly down. Ticket:#2600210 CM-22929 Reviewed By:CCR-11544 Testing Done: Performed CLAG stop/start on secondary device, all vxlan devices remained in protodown along with this validated the vnis are cleaned up and added back in bgpd. Signed-off-by: Chirag Shah <chirag@nvidia.com>	2021-05-07 15:02:05 -07:00
rgirada	d29fd1b72e	zebrad: Added a command to dump routes in support bundle Description: Added a new show command("show ip zebra route dump") to dump all routes with detailed information including nexthops,flags, status ..etc. This helps for dubugging and added to support_bundle_command.conf. Defined this command as a hidden command. Signed-off-by: Rajesh Girada <rgirada@vmware.com>	2021-05-06 02:40:12 -07:00
Donald Sharp	4a73887e0f	zebra: Reduce per vrf memory usage from hash table creation When creating a large number of vrf's we are creating a fairly large number of hash tables per vrf. Reduce memory usage on startup as well as let us identify the table these things come from. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-05 10:08:06 -04:00
Donald Sharp	da55bcbcb3	zebra: Reduce size of vni hash tables to a more reasonable start size We are creating 2 hash tables per vni in zebra. Once we start to scale the number of vni's we start to see some serious memory usage in zebra. Let's reduce the memory usage at startup for scale of vni's. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-05 10:08:06 -04:00
Donald Sharp	38078b1d5a	zebra: Add some ability to know what hash is for what vni Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-05 10:08:06 -04:00
Donald Sharp	ec64a634c2	zebra: Allow the zvrf to know it's vrf when allocing Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-05 10:08:06 -04:00
Mark Stapp	3d4b999fab	Merge pull request #8237 from pguibert6WIND/nhrp_use_zebra_2 Nhrp use zebra 2	2021-05-05 07:57:04 -04:00
Russ White	4ae7bb11fc	Merge pull request #8620 from donaldsharp/redistribution_and_infinite zebra: Allow redistribution for routes selected	2021-05-04 11:14:35 -04:00
Russ White	8ad44ef497	Merge pull request #8514 from donaldsharp/connected_is_limited zebra: Allow one connected route per network mask on a interface	2021-05-04 07:45:33 -04:00
Donald Sharp	c3d0d6e8a1	zebra: Allow redistribution for routes selected Current code has an inconsistent behavior with redistribute routes. Suppose you have a kernel route that is being read w/ a distance of 255: eva# show ip route kernel Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure K>* 0.0.0.0/0 [0/100] via 192.168.161.1, enp39s0, 00:06:39 K>* 4.4.4.4/32 [255/8192] via 192.168.161.1, enp39s0, 00:01:26 eva# If you have redistribution already turned on for kernel routes you will be notified of the 4.4.4.4/32 route. If you turn on kernel route redistribution watching after the 4.4.4.4/32 route has been read by zebra you will never learn of it. There is no need to look for infinite distance in the redistribution code. Either we are selected or not. In other words non kernel routes with an 255 distance are never installed so the checks were pointless. So let's just remove the distance checking and tell interested parties about the 255 kernel route if it exists. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-03 19:53:12 -04:00
Mark Stapp	f71e1ff6a9	Merge pull request #8545 from opensourcerouting/assert-our-own *: make our own assert() actually work	2021-05-03 11:17:36 -04:00
Donald Sharp	9298056138	zebra: Allow one connected route per network mask on a interface Currently FRR reads the kernel for interface state and FRR creates a connected route per address on an interface. If you are in a situation where you have multiple addresses on an interface just create 1 connected route for them: sharpd@eva:/tmp/topotests$ vtysh -c "show int dummy302" Interface dummy302 is up, line protocol is up Link ups: 0 last: (never) Link downs: 0 last: (never) vrf: default index 3279 metric 0 mtu 1500 speed 0 flags: <UP,BROADCAST,RUNNING,NOARP> Type: Ethernet HWaddr: aa:4a:ed:95:9f:18 inet 10.4.1.1/24 inet 10.4.1.2/24 secondary inet 10.4.1.3/24 secondary inet 10.4.1.4/24 secondary inet 10.4.1.5/24 secondary inet6 fe80::a84a:edff:fe95:9f18/64 Interface Type Other Interface Slave Type None protodown: off sharpd@eva:/tmp/topotests$ vtysh -c "show ip route connected" Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure C>* 10.4.1.0/24 is directly connected, dummy302, 00:10:03 C>* 192.168.161.0/24 is directly connected, enp39s0, 00:10:03 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-03 09:17:22 -04:00
David Lamparter	9d75e30960	zebra: replace _rnode_zlog with %pZN ext Since _rnode_zlog was wrapping zlog(), these messages weren't getting an unique ID assigned through the xref mechanism. Replace macro with a small extension that prints (almost) the same thing. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-05-02 16:20:30 +02:00
Donald Sharp	c490437e6f	zebra: Allow interface up events to read speed Initially the reading of the speed of an interface happened upon interface creation and happened until the speed of a link settled down to a single value. The speed of an interface can also change as that a new optic can be inserted that changes the speed, in which case FRR would see a interface down (optic removal) and then a interface up (optic insertion). In this case FRR would not treat this as an event that changed the speed. Let's expand the checking a bit more. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-02 07:30:02 -04:00
Philippe Guibert	e3d3fa06f7	zebra: collect gre information and push it when needed - gre keys are collected and stored locally. - when gre source set is requested, and the link interface configured is different, the gre information collected is pushed in the query, namely source ip or gre keys if present. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	db51f0cd10	nhrp: Preserve mtu during interface up/down and tunnel source change preserve mtu upon interface flapping and tunnel source change. Signed-off-by:Reuben Dowle <reuben.dowle@4rf.com> Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	62b4b7e44a	zebra: new dplane action to set gre link interface This action is initiated by nhrp and has been stubbed when moving to zebra. Now, a netlink request is forged to set the link interface of a gre interface if that gre interface does not have already a link interface. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	d17af8dd04	lib, zebra: get gre information the get gre information code is obtained by nhrp, via zebra. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	b716ab61e2	zebra: add stub implementation for zebra gre source set this functionality is stubbed. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:18 +02:00
Philippe Guibert	077c07cc58	zebra: storage of gre information in zebra layer zebra is able to get information about gre tunnels. zebra_gre file is created to handle hooks, but is not yet used. also, debug zebra gre command is done to add gre traces. A zebra_gre file is used for complementary actions that may be needed. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 10:33:15 +02:00
Philippe Guibert	357b150dae	zebra: at startup, fix links on all namespaces when zebra has vrf backend mapped to namespaces, the polling of interfaces leads to fix all linkages of interfaces. This was not done on non default namespace. do it for other namespaces. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-30 08:05:01 +02:00
Philippe Guibert	ecffe9167b	zebra: add the link interface information on interface updates There are cases where either link information is not present at interface creation or link information changed. handle this situation. Signed-off-by: Philippe.Guibert <philippe.guibert@6wind.com> zebra dd link	2021-04-30 08:05:01 +02:00
Rafael Zalamena	5418880923	Merge pull request #7165 from qlyoung/fix-zapi-codec-badness Fix zapi codec badness	2021-04-29 13:50:16 -03:00
Donald Sharp	4d0773c4ea	zebra: msgdump debug strangeness cleanup a) `debug zebra kernel` turns off `debug zebra kernel msgdump....` this is odd and bad b) `debug zebra kernel msgdump send` turns off receive and vice versa this is counter intuitive as well c) `no zebra kernel msgdump ...` turns off all kernel level debugging we should only turn off msgdump specific debugs d) `no debug zebra kernel` turns off all kernel level debugging we should leave msgdump on. e) Fix `show run` and show debug output Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-04-29 08:22:53 -04:00
Quentin Young	693fc882d7	zebra: use safe stream decodes for evpn zapi msg Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-28 11:43:50 -04:00
Quentin Young	f3aa221ffd	pimd, zebra: explicit cast int netlink val to uint encoding signed int as unsigned is bad practice; since we want to do it here lets at least be explicit about it Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-28 11:43:50 -04:00
Quentin Young	bbad027684	lib, bgpd, zebra: RA interval is unsigned Use unsigned value for all RA requests to Zebra - encoding signed int as unsigned is bad practice - RA interval is never, and should never be, negative Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-28 11:43:50 -04:00
Quentin Young	0ffd0fb536	bgpd, zebra: encode ip addr len as uint16 This is always a 16 bit unsigned value. - signed int is the wrong type to use - encoding a signed int as a uint32 is bad practice - decoding a signed int encoded as a uint32 into a uint16 is bad practice Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-28 11:43:45 -04:00
Russ White	d8c3daca19	Merge pull request #8531 from mjstapp/fix_backups_misc zebra: Misc fixups for backup nexthops	2021-04-27 16:04:24 -04:00
Stephen Worley	829c939a88	Merge pull request #8488 from mjstapp/more_workqueue lib, zebra: use zebra workqueue for NHG updates	2021-04-27 11:59:33 -04:00
Renato Westphal	120dab7e17	Merge pull request #8517 from volta-networks/ldp_defer_zebra_updates ldpd: defer register for info until configured	2021-04-26 23:57:57 -03:00
Renato Westphal	54e9f5138c	Merge pull request #8538 from mjstapp/re_dump_nh_labels zebra: include nexthops' label stacks in zebra rib debug	2021-04-26 23:57:03 -03:00
Emanuele Di Pascale	67da957372	zebra: debug log for redistribute_del We're firing an event debug log for zebra_redistribute_add, but not one for zebra_redistribute_delete. Let's make it symmetric. Signed-off-by: Emanuele Di Pascale <emanuele@voltanet.io>	2021-04-26 10:00:37 +02:00
David Lamparter	6a0eb6885b	*: drop zassert.h It's not actually working properly... Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-04-23 12:06:35 +02:00
David Lamparter	1f8031f79a	*: make sure `config.h` or `zebra.h` is first `config.h` has all the defines from autoconf, which may include things that switch behavior of other included headers (e.g. _GNU_SOURCE enabling prototypes for additional functions.) So, the first include in any `.c` file must be either `config.h` (with the appropriate guard) or `zebra.h` (which includes `config.h` first thing.) Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-04-23 12:06:35 +02:00
Stephen Worley	dc65cd999d	zebra: handle gracefulRS/retain with proto NHGs Properly handle refcounting of Proto-owned NHGs when zebra is operating under graceful restart and retain conditions. We have an extra refcnt of 1 we keep for proto-owned NHGs to indicate the upper level proto has created and owns it. When we are reading these in from the kernel, we need to set them to 1 as appropriate. Without this, we fail in the assert() during zebra_nhg_proto_add() after the owning daemons resends the NHG and the refcnts are off by one. Also add in the same logic we use for routes when sweeping with respect to uptimes. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-04-22 17:25:15 -04:00
Stephen Worley	45691de9a0	zebra: add uptime to NHEs Add uptime for use with NHEs to keep track of how long we have had this NHE in our rib without an update. This is treated exactly the same as the re->uptime for routes. When we get an update for a route, we reset the uptime. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-04-22 17:25:15 -04:00
Stephen Worley	65f137fe3c	zebra: add PROTO_OWNED macro for NHE id bounds checking Add a PROTO_OWNED macro for code readability when checking ID bounds for whether a NHG is proto owned. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-04-22 17:25:15 -04:00
Mark Stapp	cbe5bafbd5	zebra: include nexthops' label stacks in debugs Include nexthops' labels in an important debug early in route processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-22 11:51:50 -04:00
Mark Stapp	8283551d3c	zebra: handle TE policy changes in LSP async notifs Handle SR-TE policy changes in the LSP async notification handler, as we do in the normal LSP dplane results handler. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-21 14:30:15 -04:00
Mark Stapp	a082cd9a51	zebra: include inner labels with recursive backups When capturing backup nexthops with recursive resolution, ensure that inner labels from the recursive nexthop are included in each backup (as they are with the resolving primary nexthops). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-21 14:30:15 -04:00
Mark Stapp	c56c16eb2c	zebra: fix some issues in recursive backup nexthop code Fix a couple of small things in the code that captures backup nexthops during recursive resolution. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-21 14:30:15 -04:00
David Lamparter	0c4285d77e	build: properly split CFLAGS from AC_CFLAGS `CFLAGS` is a "user variable", not intended to be controlled by configure itself. Let's put all the "important" stuff in AC_CFLAGS and only leave debug/optimization controls in CFLAGS. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-04-21 15:42:36 +02:00
David Lamparter	09781197b6	build: make builddir include path consistent ... by referencing all autogenerated headers relative to the root directory. (90% of the changes here is `version.h`.) Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2021-04-21 15:42:33 +02:00
Russ White	2bbf1bd88b	Merge pull request #8361 from rameshabhinay/change_1 bgpd: vrf route leaking related fixes	2021-04-20 11:23:49 -04:00
Mark Stapp	04bec7b217	zebra: use workqueue for daemon-owned NHGs Use the main zebra workqueue for daemon-owned NHGs, in addition to processing kernel-owned NHGs. The zapi message processing creates a temporary object that's enqueued to the workqueue, then processed/installed as part of the workqueue processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-15 14:20:39 -04:00
David Lamparter	c574670847	build: don't use $(top_srcdir) in vtysh_scan It's not necessary and can confuse scripts. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-04-13 23:57:14 +02:00
Quentin Young	54bb4ab3ec	Merge pull request #8426 from idryzhov/fix-interface-nb-stale-pointers lib: fix interface nb stale pointers	2021-04-13 15:26:51 +00:00
Mark Stapp	f3dbd9d3ef	Merge pull request #8145 from pguibert6WIND/nhrp_use_zebra nhrp: use zebra	2021-04-13 08:02:56 -04:00
Philippe Guibert	88217099de	zebra, lib: replace ZEBRA_ROUTE_NEIGH with simplified version do not add a new route type, and consider 0 as a value meaning that zebra should be the owner. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-13 08:58:54 +02:00
Philippe Guibert	d603c0774e	nhrp, zebra, lib: enforce usage of zapi_neigh_ip structure zapi_nbr structure is renamed to zapi_neigh_ip. Initially used to set a neighbor ip entry for gre interfaces, this structure is used to get events from the zebra layer to nhrp layer. The ndm state has been added, as it is needed on both sides. The zebra dplane layer is slightly modified. Also, to clarify what ZEBRA_NEIGH_ADD/DEL means, a rename is done: it is called now ZEBRA_NEIGH_IP_ADD/DEL, and it signified that this zapi interface permits to set link operations by associating ip addresses to link addresses. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-13 08:58:49 +02:00
Igor Ryzhov	af736200e1	lib: fix interface nb stale pointers The first change in this commit is the processing of the VRF termination. When we terminate the VRF, we should not delete the underlying interfaces, because there may be pointers to them in the northbound configuration. We should move them to the default VRF instead. Because of the first change, the VRF interface itself is also not deleted when deleting the VRF. It should be handled in netlink_link_change. This is done by the second change. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-04-12 10:56:04 +03:00
Quentin Young	b832909b42	: remove .conf.sample files Most of these are many, many years out of date. All of them vary randomly in quality. They show up by default in packages where they aren't really useful now that we use integrated config. Remove them. The useful ones have been moved to the docs. Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-04-09 13:14:30 -04:00
Philippe Guibert	e18747a967	zebra: move neighbor table configuration to dplane contexts Instead of directly configuring the neighbor table after read from zapi interface, a zebra dplane context is prepared to host the interface and the family where the neighbor table is updated. Also, some other fields are hosted: app_probes, ucast_probes, and mcast_probes. More information on those fields can be found on ip-ntable configuration. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	0a27a2fef5	zebra, lib: handle NEIGH_ADD/DELETE to zebra dataplane framework EVPN neighbor operations were already done in the zebra dataplane framework. Now that NHRP is able to use zebra to perform neighbor IP operations (by programming link IP operations), handle this operation under dataplane framework: - assign two new operations NEIGH_IP_INSTALL and NEIGH_IP_DELETE; this is reserved for GRE like interfaces: example: ip neigh add A.B.C.D lladdr E.F.G.H - use 'struct ipaddr' to store and encode the link ip address - reuse dplane_neigh_info, and create an union with mac address - reuse the protocol type and use it for neighbor operations; this permits to store the daemon originating this neighbor operation. a new route type is created: ZEBRA_ROUTE_NEIGH. - the netlink level functions will handle a pointer, and a type; the type indicates the family of the pointer: AF_INET or AF_INET6 if the link type is an ip address, mac address otherwise. - to keep backward compatibility with old queries, as no extension was done, an option NEIGH_NO_EXTENSION has been put in place - also, 2 new state flags are used: NUD_PERMANENT and NUD_FAILED. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	541025d6ff	zebra: handler for configuring neighbor table neighbor table api in zebra is added. a netlink api is created for that. the handler is called from the api defined in the previous commit. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	df948efc56	zebra: fixes NDA_DST in netlink_neigh_update() function When netlink_neigh_update() is called, the link registration was failing, due to bad request length. Also, the query was failing if NDA_DST was an ipv6 address. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	05657ec2b7	nhrp, lib, zebra: add/del neighbor entry possible from nhrp a zebra api is extended to offer ability to add or remove neighbor entry from daemon. Also this extension makes possible to add neigh entry, not only between IPs and macs, but also between IPs and NBMA IPs. This API supports configuring ipv6/ipv4 entries with ipv4/ipv6 lladdr. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Philippe Guibert	7723e8d3fd	zebra: link layer config and notification, implementation in zebra zebra implements zebra api for configuring link layer information. that can be an arp entry (for ipv4) or ipv6 neighbor discovery entry. This can also be an ipv4/ipv6 entry associated to an underlay ipv4 address, as it is used in gre point to multipoint interfaces. this api will also be used as monitoring. an hash list is instantiated into zebra (this is the vrf bitmap). each client interested in those entries in a specific vrf, will listen for following messages: entries added, removed, or who-has messages. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-04-09 18:29:58 +02:00
Mark Stapp	b254f784ae	zebra: optionally hide backup-nexthop events in nht Optionally hide route changes that only involve backup nexthop activation/deactivation. The goal is to avoid route churn during backup nexthop switchover events, before the resolving routes re-converge. A UI config enables this 'hiding' behavior. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-08 11:03:49 -04:00
Mark Stapp	aef1d5404f	zebra: add config control to hide backup nh events in nht Add a config that can control hiding of backup-nexthop activation changes in nexthop-tracking. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-07 15:38:09 -04:00
Abhinay Ramesh	6b2433c63f	bgpd: vrf route leaking, fix vrf redistribute Description: After FRR restart, routes are not getting redistributed; when routes added first and then 'redistribute static' cmd is issued. During the frr restart, vrf_id will be unknown, so irrespective of redistribution, we set the redistribute vrf bitmap. Later, when we add a route and then issue 'redistribute' cmd, we check the redistribute vrf bitmap and return CMD_WARNING; zebra_redistribute_add also checks the redistribute vrf bitmap and returns. Instead of checking the redistribute vrf bitmap, always set it anyways. Co-authored-by: Santosh P K <sapk@vmware.com> Co-authored-by: Kantesh Mundaragi <kmundaragi@vmware.com> Signed-off-by: Abhinay Ramesh <rabhinay@vmware.com>	2021-04-07 06:09:42 +00:00
Mark Stapp	2aa2a407e4	zebra: be more selective about processing LSPs When certain events occur (connected route changes e.g.) zebra examines LSPs to see if they might have been affected. For LSPs with backup nhlfes, skip this immediate processing and wait for the owning protocol daemon to react. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-05 15:53:48 -04:00
Mark Stapp	04dda09218	zebra: add 'detail' mpls debug setting Add setting and cli for 'debug zebra mpls detail'. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-04-05 15:53:48 -04:00
Mark Stapp	cc6e7d13d5	Merge pull request #8358 from idryzhov/fix-nb-vrf-crash *: modify VRF_CONFIGURED flag only in VRF NB layer	2021-04-01 16:42:03 -04:00
Sarita Patra	e71627cbcb	zebra: North-bound implementation for zebra rmaps This commit introduces the implementation for the north-bound callbacks for the zebra-specific route-map match and set clauses. Signed-off-by: NaveenThanikachalam <nthanikachal@vmware.com> Signed-off-by: Sarita Patra <saritap@vmware.com>	2021-03-30 22:58:42 +03:00
Igor Ryzhov	b9b794db21	: modify VRF_CONFIGURED flag only in VRF NB layer This is to fix the crash reproduced by the following steps: ip link add red type vrf table 1 Creates VRF. * vtysh -c "conf" -c "vrf red" Creates VRF NB node and marks VRF as configured. * ip route 1.1.1.0/24 2.2.2.2 vrf red * no ip route 1.1.1.0/24 2.2.2.2 vrf red (or similar l3vni set/unset in zebra) Marks VRF as NOT configured. * ip link del red VRF is deleted, because it is marked as not configured, but NB node stays. Subsequent attempt to configure something in the VRF leads to a crash because of the stale pointer in NB layer. Fixes #8357. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-03-29 00:52:39 +03:00
Anuradha Karuppiah	7bfa7d0233	lib/zebra: zapi for installing EVPN nexthops from bgp EVPN nexthops are installed as remote neighs by zebra. This was earlier done only via VRF IPvX uni routes imported from EVPN routes. With EVPN-MH these VRF routes now reference a L3NHG which is setup based on the EAD and doesn't include the RMAC. To workaround that BGP now consolidates and maintains EVPN nexthops which are then sent to zebra. zebra sets up these nexthops as L3-VNI nh entries using a dummy type-1 route as reference. Ticket: CM-31398 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-03-25 17:09:53 -07:00
Renato Westphal	b1c875d692	Merge pull request #8250 from idryzhov/fix-nb-running-get-entry Fix aborts when using nb_running_get_entry during validation stage	2021-03-24 19:39:09 -03:00
Rafael Zalamena	b9f1b4d3d3	Merge pull request #8078 from idryzhov/fix-zebra-vni zebra: fix vni configuration in default vrf	2021-03-24 13:32:44 +00:00
David Lamparter	224ccf29d9	zebra: kill zebra_memory.h, use MTYPE_STATIC This one also needed a bit of shuffling around, but MTYPE_RE is the only one left used across file boundaries now. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-22 20:02:17 +01:00
Donatas Abraitis	37916b2b11	Merge pull request #8121 from opensourcerouting/macro-cleanup *: require ISO C11 + semicolons after file-scope macros	2021-03-22 11:00:34 +02:00
David Lamparter	80413c2073	*: require semicolon after FRR_DAEMON_INFO & co. ... again ... Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:39 +01:00
David Lamparter	960b9a5383	*: require semicolon after DEFINE_<typesafe...> Again, see previous commits. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:39 +01:00
David Lamparter	96244aca23	*: require semicolon after DEFINE_QOBJ & co. Again, see previous commits. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:37 +01:00
David Lamparter	8451921b70	*: require semicolon after DEFINE_HOOK & co. See previous commit. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:17 +01:00
David Lamparter	bf8d3d6aca	*: require semicolon after DEFINE_MTYPE & co Back when I put this together in 2015, ISO C11 was still reasonably new and we couldn't require it just yet. Without ISO C11, there is no "good" way (only bad hacks) to require a semicolon after a macro that ends with a function definition. And if you added one anyway, you'd get "spurious semicolon" warnings on some compilers... With C11, `_Static_assert()` at the end of a macro will make it so that the semicolon is properly required, consumed, and not warned about. Consistently requiring semicolons after "file-level" macros matches Linux kernel coding style and helps some editors against mis-syntax'ing these macros. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:17 +01:00
David Lamparter	247c7e27a9	snmp: change -std=gnu99 to -std=gnu11 The point of the `-std=gnu99` was to override a `-std=c99` that may be coming in from net-snmp. However, we want C11, not C99. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:17 +01:00
Mark Stapp	5530d55d3c	zebra: capture backup nexthop info with recursive resolution When resolving a recursive route, capture backup nexthop info along with the resolving nexthops. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-03-16 12:14:53 -04:00
Mark Stapp	aa45883818	zebra: add ui control for use of backup nexthops in resolution Add a control and api for the use of backup nexthops in recursive resolution. With 'no', we won't try to use installed backup nexthops when resolving a recursive route. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-03-16 12:14:53 -04:00
Stephen Worley	0a7edab036	Merge pull request #7993 from mjstapp/reorg_resolve zebra: reorg nexthop resolution code	2021-03-16 11:34:33 -04:00
Igor Ryzhov	6c38095749	zebra: make ribs config false Zebra routing tables are not controlled by the user and can not be created/deleted manually. Current NB create/destroy callbacks are incorrectly implemented because instead of creating/deleting the RIB they are only checking for it's existence. YANG model should reflect the real situation. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-03-16 17:25:49 +03:00
Igor Ryzhov	4ba756ed9c	*: fix aborts when validating configuration There are places in the code where function nb_running_get_entry is used with abort_if_not_found set to true during the config validation stage. This is incorrect because when used in transactional CLI, the running entry won't be set until the apply stage, and such usage leads to crash. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-03-16 17:25:49 +03:00
David Lamparter	ad6f7449ef	*: remove remaining severity prefixes Having a "warning:" prefix on a debug message is particularly dumb... Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-14 22:56:07 +01:00
David Lamparter	5d27875b7d	zebra: move up prefix2str call in rib dump Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-14 22:56:07 +01:00
David Lamparter	ef7b8be459	zebra: use printfrr exts in EVPN/VXLAN code Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-14 22:56:07 +01:00
David Lamparter	5e9f9adbb4	fpm: use printfrr exts Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-14 22:56:07 +01:00
Jafar Al-Gharaibeh	d532dd6d6a	Revert "zebra: Remove `first_p` which is never used" This reverts commit `8617eb7c5f`.	2021-03-12 01:02:25 -06:00
Donald Sharp	8617eb7c5f	zebra: Remove `first_p` which is never used Remove dead code. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-03-11 21:22:53 -05:00
Mark Stapp	6ff2514b41	Merge pull request #8124 from pguibert6WIND/ipsec_iptable_dplane zebra: move netfilter contexts to zebra dplane	2021-03-10 16:43:15 -05:00
Philippe Guibert	ef524230a6	zebra: move ipset and ipset_entry to zebra dplane contexts like it has been done for iptable contexts, a zebra dplane context is created for each ipset/ipset entry event. The zebra_dplane_ctx job is then enqueued and processed by separate thread. Like it has been done for zebra_pbr_iptable context, the ipset and ipset entry contexts are encapsulated into an union of structures in zebra_dplane_ctx. There is a specificity in that when storing ipset_entry structure, there was a backpointer pointer to the ipset structure that is necessary to get some complementary information before calling the hook. The proposal is to use an ipset_entry_info structure next to the ipset_entry, in the zebra_dplane context. That information is used for ipset_entry processing. The ipset name and the ipset type are the only fields necessary. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-03-10 14:57:32 +01:00
Philippe Guibert	5162e00045	zebra: move iptable handling in zebra_dplane The iptable processing was not handled in remote dataplane, and was directly processed by the thread in charge of zapi calls. Now that call can be handled in the zebra_dplane separate thread. once a zebra_dplane_ctx is allocated for iptable handling, the hook call is performed later. Subsequently, a return code may be triggered to zclient interface if any problem occurs when calling the hook call. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-03-04 11:50:25 +01:00
Stephen Worley	b1077f0fa2	Merge pull request #8152 from idryzhov/fix-zebra-blackhole zebra: don't use kernel nexthops for blackhole routes	2021-03-02 11:50:46 -05:00
Patrick Ruddy	e1cfd75ffb	Merge pull request #8021 from AnuradhaKaruppiah/evpn-weak-override-fix zebra: disable setting weak override flag in neigh updates	2021-03-02 10:44:43 +00:00
Igor Ryzhov	4be03ff4ca	zebra: don't use kernel nexthops for blackhole routes Fixes #6522 and #8149. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-03-01 17:47:38 +03:00
Anuradha Karuppiah	3f589fa8ec	zebra: fix problem with bypass getting set accidentally on all ESs This was caused because of uninitialized netlint attrs in the bond-member netlink parse API. PS: It was caught by the upstream topotests on ARM8 (passed everywhere else). Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-02-24 08:11:26 -08:00
Anuradha Karuppiah	8e1337c5dd	zebra: del/add remote mac if there is a change from es->non-es dst and vicevera This is needed as kernel currently doesn't allow a mac replace if the dst changes from a L2NHG to a single-VTEP and viceversa. Ticket: CM-31561 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-24 08:11:26 -08:00
Anuradha Karuppiah	fd40906be9	zebra: flush macs linked to the bond when it moves out of bypass When a ES-bond is in bypass state MACs learnt on it are linked to the access port instead of the ES. When LACP converges on the bond it moves out of bypass and the MACs previously learnt on it are flushed to force a re-learn on new traffic. Ticket: CM-31326 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-24 08:11:26 -08:00
Anuradha Karuppiah	8b07f173e8	zebra: link local MACs to destination port for efficient lacp-bypass processing When an ES-bond comes out of bypass FRR needs to flush the local MACs learnt while the bond was in bypass. To do that efficiently local MACs are linked to the dest-access port. This only happens if the access-port is in LACP-bypass or if it is non-ES. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-24 08:11:24 -08:00
Anuradha Karuppiah	00a7710c25	zebra: support for lacp bypass with EVPN MH Feature overview: ================= A 802.3ad bond can be setup to allow lacp-bypass. This is done to enable servers to pxe boot without a LACP license i.e. allows the bond to go oper up (with a single link) without LACP converging. If an ES-bond is oper-up in an "LACP-bypass" state MH treats it as a non-ES bond. This involves the following special handling - 1. If the bond is in a bypass-state the associated ES is placed in a bypass state. 2. If an ES is in a bypass state - a. DF election is disabled (i.e. assumed DF) b. SPH filter is not installed. 3. MACs learnt via the host bond are advertised with a zero ESI. When the ES moves out of "bypass" the MACs are moved from a zero-ESI to the correct non-zero id. This is treated as a local station move. Implementation: =============== When (a) an ES is detached from a hostbond or (b) an ES-bond goes into LACP bypass zebra deletes all the local macs (with that ES as destination) in the kernel and its local db. BGP re-sends any imported MAC-IP routes that may exist with this ES destination as remote routes i.e. zebra can end up programming a MAC that was perviously local as remote pointing to a VTEP-ECMP group. When an ES is attached to a hostbond or an ES-bond goes LACP-up (out of bypss) zebra again deletes all the local macs in the kernel and its local db. At this point BGP resends any imported MAC-IP routes that may exist with this ES destination as sync routes i.e. zebra can end up programming a MAC that was perviously remote as local pointing to an access port. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-24 08:09:33 -08:00
Igor Ryzhov	db1f688d44	zebra: fix duplicated definitions Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-02-24 14:51:00 +03:00
Igor Ryzhov	9082b3eb3d	zebra: fix vni configuration in default vrf VNI configuration is done without NB layer in default VRF. It leads to the following problems: ``` vtysh -c "conf" -c "vni 1" vtysh -c "conf" -c "vrf default" -c "no vni" ``` Second command does nothing, because the NB node is not created by the first command. ``` vtysh -c "conf" -c "vrf default" -c "vni 1" vtysh -c "conf" -c "no vni 1" ``` Second command doesn't delete the NB node created by the first command. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-02-24 14:51:00 +03:00
Patrick Ruddy	0ff7911386	Merge pull request #7879 from AnuradhaKaruppiah/advertise-svi-mac evpn-mh: Advertise SVI MAC as a type-2 route if EVPN MH is enabled	2021-02-24 10:20:24 +00:00
Mark Stapp	15869cd81d	Merge pull request #8035 from qlyoung/remove-more-sprintf *: remove more sprintf()	2021-02-23 15:55:02 -05:00
Anuradha Karuppiah	736475cdf6	zebra: disable setting weak override flag in neigh updates This is causing problems with VM move i.e. transition from remote neigh to local neigh. This transition involves changing the NUD_STATE NUD_NOARP to NUD_STALE. And the weak override flag prevents changing the state from connected (REACHABLE, NOARP, PERMANENT) to STALE. PS: Weak-override was originally used to prevent race conditions where FRR can end up making a REACHABLE neigh STALE. We may need to revisit and address that case at a later point. Ticket: CM-30273 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-22 11:56:30 -08:00
Mark Stapp	9b4ab90984	zebra: support nh resolution without a route Start reorg of zebra nexthop-resolution so that we can use the resolution logic for nexthop-groups as well as routes. Change the signature of the core nexthop_active() api so that it does not require a route-entry or route-node. Move some of the logic around so that nexthop-specific logic is in nexthop_active(), while route-oriented logic is in nexthop_active_check(). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-02-19 15:38:37 -05:00
Anuradha Karuppiah	e4c3ece6e0	zebra: fix problem with SVI MAC not being sent to BGP For MH the SVI MAC is advertised to prevent flooding of ARP replies. But because of a bug the SVI MAC was being added to the zebra database but not sent to bgpd for advertising. Ticket: CM-33329 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-02-19 08:11:15 -08:00
Anuradha Karuppiah	bd2ac9a794	zebra: drop the SVI MAC cleanup done as a part of interface delete As a part of FRR shutdown interfaces are force flushed (in an arbitary order). Interfaces are already down at that point i.e. resources like SVI-MAC have already been released. Attempting to clean it up again as a part of the force-flush was resulting in access of freed up memory - >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ==26457== Thread 1: ==26457== Invalid read of size 8 ==26457== at 0x1AE6B0: zebra_evpn_acc_bd_svi_set (zebra_evpn_mh.c:606) ==26457== by 0x1B1460: zebra_evpn_if_cleanup (zebra_evpn_mh.c:1040) ==26457== by 0x13CA69: if_zebra_delete_hook (interface.c:244) ==26457== by 0x48A0E34: hook_call_if_del (if.c:59) ==26457== by 0x48A0E34: if_delete_retain (if.c:290) ==26457== by 0x48A2F94: if_delete (if.c:313) ==26457== by 0x48A3169: if_terminate (if.c:1217) ==26457== by 0x48E0024: vrf_delete (vrf.c:254) ==26457== by 0x48E0024: vrf_delete (vrf.c:225) ==26457== by 0x48E02FE: vrf_terminate (vrf.c:551) ==26457== by 0x1442E1: sigint (main.c:203) ==26457== by 0x1442E1: sigint (main.c:141) ==26457== by 0x48CF862: quagga_sigevent_process (sigevent.c:103) ==26457== by 0x48DD324: thread_fetch (thread.c:1404) ==26457== by 0x48A926A: frr_run (libfrr.c:1122) >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> (gdb) bt (gdb) fr 5 1037 zebra/zebra_evpn_mh.c: No such file or directory. (gdb) p zif->ifp->name $2 = "vlan131", '\000' <repeats 12 times> (gdb) p zif->link->info $5 = (void *) 0x1 (gdb) p/x zif->ifp->flags $7 = 0x1002 (gdb) >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Ticket: CM-32435 Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>	2021-02-19 08:11:15 -08:00
Chirag Shah	3b63732a42	zebra: prevent crash in evpn if cleanup zebra crash is seen while cleaning up evpn interface during shutdown event. evpn interface clean up is called from vrf_delete callback (gdb) frame 4 (is_up=false, br_zif=0x0, vlan_zif=0x557f31fb36f0) at zebra/zebra_evpn_mh.c:614 614 zebra/zebra_evpn_mh.c: No such file or directory. (gdb) p tmp_br_zif $1 = (struct zebra_if ) 0x0 (gdb) p vlan_zif->link $2 = (struct interface ) 0x557f31fb2d40 (gdb) p vlan_zif->link->info $3 = (void *) 0x0 (gdb) p zebra_if->ifp->name No symbol "zebra_if" in current context. (gdb) p vlan_zif->ifp->name $4 = "peerlink-3.4094\000\000\000\000" Ticket:CM-32435 Reviewed By:CCR-10957 Testing Done: Signed-off-by: Chirag Shah <chirag@nvidia.com>	2021-02-19 08:11:15 -08:00
Anuradha Karuppiah	243b74eda6	zebra: changes to advertise SVI mac by default if evpn-mh is enabled Added support for advertising SVI MAC if EVPN-MH is enabled. In the case of EVPN MH arp replies from an attached server can be sent to the ES-peer. To prevent flooding of the reply the SVI MAC needs to be advertised by default. Note: advertise-svi-ip could have been used as an alternate way to advertise SVI MAC. However that config cannot be turned on if SVI IPs are re-used (which is done to avoid wasting IP addresses in a subnet). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-19 08:11:15 -08:00
Anuradha Karuppiah	c0c7707d0d	zebra: fix problem with SVI IP being advertised even if disabled SVI IP is being advertised unconditionally i.e. even if disabled (and that is the default config). This can be problematic when the SVI address is re-used across racks. Added the user config condition in all the relevant places where the SVI advertisement is triggered. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2021-02-19 08:11:15 -08:00
Donald Sharp	d6816f68bd	zebra: use AF_INET for protocol family When looking up the conversion from kernel protocol to internal protocol family make sure we use the correct AF_INET( what the kernel uses ) instead of AFI_IP (which is what FRR uses ). Routes from OSPF will show up from the kernel as OSPF6 instead of OSPF. Which will cause mayhem Ticket: CM-33306 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-02-16 15:54:08 -05:00
David Lamparter	1d5453d607	*: remove tabs & newlines from log messages Neither tabs nor newlines are acceptable in syslog messages. They also break line-based parsing of file logs. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-02-14 15:36:51 +01:00
Stephen Worley	3d26211e08	Merge pull request #7508 from sudhanshukumar22/zebra-vrf-delete zebra: treat vrf add for existing vrf as update	2021-02-10 02:05:10 -05:00
Quentin Young	7533cad751	*: remove more sprintf() Should be just a couple non-development, non-test occurrences of this function left now. Signed-off-by: Quentin Young <qlyoung@qlyoung.net>	2021-02-09 15:40:40 -05:00
Russ White	d887c7bf04	Merge pull request #7973 from sworleys/Pbr-More-Fixes zebra,pbrd,doc: PBR more fixes	2021-02-09 07:37:09 -05:00
Donald Sharp	4b24d96930	Merge pull request #8009 from pjdruddy/evpn-cleanup zebra: resolve multiple functions for local MAC delete	2021-02-04 13:37:24 -05:00
Donald Sharp	99a30b4760	zebra: Display instance id as part of `show zebra client summ` When displaying `show zebra client summ` when we have instances running, display the instance number as well. New Output: sharpd@eva ~/frr7 (instance_data)> vtysh -c "show zebra client summ" Name Connect Time Last Read Last Write IPv4 Routes IPv6 Routes -------------------------------------------------------------------------------- ospf[1] 00:00:02 00:00:02 00:00:02 0/0 0/0 ospf[5] 00:00:02 00:00:02 00:00:02 0/0 0/0 sharp 00:00:02 00:00:02 00:00:02 0/0 0/0 static 00:00:02 00:00:02 00:00:02 0/0 0/0 Routes column shows (added+updated)/deleted Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-02-04 08:35:14 -05:00
Pat Ruddy	46d6f5a2c6	zebra: resolve multiple functions for local MAC delete the old VXLAN function for local MAC deletion was still in existence and being called from the VXLAN code whilst the new generic function was not being called at all. Resolve this so the generic function matches the old function and is called exclusively. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-02-03 12:22:00 +00:00
Igor Ryzhov	1ac88792c0	*: fix all backets Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-02-02 19:11:25 +03:00
Russ White	a67b8731d2	Merge pull request #7991 from donaldsharp/valgrind_cleanups1 Valgrind cleanups	2021-02-02 07:30:06 -05:00
Stephen Worley	f7692085cb	zebra: move pbr hash create after update release Move the pbr hash creation to be after the update release and dplane install. Now that rules are installed in a separate dplane pthread, we can have scenarios where we have an interface flapping and we install/remove rules sufficiently fast enough we could issue what we think is an update for an identical rule and end up releasing the rule right after we created it and sent it to the dplane. This solves the problem of recving duplicate rules during interface flapping. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-02-01 13:32:37 -05:00
Stephen Worley	8eeca5a201	zebra: add some debugging for PBR events in zebra Add some debugging for PBR events internal to zebra, specifically ADD/UPDATE/DELETE of pbr rules. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-02-01 13:32:37 -05:00
Stephen Worley	3d30f6defb	zebra: disallow resolution to duplicate nexthops Disallow the resolution to nexthops that are marked duplicate. When we are resolving to an ecmp group, it's possible this group has duplicates. I found this when I hit a bug where we can have groups resolving to each other and cause the resolved->next->next pointer to increase exponentially. Sufficiently large ecmp and zebra will grind to a hault. Like so: ``` D> 4.4.4.14/32 [150/0] via 1.1.1.1 (recursive), weight 1, 00:00:02 * via 1.1.1.1, dummy1 onlink, weight 1, 00:00:02 via 4.4.4.1 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.2 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.3 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.4 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.5 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.6 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.7 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.8 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.9 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.10 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.11 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.12 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.13 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.15 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1 onlink, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1 onlink, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 4.4.4.16 (recursive), weight 1, 00:00:02 via 1.1.1.1, dummy1 onlink, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 via 1.1.1.1, dummy1, weight 1, 00:00:02 D> 4.4.4.15/32 [150/0] via 1.1.1.1 (recursive), weight 1, 00:00:09 * via 1.1.1.1, dummy1 onlink, weight 1, 00:00:09 via 4.4.4.1 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.2 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.3 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.4 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.5 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.6 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.7 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.8 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.9 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.10 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.11 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.12 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.13 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.14 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 4.4.4.16 (recursive), weight 1, 00:00:09 via 1.1.1.1, dummy1 onlink, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 via 1.1.1.1, dummy1, weight 1, 00:00:09 D> 4.4.4.16/32 [150/0] via 1.1.1.1 (recursive), weight 1, 00:00:19 * via 1.1.1.1, dummy1 onlink, weight 1, 00:00:19 via 4.4.4.1 (recursive), weight 1, 00:00:19 via 1.1.1.1, dummy1, weight 1, 00:00:19 via 4.4.4.2 (recursive), weight 1, 00:00:19 ............... ................ and on... ``` You can repro the above via: ``` kernel routes: 1.1.1.1 dev dummy1 scope link 4.4.4.0/24 via 1.1.1.1 dev dummy1 ============================== config: nexthop-group doof nexthop 1.1.1.1 nexthop 4.4.4.1 nexthop 4.4.4.10 nexthop 4.4.4.11 nexthop 4.4.4.12 nexthop 4.4.4.13 nexthop 4.4.4.14 nexthop 4.4.4.15 nexthop 4.4.4.16 nexthop 4.4.4.2 nexthop 4.4.4.3 nexthop 4.4.4.4 nexthop 4.4.4.5 nexthop 4.4.4.6 nexthop 4.4.4.7 nexthop 4.4.4.8 nexthop 4.4.4.9 ! =========================== Then use sharpd to install 4.4.4.16 -> 4.4.4.1 pointing to that nexthop group in decending order. ``` With these changes it prevents the growing ecmp above by disallowing duplicates to be in the resolution decision. These nexthops are not installed anyways so why should we be resolving to them? Signed-off-by: Stephen Worley <sworley@nvidia.com>	2021-02-01 13:02:40 -05:00
sudhanshukumar22	75d26fb313	zebra: treat vrf add for existing vrf as update Description: When we get a new vrf add and vrf with same name, but different vrf-id already exists in the database, we should treat vrf add as update. This happens mostly when there are lots of vrf and other configuration being replayed. There may be a stale vrf delete followed by new vrf add. This can cause timing race condition where vrf delete could be missed and further same vrf add would get rejected instead of treating last arrived vrf add as update. Treat vrf add for existing vrf as update. Implicitly disable this VRF to cleanup routes and other functions as part of vrf disable. Update vrf_id for the vrf and update vrf_id tree. Re-enable VRF so that all routes are freshly installed. Above 3 steps are mandatory since it can happen that with config reload stale routes which are installed in vrf-1 table might contain routes from older vrf-0 table which might have got deleted due to missing vrf-0 in new configuration. Signed-off-by: sudhanshukumar22 <sudhanshu.kumar@broadcom.com>	2021-02-01 08:33:13 -08:00
Donald Sharp	a013777abc	zebra: Prevent sending of unininted data valgrind is reporting: 2448137-==2448137== Thread 5 zebra_apic: 2448137-==2448137== Syscall param writev(vector[...]) points to uninitialised byte(s) 2448137:==2448137== at 0x4D6FDDD: __writev (writev.c:26) 2448137-==2448137== by 0x4D6FDDD: writev (writev.c:24) 2448137-==2448137== by 0x48A35F5: buffer_flush_available (buffer.c:431) 2448137-==2448137== by 0x48A3504: buffer_flush_all (buffer.c:237) 2448137-==2448137== by 0x495948: zserv_write (zserv.c:263) 2448137-==2448137== by 0x4904B7E: thread_call (thread.c:1681) 2448137-==2448137== by 0x48BD3E5: fpt_run (frr_pthread.c:308) 2448137-==2448137== by 0x4C61EA6: start_thread (pthread_create.c:477) 2448137-==2448137== by 0x4D78DEE: clone (clone.S:95) 2448137-==2448137== Address 0x720c3ce is 62 bytes inside a block of size 4,120 alloc'd 2448137:==2448137== at 0x483877F: malloc (vg_replace_malloc.c:307) 2448137-==2448137== by 0x48D2977: qmalloc (memory.c:110) 2448137-==2448137== by 0x48A30E3: buffer_add (buffer.c:135) 2448137-==2448137== by 0x48A30E3: buffer_put (buffer.c:161) 2448137-==2448137== by 0x49591B: zserv_write (zserv.c:256) 2448137-==2448137== by 0x4904B7E: thread_call (thread.c:1681) 2448137-==2448137== by 0x48BD3E5: fpt_run (frr_pthread.c:308) 2448137-==2448137== by 0x4C61EA6: start_thread (pthread_create.c:477) 2448137-==2448137== by 0x4D78DEE: clone (clone.S:95) 2448137-==2448137== Uninitialised value was created by a stack allocation 2448137:==2448137== at 0x43E490: zserv_encode_vrf (zapi_msg.c:103) Effectively we are sending `struct vrf_data` without ensuring data has been properly initialized. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-02-01 08:57:51 -05:00
Donald Sharp	cadc15cfe2	zebra: Remove #if 0 code Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-28 13:57:49 -05:00
Mark Stapp	4c99d413e6	zebra: debug messages go under conditionals Move a couple of unprotected debug calls in the netlink code under DEBUG_KERNEL. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-26 12:29:39 -05:00
Donald Sharp	431deca7ea	Merge pull request #7905 from mjstapp/fix_zapi_nhg zebra, sharpd: async results for NHGs	2021-01-25 10:29:04 -05:00
Mark Stapp	ee94437e28	zebra: send async nhg update results Send the results of daemons' nhg updates asynchronously, after the update has actually completed. Capture additional info about the source daemon in order to locate the correct zapi session. Simplify the result types considered by the zebra_nhg module. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-22 16:33:01 -05:00
Mark Stapp	f5b7e50f9a	zebra: use afi_t for route-map address family arg Use afi_t in the route_map_check api Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-21 10:13:57 -05:00
Mark Stapp	bdbef5edc6	Merge pull request #7233 from donaldsharp/route_map_optimizations Route map optimizations	2021-01-19 13:20:02 -05:00
Patrick Ruddy	f87fe77aeb	Merge pull request #7723 from AnuradhaKaruppiah/fdb-ext-attrs zebra: move from NDA_NOTIFY to NDA_FDB_EXT_ATTRS	2021-01-19 16:27:54 +00:00
Mark Stapp	5898ce6f35	libs,zebra: remove zapi nhg encode and decode public apis The raw zapi apis to encode and decode NHGs don't need to be public; also add a little more validity-checking. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-19 08:48:54 -05:00
Donald Sharp	3a15018892	zebra: Tell SA that we are intentionally ignoring the return Calling fpm_nl_enqueue we should expect a it fit or not return value on the outgoing stream. This is not necessary to check here because the while loop where we are checking this already has ensured that the data being written will fit. CID -> 1499854 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-18 09:06:49 -05:00
Donald Sharp	d33da0e071	zebra: A `zebra route-map delay-timer 0` command should still run the route-map Setting `zebra route-map delay-timer 0` completely turns of any route-map processing in zebra. Which is completely wrong. A timer of 0 means `do it now`. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	4dfcfabfa9	zebra: Push timer out if another route-map change comes in for zebra If we are running with a delayed timer to handle route-map changes in zebra, if another route-map change is made to the cli, push out the timer instead of not modifying the timer. This will allow a large set of route-maps to be possibly be read in by the system and we don't have a state where new route-map changes are being read in and having the timer pop in the middle of it. Additionally convert to use THREAD_OFF, preventing a possible use after free as well as aligning the thread api usage with what we consider correct. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	cfcd844c0b	zebra: Limit routemap changes to reconsider only routes associated with that rm Current code when a route map changes schedules a rerun of all routes in the particular table. So if you modify the `ip protocol XX route-map FOO` route-map `FOO` all routes will be rechecked. This is extremely expensive. Modify zebra to only update the routes associated with the route-map. So if we have 800k bgp routes and 50 ospf routes and we are route-map'ing the ospf routes we'll only look at 50 routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	54aeba3540	zebra: Allow rib_update_table to receive a specified route type When we need to cause a reprocessing of data the code currently marks all routes as needing to be looked at. Modify the rib_update_table code to allow us to specify a specific route type we only want to reprocess. At this point none of the code is behaving differently this is just setup for a future code change. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	1866a6f65b	zebra: remove unused function rib_update_vrf The function rib_update_vrf is never used. Remove it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Donald Sharp	3d34678f1d	doc: Document the "zebra route-map delay-timer" functionality Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-15 19:34:33 -05:00
Duncan Eastoe	869a5f7168	zebra: set nlmsg_pid in netlink msgs sent by 'fpm' Use nl_pid from the netlink socket used for programming the kernel (netlink_dplane) in netlink route messages sent by the 'fpm' module. This makes 'fpm' consistent with 'dplane_fpm_nl' which already behaves this way, and allows FPM server implementations to determine route origin via nlmsg_pid. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2021-01-15 16:28:06 +00:00
Donald Sharp	f7f52f0d2b	Merge pull request #7868 from mjstapp/fix_fpm_conn_up zebra: don't set connection-up event pointer directly	2021-01-15 06:55:29 -05:00
Mark Stapp	9fad1340d4	Merge pull request #7866 from kishorekunal01/fpm_dump_issue zebra: Scale setup RMAC is send multiple time to fpm	2021-01-14 14:13:31 -05:00
Mark Stapp	ef1dbba83a	zebra: don't set connection-up event pointer directly Use thread_cancel to reset the connection-up processing timer. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-14 14:09:14 -05:00
Kishore Kunal	e840edcacb	zebra: Scale setup RMAC is send multiple time to fpm Thread zfpm_conn_up_thread_cb can Yield and send RMAC multiple times to FPM. Signed-off-by: Kishore Kunal <kishorekunal01@broadcom.com>	2021-01-14 15:53:52 +00:00
Donald Sharp	700cae7698	zebra: in zebra_evpn_mac.c use size_t for buffer length Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:29 -05:00
Donald Sharp	b16e800423	zebra: Create a dump function for mac->flags and use it Create a function that can dump the mac->flags in human readable output and convert all debugs to use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:29 -05:00
Donald Sharp	bf902d4c52	zebra: Create function to dump MACIP flags Create a function to dump MACIP flags and to use it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 13:22:27 -05:00
Mark Stapp	d7ceaa8f5a	Merge pull request #7819 from donaldsharp/more_data_for_debug_dumps zebra: Add ability to display human readable format re->flags and status	2021-01-13 13:06:23 -05:00
Mark Stapp	3c57be5936	Merge pull request #7818 from donaldsharp/ip_proto_denied zebra: notify installing protocol when nexthops cannot be resolved	2021-01-13 10:33:33 -05:00
Donald Sharp	61e6de9d57	zebra: Add ability to display in human readable format re->flags and status The re->flags and re->status in debugs were being dumped as hex values. I can never quickly decode this. Here is an idea. Let's let FRR do it for me. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-13 10:16:06 -05:00
Donald Sharp	1afacb94e6	Merge pull request #6853 from mjstapp/fix_rib_dups zebra: reduce impact of route-update overload	2021-01-13 09:42:34 -05:00
Donald Sharp	7874422ad2	Merge pull request #7850 from mjstapp/build_dplane_plugin zebra: build the sample dataplane plugin	2021-01-12 08:43:53 -05:00
Mark Stapp	b9f15b49b2	zebra: add the sample dataplane plugin to the build Build the sample dataplane plugin with debug/dev builds. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-11 16:33:55 -05:00
Mark Stapp	fb913e53a5	zebra: remove unused local in dplane sample plugin Remove an unused local in the sample dataplane plugin. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-01-11 16:33:27 -05:00
Donald Sharp	7e010c4b78	zebra: notify installing protocol when nexthops cannot be resolved In the case where a routes nexthops cannot be resolved as part of route processing, immmediately notify the upper level protocol that their routes failed to install if they are interested in being informed about this issue. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-11 10:11:35 -05:00
Donatas Abraitis	88ffa95dc3	Merge pull request #7823 from donaldsharp/zebra_delay_timer Zebra delay timer	2021-01-11 16:46:23 +02:00
Donald Sharp	f10f8f0e98	Merge pull request #7652 from adharkar/frr-vni_switch zebra: L3VNI to L2VNI conversion is not handled	2021-01-10 18:44:49 -05:00
Donald Sharp	7df0e6bb3b	Merge pull request #7756 from pjdruddy/bgplu-fixes Bgplu fixes	2021-01-09 15:48:22 -05:00
Donald Sharp	24420c8200	Merge pull request #7787 from deastoe/fpm-work-ready-fixes dplane_fpm_nl: routes stuck with 'q' flag (revisited)	2021-01-09 15:38:46 -05:00
Donald Sharp	9df81095f8	zebra: zebra route-map delay-timer is global not per vrf The zebra route-map delay timer value is a global value not a per vrf change. As such we should only print it out one time. We are seeing this: zebra route-map delay-timer 33 exit-vrf zebra route-map delay-timer 33 When we have 2 vrf's configured. Fix the code to only write it out for the default vrf Ticket: CM-32888 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-08 22:34:41 -05:00
Donald Sharp	c70e585e05	zebra: Remove uncalled function Remove the dead function zebra_route_map_write_delay_timer Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-08 22:34:41 -05:00
Renato Westphal	dc70c83afa	Merge pull request #7816 from pjdruddy/revert_labelmanager_statics Revert labelmanager statics	2021-01-08 20:57:25 -03:00
Mark Stapp	6b66913275	Merge pull request #7762 from sworleys/PBR-Ipv4/Ipv6-Match-Fixes pbrd: pbr ipv4/ipv6 match fixes	2021-01-05 13:54:06 -05:00
Pat Ruddy	507d2737d6	zebra: expose label-manager util-funcs Revert "zebra: unexpose label-manager util-funcs as static" This reverts commit `d3d9639d9a`. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-05 18:19:44 +00:00
Patrick Ruddy	b567ed7eeb	Merge pull request #7722 from AnuradhaKaruppiah/mh-fixes bgpd, zebra: evpn mh fixes	2021-01-05 09:26:17 +00:00
Pat Ruddy	189982283a	zebra: labelmanager could return reserved labels when checking if there is a "hole" behind the current reservation marker the calculation of whether the hole is big enough to satisfy the requested chunk is out by 1. This could result in returning a label which has already been allocated. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-04 14:29:44 +00:00
Pat Ruddy	3c84497943	zebra: label manager should never return a reserved block if the requested chunk size was less than 16 then a chunk within the reserved block would be returned. Make sure that we never return labels that are below MPLS_LABEL_UNRESERVED_MIN Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-04 14:29:44 +00:00
Quentin Young	19ff5340a1	Merge pull request #7777 from volta-networks/fix_zebra_rib_c++ zebra: avoid c++ reserved keyword	2020-12-29 11:07:12 -05:00
Stephen Worley	a4525d25b5	Merge pull request #7788 from deastoe/zebra2proto-kernel-connect zebra: zebra2proto() handle kernel/connect type	2020-12-28 14:57:41 -05:00
Mark Stapp	7c08b70a53	Merge pull request #7724 from donaldsharp/pbr_zebra_was_wrong Pbr zebra was wrong	2020-12-23 13:34:18 -05:00
Duncan Eastoe	911d4d4804	zebra: zebra2proto() handle kernel/connect type When dplane_fpm_nl is used the "Please add this protocol(n) to proper rt_netlink.c handling" debug message is emitted for any route of type kernel or connected. This severely reduces performance of dplane_fpm_nl when large numbers of these routes are present in the RIB. The messages are not observed when using the original fpm module since this uses a custom function, netlink_proto_from_route_type(). zebra2proto() now returns RTPROT_KERNEL for ZEBRA_ROUTE_CONNECT and ZEBRA_ROUTE_KERNEL. This should only impact dplane_fpm_nl's use of the common netlink routines since these routes generally ignored via checking of RSYSTEM_ROUTE(). Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:27:52 +00:00
Duncan Eastoe	b677907c99	zebra: fpm_nl_process() reschedule dp thread fpm_nl_process() now ensures that the dataplane thread is rescheduled if it hits the work limit while processing its incoming work queue. This would probably already occur due to some other event, such as fpm_process_queue() enqueuing completed work to the output queue, however it does no harm to add this explicit reschedule. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:14:03 +00:00
Duncan Eastoe	f1595ce439	zebra: resched dp thread if output queue limit hit If the dataplane thread hits the work limit while processing the output queue for any given provider, we now explicitly reschedule the thread. Otherwise, if the number of items in the output queue is greater than the work limit, draining of that output queue is dependent on new dataplane work. Routes which are not drained from the output queue are stuck with the 'q' flag, so this is a similar issue to that observed in `164d8e8608`. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-22 21:14:03 +00:00
Rafael Zalamena	fb1e954880	Merge pull request #7767 from mjstapp/fix_dplane_extra_info zebra: fix loop logic in dplane for extra intf info	2020-12-22 15:08:35 -03:00
Mark Stapp	700ff41ed3	Merge pull request #7472 from opensourcerouting/fpm-fixes fpm: frr-reload, IPv6 and an improvement	2020-12-22 11:37:58 -05:00
Anuradha Karuppiah	0b05c9bbe1	zebra: skip EVI setup if an ES is applied to a pseudo interface zebra maintains pseudo interface for hanging off user config after the interface is deleted in the kernel. If an user tried to config an ES against such an interface zebra would crash with the following call stack - at zebra/zebra_evpn_mh.c:2095 sysmac=sysmac@entry=0x55cfbadd3160) at zebra/zebra_evpn_mh.c:2258 at zebra/zebra_evpn_mh.c:3222 argv=<optimized out>, es_lid_str=<optimized out>, es_lid=1, no=0x0, vty=0x55cfbaf4c7b0) at zebra/zebra_evpn_mh.c:3222 argv=<optimized out>) at ./zebra/zebra_evpn_mh_clippy.c:202 vty=vty@entry=0x55cfbaf4c7b0, cmd=cmd@entry=0x0, filter=FILTER_RELAXED) at lib/command.c:1073 Ticket: CM-31702 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	16de1338a9	zebra: accept bgp remote mac-ip update if the higher-seq-local mac is not bgp-ready If a local-MAC or local-neigh is not active locally it is not sent to BGP. At this point if BGP rxes a remote route it accepts it and installs in zebra. Zebra was rejecting BGP's update if it had a higher seq local (inactive) entry. This would result in bgp and zebra falling out of sync. In some cases zebra would delete the local-inactive entries in sometime (as a part of the dplane/kernel garbage collection). This would leave zebra with missing remote entries (which were still present in bgpd). This change allows lower-seq BGP updates to overwrite zebra's local entry if that entry happens to be local-inactive. Note: This logic was already in use for sync-mac-ip updates. Extended the same logic to remote-mac-ip updates. Ticket: CM-31626 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	963b0c55fd	zebra: clean zevpn references in the access bd database when the VNI is deleted When an VNI was deleted as a part of FRR/zebra shutdown the zevpn entry was being freed without removing its reference in the access vlan entry (i.e. without clearing the VLAN->VNI mapping) used by MH. Ticket: CM-31197 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	7c0e4dc659	zebra: reinstall missing peer-sync flag If a netlink/dp notification is rxed for a neigh without the peer-sync flag FRR re-installs the entry with the right flags. This change is needed to handle cases where the dataplane and FRR may fall out of sync because of neigh learning on the network ports (i.e. via the VxLAN). Ticket: CM-30693 The problem was found during VM mobility "torture" tests where 100s of extended VM moves were done. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	2c89cb9017	zebra: changes to log ext_flags in neigh nl add Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	c1735c08c9	zebra: fix a problem with local MAC pointing to a remote ES If a remote MAC update is rxed from BGP with a lower sequence number than the local one zebra ignores the MAC update. This typically happens if there is a race condition (where updates are in flight from zebra to BGP). There was a bug in zebra because of which the dest ES was being updated before this check. This left the local MAC pointing to a remote ES. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Relevant Dumps: =============== root@leaf21:mgmt:~# net show evpn mac vni 101101 mac 00:93:00:00:00:01 MAC: 00:93:00:00:00:01 ESI: 03:00:00:00:77:01:03:00:00:0d Intf: - VLAN: 101 Sync-info: neigh#: 1 peer-proxy Local Seq: 3 Remote Seq: 0 Neighbors: 21.1.13.1 Active root@leaf21:mgmt:~# net sho evpn es Type: L local, R remote, N non-DF ESI Type ES-IF VTEPs 03:00:00:00:77:01:02:00:00:0c R - 6.0.0.10,6.0.0.11 03:00:00:00:77:01:03:00:00:0d R - 6.0.0.10,6.0.0.11,6.0.0.12 03:00:00:00:77:01:04:00:00:0e R - 6.0.0.10,6.0.0.11,6.0.0.12,6.0.0.13 03:00:00:00:77:02:02:00:00:16 LR bondP2-H2 6.0.0.15 03:00:00:00:77:02:03:00:00:17 LR bondP2-H3 6.0.0.15,6.0.0.16 03:00:00:00:77:02:04:00:00:18 LR bondP2-H4 6.0.0.15,6.0.0.16,6.0.0.17 root@leaf21:mgmt:~# Relevant logs: =============== 2020/07/29 15:41:27.110846 ZEBRA: Recv MACIP ADD VNI 101101 MAC 00:93:00:00:00:01 IP 21.1.13.1 flags 0x0 seq 2 VTEP 0.0.0.0 ESI 03:00:00:00:77:01:03:00:00:0d from bgp 2020/07/29 15:41:27.110867 ZEBRA: Ignore remote MACIP ADD VNI 101101 MAC 00:93:00:00:00:01 IP 21.1.13.1 as existing MAC has higher seq 3 flags 0x401 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Ticket: CM-30273 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:17 -08:00
Anuradha Karuppiah	c7bfd08568	zebra: advertise stale neighs if EVPN-MH is not enabled With EVPN-MH, Type-2 routes are also used for MAC-IP syncing between ES peers so a change was done to only treat REACHABLE local neigh entries as local-active and advertise them as Type-2 routes i.e. STALE neigh entries are no longer advertised as Type-2s. This however exposed some unexpected problems with MLAG where a secondary reboot followed by a primary reboot left a lot of neighs in STALE state (on the primary) resulting in them not being advertised. And remote routed traffic to those hosts being blackholed in a sym-IRB setup. This commit is a workaround to fix the regression (it doesn't fix the underlying problems with entries not becoming REACHABLE; which maybe a day-1 problem). The workaround is to continue advertising STALE neighbors if EVPN-MH is not enabled. Ticket: CM-30303 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:41:15 -08:00
Anuradha Karuppiah	362c8f2d73	zebra: handle "show evpn es-evi" a non-existent VNI zebra was crashing when the command was run on a non-existent VNI. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 VNI 16777215 doesn't exist root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 detail VNI 16777215 doesn't exist root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 json [ ] root@torm-12:mgmt:~# net show evpn es-evi vni 16777215 detail json [ ] root@torm-12:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Ticket: CM-30232 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-21 08:40:07 -08:00
Emanuele Di Pascale	2e8db20d7e	zebra: avoid c++ reserved keyword in rib_handle_nhg_replace, do not use new as a parameter name to allow compilation of c++ code including zebra headers. Signed-off-by: Emanuele Di Pascale <emanuele@voltanet.io>	2020-12-21 14:34:55 +01:00
Mark Stapp	b364e87d56	zebra: fix loop logic in dplane for extra intf info The way a couple of clauses were placed in a loop meant that some info might not be collected - re-order things just a bit. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-18 13:49:07 -05:00
Stephen Worley	e36ea40d3b	zebra: derive rule family from src->dst->ipv4 Derive the rule family from src if available, otherwise dst if available, otherwise assume ipv4. We only support ipv4/ipv6 currently so it we cant tell from the src/dst it must be ipv4 and likely a dsfield match. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-18 11:53:18 -05:00
Duncan Eastoe	438dd3e7df	zebra: reduce atomic ops in fpm_process_queue() Maintain the count of contexts which have been processed in a local variable, and perform a single atomic update after we have consumed all queued contexts. Generally this results in at least one less atomic operation per context. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:37:13 +00:00
Duncan Eastoe	3f2b998f61	zebra: local var in fpm_process_queue() sched cond Don't use an atomic operation to determine whether fpm_process_queue() needs to be re-scheduled. Instead we can simply use a local variable to determine if we stopped processing because we ran out of buffers. In the case where we would have re-scheduled due to new context objects in the queue (enqueued after we stopped processing), fpm_nl_process() will schedule us (or will have done already). Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:36:39 +00:00
Duncan Eastoe	bf2f783945	zebra: reduce atomic ops in fpm_nl_process() Maintain the peak ctxqueue length in a local variable, and perform a single atomic update after processing all contexts. Generally this results in at least one less atomic operation per context. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:36:38 +00:00
Duncan Eastoe	dc693fe057	zebra: reduce dplane_fpm_nl ctxqueue_mutex contention Reduce code in the critical sections of fpm_nl_process() and fpm_process_queue() to the bare minimum - basically only enqueue and dequeue operations on the shared ctxqueue. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-18 15:33:46 +00:00
Mark Stapp	86723fe89b	zebra: nht resolve-via-default doesn't need force We don't need to use the 'force' flag when processing the resolve-via-default clis for ip and ipv6: we can just do normal nht processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-17 11:22:09 -05:00
Ameya Dharkar	3b0a590bf3	zebra: L3VNI to L2VNI conversion is not handled After removal of L3VNI config, the VNI should become an L2VNI if a VxLAN interface is present for the VNI. This case is not handled in the code. Changes: 1. After unconfiguring L3VNI, create an L2VNI if VxLAN interface is present for the VNI. 2. Trigger an update to BGP. 3. Read MAC and ARP entries from kernel. This PR fixes the issue only for route type-2, 3 and 5. This PR does not address states regarding route type-1, 4 and multicast group for VxLAN interface. Signed-off-by: Ameya Dharkar <adharkar@vmware.com>	2020-12-16 18:06:37 -08:00
Anuradha Karuppiah	35f5c31b0e	zebra: add support for DF delay timer When a new ES is created it is held in a non-DF state for 3 seconds as specified by RFC7432. This allows the switch time to import the Type-4 routes from the peers. And the peers time to rx the new Type-4 route. root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01"\|grep DF DF status: non-df DF delay: 00:00:01 DF preference: 50000 root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01"\|grep DF DF status: df DF preference: 50000 root@torm-11:mgmt:~# Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-15 10:03:50 -08:00
Anuradha Karuppiah	0109f42f86	zebra: display DF status only for local ESs For remote ESs it is not relevant and confuses the admin. Local ES sample - =============== root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01" ESI: 03:44:38:39:ff:ff:01:00:00:01 Type: Local,Remote Interface: hostbond1 State: up Bridge port: yes Ready for BGP: yes VNI Count: 10 MAC Count: 3 DF: status: df preference: 50000 >>>>>>>>>>>>>>> Nexthop group: 536870913 VTEPs: 27.0.0.16 df_alg: preference df_pref: 32767 nh: 268435465 27.0.0.17 df_alg: preference df_pref: 32767 nh: 268435466 root@torm-11:mgmt:~# Remote ES sample - =============== root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:02:00:00:01" ESI: 03:44:38:39:ff:ff:02:00:00:01 Type: Remote Interface: - Ready for BGP: no VNI Count: 0 MAC Count: 6 DF: status: - preference: 0 >>>>>>>>>>>>>>> Nexthop group: 536870919 VTEPs: 27.0.0.18 nh: 268435464 27.0.0.19 nh: 268435467 27.0.0.20 nh: 268435461 root@torm-11:mgmt:~# Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-15 10:02:03 -08:00
Patrick Ruddy	a119a429e4	Merge pull request #7637 from AnuradhaKaruppiah/evpn-pim-fixes evpn-pim: cleanup and display fixes	2020-12-15 17:36:24 +00:00
Patrick Ruddy	bedf36e327	Merge pull request #7636 from AnuradhaKaruppiah/type-0-esi zebra: support for type-0 ESI	2020-12-15 17:33:46 +00:00
Patrick Ruddy	01c65ba77e	Merge pull request #7633 from AnuradhaKaruppiah/protodown-fixes evpn-mh: protodown handling fixes	2020-12-15 17:23:32 +00:00
Russ White	930c9b7be8	Merge pull request #7736 from ton31337/fix/s_addr_INADDR_ANY *: Replace s_addr check agains 0 with INADDR_ANY	2020-12-15 07:12:49 -05:00
Donatas Abraitis	3a6290bdd1	*: Replace s_addr check agains 0 with INADDR_ANY Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-12-14 21:03:38 +02:00
Stephen Worley	3bece1e0e3	Merge pull request #7162 from opensourcerouting/zebra-human-netlink zebra: human readable netlink dumps	2020-12-14 14:03:35 -05:00
Anuradha Karuppiah	dc261b8de4	zebra: restart start-up delay timer when the first uplink comes up When all the uplinks go down the VTEP is disconnected from the VxLAN overlay and this was handled by proto-downing the ES bonds. When the uplinks come up again we need to re-enable the ES bonds but that needs to be done after a delay to allow the EVPN network to converge. And that is done by firing off the startup-delay timer on first uplink-up. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:41 -08:00
Anuradha Karuppiah	2bcf92e18b	zebra: re-sync protodown state with the dplane on new ES add 1. When a bond is associated with an ES we may need to re-sync the dplane protodown state (which maybe stale/set by some other app). 2. Also change the uplink state display to avoid confusion with protodown reason code (both used to show uplink-up). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Anuradha Karuppiah	26ba45e33d	zebra: update protodown display protodown state is a combination of the dplane and zebra states. protodown reason is maintained exclusively by zebra. Display this information on two separate lines to make that ownership clearer. Also display n/a for bonds as the dplane doesn't support protodowning the bond device. Sample output - ============== root@torm-11:mgmt:~# vtysh -c "show interface hostbond1"\|grep -i protodown protodown: off (n/a) protodown reasons: (uplinks-down) root@torm-11:mgmt:~# vtysh -c "show interface swp5"\|grep -i protodown protodown: on protodown reasons: (uplinks-down) root@torm-11:mgmt:~# PS: Cosmetic changes only, no functional change. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Anuradha Karuppiah	5c84327054	zebra: re-sync protodown state when a port/mbr is linked to an ES-bond The code for this was already there but was not kicking in because of a zebra local reason-code dup check. Even if the reason-code is the same, if the dplane and zebra disagree about the protodown state zebra will need to re-program the dplane. Fixed a couple of spelling errors in the protodown logs to make greps easy. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-14 10:32:40 -08:00
Donatas Abraitis	219218d964	Merge pull request #7664 from donaldsharp/global_bgp_wait Global bgp wait	2020-12-14 10:28:02 +02:00
Donald Sharp	3ceae22b7f	Revert "zebra: When shutting down an interface immediately notify about rnh" This reverts commit `0aaa722883`.	2020-12-11 20:45:43 -05:00
Nikolay Aleksandrov	4bcdb6086c	zebra: move from NDA_NOTIFY to NDA_FDB_EXT_ATTRS Use the new nested NDA_FDB_EXT_ATTRS attribute to control per-fdb notifications. PS: The attributes where updated as a part of the kernel upstreaming hence the change. Signed-off-by: Nikolay Aleksandrov <nikolay@cumulusnetworks.com> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-11 12:13:36 -08:00
Duncan Eastoe	164d8e8608	zebra: routes stuck with 'q' when using dplane FPM New work enqueued to the dplane_fpm_nl provider is initially de-queued and re-enqueued, in fpm_nl_process(), to be processed by the provider's own thread. After performing this initial de-queue/enqueue we return to dplane_thread_loop() and check the dplane_fpm_nl output queue for any work which has been completed. Since this work is being processed in another thread it is very likely that there will be some (or all) work still outstanding at this point. The dataplane thread finishes up any other tasks and then waits until it is next scheduled. In the meantime the dplane_fpm_nl thread is processing its work queue until completion. The issue arises here as the dataplane thread is not explicitly re-scheduled once dplane_fpm_nl has drained its work queue and populated its output queue with completed work. This completed work can sit in the output queue for an indeterminate period of time, depending upon when the dataplane thread is next scheduled for other work. If the RIB has reached a stable state then this could be a significant period of time. During this period zebra marks these routes as queued, even though they have actually been processed by all dataplane providers. An un-related RIB change which triggers a FIB update will result in the dataplane thread being scheduled and this completed work then being processed. At this point the routes will then no longer be marked as queued by zebra. However this new FIB update might itself then fall victim to the same scenario! We can observe the above behaviour in these detailed dplane logs. 11:24:47 zebra[7282]: dplane: incoming new work counter: 2 11:24:47 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:24:47 zebra[7282]: dplane provider 'Kernel': processing 11:24:47 zebra[7282]: Dplane NEIGH_DISCOVER, ip 192.168.2.2, ifindex 9 11:24:47 zebra[7282]: Dplane NEIGH_DISCOVER, ip 192.168.2.2, ifindex 9 11:24:47 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:24:47 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:24:47 zebra[7282]: dplane dequeues 1 completed work from provider dplane_fpm_nl 11:24:47 zebra[7282]: dplane has 1 completed, 0 errors, for zebra main 2 contexts (all incoming work) have been queued to dplane_fpm_nl - all good. 1 completed context was de-queued, so there is outstanding work. 11:24:58 zebra[7282]: dplane: incoming new work counter: 2 11:24:58 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:24:58 zebra[7282]: dplane provider 'Kernel': processing 11:24:58 zebra[7282]: ID (193) Dplane nexthop update ctx 0x55c429b6fed0 op NH_INSTALL 11:24:58 zebra[7282]: 0:5.5.5.5/32 Dplane route update ctx 0x55c429b79690 op ROUTE_INSTALL 11:24:58 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:24:58 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:24:58 zebra[7282]: dplane dequeues 2 completed work from provider dplane_fpm_nl 11:24:58 zebra[7282]: dplane has 2 completed, 0 errors, for zebra main A further 2 contexts (all incoming work) have been queued to dplane_fpm_nl - all good. 2 completed contexts were de-queued, which sounds good as that is what we en-queued. However, there is an outstanding context from earlier, so there is still outstanding work. Indeed the new 5.5.5.5/32 route is marked as queued: O>q 5.5.5.5/32 [110/10] via 192.168.2.2, dp0p1s3, weight 1, 00:01:19 This remains the case until we trigger a FIB update by installation of the (eg.) 10.10.10.10/32 route: 11:26:41 zebra[7282]: dplane: incoming new work counter: 2 11:26:41 zebra[7282]: dplane enqueues 2 new work to provider 'Kernel' 11:26:41 zebra[7282]: dplane provider 'Kernel': processing 11:26:41 zebra[7282]: ID (195) Dplane nexthop update ctx 0x55c429b78ce0 op NH_INSTALL 11:26:41 zebra[7282]: 0:10.10.10.10/32 Dplane route update ctx 0x55c429b7a040 op ROUTE_INSTALL 11:26:41 zebra[7282]: dplane dequeues 2 completed work from provider Kernel 11:26:41 zebra[7282]: dplane enqueues 2 new work to provider 'dplane_fpm_nl' 11:26:41 zebra[7282]: dplane dequeues 2 completed work from provider dplane_fpm_nl 11:26:41 zebra[7282]: dplane has 2 completed, 0 errors, for zebra main 11:26:41 zebra[7282]: zebra2proto: Please add this protocol(2) to proper rt_netlink.c handling 11:26:41 zebra[7282]: Nexthop dplane ctx 0x55c429b6fed0, op NH_INSTALL, nexthop ID (193), result SUCCESS 11:26:41 zebra[7282]: default(0:254):5.5.5.5/32 Processing dplane result ctx 0x55c429b79690, op ROUTE_INSTALL result SUCCESS We observe the same 2 enqueues and 2 dequeues as before, which again suggests that there is outstanding work. As expected, the 5.5.5.5/32 route is no longer marked as queued: O>* 5.5.5.5/32 [110/10] via 192.168.2.2, dp0p1s3, weight 1, 00:02:06 But the 10.10.10.10/32 route is, as we have not yet processed the completed context: C>q 10.10.10.10/32 is directly connected, lo, 00:26:05 Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 15:04:15 +00:00
Duncan Eastoe	53706b4e51	zebra: dplane API to get provider output q length Returns the current number of (completed) contexts in the provider's output queue (dp_ctx_out_q), allowing access to this data from the provider itself. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 15:04:11 +00:00
Duncan Eastoe	7545bda0a4	dplane_fpm_nl: queue peak counter never increments The context queue length peak counter is always set to its current value, hence never increments. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-12-11 12:09:56 +00:00
Donald Sharp	7ed5844bef	zebra: Allow `show zebra client` to give clues about route update status When entering `show zebra client` allow the display of the client->notify_status for route updates. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-10 12:59:14 -05:00
Russ White	101ad544fa	Merge pull request #7678 from donaldsharp/aspath_to_zebra Aspath to zebra	2020-12-10 10:38:14 -05:00
Donald Sharp	b2c7cf18b2	Merge pull request #7706 from slankdev/slankdev-unexpose-lm-func-1 zebra: unexpose label-manager util-funcs as static	2020-12-10 07:43:02 -05:00
Rafael Zalamena	0c7e0f2f70	Merge pull request #7697 from pguibert6WIND/zebra_crash_startup_zns zebra: anticipate zns creation at vrf creation when backend is vrf-lite	2020-12-10 09:10:34 -03:00
Donatas Abraitis	82b773e63b	Merge pull request #7524 from donaldsharp/zebra_route_map_tighten zebra: deny when route map is specified but does not exist yet	2020-12-10 11:01:25 +02:00
Hiroki Shirokura	d3d9639d9a	zebra: unexpose label-manager util-funcs as static Following functions which is a piece of label-maanager implementation isn't called from out side of its file. And all lines of label-manager are coded on zebra/label_manager.c at this time. So these functions should be unexposed. Functions: - create_label_chunk - assign_label_chunk - delete_label_chunk - release_label_chunk Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2020-12-10 09:56:55 +09:00
Philippe Guibert	91b1421e84	zebra: anticipate zns creation at vrf creation when backend is vrf-lite in the case the namespace pointer is already available, feed it at vrf creation. this prevents from crashing if the netlink parsing already began, and the vrf-lite is not enabled yet. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-12-09 13:26:20 +00:00
Mark Stapp	e386d2b154	Merge pull request #7690 from donaldsharp/nht_show_is_not_not_not zebra, tests: Fix `show ip nht`	2020-12-09 07:58:37 -05:00
Hiroki Shirokura	732d22cbf2	zebra: use zserv_send_message instead of writen Following functions is using writen to dispatch message into socket, but another function uses zserv_send_message. This commit does tiny unification for zapi's socket messaging. Funcs: - zsend_assign_label_chunk_response() - zsend_label_manager_connect_response() Signed-off-by: Hiroki Shirokura <slank.dev@gmail.com>	2020-12-09 17:17:21 +09:00
Donald Sharp	dda33b6e0c	zebra, tests: Fix `show ip nht` The `show ip nht` and `show ipv6 nht` commands were broken. This is because recent code commit: `0154d8ce45` assumed that p must not be NULL and this is not the case. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 15:50:46 -05:00
Donald Sharp	e46723a50e	bgpd, zebra: Add ability for bgp to send AS-Path information to zebra Add a bit of code to allow bgp to send the AS-Path associated with the route being installed to zebra so it can be displayed and used as part of the `show ip route A` command in zebra. eva# show ip route 20.0.0.0/11 Routing entry for 20.0.0.0/11 Known via "bgp", distance 20, metric 0, best Last update 00:00:00 ago * 192.168.161.1, via enp39s0, weight 1 AS-Path: 60000 64539 15096 6939 8075 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:07:21 -05:00
Donald Sharp	cfa2a35d8d	sharpd, zebra: Pass and display opaque data as PoC Pass data from sharpd to zebra as opaque data and display it as part of the detailed route data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:09 -05:00
Donald Sharp	80a6ee90c3	zebra: Setup structure for opaque data to be displayed Setup the output mechanism for opaque data to be displayed to the end operator. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Donald Sharp	a29a60016e	zebra: Gather opaque data into the route entry for storage Just gather the opaque data into the route entry. Later commits will display this data for end users as well as to send it down. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Donald Sharp	aab4eca1c0	lib, zebra: Fix overlapping message types We had duplicate message id's. Shit's broke yo. Fix. I have no idea how this properly worked. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-08 09:06:08 -05:00
Karen Schoener	581e797e02	zebra: Adding zapi client close notification When zebra detects a client close, send a zapi client close notification. Signed-off-by: Karen Schoener <karen@voltanet.io>	2020-12-07 18:22:36 -05:00
Mark Stapp	a88a7c8d43	zebra: improve dataplane plugin queue counters Add the current queue depths for each plugin to the 'show dplane providers' output. Maintain the out-bound queue max counter properly, that was being ignored. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-07 13:54:08 -05:00
Mark Stapp	0ca6f3b1e6	zebra: remove useless deleted route_entries promptly Zebra accumulates route-entry objects and then processes them as a group. If that rib processing is delayed, because the dataplane/fib programming has built up a queue e.g., zebra can hold multiple deleted route objects in memory. At scale, this can be a problem. Delete unneeded route entries promptly, if they can't contribute to rib processing. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-12-07 13:54:08 -05:00
Patrick Ruddy	dd662ca570	Merge pull request #7399 from AnuradhaKaruppiah/mh-mac-ecmp-fixes evpn-mh: miscellaneous fixes in MAC-sync and MAC-ECMP handling	2020-12-03 16:27:49 +00:00
Rafael Zalamena	f584de526d	fpm: reset/walk data structures on connection Don't attempt to walk data structures while not connected so we can save some CPU usage when FPM server is offline. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00
Rafael Zalamena	1f9193c1f0	fpm: simplify reset logic Instead of checking for next group reset, always do it and skip sending if next hop group support is disabled. Also remove unused `*_complete` variables. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00
Rafael Zalamena	a3adec468e	zebra,fpm: fix configuration display Use `pI4` and `pI6` to format addresses and fix a bug when displaying IPv6 addresses. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-12-03 07:30:23 -03:00
Donatas Abraitis	c49042b407	Merge pull request #7638 from donaldsharp/reduce_warn zebra: Reduce warn -> debug	2020-12-03 08:17:59 +02:00
Donald Sharp	0fb4ab0388	Merge pull request #6950 from opensourcerouting/bfd-distributed-v3 bfdd: distributed BFD	2020-12-02 20:50:47 -05:00
Donald Sharp	af8a77d636	Merge pull request #7644 from mjstapp/dplane_cleaner zebra: add an api to process/clean the pending dplane queue	2020-12-02 09:01:44 -05:00
Donald Sharp	fe76cf322e	Merge pull request #7646 from volta-networks/fix_show_route_summary zebra: fix show ip route vrf X summary	2020-12-02 08:59:54 -05:00
Mark Stapp	b238167a9b	Merge pull request #7645 from sworleys/NHG-IFP-Error2Log zebra: make a couple NHG errors debugs	2020-12-01 17:17:59 -05:00
Rafael Zalamena	de5fa92042	Merge pull request #7617 from deastoe/dplane-fpm-lsp zebra: dplane FPM LSP support	2020-12-01 16:01:09 -03:00
Stephen Worley	8c74d904d4	zebra: remove unused EC_ZEBRA_IF_LOOKUP_FAILED EC_ZEBRA_IF_LOOKUP_FAILED is no longer being used, remove it. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-01 13:05:36 -05:00
Anuradha Karuppiah	46bf266c1c	zebra: debug logs to detect incorrect mac deletions A MAC entry cannot be deleted while a neigh is referencing it. It seems there is some race condition where this may be happening. The log is to help identify those cases. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	4f9bb78eca	zebra: change the L2 NHG id format to co-exist with the L3NHG ids It is now 4bits of type and 28bits of value - 1. type=0 is for L3 NHG 2. type=1 is for L2 NH 3. type=2 is for L2 NHG Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	5de10c3705	zebra: allocate one nexthop id per-VTEP instead of one per-ES-VTEP This is an optimization to reduce the number of L2 nexthops. A l2 or fdb nexthop simply provides the dataplane with a nexthop ip- torm-12:mgmt:~# ip nexthop id 268435461 via 27.0.0.20 scope link fdb id 268435463 via 27.0.0.20 scope link fdb id 268435465 via 27.0.0.20 scope link fdb So there is no need to allocate a nexthop per-ES/per-VTEP. There can be 100+ ESs per-VTEP so this change cuts the scale down by a factor of 100. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:28 -08:00
Anuradha Karuppiah	15400f95b7	zebra: support for slow-failover of local MACs on an ES When a local ES flaps there are two modes in which the local MACs are failed over - 1. Fast failover - A backup NHG (ES-peer group) is programmed in the dataplane per-access port. When a local ES flaps the MAC entries are left unaltered i.e. pointing to the down access port. And the dataplane redirects traffic destined to the oper-down access port via the backup NHG. 2. Slow failover - This mode needs to be turned on to allow dataplanes not capable of re-directing traffic. In this mode local MAC entries on a down local ES are re-programmed to point to the ES-peers' NHG. And vice-versa i.e. when the ES comes up the MAC entries are re-programmed with the access port as dest. Fast failover is on by default. Slow failover can be enabled via the following config - evpn mh redirect-off Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:46:26 -08:00
Anuradha Karuppiah	69711b3f83	zebra: on local mac add from the dplane a re-install maybe need as static As a part of extended MM handing a MAC can be updated from local to remote while being referenced by SYNC neighs (this is really a temporary/small window). During this window if the MAC transitions back to local again we need to re-inforce the previous SYNC flags (based on the sync-neigh count) as subsequent SYNC updates to the MAC will be de-duped and ignored. Ticket: CM-29636 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	1a4f9efd54	zebra: set inactive bit when zebra re-installs the MAC on dplane del When a local mac is deleted by the dataplane zebra can re-install it if the MAC is a SYNC MAC (learned from ES peers). The "local_inactive" bit must be set as a part of the re-install to prevent zebra turning around and advertising the MAC as locally active. Also fixed up some debug logs in the slow-fail path to include the VNI. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	80e19eb71f	zebra: skip NDA_DST attr if NHG is present NHG and DST (VTEP-IP) are mutually exclusive attributes. If DST is present the kernel ignores NHG. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	de86cc5bb1	zebra: free up the L2 NHG bitmap as a part of shutdown Fix for a shutdown time memory leak found during review. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Anuradha Karuppiah	f3722826a4	zebra: remove FDB entries before de-activating a L2-NHG NHG is activated i.e. programmed in the dataplane only if there are active-VTEPs associated with it. When a NHG is de-activated all the remote-mac entries associated with it need to be removed before the NHG is removed. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-12-01 09:44:37 -08:00
Patrick Ruddy	0091461961	Merge pull request #7483 from AnuradhaKaruppiah/evpn-mh-dad bgpd, zebra: Keep DAD disabled if EVPN MH is turned on	2020-12-01 17:37:32 +00:00
Emanuele Di Pascale	265ac74a87	zebra: fix show ip route vrf X summary The lookup for non default VRFs was always using a tableId; if not provided, we were defaulting to RT_TABLE_MAIN. This is fine for the default VRF but not for others. As a result, the command was silently failing for non-default VRFs unless we also specified the correct tableId. Fix this by only performing the lookup using the tableId if it is provided; else use zebra_vrf_table. Signed-off-by: Emanuele Di Pascale <emanuele@voltanet.io>	2020-12-01 18:34:05 +01:00
Stephen Worley	306720345a	zebra: make a couple NHG errors debugs A couple NHG messages we were logging as errors are a bit spammy in usecases where you routinely add/remove interfaces (VM heavy deployments). Its not really an error a user cares about and more for a developer to know what went wrong after the fact so it makes more sense for these to be under a debug rather than an error since seeing them does not implicitly mean error during those usecases. Signed-off-by: Stephen Worley <sworley@nvidia.com>	2020-12-01 12:04:30 -05:00
Donald Sharp	34c9b28ba8	zebra: Reduce warn -> debug During times of network trauma and when we are at large network scale the process_remote_macip_add function can issue a zlog_warn for a common occurrence. Modify the code to be a debug statement. This behavior is the same now as the process_remote_macip_del function Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-30 19:37:53 -05:00
Mark Stapp	aa21da071c	zebra: add an api to process/clean the pending dplane queue Add an api that allows a caller in the zebra main pthread to process the queue of pending dplane updates. The caller supplies a function to call to test each pending context. Selected contexts are dequeued, and freed without being processed. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-30 16:42:18 -05:00
Anuradha Karuppiah	0c16fb7262	zebra: fix crash seen on VxLAN SG table cleanup done as a part of vrf disable There are two fixes in this commit - 1. Prevent implicit deletion of (,G) entries during (S,G) cleanup. This is done by creating a dummy reference on all (,G) entries. This is needed for a hash-walk based table cleanup. 2. Free up the SG hash table when the VRF is deleted. Ticket: CM-30151 Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-30 12:50:38 -08:00
Anuradha Karuppiah	325d694b93	zebra: support for type-0 ESI Earlier type-3 ESI was the only format supported for evpn-mh. Updated the CLI to allow a 10-byte type-0 ESI. Both type-0 and type-3 ESIs are statically configured; just in two different ways - 1. type-0 is configured as a complete 10-byte string 2. type-3 is configured as a 6-byte es-sys-mac and a 3-byte local-discriminator. Sample config - ! interface hostbond1 evpn mh es-id 00:44:38:39:ff:ff:01:00:00:01 ! This is a CLI-only change and has no functional impact. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-30 12:36:41 -08:00
Mark Stapp	a20e6c32a2	zebra: free dplane ctx after pw update Free the dplane contexts used for pseudowire updates; we were leaking these. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-30 10:02:40 -05:00
Duncan Eastoe	f9bf1ecc38	zebra: dplane FPM LSP table walk Add routines to walk the LSP table and generate FPM updates for all entries. A walk of the LSP table is triggered when (re-)connecting to an FPM. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-11-30 12:13:43 +00:00
Duncan Eastoe	b300c8bbcf	zebra: dplane FPM handle LSP install/update/delete Export netlink_lsp_msg_encoder() and use it to encode and send netlink messages concerning LSP updates to connected FPMs. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-11-27 16:32:01 +00:00
Anuradha Karuppiah	dfa3d3d70a	zebra: change the nhg format from hex to dec for easy match up with the dp Dataplane/kernel prints the NHG and NH ids as decimal. Zebra was printing it as hex (to display type vs. val). This became a debugging hassle hence normalizing the format. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-24 11:06:08 -08:00
Anuradha Karuppiah	b2ee2b71f4	zebra: Keep DAD disabled if EVPN MH is turned on DAD is not supported currently with EVPN-MH so we turn it off internally when the first ES config is detected. PS: Note that when all local ESs are deleted DAD will stay off and will need to be cleared via a daemon restart. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-24 10:20:32 -08:00
Rafael Zalamena	91804f630c	lib: add new stream function to reorganize buffer The function was originally implemented for zebra data plane FPM plugin, but another code places could use it. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-11-24 07:54:07 -03:00
Donatas Abraitis	53a85efa51	Merge pull request #7554 from donaldsharp/sockunion2hostprefix_watch_returns bgpd, lib, nhrpd, zebra: verify return of sockunion2hostprefix	2020-11-19 11:26:02 +02:00
Mark Stapp	84c709bc6e	Merge pull request #7555 from idryzhov/cppcheck-fixes fix a couple of issues found by cppcheck	2020-11-18 14:29:25 -05:00
Igor Ryzhov	b0efbc16e4	zebra: fix writing to pointer instead of value Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-11-18 19:05:30 +03:00
Donald Sharp	0154d8ce45	bgpd, lib, nhrpd, zebra: verify return of sockunion2hostprefix The return from sockunion2hostprefix tells us if the conversion succeeded or not. There are places in the code where we always assume that it just `works`, since it can fail notice and try to do the right thing. Please note that failure of this function for most cases of sockunion2hostprefix is highly highly unlikely as that the sockunion was already created and tested elsewhere it's just that this function can fail. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-18 11:04:27 -05:00
Mark Stapp	926bc58f78	Merge pull request #7478 from donaldsharp/buffer Buffer	2020-11-18 08:30:47 -05:00
Russ White	7dce3c57c2	Merge pull request #7518 from donaldsharp/asic_offload_more Asic offload more	2020-11-17 07:27:41 -05:00
Russ White	2bd9d50ca1	Merge pull request #7523 from donaldsharp/route_map_object_t *: Remove route_map_object_t from the system	2020-11-17 07:16:12 -05:00
Mark Stapp	55e74ca925	zebra: use smaller stream buffer for zapi route notifications The owner-notification zapi message is small; use a small buffer for it. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-11-15 14:50:17 -05:00
Donald Sharp	f7a9d0120d	zebra: Add offload and trap counts to summary command for json output For the json output add offload and trap route counts for the json output. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:19:25 -05:00
Donald Sharp	e4876266e4	zebra: Add `--asic-offload` command Add a command that allows FRR to know it's being used with an underlying asic offload, from the linux kernel perspective. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:19:25 -05:00
Donald Sharp	0d32fbee6d	lib, zebra: Add ability to read kernel notice of Offload Failed The linux kernel is getting RTM_F_OFFLOAD_FAILED for kernel routes that have failed to offload. Write the code to receive these notifications from the linux kernel and store that data for display about the routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-15 10:12:50 -05:00
Donald Sharp	fd303a4ba1	zebra: deny when route map is specified but does not exist yet If we have `ip protocol <proto> route-map FOO` and FOO has not been defined in any way shape fashion or form, we should deny the match instead of permitting it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-13 21:11:48 -05:00
Donald Sharp	1782514fb9	*: Remove route_map_object_t from the system The route_map_object_t was being used to track what protocol we were being called against. But each protocol was only ever calling itself. So we had a variable that was only ever being passed in from route_map_apply that had to be carried against and everyone was testing if that variable was for their own stack. Clean up this route_map_object_t from the entire system. We should speed some stuff up. Yes I know not a bunch but this will add up. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-13 19:35:20 -05:00
Donald Sharp	6d12b20703	zebra: Allow `set src X` to work on startup If a route-map in zebra has `set src X` and the interface X is on has not been configured yet, we are rejecting the command outright. This is a problem on boot up especially( and where I found this issue ) in that interfaces can and will be slow on startup and config can easily be read in before the interface has an ip address. Let's modify zebra to just warn to the user we may have a problem and let the chips fall where they may. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-13 16:12:26 -05:00
Santosh P K	9b936c5c36	Merge pull request #4770 from kssoman/fib Advertise FIB installed routes to bgp peers	2020-11-12 18:59:24 +05:30
Anuradha Karuppiah	60e372e9cb	zebra: Set NUD_NOARP on sticky MAC entries in addition to NTF_STICKY (ndm_state & NUD_NOARP) - prevents the entry from expiring (ndm_flags & NTF_STICKY) - prevents station moves on the entry Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-11-06 17:21:12 -08:00
Soman K S	77b38a4a7d	bgpd: Advertise FIB installed routes to bgp peers (Part 1) Issue: The bgp routes learnt from peers which are not installed in kernel are advertised to peers. This can cause routers to send traffic to these destinations only to get dropped. The fix is to provide a configurable option "bgp suppress-fib-pending". When the option is enabled, bgp will advertise routes only if it these are successfully installed in kernel. Fix (Part1) : * Added message ZEBRA_ROUTE_NOTIFY_REQUEST used by client to request FIB install status for routes * Added AFI/SAFI to ZAPI messages * Modified the functions zapi_route_notify_decode(), zsend_route_notify_owner() and route_notify_internal() to include AFI, SAFI as parameters Signed-off-by: kssoman <somanks@gmail.com>	2020-11-06 08:39:28 +05:30
Donald Sharp	9ea714e143	zebra: Rework code to make SA happy Clan SA was saying: ./zebra/zebra_vty_clippy.c: In function ‘show_route’: zebra/zebra_vty.c:1775:4: warning: ‘zvrf’ may be used uninitialized in this function [-Wmaybe-uninitialized] do_show_ip_route_all(vty, zvrf, afi, !!fib, !!json, tag, ^ I do not see a way that zvrf could ever be uninited in the code path but rearrange the code a tiny bit to make it happier. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-04 11:48:49 -05:00
Mark Stapp	5917df094a	zebra: add optional extra data about routes' interfaces Add extra data about the interfaces used in route updates' nexthops - some consumers of route updates may want additional data, but dataplane plugins running in the dplane pthread cannot safely access the normal zebra data structures. Capturing this info is optional - a plugin must request it (via an api). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-30 10:51:54 -04:00
Mark Stapp	93ca501b61	Merge pull request #7418 from donaldsharp/manuall *: spelling fixes	2020-10-30 08:16:46 -04:00
Donald Sharp	cd8cae5489	Merge pull request #7415 from mjstapp/fix_sa_strlen ospfd, zebra: Fix SA warnings	2020-10-30 07:21:45 -04:00
Jafar Al-Gharaibeh	b131b5f539	Merge pull request #7414 from donaldsharp/32bitflags zebra: Consolidate on 32 bits as the flag size for route flags	2020-10-29 21:47:15 -05:00
Donald Sharp	02c671af40	*: Correct spelling stuff Pretty obvious. WE R SPELL GOOD Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 16:16:00 -04:00
Mark Stapp	904e9b0570	zebra: clean up zebra_protodown_rc_str() Clean up api SA warning, use 'const', and replace snprintf+ pointer math with strlcat. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-29 12:03:25 -04:00
Donald Sharp	acde7f6b8e	zebra: Consolidate on 32 bits as the flag size for route flags When we get a route for installation via any method we should consolidate on 32 bits as the flag size, since we have actually more than 8 bits of data to bass around. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 09:13:59 -04:00
Donald Sharp	82144f532b	zebra: Don't do expensive string manip if not in debug Modify the code to not load up a string that is only used in debugging unless we are debugging. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 09:00:43 -04:00
Russ White	763a60663c	Merge pull request #7371 from AnuradhaKaruppiah/mh-uplink-tracking-1 evpn-mh: uplink tracking and startup delay	2020-10-28 12:13:57 -04:00
Donald Sharp	4d8fa81fbe	Merge pull request #7352 from mjstapp/fix_rt_netlink_indent zebra: fix strange indentation	2020-10-27 20:07:15 -04:00
Anuradha Karuppiah	c36e442c4b	zebra: uplink tracking and startup delay for EVPN-MH Local ethernet segments are held in a protodown or error-disabled state if access to the VxLAN overlay is not ready - 1. When FRR comes up the local-ESs/access-port are kept protodown for the startup-delay duration. During this time the underlay and EVPN routes via it are expected to converge. 2. When all the uplinks/core-links attached to the underlay go down the access-ports are similarly protodowned. The ES-bond protodown state is propagated to each ES-bond member and programmed in the dataplane/kernel (per-bond-member). Configuring uplinks - vtysh -c "conf t" vtysh -c "interface swp4" vtysh -c "evpn mh uplink" Configuring startup delay - vtysh -c "conf t" vtysh -c "evpn mh startup-delay 100" >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> EVPN protodown display - ======================== root@torm-11:mgmt:~# vtysh -c "show evpn" L2 VNIs: 10 L3 VNIs: 3 Advertise gateway mac-ip: No Advertise svi mac-ip: No Duplicate address detection: Disable Detection max-moves 5, time 180 EVPN MH: mac-holdtime: 60s, neigh-holdtime: 60s startup-delay: 180s, start-delay-timer: 00:01:14 <<<<<<<<<<<< uplink-cfg-cnt: 4, uplink-active-cnt: 4 protodown: startup-delay <<<<<<<<<<<<<<<<<<<<<<< >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ES-bond protodown display - =========================== root@torm-11:mgmt:~# vtysh -c "show interface hostbond1" Interface hostbond1 is up, line protocol is down Link ups: 0 last: (never) Link downs: 1 last: 2020/04/26 20:38:03.53 PTM status: disabled vrf: default OS Description: Local Node/s torm-11 and Ports swp5 <==> Remote Node/s hostd-11 and Ports swp1 index 58 metric 0 mtu 9152 speed 4294967295 flags: <UP,BROADCAST,MULTICAST> Type: Ethernet HWaddr: 00:02:00:00:00:35 Interface Type bond Master interface: bridge EVPN-MH: ES id 1 ES sysmac 00:00:00:00:01:11 protodown: off rc: startup-delay <<<<<<<<<<<<<<<<< >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ES-bond member protodown display - ================================== root@torm-11:mgmt:~# vtysh -c "show interface swp5" Interface swp5 is up, line protocol is down Link ups: 0 last: (never) Link downs: 3 last: 2020/04/26 20:38:03.52 PTM status: disabled vrf: default index 7 metric 0 mtu 9152 speed 10000 flags: <UP,BROADCAST,MULTICAST> Type: Ethernet HWaddr: 00:02:00:00:00:35 Interface Type Other Master interface: hostbond1 protodown: on rc: startup-delay <<<<<<<<<<<<<<<< root@torm-11:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-27 09:34:09 -07:00
Patrick Ruddy	dd51171227	Merge pull request #7158 from AnuradhaKaruppiah/mh-df-election evpn-mh: support for DF election	2020-10-27 16:09:45 +00:00
Mark Stapp	bdd085a874	zebra: fix strange indentation Fix some odd indentation in rt_netlink.c - merge damage, maybe? Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-27 12:03:41 -04:00
Mark Stapp	aa9d75efaf	Merge pull request #7381 from sworleys/NHG-Show-Proto-Filter zebra: add type specifier to show nexthop-group	2020-10-27 11:33:00 -04:00
Donald Sharp	f1dbb1c7e1	zebra: Add uptime to `show evpn mac vni ... detail` Add the uptime a mac entry has been in the system. New Output: eva# show evpn mac vni all detail VNI 1000 #MACs (local and remote) 16 MAC: 4e:2d:f3:75:ff:db ESI: 03:44:38:39:ff:ff:01:00:00:02 Intf: hostbond2(10) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 7a:a4:f2:30:dd:5d ESI: 03:44:38:39:ff:ff:01:00:00:01 Intf: hostbond1(9) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 66:9e:d7:3a:f1:f1 Remote VTEP: 192.168.100.18 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:26 Neighbors: 45.0.0.5 Active fe80::649e:d7ff:fe3a:f1f1 Active MAC: 26:f1:bd:5f:e1:77 Remote ES: 03:44:38:39:ff:ff:02:00:00:02 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:23 Neighbors: No Neighbors MAC: 16:80:eb:c4:43:6d ESI: 03:44:38:39:ff:ff:01:00:00:01 Intf: hostbond1(9) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 00:00:00:00:00:22 Remote ES: 03:44:38:39:ff:ff:02:00:00:02 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:26 Neighbors: No Neighbors Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 16:47:07 -04:00
Donald Sharp	a05111ba3d	zebra: Add uptime to `show evpn arp-cache vni .. detail` Add uptime data to `show evpn arp-cache vni ... detail` command. Effectively when we create a neighbor entry store the time it was created. When we modify the neighbor entry store the time it was modified. Display under detail output and json output. New output: eva# show evpn arp-cache vni all detail VNI 1000 #ARP (IPv4 and IPv6, local and remote) 8 IP: 45.0.0.5 Type: remote State: active Uptime: 00:01:59 MAC: 0a:fd:87:ca:7c:00 Sync-info: - Remote VTEP: 192.168.100.18 Local Seq: 0 Remote Seq: 0 IP: fe80::8fd:87ff:feca:7c00 Type: remote State: active Uptime: 00:01:59 MAC: 0a:fd:87:ca:7c:00 Sync-info: - Remote VTEP: 192.168.100.18 Local Seq: 0 Remote Seq: 0 IP: fe80::14e5:c2ff:fe50:fa59 Type: local State: active Uptime: 00:02:04 MAC: 16:e5:c2:50:fa:59 Sync-info: - Local Seq: 0 Remote Seq: 0 IP: 45.0.0.3 Type: remote State: active Uptime: 00:02:02 MAC: 0e:50:e8:cf:6b:eb Sync-info: - Remote VTEP: 192.168.100.16 Local Seq: 0 Remote Seq: 0 IP: 45.0.0.2 Type: local State: active Uptime: 00:02:05 MAC: 16:e5:c2:50:fa:59 Sync-info: - Local Seq: 0 Remote Seq: 0 IP: fe80::c50:e8ff:fecf:6beb Type: remote State: active Uptime: 00:02:02 MAC: 0e:50:e8:cf:6b:eb Sync-info: - Remote VTEP: 192.168.100.16 Local Seq: 0 Remote Seq: 0 IP: 45.0.0.4 Type: remote State: active Uptime: 00:01:55 MAC: 02:ad:5f:d8:da:80 Sync-info: - Remote VTEP: 192.168.100.17 Local Seq: 0 Remote Seq: 0 IP: fe80::ad:5fff:fed8:da80 Type: remote State: active Uptime: 00:01:55 MAC: 02:ad:5f:d8:da:80 Sync-info: - Remote VTEP: 192.168.100.17 Local Seq: 0 Remote Seq: 0 eva# Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 16:47:07 -04:00
Stephen Worley	a8ad9a89ea	zebra,doc: add type specifier to show nexthop-group Add a type specifier to the `show nexthop-group` command so we can easily filter by type when using proto created nexthop groups. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-26 15:55:02 -04:00
Anuradha Karuppiah	2747f6f786	zebra: cleanup inet_ntoa usage in zebra_evpn_mh.c logs Replaced inet_ntoa with %pI4 in the zebra debugs logs. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:43:05 -07:00
Anuradha Karuppiah	acffa256ba	zebra: add json output for zebra ES, ES-EVI and access vlan dumps 1. ES root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01 json" \|python -m json.tool { "accessPort": "hostbond1", "dfPreference": 50000, "esi": "03:44:38:39:ff:ff:01:00:00:01", "flags": [ "local", "remote", "readyForBgp", "bridgePort", "operUp", "nexthopGroupActive" ], "macCount": 10, "nexthopGroup": 536870913, "vniCount": 10, "vteps": [ { "dfAlgorithm": "preference", "dfPreference": 32767, "nexthopId": 268435460, "vtep": "27.0.0.16" }, { "dfAlgorithm": "preference", "dfPreference": 32767, "nexthopId": 268435463, "vtep": "27.0.0.17" } ] } >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 2. ES-EVI - root@torm-11:mgmt:~# vtysh -c "show evpn es-evi vni 1001 detail json" \|python -m json.tool [ { "esi": "03:44:38:39:ff:ff:01:00:00:01", "flags": [ "local", "readyForBgp" ], "vni": 1001 }, { "esi": "03:44:38:39:ff:ff:01:00:00:02", "flags": [ "local", "readyForBgp" ], "vni": 1001 }, ] >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 3. access-vlan root@torm-11:mgmt:~# vtysh -c "show evpn access-vlan 1001 json" \|python -m json. tool { "memberIfCount": 4, "members": [ { "ifName": "hostbond4" }, { "ifName": "hostbond1" }, { "ifName": "hostbond2" }, { "ifName": "hostbond3" } ], "vlan": 1001, "vni": 1001, "vxlanIf": "vx-1001" } root@torm-11:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:21 -07:00
Anuradha Karuppiah	72f2674a95	zebra: handle local-es bridge port association A local ES can be added or removed to a bridge after it is created. When it becomes a bridge port member the dataplane attributes need to be programmed. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:21 -07:00
Anuradha Karuppiah	28e80a037f	zebra: changes for programming SPH, non-DF and backup NHG br-port attrs split horizon filter, non-DF block filter and backup nexthop group are passed as bridge port attributes to the dataplane. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:19 -07:00
Anuradha Karuppiah	c60522f702	zebra: dplane APIs for programming evpn-mh access port attributes This includes - 1. non-DF block filter 2. List of es-peers that need to be blocked per-access port (for split horizon filtering) 3. Backup nexthop group to failover local-es via the VxLAN overlay Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:32:51 -07:00
Anuradha Karuppiah	1103c5c6cd	zebra: changes to run DF election 1. DF preference is configurable per-ES ! interface hostbond1 evpn mh es-df-pref 100 >>>>>>>>>>> evpn mh es-id 1 evpn mh es-sys-mac 00:00:00:00:01:11 ! 2. This parameter is sent to BGP and advertised via the ESR. 3. The peer-ESs' DF params are sent to zebra (by BGP) and used for running the DF election. 4. If the local VTEP becomes non-DF on an ES a block filter is programmed in the dataplane to drop de-capsulated BUM packets destined to that ES. Sample output ============= >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> torm-11# sh evpn es Type: L local, R remote, N non-DF ESI Type ES-IF VTEPs 03:00:00:00:00:01:11:00:00:01 LRN hostbond1 27.0.0.16 03:00:00:00:00:01:22:00:00:02 LR hostbond2 27.0.0.16 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> torm-11# sh evpn es 03:00:00:00:00:01:11:00:00:01 ESI: 03:00:00:00:00:01:11:00:00:01 Type: Local,Remote Interface: hostbond1 State: up Ready for BGP: yes VNI Count: 10 MAC Count: 2 DF: status: non-df preference: 100 >>>>>>>> Nexthop group: 0x2000001 VTEPs: 27.0.0.16 df_alg: preference df_pref: 32767 nh: 0x100000d >>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:32:49 -07:00
Donald Sharp	b467b4b462	zebra: Fix prefix2str buf and some invalid data output in zebra_mpls.c There are several places where prefix2str was used to convert a prefix but they were debug guarded and the buffer was used for flog_err/warn. This would lead to corrupt data being output in the failure cases if debugs were not turned on. Modify the code in zebra_mpls.c to not use prefix2str Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 09:38:33 -04:00
Donald Sharp	2919eea86a	zebra: Replace some prefix2str with %pFX We are loading a buffer with the prefix2str results then using it in the debugs throughout functions. Replace with just using %pFX and remove the buffer. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 09:38:29 -04:00
Patrick Ruddy	d7bd0c043c	Merge pull request #7217 from AnuradhaKaruppiah/fix-es-del-regression zebra: fix double clearing of zif->es_info.es	2020-10-26 10:12:54 +00:00
Mark Stapp	874e77acce	Merge pull request #7374 from sworleys/Revert-Revert-NHG-Dependents zebra: Fix the NHG dependents relationship	2020-10-24 16:49:09 -04:00
Mark Stapp	33fa4b14db	Merge pull request #7382 from sworleys/Fix-Msg-Buff zebra: fix unitialized msg header reading at startup	2020-10-23 18:05:04 -04:00
Quentin Young	939bd6ac52	Merge pull request #6788 from mjstapp/thread_cancel_off *: unify thread/task cancel apis	2020-10-23 15:02:50 -04:00
Stephen Worley	9d06e1219a	zebra: fix unitialized msg header reading at startup Fixes the valgrind error we were seeing on startup due to initializing the msg header struct: ``` ==2534283== Thread 3 zebra_dplane: ==2534283== Syscall param recvmsg(msg) points to uninitialised byte(s) ==2534283== at 0x4D616DD: recvmsg (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x43107C: netlink_recv_msg (kernel_netlink.c:744) ==2534283== by 0x4330E4: nl_batch_read_resp (kernel_netlink.c:1070) ==2534283== by 0x431D12: nl_batch_send (kernel_netlink.c:1201) ==2534283== by 0x431E8B: kernel_update_multi (kernel_netlink.c:1369) ==2534283== by 0x46019B: kernel_dplane_process_func (zebra_dplane.c:3979) ==2534283== by 0x45EB7F: dplane_thread_loop (zebra_dplane.c:4368) ==2534283== by 0x493F5CC: thread_call (thread.c:1585) ==2534283== by 0x48D3450: fpt_run (frr_pthread.c:303) ==2534283== by 0x48D3D41: frr_pthread_inner (frr_pthread.c:156) ==2534283== by 0x4D56431: start_thread (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x4E709D2: clone (in /usr/lib64/libc-2.31.so) ==2534283== Address 0x85cd850 is on thread 3's stack ==2534283== in frame #2, created by nl_batch_read_resp (kernel_netlink.c:1051) ==2534283== ==2534283== Syscall param recvmsg(msg.msg_control) points to unaddressable byte(s) ==2534283== at 0x4D616DD: recvmsg (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x43107C: netlink_recv_msg (kernel_netlink.c:744) ==2534283== by 0x4330E4: nl_batch_read_resp (kernel_netlink.c:1070) ==2534283== by 0x431D12: nl_batch_send (kernel_netlink.c:1201) ==2534283== by 0x431E8B: kernel_update_multi (kernel_netlink.c:1369) ==2534283== by 0x46019B: kernel_dplane_process_func (zebra_dplane.c:3979) ==2534283== by 0x45EB7F: dplane_thread_loop (zebra_dplane.c:4368) ==2534283== by 0x493F5CC: thread_call (thread.c:1585) ==2534283== by 0x48D3450: fpt_run (frr_pthread.c:303) ==2534283== by 0x48D3D41: frr_pthread_inner (frr_pthread.c:156) ==2534283== by 0x4D56431: start_thread (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x4E709D2: clone (in /usr/lib64/libc-2.31.so) ==2534283== Address 0xa0 is not stack'd, malloc'd or (recently) free'd ==2534283== ``` Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-23 14:57:29 -04:00
Mark Stapp	5047884528	*: unify thread/event cancel macros Replace all lib/thread cancel macros, use thread_cancel() everywhere. Only the THREAD_OFF macro and thread_cancel() api are supported. Also adjust thread_cancel_async() to NULL caller's pointer (if present). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-23 12:16:52 -04:00
Mark Stapp	1e4fa7f46c	Merge pull request #7364 from donaldsharp/zebra_nhg_keep zebra: Do not delete nhg's when retain_mode is engaged	2020-10-23 10:28:31 -04:00
Mark Stapp	b3d6bc6ef0	* : update signature of thread_cancel api Change thread_cancel to take a ** to an event, NULL-check before dereferencing, and NULL the caller's pointer. Update many callers to use the new signature. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-23 08:59:34 -04:00
Stephen Worley	7fa239f165	zebra: disable dependent backpointers for backup nexthops Because the backup nexthop groups currently are more like pseudo-NHEs (they don't have IDs and are not inserted into the ID table or hashed), they can't really have this depends/dependents relationship yet in both directions. Some work needs to be done there to make them more like first class citizens like "normal" NHGs to enable this. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-22 18:10:44 -04:00
Stephen Worley	8459128259	Revert "Revert "zebra: fix NHE dependents backpointer relationship"" This reverts commit `a682deea0f`.	2020-10-22 18:09:44 -04:00
Mark Stapp	9bcef951be	zebra: replace inet_ntoa Stop using inet_ntoa - use %pI4 or inet_ntop instead Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-22 13:37:25 -04:00
Donald Sharp	b1b07ef5a6	zebra: Do not delete nhg's when retain_mode is engaged When `-r` is specified to zebra, on shutdown we should not remove any routes from the fib. This was a problem with nhg's on shutdown due to their ref-count behavior. Introduce a methodology where on shutdown we don't mess with the nexthop groups in the kernel. That way on next startup things will be ok. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-22 08:02:33 -04:00
Donatas Abraitis	2dbe669bdf	:* Convert prefix2str to %pFX Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-10-22 09:07:41 +03:00
Stephen Worley	a682deea0f	Revert "zebra: fix NHE dependents backpointer relationship" This reverts commit `f9f9466e04`.	2020-10-20 17:11:35 -04:00
Donald Sharp	203098301c	Merge pull request #7348 from mjstapp/fix_router_id_lists zebra: clean up all router id lists	2020-10-20 15:53:52 -04:00
Donatas Abraitis	9072f5c89a	Merge pull request #7311 from donaldsharp/table_lock_count Abstract rn->lock accessing and cleanup usage to %pFX and %pRN	2020-10-20 16:04:15 +03:00
Mark Stapp	cdc09a4b04	zebra: clean up all router id lists Clean up the ipv6 router-id lists associated with a zvrf - these were being leaked. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-20 08:48:12 -04:00
Mark Stapp	658084c229	Merge pull request #7289 from sworleys/NHG-Crash-Start zebra: a couple NHG fixes	2020-10-20 08:41:36 -04:00
Stephen Worley	dc1c436278	zebra: add alias for "show ip/ipv6 ro" Add an alias so people can still type `show ip ro`. It became ambigious in a recent release. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-19 14:08:18 -04:00
Stephen Worley	f9f9466e04	zebra: fix NHE dependents backpointer relationship Apparantly the dependents backpointer trees for singletons got broken at some point and we never noticed. There is not really any code making use of this right now so not suprising but let's go ahead and fix it for zebra and proto NHGs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-19 12:52:39 -04:00
Anuradha Karuppiah	ab06b03315	zebra: fix double clearing of zif->es_info.es This problem was accidentally introduced as a part of another fixup - [ commit `e378f5020d` (anuradhak/mh-misc-fixes, mh-misc-fixes) Author: Anuradha Karuppiah <anuradhak@cumulusnetworks.com> Date: Tue Sep 15 16:50:14 2020 -0700 zebra: fix use of freed es during zebra shutdown ] zif->es_info.es is cleared as a part of zebra_evpn_es_local_info_clear so it cannot be passed around as a pointer from zebra_evpn_local_es_update/del. Because of this bug removing ES from an interface resulted in a zebra crash. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-19 09:36:44 -07:00
Donald Sharp	c85b63238a	Merge pull request #7333 from mjstapp/fix_multi_connected zebra: support multiple connected subnets on an interface	2020-10-18 08:29:19 -04:00
Donald Sharp	c10e14e96d	*: Create/Use accessor functions for lock count Create appropriate accessor functions for the rn->lock data. We should be accessing this data through accessor functions since it is private data to the data structure. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-17 13:39:10 -04:00
Donald Sharp	7e26b9d4a2	zebra: Fix use after free in debug path When zebra is running with debugs turned on there is a use after free reported by the address sanitizer: 2020/10/16 12:58:02 ZEBRA: rib_delnode: (0:254):4.5.6.16/32: rn 0x60b000026f20, re 0x6080000131a0, removing 2020/10/16 12:58:02 ZEBRA: rib_meta_queue_add: (0:254):4.5.6.16/32: queued rn 0x60b000026f20 into sub-queue 3 ================================================================= ==3101430==ERROR: AddressSanitizer: heap-use-after-free on address 0x608000011d28 at pc 0x555555705ab6 bp 0x7fffffffdab0 sp 0x7fffffffdaa8 READ of size 8 at 0x608000011d28 thread T0 #0 0x555555705ab5 in re_list_const_first zebra/rib.h:222 #1 0x555555705b54 in re_list_first zebra/rib.h:222 #2 0x555555711a4f in process_subq_route zebra/zebra_rib.c:2248 #3 0x555555711d2e in process_subq zebra/zebra_rib.c:2286 #4 0x555555711ec7 in meta_queue_process zebra/zebra_rib.c:2320 #5 0x7ffff74701f7 in work_queue_run lib/workqueue.c:291 #6 0x7ffff7450e9c in thread_call lib/thread.c:1581 #7 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #8 0x55555561a578 in main zebra/main.c:455 #9 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 #10 0x5555555e3429 in _start (/usr/lib/frr/zebra+0x8f429) 0x608000011d28 is located 8 bytes inside of 88-byte region [0x608000011d20,0x608000011d78) freed by thread T0 here: #0 0x7ffff768bb6f in __interceptor_free (/lib/x86_64-linux-gnu/libasan.so.6+0xa9b6f) #1 0x7ffff739ccad in qfree lib/memory.c:129 #2 0x555555709ee4 in rib_gc_dest zebra/zebra_rib.c:746 #3 0x55555570ca76 in rib_process zebra/zebra_rib.c:1240 #4 0x555555711a05 in process_subq_route zebra/zebra_rib.c:2245 #5 0x555555711d2e in process_subq zebra/zebra_rib.c:2286 #6 0x555555711ec7 in meta_queue_process zebra/zebra_rib.c:2320 #7 0x7ffff74701f7 in work_queue_run lib/workqueue.c:291 #8 0x7ffff7450e9c in thread_call lib/thread.c:1581 #9 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #10 0x55555561a578 in main zebra/main.c:455 #11 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 previously allocated by thread T0 here: #0 0x7ffff768c037 in calloc (/lib/x86_64-linux-gnu/libasan.so.6+0xaa037) #1 0x7ffff739cb98 in qcalloc lib/memory.c:110 #2 0x555555712ace in zebra_rib_create_dest zebra/zebra_rib.c:2515 #3 0x555555712c6c in rib_link zebra/zebra_rib.c:2576 #4 0x555555712faa in rib_addnode zebra/zebra_rib.c:2607 #5 0x555555715bf0 in rib_add_multipath_nhe zebra/zebra_rib.c:3012 #6 0x555555715f56 in rib_add_multipath zebra/zebra_rib.c:3049 #7 0x55555571788b in rib_add zebra/zebra_rib.c:3327 #8 0x5555555e584a in connected_up zebra/connected.c:254 #9 0x5555555e42ff in connected_announce zebra/connected.c:94 #10 0x5555555e4fd3 in connected_update zebra/connected.c:195 #11 0x5555555e61ad in connected_add_ipv4 zebra/connected.c:340 #12 0x5555555f26f5 in netlink_interface_addr zebra/if_netlink.c:1213 #13 0x55555560f756 in netlink_information_fetch zebra/kernel_netlink.c:350 #14 0x555555612e49 in netlink_parse_info zebra/kernel_netlink.c:941 #15 0x55555560f9f1 in kernel_read zebra/kernel_netlink.c:402 #16 0x7ffff7450e9c in thread_call lib/thread.c:1581 #17 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #18 0x55555561a578 in main zebra/main.c:455 #19 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 SUMMARY: AddressSanitizer: heap-use-after-free zebra/rib.h:222 in re_list_const_first This is happening because we are using the dest pointer after a call into rib_gc_dest. In process_subq_route, we call rib_process() and if the dest is deleted dest pointer is now garbage. We must reload the dest pointer in this case. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-17 08:55:26 -04:00
Mark Stapp	87009d7df0	zebra: support multiple connected subnets on an interface We support configuration of multiple addresses in the same subnet on a single interface: make sure that zebra supports multiple instances of the corresponding connected route. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-16 16:46:33 -04:00
Mark Stapp	027b3ca2e0	Merge pull request #7244 from donaldsharp/mlag_backout_and_fix Mlag backout and fix	2020-10-14 08:30:54 -04:00
Donald Sharp	4fe30ff1eb	Merge pull request #7298 from mjstapp/quiet_opaque_debugs zebra: quiet the zebra opaque message debugs	2020-10-14 07:27:27 -04:00
Donald Sharp	ca3491262b	zebra: Isolate mlag_rd_buf_offset to the actual using function Isolate the mlag_rd_buf_offset variable to the actual used function, instead of having it a global. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-13 16:02:05 -04:00
Donald Sharp	ded3e3e39c	Revert "zebra: the mlag_rd_buf_offset variable was write only" This reverts commit `00e0d113e5`.	2020-10-13 15:57:54 -04:00
Donald Sharp	82b4a8bf2c	Merge pull request #7258 from mjstapp/zebra_remove_slsp zebra: remove 'static' lsp objects	2020-10-13 15:51:18 -04:00
Mark Stapp	674afc2b0a	zebra: quiet the zebra opaque message debugs Put most of the debugs about opaque ZAPI messages under 'detail' to reduce the noise. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-13 14:07:17 -04:00
Donald Sharp	c8c5009ec5	Merge pull request #7288 from rsmarples/BSD-link_state BSD: ifi_link_state is the link state	2020-10-13 13:43:07 -04:00
Stephen Worley	475852b263	zebra: only track NHEs from the dataplane for ID usage Let's just track the NHEs we get from the kernel(dplane) for ID usage with internal routes. I tried to be smart originally and allow them to be re-used internal to zebra but its proving to cause more bugs than it's worth. This doesn't break any functionality. It just means we won't use NHEs we get from the kernel with our routes, we will create new ones. Decided this based on various bugs seen ith the lastest one being on startup with this kernel state: ``` [root@alfred frr-2]# ip next ls id 15 via 192.168.161.1 dev doof scope link proto zebra id 17 group 15 proto zebra [root@alfred frr-2]# ip ro show 3.3.3.1 3.3.3.1 nhid 17 via 192.168.161.1 dev doof ``` Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-13 11:23:57 -04:00
Emanuele Bovisio	2a3a97be8c	doc, zebra: remove keep_kernel option everywhere remove all remaining parts related to keep_kernel option Signed-off-by: Emanuele Bovisio <emanuele.bovisio@eolo.it>	2020-10-13 12:59:50 +02:00
Roy Marples	98f3df554b	zebra: ifi_link_state is the link state SIOCGIFMEDIA returns the media state. SIOCGIFDATA returns interface data which includes the link state. While the status of the former is usually indicitive of the latter, this is not always the case. Ifact some recent net80211 changes in at least NetBSD and OpenBSD have MONITOR media set to active but the link status set to DOWN. All interfaces will return link state with SIOCGIFDATA, unlike SIOCGIFMEDIA. However not all BSD's support SIOCGIFDATA - it has recently been accepted into FreeBSD-13. However, all BSD's do report the same structure in ifa_data for AF_LINK addresses from getifaddrs(3) so the information has always been available. Signed-off-by: Roy Marples <roy@marples.name>	2020-10-13 11:32:36 +01:00
Stephen Worley	5588801e7a	zebra: add from_dplane info for NHE creation Add a param to the common NHE creation callstack so we can know if this is one we have read in from the dataplane. We can add some logic on how to handle these special ones later. I considered putting this on a struct as a flag or something but it would have required it being put on struct nexthop since we have some `*_find_nexthop()` functions that can be called when given NHEs from the dataplane. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-12 20:39:28 -04:00
Donald Sharp	ba49e033f5	zebra: zevpn cannot be null passed into zebra_evpn_es_evi_show_one_evpn In zebra_evpn_es_evi_show_vni the zevpn pointer if passed into zebra_evpn_es_evi_show_one_evi will crash if it is null and we have code that checks that it is non null and then immediately calls the function. Add a return to prevent a crash. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-10-11 10:47:37 -04:00
Donald Sharp	bc3cd39bc4	zebra: n->mac is derefed in all paths No need to check for n->mac existence as that all paths leading to this code have n->mac already derefed. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-10-11 10:47:37 -04:00
Renato Westphal	8b6b6b694d	Merge pull request #7222 from idryzhov/fix-debug fix debug commands node inconsistencies	2020-10-09 21:58:24 -03:00
Mark Stapp	608a57c08b	zebra: remove 'static' lsp objects Use the same lsp and nexthop/nhlfe objects for 'static' and dynamic LSPs; remove the 'static' objects and their supporting code. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-08 15:50:41 -04:00
Renato Westphal	9cfb2747ad	Merge pull request #7241 from chiragshah6/evpn_dev1 lib: add errmsg to nb rpc	2020-10-07 11:50:52 -03:00
Mark Stapp	628995a30c	Merge pull request #7214 from donaldsharp/more_vrf_usefulness zebra: cleanup zebra_rnh.c debugs	2020-10-06 08:29:45 -04:00
Chirag Shah	9bee02322f	zebra: display rpc error msg to vtysh Zebra's clear duplicate detect command is rpc converted. There is condition where cli fails with human readable message. Using northboun's errmsg buffer to display error message to user. Testing: bharat# clear evpn dup-addr vni 1002 ip 2011:11::11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state bharat# clear evpn dup-addr vni 1002 ip 11.11.11.11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-10-05 13:57:54 -07:00
Chirag Shah	f63f5f1947	*: add errmsg to nb rpc Display human readable error message in northbound rpc transaction failure. In case of vtysh nb client, the error message will be displayed to user. Testing: bharat# clear evpn dup-addr vni 1002 ip 11.11.11.11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-10-05 13:15:59 -07:00
Mark Stapp	10da81824a	Merge pull request #7219 from donaldsharp/rib_fixes Rib fixes	2020-10-05 09:11:50 -04:00
Roy Marples	355c74b7e9	BSD: Add whitespace between declaration and code Signed-off-by: Roy Marples <roy@marples.name>	2020-10-05 08:10:42 +01:00
Roy Marples	68cd699df5	BSD: Detect route(4) overflows NetBSD and DragonFlyBSD support reporting of route(4) overflows by setting the socket option SO_RERROR. This is handled the same as on Linux by exiting with a -1 error code. Signed-off-by: Roy Marples <roy@marples.name>	2020-10-04 20:32:26 +01:00
Donald Sharp	5c30573e2a	zebra: cleanup zebra_rnh.c debugs a) Use appropriate %p modifiers for output 2) Display vrf name in addition to vrf id c) Remove now unused function Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-02 12:15:03 -04:00
Igor Ryzhov	d7b86ae4fe	vtysh: dynamically generate the list of daemons for commands Some daemons were actually missing from the static definitions: nhrpd, babeld, eigrpd and bfdd. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-10-02 15:06:27 +03:00
Igor Ryzhov	dd73744d8c	*: move "show debugging ..." commands to enable node Use the same node for "show debugging" commands in all daemons. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-10-02 15:06:05 +03:00
Donald Sharp	c17b2d5b6b	zebra: Make connected routes their own entry on the meta_q During quick ifdown / ifup events from the linux kernel there exists a situation where a prefix that has both a kernel route and a static route can queued up on the meta-q. If the static route happens to point at a connected route for nexthop resolution and we receive a series of quick up/down events after the static route and kernel route are queued up for rib reprocessing. Since the static route and kernel route are queued on meta-q 1 and the connected route is also on meta-q 1 there exists a situation where the connected route will be resolved after the static route fails to resolve, leaving the static route in a unresolved state. Add a new queue level and put connected routes on their own level, since they are the fundamental building blocks of pretty much all the other routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-01 15:17:06 -04:00
Donald Sharp	9d221fac7e	zebra: When processing route_entries ignore unusable routes When zebra is processing routes to determine what to send to the rib, suppose we have two routes (a) a route processed earlier that none of it's nexthops were active and (b) a route that has good nexthops but has a worse admin distance. rib_process, would not relook at (a)'s nexthops because the ROUTE_ENTRY_CHANGED flag was not true and it would win when compared to (b) because it's admin distance was better, leaving us with a state where we would attempt and fail to install route (a) because it was not valid. Modify the code to consider the number of nexthops we have as a determiner if we can use the route. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-01 15:17:06 -04:00
Donald Sharp	5c18e66208	zebra: Prevent uninstall attempts when new entry is not happy In rib_process_update_fib, the function is sent two route entries the old ( previously installed ) and new ( the one to install ) When the function detects that the new is unusable because the number of nexthops that are usable for that route is 0, then we uninstall the old route. The problem here is that we should not attempt to uninstall any route that is not owned by FRR. Modify the code to not attempt this behavior Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-30 17:26:44 -04:00
Quentin Young	fb3bc7a74b	Merge pull request #7215 from mjstapp/fix_z_mlag_read zebra: don't touch mlag read event pointer	2020-09-30 16:27:01 -04:00
Mark Stapp	f5d8487244	zebra: don't touch mlag read event pointer Don't touch the mlag read event pointer, it's not safe. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-09-30 13:24:54 -04:00
Mark Stapp	4fdfda2e34	Merge pull request #7167 from donaldsharp/mlag_rd_killer zebra: the mlag_rd_buf_offset variable was write only	2020-09-30 11:40:40 -04:00
Donald Sharp	dbbae374d4	Merge pull request #7192 from deastoe/zebra-fpm-blackhole-abort zebra: fix FPM abort for unreach/prohibit routes	2020-09-29 13:47:38 -04:00
Patrick Ruddy	aa1f6a8795	Merge pull request #7188 from chiragshah6/evpn_dev zebra: EVPN avoid duplicate list-node in l3vni's l2vni-list	2020-09-29 16:33:19 +01:00
Duncan Eastoe	94f7786375	zebra: fix FPM abort for unreach/prohibit routes `b0e9567ed1` fixed an issue whereby zebra would abort while building an update for a blackhole route. The same issue, `assert(data_len)` failing in `zfpm_build_route_updates()`, can be observed when building updates for unreachable and prohibit routes. To address this `netlink_route_info_fill()` is updated to not indicate failure, due to lack of nexthops, for any blackhole routes. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-09-29 12:59:30 +01:00
Donald Sharp	a24d04f4db	zebra: Make nexthop_active check use the same debug When debugging why a route was not successfully installed into the rib, it would be preferable that the end user only have to turn on `debug zebra rib detail` as that is what we have been telling people to do for the last couple of years. Consolidate back to this. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-29 07:54:35 -04:00
Donald Sharp	81194feec9	zebra: Add missing reason we could not make an active_nexthop check Add a missing reason as to why we are unable to make an active nexthop check be successful. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-29 07:45:19 -04:00
Chirag Shah	c7e83a4efe	zebra: avoid duplication node in l3vni l2vni-list With l2vni flap leading to duplicate entry creation in l3vni's l2vni-list. Use list sorted add with no duplicates. root@TORC11:mgmt:~# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1000 1000 0 0 1002 root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1000 1000 0 0 1002 1002 Ticket:CM-31545 Reviewed By: Testing Done: With Fix: Multiple time flaps vni counts remained the same. root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# net show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1002 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-09-28 21:44:30 -07:00
Stephen Worley	66c28560ba	zebra: set NHG/backup NHG pointers on success zapi read Only set the NHG/backup NHG pointers of the caller if the read of the nexthops was successfull. Otherwise, we might free when not neccessary or double free. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	2173535298	lib,zebra,sharpd: add code for backup proto-NHs but disabled Add the zapi code for encoding/decoding of backup nexthops for when we are ready for it, but disable it for now so that we revert to the old way with them. When zebra gets a proto-NHG with a backup in it, we early fail and tell the upper level proto. In this case sharpd. Sharpd then reverts to the old way of installation with the route. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	aaa42e056f	zebra: add type to nhg_prot_del API for sanity check Add type to the nhg_proto_del API params for sanity checking that the types of the route sent by the proto matches the type found with the ID. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	841f77ff04	zebra: free ctx if we skip replace for NHG PROTO routes Free the ctx if we decide we dont need to do anything with this route update. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	3d3a9dc8a7	zebra: limit no re-install to NHG PROTO using routes Limit the not re-installation of routes with the same NHG ID to routes that are using the new NHG PROTO API. This would only include sharpd and EVPN-MH for now. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8f830b8c64	zebra: use list to mark for removal when scoring In scoring our NHEs during shutdown there is a chance we could release mutliple NHEs at the same time during one iteration. This can cause memory corruption if the two being released are directly next to each other in the hash table. hash_iterate accounts for releasing one during the iteration but not two by setting hbnext before release but if hbnext is also freed, we obviously can have a problem. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	70f3cda6c1	zebra: reject proto NHGs of blackhole/interface Reject proto NHGs of type blackhole/interface for now. We need to think a bit more about how to resolve these given the linux kernel needs to know the Address Family of the routes that will use them and install it with them. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	73937edb73	zebra,sharpd: checkpatch fixes Check patches fixes for NHG API pathes. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	ff9aca4f8d	lib,zebra,sharpd: clang format Clang format for NHG API and sharpd patches. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8b2d3a0fb6	zebra: clean up the NHG proto zapi code a bit Clean up the function names and remove some TODOs that are no longer needed/hacks we used for testing. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	e270f004ae	zebra: multipath number checks with NHG proto Get the multipath number checks working with proto-based NHG message decoding in zapi_msg.c Modify the function that checks this for routes to work without being passed a prefix as is the case with NHG creates. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8155e8c592	zebra: add flag track released state of proto NHGS Add a flag to track the released state of a proto-based NHG. This flag is used to know whether the upper level proto has called the *_del API. Typically, the NHG would just get removed and uninstalled at this point but there is a chance we are being sent it while routes are still being owned or we were sent it multiple times. This flag and associated code handles that. Ticket: CM-30369 Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	70347b7ad6	zebra: reply fail on NHG add if not ifindex/onlink We currently don't support ADD/DEL/REPLACE with proto-based NHGs that are not already fully resolved and ifindex/onlink based. If we are handed one that doesn't have ifindex set i.e. recursive, gracefully fail and with a notification. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	2c7819b9d4	lib,zebra: fixup NHG notify zapi messaging Make the message parameters align better with other zapi notifications and change the ID to correctly be a uint32. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Donald Sharp	27805e74f0	zebra: Properly set NEXTHOP_FLAG_FIB when skipping install When the dataplane detects that we have no need to reinstall the same route, setup the NEXTHOP_FLAG_FIB appropriately. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	e3b9c0f2f6	zebra: Only install a minimal amount of times The code was installing the nexthop group again using the NLM_F_REPLACE function causing extremely large route installation times. This reduces the time from installing 1 million routes from sharpd with a nhg from > 200 seconds ( where I gave up ) to ~15 seconds on my machine for 32 x ecmp. As a side note 1 million routes using master sharpd takes ~50 seconds to do the same thing. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	72938edfbc	zebra: add logging for NHG ignoring in netlink Add some logging for when we choose to ignore a NHG install for one reason or another. Also, cleanup some of the code using the same accessor functions for the context object. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	9c6c48bc10	zebra: return the proto nhe on del even with refs Return the proto nhe on del even if their are still possible route references. We may get a del before the routes are removed. So we still need to return this to the caller so they can decrement the ref. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	3bccc0f5eb	zebra: fix releasing proto-owned singletons Fix the releasing of proto-owned singletons from the attribute hashed table. Proto-owned singleton nexthops are hashed so they can still be shared therefore they are present in this table and need to be released when the time comes. This check was only matching on zebra proto before. Changed to match IDs in zebra allocated range. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	f651b708e0	zebra: increment the nhg proto score iterator Increment the nhg proto score iterator we used to count leftover NHGs after client disconnect and log. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	1f65568046	zebra: fix refcnt/rib issues in NHG replace/delete Fix some reference counting issues seen when replacing a NHG and deleting one. For replacement, we should end with the same refcnt on the new one. For delete, its the caller's job to decrement its ref after its done with it. Further, update routes in the rib with the new pointer after replace. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	68671c7439	zebra: warn if zapi NHG add has no nexthops Log a warning and return if we receive a NHG add via zapi that has no nexthops. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	24db1a7b9a	zebra: handle proto NHG uninstall client disconnect Add code to handle proto-based NHG uninstalling after the owning client disconnects. This is handled the same way as rib_score_proto() but for now we are ignoring instance. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	6fae63d2ba	zebra: inc/dec refcount on add/del NHG proto When we add a proto NHG, increment the refcount, when we del a proto NHG, decrement the refcount rather than deleting it explicitly. If the upper level proto is handling it properly, it should get decremented to zero when we receive a NHG del. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2d8a9c544b	zebra: remove unneeded nhg repalce boilerplate Remove some leftover boilerplate from the old replace code path. That code ended up in the add API so its no longer needed. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	df3cef24c5	zebra: Prevent duplicate re-install If we have received a route that the already existing route is exactly the same, just note that it happened and move on. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2b5ecd4ca6	zebra: fix route validity check with NHG ID Fix check in zread where we determine validity of a route based on reading in nexthops/checking ID is present. We had a bad conditional that was determining a route is bad if its not NHG ID based. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	cd53e3a6e6	zebra: use the passed proto from zapi We were hard coding proto bgp for use with the NHG creation. Use the actual passed one from zapi now that it exists. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	ac5d1091dc	zebra: make NHG ID allocation smarter Make NHG ID allocation smarter so it wraps once it hits the lower bound for protos and performs a lookup to make sure we don't already have that ID in use. Its pretty unlikely we would wrap since the ID space is somewhere around 24million for Zebra at this point in time. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	54c89c9377	zebra: NHG ID bounds macros Determine the NHG ID spacing and lower bound with ZEBRA_ROUTE_MAX in macros. Directly set the upperbound to be the lower 28bits of the uint32_t ID space (the top 4 are reserved for l2-NHGs). Round that number down a bit to make it more even. Convert all former lower_bound calls to just use the macro. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	16b20ad062	zebra: dont update counter if outside of zebra ID range When we receive a NHG from the kernel, we set the ID counter to that to avoid using IDs owned from the kernel. If we get one outside of zebra's range, lets not update it since its probably one we created and never deleted anyway. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2c41ef8c17	zebra: special handling for proto-NHG-based routes For now let's assume proto-NHG-based routes are good to go (we assume they are onlink/interface based anyway) and bypass route resolution altogether. Once we determine how to handle recursive nexthop-resolution for proto-NHGs we will revisit this. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	50db3f2f1d	zebra: handle zapi routes with NHG ID set Add code to properly handle routes sent with NHG ID rather than a nexthop_group. For now, we separate this from backup nexthop handling since that should probably be added to the nhg_proto_add calls. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	dd1e105fe3	zebra: implement NHG proto replace Implement the ability to replace an NHG sent down from an upper level proto. With proto-owned NHGs, we make the assumption they are ecmp and always treat them as a group to make the replace from 1 -> 2 and 2 -> 1 quite a bit easier. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	08da8bbc22	zebra: hash proto-created but zebra ID spaced NHGS To prevent duplication of singleton NHGs, lets hash any zebra-ID spaced NHGs sent from an upper level proto. These would be singleton NHGs anyway and should prevent duplication of dataplane installs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	6c67f41f9e	zebra,lib: command to only install proto-based nexthops Add a command/functionality to only install proto-based nexthops. That is nexthops owned/created by upper level protocols, not ones implicitly created by zebra. There are some scenarios where you would not want zebra to be arbitrarily installing nexthop groups and but you still want to use ones you have control over via lib/nexthop_group config and an upper level protocol. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	0885b1e3d9	zebra: implement protocol NHG Add/Del Implement the underlying zebra functionality to Add/Del an internal zebra and kernel NHG. These NHGs are managed by the upperlevel protocols that send them down via zapi messaging. They are not put into the overall zebra NHG hash table and only put into to the ID table. Therefore, different protos cannot and will not share NHGs. The proto is also set appropriately when sent to the kernel. Expand the separation of Zebra hashed/shared/created NHGs and proto created and mangaged NHGs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	5b27c09d4e	zebra: remove NHG unhashable flag and its code Remove the code for setting a NHG as unhashable. Originally this was to prevent us from attempting to put duplicates from the kernel in our hashtable. Now I think its better to not use them in the hashtable at all and only track them in the ID table. Routes will still be able to use them if they specify the ID explicitly when sending Zebra the route, but 'normal' routes we hash the nexthop group on will not. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	27141ea94e	lib, zebra: Add ability to send down a nhgid over route install Modify the send down of a route to use the nexthop group id if we have one associated with the route. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	2f35a820bf	lib, zebra: Add ZAPI_NHG_ADD\|DELETE Add the ability to send a NHG from an upper level protocol down to zebra. ZAPI_NHG_ADD encompasses both the addition and replace semantics ( If the id passed down does not exist yet, it's Add, else it's a replace ). Effectively zebra will take this nhg passed down save the nhg in the id hash for nhg's and then create the appropriate nhg's and finally install them into the linux kernel. Notification will be the ZAPI_NHG_NOTIFY_OWNER zapi message for normal success/failure messaging to the installing protocol. This work is being done to allow us to work with EVPN MH which needs the ability to modify NHG's that BGP will own and operate on. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	f70da2a390	zebra: Refactor nexthop reading from zapi messages Take the zebra code that reads nexthops and combine it into one function so that when we add zapi messages to send/receive nexthops we can take advantage of this function. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	786a9bd9eb	zebra: Convert zserv_nexthop_num_warn to return bool Allow us to key of the warning if we have one. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donatas Abraitis	b1f476731a	Merge pull request #7169 from donaldsharp/some_code_cleanup Some code cleanup	2020-09-25 10:19:34 +03:00
Sri Mohana Singamsetty	46dd92c522	Merge pull request #7164 from AnuradhaKaruppiah/mh-misc-fixes evpn-mh: miscellaneous cleanup/fixes	2020-09-24 08:37:45 -07:00
Donald Sharp	9781e6a047	zebra: Don't ignore setsockopt return When attempting to limit the amount of data sent from the kernel to FRR, some kernels we can run against may not have this ability in which case the setsockopt will fail. Notice that in the log. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-24 07:42:51 -04:00
Rafael Zalamena	eead0bc46b	zebra: human readable netlink dumps Add new compile option to enable human readable netlink dumps with `debug zebra kernel msgdump`. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-09-23 23:07:02 -03:00
Donald Sharp	00e0d113e5	zebra: the mlag_rd_buf_offset variable was write only The mlag_rd_buf_offset function was only ever being set to 0 in the mlag_read function and only written in that function. There is no need for this global variable. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 20:36:51 -04:00
Mark Stapp	ccda0eadac	Merge pull request #7155 from donaldsharp/TRAP Offload/Trap	2020-09-23 16:06:37 -04:00
Mark Stapp	4020564a3c	Merge pull request #7163 from donaldsharp/zebra_mlag_bugs Zebra mlag bugs	2020-09-23 15:32:31 -04:00
Anuradha Karuppiah	e378f5020d	zebra: fix use of freed es during zebra shutdown This problem was reported by the sanitizer - ================================================================= ==24764==ERROR: AddressSanitizer: heap-use-after-free on address 0x60d0000115c8 at pc 0x55cb9cfad312 bp 0x7fffa0552140 sp 0x7fffa0552138 READ of size 8 at 0x60d0000115c8 thread T0 #0 0x55cb9cfad311 in zebra_evpn_remote_es_flush zebra/zebra_evpn_mh.c:2041 #1 0x55cb9cfad311 in zebra_evpn_es_cleanup zebra/zebra_evpn_mh.c:2234 #2 0x55cb9cf6ae78 in zebra_vrf_disable zebra/zebra_vrf.c:205 #3 0x7fc8d478f114 in vrf_delete lib/vrf.c:229 #4 0x7fc8d478f99a in vrf_terminate lib/vrf.c:541 #5 0x55cb9ceba0af in sigint zebra/main.c:176 #6 0x55cb9ceba0af in sigint zebra/main.c:130 #7 0x7fc8d4765d20 in quagga_sigevent_process lib/sigevent.c:103 #8 0x7fc8d4787e8c in thread_fetch lib/thread.c:1396 #9 0x7fc8d4708782 in frr_run lib/libfrr.c:1092 #10 0x55cb9ce931d8 in main zebra/main.c:488 #11 0x7fc8d43ee09a in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x2409a) #12 0x55cb9ce94c09 in _start (/usr/lib/frr/zebra+0x8ac09) ================================================================= Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 11:20:13 -07:00
Anuradha Karuppiah	4d8b658c8c	zebra: evpn-mh: add error logs on ES processing failures Cleanup some of the XXX added during development of MH. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 11:15:08 -07:00
Donatas Abraitis	5fde152be6	Merge pull request #7112 from AnuradhaKaruppiah/mac-neigh-ht evpn-mh: mac-ip sync hold timers	2020-09-23 21:11:56 +03:00
Patrick Ruddy	a3b5e4fdf7	Merge pull request #7157 from donaldsharp/nhg_speeds zebra: Move debug information gathering to inside guard	2020-09-23 18:42:00 +01:00
Donald Sharp	c19808acad	zebra: Increase the read/write mlag buffer sizes The read/write mlag buffer sizes of 2k were sufficient for ~100 S,G notifications at one go. Increase to 32k to give us 16 times the space. Ticket: CM-31576 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:13:03 -04:00
Donald Sharp	7692744f2c	zebra: Ensure that message received from mlag will fit If we receive a message that is greater than our buffer size we are in a situation where both the read and write buffers are fubar'ed beyond the end. Assert when we notice this fact. Ticket: CM-31576 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:12:26 -04:00
Donald Sharp	f24d9ab667	zebra: modify mlag code to only need 1 stream when generating data The normal pattern of writing the type/length at the beginning of the packet was not being quite followed. Modify the mlag code to respect the proper way of doing things and get rid of a stream_new and copy. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:12:20 -04:00
Anuradha Karuppiah	2b9e207e0e	zebra: stop neigh hold timer when the neigh is deleted The neigh hold timer was firing after the neigh was deleted resulting in the following crash - [ at ./zebra/zebra_evpn_neigh.h:155 at zebra/zebra_evpn_neigh.c:447 at lib/thread.c:1578 at zebra/main.c:488 ] Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Don Slice	f9f0463fb9	zebra: fix deletion of evpn mh neigh-holdtime Found that the command "evpn mh neigh-holdtime" can be set but not deleted. This fix solves the delete process Signed-off-by: Don Slice <dslice@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Anuradha Karuppiah	41c809b2a8	zebra: changes for configuring mac and neigh holdtime When an ES peer withdraws a MAC-IP route we hold the entry for N seconds to allow an external daemon (neighmgr) to establish host reachability independent of the peer. Add config commands to allow the user to set this holdtime (N). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Donald Sharp	aa178efd49	Merge pull request #7148 from pguibert6WIND/fix_fd_not_closed zebra: fix fd going out of scope leaks the handle	2020-09-23 07:40:14 -04:00
Donatas Abraitis	0ce5baaab1	Merge pull request #7018 from gouault6wind/show_ip_route Clean up in vrf management	2020-09-23 08:45:09 +03:00
Donald Sharp	bed74d178e	zebra: Move debug information gathering to inside guard Let's not make the entire `depend_finds` function pay for the data gathering needed for the debug. There are numerous other places in the code that check the NEXTHOP_FLAG_RECURSIVE and do the same output. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-22 20:47:33 -04:00
Sri Mohana Singamsetty	efdd997dad	Merge pull request #7116 from AnuradhaKaruppiah/mh-neigh-fixes evpn-mh: changes for programming synced neighs as static in the dataplane	2020-09-22 15:45:09 -07:00
Mark Stapp	b6033bd1c1	Merge pull request #7067 from donaldsharp/remove_solaris Remove solaris	2020-09-22 17:04:19 -04:00
Donald Sharp	5a3cf85391	lib, zebra: Add ability to read kernel notice of TRAP/OFFLOAD The linux kernel is getting RTM_F_TRAP and RTM_F_OFFLOAD for kernel routes that have an underlying asic offload. Write the code to receive these notifications from the linux kernel and to store that data for display about the routes. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-22 15:57:43 -04:00
Donald Sharp	4c56ce1cea	zebra: Add basic knowledge of asic offload available Some linux kernels are starting to support the idea of knowledge about the underlying asic. Add a boolean that we can set/unset to track whether or not we think the router has this functionality available. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-22 15:57:43 -04:00
Philippe Guibert	7529bf8f05	zebra: fix fd going out of scope leaks the handle the file descriptor is closed if it has been locally created. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-22 21:09:13 +02:00

... 10 11 12 13 14 ...

5113 Commits