mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-09-18 10:39:53 +00:00

Author	SHA1	Message	Date
Jafar Al-Gharaibeh	b131b5f539	Merge pull request #7414 from donaldsharp/32bitflags zebra: Consolidate on 32 bits as the flag size for route flags	2020-10-29 21:47:15 -05:00
Donald Sharp	02c671af40	*: Correct spelling stuff Pretty obvious. WE R SPELL GOOD Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 16:16:00 -04:00
Mark Stapp	904e9b0570	zebra: clean up zebra_protodown_rc_str() Clean up api SA warning, use 'const', and replace snprintf+ pointer math with strlcat. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-29 12:03:25 -04:00
Donald Sharp	acde7f6b8e	zebra: Consolidate on 32 bits as the flag size for route flags When we get a route for installation via any method we should consolidate on 32 bits as the flag size, since we have actually more than 8 bits of data to bass around. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 09:13:59 -04:00
Donald Sharp	82144f532b	zebra: Don't do expensive string manip if not in debug Modify the code to not load up a string that is only used in debugging unless we are debugging. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-29 09:00:43 -04:00
Russ White	763a60663c	Merge pull request #7371 from AnuradhaKaruppiah/mh-uplink-tracking-1 evpn-mh: uplink tracking and startup delay	2020-10-28 12:13:57 -04:00
Donald Sharp	4d8fa81fbe	Merge pull request #7352 from mjstapp/fix_rt_netlink_indent zebra: fix strange indentation	2020-10-27 20:07:15 -04:00
Anuradha Karuppiah	c36e442c4b	zebra: uplink tracking and startup delay for EVPN-MH Local ethernet segments are held in a protodown or error-disabled state if access to the VxLAN overlay is not ready - 1. When FRR comes up the local-ESs/access-port are kept protodown for the startup-delay duration. During this time the underlay and EVPN routes via it are expected to converge. 2. When all the uplinks/core-links attached to the underlay go down the access-ports are similarly protodowned. The ES-bond protodown state is propagated to each ES-bond member and programmed in the dataplane/kernel (per-bond-member). Configuring uplinks - vtysh -c "conf t" vtysh -c "interface swp4" vtysh -c "evpn mh uplink" Configuring startup delay - vtysh -c "conf t" vtysh -c "evpn mh startup-delay 100" >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> EVPN protodown display - ======================== root@torm-11:mgmt:~# vtysh -c "show evpn" L2 VNIs: 10 L3 VNIs: 3 Advertise gateway mac-ip: No Advertise svi mac-ip: No Duplicate address detection: Disable Detection max-moves 5, time 180 EVPN MH: mac-holdtime: 60s, neigh-holdtime: 60s startup-delay: 180s, start-delay-timer: 00:01:14 <<<<<<<<<<<< uplink-cfg-cnt: 4, uplink-active-cnt: 4 protodown: startup-delay <<<<<<<<<<<<<<<<<<<<<<< >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ES-bond protodown display - =========================== root@torm-11:mgmt:~# vtysh -c "show interface hostbond1" Interface hostbond1 is up, line protocol is down Link ups: 0 last: (never) Link downs: 1 last: 2020/04/26 20:38:03.53 PTM status: disabled vrf: default OS Description: Local Node/s torm-11 and Ports swp5 <==> Remote Node/s hostd-11 and Ports swp1 index 58 metric 0 mtu 9152 speed 4294967295 flags: <UP,BROADCAST,MULTICAST> Type: Ethernet HWaddr: 00:02:00:00:00:35 Interface Type bond Master interface: bridge EVPN-MH: ES id 1 ES sysmac 00:00:00:00:01:11 protodown: off rc: startup-delay <<<<<<<<<<<<<<<<< >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ES-bond member protodown display - ================================== root@torm-11:mgmt:~# vtysh -c "show interface swp5" Interface swp5 is up, line protocol is down Link ups: 0 last: (never) Link downs: 3 last: 2020/04/26 20:38:03.52 PTM status: disabled vrf: default index 7 metric 0 mtu 9152 speed 10000 flags: <UP,BROADCAST,MULTICAST> Type: Ethernet HWaddr: 00:02:00:00:00:35 Interface Type Other Master interface: hostbond1 protodown: on rc: startup-delay <<<<<<<<<<<<<<<< root@torm-11:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-27 09:34:09 -07:00
Patrick Ruddy	dd51171227	Merge pull request #7158 from AnuradhaKaruppiah/mh-df-election evpn-mh: support for DF election	2020-10-27 16:09:45 +00:00
Mark Stapp	bdd085a874	zebra: fix strange indentation Fix some odd indentation in rt_netlink.c - merge damage, maybe? Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-27 12:03:41 -04:00
Mark Stapp	aa9d75efaf	Merge pull request #7381 from sworleys/NHG-Show-Proto-Filter zebra: add type specifier to show nexthop-group	2020-10-27 11:33:00 -04:00
Donald Sharp	f1dbb1c7e1	zebra: Add uptime to `show evpn mac vni ... detail` Add the uptime a mac entry has been in the system. New Output: eva# show evpn mac vni all detail VNI 1000 #MACs (local and remote) 16 MAC: 4e:2d:f3:75:ff:db ESI: 03:44:38:39:ff:ff:01:00:00:02 Intf: hostbond2(10) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 7a:a4:f2:30:dd:5d ESI: 03:44:38:39:ff:ff:01:00:00:01 Intf: hostbond1(9) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 66:9e:d7:3a:f1:f1 Remote VTEP: 192.168.100.18 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:26 Neighbors: 45.0.0.5 Active fe80::649e:d7ff:fe3a:f1f1 Active MAC: 26:f1:bd:5f:e1:77 Remote ES: 03:44:38:39:ff:ff:02:00:00:02 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:23 Neighbors: No Neighbors MAC: 16:80:eb:c4:43:6d ESI: 03:44:38:39:ff:ff:01:00:00:01 Intf: hostbond1(9) VLAN: 1000 Sync-info: neigh#: 0 peer-active Local Seq: 0 Remote Seq: 0 Uptime: 00:00:28 Neighbors: No Neighbors MAC: 00:00:00:00:00:22 Remote ES: 03:44:38:39:ff:ff:02:00:00:02 Sync-info: neigh#: 0 Local Seq: 0 Remote Seq: 0 Uptime: 00:00:26 Neighbors: No Neighbors Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 16:47:07 -04:00
Donald Sharp	a05111ba3d	zebra: Add uptime to `show evpn arp-cache vni .. detail` Add uptime data to `show evpn arp-cache vni ... detail` command. Effectively when we create a neighbor entry store the time it was created. When we modify the neighbor entry store the time it was modified. Display under detail output and json output. New output: eva# show evpn arp-cache vni all detail VNI 1000 #ARP (IPv4 and IPv6, local and remote) 8 IP: 45.0.0.5 Type: remote State: active Uptime: 00:01:59 MAC: 0a:fd:87:ca:7c:00 Sync-info: - Remote VTEP: 192.168.100.18 Local Seq: 0 Remote Seq: 0 IP: fe80::8fd:87ff:feca:7c00 Type: remote State: active Uptime: 00:01:59 MAC: 0a:fd:87:ca:7c:00 Sync-info: - Remote VTEP: 192.168.100.18 Local Seq: 0 Remote Seq: 0 IP: fe80::14e5:c2ff:fe50:fa59 Type: local State: active Uptime: 00:02:04 MAC: 16:e5:c2:50:fa:59 Sync-info: - Local Seq: 0 Remote Seq: 0 IP: 45.0.0.3 Type: remote State: active Uptime: 00:02:02 MAC: 0e:50:e8:cf:6b:eb Sync-info: - Remote VTEP: 192.168.100.16 Local Seq: 0 Remote Seq: 0 IP: 45.0.0.2 Type: local State: active Uptime: 00:02:05 MAC: 16:e5:c2:50:fa:59 Sync-info: - Local Seq: 0 Remote Seq: 0 IP: fe80::c50:e8ff:fecf:6beb Type: remote State: active Uptime: 00:02:02 MAC: 0e:50:e8:cf:6b:eb Sync-info: - Remote VTEP: 192.168.100.16 Local Seq: 0 Remote Seq: 0 IP: 45.0.0.4 Type: remote State: active Uptime: 00:01:55 MAC: 02:ad:5f:d8:da:80 Sync-info: - Remote VTEP: 192.168.100.17 Local Seq: 0 Remote Seq: 0 IP: fe80::ad:5fff:fed8:da80 Type: remote State: active Uptime: 00:01:55 MAC: 02:ad:5f:d8:da:80 Sync-info: - Remote VTEP: 192.168.100.17 Local Seq: 0 Remote Seq: 0 eva# Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 16:47:07 -04:00
Stephen Worley	a8ad9a89ea	zebra,doc: add type specifier to show nexthop-group Add a type specifier to the `show nexthop-group` command so we can easily filter by type when using proto created nexthop groups. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-26 15:55:02 -04:00
Anuradha Karuppiah	2747f6f786	zebra: cleanup inet_ntoa usage in zebra_evpn_mh.c logs Replaced inet_ntoa with %pI4 in the zebra debugs logs. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:43:05 -07:00
Anuradha Karuppiah	acffa256ba	zebra: add json output for zebra ES, ES-EVI and access vlan dumps 1. ES root@torm-11:mgmt:~# vtysh -c "show evpn es 03:44:38:39:ff:ff:01:00:00:01 json" \|python -m json.tool { "accessPort": "hostbond1", "dfPreference": 50000, "esi": "03:44:38:39:ff:ff:01:00:00:01", "flags": [ "local", "remote", "readyForBgp", "bridgePort", "operUp", "nexthopGroupActive" ], "macCount": 10, "nexthopGroup": 536870913, "vniCount": 10, "vteps": [ { "dfAlgorithm": "preference", "dfPreference": 32767, "nexthopId": 268435460, "vtep": "27.0.0.16" }, { "dfAlgorithm": "preference", "dfPreference": 32767, "nexthopId": 268435463, "vtep": "27.0.0.17" } ] } >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 2. ES-EVI - root@torm-11:mgmt:~# vtysh -c "show evpn es-evi vni 1001 detail json" \|python -m json.tool [ { "esi": "03:44:38:39:ff:ff:01:00:00:01", "flags": [ "local", "readyForBgp" ], "vni": 1001 }, { "esi": "03:44:38:39:ff:ff:01:00:00:02", "flags": [ "local", "readyForBgp" ], "vni": 1001 }, ] >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 3. access-vlan root@torm-11:mgmt:~# vtysh -c "show evpn access-vlan 1001 json" \|python -m json. tool { "memberIfCount": 4, "members": [ { "ifName": "hostbond4" }, { "ifName": "hostbond1" }, { "ifName": "hostbond2" }, { "ifName": "hostbond3" } ], "vlan": 1001, "vni": 1001, "vxlanIf": "vx-1001" } root@torm-11:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:21 -07:00
Anuradha Karuppiah	72f2674a95	zebra: handle local-es bridge port association A local ES can be added or removed to a bridge after it is created. When it becomes a bridge port member the dataplane attributes need to be programmed. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:21 -07:00
Anuradha Karuppiah	28e80a037f	zebra: changes for programming SPH, non-DF and backup NHG br-port attrs split horizon filter, non-DF block filter and backup nexthop group are passed as bridge port attributes to the dataplane. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:33:19 -07:00
Anuradha Karuppiah	c60522f702	zebra: dplane APIs for programming evpn-mh access port attributes This includes - 1. non-DF block filter 2. List of es-peers that need to be blocked per-access port (for split horizon filtering) 3. Backup nexthop group to failover local-es via the VxLAN overlay Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:32:51 -07:00
Anuradha Karuppiah	1103c5c6cd	zebra: changes to run DF election 1. DF preference is configurable per-ES ! interface hostbond1 evpn mh es-df-pref 100 >>>>>>>>>>> evpn mh es-id 1 evpn mh es-sys-mac 00:00:00:00:01:11 ! 2. This parameter is sent to BGP and advertised via the ESR. 3. The peer-ESs' DF params are sent to zebra (by BGP) and used for running the DF election. 4. If the local VTEP becomes non-DF on an ES a block filter is programmed in the dataplane to drop de-capsulated BUM packets destined to that ES. Sample output ============= >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> torm-11# sh evpn es Type: L local, R remote, N non-DF ESI Type ES-IF VTEPs 03:00:00:00:00:01:11:00:00:01 LRN hostbond1 27.0.0.16 03:00:00:00:00:01:22:00:00:02 LR hostbond2 27.0.0.16 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> torm-11# sh evpn es 03:00:00:00:00:01:11:00:00:01 ESI: 03:00:00:00:00:01:11:00:00:01 Type: Local,Remote Interface: hostbond1 State: up Ready for BGP: yes VNI Count: 10 MAC Count: 2 DF: status: non-df preference: 100 >>>>>>>> Nexthop group: 0x2000001 VTEPs: 27.0.0.16 df_alg: preference df_pref: 32767 nh: 0x100000d >>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-26 10:32:49 -07:00
Donald Sharp	b467b4b462	zebra: Fix prefix2str buf and some invalid data output in zebra_mpls.c There are several places where prefix2str was used to convert a prefix but they were debug guarded and the buffer was used for flog_err/warn. This would lead to corrupt data being output in the failure cases if debugs were not turned on. Modify the code in zebra_mpls.c to not use prefix2str Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 09:38:33 -04:00
Donald Sharp	2919eea86a	zebra: Replace some prefix2str with %pFX We are loading a buffer with the prefix2str results then using it in the debugs throughout functions. Replace with just using %pFX and remove the buffer. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-26 09:38:29 -04:00
Patrick Ruddy	d7bd0c043c	Merge pull request #7217 from AnuradhaKaruppiah/fix-es-del-regression zebra: fix double clearing of zif->es_info.es	2020-10-26 10:12:54 +00:00
Mark Stapp	874e77acce	Merge pull request #7374 from sworleys/Revert-Revert-NHG-Dependents zebra: Fix the NHG dependents relationship	2020-10-24 16:49:09 -04:00
Mark Stapp	33fa4b14db	Merge pull request #7382 from sworleys/Fix-Msg-Buff zebra: fix unitialized msg header reading at startup	2020-10-23 18:05:04 -04:00
Quentin Young	939bd6ac52	Merge pull request #6788 from mjstapp/thread_cancel_off *: unify thread/task cancel apis	2020-10-23 15:02:50 -04:00
Stephen Worley	9d06e1219a	zebra: fix unitialized msg header reading at startup Fixes the valgrind error we were seeing on startup due to initializing the msg header struct: ``` ==2534283== Thread 3 zebra_dplane: ==2534283== Syscall param recvmsg(msg) points to uninitialised byte(s) ==2534283== at 0x4D616DD: recvmsg (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x43107C: netlink_recv_msg (kernel_netlink.c:744) ==2534283== by 0x4330E4: nl_batch_read_resp (kernel_netlink.c:1070) ==2534283== by 0x431D12: nl_batch_send (kernel_netlink.c:1201) ==2534283== by 0x431E8B: kernel_update_multi (kernel_netlink.c:1369) ==2534283== by 0x46019B: kernel_dplane_process_func (zebra_dplane.c:3979) ==2534283== by 0x45EB7F: dplane_thread_loop (zebra_dplane.c:4368) ==2534283== by 0x493F5CC: thread_call (thread.c:1585) ==2534283== by 0x48D3450: fpt_run (frr_pthread.c:303) ==2534283== by 0x48D3D41: frr_pthread_inner (frr_pthread.c:156) ==2534283== by 0x4D56431: start_thread (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x4E709D2: clone (in /usr/lib64/libc-2.31.so) ==2534283== Address 0x85cd850 is on thread 3's stack ==2534283== in frame #2, created by nl_batch_read_resp (kernel_netlink.c:1051) ==2534283== ==2534283== Syscall param recvmsg(msg.msg_control) points to unaddressable byte(s) ==2534283== at 0x4D616DD: recvmsg (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x43107C: netlink_recv_msg (kernel_netlink.c:744) ==2534283== by 0x4330E4: nl_batch_read_resp (kernel_netlink.c:1070) ==2534283== by 0x431D12: nl_batch_send (kernel_netlink.c:1201) ==2534283== by 0x431E8B: kernel_update_multi (kernel_netlink.c:1369) ==2534283== by 0x46019B: kernel_dplane_process_func (zebra_dplane.c:3979) ==2534283== by 0x45EB7F: dplane_thread_loop (zebra_dplane.c:4368) ==2534283== by 0x493F5CC: thread_call (thread.c:1585) ==2534283== by 0x48D3450: fpt_run (frr_pthread.c:303) ==2534283== by 0x48D3D41: frr_pthread_inner (frr_pthread.c:156) ==2534283== by 0x4D56431: start_thread (in /usr/lib64/libpthread-2.31.so) ==2534283== by 0x4E709D2: clone (in /usr/lib64/libc-2.31.so) ==2534283== Address 0xa0 is not stack'd, malloc'd or (recently) free'd ==2534283== ``` Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-23 14:57:29 -04:00
Mark Stapp	5047884528	*: unify thread/event cancel macros Replace all lib/thread cancel macros, use thread_cancel() everywhere. Only the THREAD_OFF macro and thread_cancel() api are supported. Also adjust thread_cancel_async() to NULL caller's pointer (if present). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-23 12:16:52 -04:00
Mark Stapp	1e4fa7f46c	Merge pull request #7364 from donaldsharp/zebra_nhg_keep zebra: Do not delete nhg's when retain_mode is engaged	2020-10-23 10:28:31 -04:00
Mark Stapp	b3d6bc6ef0	* : update signature of thread_cancel api Change thread_cancel to take a ** to an event, NULL-check before dereferencing, and NULL the caller's pointer. Update many callers to use the new signature. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-23 08:59:34 -04:00
Stephen Worley	7fa239f165	zebra: disable dependent backpointers for backup nexthops Because the backup nexthop groups currently are more like pseudo-NHEs (they don't have IDs and are not inserted into the ID table or hashed), they can't really have this depends/dependents relationship yet in both directions. Some work needs to be done there to make them more like first class citizens like "normal" NHGs to enable this. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-22 18:10:44 -04:00
Stephen Worley	8459128259	Revert "Revert "zebra: fix NHE dependents backpointer relationship"" This reverts commit `a682deea0f`.	2020-10-22 18:09:44 -04:00
Mark Stapp	9bcef951be	zebra: replace inet_ntoa Stop using inet_ntoa - use %pI4 or inet_ntop instead Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-22 13:37:25 -04:00
Donald Sharp	b1b07ef5a6	zebra: Do not delete nhg's when retain_mode is engaged When `-r` is specified to zebra, on shutdown we should not remove any routes from the fib. This was a problem with nhg's on shutdown due to their ref-count behavior. Introduce a methodology where on shutdown we don't mess with the nexthop groups in the kernel. That way on next startup things will be ok. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-22 08:02:33 -04:00
Donatas Abraitis	2dbe669bdf	:* Convert prefix2str to %pFX Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-10-22 09:07:41 +03:00
Stephen Worley	a682deea0f	Revert "zebra: fix NHE dependents backpointer relationship" This reverts commit `f9f9466e04`.	2020-10-20 17:11:35 -04:00
Donald Sharp	203098301c	Merge pull request #7348 from mjstapp/fix_router_id_lists zebra: clean up all router id lists	2020-10-20 15:53:52 -04:00
Donatas Abraitis	9072f5c89a	Merge pull request #7311 from donaldsharp/table_lock_count Abstract rn->lock accessing and cleanup usage to %pFX and %pRN	2020-10-20 16:04:15 +03:00
Mark Stapp	cdc09a4b04	zebra: clean up all router id lists Clean up the ipv6 router-id lists associated with a zvrf - these were being leaked. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-20 08:48:12 -04:00
Mark Stapp	658084c229	Merge pull request #7289 from sworleys/NHG-Crash-Start zebra: a couple NHG fixes	2020-10-20 08:41:36 -04:00
Stephen Worley	dc1c436278	zebra: add alias for "show ip/ipv6 ro" Add an alias so people can still type `show ip ro`. It became ambigious in a recent release. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-19 14:08:18 -04:00
Stephen Worley	f9f9466e04	zebra: fix NHE dependents backpointer relationship Apparantly the dependents backpointer trees for singletons got broken at some point and we never noticed. There is not really any code making use of this right now so not suprising but let's go ahead and fix it for zebra and proto NHGs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-19 12:52:39 -04:00
Anuradha Karuppiah	ab06b03315	zebra: fix double clearing of zif->es_info.es This problem was accidentally introduced as a part of another fixup - [ commit `e378f5020d` (anuradhak/mh-misc-fixes, mh-misc-fixes) Author: Anuradha Karuppiah <anuradhak@cumulusnetworks.com> Date: Tue Sep 15 16:50:14 2020 -0700 zebra: fix use of freed es during zebra shutdown ] zif->es_info.es is cleared as a part of zebra_evpn_es_local_info_clear so it cannot be passed around as a pointer from zebra_evpn_local_es_update/del. Because of this bug removing ES from an interface resulted in a zebra crash. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-10-19 09:36:44 -07:00
Donald Sharp	c85b63238a	Merge pull request #7333 from mjstapp/fix_multi_connected zebra: support multiple connected subnets on an interface	2020-10-18 08:29:19 -04:00
Donald Sharp	c10e14e96d	*: Create/Use accessor functions for lock count Create appropriate accessor functions for the rn->lock data. We should be accessing this data through accessor functions since it is private data to the data structure. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-17 13:39:10 -04:00
Donald Sharp	7e26b9d4a2	zebra: Fix use after free in debug path When zebra is running with debugs turned on there is a use after free reported by the address sanitizer: 2020/10/16 12:58:02 ZEBRA: rib_delnode: (0:254):4.5.6.16/32: rn 0x60b000026f20, re 0x6080000131a0, removing 2020/10/16 12:58:02 ZEBRA: rib_meta_queue_add: (0:254):4.5.6.16/32: queued rn 0x60b000026f20 into sub-queue 3 ================================================================= ==3101430==ERROR: AddressSanitizer: heap-use-after-free on address 0x608000011d28 at pc 0x555555705ab6 bp 0x7fffffffdab0 sp 0x7fffffffdaa8 READ of size 8 at 0x608000011d28 thread T0 #0 0x555555705ab5 in re_list_const_first zebra/rib.h:222 #1 0x555555705b54 in re_list_first zebra/rib.h:222 #2 0x555555711a4f in process_subq_route zebra/zebra_rib.c:2248 #3 0x555555711d2e in process_subq zebra/zebra_rib.c:2286 #4 0x555555711ec7 in meta_queue_process zebra/zebra_rib.c:2320 #5 0x7ffff74701f7 in work_queue_run lib/workqueue.c:291 #6 0x7ffff7450e9c in thread_call lib/thread.c:1581 #7 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #8 0x55555561a578 in main zebra/main.c:455 #9 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 #10 0x5555555e3429 in _start (/usr/lib/frr/zebra+0x8f429) 0x608000011d28 is located 8 bytes inside of 88-byte region [0x608000011d20,0x608000011d78) freed by thread T0 here: #0 0x7ffff768bb6f in __interceptor_free (/lib/x86_64-linux-gnu/libasan.so.6+0xa9b6f) #1 0x7ffff739ccad in qfree lib/memory.c:129 #2 0x555555709ee4 in rib_gc_dest zebra/zebra_rib.c:746 #3 0x55555570ca76 in rib_process zebra/zebra_rib.c:1240 #4 0x555555711a05 in process_subq_route zebra/zebra_rib.c:2245 #5 0x555555711d2e in process_subq zebra/zebra_rib.c:2286 #6 0x555555711ec7 in meta_queue_process zebra/zebra_rib.c:2320 #7 0x7ffff74701f7 in work_queue_run lib/workqueue.c:291 #8 0x7ffff7450e9c in thread_call lib/thread.c:1581 #9 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #10 0x55555561a578 in main zebra/main.c:455 #11 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 previously allocated by thread T0 here: #0 0x7ffff768c037 in calloc (/lib/x86_64-linux-gnu/libasan.so.6+0xaa037) #1 0x7ffff739cb98 in qcalloc lib/memory.c:110 #2 0x555555712ace in zebra_rib_create_dest zebra/zebra_rib.c:2515 #3 0x555555712c6c in rib_link zebra/zebra_rib.c:2576 #4 0x555555712faa in rib_addnode zebra/zebra_rib.c:2607 #5 0x555555715bf0 in rib_add_multipath_nhe zebra/zebra_rib.c:3012 #6 0x555555715f56 in rib_add_multipath zebra/zebra_rib.c:3049 #7 0x55555571788b in rib_add zebra/zebra_rib.c:3327 #8 0x5555555e584a in connected_up zebra/connected.c:254 #9 0x5555555e42ff in connected_announce zebra/connected.c:94 #10 0x5555555e4fd3 in connected_update zebra/connected.c:195 #11 0x5555555e61ad in connected_add_ipv4 zebra/connected.c:340 #12 0x5555555f26f5 in netlink_interface_addr zebra/if_netlink.c:1213 #13 0x55555560f756 in netlink_information_fetch zebra/kernel_netlink.c:350 #14 0x555555612e49 in netlink_parse_info zebra/kernel_netlink.c:941 #15 0x55555560f9f1 in kernel_read zebra/kernel_netlink.c:402 #16 0x7ffff7450e9c in thread_call lib/thread.c:1581 #17 0x7ffff738eaf7 in frr_run lib/libfrr.c:1099 #18 0x55555561a578 in main zebra/main.c:455 #19 0x7ffff7079cc9 in __libc_start_main ../csu/libc-start.c:308 SUMMARY: AddressSanitizer: heap-use-after-free zebra/rib.h:222 in re_list_const_first This is happening because we are using the dest pointer after a call into rib_gc_dest. In process_subq_route, we call rib_process() and if the dest is deleted dest pointer is now garbage. We must reload the dest pointer in this case. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-17 08:55:26 -04:00
Mark Stapp	87009d7df0	zebra: support multiple connected subnets on an interface We support configuration of multiple addresses in the same subnet on a single interface: make sure that zebra supports multiple instances of the corresponding connected route. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-16 16:46:33 -04:00
Mark Stapp	027b3ca2e0	Merge pull request #7244 from donaldsharp/mlag_backout_and_fix Mlag backout and fix	2020-10-14 08:30:54 -04:00
Donald Sharp	4fe30ff1eb	Merge pull request #7298 from mjstapp/quiet_opaque_debugs zebra: quiet the zebra opaque message debugs	2020-10-14 07:27:27 -04:00
Donald Sharp	ca3491262b	zebra: Isolate mlag_rd_buf_offset to the actual using function Isolate the mlag_rd_buf_offset variable to the actual used function, instead of having it a global. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-13 16:02:05 -04:00
Donald Sharp	ded3e3e39c	Revert "zebra: the mlag_rd_buf_offset variable was write only" This reverts commit `00e0d113e5`.	2020-10-13 15:57:54 -04:00
Donald Sharp	82b4a8bf2c	Merge pull request #7258 from mjstapp/zebra_remove_slsp zebra: remove 'static' lsp objects	2020-10-13 15:51:18 -04:00
Mark Stapp	674afc2b0a	zebra: quiet the zebra opaque message debugs Put most of the debugs about opaque ZAPI messages under 'detail' to reduce the noise. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-13 14:07:17 -04:00
Donald Sharp	c8c5009ec5	Merge pull request #7288 from rsmarples/BSD-link_state BSD: ifi_link_state is the link state	2020-10-13 13:43:07 -04:00
Stephen Worley	475852b263	zebra: only track NHEs from the dataplane for ID usage Let's just track the NHEs we get from the kernel(dplane) for ID usage with internal routes. I tried to be smart originally and allow them to be re-used internal to zebra but its proving to cause more bugs than it's worth. This doesn't break any functionality. It just means we won't use NHEs we get from the kernel with our routes, we will create new ones. Decided this based on various bugs seen ith the lastest one being on startup with this kernel state: ``` [root@alfred frr-2]# ip next ls id 15 via 192.168.161.1 dev doof scope link proto zebra id 17 group 15 proto zebra [root@alfred frr-2]# ip ro show 3.3.3.1 3.3.3.1 nhid 17 via 192.168.161.1 dev doof ``` Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-13 11:23:57 -04:00
Emanuele Bovisio	2a3a97be8c	doc, zebra: remove keep_kernel option everywhere remove all remaining parts related to keep_kernel option Signed-off-by: Emanuele Bovisio <emanuele.bovisio@eolo.it>	2020-10-13 12:59:50 +02:00
Roy Marples	98f3df554b	zebra: ifi_link_state is the link state SIOCGIFMEDIA returns the media state. SIOCGIFDATA returns interface data which includes the link state. While the status of the former is usually indicitive of the latter, this is not always the case. Ifact some recent net80211 changes in at least NetBSD and OpenBSD have MONITOR media set to active but the link status set to DOWN. All interfaces will return link state with SIOCGIFDATA, unlike SIOCGIFMEDIA. However not all BSD's support SIOCGIFDATA - it has recently been accepted into FreeBSD-13. However, all BSD's do report the same structure in ifa_data for AF_LINK addresses from getifaddrs(3) so the information has always been available. Signed-off-by: Roy Marples <roy@marples.name>	2020-10-13 11:32:36 +01:00
Stephen Worley	5588801e7a	zebra: add from_dplane info for NHE creation Add a param to the common NHE creation callstack so we can know if this is one we have read in from the dataplane. We can add some logic on how to handle these special ones later. I considered putting this on a struct as a flag or something but it would have required it being put on struct nexthop since we have some `*_find_nexthop()` functions that can be called when given NHEs from the dataplane. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-10-12 20:39:28 -04:00
Donald Sharp	ba49e033f5	zebra: zevpn cannot be null passed into zebra_evpn_es_evi_show_one_evpn In zebra_evpn_es_evi_show_vni the zevpn pointer if passed into zebra_evpn_es_evi_show_one_evi will crash if it is null and we have code that checks that it is non null and then immediately calls the function. Add a return to prevent a crash. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-10-11 10:47:37 -04:00
Donald Sharp	bc3cd39bc4	zebra: n->mac is derefed in all paths No need to check for n->mac existence as that all paths leading to this code have n->mac already derefed. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-10-11 10:47:37 -04:00
Renato Westphal	8b6b6b694d	Merge pull request #7222 from idryzhov/fix-debug fix debug commands node inconsistencies	2020-10-09 21:58:24 -03:00
Mark Stapp	608a57c08b	zebra: remove 'static' lsp objects Use the same lsp and nexthop/nhlfe objects for 'static' and dynamic LSPs; remove the 'static' objects and their supporting code. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-08 15:50:41 -04:00
Renato Westphal	9cfb2747ad	Merge pull request #7241 from chiragshah6/evpn_dev1 lib: add errmsg to nb rpc	2020-10-07 11:50:52 -03:00
Mark Stapp	628995a30c	Merge pull request #7214 from donaldsharp/more_vrf_usefulness zebra: cleanup zebra_rnh.c debugs	2020-10-06 08:29:45 -04:00
Chirag Shah	9bee02322f	zebra: display rpc error msg to vtysh Zebra's clear duplicate detect command is rpc converted. There is condition where cli fails with human readable message. Using northboun's errmsg buffer to display error message to user. Testing: bharat# clear evpn dup-addr vni 1002 ip 2011:11::11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state bharat# clear evpn dup-addr vni 1002 ip 11.11.11.11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-10-05 13:57:54 -07:00
Chirag Shah	f63f5f1947	*: add errmsg to nb rpc Display human readable error message in northbound rpc transaction failure. In case of vtysh nb client, the error message will be displayed to user. Testing: bharat# clear evpn dup-addr vni 1002 ip 11.11.11.11 Error type: generic error Error description: Requested IP's associated MAC aa:aa:aa:aa:aa:aa is still in duplicate state Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-10-05 13:15:59 -07:00
Mark Stapp	10da81824a	Merge pull request #7219 from donaldsharp/rib_fixes Rib fixes	2020-10-05 09:11:50 -04:00
Roy Marples	355c74b7e9	BSD: Add whitespace between declaration and code Signed-off-by: Roy Marples <roy@marples.name>	2020-10-05 08:10:42 +01:00
Roy Marples	68cd699df5	BSD: Detect route(4) overflows NetBSD and DragonFlyBSD support reporting of route(4) overflows by setting the socket option SO_RERROR. This is handled the same as on Linux by exiting with a -1 error code. Signed-off-by: Roy Marples <roy@marples.name>	2020-10-04 20:32:26 +01:00
Donald Sharp	5c30573e2a	zebra: cleanup zebra_rnh.c debugs a) Use appropriate %p modifiers for output 2) Display vrf name in addition to vrf id c) Remove now unused function Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-02 12:15:03 -04:00
Igor Ryzhov	d7b86ae4fe	vtysh: dynamically generate the list of daemons for commands Some daemons were actually missing from the static definitions: nhrpd, babeld, eigrpd and bfdd. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-10-02 15:06:27 +03:00
Igor Ryzhov	dd73744d8c	*: move "show debugging ..." commands to enable node Use the same node for "show debugging" commands in all daemons. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2020-10-02 15:06:05 +03:00
Donald Sharp	c17b2d5b6b	zebra: Make connected routes their own entry on the meta_q During quick ifdown / ifup events from the linux kernel there exists a situation where a prefix that has both a kernel route and a static route can queued up on the meta-q. If the static route happens to point at a connected route for nexthop resolution and we receive a series of quick up/down events after the static route and kernel route are queued up for rib reprocessing. Since the static route and kernel route are queued on meta-q 1 and the connected route is also on meta-q 1 there exists a situation where the connected route will be resolved after the static route fails to resolve, leaving the static route in a unresolved state. Add a new queue level and put connected routes on their own level, since they are the fundamental building blocks of pretty much all the other routes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-01 15:17:06 -04:00
Donald Sharp	9d221fac7e	zebra: When processing route_entries ignore unusable routes When zebra is processing routes to determine what to send to the rib, suppose we have two routes (a) a route processed earlier that none of it's nexthops were active and (b) a route that has good nexthops but has a worse admin distance. rib_process, would not relook at (a)'s nexthops because the ROUTE_ENTRY_CHANGED flag was not true and it would win when compared to (b) because it's admin distance was better, leaving us with a state where we would attempt and fail to install route (a) because it was not valid. Modify the code to consider the number of nexthops we have as a determiner if we can use the route. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-01 15:17:06 -04:00
Donald Sharp	5c18e66208	zebra: Prevent uninstall attempts when new entry is not happy In rib_process_update_fib, the function is sent two route entries the old ( previously installed ) and new ( the one to install ) When the function detects that the new is unusable because the number of nexthops that are usable for that route is 0, then we uninstall the old route. The problem here is that we should not attempt to uninstall any route that is not owned by FRR. Modify the code to not attempt this behavior Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-30 17:26:44 -04:00
Quentin Young	fb3bc7a74b	Merge pull request #7215 from mjstapp/fix_z_mlag_read zebra: don't touch mlag read event pointer	2020-09-30 16:27:01 -04:00
Mark Stapp	f5d8487244	zebra: don't touch mlag read event pointer Don't touch the mlag read event pointer, it's not safe. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-09-30 13:24:54 -04:00
Mark Stapp	4fdfda2e34	Merge pull request #7167 from donaldsharp/mlag_rd_killer zebra: the mlag_rd_buf_offset variable was write only	2020-09-30 11:40:40 -04:00
Donald Sharp	dbbae374d4	Merge pull request #7192 from deastoe/zebra-fpm-blackhole-abort zebra: fix FPM abort for unreach/prohibit routes	2020-09-29 13:47:38 -04:00
Patrick Ruddy	aa1f6a8795	Merge pull request #7188 from chiragshah6/evpn_dev zebra: EVPN avoid duplicate list-node in l3vni's l2vni-list	2020-09-29 16:33:19 +01:00
Duncan Eastoe	94f7786375	zebra: fix FPM abort for unreach/prohibit routes `b0e9567ed1` fixed an issue whereby zebra would abort while building an update for a blackhole route. The same issue, `assert(data_len)` failing in `zfpm_build_route_updates()`, can be observed when building updates for unreachable and prohibit routes. To address this `netlink_route_info_fill()` is updated to not indicate failure, due to lack of nexthops, for any blackhole routes. Signed-off-by: Duncan Eastoe <duncan.eastoe@att.com>	2020-09-29 12:59:30 +01:00
Donald Sharp	a24d04f4db	zebra: Make nexthop_active check use the same debug When debugging why a route was not successfully installed into the rib, it would be preferable that the end user only have to turn on `debug zebra rib detail` as that is what we have been telling people to do for the last couple of years. Consolidate back to this. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-29 07:54:35 -04:00
Donald Sharp	81194feec9	zebra: Add missing reason we could not make an active_nexthop check Add a missing reason as to why we are unable to make an active nexthop check be successful. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-29 07:45:19 -04:00
Chirag Shah	c7e83a4efe	zebra: avoid duplication node in l3vni l2vni-list With l2vni flap leading to duplicate entry creation in l3vni's l2vni-list. Use list sorted add with no duplicates. root@TORC11:mgmt:~# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1000 1000 0 0 1002 root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1000 1000 0 0 1002 1002 Ticket:CM-31545 Reviewed By: Testing Done: With Fix: Multiple time flaps vni counts remained the same. root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# ip link set down vx-1002 root@TORC11:mgmt:~# ip link set up vx-1002 root@TORC11:mgmt:~# net show evpn vni 4001 VNI: 4001 Type: L3 Tenant VRF: vrf1 State: Up ... L2 VNIs: 1000 1002 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2020-09-28 21:44:30 -07:00
Stephen Worley	66c28560ba	zebra: set NHG/backup NHG pointers on success zapi read Only set the NHG/backup NHG pointers of the caller if the read of the nexthops was successfull. Otherwise, we might free when not neccessary or double free. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	2173535298	lib,zebra,sharpd: add code for backup proto-NHs but disabled Add the zapi code for encoding/decoding of backup nexthops for when we are ready for it, but disable it for now so that we revert to the old way with them. When zebra gets a proto-NHG with a backup in it, we early fail and tell the upper level proto. In this case sharpd. Sharpd then reverts to the old way of installation with the route. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	aaa42e056f	zebra: add type to nhg_prot_del API for sanity check Add type to the nhg_proto_del API params for sanity checking that the types of the route sent by the proto matches the type found with the ID. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	841f77ff04	zebra: free ctx if we skip replace for NHG PROTO routes Free the ctx if we decide we dont need to do anything with this route update. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	3d3a9dc8a7	zebra: limit no re-install to NHG PROTO using routes Limit the not re-installation of routes with the same NHG ID to routes that are using the new NHG PROTO API. This would only include sharpd and EVPN-MH for now. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8f830b8c64	zebra: use list to mark for removal when scoring In scoring our NHEs during shutdown there is a chance we could release mutliple NHEs at the same time during one iteration. This can cause memory corruption if the two being released are directly next to each other in the hash table. hash_iterate accounts for releasing one during the iteration but not two by setting hbnext before release but if hbnext is also freed, we obviously can have a problem. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	70f3cda6c1	zebra: reject proto NHGs of blackhole/interface Reject proto NHGs of type blackhole/interface for now. We need to think a bit more about how to resolve these given the linux kernel needs to know the Address Family of the routes that will use them and install it with them. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	73937edb73	zebra,sharpd: checkpatch fixes Check patches fixes for NHG API pathes. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	ff9aca4f8d	lib,zebra,sharpd: clang format Clang format for NHG API and sharpd patches. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8b2d3a0fb6	zebra: clean up the NHG proto zapi code a bit Clean up the function names and remove some TODOs that are no longer needed/hacks we used for testing. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	e270f004ae	zebra: multipath number checks with NHG proto Get the multipath number checks working with proto-based NHG message decoding in zapi_msg.c Modify the function that checks this for routes to work without being passed a prefix as is the case with NHG creates. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	8155e8c592	zebra: add flag track released state of proto NHGS Add a flag to track the released state of a proto-based NHG. This flag is used to know whether the upper level proto has called the *_del API. Typically, the NHG would just get removed and uninstalled at this point but there is a chance we are being sent it while routes are still being owned or we were sent it multiple times. This flag and associated code handles that. Ticket: CM-30369 Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	70347b7ad6	zebra: reply fail on NHG add if not ifindex/onlink We currently don't support ADD/DEL/REPLACE with proto-based NHGs that are not already fully resolved and ifindex/onlink based. If we are handed one that doesn't have ifindex set i.e. recursive, gracefully fail and with a notification. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Stephen Worley	2c7819b9d4	lib,zebra: fixup NHG notify zapi messaging Make the message parameters align better with other zapi notifications and change the ID to correctly be a uint32. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:41:00 -04:00
Donald Sharp	27805e74f0	zebra: Properly set NEXTHOP_FLAG_FIB when skipping install When the dataplane detects that we have no need to reinstall the same route, setup the NEXTHOP_FLAG_FIB appropriately. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	e3b9c0f2f6	zebra: Only install a minimal amount of times The code was installing the nexthop group again using the NLM_F_REPLACE function causing extremely large route installation times. This reduces the time from installing 1 million routes from sharpd with a nhg from > 200 seconds ( where I gave up ) to ~15 seconds on my machine for 32 x ecmp. As a side note 1 million routes using master sharpd takes ~50 seconds to do the same thing. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	72938edfbc	zebra: add logging for NHG ignoring in netlink Add some logging for when we choose to ignore a NHG install for one reason or another. Also, cleanup some of the code using the same accessor functions for the context object. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	9c6c48bc10	zebra: return the proto nhe on del even with refs Return the proto nhe on del even if their are still possible route references. We may get a del before the routes are removed. So we still need to return this to the caller so they can decrement the ref. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	3bccc0f5eb	zebra: fix releasing proto-owned singletons Fix the releasing of proto-owned singletons from the attribute hashed table. Proto-owned singleton nexthops are hashed so they can still be shared therefore they are present in this table and need to be released when the time comes. This check was only matching on zebra proto before. Changed to match IDs in zebra allocated range. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	f651b708e0	zebra: increment the nhg proto score iterator Increment the nhg proto score iterator we used to count leftover NHGs after client disconnect and log. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	1f65568046	zebra: fix refcnt/rib issues in NHG replace/delete Fix some reference counting issues seen when replacing a NHG and deleting one. For replacement, we should end with the same refcnt on the new one. For delete, its the caller's job to decrement its ref after its done with it. Further, update routes in the rib with the new pointer after replace. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	68671c7439	zebra: warn if zapi NHG add has no nexthops Log a warning and return if we receive a NHG add via zapi that has no nexthops. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	24db1a7b9a	zebra: handle proto NHG uninstall client disconnect Add code to handle proto-based NHG uninstalling after the owning client disconnects. This is handled the same way as rib_score_proto() but for now we are ignoring instance. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	6fae63d2ba	zebra: inc/dec refcount on add/del NHG proto When we add a proto NHG, increment the refcount, when we del a proto NHG, decrement the refcount rather than deleting it explicitly. If the upper level proto is handling it properly, it should get decremented to zero when we receive a NHG del. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2d8a9c544b	zebra: remove unneeded nhg repalce boilerplate Remove some leftover boilerplate from the old replace code path. That code ended up in the add API so its no longer needed. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	df3cef24c5	zebra: Prevent duplicate re-install If we have received a route that the already existing route is exactly the same, just note that it happened and move on. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2b5ecd4ca6	zebra: fix route validity check with NHG ID Fix check in zread where we determine validity of a route based on reading in nexthops/checking ID is present. We had a bad conditional that was determining a route is bad if its not NHG ID based. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	cd53e3a6e6	zebra: use the passed proto from zapi We were hard coding proto bgp for use with the NHG creation. Use the actual passed one from zapi now that it exists. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	ac5d1091dc	zebra: make NHG ID allocation smarter Make NHG ID allocation smarter so it wraps once it hits the lower bound for protos and performs a lookup to make sure we don't already have that ID in use. Its pretty unlikely we would wrap since the ID space is somewhere around 24million for Zebra at this point in time. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	54c89c9377	zebra: NHG ID bounds macros Determine the NHG ID spacing and lower bound with ZEBRA_ROUTE_MAX in macros. Directly set the upperbound to be the lower 28bits of the uint32_t ID space (the top 4 are reserved for l2-NHGs). Round that number down a bit to make it more even. Convert all former lower_bound calls to just use the macro. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	16b20ad062	zebra: dont update counter if outside of zebra ID range When we receive a NHG from the kernel, we set the ID counter to that to avoid using IDs owned from the kernel. If we get one outside of zebra's range, lets not update it since its probably one we created and never deleted anyway. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	2c41ef8c17	zebra: special handling for proto-NHG-based routes For now let's assume proto-NHG-based routes are good to go (we assume they are onlink/interface based anyway) and bypass route resolution altogether. Once we determine how to handle recursive nexthop-resolution for proto-NHGs we will revisit this. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	50db3f2f1d	zebra: handle zapi routes with NHG ID set Add code to properly handle routes sent with NHG ID rather than a nexthop_group. For now, we separate this from backup nexthop handling since that should probably be added to the nhg_proto_add calls. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	dd1e105fe3	zebra: implement NHG proto replace Implement the ability to replace an NHG sent down from an upper level proto. With proto-owned NHGs, we make the assumption they are ecmp and always treat them as a group to make the replace from 1 -> 2 and 2 -> 1 quite a bit easier. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	08da8bbc22	zebra: hash proto-created but zebra ID spaced NHGS To prevent duplication of singleton NHGs, lets hash any zebra-ID spaced NHGs sent from an upper level proto. These would be singleton NHGs anyway and should prevent duplication of dataplane installs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	6c67f41f9e	zebra,lib: command to only install proto-based nexthops Add a command/functionality to only install proto-based nexthops. That is nexthops owned/created by upper level protocols, not ones implicitly created by zebra. There are some scenarios where you would not want zebra to be arbitrarily installing nexthop groups and but you still want to use ones you have control over via lib/nexthop_group config and an upper level protocol. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	0885b1e3d9	zebra: implement protocol NHG Add/Del Implement the underlying zebra functionality to Add/Del an internal zebra and kernel NHG. These NHGs are managed by the upperlevel protocols that send them down via zapi messaging. They are not put into the overall zebra NHG hash table and only put into to the ID table. Therefore, different protos cannot and will not share NHGs. The proto is also set appropriately when sent to the kernel. Expand the separation of Zebra hashed/shared/created NHGs and proto created and mangaged NHGs. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Stephen Worley	5b27c09d4e	zebra: remove NHG unhashable flag and its code Remove the code for setting a NHG as unhashable. Originally this was to prevent us from attempting to put duplicates from the kernel in our hashtable. Now I think its better to not use them in the hashtable at all and only track them in the ID table. Routes will still be able to use them if they specify the ID explicitly when sending Zebra the route, but 'normal' routes we hash the nexthop group on will not. Signed-off-by: Stephen Worley <sworley@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	27141ea94e	lib, zebra: Add ability to send down a nhgid over route install Modify the send down of a route to use the nexthop group id if we have one associated with the route. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	2f35a820bf	lib, zebra: Add ZAPI_NHG_ADD\|DELETE Add the ability to send a NHG from an upper level protocol down to zebra. ZAPI_NHG_ADD encompasses both the addition and replace semantics ( If the id passed down does not exist yet, it's Add, else it's a replace ). Effectively zebra will take this nhg passed down save the nhg in the id hash for nhg's and then create the appropriate nhg's and finally install them into the linux kernel. Notification will be the ZAPI_NHG_NOTIFY_OWNER zapi message for normal success/failure messaging to the installing protocol. This work is being done to allow us to work with EVPN MH which needs the ability to modify NHG's that BGP will own and operate on. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	f70da2a390	zebra: Refactor nexthop reading from zapi messages Take the zebra code that reads nexthops and combine it into one function so that when we add zapi messages to send/receive nexthops we can take advantage of this function. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donald Sharp	786a9bd9eb	zebra: Convert zserv_nexthop_num_warn to return bool Allow us to key of the warning if we have one. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-28 12:40:59 -04:00
Donatas Abraitis	b1f476731a	Merge pull request #7169 from donaldsharp/some_code_cleanup Some code cleanup	2020-09-25 10:19:34 +03:00
Sri Mohana Singamsetty	46dd92c522	Merge pull request #7164 from AnuradhaKaruppiah/mh-misc-fixes evpn-mh: miscellaneous cleanup/fixes	2020-09-24 08:37:45 -07:00
Donald Sharp	9781e6a047	zebra: Don't ignore setsockopt return When attempting to limit the amount of data sent from the kernel to FRR, some kernels we can run against may not have this ability in which case the setsockopt will fail. Notice that in the log. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-24 07:42:51 -04:00
Rafael Zalamena	eead0bc46b	zebra: human readable netlink dumps Add new compile option to enable human readable netlink dumps with `debug zebra kernel msgdump`. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2020-09-23 23:07:02 -03:00
Donald Sharp	00e0d113e5	zebra: the mlag_rd_buf_offset variable was write only The mlag_rd_buf_offset function was only ever being set to 0 in the mlag_read function and only written in that function. There is no need for this global variable. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 20:36:51 -04:00
Mark Stapp	ccda0eadac	Merge pull request #7155 from donaldsharp/TRAP Offload/Trap	2020-09-23 16:06:37 -04:00
Mark Stapp	4020564a3c	Merge pull request #7163 from donaldsharp/zebra_mlag_bugs Zebra mlag bugs	2020-09-23 15:32:31 -04:00
Anuradha Karuppiah	e378f5020d	zebra: fix use of freed es during zebra shutdown This problem was reported by the sanitizer - ================================================================= ==24764==ERROR: AddressSanitizer: heap-use-after-free on address 0x60d0000115c8 at pc 0x55cb9cfad312 bp 0x7fffa0552140 sp 0x7fffa0552138 READ of size 8 at 0x60d0000115c8 thread T0 #0 0x55cb9cfad311 in zebra_evpn_remote_es_flush zebra/zebra_evpn_mh.c:2041 #1 0x55cb9cfad311 in zebra_evpn_es_cleanup zebra/zebra_evpn_mh.c:2234 #2 0x55cb9cf6ae78 in zebra_vrf_disable zebra/zebra_vrf.c:205 #3 0x7fc8d478f114 in vrf_delete lib/vrf.c:229 #4 0x7fc8d478f99a in vrf_terminate lib/vrf.c:541 #5 0x55cb9ceba0af in sigint zebra/main.c:176 #6 0x55cb9ceba0af in sigint zebra/main.c:130 #7 0x7fc8d4765d20 in quagga_sigevent_process lib/sigevent.c:103 #8 0x7fc8d4787e8c in thread_fetch lib/thread.c:1396 #9 0x7fc8d4708782 in frr_run lib/libfrr.c:1092 #10 0x55cb9ce931d8 in main zebra/main.c:488 #11 0x7fc8d43ee09a in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x2409a) #12 0x55cb9ce94c09 in _start (/usr/lib/frr/zebra+0x8ac09) ================================================================= Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 11:20:13 -07:00
Anuradha Karuppiah	4d8b658c8c	zebra: evpn-mh: add error logs on ES processing failures Cleanup some of the XXX added during development of MH. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 11:15:08 -07:00
Donatas Abraitis	5fde152be6	Merge pull request #7112 from AnuradhaKaruppiah/mac-neigh-ht evpn-mh: mac-ip sync hold timers	2020-09-23 21:11:56 +03:00
Patrick Ruddy	a3b5e4fdf7	Merge pull request #7157 from donaldsharp/nhg_speeds zebra: Move debug information gathering to inside guard	2020-09-23 18:42:00 +01:00
Donald Sharp	c19808acad	zebra: Increase the read/write mlag buffer sizes The read/write mlag buffer sizes of 2k were sufficient for ~100 S,G notifications at one go. Increase to 32k to give us 16 times the space. Ticket: CM-31576 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:13:03 -04:00
Donald Sharp	7692744f2c	zebra: Ensure that message received from mlag will fit If we receive a message that is greater than our buffer size we are in a situation where both the read and write buffers are fubar'ed beyond the end. Assert when we notice this fact. Ticket: CM-31576 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:12:26 -04:00
Donald Sharp	f24d9ab667	zebra: modify mlag code to only need 1 stream when generating data The normal pattern of writing the type/length at the beginning of the packet was not being quite followed. Modify the mlag code to respect the proper way of doing things and get rid of a stream_new and copy. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-23 13:12:20 -04:00
Anuradha Karuppiah	2b9e207e0e	zebra: stop neigh hold timer when the neigh is deleted The neigh hold timer was firing after the neigh was deleted resulting in the following crash - [ at ./zebra/zebra_evpn_neigh.h:155 at zebra/zebra_evpn_neigh.c:447 at lib/thread.c:1578 at zebra/main.c:488 ] Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Don Slice	f9f0463fb9	zebra: fix deletion of evpn mh neigh-holdtime Found that the command "evpn mh neigh-holdtime" can be set but not deleted. This fix solves the delete process Signed-off-by: Don Slice <dslice@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Anuradha Karuppiah	41c809b2a8	zebra: changes for configuring mac and neigh holdtime When an ES peer withdraws a MAC-IP route we hold the entry for N seconds to allow an external daemon (neighmgr) to establish host reachability independent of the peer. Add config commands to allow the user to set this holdtime (N). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-23 06:46:19 -07:00
Donald Sharp	aa178efd49	Merge pull request #7148 from pguibert6WIND/fix_fd_not_closed zebra: fix fd going out of scope leaks the handle	2020-09-23 07:40:14 -04:00
Donatas Abraitis	0ce5baaab1	Merge pull request #7018 from gouault6wind/show_ip_route Clean up in vrf management	2020-09-23 08:45:09 +03:00
Donald Sharp	bed74d178e	zebra: Move debug information gathering to inside guard Let's not make the entire `depend_finds` function pay for the data gathering needed for the debug. There are numerous other places in the code that check the NEXTHOP_FLAG_RECURSIVE and do the same output. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-22 20:47:33 -04:00
Sri Mohana Singamsetty	efdd997dad	Merge pull request #7116 from AnuradhaKaruppiah/mh-neigh-fixes evpn-mh: changes for programming synced neighs as static in the dataplane	2020-09-22 15:45:09 -07:00
Mark Stapp	b6033bd1c1	Merge pull request #7067 from donaldsharp/remove_solaris Remove solaris	2020-09-22 17:04:19 -04:00
Donald Sharp	5a3cf85391	lib, zebra: Add ability to read kernel notice of TRAP/OFFLOAD The linux kernel is getting RTM_F_TRAP and RTM_F_OFFLOAD for kernel routes that have an underlying asic offload. Write the code to receive these notifications from the linux kernel and to store that data for display about the routes. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-09-22 15:57:43 -04:00
Donald Sharp	4c56ce1cea	zebra: Add basic knowledge of asic offload available Some linux kernels are starting to support the idea of knowledge about the underlying asic. Add a boolean that we can set/unset to track whether or not we think the router has this functionality available. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-22 15:57:43 -04:00
Philippe Guibert	7529bf8f05	zebra: fix fd going out of scope leaks the handle the file descriptor is closed if it has been locally created. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-22 21:09:13 +02:00
Donald Sharp	cae8bc967c	*: Remove solaris from FRR The Solaris code has gone through a deprecation cycle. No-one has said anything to us and worse of all we don't have any test systems running Solaris to know if we are making changes that are breaking on Solaris. Remove it from the system so we can clean up a bit. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-21 10:02:20 -04:00
Christophe Gouault	1eb92f06c4	vrf: VRF_DEFAULT must be 0, remove useless code Code was added in the past to support a value of VRF_DEFAULT different from 0. This option was abandoned, the default vrf id is always 0. Remove this code, this will simplify the code and improve performance (use a constant value instead of a function that performs tests). Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>	2020-09-21 10:17:35 +02:00
Christophe Gouault	d58b6f7568	zebra: always display vrf in show ip route json In route json outputs, always display the vrf even if it is the default vrf. Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>	2020-09-21 10:07:56 +02:00
Christophe Gouault	6ac9404177	zebra: simplify and optimize vrf display in show ip route In all outputs (text and json): simplify and optimize the vrf name display, use the vrf_id_to_name() handler. Note: vrf_id_to_name() has a safeguard system that prevents from crashing when the vrf cannot be found because it changed in some (unexpected) manner, it returns "n/a". Note: "vrf n/a" will now be displayed instead of "vrf UNKNOWN" in this case, like in most other frr components. This safeguard was missing for show ip route json, so this optimization also fixes a potential crash. Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>	2020-09-21 10:07:56 +02:00
Christophe Gouault	8277d4b796	zebra: fix show ip route output Variable "show ip route" commands invoke the same helper (do_show_ip_route), potentially several times. When asking to dump a non-default vrf, all vrfs or all tables, the output is messy, the header summarizing abbreviations is repeated several times, excess line feeds appear, the default table of default VRF is concatenated to the previous table output... Normalize the output: - whatever the case, display the common header at most once, if there is at least an entry to dump. - when using a "vrf all" or "table all" command, prepend a line with the VRF and table (even for the default vrf or table). - when dumping a specific vrf or table, prepend a line with the VRF and table. Example (vrf all) ================= router# show ip route vrf all Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF main: C>* 10.0.2.0/24 is directly connected, mgmt0, 00:24:09 K>* 10.0.2.2/32 [0/100] is directly connected, mgmt0, 00:24:09 C>* 10.125.0.0/24 is directly connected, ntfp2, 00:00:26 VRF private: S>* 1.1.1.0/24 [1/0] via 10.125.0.2, loop0, 00:00:29 C>* 10.125.0.0/24 is directly connected, loop0, 00:00:42 Example (main vrf) ================== router# show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route C>* 10.0.2.0/24 is directly connected, mgmt0, 00:24:41 K>* 10.0.2.2/32 [0/100] is directly connected, mgmt0, 00:24:41 C>* 10.125.0.0/24 is directly connected, ntfp2, 00:00:58 Example (specific vrf) ====================== router# show ip route vrf private Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF private: S>* 1.1.1.0/24 [1/0] via 10.125.0.2, loop0, 00:01:23 C>* 10.125.0.0/24 is directly connected, loop0, 00:01:36 Example (all tables) ==================== router# show ip route table all Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF main table 200: S>* 4.4.4.4/32 [1/0] via 10.125.0.3, ntfp2, 00:01:51 VRF main table 254: C>* 10.0.2.0/24 is directly connected, mgmt0, 00:25:34 K>* 10.0.2.2/32 [0/100] is directly connected, mgmt0, 00:25:34 C>* 10.125.0.0/24 is directly connected, ntfp2, 00:01:51 Example (all vrf, all table) ============================ router# show ip route table all vrf all Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF main table 200: S>* 4.4.4.4/32 [1/0] via 10.125.0.3, ntfp2, 00:02:15 VRF main table 254: C>* 10.0.2.0/24 is directly connected, mgmt0, 00:25:58 K>* 10.0.2.2/32 [0/100] is directly connected, mgmt0, 00:25:58 C>* 10.125.0.0/24 is directly connected, ntfp2, 00:02:15 VRF private table 200: S>* 2.2.2.0/24 [1/0] via 10.125.0.2, loop0, 00:02:18 VRF private table 254: S>* 1.1.1.0/24 [1/0] via 10.125.0.2, loop0, 00:02:18 C>* 10.125.0.0/24 is directly connected, loop0, 00:02:31 Example (specific table) ======================== router# show ip route table 200 Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF main table 200: S>* 4.4.4.4/32 [1/0] via 10.125.0.3, ntfp2, 00:05:26 Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>	2020-09-21 10:07:56 +02:00
Donald Sharp	8c36640b7c	zebra: Allow FreeBSD to set and delete addresses from an interface This series of events: $ sudo ifconfig lo0 add 4.4.4.4/32 $ sudo ifconfig lo0 inet 4.4.4.4/32 delete would end up leaving the 4.4.4.4/32 address on the interface under freebsd. This all boils down to the fact that the interface is not considered connected yet we have a destination. If the destination is the same and we are not connected ignore it on freebsd. I am sure there are other fun scenarios that someone will have to squirrel out. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-20 20:53:27 -04:00
Russ White	0a6e6613d7	Merge pull request #7089 from pguibert6WIND/netns-refactor Netns refactor	2020-09-18 11:02:30 -04:00
Anuradha Karuppiah	fb8f609d48	zebra: fixup a neigh del bug introduced by a previous commit for MAC-IP sync Problem commit - [ `b169fd6fd5` zebra: support for MAC-IP sync routes ] That commit had accidentally replaced a mac-ip del to bgp with a mac del (consequence of a bad cut-paste). Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-16 17:32:33 -07:00
Anuradha Karuppiah	ccd187cdaa	zebra: setup static neigh flag in the dataplane Changes to setup peer-synced as static in the dataplane. This prevents them from being flushed out when the local switch cannot establish their reachability. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-16 17:24:18 -07:00
Patrick Ruddy	f6374bd9b7	Merge pull request #7107 from AnuradhaKaruppiah/mh-cleanup zebra: re-name some mh functions to make the code more readable	2020-09-16 15:42:49 +01:00
Anuradha Karuppiah	945ee7b272	zebra: re-name some mh functions to make the code more readable As a part of the re-factoring some of the evpn_vni_es apis got re-named as evpn_evpn_es. Changed them to evpn_es_evi to make it common to vxlan and mpls. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-09-15 16:24:59 -07:00
Chirag Shah	839dfe29a6	zebra: dup detected mac avoid update to bgpd When a MAC is detected duplicate on a local learn event (with freeze action), do not send update to bgp to advertise into evpn control plane. With evpn mh, inform_client flag is set and sends notification to bgp albeit dup detect is set. Check mac are detected as duplicate before setting inform_client to true. Ticket:CM-29817 Reviewed By:CCR-10329 Testing Done: Enable DAD with freeze action Upon local learn MAC detected as duplica Signed-off-by: Chirag Shah <chirag@cumulusnetworks.com>	2020-09-15 16:12:44 -07:00
Donald Sharp	58a1d24924	bgpd, lib, pbrd, zebra: Pass by ifname When installing rules pass by the interface name across zapi. This is being changed because we have a situation where if you quickly create/destroy ephermeal interfaces under linux the upper level protocol may be trying to add a rule for a interface that does not quite exist at the moment. Since ip rules actually want the interface name ( to handle just this sort of situation ) convert over to passing the interface name and storing it and using it in zebra. Ticket: CM-31042 Signed-off-by: Stephen Worley <sworley@nvidia.com> Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-09-11 20:04:45 -04:00
Renato Westphal	2525fc315a	Merge pull request #6974 from liron-ze/high-cpu-usage zebra: Solve the problem of high CPU and memory usage when creating t…	2020-09-11 15:59:27 -03:00
Philippe Guibert	a1ce03e1fd	zebra: support for macvlan interfaces mac vlan interfaces are supported in different network namespaces. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	289b0f0dfd	zebra: dynamically detect vxlan link interfaces in other netns this is used when parsing the newly network namespaces. actually, to track the link of some interfaces like vxlan interfaces, both link index and link nsid are necessary. if a vxlan interface is moved to a new netns, the link information is in the default network namespace, then LINK_NSID is the value of the netns by default in the new netns. That value of the default netns in the new netns is not known, because the system does not automatically assign an NSID of default network namespace in the new netns. Now a new NSID of default netns, seen from that new netns, is created. This permits to store at netns creation the default netns relative value for further usage. Because the default netns value is set from the new netns perspective, it is not needed anymore to use the NETNSA_TARGET_NSID attribute only available in recent kernels. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	2961d0601c	lib, zebra: reuse and adapt ns_list walk functionality the walk routine is used by vxlan service to identify some contexts in each specific network namespace, when vrf netns backend is used. that walk mechanism is extended with some additional paramters to the walk routine. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	b5fde6fd5c	zebra: local mac entries populated in correct netnamespace MAC address neighbours are read on the same netnamespace where the vxlan interface is. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	d6bf8f136d	zebra: when parsing local entry against dad, retrieve config when duplicate address detection is observed, some incrementation, some timing mechanisms need to be done. For that the main evpn configuration is retrieved. Until now, the VRF that was storing the dad config parameters was the same VRF that hosted the VXLAN interface. With netns backend, this is not true, as the VXLAN interface is in the same VRF as the bridge interface. The modification takes same definition as in BGP, that is to say that there is a single bgp evpn instance, and this is that instance that will give the correct config settings. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:26:23 +02:00
Philippe Guibert	9609fab712	zebra: zvni_map_to_vlan() adaptation for all namespaces this change is needed when a MAC/IP entry is learned by zebra, and the entry happens to be in a different namespace. So that the entry be active, the correct vni match has to be found. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-09-11 18:02:50 +02:00
Jakub Urbańczyk	97f8514460	zebra: increase netlink receive buffer size Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-09-01 11:35:16 +02:00
Anuradha Karuppiah	4cd940502d	zebra: fix misc problems with ES management 1. MAC ref of a zero ESI was accidentally creating a new ES with zero ES id. 2. When an ES was deleted and re-added the ES was not being sent to BGP because of a stale flag that suppressed the update as a dup. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-08-28 15:10:18 -07:00
Donald Sharp	9271987f1e	zebra: When we get a rib deletion event be smarter When we get a rib deletion event and we already have that particular route node in the queue to be reprocessed, just note that someone from kernel land has done us dirty and allow it to be cleaned up by normal processing Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-28 14:45:59 -04:00
Donald Sharp	0aaa722883	zebra: When shutting down an interface immediately notify about rnh Imagine a situation where a interface is bouncing up/down. The interface comes up and daemons like pbr will get a nht tracking callback for a connected interface up and will install the routes down to zebra. At this same time the interface can go down. But since zebra is busy handling route changes ( from pbr ) it has not read the netlink message and can get into a situation where the route resolves properly and then we attempt to install it into the kernel( which is rejected ). If the interface bounces back up fast at this point, the down then up netlink message will be read and create two route entries off the connected route node. Zebra will then enqueue both route entries for future processing. After this processing happens the down/up is collapsed into an up and nexthop tracking sees no changes and does not inform any upper level protocol( in this case pbr ) that nexthop tracking has changed. So pbr still believes the nexthops are good but the routes are not installed since pbr has taken no action. Fix this by immediately running rnh when we signal a connected route entry is scheduled for removal. This should cause upper level protocols to get a rnh notification for the small amount of time that the connected route was bouncing around like a madman. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-28 14:45:59 -04:00
Mark Stapp	5c2bb61763	Merge pull request #7017 from xThaid/netlink_fix zebra: fix netlink batching	2020-08-28 08:20:28 -04:00
Quentin Young	3d9d84e42e	Merge pull request #6989 from xThaid/remove_fuzzing zebra: remove fuzzing stuff	2020-08-27 17:33:43 -04:00
Jakub Urbańczyk	2f9dbd3ab3	zebra: fix netlink batching It was wrongly assumed that the kernel is replying in batches when multiple requests fail. The kernel sends one error message at a time, so we can simply keep reading data from the socket as long as possible. Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-27 22:06:14 +02:00
Donald Sharp	b96f64f76f	zebra: When we fail, actually note the failure During testing it was noticed that routes were considered installed by zebra, but the kernel did not have the route. Upon close debugging of the rib it was noticed that FRR was turning a dplane_ctx_route_init into a success and FRR was now in a bad state. 2020/08/26 17:55:53.897436 PBR: route_notify_owner: [0.0.0.0/0] Route Removed succeeded for table: 10012 2020/08/26 17:55:53.897572 ZEBRA: 0.0.0.0/0: uptime == 432033, type == 24, instance == 0, table == 10012 2020/08/26 17:55:53.897622 ZEBRA: rib_meta_queue_add: (0:10012):0.0.0.0/0: queued rn 0x5566b0ea7680 into sub-queue 5 2020/08/26 17:55:53.907637 ZEBRA: default(0:10012):0.0.0.0/0: Processing rn 0x5566b0ea7680 2020/08/26 17:55:53.907665 ZEBRA: default(0:10012):0.0.0.0/0: Examine re 0x5566b0d01200 (pbr) status 2 flags 1 dist 200 metric 0 2020/08/26 17:55:53.907702 ZEBRA: default(0:10012):0.0.0.0/0: After processing: old_selected 0x0 new_selected 0x5566b0d01200 old_fib 0x0 new_fib 0x5566b0d01200 2020/08/26 17:55:53.907713 ZEBRA: default(0:10012):0.0.0.0/0: Adding route rn 0x5566b0ea7680, re 0x5566b0d01200 (pbr) 2020/08/26 17:55:53.907879 ZEBRA: default(0:10012):0.0.0.0/0: rn 0x5566b0ea7680 dequeued from sub-queue 5 2020/08/26 17:55:53.907943 ZEBRA: netlink_route_multipath: RTM_NEWROUTE 0.0.0.0/0 vrf 0(10012) 2020/08/26 17:55:53.910756 ZEBRA: default(0:10012):0.0.0.0/0 Processing dplane result ctx 0x5566b0ea82f0, op ROUTE_INSTALL result SUCCESS 2020/08/26 17:55:53.910769 ZEBRA: update_from_ctx: default(0:10012):0.0.0.0/0: SELECTED, re 0x5566b0d01200 2020/08/26 17:55:53.910785 ZEBRA: default(0:10012):0.0.0.0/0 update_from_ctx(): no fib nhg 2020/08/26 17:55:53.910793 ZEBRA: default(0:10012):0.0.0.0/0 update_from_ctx(): rib nhg matched, changed 'true' 2020/08/26 17:55:53.910802 ZEBRA: (0:10012):0.0.0.0/0: Redist update re 0x5566b0d01200 (pbr), old 0x0 (None) 2020/08/26 17:55:53.910812 ZEBRA: Notifying Owner: 24 about prefix 0.0.0.0/0(10012) 2 vrf: 0 2020/08/26 17:55:53.910912 PBR: route_notify_owner: [0.0.0.0/0] Route installed succeeded for table: 10012 2020/08/26 17:55:55.400516 ZEBRA: RTM_DELROUTE 0.0.0.0/0 vrf default(0) table_id: 10012 metric: 20 Admin Distance: 0 2020/08/26 17:55:55.400527 ZEBRA: rib_delete: (0:10012):0.0.0.0/0: rn 0x5566b0ea7680, re 0x5566b0d01200 (pbr) was deleted from kernel, adding We were receiving a notification from the kernel that the route was deleted and deciding that we needed to reinstall it. At that point in time when it got into the dplane handlers to convert it to the dplane pthread, the dplane decided to drop the request convert it too a success and not do anything. This code change removes the conversion from this failure to success and notifies the upper level about it. After this change the default route to table 10012 is now properly marked as rejected: root@mlx-2700-07:mgmt:/var/log/frr# vtysh -c "show ip route table 10012" Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, v - VNC, V - VNC-Direct, A - Babel, D - SHARP, F - PBR, f - OpenFabric, > - selected route, * - FIB route, q - queued route, r - rejected route VRF default table 10012: F>r 0.0.0.0/0 [200/0] via 172.168.1.164, isp2-uplink (vrf PUBLIC), weight 1, 00:24:48 Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-26 21:51:54 -04:00
Donald Sharp	1a39fdc218	zebra: Short circuit test for nexthop groups when not using When we are not using nexthop groups, there is no need to test for whether or not they are installed correctly or not Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-26 21:50:32 -04:00
Donald Sharp	c6d41e93e0	Merge pull request #5799 from pguibert6WIND/flowspec_ipv6 Flowspec ipv6	2020-08-26 08:26:46 -04:00
liuze	9212d1e196	zebra: Solve the problem of high CPU and memory usage when creating thousands of vrf Signed-off-by: liuze <liuze@asterfusion.com>	2020-08-26 17:41:29 +08:00
Jakub Urbańczyk	4efd721d1e	zebra: remove fuzzing stuff The fuzzing code that is in the master branch is outdated and unused, so it is worth to remove it to improve readablity of the code. All the code related to the fuzzing is in the `fuzz` branch. Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-25 17:31:07 +02:00
Mark Stapp	f515871207	zebra: fix SA warning in rib_process() Fix an SA warning about a possible NULL pointer deref in rib_process(). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-08-21 09:39:02 -04:00
Philippe Guibert	c6423c3153	bgp, zebra: add some alignments with remarks from community align the code to remarks from community. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-21 13:37:08 +02:00
Philippe Guibert	c9b1139ace	zebra: add icmpv6 table of type / code this table contains the list of icmp type/code for icmpv6. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-21 13:37:08 +02:00
Philippe Guibert	a60b7031f9	bgp, zebra: add family attribute to ipset and iptable context in order to create appropriate policy route, family attribute is stored in ipset and iptable zapi contexts. This commit also adds the flow label attribute in iptables, for further usage. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-21 13:37:08 +02:00
Rafael Zalamena	f07254d0f5	Merge pull request #6951 from donaldsharp/zebra_table zebra: Add table id to debug output	2020-08-20 14:25:14 +00:00
Donald Sharp	ec64a77b33	zebra: Limit packet dumps for received data to `detail` level When turning on `debug zebra packet detail` or `debug zebra packet recv detail` only display the detailed packet dump when `detail` is added. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-19 18:33:05 -04:00
Donald Sharp	c2c02b76bc	zebra: Add table id to debug output There are a bunch of places where the table id is not being outputed in debug messages for routing changes. Add in the table id we are operating on. This is especially useful for the case where pbr is working. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-19 13:59:29 -04:00
Philippe Guibert	6265fbe55c	zebra: fdb and neighbor table are read for all zns all network namespaces are read so as to collect interesting fdb and neighbor tables for EVPN. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	47c5892955	zebra: add ns_id attribute to mac structure this information is necessary for local information, because the interface associated to the mac address is stored with its ifindex, and the ifindex may not be enough to get to the right interface when it comes with multiple network namespaces. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	a6e0edf280	zebra: bridge layer2 information records ns_id where bridge is when working with vrf netns backend, two bridges interfaces may have the same bridge interface index, but not the same namespace. because in vrf netns backend mode, a bridge slave always belong to the same network namespace, then a check with the namespace id and the ns id of the bridge interface permits to resolve correctly the interface pointer. The problem could occur if a same index of two bridge interfaces can be found on two different namespaces. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	b1cc23b2cd	zebra, lib: new API to get absolute netns val from relative netns val when receiving a netlink API for an interface in a namespace, this interface may come with LINK_NSID value, which means that the interface has its link in an other namespace. Unfortunately, the link_nsid value is self to that namespace, and there is a need to know what is its associated nsid value from the default namespace point of view. The information collected previously on each namespace, can then be compared with that value to check if the link belongs to the default namespace or not. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	20f4b2b071	zebra, lib: store relative default ns id in each namespace to be able to retrieve the network namespace identifier for each namespace, the ns id is stored in each ns context. For default namespace, the netns id is the same as that value. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	2d4e4d39b7	zebra, lib: add an internal API to get relative default nsid in other ns as remind, the netns identifiers are local to a namespace. that is to say that for instance, a vrf <vrfx> will have a netns id value in one netns, and have an other netns id value in one other netns. There is a need for zebra daemon to collect some cross information, like the LINK_NETNSID information from interfaces having link layer in an other network namespace. For that, it is needed to have a global overview instead of a relative overview per namespace. The first brick of this change is an API that sticks to netlink API, that uses NETNSA_TARGET_NSID. from a given vrf vrfX, and a new vrf created vrfY, the API returns the value of nsID from vrfX, inside the new vrf vrfY. The brick also gets the ns id value of default namespace in each other namespace. An additional value in ns.h is offered, that permits to retrieve the default namespace context. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	9771da714e	zebra: map vxlan interface to bridge interface with correct ns id an incoming bridge index has been found, that is linked with vxlan interface, and the search for that bridge interface is done. In vrf-lite, the search is done across the same default namespace, because bridge and vxlan may not be in the same vrf. But this behaviour is wrong when using vrf netns backend, as the bridge and the vxlan have to be in the same vrf ( hence in the same network namespace). To comply with that, use the netnamespace of the vxlan interface. Like that, the appropriate nsid is passed as parameter, and consequently, the search is correct, and the mac address passed to BGP will be ok too. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	d0738ba2bb	zebra: display interface slave type interface slave type is displayed. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	a237058f59	zebra: zvni_map_to_svi() adaptation for other network namespaces the function is called with all the network namespaces. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	9d277b8c52	zebra: zvni_from_svi() adaptation for other network namespaces other network namespaces are parsed because bridge interface can be bridged with vxlan interfaces with a link in the default vrf that hosts l2vpn. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Philippe Guibert	07509878e3	zebra: importation of bgp evpn rt5 from vni with other netns With vrf-lite mechanisms, it is possible to create layer 3 vnis by creating a bridge interface in default vr, by creating a vxlan interface that is attached to that bridge interface, then by moving the vxlan interface to the wished vrf. With vrf-netns mechanism, it is slightly different since bridged interfaces can not be separated in different network namespaces. To make it work, the setup consists in : - creating a vxlan interface on default vrf. - move the vxlan interface to the wished vrf ( with an other netns) - create a bridge interface in the wished vrf - attach the vxlan interface to that bridged interface from that point, if BGP is enabled to advertise vnis in default vrf, then vxlan interfaces are discovered appropriately in other vrfs, provided that the link interface still resides in the vrf where l2vpn is advertised. to import ipv4 entries from a separate vrf, into the l2vpn, the configuration of vni in the dedicated vrf + the advertisement of ipv4 entries in bgp vrf will import the entries in the bgp l2vpn. the modification consists in parsing the vxlan interfaces in all network namespaces, where the link resides in the same network namespace as the bgp core instance where bgp l2vpn is enabled. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-08-18 09:25:06 -07:00
Donald Sharp	22c9bfb824	Merge pull request #6883 from pjdruddy/evpn-refactor Evpn refactor	2020-08-14 09:02:25 -04:00
Donald Sharp	806e504063	Merge pull request #6892 from opensourcerouting/feature/sr-te-staticd staticd: add support for SR Policies	2020-08-13 19:19:37 -04:00
Jakub Urbańczyk	d68e74b41c	lib, zebra: add support for sending ARP requests We can make the Linux kernel send an ARP/NDP request by adding a neighbour with the 'NUD_INCOMPLETE' state and the 'NTF_USE' flag. This commit adds new dataplane operation as well as new zapi message to allow other daemons send ARP/NDP requests. Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-12 23:19:58 +02:00
Pat Ruddy	2bdd4461c1	zebra: clean up SA warning in EVPN code Use asserts rather thank test where the values should definitely not be NULL. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 18:03:21 +01:00
Chirag Shah	707b76d795	zebra: Revert "zebra: probe local inactive neigh" Reverting probing of neigh entry. There is a timing where probe and remote macip add request comes at the same time resulting in neigh to remain in local state event though it should be remote. In mobility case, the host moves to remote VTEP, first MAC only type-2 route is received which triggers a PROBE of neighs (associated to MAC). PROBE request can go via network port to remote VTEP. PROBE request picks up local neigh with MAC entry's outgoing port is remote VTEP tunnel port. The PROBE reply and MAC-IP (containing IP) almost comes same time at DUT. DUT first processes remote macip and installs neigh as remote. Followed by receives neigh as REACHABLE which marks neigh as LOCAL. FRR does have BPF filter which does not allow its own netlink request to receive. Otherwise frr's request to program neigh as remote can move neigh from local to remote. Though ordering can not be guranteed that REACHABLE (PROBE's repsonse) can come at anytime and move it to LOCAL. This fix would not suffice the needs of converging LOCAL inactive neighs to remove from DB. As mobility draft sugges to PROBE local neigh when MAC moves to remote but it is not working with current framework. Ticket:CM-22864 This reverts commit `44bc8ae550` Signed-off-by: Chirag Shah <chirag@cumulusnetworks.com>	2020-08-12 12:39:34 +01:00
Anuradha Karuppiah	1718bc783a	zebra: fix SA NULL ptr access warning in evpn_mh Warning logs - Logic error: Dereference of null pointer in zebra_evpn_mh.c, function zebra_evpn_es_evi_show_vni, line 360 See https://ci1.netdef.org/browse/FRR-FRRPULLREQ-13544/artifact/shared/static_analysis/report-b1eb72.html#EndPath Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-08-12 12:39:34 +01:00
Pat Ruddy	8b5fdf2e66	zebra: extract core EVPN functions from zebra_vxlan.c extract the core EVPN functions from zebra_vxlan.c and put them in a new file zebra_evpn.c. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	6006414dc9	zebra: clone zebra_vxlan.c to zebra_evpn.c clone zebra_vxlan.c to create a file zebra_evpn.c for core EVPN functions whilst retaining the history of zebra_vxlan.c Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	33064a6289	zebra: extract neighbor processing from kernel_neigh_del extract the neighbor uninstall part of zebra_vxlan_handle_kernel_neigh_del into a new function zebra_evpn_neigh_del_ip in zebra_evpn_neigh.c. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	32fe7dfd9b	zebra: extract neighbor processing from process_remote_macip_del extract the neighbor uninstall part of process_remote_macip_add into a new function zebra_evpn_neigh_remote_uninstall in zebra_evpn_neigh.c. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	224315f3e4	zebra: extract neigbor processing from zevpn_gw_macip_add extract the neighbor part of process_remote_macip_add into a new function zebra_evpn_neigh_gw_macip_add in zebra_evpn_neigh.c. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	036daaca3e	zebra: extract neigbor processing from remote_macip_add extract the neighbor part of process_remote_macip_add into a new function process_neigh_remote_macip_add in zebra_evpn_neigh.c. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	7cbae20ade	zebra: extract neighbor functions from zebra_vxlan.c Move neighbor processing functions to new zebra_evpn_neigh.c Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	6336e12b94	zebra: clone zebra_vxlan.c to zebra_evpn_neigh.c clone zebra_vxlan.c to create a file zebra_evpn_neigh.c for neighbor dB functions whilst retaining the history of zebra_vxlan.c Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	7bce35354f	zebra: extract gateway mac add from zebra_vxlan.c extract mac_gateway add code from zevi_gw_macip_add and move it to a new generic function zebra_evpn_mac_gw_macip_add in zebra_evpn_mac.c Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	ad6ca5f423	zebra: extract local mac del from zebra_vxlan.c extract generic local mac add code from zebra_vxlan_local_mac_del into a new function zebra_evpn_del_local_mac in zebra_evpn_mac.c Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	d9d3455e09	zebra: extract local mac add code from vxlan extract the local mac add code from zebra_vxlan_local_mac_add_update and create a new generic local mac add function zebra_evpn_add_update_local_mac Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:34 +01:00
Pat Ruddy	19fdd1be29	zebra: split out mac_add code from process_remote_macip_add Move MAC add code from process_remote_macip_add in zebra_vxlan.c to a generic function process_mac_remote_macip_add in zebra_evpn_mac.c Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:33 +01:00
Pat Ruddy	b299808662	zebra: extract evpn mac functions from zebra_vxlan.c Move MAC dB specific functions to zebra_evpn_mac.c Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:33 +01:00
Pat Ruddy	24268cd07e	zebra: clone zebra_vxlan.c to zebra_evpn_mac.c clone zebra_vxlan.c to create a file zebra_evpn_mac.c for MAC dB functions whilst retaining the history of zebra_vxlan.c Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:33 +01:00
Pat Ruddy	87d76d5403	zebra: rename vni to evpn where appropriate The main zebra_vni_t hash structure has been renamed to zebra_evpn_t to allow for other transport underlays. Rename functions and variables to reflect this change. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2020-08-12 12:39:33 +01:00
Sebastien Merle	065276ae1f	staticd: add support for SR Policies Configuration example: ip route 9.9.9.9/32 6.6.6.6 color 123 The SR Policy to be chosen is uniquely identified by the policy endpoint (6.6.6.6) and the SR-TE color (123). Traffic will be augmented with an MPLS label stack according to the active candidate path of that particular policy. Co-authored-by: GalaxyGorilla <sascha@netdef.org> Signed-off-by: Sebastien Merle <sebastien@netdef.org>	2020-08-12 13:28:48 +02:00
Renato Westphal	4b991739c1	Merge pull request #6874 from chiragshah6/mdev zebra: fix crash in vrf-vni mapping	2020-08-11 20:27:00 -03:00
Mark Stapp	240e9a57fd	Merge pull request #6820 from donaldsharp/random_stuff Some ad-hoc work that is independent of actual bug fixes/feature work I am doing	2020-08-11 16:07:58 -04:00
Donald Sharp	659d56e13f	Merge pull request #6755 from xThaid/dplane_batching zebra: dataplane batching	2020-08-11 13:59:12 -04:00
Donald Sharp	39feaef849	Merge pull request #6894 from mjstapp/fix_sa_vxlan zebra: clean up SA warning in vxlan code	2020-08-11 13:12:08 -04:00
Mark Stapp	ceb71630ed	Merge pull request #6870 from donaldsharp/test_fixups Test fixups	2020-08-11 12:40:52 -04:00
Donald Sharp	469d627747	zebra: Only note time of first nht registration We were noticing registration time of the last nht time. Let's just store the original time, although I am a bit dubious about the usefulness of this. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-11 12:37:35 -04:00
Mark Stapp	b7895aad82	zebra: clean up SA warning in vxlan code Resolve an SA warning in the vxlan code. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-08-11 08:39:25 -04:00
Jakub Urbańczyk	531c92b82f	tests: zebra netlink testcases Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-10 21:57:12 +02:00
Jakub Urbańczyk	f6feb48b48	zebra: design changes in netlink batching code Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-10 21:57:12 +02:00
Jakub Urbańczyk	18f60fe999	zebra: remove old kernel one-update-at-a-time api The old one is replaced by the api that is suitable for the batching. Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-10 21:57:04 +02:00
Jakub Urbańczyk	67e3369ed4	zebra: netlink message batching Integrate existing functions with batching infrastructure. Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-10 21:42:43 +02:00
Jakub Urbańczyk	e63c76228c	zebra: add netlink message batching infrastructure Add helpers to batch several messages into one single netlink datagram. Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-10 21:33:00 +02:00
Jakub Urbańczyk	86d5622362	zebra: remove "PENDING" dplane request state This request state is redundant with new message batching interface. Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-10 21:33:00 +02:00
Jakub Urbańczyk	fef24b0339	zebra: prepare dplane for batching Extend kernel interface to allow the data plane to send many kernel updates at once. Signed-off-by: Jakub Urbańczyk <xthaid@gmail.com>	2020-08-10 21:32:59 +02:00
Russ White	271ac28499	Merge pull request #6783 from opensourcerouting/feature/sr-te lib, zebra: Add SR-TE policy infrastructure to zebra	2020-08-10 15:31:35 -04:00
Donald Sharp	8e3ac40d2c	Merge pull request #6483 from sylane/router-id-v6 zebra: add IPv6 router-id	2020-08-10 08:39:51 -04:00
Chirag Shah	a446dc589b	zebra: fix crash in vrf-vni mapping As part of PR 6758 vrf vni converted to transactional cli. Handle a scenario where vrf is not created yet (inactive) and vni is mapped to the inactive vrf. Testing Done: bharat(config-vrf)# do show vrf vrf vrf1 id 11 table 1001 vrf vrf5 inactive (configured) bharat(config)# vrf vrf5 bharat(config-vrf)# vni 5005 bharat(config-vrf)# do show vrf vni VRF VNI VxLAN IF L3-SVI State Rmac vrf5 5005 None None Down None bharat(config-vrf)# no vni 5005 bharat(config-vrf)# do show vrf vni VRF VNI VxLAN IF L3-SVI State Rmac Signed-off-by: Chirag Shah <chirag@cumulusnetworks.com>	2020-08-07 15:23:27 -07:00
Sebastien Merle	31f937fb43	lib, zebra: Add SR-TE policy infrastructure to zebra For the sake of Segment Routing (SR) and Traffic Engineering (TE) Policies there's a need for additional infrastructure within zebra. The infrastructure in this PR is supposed to manage such policies in terms of installing binding SIDs and LSPs. Also it is capable of managing MPLS labels using the label manager, keeping track of nexthops (for resolving labels) and notifying interested parties about changes of a policy/LSP state. Further it enables a route map mechanism for BGP and SR-TE colors such that learned BGP routes can be mapped onto SR-TE Policies. This PR does not introduce any usable features by now, it is just infrastructure for other upcoming PRs which will introduce 'pathd', a new SR-TE daemon. Co-authored-by: Renato Westphal <renato@opensourcerouting.org> Co-authored-by: GalaxyGorilla <sascha@netdef.org> Signed-off-by: Sebastien Merle <sebastien@netdef.org>	2020-08-07 11:08:49 +02:00
GalaxyGorilla	7658c2e560	zebra: Fix selection of label chunks in label manager For allocating a new label range the label manager will loop the existing label chunks and compare the start and end labels with the label range in question. In case a label range should be re-allocated to the existing label chunk, the end label of the chunk is not honored correctly, e.g. the new label range has to be a true subset of the existing label chunk. This is very easy reproducable by re-allocating a single label. e.g. a label range of size 1. This problem is fixed by allowing the mentioned 'end' labels to be equal. Signed-off-by: GalaxyGorilla <sascha@netdef.org>	2020-08-07 11:08:49 +02:00
Donald Sharp	a5897c41e4	zebra: Fix off by one error in no vni command Commit: `e2bcfa3946` Introduced a off by one error in the `no vni XXX` command: @@ -2337,12 +2336,10 @@ DEFUN (no_vrf_vni_mapping, "VNI-ID\n" "prefix-routes-only\n") { - int ret = 0; int filter = 0; - char err[ERR_STR_SZ]; - vni_t vni = strtoul(argv[2]->arg, NULL, 10); ZEBRA_DECLVAR_CONTEXT(vrf, zvrf); + vni_t vni = strtoul(argv[1]->arg, NULL, 10); assert(vrf); assert(zvrf); Fix it. Fixes: #6867 Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-06 16:31:06 -04:00
Rafael Zalamena	084cb75254	Merge pull request #6587 from AnuradhaKaruppiah/evpn-mh-pr-1 EVPN Multihoming part-1	2020-08-06 19:49:14 +00:00
Donald Sharp	3a2bc867f8	Merge pull request #6859 from mjstapp/fix_dplane_sa_again zebra: fix coverity SA warning in dataplane	2020-08-06 09:42:33 -04:00
Anuradha Karuppiah	4a7d61409d	zebra: add ESI to the "show evpn mac vni <> mac <> json" output Sample output - ============= anuradhak-VirtualBox# anuradhak-VirtualBox# show evpn mac vni 1000 mac 00:00:00:00:00:22 json { "00:00:00:00:00:22":{ "type":"remote", "remoteVtep":"0.0.0.0", "localSequence":0, "remoteSequence":0, "detectionCount":0, "isDuplicate":false, "syncNeighCount":0, "esi":"03:44:38:39:ff:ff:02:00:00:02", >>>>>>>>>>>>>>>> "neighbors":"none" } } Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-08-05 06:46:13 -07:00
Anuradha Karuppiah	9e0c2fd182	bgpd, zebra: remove strcpy, strlen and sprintf calls Replace with safe copy functions - strlcpy, strlcat, strnlen and snprintf. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-08-05 06:46:13 -07:00
Anuradha Karuppiah	d87ed8d74a	zebra: remove linux/ifbridge.h from rt_netlink.c It is causing build failures because of conflicts with netinet. Instead I have re-defined the MAC-SYNC UAPIs in the re_netlink.c This is clearly a hack that needs to be re-visited. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-08-05 06:46:13 -07:00
Anuradha Karuppiah	b169fd6fd5	zebra: support for MAC-IP sync routes MAC-IP routes are used for syncing local entries across redundant switches in an EVPN-MH setup. A path from a peer that has a local ES as destination is tagged as a SYNC path. The SYNC path results in the addition of local MAC and/or local neigh entry in zebra and in the dataplane. Implementation overview ======================= 1. Three new flags "local-inactive", "peer-active" and "peer-proxy" are maintained per-local-MAC and per-local-Neigh entry. 2. The "peer-XXX" flags are set and cleared via SYNC path updates from BGP. Proxy sync paths result in the setting of "peer-proxy" flag (and non-proxies result in the "peer-active"). 3. A neigh entry that has a "peer-XXX" flag set is programmed as "static" in the dataplane. 4. A MAC entry that has a "peer-XXX" flag set or is referenced by a sync-neigh entry (that has a "peer-XXX" flags set) is programmed as "static" in the dataplane. 5. The sync-seq number is used to normalize the MM seq number across all the redundant switches i.e. the max MM seq number across all switches is used by each of the switches. This commit also includes the changes needed for extended MM seq syncing. 6. A MAC/neigh entry has to be local-active or peer-active to sent to BGP. An entry that is NOT local-active is sent with the proxy flag (so BGP can "proxy" advertise it). 7. The "peer-active" flag is aged out by zebra by using a hold_timer (this is instead of being abruptly dropped on SYNC path delete). This age-out is needed to handle peer-switch restart (procedures are specified in draft-rbickhart-evpn-ip-mac-proxy-adv). The holdtime needs to be sufficiently long to allow an external neighmgr daemon or the dataplane component to independently probe and establish local reachability of a host. The MAC and neigh hold time values are configurable. PS: In the future this probing may happen in FRR itself. CLI changes to display sync info ================================ MAC === >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> root@torm-11:mgmt:~# net show evpn mac vni 1000 Number of MACs (local and remote) known for this VNI: 6 Flags: N=sync-neighs, I=local-inactive, P=peer-active, X=peer-proxy MAC Type Flags Intf/Remote ES/VTEP VLAN Seq #'s 00:02:00:00:00:25 local vlan1000 1000 0/0 02:02:00:00:00:02 local PI hostbond1 1000 0/0 02:02:00:00:00:06 remote 03:00:00:00:00:02:11:00:00:01 0/0 02:02:00:00:00:01 local X hostbond1 1000 0/0 00:00:00:00:00:11 local PI hostbond1 1000 0/0 02:02:00:00:00:05 remote 03:00:00:00:00:02:11:00:00:01 0/0 root@torm-11:mgmt:~# root@torm-11:mgmt:~# net show evpn mac vni 1000 mac 00:00:00:00:00:11 MAC: 00:00:00:00:00:11 ESI: 03:00:00:00:00:01:11:00:00:01 Intf: hostbond1(58) VLAN: 1000 Sync-info: neigh#: 0 local-inactive peer-active >>>>>>>>>>>> Local Seq: 0 Remote Seq: 0 Neighbors: No Neighbors root@torm-11:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> neigh ===== >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> root@torm-11:mgmt:~# net show evpn arp vni 1003 Number of ARPs (local and remote) known for this VNI: 4 Flags: I=local-inactive, P=peer-active, X=peer-proxy Neighbor Type Flags State MAC Remote ES/VTEP Seq #'s 2001:fee1:0:3::6 local active 00:02:00:00:00:25 0/0 45.0.3.66 local P active 00:02:00:00:00:66 0/0 45.0.3.6 local active 00:02:00:00:00:25 0/0 fe80::202:ff:fe00:25 local active 00:02:00:00:00:25 0/0 root@torm-11:mgmt:~# root@torm-11:mgmt:~# net show evpn arp vni 1003 ip 45.0.3.66 IP: 45.0.3.66 Type: local State: active MAC: 00:02:00:00:00:66 Sync-info: peer-active >>>>>>>>>>>>>>>> Local Seq: 0 Remote Seq: 0 root@torm-11:mgmt:~# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-08-05 06:46:13 -07:00
Anuradha Karuppiah	f188e68e5c	zebra: debug flags for MAC-IP sync Filters for zebra debug logs. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2020-08-05 06:46:13 -07:00

... 3 4 5 6 7 ...

4314 Commits