mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-10-31 15:11:11 +00:00

Author	SHA1	Message	Date
Donatas Abraitis	65baedcade	bgpd: bgp_packet_set_size int to void stream size is never checked anywhere in the code, just convert to void. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-29 21:23:53 +02:00
Donatas Abraitis	d08c0c8077	bgpd: Implement rfc9072 Related: https://datatracker.ietf.org/doc/html/rfc9072 Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-11-22 15:34:46 +02:00
Donald Sharp	e1a32ec1c5	bgpd: bgp_announce_route should know if we should force the update or not When calling bgp_announce_route allow it to properly set the flag to force an update to go out or not. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-10-04 07:59:18 -04:00
Donald Sharp	0e5cdd59eb	bgpd: Don't lookup paf structure get straight to the point The paf data structure is stored based upon an internal bgp enum. The code is looking over all AFI/SAFI's and doing a paf_af_find which then calls afindex to find the right paf structure. Let's just loop over the peer->peer_af_array[] and cut straight to the chase. Under some loads the paf_af_find was taking up 6% of the run time. This removes it entirely. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-26 20:17:40 -04:00
Philippe Guibert	046bb34781	bgpd: swap bgp error value with file descriptor value the values were swapped by mistake. fix it. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-09-23 14:23:30 +02:00
Takemasa Imada	b042667a3d	bgpd: minimum-holdtime knob to prevent session establishment with BGP peer with low holdtime. Signed-off-by: Takemasa Imada <takemasa.imada@gmail.com>	2021-08-15 06:08:08 +09:00
Donald Sharp	feb1723846	bgpd: Convert to using peer_established(peer) function We are inconsistently using peer_establiahed(peer) with sometimes using `peer->status == Established`. Just Convert over to using the function for consistency. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-07 10:48:36 -04:00
Quentin Young	556beacf10	bgpd: rework BGP_MAX_PACKET_SIZE & friends BGP_MAX_PACKET_SIZE no longer represented the absolute maximum BGP packet size as it did before, instead it was defined as 4096 bytes, which is the maximum unless extended message capability is negotiated, in which case the maximum goes to 65k. That introduced at least one bug - last_reset_cause was undersized for extended messages, and when sending an extended message > 4096 bytes back to a peer as part of NOTIFY data would trigger a bounds check assert. This patch redefines the macro to restore its previous meaning, introduces a new macro - BGP_STANDARD_MESSAGE_MAX_PACKET_SIZE - to represent the 4096 byte size, and renames the extended size to BGP_EXTENDED_MESSAGE_MAX_PACKET_SIZE for consistency. Code locations that definitely should use the small size have been updated, locations that semantically always need whatever the max is, no matter what that is, use BGP_MAX_PACKET_SIZE. BGP_EXTENDED_MESSAGE_MAX_PACKET_SIZE should only be used as a constant when storing what the negotiated max size is for use at runtime and to define BGP_MAX_PACKET_SIZE. Unless there is a future standard that introduces a third valid size it should not be used for any other purpose. Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2021-05-06 11:54:02 -04:00
Donald Sharp	7a75470fe1	bgpd: Delay setting peer data until after decision to allow open Delay setting local data about a remote peer until after BGP has decided to allow an open connection to proceed. Modifying local peer data structures based upon what is received from a peer should not be done until after BGP has decided that the open is allowed to proceed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-02 07:31:54 -04:00
Donald Sharp	f88221f3b4	bgpd: Cleanup bgp_collision_detect indentation The bgp_collision_detect function is heavily indented. Perform some cleanup to make it easier to read. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-05-02 07:31:45 -04:00
Donatas Abraitis	37916b2b11	Merge pull request #8121 from opensourcerouting/macro-cleanup *: require ISO C11 + semicolons after file-scope macros	2021-03-22 11:00:34 +02:00
Mark Stapp	e0d550dfea	bgpd: use add_event instead of add_timer with zero timeout Just use events in a few places where timers with zero timeout were being used. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-03-17 16:10:13 -04:00
David Lamparter	8451921b70	*: require semicolon after DEFINE_HOOK & co. See previous commit. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:17 +01:00
Mark Stapp	6af96fa383	bgpd: handle socket read errors in the main pthread Add a handler for socket errors that runs in the main pthread, rather than the io pthread. When the io pthread encounters a read error, capture the error and schedule a task for the main pthread. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-03-09 11:13:41 -05:00
Donatas Abraitis	ef56aee47c	bgpd: Add BGP Extended message support Implement https://www.rfc-editor.org/rfc/rfc8654.txt ``` > \| jq '."192.168.10.25".neighborCapabilities.extendedMessage' "advertisedAndReceived" ``` Another side is Bird: ``` BIRD 2.0.7 ready. Name Proto Table State Since Info v4 BGP --- up 19:39:15.689 Established BGP state: Established Neighbor address: 192.168.10.123 Neighbor AS: 65534 Local AS: 65025 Neighbor ID: 192.168.100.1 Local capabilities Multiprotocol AF announced: ipv4 Route refresh Extended message Graceful restart 4-octet AS numbers Enhanced refresh Long-lived graceful restart Neighbor capabilities Multiprotocol AF announced: ipv4 Route refresh Extended message Graceful restart 4-octet AS numbers ADD-PATH RX: ipv4 TX: Enhanced refresh Session: external AS4 Source address: 192.168.10.25 Hold timer: 140.139/180 Keepalive timer: 9.484/60 Channel ipv4 State: UP Table: master4 Preference: 100 Input filter: ACCEPT Output filter: ACCEPT Routes: 9 imported, 3 exported, 8 preferred Route change stats: received rejected filtered ignored accepted Import updates: 9 0 0 0 9 Import withdraws: 2 0 --- 2 0 Export updates: 11 8 0 --- 3 Export withdraws: 0 --- --- --- 0 BGP Next hop: 192.168.10.25 ``` Tested at least as well with to make sure it works with backward compat.: ExaBGP 4.0.2-1c737d99. Arista vEOS 4.21.14M Testing by injecint 10k routes with: ``` sharp install routes 172.16.0.1 nexthop 192.168.10.123 10000 ``` Before extended message support: ``` 2021/03/01 07:18:51 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:51 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:51 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:51 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:51 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:51 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:52 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:52 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:52 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:52 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:52 BGP: u1:s1 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/03/01 07:18:52 BGP: u1:s1 send UPDATE len 2186 (max message len: 4096) numpfx 427 2021/03/01 07:18:53 BGP: u1:s1 send UPDATE len 3421 (max message len: 4096) numpfx 674 ``` After extended message support: ``` 2021/03/01 07:20:11 BGP: u1:s1 send UPDATE len 50051 (max message len: 65535) numpfx 10000 ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-03-04 21:32:36 +02:00
Donatas Abraitis	c051ad7054	bgpd: Initialize bgp_notify.raw_data before passing to bgp_notify_receive() ``` 2523558-==2523558== 2523558-==2523558== Conditional jump or move depends on uninitialised value(s) 2523558:==2523558== at 0x47F242: bgp_notify_admin_message (bgp_debug.c:505) 2523558-==2523558== by 0x47F242: bgp_notify_print (bgp_debug.c:534) 2523558-==2523558== by 0x4BA9BC: bgp_notify_receive (bgp_packet.c:1905) 2523558-==2523558== by 0x4BA9BC: bgp_process_packet (bgp_packet.c:2602) 2523558-==2523558== by 0x4904B7E: thread_call (thread.c:1681) 2523558-==2523558== by 0x48CAA27: frr_run (libfrr.c:1126) 2523558-==2523558== by 0x474B1A: main (bgp_main.c:540) 2523558-==2523558== Uninitialised value was created by a stack allocation 2523558:==2523558== at 0x4BA33D: bgp_process_packet (bgp_packet.c:2529) ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-01-31 16:20:36 +02:00
Donatas Abraitis	bcbeb3f967	bgpd: Use neighbor_events instead of debug_update for route-refresh msg This was somewhy under bgp_debug_udpate() guard and others are under bgp_debug_neighbor_events(). Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-01-05 20:19:42 +02:00
Donatas Abraitis	9af52ccf81	bgpd: Implement enhanced route refresh capability 16:40:49 BGP: 192.168.0.2: sending route-refresh (BoRR) for IPv4/unicast 16:40:51 BGP: 192.168.0.2: sending route-refresh (EoRR) for IPv4/unicast Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-01-05 20:19:41 +02:00
Russ White	ee117a8bd6	Merge pull request #7507 from ton31337/fix/bgpd_do_not_send_update_if_path_really_did_not_change bgpd: Do not send BGP UPDATE if the route actually not changed	2021-01-05 10:26:18 -05:00
Donatas Abraitis	2adac2562a	bgpd: Do not send BGP UPDATE if the route actually not changed Reference: https://www.cmand.org/communityexploration --y2-- / \| \ c1 ---- x1 ---- y1 \| z1 \ \| / --y3-- 1. z1 announces 192.168.255.254/32 to y2, y3. 2. y2 and y3 tags this prefix at ingress with appropriate communities 65004:2 (y2) and 65004:3 (y3). 3. x1 filters all communities at the egress to c1. 4. Shutdown the link between y1 and y2. 5. y1 will generate a BGP UPDATE message regarding the next-hop change. 6. x1 will generate a BGP UPDATE message regarding community change. To avoid sending duplicate BGP UPDATE messages we should make sure we send only actual route updates. In this example, x1 will skip BGP UPDATE to c1 because the actual route is the same (filtered communities - nothing changes). Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-12-11 14:51:05 +02:00
Donatas Abraitis	c386cdd8c9	bgpd: Print afi/safi as strings when handling capability in zlog_debug Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-12-11 11:41:30 +02:00
Donald Sharp	50121ac041	bgpd: Remove restriction on certain connection types under HAVE_CUMULUS Current code when we are establishing a peering relationship when under the HAVE_CUMULUS block will dissallow v4/v6 connections if we do not have v4/v6 addresses applied. This restriction is a bit harsh and should be allowed but warned against. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-20 13:47:14 -05:00
Donald Sharp	f18ba3cd18	bgpd, lib, staticd, tests: Convert to using FOREACH_AFI_SAFI Move the FOREACH_AFI_SAFI macro from bgpd.h to zebra.h( GLOBAL's YOUALL ) Then convert all the places that have the two level for loop to iterate over all afi/safis Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-11-14 18:53:03 -05:00
Madhuri Kuruganti	c385f82af3	bgpd: conditional advertisement - other match rules support Sample Configuration with prefix-list and community match rules --------------------------------------------------------------- R1 ------- R2(DUT) ------- R3 Router2# show running-config Building configuration... Current configuration: ! frr version 7.6-dev-MyOwnFRRVersion frr defaults traditional hostname router log file /var/log/frr/bgpd.log log syslog informational hostname Router2 service integrated-vtysh-config ! debug bgp updates in debug bgp updates out ! debug route-map ! ip route 20.20.0.0/16 blackhole ipv6 route 2001:db8::200/128 blackhole ! interface enp0s9 ip address 10.10.10.2/24 ! interface enp0s10 ip address 10.10.20.2/24 ! interface lo ip address 2.2.2.2/32 ! router bgp 2 bgp log-neighbor-changes no bgp ebgp-requires-policy neighbor 10.10.10.1 remote-as 1 neighbor 10.10.20.3 remote-as 3 ! address-family ipv4 unicast neighbor 10.10.10.1 soft-reconfiguration inbound neighbor 10.10.20.3 soft-reconfiguration inbound neighbor 10.10.20.3 advertise-map ADV-MAP non-exist-map EXIST-MAP exit-address-family ! ip prefix-list DEFAULT seq 5 permit 1.1.1.5/32 ip prefix-list DEFAULT seq 10 permit 1.1.1.1/32 ip prefix-list EXIST seq 5 permit 10.10.10.10/32 ip prefix-list DEFAULT-ROUTE seq 5 permit 0.0.0.0/0 ip prefix-list IP1 seq 5 permit 10.139.224.0/20 ip prefix-list T2 seq 5 permit 1.1.1.5/32 ! bgp community-list standard DC-ROUTES seq 5 permit 64952:3008 bgp community-list standard DC-ROUTES seq 10 permit 64671:501 bgp community-list standard DC-ROUTES seq 15 permit 64950:3009 bgp community-list standard DEFAULT-ROUTE seq 5 permit 65013:200 ! route-map ADV-MAP permit 10 match ip address prefix-list IP1 ! route-map ADV-MAP permit 20 match community DC-ROUTES ! route-map EXIST-MAP permit 10 match community DEFAULT-ROUTE match ip address prefix-list DEFAULT-ROUTE ! line vty ! end Router2# Router2# show ip bgp 0.0.0.0 BGP routing table entry for 0.0.0.0/0 Paths: (1 available, best #1, table default) Advertised to non peer-group peers: 10.10.10.1 10.10.20.3 1 10.10.10.1 from 10.10.10.1 (10.139.224.1) Origin IGP, metric 0, valid, external, best (First path received) Community: 64848:3011 65011:200 65013:200 Last update: Tue Oct 6 02:39:42 2020 Router2# Sample output with non-exist-map when default route present in table -------------------------------------------------------------------- Router2# show ip bgp BGP table version is 4, local router ID is 2.2.2.2, vrf id 0 Default local pref 100, local AS 2 Status codes: s suppressed, d damped, h history, * valid, > best, = multipath, i internal, r RIB-failure, S Stale, R Removed Nexthop codes: @NNN nexthop's vrf id, < announce-nh-self Origin codes: i - IGP, e - EGP, ? - incomplete Network Next Hop Metric LocPrf Weight Path > 0.0.0.0/0 10.10.10.1 0 0 1 i > 1.1.1.1/32 10.10.10.1 0 0 1 i > 1.1.1.5/32 10.10.10.1 0 0 1 i > 10.139.224.0/20 10.10.10.1 0 0 1 ? Displayed 4 routes and 4 total paths Router2# show ip bgp neighbors 10.10.20.3 advertised-routes BGP table version is 4, local router ID is 2.2.2.2, vrf id 0 Default local pref 100, local AS 2 Status codes: s suppressed, d damped, h history, * valid, > best, = multipath, i internal, r RIB-failure, S Stale, R Removed Nexthop codes: @NNN nexthop's vrf id, < announce-nh-self Origin codes: i - IGP, e - EGP, ? - incomplete Network Next Hop Metric LocPrf Weight Path > 0.0.0.0/0 0.0.0.0 0 1 i > 1.1.1.5/32 0.0.0.0 0 1 i <<<<<<<<< non-exist-map : 0.0.0.0/0 is present so, 10.139.224.0/20 not advertised Total number of prefixes 2 Sample output with non-exist-map when default route not present in table ------------------------------------------------------------------------ Router2# show ip bgp BGP table version is 5, local router ID is 2.2.2.2, vrf id 0 Default local pref 100, local AS 2 Status codes: s suppressed, d damped, h history, * valid, > best, = multipath, i internal, r RIB-failure, S Stale, R Removed Nexthop codes: @NNN nexthop's vrf id, < announce-nh-self Origin codes: i - IGP, e - EGP, ? - incomplete Network Next Hop Metric LocPrf Weight Path > 1.1.1.1/32 10.10.10.1 0 0 1 i > 1.1.1.5/32 10.10.10.1 0 0 1 i > 10.139.224.0/20 10.10.10.1 0 0 1 ? Displayed 3 routes and 3 total paths Router2# Router2# Router2# show ip bgp neighbors 10.10.20.3 advertised-routes BGP table version is 5, local router ID is 2.2.2.2, vrf id 0 Default local pref 100, local AS 2 Status codes: s suppressed, d damped, h history, valid, > best, = multipath, i internal, r RIB-failure, S Stale, R Removed Nexthop codes: @NNN nexthop's vrf id, < announce-nh-self Origin codes: i - IGP, e - EGP, ? - incomplete Network Next Hop Metric LocPrf Weight Path > 1.1.1.1/32 0.0.0.0 0 1 i > 1.1.1.5/32 0.0.0.0 0 1 i > 10.139.224.0/20 0.0.0.0 0 1 ? <<<<<<<<< non-exist-map : 0.0.0.0/0 is not present so, 10.139.224.0/20 advertised Total number of prefixes 3 Router2# Sample output with exist-map when default route present in table -------------------------------------------------------------------- Router2# show ip bgp BGP table version is 8, local router ID is 2.2.2.2, vrf id 0 Default local pref 100, local AS 2 Status codes: s suppressed, d damped, h history, valid, > best, = multipath, i internal, r RIB-failure, S Stale, R Removed Nexthop codes: @NNN nexthop's vrf id, < announce-nh-self Origin codes: i - IGP, e - EGP, ? - incomplete Network Next Hop Metric LocPrf Weight Path > 0.0.0.0/0 10.10.10.1 0 0 1 i > 1.1.1.1/32 10.10.10.1 0 0 1 i > 1.1.1.5/32 10.10.10.1 0 0 1 i > 10.139.224.0/20 10.10.10.1 0 0 1 ? Displayed 4 routes and 4 total paths Router2# Router2# Router2# Router2# Router2# show ip bgp neighbors 10.10.20.3 advertised-routes BGP table version is 8, local router ID is 2.2.2.2, vrf id 0 Default local pref 100, local AS 2 Status codes: s suppressed, d damped, h history, * valid, > best, = multipath, i internal, r RIB-failure, S Stale, R Removed Nexthop codes: @NNN nexthop's vrf id, < announce-nh-self Origin codes: i - IGP, e - EGP, ? - incomplete Network Next Hop Metric LocPrf Weight Path > 0.0.0.0/0 0.0.0.0 0 1 i > 1.1.1.1/32 0.0.0.0 0 1 i > 1.1.1.5/32 0.0.0.0 0 1 i > 10.139.224.0/20 0.0.0.0 0 1 ? <<<<<<<<< exist-map : 0.0.0.0/0 is present so, 10.139.224.0/20 advertised Total number of prefixes 4 Router2# Sample output with exist-map when default route not present in table -------------------------------------------------------------------- Router2# show ip bgp BGP table version is 9, local router ID is 2.2.2.2, vrf id 0 Default local pref 100, local AS 2 Status codes: s suppressed, d damped, h history, * valid, > best, = multipath, i internal, r RIB-failure, S Stale, R Removed Nexthop codes: @NNN nexthop's vrf id, < announce-nh-self Origin codes: i - IGP, e - EGP, ? - incomplete Network Next Hop Metric LocPrf Weight Path > 1.1.1.1/32 10.10.10.1 0 0 1 i > 1.1.1.5/32 10.10.10.1 0 0 1 i > 10.139.224.0/20 10.10.10.1 0 0 1 ? Displayed 3 routes and 3 total paths Router2# Router2# Router2# Router2# show ip bgp neighbors 10.10.20.3 advertised-routes BGP table version is 9, local router ID is 2.2.2.2, vrf id 0 Default local pref 100, local AS 2 Status codes: s suppressed, d damped, h history, valid, > best, = multipath, i internal, r RIB-failure, S Stale, R Removed Nexthop codes: @NNN nexthop's vrf id, < announce-nh-self Origin codes: i - IGP, e - EGP, ? - incomplete Network Next Hop Metric LocPrf Weight Path *> 1.1.1.5/32 0.0.0.0 0 1 i <<<<<<<<< exist-map : 0.0.0.0/0 is not present so, 10.139.224.0/20 not advertised Total number of prefixes 1 Router2# Signed-off-by: Madhuri Kuruganti <k.madhuri@samsung.com>	2020-10-27 16:15:36 +05:30
Don Slice	f4d2dd841d	bgpd: delay local routes until update-delay is over Problem found that turning an update-delay would only delay prefixes learned from peers by delaying bestpath, but would allow local routes (network statements or redistributed) to be immediately advertised, followed by an End of Rib indicator. This fix delays sending local routes until the update-delay process is completed, which matches what testing shows other vendors do.. Ticket: CM-31743 Signed-off-by: Don Slice <dslice@nvidia.com>	2020-10-26 04:06:25 -07:00
Quentin Young	c7bb4f006b	lib, bgpd: convert lttng tracepoints to frrtrace() - tracepoint() -> frrtrace() - tracelog() -> frrtracelog() - tracepoint_enabled() -> frrtrace_enabled() Also removes copypasta'd #ifdefs for those LTTng macros, those are handled in lib/trace.h Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2020-10-23 15:13:51 -04:00
Quentin Young	d9a03c5736	bgpd: add basic packet-related tracepoints Add tracepoints for: - packet pushed to internal rx queue - packet dequeued from rx queue and processed Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2020-10-23 15:13:51 -04:00
Donatas Abraitis	23d0a75356	bgpd: Convert inet_ntoa to %pI4/inet_ntop Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-10-18 11:22:30 +03:00
Trey Aspelund	7aa4fd5ba7	bgpd: Use bgp instance's default keepalive interval if < (holdtime/3) bgp->default_keepalive was not considered when setting peer->v_keepalive, causing the effective keepalive interval to always be (holdtime/3), even when default_keepalive < (holdtime/3). This ensures that the default_keepalive is used when it's set and is < (holdtime/3). Signed-off-by: Trey Aspelund <taspelund@cumulusnetworks.com> (cherry picked from commit d8bf8c6128f2e493d473148213bd663a500c7f73)	2020-09-25 09:46:54 -04:00
Quentin Young	765b07d9ff	bgpd: remove extra hold-timer reset Handler function doesn't need to reset the hold timer, this is done during the FSM update. Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2020-09-15 20:15:08 -04:00
Donatas Abraitis	8336c896fd	bgpd: Add `neighbor <neigh> shutdown rtt` command This would be useful in cases with lots of peers and shutdown them automatically if RTT goes above the specified limit. A host with 512 or more IPv6 addresses has a higher latency due to ipv6_addr_label(). This method tries to pick the best candidate address fo outgoing connection and literally increases processing latency. ``` Samples: 28 of event 'cycles', Event count (approx.): 22131542 Children Self Command Shared Object Symbol + 100.00% 0.00% ping6 [kernel.kallsyms] [k] entry_SYSCALL_64_fastpath + 100.00% 0.00% ping6 [unknown] [.] 0x0df0ad0b8047022a + 100.00% 0.00% ping6 libc-2.17.so [.] __sendto_nocancel + 100.00% 0.00% ping6 [kernel.kallsyms] [k] sys_sendto + 100.00% 0.00% ping6 [kernel.kallsyms] [k] SYSC_sendto + 100.00% 0.00% ping6 [kernel.kallsyms] [k] sock_sendmsg + 100.00% 0.00% ping6 [kernel.kallsyms] [k] inet_sendmsg + 100.00% 0.00% ping6 [kernel.kallsyms] [k] rawv6_sendmsg + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ip6_dst_lookup_flow + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ip6_dst_lookup_tail + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ip6_route_get_saddr + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ipv6_dev_get_saddr + 100.00% 0.00% ping6 [kernel.kallsyms] [k] __ipv6_dev_get_saddr + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ipv6_get_saddr_eval + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ipv6_addr_label + 100.00% 100.00% ping6 [kernel.kallsyms] [k] __ipv6_addr_label + 0.00% 0.00% ping6 [kernel.kallsyms] [k] schedule ``` This is how it works: ``` ~# vtysh -c 'show bgp neigh 192.168.0.2 json' \| jq '."192.168.0.2".estimatedRttInMsecs' 9 ~# tc qdisc add dev eth1 root netem delay 120ms ~# vtysh -c 'show bgp neigh 192.168.0.2 json' \| jq '."192.168.0.2".estimatedRttInMsecs' 89 ~# vtysh -c 'show bgp neigh 192.168.0.2 json' \| jq '."192.168.0.2".estimatedRttInMsecs' null ~# vtysh -c 'show bgp neigh 192.168.0.2 json' \| jq '."192.168.0.2".lastResetDueTo' "Admin. shutdown" ``` Warning message: bgpd[14807]: 192.168.0.2 shutdown due to high round-trip-time (200ms > 150ms) Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-09-07 22:30:19 +03:00
Donatas Abraitis	e410d56307	bgpd: Update RTT on KEEPALIVE message Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-09-07 17:25:57 +03:00
Russ White	e3dcd431cd	Merge pull request #6938 from opensourcerouting/bgp-instance-shutdown bgpd: BGP instance administrative shutdown	2020-08-25 10:31:01 -04:00
Renato Westphal	4fe5bc8c62	Merge pull request #6943 from ton31337/fix/replace_sizeof_instead_of_constant_for_bgp_dump_attr bgpd: Use sizeof() in bgp_dump_attr()	2020-08-19 07:36:13 -03:00
Donatas Abraitis	5022c8331d	bgpd: Use sizeof() in bgp_dump_attr() Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-08-18 21:43:07 +03:00
Donald Sharp	b5c2113e47	bgpd: Actually respect RFC 6286 for router_id The RFC states: The BGP Identifier is a 4-octet, unsigned, non-zero integer that should be unique within an AS. The value of the BGP Identifier for a BGP speaker is determined on startup and is the same for every local interface and every BGP peer. We were going slightly beyond this and ensuring that the address was a specific range of addresses which is no longer relevant. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-08-17 13:52:19 -04:00
David Schweizer	cb9196e77a	bgpd: bgp instance administrative shutdown. * Fixed integration in FSM and packet handling. * Added CLI "show" output, incl. JSON. * For review and testing only. Signed-off-by: David Schweizer <dschweizer@opensourcerouting.org>	2020-08-14 10:23:34 +02:00
Donatas Abraitis	deee0dd830	Merge pull request #6519 from RichardWu-Hebut/master bgpd: Fix the bug that BGP MRAI does not work.	2020-07-16 16:49:08 +03:00
David Lamparter	3efd0893d0	*: un-split strings across lines Remove mid-string line breaks, cf. workflow doc: .. [#tool_style_conflicts] For example, lines over 80 characters are allowed for text strings to make it possible to search the code for them: please see `Linux kernel style (breaking long lines and strings) <https://www.kernel.org/doc/html/v4.10/process/coding-style.html#breaking-long-lines-and-strings>`_ and `Issue #1794 <https://github.com/FRRouting/frr/issues/1794>`_. Scripted commit, idempotent to running: ``` python3 tools/stringmangle.py --unwrap `git ls-files \| egrep '\.[ch]$'` ``` Signed-off-by: David Lamparter <equinox@diac24.net>	2020-07-14 10:37:25 +02:00
Richard Wu	b10b6d5272	bgpd: Fix the bug that BGP MRAI does not work. Issue: bgp_process_writes will be called when the fd is writable. And it will bgp_generate_updgrp_packets to generate the update packets no matter MRAI is set or not. Fix: bgp_generate_updgrp_packets thread will return without sending any update when MRAI timer is still running. Signed-off-by: Richard Wu <wutong23@baidu.com>	2020-06-24 16:30:12 +08:00
Quentin Young	772270f3b6	*: sprintf -> snprintf Replace sprintf with snprintf where straightforward to do so. - sprintf's into local scope buffers of known size are replaced with the equivalent snprintf call - snprintf's into local scope buffers of known size that use the buffer size expression now use sizeof(buffer) - sprintf(buf + strlen(buf), ...) replaced with snprintf() into temp buffer followed by strlcat Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2020-04-20 19:14:33 -04:00
David Lamparter	cd05906c41	Merge pull request #6071 from ton31337/feature/rfc6286 bgpd: Add support for Autonomous-System-Wide Unique BGP Identifier	2020-04-03 15:16:59 +02:00
Donatas Abraitis	036937f042	bgpd: Correct two comments typos for bgp_collision_detect() Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-31 17:54:40 +03:00
Donatas Abraitis	787c30209f	bgpd: Add support for Autonomous-System-Wide Unique BGP Identifier Implement https://tools.ietf.org/html/rfc6286 Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-31 17:14:56 +03:00
Donatas Abraitis	3dc339cdc2	bgpd: Convert lots of int type functions to bool/void Some were converted to bool, where true/false status is needed. Converted to void only those, where the return status was only false or true. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-21 14:59:18 +02:00
Quentin Young	27f83b0b18	Merge pull request #6028 from mjstapp/fix_func_macros bgpd,zebra: replace some more FUNCTION macros with __func__	2020-03-18 11:53:58 -04:00
Mark Stapp	0767b4f34e	bgpd,zebra: replace some more FUNCTION macros Replace some remaining __FUNCTION__ macros with __func__, now that we're trying to converge that way. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-03-18 08:13:32 -04:00
Russ White	047315df42	Merge pull request #5954 from ton31337/feature/rfc7607 bgpd: Proscribe the use of AS 0 (zero)	2020-03-17 10:27:35 -04:00
Donatas Abraitis	33d022bcf6	bgpd: Proscribe the use of AS 0 (zero) Implements https://tools.ietf.org/html/rfc7607 Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-17 13:31:23 +02:00
Donatas Abraitis	3893aeeea3	bgpd: Add subcodes for BGP Finite State Machine Error Implement https://tools.ietf.org/html/rfc6608 I used python scapy library to send a notification message in OpenSent state: ``` send(IP(dst="192.168.0.1")/TCP(sport=sp, dport=179, seq=rec.ack, ack=rec.seq + 1, flags=0x18)/BGPHeader(type=3)/BGPNotification(error_code=4, error_subcode=0)) ``` Logs from FRR: ``` %NOTIFICATION: sent to neighbor 192.168.0.2 5/1 (Neighbor Events Error/Receive Unexpected Message in OpenSent State) 0 bytes ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-16 09:22:22 +02:00
Donatas Abraitis	15569c58f8	*: Replace __PRETTY_FUNCTION__/__FUNCTION__ to __func__ Just keep the code cool. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-05 20:23:23 +02:00
Donald Sharp	5ca840a3e1	bgpd: Cleanup indentation in bgp_route_refresh_receive Some code in bgp_route_refresh_receive was spread across several lines because of an end of line commit. Move comment to a place to allow better formating. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-02-27 10:04:37 -05:00
Donald Sharp	1bb379bf4e	bgpd: Cleanup set but unused variables There existed some variables set but never used. Clean this up. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-02-27 09:41:58 -05:00
Donald Sharp	3dbe2b6061	bgpd: Add a better breadcrumb for when bgp is missconfiged Currently During bgp open collision resolution if both the router-id's are the same, we correctly follow the RFC and close the connection. The problem is of course that there is no notification of the error in configuration to the end user other than a subtle open debug message. Explicitly call out the miss-configuration as an error message as that this miss-config took several hours of debugging to notice. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-02-19 10:52:14 -05:00
Renato Westphal	4b08a72ed1	Merge pull request #5763 from ton31337/fix/return_without_parent *: Remove parenthesis on return for constants	2020-02-10 18:49:06 -03:00
Donatas Abraitis	95f7965d09	*: Remove parenthesis on return for constants Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-02-09 14:21:56 +02:00
Donatas Abraitis	975a328e2e	*: Replace s_addr 0 => INADDR_ANY Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-02-06 09:00:12 +02:00
Donatas Abraitis	85c58de773	Merge pull request #5761 from qlyoung/fix-bgp-gr-cruft Fix bgp gr style	2020-02-06 08:16:25 +02:00
Quentin Young	362353195a	bgpd, lib: fix style from BGP GR code This patch fixes the noncompliant style for the following commit range: `4a6e80fbf` `2ba1fe695` `efcb2ebbb` `8c48b3b69` `dc95985fe` `0f0444fbd` `85ef4179a` `eb451ee58` `2d3dd828d` `9e3b51a7f` `d6e3c15b6` `34aa74486` `6102cb7fe` `d7b3cda6f` `2bb5d39b1` `5f9c1aa29` `5cce3f054` `3a75afa4b` `f009ff269` `cfd47646b` `2986cac29` `055679e91` `034e185dc` `794b37d52` `b0965c44e` `949b0f24f` `63696f1d8` Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2020-02-04 15:19:04 -05:00
Quentin Young	b3ba5dc7fe	*: don't null after XFREE; XFREE does this itself Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2020-02-03 11:22:13 -05:00
bisdhdh	4a6e80fbf2	bgpd: Added bgp graceful restart additional debug logs. bgp graceful restart additional debug logs, resolved merge conflicts. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:36:33 +05:30
bisdhdh	2ba1fe6951	bgpd: BGP Garaceful Restart debug logs. Reorganizing bgp gr debug logs and code review comments. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:36:33 +05:30
bisdhdh	0f0444fbd8	bgpd: Adding helper caller hooks for BGPD-ZEBRA integration for GR. *Adding helper caller hooks function for signalling from BGPD to ZEBRA to enable or disable GR feature in ZEBRA depending on bgp per peer gr configuration. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:36:33 +05:30
bisdhdh	9e3b51a7f3	bgpd: Restarting node does not send EOR after the convergence. After a restarting router comes up and the bgp session is successfully established with the peer. If the restarting router doesn’t have any route to send, it send EOR to the peer immediately before receiving updates from its peers. Instead the restarting router should send EOR, if the selection deferral timer is not running OR count of eor received and eor required are matches then send EOR. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
bisdhdh	d6e3c15b62	bgpd: Added hidden CLI command to disable sending of End-of-Rib. BGP disable EOR sending is a useful command for testing various scenarios of BGP graceful restart. * Added the hidden CLI command : bgp graceful-restart disable-eor * The CLI will not be displayed in "show running-config" and will not be stored in configuration file. * When enabled, EOR will not be sent to peer Signed-off-by: Biswajit Sadhu <sadhub@vmware.com> Signed-off-by: Soman K S <somanks@vmware.com>	2020-01-23 09:34:25 +05:30
bisdhdh	5cce3f0544	bgpd: Adding BGP GR change mode config apply on notification sent & received. * Changing GR mode on a router needs a session reset from the SAME router to negotiate new GR capability. * The present GR implementation needs a session reset after every new BGP GR mode change. * When BGP session reset happens due to sending or receiving BGP notification after changing BGP GR mode, there is no need of explicit session reset. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
bisdhdh	f009ff2697	bgpd: Adding Selection Deferral Timer handler changes. * Selection Deferral Timer for Graceful Restart. * Added selection deferral timer handling function. * Route marking as selection defer when update message is received. * Staggered processing of routes which are pending best selection. * Fix for multi-path test case. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
Trey Aspelund	a0e89d545b	bgpd: Remove misleading 'NOTIFICATION' string from End-of-RIB log 'NOTIFICATION' string in this message incorrectly implies a BGP Notification message was the cause of this log. Removing it to reduce confusion and replacing with function name. Signed-off-by: Trey Aspelund <taspelund@cumulusnetworks.com>	2019-12-18 15:58:26 -05:00
Donatas Abraitis	0e35025eb4	bgpd: Use BGP_NOTIFY_SUBCODE_UNSPECIFIC value for bgp_notify_send() where 0 Just a code cleanup to keep the code consistent. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2019-11-10 17:54:37 +02:00
David Lamparter	00dffa8cde	lib: add frr_with_mutex() block-wrapper frr_with_mutex(...) { ... } locks and automatically unlocks the listed mutex(es) when the block is exited. This adds a bit of safety against forgetting the unlock in error paths & co. and makes the code a slight bit more readable. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2019-09-03 17:15:17 +02:00
David Lamparter	d35a6c2895	bgpd/bmp: use bgp packet dump hook Signed-off-by: David Lamparter <equinox@diac24.net>	2019-08-30 19:00:45 +02:00
Yasuhiro Ohara	6c29258c96	bgpd/bmp: Initial BMP implementation. This is the initial BMP skeleton from Yasuhiro Ohara. (License/Signoff note: code published on github as GPLv2+.) Signed-off-by: David Lamparter <equinox@diac24.net>	2019-08-30 19:00:45 +02:00
Dinesh G Dutt	5cb5f4d04d	bgpd: Eliminate all incorrect formulations of afi/safi in JSON In a number of places, the JSON output had invalid key names for AFI/SAFI. For example, the key name in JSON was "IPv4 Unicast" which is invalid as a JSON Key name. Many JSON tools such as those used in Ansible, jq etc. all fail to parse the output in these scenarios. The valid name is ipv4Unicast. There's already a routine afi_safi_json() defined to handle this change, but it was not consistently called. The non-JSON version was called afi_safi_print() and it merely returned the CLI version of the string, didn't print anything. This patch deals with this issue by: - Renaming afi_safi_print to get_afi_safi_str() - get_afi_safi_str takes an additional param, for_json which if true will return the JSON-valid string - Renaming afi_safi_json to get_afi_safi_json_str() - Creating a new routine get_afi_safi_vty_str() for printing to vty - Consistently using get_afi_safi_str() with the appropriate for_json value Signed-off-by: Dinesh G Dutt <5016467+ddutt@users.noreply.github.com>	2019-08-27 14:05:39 +00:00
David Lamparter	6fd04594bb	bgpd: add packet send hook Unlike MRT dumps, BMP also provides packets sent by the router. Add another hook for that. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2019-07-03 16:59:12 +02:00
David Lamparter	584470fb5f	bgpd: add & use bgp packet dump hook The MRT dump code is already hooked in at the right places to write out packets; the BMP code needs exactly the same access so let's make this a hook. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2019-07-03 16:58:26 +02:00
David Lamparter	b4d46cc9b1	bgpd: count some per-peer stats (for BMP) These counters are accessible through BMP and may be useful to monitor bgpd. A CLI to show them could also be added if people are interested. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2019-07-03 16:53:12 +02:00
David Lamparter	1a1f453436	bgpd: fix last_reset_cause setup last_reset_cause_size is the length used in last_reset_cause[]. It's straight up used wrong here; we're saving off a reset cause and need to check against the available size in last_reset_cause[]. This could actually have led to (hopefully rare) crashes in the assert there, since the assert condition might fail incorrectly. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2019-07-03 16:50:36 +02:00
Donald Sharp	7ec5e2bf70	Merge pull request #4514 from opensourcerouting/warnings-20190612 *: kill more warnings	2019-06-17 15:19:42 -04:00
David Lamparter	6dcef54cbf	bgpd: fix uninitialized & wrong endian NOTIFY notify_data_remote_as4 would contain garbage if optlen == 0, and also as4 is in host byte order while the notify needs network byte order. Signed-off-by: David Lamparter <equinox@diac24.net>	2019-06-13 20:43:13 +02:00
Donald Sharp	748a041f09	bgpd, lib: Add iana_afi2str and iana_safi2str for eye pleasing strings Modify the code such that we can auto turn the iana values of afi and safi to pleasant to read strings. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2019-06-02 14:51:52 -04:00
Quentin Young	5041dc4fbf	bgpd: suppress dead store warning Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2019-05-29 18:03:26 +00:00
Quentin Young	552d6491f0	bgpd: remove strcpy, strcat Replace with strlcpy, strlcat Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2019-05-29 18:02:57 +00:00
Quentin Young	db878db01a	bgpd: fix false compiler warning Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2019-05-29 18:02:57 +00:00
nikos	9738e9aa36	bgpd: IPv6 session flapping with MP_REACH_NLRI and 0.0.0.0 in NEXT_HOP attribute This is causing interop issues with vendors. According to the RFC, receiver should ignore the NEXT_HOP attribute with MP_REACH_NLRI present. Signed-off-by: nikos <ntriantafillis@gmail.com>	2019-05-10 12:52:17 -07:00
Donatas Abraitis	513386b57f	bgpd: Do not send UPDATE message with maximum-prefix When using maximum-prefix and count is overflow BGP sends UPDATE message: Apr 15 20:45:06 exit1-debian-9 bgpd[9818]: 192.168.0.2 [Error] Error parsing NLRI Apr 15 20:45:06 exit1-debian-9 bgpd[9818]: %NOTIFICATION: sent to neighbor 192.168.0.2 3/10 (UPDATE Message Error/Invalid Network Field) 0 bytes Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2019-04-24 14:51:06 +03:00
Donald Sharp	e82d19a3d4	bgpd: Modify End of Rib notification to INFO The End of Rib notification in BGP is useful to know no matter the circumstances. So change this from a debug message to an info and cleanup the message a bit and add vrf we are in. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2019-01-07 17:51:27 +01:00
David Lamparter	0437e10517	*: spelchek Signed-off-by: David Lamparter <equinox@diac24.net>	2018-10-25 20:10:57 +02:00
Quentin Young	1c50c1c0d6	*: style for EC replacements Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 19:38:57 +00:00
Quentin Young	450971aa99	*: LIB_[ERR\|WARN] -> EC_LIB Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 19:34:28 +00:00
Quentin Young	e50f7cfdbd	bgpd: BGP_[WARN\|ERR] -> EC_BGP Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 18:51:04 +00:00
Quentin Young	ade6974def	*: style for flog_warn conversions Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-06 20:56:41 +00:00
Donald Sharp	63d430ceee	bgpd: Convert zlog_warn to flog_warn for bgp_packet.c Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-09-06 20:50:58 +00:00
Quentin Young	09c866e34d	*: rename ferr_zlog -> flog_err_sys Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Quentin Young	af4c27286d	*: rename zlog_fer -> flog_err Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Donald Sharp	02705213b1	bgpd: Convert to using LIB_ERR_XXX where possible Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Don Slice	14454c9fdd	bgpd: implement zlog_ferr facility for enhance error messages in bgp Signed-off-by: Don Slice <dslice@cumulusnetworks.com<	2018-08-14 20:02:05 +00:00
Pascal Mathis	b90a8e13ee	bgpd: Implement group-overrides for peer timers This commit implements BGP peer-group overrides for the timer flags, which control the value of the hold, keepalive, advertisement-interval and connect connect timers. It was kept separated on purpose as the whole timer implementation is quite complex and merging this commit together with with the other flag implementations did not seem right. Basically three new peer flags were introduced, namely PEER_FLAG_ROUTEADV, PEER_FLAG_TIMER and PEER_FLAG_TIMER_CONNECT. The overrides work exactly the same way as they did before, but introducing these flags made a few conditionals simpler as they no longer had to compare internal data structures against eachother. Last but not least, the test suite has been adjusted accordingly to test the newly implemented flag overrides. Signed-off-by: Pascal Mathis <mail@pascalmathis.com>	2018-06-14 18:55:30 +02:00
Quentin Young	bd6b2706b3	bgpd: remove unused variable Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-04-13 17:17:42 -04:00
Quentin Young	e0981960cd	bgpd: double-check notify data when debugging clang-analyze complains that data may be null, and since we didn't explicitly check it (although we did check the overall packet length minus the header length) it has a point. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-04-13 17:17:42 -04:00
jaydom	7c40bf391c	bgpd: add flowspec feature This work is derived from a work done by China-Telecom. That initial work can be found in [0]. As the gap between frr and quagga is important, a reworks has been done in the meantime. The initial work consists of bringing the following: - Bringing the client side of flowspec. - the enhancement of address-family ipv4/ipv6 flowspec - partial data path handling at reception has been prepared - the support for ipv4 flowspec or ipv6 flowspec in BGP open messages, and the internals of BGP has been done. - the memory contexts necessary for flowspec has been provisioned In addition to this work, the following has been done: - the complement of adaptation for FS safi in bgp code - the code checkstyle has been reworked so as to match frr checkstyle - the processing of IPv6 FS NLRI is prevented - the processing of FS NLRI is stopped ( temporary) [0] https://github.com/chinatelecom-sdn-group/quagga_flowspec/ Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com> Signed-off-by: jaydom <chinatelecom-sdn-group@github.com>	2018-03-30 14:00:47 +02:00
Quentin Young	d7c0a89a3a	*: use C99 standard fixed-width integer types The following types are nonstandard: - u_char - u_short - u_int - u_long - u_int8_t - u_int16_t - u_int32_t Replace them with the C99 standard types: - uint8_t - unsigned short - unsigned int - unsigned long - uint8_t - uint16_t - uint32_t Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-03-27 15:13:34 -04:00
Donald Sharp	5410015a79	bgpd: peer->bgp must be non NULL We lock and set peer->bgp at peer creation and only remove it at deletion. Therefore these tests are not needed. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-03-20 19:09:06 -04:00
Lou Berger	996c93142d	*: conform with COMMUNITY.md formatting rules, via 'make indent' Signed-off-by: Lou Berger <lberger@labn.net>	2018-03-06 14:04:32 -05:00
Quentin Young	a127f33b97	bgpd: fix race condition causing occasional assert If a BGP message header fails validation we send a BGP NOTIFICATION from the I/O thread. At this time we clear the output buffer, push a NOTIFICATION and then call the manual write function for errors. But in between the push and the write the main thread could have pushed some other message. Thus we need to hold the lock for the duration of the function. TOCTTOU. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-01-23 18:51:34 -05:00
Quentin Young	0112e9e0b9	bgpd: use atomic_* ops on _Atomic variables Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-01-09 15:40:48 -05:00
Quentin Young	8ec586b01b	bgpd: fix potential deadlock With the way things are set up, this bit of code would never actually cause a deadlock, but would be highly likely in the future. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-12-01 13:41:27 -05:00
Quentin Young	6ec98a2f37	bgpd: small optimization with UPDATE generation After a batch of generated UPDATEs, call bgp_writes_on() once instead of after generating each packet. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 17:17:16 -05:00
Quentin Young	c58b0f46dd	bgpd: use FOREACH_AFI_SAFI() Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:58:37 -05:00
Quentin Young	4961a5a2eb	bgpd: intelligently adjust coalesce timer The subgroup coalesce timer controls how long updates to a particular subgroup are delayed in order to allow additional peers to join the subgroup. Presently the timer value is 200 ms. Increase it to 1 second and adjust up as peers are configured, with an upper cap at 10s. This cuts convergence time by a factor of 3 at large scale (300+ peers, 1000+ prefixes per peer). Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:47:51 -05:00
Quentin Young	934af4587f	bgpd: turn off keepalives when sending NOTIFY This is necessary because otherwise between the time we wipe the output buffer and the time we push the NOTIFY onto it, the KA generation thread could have pushed a KEEPALIVE in the middle. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:07 -05:00
Quentin Young	d0ad6d8e5f	bgpd: yield more when generating UPDATEs In the same vein as the round-robin input commit, this re-adds logic for limiting the amount of time spent generating UPDATEs per generation cycle. Missed this when shifting around wpkt_quanta; prior to MT it limited both calls to write() as well as UPDATE generation.	2017-11-30 16:18:07 -05:00
Quentin Young	9773a576bd	bgpd: restore packet input limit Unfortunately, batching input processing severely impacts BGP initial convergence times. As a consequence of the way update-groups were implemented, advancing the state of the routing table based on prefixes learned from one peer prior to all (or at least most) peers establishing connections will cause us to start generating outbound UPDATEs, which is a very expensive operation at present. This intensive processing starves out bgp_accept(), delaying connection of additional peers. When additional peers do connect the problem gets worse and worse, yielding approximately exponential growth in convergence time dependent on both peering and prefix counts. This behavior is present pre-multithreading as well, but batched input exacerbates it. Round-robin input processing marginally harms convergence times for small topologies but should allow much larger topologies to function within reasonable performance thresholds. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:06 -05:00
Quentin Young	4af766600a	bgpd: schedule process packet as timer Different places scheduling the same thread should use the same semantics and thread type. Additionally providing the back reference here makes sure we only schedule the job once and avoids flooding the event queue with jobs to process an empty buffer.	2017-11-30 16:18:06 -05:00
Quentin Young	af1e1dc69e	bgpd: re-add write trigger logic Apparently I didn't fully understand how subgroup packets make their way out to individual peers. Turns out (on the base branch) we just busy poll while waiting for packets to make their way onto subgroup queues. While this needs to be fixed in the future, for now readding this logic fixes performance issues with convergence.	2017-11-30 16:18:06 -05:00
Quentin Young	becedef6c3	bgpd, tests: comment formatting Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:05 -05:00
Quentin Young	e3c7270d49	bgpd: fix uninitialized result code Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:04 -05:00
Quentin Young	3b73658c7c	bgpd: lift read-quanta restriction Per previous work to ensure all FSM state is updated after processing each message, read-quanta should be safe to set > 1. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:03 -05:00
Quentin Young	3735936bda	bgpd: free notify packet after writing Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:02 -05:00
Quentin Young	d815168795	bgpd: fix bgp_packet.c / bgp_fsm.c organization Despaghettification of bgp_packet.c and bgp_fsm.c Sometimes we call bgp_event_update() inline packet parsing. Sometimes we post events instead. Sometimes we increment packet counters in the FSM. Sometimes we do it in packet routines. Sometimes we update EOR's in FSM. Sometimes we do it in packet routines. Fix the madness. bgp_process_packet() is now the centralized place to: - Update message counters - Execute FSM events in response to incoming packets FSM events are now executed directly from this function instead of being queued on the thread_master. This is to ensure that the FSM contains the proper state after each packet is parsed. Otherwise there could be race conditions where two packets are parsed in succession without the appropriate FSM update in between, leading to session closure due to receiving inappropriate messages for the current FSM state. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:02 -05:00
Quentin Young	555e09d4a2	bgpd: atomize write-quanta, add read-quanta bgpd supports setting a write-quanta that serves as a hint on how many packets to write per I/O cycle. Now that input is buffered, it makes sense to add the equivalent parameter for how many packets are processed per cycle. This is not how many packets are read off the wire per I/O cycle; rather it is how many packets are processed from the input buffer in a given cycle after having been read off the wire and sanitized. Since these values must be used from multiple threads, they have also been made atomic. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:00 -05:00
Quentin Young	9eb217ff69	bgpd: batched i/o Instead of reading a packet header and the rest of the packet in two separate i/o cycles, instead read a chunk of data at one time and then parse as many packets as possible out of the chunk. Also changes bgp_packet.c to batch process packets. To avoid thrashing on useless mutex locks, the scheduling call for bgp_process_packet has been changed to always succeed at the cost of no longer being cancel-able. In this case this is acceptable; following the pattern of other event-based callbacks, an additional check in bgp_process_packet to ignore stray events is sufficient. Before deleting the peer all events are cleared which provides the requisite ordering. XXX: chunk hardcoded to 5, should use something similar to wpkt_quanta Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:00 -05:00
Quentin Young	424ab01d0f	bgpd: implement buffered reads * Move and modify all network input related code to bgp_io.c * Add a real input buffer to `struct peer` * Move connection initialization to its own thread.c task instead of piggybacking off of bgp_read() * Tons of little fixups Primary changes are in bgp_packet.[ch], bgp_io.[ch], bgp_fsm.[ch]. Changes made elsewhere are almost exclusively refactoring peer->ibuf to peer->curr since peer->ibuf is now the true FIFO packet input buffer while peer->curr represents the packet currently being processed by the main pthread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	56257a44e4	bgpd: move bgp i/o to a separate source file After implement threading, bgp_packet.c was serving the double purpose of consolidating packet parsing functionality and handling actual I/O operations. This is somewhat messy and difficult to understand. I've thus moved all code and data structures for handling threaded packet writes to bgp_io.[ch]. Although bgp_io.[ch] only handles writes at the moment to keep the noise on this commit series down, for organization purposes, it's probably best to move bgp_read() and its trappings into here as well and restructure that code so that read()'s happen in the pthread and packet processing happens on the main thread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	0ca8b79f38	bgpd: use new threading infra Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	2bb745fe02	bgpd: stop pseudo-blocking in bgp_write If write() indicates that we should retry, just move along to the next peer and come back later. No need to burn write() in a loop. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:58 -05:00
Quentin Young	419dfe6a70	bgpd: dynamically allocate synchronization primitives Changes all synchronization primitives to be dynamically allocated. This should help catch any subtle errors in pthread lifecycles. This change also pre-initializes synchronization primitives before threads begin to run, eliminating a potential race condition that probably would have caused a segfault on startup on a very fast box. Also changes mutex and condition variable allocations to use MTYPE_PTHREAD and updates tests to do the proper initializations. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:58 -05:00
Quentin Young	49507a6f6a	bgpd: remove unused `struct thread` from peer * Remove t_write * Remove t_keepalive These have been replaced by pthreads and are no longer needed. Since some code looks at these values to determine if the threads are scheduled, also add a new bitfield to store the same information. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:58 -05:00
Quentin Young	2d4ee77490	lib, bgpd: implement pthread lifecycle management Removes the WiP shim and implements proper thread lifecycle management. * Declare necessary pthread_t's in bgp_master * Define new MTYPE in lib/thread.c for pthreads * Allocate and free BGP's pthreads appropriately * Terminate and join threads appropriately Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	07a1652682	bgpd: move bgp_connect_check() to bgp_fsm.c Prior to this change, after initiating a nonblocking connection to the remote peer bgpd would call both BGP_READ_ON and BGP_WRITE_ON on the peer's socket. This resulted in a call to select(), so that when some event (either a connection success or failure) occurred on the socket, one of bgp_read() or bgp_write() would run. At the beginning of each of those functions was a hook into bgp_connect_check(), which checked the socket status and issued the correct connection event onto the BGP FSM. This code is better suited for bgp_fsm.c. Placing it there avoids scheduling packet reads or writes when we don't know if the socket has established a connection yet, and the specific functionality is a better fit for the responsibility scope of this unit. This change also helps isolate the responsibilities of the packet-writing kernel thread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	80bd61c416	bgpd: move update group processing to main thread Prior to this change, packets generated for update groups were taken off of the (independent) buffer for the update group, reformatted for the specific peer under question and sent off inline with bgp_write(). Since the operations of this code path can include the merging and pruning of subgroups and are too large to safely synchronize, this change moves that logic to execute after each tick of the write thread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	d3ecc69e5f	bgpd: move packet writes into dedicated pthread * BGP_WRITE_ON() removed * BGP_WRITE_OFF() removed * peer_writes_on() added * peer_writes_off() added * bgp_write_proceed_actions() removed Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	05c7a1cc93	bgpd: use FOREACH_AFI_SAFI where possible Improves consistency and readability. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-21 13:02:06 -05:00
Renato Westphal	965a99f58a	Merge pull request #1406 from donaldsharp/bgpd_ecommunity_crash bgpd: Fix crash with ecommunity string	2017-11-06 15:08:07 -02:00
Renato Westphal	f498ca82bd	Merge pull request #1370 from dslicenc/cm18408-bgp-timers bgpd: fix various problems with hold/keepalive timers	2017-11-06 14:06:12 -02:00
Donald Sharp	d2b6417bd6	bgpd: Prevent infinite loop when reading capabilities If the user has configured the ability to override the capabilities or if the afi/safi passed as part of the _MP capability is not understood, then we can enter into an infinite loop as part of the capability parsing. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2017-11-06 10:38:05 -05:00
Don Slice	d25e4efc52	bgpd: fix various problems with hold/keepalive timers Problem reported that we weren't adjusting the keepalive timer correctly when we negotiated a lower hold time learned from a peer. While working on this, found we didn't do inheritance correctly at all. This fix solves the first problem and also ensures that the timers are configured correctly based on this priority order - peer defined > peer-group defined > global config. This fix also displays the timers as "configured" regardless of which of the three locations above is used. Ticket: CM-18408 Signed-off-by: Don Slice <dslice@cumulusnetworks.com> Reviewed-by: CCR-6807 Testing-performed: Manual testing successful, fix tested by submitter, bgp-smoke completed successfully	2017-10-26 11:55:31 -04:00
Donald Sharp	9b9df9892d	bgpd: Treat empty reachable NLRI as a EOR This issue was discovered on a live session with an extremely old cisco 7206VXR router running 12.2(33)SRE4. The sending router is sending us an empty NLRI that is MP_REACH. From RFC exploration(thanks Russ!) it appears that this was considered a 'valid' way to send EOR. Following discussion decided that we should treat this situation as a EOR marker instead of bringing down the session. Applying this fix on the FRR router seeing this issue allows it to continue it's peering relationship with the ASR. Since this is a point fix I do not see a high likelihood of further fallout. Fixes: #1258 Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2017-10-26 07:31:17 -04:00
Renato Westphal	5c5255381e	lib/bgpd: introduce the iana_safi_t enum We had afi_t/iana_afi_t for AFIs but only safi_t for SAFIs. Fix this inconsistency. Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2017-07-31 23:44:42 -03:00
David Lamparter	9d303b37d7	Revert "*: reindent pt. 2" This reverts commit `c14777c6bf`. clang 5 is not widely available enough for people to indent with. This is particularly problematic when rebasing/adjusting branches. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-22 14:52:33 +02:00
Don Slice	e5f22b3036	bgpd: fix peer startup for labeled-unicast if linklocal address not found Problem found in testing where ipv6 labeled-unicast prefixes were not received on the peers if a "service networking restart" was issued. Same problem would happen with an ifdown/ifup on the link to the peer. Found the problem to be that peers would establish for labeled-unicast even if a link-local address was not yet available on the interface toward the peer, causing updates to be sent without a nexthop value. These were then rejected by the peer. Fix is to delay peer establishment until after the link-local addresses are available. Ticket: CM-16779 Signed-off-by: Don Slice <dslice@cumulusnetworks.com> Reviewed By: Donald Sharp <sharpd@cumulusnetworks.com> Testing Done: Manual testing successful. Bgp-smoke completed with no new failures	2017-07-18 13:09:34 +00:00
whitespace / reindent	c14777c6bf	: reindent pt. 2 w/ clang 5 reflow comments * struct members go 1 per line * binpack algo was adjusted	2017-07-17 15:26:02 -04:00
whitespace / reindent	d62a17aede	*: reindent indent.py `git ls-files \| pcregrep '\.[ch]$' \| pcregrep -v '^(ldpd\|babeld\|nhrpd)/'` Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-17 14:04:07 +02:00
Donald Sharp	aadc090505	bgpd: Refactor 'struct attr_extra' into 'struct attr' Most of the attributes in 'struct attr_extra' allow for the more interesting cases of using bgp. The extra overhead of managing it will induce errors as we add more attributes and the extra memory overhead is negligible on anything but full bgp feeds. Additionally this greatly simplifies the code for the handling of data. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com> bgpd: Fix missing label set Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2017-07-12 15:23:18 -04:00
David Lamparter	21bb7c8774	Merge commit '3d22338f04d9554fa' into evpn-prep Conflicts: lib/Makefile.am Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-10 22:15:19 +02:00
Daniel Walton	4fbf55e986	Merge branch 'master' of https://github.com/dwalton76/frr into bgpd-ipv4-plus-label-misc3	2017-06-26 17:24:44 +00:00
Quentin Young	56b4067930	*: simplify log message lookup log.c provides functionality for associating a constant (typically a protocol constant) with a string and finding the string given the constant. However this is highly delicate code that is extremely prone to stack overflows and off-by-one's due to requiring the developer to always remember to update the array size constant and to do so correctly which, as shown by example, is never a good idea.b The original goal of this code was to try to implement lookups in O(1) time without a linear search through the message array. Since this code is used 99% of the time for debugs, it's worth the 5-6 additional cmp's worst case if it means we avoid explitable bugs due to oversights... Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-06-21 15:22:21 +00:00
Daniel Walton	9bedbb1e52	bgpd: Install SAFI_LABELED_UNICAST routes in SAFI_UNICAST table Signed-off-by: Daniel Walton <dwalton@cumulusnetworks.com> - All ipv4 labeled-unicast routes are now installed in the ipv4 unicast table. This allows us to do things like take routes from an ipv4 unicast peer, allocate a label for them and TX them to a ipv4 labeled-unicast peer. We can do the opposite where we take routes from a labeled-unicast peer, remove the label and advertise them to an ipv4 unicast peer. - Multipath over a labeled route and non-labeled route is not allowed. - You cannot activate a peer for both 'ipv4 unicast' and 'ipv4 labeled-unicast' - The 'tag' variable was overloaded for zebra's route tag feature as well as the mpls label. I added a 'mpls_label_t mpls' variable to avoid this. This is much cleaner but resulted in touching a lot of code.	2017-06-16 19:12:57 +00:00
vivek	3d22338f04	bgpd: Fixes related to use of L2VPN/EVPN Add checks related to AFI_L2VPN/SAFI_EVPN that were missing in some parts of the code. Fix incorrect check skipping EVPN when sending End of RIB. Signed-off-by: Vivek Venkatraman <vivek@cumulusnetworks.com>	2017-05-25 10:20:04 -07:00
Lou Berger	1ec1afd6cb	bgpd: remove encap safi vty related files bgp_encap.h\|c Signed-off-by: Lou Berger <lberger@labn.net>	2017-05-23 15:58:50 -04:00
Lou Berger	eedae49501	bgpd: remove encap_safi rx processing Signed-off-by: Lou Berger <lberger@labn.net>	2017-05-23 15:58:50 -04:00

1 2 3 4 5 ...

390 Commits