mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-10-27 23:32:47 +00:00

Author	SHA1	Message	Date
Donatas Abraitis	a7d91a8c79	bgpd: Print hostname along with IP for most useful debug messages Examples: ``` %ADJCHANGE: neighbor 192.168.0.1(exit1-debian-11) in vrf default Up 192.168.0.1(exit1-debian-11) graceful restart stalepath timer expired 192.168.0.1(exit1-debian-11) sending route-refresh (BoRR) for IPv4/unicast 192.168.0.1(exit1-debian-11) graceful restart timer started for 120 sec 192.168.0.1(exit1-debian-11) graceful restart stalepath timer started for 120 sec 192.168.0.1(exit1-debian-11) graceful restart timer stopped %MAXPFXEXCEED: No. of IPv4 Unicast prefix received from 192.168.0.1(exit1-debian-11) 9 exceed, limit 1 ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-03-22 21:59:58 +02:00
Russ White	d2dfd26697	Merge pull request #10636 from ton31337/fix/use_get_set_for_communities bgpd: Reuse get/set helpers for attr->community	2022-02-28 09:52:50 -05:00
Donatas Abraitis	9a706b42fb	bgpd: Reuse get/set helpers for attr->community Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2022-02-25 10:02:30 +02:00
Donald Sharp	cc9f21da22	*: Change thread->func to return void instead of int The int return value is never used. Modify the code base to just return a void instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-02-23 19:56:04 -05:00
Donatas Abraitis	31afff83f1	bgpd: Print function name for `(dynamic neighbor) deleted` debug messages Just sometimes to properly understand where this is coming from. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2022-01-19 15:02:08 +02:00
Donatas Abraitis	af8496af08	bgpd: Do not delete BGP dynamic peers if graceful restart kicks in ``` ~# vtysh -c 'show bgp ipv4 unicast summary' \| grep 192.168.10.17 *donatas-pc(192.168.10.17) 4 65002 8 12 0 0 0 00:01:35 2 14 N/A ``` Before shutting down 192.168.10.17: ``` ~# vtysh -c 'show bgp ipv4 unicast 100.100.100.100/32' BGP routing table entry for 100.100.100.100/32, version 7 Paths: (2 available, best #2, table default) Advertised to non peer-group peers: home-spine1.donatas.net(192.168.0.2) 65002, (stale) 192.168.10.17 from donatas-pc(192.168.10.17) (0.0.0.0) Origin incomplete, valid, external Last update: Sat Jan 15 21:45:47 2022 65001 192.168.0.2 from home-spine1.donatas.net(192.168.0.2) (2.2.2.2) Origin incomplete, metric 0, valid, external, best (Older Path) Last update: Sat Jan 15 21:25:19 2022 ``` After 192.168.10.17 is down: ``` ~# vtysh -c 'show bgp ipv4 unicast summary' \| grep 192.168.10.17 donatas-pc(192.168.10.17) 4 65002 5 9 0 0 0 00:00:12 Active 0 N/A ~# vtysh -c 'show bgp ipv4 unicast 100.100.100.100/32' BGP routing table entry for 100.100.100.100/32, version 7 Paths: (2 available, best #2, table default) Advertised to non peer-group peers: home-spine1.donatas.net(192.168.0.2) 65002, (stale) 192.168.10.17 from donatas-pc(192.168.10.17) (0.0.0.0) Origin incomplete, valid, external Community: llgr-stale Last update: Sat Jan 15 21:49:01 2022 Time until Long-lived stale route deleted: 16 65001 192.168.0.2 from home-spine1.donatas.net(192.168.0.2) (2.2.2.2) Origin incomplete, metric 0, valid, external, best (First path received) Last update: Sat Jan 15 21:25:19 2022 ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2022-01-19 15:02:07 +02:00
Donatas Abraitis	df8d723c5f	*: Add FOREACH_AFI_SAFI_NSF(afi, safi) macro to reduce nesting Used for graceful-restart mostly. Especially for bgp_show_neighbor_graceful_restart_capability_per_afi_safi() Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2022-01-13 14:29:54 +02:00
Donatas Abraitis	1479ed2fb3	bgpd: Implement LLGR helper mode Tested between GoBGP and FRR (this commit). ``` ┌───────────┐ ┌────────────┐ │ │ │ │ │ GoBGPD │ │ FRRouting │ │ (restart) │ │ │ │ │ │ │ └──────┬────┘ └───────┬────┘ │ │ │ │ │ │ │ ┌───────────┐ │ │ │ │ │ │ │ │ │ └─────┤ FRRouting ├────────┘ │ (helper) │ │ │ └───────────┘ // GoBGPD % cat /etc/gobgp/config.toml [global.config] as = 65002 router-id = "2.2.2.2" port = 179 [[neighbors]] [neighbors.config] peer-as = 65001 neighbor-address = "2a02🔤:123" [neighbors.graceful-restart.config] enabled = true restart-time = 3 long-lived-enabled = true [[neighbors.afi-safis]] [neighbors.afi-safis.config] afi-safi-name = "ipv6-unicast" [neighbors.afi-safis.mp-graceful-restart.config] enabled = true [neighbors.afi-safis.long-lived-graceful-restart.config] enabled = true restart-time = 10 [[neighbors.afi-safis]] [neighbors.afi-safis.config] afi-safi-name = "ipv4-unicast" [neighbors.afi-safis.mp-graceful-restart.config] enabled = true [neighbors.afi-safis.long-lived-graceful-restart.config] enabled = true restart-time = 20 % ./gobgp global rib add -a ipv6 2001:db8:4::/64 % ./gobgp global rib add -a ipv6 2001:db8:5::/64 community 65535:7 % ./gobgp global rib add -a ipv4 100.100.100.100/32 % ./gobgp global rib add -a ipv4 100.100.100.200/32 community 65535:7 ``` 1. When killing GoBGPD, graceful restart timer starts in FRR helper router; 2. When GR timer expires in helper router: a) LLGR_STALE community is attached to routes to be retained; b) Clear stale routes that have NO_LLGR community attached; c) Start LLGR timer per AFI/SAFI; d) Recompute bestpath and reannounce routes to peers; d) When LLGR timer expires, clear all routes on particular AFI/SAFI. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-12-28 16:07:59 +02:00
Donatas Abraitis	22472feef8	bgpd: No need to test if a thread is running for BGP_TIMER_OFF Handles that inside the macro. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-12-21 10:57:07 +02:00
Donald Sharp	e36f61b507	*: Rename quagga_timestamp with frr_timestamp Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-11-11 14:41:27 -05:00
Donatas Abraitis	d9377cb626	Merge pull request #9557 from idryzhov/bgp-view-cleanup bgpd: cleanup special checks for views	2021-09-07 10:14:30 +03:00
Igor Ryzhov	2c1eba8e84	bgpd: cleanup special checks for views bgp->vrf_id is always VRF_DEFAULT for views. All these special checks are not necessary. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2021-09-03 17:36:40 +03:00
Donald Sharp	c5fe9095fe	bgpd: Add `PEER_DOWN_SOCKET_ERROR` to the list of peer failure modes BGP can experience a bunch of errors associated with sockets being manipulated which would prevent the peer from coming up. Let's add some additional debug information here so that our operators can do a bit more for themselves. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-09-03 07:56:35 -04:00
Prerana-GB	f852eb9833	bgpd: BGP knob to teardown session immediately when peer is unreachable When BGP is notified by RIB that peer address is unreachable then BGP session must be brought down immediately and not wait for the hold-timer expiry. Today single-hop EBGP already behaves this way but need to change for iBGP and multi-hop EBGP sessions. Signed-off-by: Prerana G.B <prerana@vmware.com>, Pushpasis Sarkar <spushpasis@vmware.com>	2021-08-24 12:23:38 +00:00
Russ White	04cfc0a3a8	Merge pull request #9056 from askorichenko/test-dont-capability bgpd: Clear capabilities field when resetting a bgp neighbor	2021-08-03 06:59:56 -04:00
Donatas Abraitis	90737805d9	Merge pull request #8956 from pguibert6WIND/bgp_loop_through_itself bgpd: prevent routes loop through itself	2021-07-21 09:28:21 +03:00
Alexander Skorichenko	24f569e9cc	bgpd: Clear capabilities field when resetting a bgp neighbor Currently, the following sequence of events between peers could result in erroneous capability reports on the peer with enabled dont-capability-negotiate option: - having some of the capabilities advertised to a bgp neighbor, - then disabling capability negotiation to that neighbor, - then resetting connection to it, - and no capabilities are actually sent to the neighbor, - but "show bgp neighbors" on the host still displays them as advertised to the neighbor. There are two possibilities for establishing a new connection - the established connection was initiated by us with bgp_start(), - the connection was initiated on the neighbor side and processed by us via bgp_accept() in bgp_network.c. The former case results in "show bgp neighbors" displaying only "received" in capabilities, as the peer's cap is initiated to zero in bgp_start(). In the latter case, if bgp_accept() happens before bgp_start() is called, then new peer capabilities are being transferred from its previous record before being zeroed in bgp_start(). This results in "show bgp neighbors" still displaying "advertised and received" in capabilities. Following the logic of a similar af_cap field clearing, treated correctly in both cases, we - reset peer's capability during bgp_stop() - don't pass it over to a new peer structure in bgp_accept(). This fix prevents transferring of the previous capabilities record to a new peer instance in arbitrary reconnect scenario. Signed-off-by: Alexander Skorichenko <askorichenko@netgate.com>	2021-07-14 16:43:37 -04:00
Philippe Guibert	654a5978f6	bgpd: prevent routes loop through itself Some BGP updates received by BGP invite local router to install a route through itself. The system will not do it, and the route should be considered as not valid at the earliest. This case is detected on the zebra, and this detection prevents from trying to install this route to the local system. However, the nexthop tracking mechanism is called, and acts as if the route was valid, which is not the case. By detecting in BGP that use case, we avoid installing the invalid routes. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2021-07-12 13:57:36 +02:00
prerana	3f98a750c2	bgpd: Inconsistency in Local BGP GR state. Problem: Sometimes the configured Local GR state is not reflected in show command and peer node. This is causing failures in few of the BGP-GR topotests. RCA: This problem is seen when the configuration of local GR state happens when the BGP session is in OpenSent state and moves to Established after the configuration is complete. When the session gets established, we move the GR state value from stub peer to the config peer. This will result in overriding the GR state to previous value. Fix: The local GR state is modified only through CLI configuration and does not change during BGP FSM transition. In this case it is not necessary to transfer the GR state value from stub peer to config peer. This way we can ensure that always the most recent config value is present in peer datastructure. Signed-off-by: Prerana-GB <prerana@vmware.com>	2021-07-09 00:20:15 -07:00
Donatas Abraitis	0db06e3785	bgpd: Set 4096 instead of 65535 as new max packet size for a new peer New peers should be initialized with a usual max packet size and later determined on OPEN messages. Testing with different peers supporting/not supporting extended support. 2021/07/02 13:48:00 BGP: [WEV7K-2GAQ5] u2:s2 send UPDATE len 8991 (max message len: 65535) numpfx 1788 2021/07/02 13:48:03 BGP: [WEV7K-2GAQ5] u3:s3 send UPDATE len 4096 (max message len: 4096) numpfx 809 2021/07/02 13:48:03 BGP: [WEV7K-2GAQ5] u3:s3 send UPDATE len 4096 (max message len: 4096) numpfx 809 Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-07-03 11:17:37 +03:00
Donald Sharp	feb1723846	bgpd: Convert to using peer_established(peer) function We are inconsistently using peer_establiahed(peer) with sometimes using `peer->status == Established`. Just Convert over to using the function for consistency. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-06-07 10:48:36 -04:00
Donald Sharp	53aabbe192	bgpd: Prevent race condition loss of config If we have a situation where BGP is partially reading in a config file for a neighbor, and the neighbor is coming up and we have a doppelganger. There exists a race condition when we transfer the config from the doppelganger to the config peer that we will overwrite later config because we are copying the config data from the doppelganger peer( which was captured at the start of initiation of the peering ). From what I can tell the peer->af_flags variable is to hold configuration flags for the local peer. The doppelganger should never overwrite this. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-04-23 23:53:51 -04:00
Donald Sharp	996319e63d	bgpd: Address LL peer not NHT when receiving connection attempt The new LL code in: `8761cd6ddb` Introduced the idea of the bgp unnumbered peers using interface up/down events to track the bgp peers nexthop. This code was not properly working when a connection was received from a peer in some circumstances. Effectively the connection from a peer was immediately skipping state transitions and FRR was never properly tracking the peers nexthop. When we receive the connection attempt, let's track the nexthop now. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-04-15 13:16:28 -04:00
Rafael Zalamena	21bfce9827	bgpd: rework BFD integration Remove old BFD API usage and replace it with the new one. Highlights: - More shared code: the daemon gets notified with callbacks instead of having to roll its own code to find the notified sessions. - Less code to integrate with BFD. - Remove hidden commands to configure single / multi hop. Use protocol data instead. BGP can determine if a peer is single/multi hop according to the following criteria: a. If the IP address is a link-local address (single hop) b. The network is shared with peer (single hop) c. BGP is configured for eBGP multi hop / TTL security (multi hop) - Respect the configuration hierarchy: a. Peer configuration take precendence over peer-group configuration. b. When peer group configuration is removed, reset peer BFD configurations to defaults (unless peer had specific configs). Example: neighbor foo peer-group neighbor foo bfd profile X neighbor 192.168.0.2 peer-group foo neighbor 192.168.0.2 bfd ! If peer-group is removed the profile configuration gets ! removed from peer 192.168.0.2, but BFD will still enabled ! because of the neighbor specific bfd configuration. Signed-off-by: Rafael Zalamena <rzalamena@opensourcerouting.org>	2021-03-23 12:40:10 -03:00
Donatas Abraitis	37916b2b11	Merge pull request #8121 from opensourcerouting/macro-cleanup *: require ISO C11 + semicolons after file-scope macros	2021-03-22 11:00:34 +02:00
Mark Stapp	e0d550dfea	bgpd: use add_event instead of add_timer with zero timeout Just use events in a few places where timers with zero timeout were being used. Signed-off-by: Mark Stapp <mjs@voltanet.io>	2021-03-17 16:10:13 -04:00
David Lamparter	8451921b70	*: require semicolon after DEFINE_HOOK & co. See previous commit. Signed-off-by: David Lamparter <equinox@diac24.net>	2021-03-17 06:18:17 +01:00
Donald Sharp	c0d72166ee	bgpd: Convert remaining string output to our internal types Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-03-09 19:50:42 -05:00
Donald Sharp	8761cd6ddb	bgpd: Switch LL nexthop tracking to be interface based bgp is currently registering v6 LL as nexthops to be tracked from zebra. This presents several problems. a) zebra does not properly track multiple prefixes that match the same route properly at this point in time. b) BGP was receiving nexthops that were just incorrect because of (a). c) When a nexthop changed that really didn't affect the v6 LL we were responding incorrectly because of this Modify the code such that bgp nexthop tracking notices that we are trying to register a v6 LL. When we do so, shortcut and watch interface up/down events for this v6 LL and do the work when an interface goes up / down for this type of tracking. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-02-17 08:14:45 -05:00
Donald Sharp	62e0464d73	bgpd: Remove #if 0 code Remove all dead #if 0 code from bgpd. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2021-01-28 13:57:49 -05:00
Pat Ruddy	4053e9520a	bgpd: make sure nh is valid for MPLS vpn routes If we are using a nexthop for a MPLS VPN route make sure the nexthop is over a labeled path. This new check mirrors the one in validate_paths (where routes are enabled when a nexthop becomes reachable). The check is introduced to the code path where routes are added and the nexthop is looked up. Signed-off-by: Pat Ruddy <pat@voltanet.io>	2021-01-27 13:56:45 +00:00
Donatas Abraitis	9af52ccf81	bgpd: Implement enhanced route refresh capability 16:40:49 BGP: 192.168.0.2: sending route-refresh (BoRR) for IPv4/unicast 16:40:51 BGP: 192.168.0.2: sending route-refresh (EoRR) for IPv4/unicast Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2021-01-05 20:19:41 +02:00
Donald Sharp	3742de8d68	bgpd: Use the header Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-12-17 16:42:33 -05:00
David Schweizer	6c537a18cf	bgpd: RFC 4271 DelayOpenTimer Changes implement the DelayOpenTimer functionality proposed in RFC 4271. Signed-off-by: David Schweizer <dschweizer@opensourcerouting.org>	2020-10-20 16:49:58 +02:00
Soman K S	a77e2f4bab	bgpd: Advertise FIB installed routes to bgp peers (Part 3) * Process FIB update in bgp_zebra_route_notify_owner() and call group_announce_route() if route is installed * When bgp update is received for a route which is not installed earlier (flag BGP_NODE_FIB_INSTALLED is not set) and suppress fib is enabled set the flag BGP_NODE_FIB_INSTALL_PENDING to indicate fib install is pending for the route. The route will be advertised when zebra send ZAPI_ROUTE_INSTALLED status. * The advertisement delay (BGP_DEFAULT_UPDATE_ADVERTISEMENT_TIME) is added to allow more routes to be sent in single update message. This is required since zebra sends route notify message for each route. The delay will be applied to update group timer which advertises routes to peers. Signed-off-by: kssoman <somanks@gmail.com>	2020-11-06 08:55:56 +05:30
Mark Stapp	5047884528	*: unify thread/event cancel macros Replace all lib/thread cancel macros, use thread_cancel() everywhere. Only the THREAD_OFF macro and thread_cancel() api are supported. Also adjust thread_cancel_async() to NULL caller's pointer (if present). Signed-off-by: Mark Stapp <mjs@voltanet.io>	2020-10-23 12:16:52 -04:00
Donald Sharp	b6c386bbbd	bgpd: Make the process_queue per bgp process We currently have a global process queue for handling route updates in bgp. This is fine, in general, except there are places and times where we plug the queue for no new work during certain peer states of bgp update delay. If we happen to be processing multiple bgp instances on startup why do we want to stop processing in vrf A when vrf B is in a bit of a pickle? Also this separation will allow us to start forward thinking about how to fully integrate pthreads into route processing in bgp. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2020-10-21 15:34:47 -04:00
Quentin Young	f8dcd38ddb	bgpd: rename bgp_fsm_event_update This function is poorly named; it's really used to allow the FSM to decide the next valid state based on whether a peer has valid / reachable nexthops as determined by NHT or BFD. Signed-off-by: Quentin Young <qlyoung@nvidia.com>	2020-09-17 12:45:37 -04:00
Donatas Abraitis	8336c896fd	bgpd: Add `neighbor <neigh> shutdown rtt` command This would be useful in cases with lots of peers and shutdown them automatically if RTT goes above the specified limit. A host with 512 or more IPv6 addresses has a higher latency due to ipv6_addr_label(). This method tries to pick the best candidate address fo outgoing connection and literally increases processing latency. ``` Samples: 28 of event 'cycles', Event count (approx.): 22131542 Children Self Command Shared Object Symbol + 100.00% 0.00% ping6 [kernel.kallsyms] [k] entry_SYSCALL_64_fastpath + 100.00% 0.00% ping6 [unknown] [.] 0x0df0ad0b8047022a + 100.00% 0.00% ping6 libc-2.17.so [.] __sendto_nocancel + 100.00% 0.00% ping6 [kernel.kallsyms] [k] sys_sendto + 100.00% 0.00% ping6 [kernel.kallsyms] [k] SYSC_sendto + 100.00% 0.00% ping6 [kernel.kallsyms] [k] sock_sendmsg + 100.00% 0.00% ping6 [kernel.kallsyms] [k] inet_sendmsg + 100.00% 0.00% ping6 [kernel.kallsyms] [k] rawv6_sendmsg + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ip6_dst_lookup_flow + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ip6_dst_lookup_tail + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ip6_route_get_saddr + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ipv6_dev_get_saddr + 100.00% 0.00% ping6 [kernel.kallsyms] [k] __ipv6_dev_get_saddr + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ipv6_get_saddr_eval + 100.00% 0.00% ping6 [kernel.kallsyms] [k] ipv6_addr_label + 100.00% 100.00% ping6 [kernel.kallsyms] [k] __ipv6_addr_label + 0.00% 0.00% ping6 [kernel.kallsyms] [k] schedule ``` This is how it works: ``` ~# vtysh -c 'show bgp neigh 192.168.0.2 json' \| jq '."192.168.0.2".estimatedRttInMsecs' 9 ~# tc qdisc add dev eth1 root netem delay 120ms ~# vtysh -c 'show bgp neigh 192.168.0.2 json' \| jq '."192.168.0.2".estimatedRttInMsecs' 89 ~# vtysh -c 'show bgp neigh 192.168.0.2 json' \| jq '."192.168.0.2".estimatedRttInMsecs' null ~# vtysh -c 'show bgp neigh 192.168.0.2 json' \| jq '."192.168.0.2".lastResetDueTo' "Admin. shutdown" ``` Warning message: bgpd[14807]: 192.168.0.2 shutdown due to high round-trip-time (200ms > 150ms) Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-09-07 22:30:19 +03:00
Donatas Abraitis	5266cab359	Merge pull request #7037 from volta-networks/fix_traps_bgp Fix bgpBackwardTransition traps	2020-09-05 08:28:19 +03:00
Donatas Abraitis	08194f561e	Merge pull request #6589 from NaveenThanikachalam/gr_fixes bgpd: GR fixes	2020-09-04 18:39:26 +03:00
Babis Chalios	05e68acc75	bgpd: fix invocation of bgpTrapBackwardTransition The bgpTrapBackwardTransition callback was being called only during bgp_stop and only under the condition that peer status was Established. The MIB defines that the event should be generated for every transition of the BGP FSM from a higher to a lower state. Signed-off-by: Babis Chalios <mail@bchalios.io>	2020-09-02 15:30:22 +02:00
Russ White	e3dcd431cd	Merge pull request #6938 from opensourcerouting/bgp-instance-shutdown bgpd: BGP instance administrative shutdown	2020-08-25 10:31:01 -04:00
Sarita Patra	6c4d8732e9	bgpd: Fix BGP session stuck in OpenConfirm state Issue: 1. Initially BGP start listening to socket. 2. Start timer expires and BGP tries to connect to peer and moved to Idle->connect (lets say peer datastructre X) 3. Connect for X succeeds and hence moved from idle ->connect with FD-x. 4. A incoming connection is accepted and a new peer datastructure Y is created with FD-y moves from idle->Active state. 5. Peer datastercture Y FD-y sends out OPEN and moves to Active->Opensent state. 6. Peer datastrcture Y FD-y receives OPEN and moved from Opensent-> Openconfirm state. 7. Meanwhile on peer datastrcture X FD-x sends out a OPEN message and moved from connect->Opensent. 8. For peer datastrcture Y FD-y keep alive is received and it is moved from OpenConfirm->Established. 9. In this case peer datastructure Y FD-y is a accepted connection so we try to copy all its parameter to peer datastructure X and delete Y. 10. During this process TCP connection for the accepted connection (FD-y) goes down and hence get remote address and port fails. 11. With this failure bgp_stop function for both peer datastrure X and peer datastructure Y is called. 12. By this time all the parameters include state for datastrcture for X and Y are exchanged. Peer Y FD-y when it entered this function had state OpenConfirm still which has been moved to peer datastrcture X. 13. In bgp_stop it will stop all the timers and take action only if peer is in established state. Now that peer datastrcture X and Y are not in established state (in this function) it will simply close all timers and close the socket and assigns socket for both the peer datastrcture to -1. 14. Peer datastrcture Y will be deleted as it is a datastrcture created due to accept of connection where as peer datastrcture X will be held as it is created with configuration. 15. Now peer datastrcture X now holds a state of OpenConfirm without any timers running. 16. With this any new incoming connection will never be able to establish as there is config connection X which is stuck in OpenConfirm. Fix: While transferring the peer datastructure Y FD-y (accepted connection) to the peer datastructure X, if TCP connection for FD-y goes down, then 1. Call fsm event bgp_stop for X (do cleanup with bgp_stop and move the state to Idle) and 2. Call fsm event bgp_stop for Y (do cleanup with bgp_stop and gets deleted since it is an accept connection). Signed-off-by: Sarita Patra <saritap@vmware.com>	2020-08-20 23:36:22 -07:00
Sarita Patra	4533dc6a4e	bgpd: Don't stop hold timer in OpenConfirm State Issue: 1. Initially BGP start listening to socket. 2. Start timer expires and BGP tries to connect to peer and moved to Idle->connect (lets say peer datastructre X) 3. Peer datastrcture Y FD-X receives OPEN and moved from Opensent-> Openconfirm state and start the hold timer. 4. In the OpenConfirm state, the hold timer is stopped. So peer X waits for Keepalive message from peer. If the Keepalive message is not received, then it will be in OpenConfirm state for indefinite time. 5. Due to this it neither close the existing connection nor it will accept any connection from peer. Fix: In the OpenConfirm state, don't stop the hold timer. 1. Upon receipt of a neighbor’s Keepalive, the state is moved to Established. 2. But If the hold timer expires, a stop event occurs, the state is moved to Idle. This is as per RFC. Signed-off-by: Sarita Patra <saritap@vmware.com>	2020-08-20 23:35:47 -07:00
David Schweizer	cb9196e77a	bgpd: bgp instance administrative shutdown. * Fixed integration in FSM and packet handling. * Added CLI "show" output, incl. JSON. * For review and testing only. Signed-off-by: David Schweizer <dschweizer@opensourcerouting.org>	2020-08-14 10:23:34 +02:00
David Schweizer	392721e8b9	bgpd: fsm legacy thread reset cleanup * Removed old timer thread resets, since this has been taken care of after execution of the threads by the thread_fetch function in lib/thread.c for quite some time now. Signed-off-by: David Schweizer <dschweizer@opensourcerouting.org>	2020-08-07 14:03:48 +02:00
Naveen Thanikachalam	77b34214ea	bgpd: GR fixes 1) When a session comes up for a peer and if the peer has not adverised the GR capabilities, BGP sends a request to Zebra to clear any stale routes that might exist from that peer. 2) When OPEN message is received from the peer, clear the previously advertised GR capability by the peer, if the lastest received OPEN message does not contain the GR capability. Signed-off-by: NaveenThanikachalam <nthanikachal@vmware.com>	2020-07-14 01:39:39 -07:00
David Lamparter	3efd0893d0	*: un-split strings across lines Remove mid-string line breaks, cf. workflow doc: .. [#tool_style_conflicts] For example, lines over 80 characters are allowed for text strings to make it possible to search the code for them: please see `Linux kernel style (breaking long lines and strings) <https://www.kernel.org/doc/html/v4.10/process/coding-style.html#breaking-long-lines-and-strings>`_ and `Issue #1794 <https://github.com/FRRouting/frr/issues/1794>`_. Scripted commit, idempotent to running: ``` python3 tools/stringmangle.py --unwrap `git ls-files \| egrep '\.[ch]$'` ``` Signed-off-by: David Lamparter <equinox@diac24.net>	2020-07-14 10:37:25 +02:00
Donald Sharp	d0874d195d	bgpd: Allow extending peer timeout in rare case Currently the I/O pthread handles incoming/outgoing data communication with all peers. There is no attempt at modifying the hold timers. It's sole goal is to read/write data to appropriate channels. All this data is handled as events on the master pthread in BGP. The problem is that if the master pthread is extremely busy then any packet read that would be treated as a keepalive event may happen after the hold timer pops, due to the way thread events are handled in lib/thread.c. In a last gap attempt, if we notice that we have incoming data to proceses on the input Queue, slightly delay the hold timer. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-06-15 10:35:50 -04:00
Quentin Young	fc746f1c01	*: manually remove some more sprintf Take care of some more complicated cases by hand Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2020-04-20 19:14:33 -04:00
Donatas Abraitis	3dc339cdc2	bgpd: Convert lots of int type functions to bool/void Some were converted to bool, where true/false status is needed. Converted to void only those, where the return status was only false or true. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-21 14:59:18 +02:00
Donald Sharp	8398b5d5d2	bgpd: Convert status defines to enum Convert some status defines for the fsm to an enum so that we cannot mix and match them in the future. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-03-20 18:07:13 -04:00
Donald Sharp	d1060698b4	bgpd: Convert #define of bgp fsm events to an enum In PR #6052 which fixes issue #5963 the bgp fsm events were confused with the bgp fsm status leading to a bug. Let's start separating those out so these types of failures cannot just easily occur. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-03-20 18:01:53 -04:00
Donatas Abraitis	b7eed4f5fd	Merge pull request #5992 from pguibert6WIND/bgp_bfd_reset_with_remote bgpd: reset bfd session when bgp comes up	2020-03-18 11:19:59 +02:00
Santosh P K	9a07d32e71	Merge pull request #5998 from donaldsharp/more_spelling More spelling	2020-03-16 23:46:53 +05:30
Donatas Abraitis	3893aeeea3	bgpd: Add subcodes for BGP Finite State Machine Error Implement https://tools.ietf.org/html/rfc6608 I used python scapy library to send a notification message in OpenSent state: ``` send(IP(dst="192.168.0.1")/TCP(sport=sp, dport=179, seq=rec.ack, ack=rec.seq + 1, flags=0x18)/BGPHeader(type=3)/BGPNotification(error_code=4, error_subcode=0)) ``` Logs from FRR: ``` %NOTIFICATION: sent to neighbor 192.168.0.2 5/1 (Neighbor Events Error/Receive Unexpected Message in OpenSent State) 0 bytes ``` Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-16 09:22:22 +02:00
Donald Sharp	2089dd80c0	bgpd: Fix spelling mistakes found by debian packaging Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-03-13 19:10:28 -04:00
Philippe Guibert	fe0c4ed7ba	bgpd: reset bfd session when bgp comes up This scenario has been seen against microtik virtual machine with bfd enabled. When remote microtik bgp reestablishes the bgp session after a bgp reset, the bgp establishment comes first, then bfd is initialising. The second point is true for microtik, but not for frrouting, as the frrouting, when receiving bfd down messages, is not at init state. Actually, bfd state is up, and sees the first bfd down packet from bfd as an issue. Consequently, the BGP session is cleared. The fix consists in resetting the BFD session, only if bfd status is considered as up, once BGP comes up. That permits to align state machines of both local and remote bfd. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-03-13 16:38:33 +01:00
Philippe Guibert	7b3ec88871	bgpd: upon reconfiguration or bgp exchange failure, stop bfd. When bgp is updated with local source, the bgp session is reset; bfd also must be reset. The bgp_stop() handler handles all kind of unexpected failures, so the placeholder to deregister from bfd should be ok, providing that when bgp establishes, a similar function in bgp will recreate bfd context. Note that the bfd session is not reset on one specific case, where BFD down event is the last reset. In that case, we must let BFD to monitor the link. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-03-12 13:42:48 +01:00
Philippe Guibert	bd540576af	bgpd: reset bfd session when bgp comes up This scenario has been seen against microtik virtual machine with bfd enabled. When remote microtik bgp reestablishes the bgp session after a bgp reset, the bgp establishment comes first, then bfd is initialising. The second point is true for microtik, but not for frrouting, as the frrouting, when receiving bfd down messages, is not at init state. Actually, bfd state is up, and sees the first bfd down packet from bfd as an issue. Consequently, the BGP session is cleared. The fix consists in resetting the BFD session, once BGP comes up. That permits to align state machines of both local and remote bfd. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-03-10 14:40:31 +01:00
Philippe Guibert	e7db872b81	bgpd: upon reconfiguration or bgp exchange failure, stop bfd. When bgp is updated with local source, the bgp session is reset; bfd also must be reset. The bgp_stop() handler handles all kind of unexpected failures, so the placeholder to deregister from bfd should be ok, providing that when bgp establishes, a similar function in bgp will recreate bfd context. Note that the bfd session is not reset on one specific case, where BFD down event is the last reset. In that case, we must let BFD to monitor the link. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2020-03-10 14:40:31 +01:00
Donatas Abraitis	15569c58f8	*: Replace __PRETTY_FUNCTION__/__FUNCTION__ to __func__ Just keep the code cool. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-03-05 20:23:23 +02:00
Donatas Abraitis	07d1e5d99d	bgpd: Show the real reason why the peer is failed If the peer was shutdown locally, it doesn't show up as admin. shutdown. Instead it's treated as "Waiting for peer OPEN". The same applies to when the peer reaches maximum-prefix count. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-02-14 23:23:52 +02:00
Russ White	8d61adbf07	Merge pull request #5785 from ton31337/fix/replace_gtsm_hops_to_readable_macros bgpd: Use readable macros for peer->gtsm_hops instead of literals	2020-02-11 10:40:35 -05:00
Donatas Abraitis	724935d5a2	Merge pull request #5789 from donaldsharp/bgp_ebgp_reason bgpd: Update failed reason to distinguish some NHT scenarios	2020-02-11 10:42:23 +02:00
Donald Sharp	1e91f1d119	bgpd: Update failed reason to distinguish some NHT scenarios Current failed reasons for bgp when you have a peer that is not online yet is `Waiting for NHT`, even if NHT has succeeded. Add some code to differentiate this. eva# show bgp ipv4 uni summ failed BGP router identifier 192.168.201.135, local AS number 3923 vrf-id 0 BGP table version 0 RIB entries 0, using 0 bytes of memory Peers 2, using 43 KiB of memory Neighbor EstdCnt DropCnt ResetTime Reason 192.168.44.1 0 0 never Waiting for NHT 192.168.201.139 0 0 never Waiting for Open to Succeed Total number of neighbors 2 eva# eva# show bgp nexthop Current BGP nexthop cache: 192.168.44.1 invalid, peer 192.168.44.1 Must be Connected Last update: Mon Feb 10 19:05:19 2020 192.168.201.139 valid [IGP metric 0], #paths 0, peer 192.168.201.139 So 192.168.201.139 is a peer for a connected route that has not been created on .139, while 44.1 nexthop tracking has not succeeded yet. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-02-10 19:46:48 -05:00
Donatas Abraitis	e2521429a6	bgpd: Use readable macros for peer->gtsm_hops instead of literals Do the same way like BGP_DEFAULT_TTL Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-02-10 16:23:09 +02:00
Donatas Abraitis	892fedb611	bgpd: Replace bgp_flag_* to [UN]SET/CHECK_FLAG macros Most of the code uses macros, thus let's keep the code unified. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-02-06 17:11:38 +02:00
Donatas Abraitis	975a328e2e	*: Replace s_addr 0 => INADDR_ANY Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2020-02-06 09:00:12 +02:00
Quentin Young	362353195a	bgpd, lib: fix style from BGP GR code This patch fixes the noncompliant style for the following commit range: `4a6e80fbf` `2ba1fe695` `efcb2ebbb` `8c48b3b69` `dc95985fe` `0f0444fbd` `85ef4179a` `eb451ee58` `2d3dd828d` `9e3b51a7f` `d6e3c15b6` `34aa74486` `6102cb7fe` `d7b3cda6f` `2bb5d39b1` `5f9c1aa29` `5cce3f054` `3a75afa4b` `f009ff269` `cfd47646b` `2986cac29` `055679e91` `034e185dc` `794b37d52` `b0965c44e` `949b0f24f` `63696f1d8` Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2020-02-04 15:19:04 -05:00
Donald Sharp	7318ae88de	bgpd: enums in switches do not need default If you have enums handled in a switch adding a default case makes it fun to fix when new stuff is added later. Remove. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-01-31 09:59:57 -05:00
Donald Sharp	13909c4fbc	bgpd: Cleanup some bad formating Some recent commits got some bad formating. Clean this up. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2020-01-31 09:59:57 -05:00
bisdhdh	4a6e80fbf2	bgpd: Added bgp graceful restart additional debug logs. bgp graceful restart additional debug logs, resolved merge conflicts. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:36:33 +05:30
bisdhdh	2ba1fe6951	bgpd: BGP Garaceful Restart debug logs. Reorganizing bgp gr debug logs and code review comments. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:36:33 +05:30
bisdhdh	8c48b3b696	bgpd: Adding bgp peer route processing and EOR state Signalling from BGPD to Zebra. * While the Deferral timer is running, signal route update pending (ZEBRA_CLIENT_ROUTE_UPDATE_PENDING) from BGPD to Zebra. * After expiry of the Deferral timer, the deferred routes are processed. When the deferred route_list becomes empty, End-of-Rib is send to the peer and route processing complete message (ZEBRA_CLIENT_ROUTE_UPDATE_COMPLETE) is sent to Zebra. So that Zebra would delete any stale routes still present in the rib. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:36:33 +05:30
bisdhdh	9e3b51a7f3	bgpd: Restarting node does not send EOR after the convergence. After a restarting router comes up and the bgp session is successfully established with the peer. If the restarting router doesn’t have any route to send, it send EOR to the peer immediately before receiving updates from its peers. Instead the restarting router should send EOR, if the selection deferral timer is not running OR count of eor received and eor required are matches then send EOR. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
bisdhdh	d7b3cda6f7	bgpd: BGP tcp session failed to apply GR configuration on the transferred bgp tcp connection. When the BGP peer is configured between two bgp routes both routers would create peer structure , when they receive each other’s open message. In this event both speakers, open duplicate TCP sessions and send OPEN messages on each socket simultaneously, the BGP Identifier is used to resolve which socket should be closed. If BGP GR is enabled the old tcp session is dumped and the new session is retained. So while this transfer of connection is happening, if all the bgp gr config is not migrated to the new connection, the new bgp gr mode will never get applied. Fix Summary: 1. Replicate GR configuration from the old session to the new session in bgp_accept(). 2. Replicate GR configuration from stub to full-fledged peer in bgp_establish(). 3. Disable all NSF flags, clear stale routes (if present), stop restart & stale timers (if they are running) when the bgp GR mode is changed to “Disabled”. 4. Disable R-bit in cap, if it is not set the received open message. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
bisdhdh	5cce3f0544	bgpd: Adding BGP GR change mode config apply on notification sent & received. * Changing GR mode on a router needs a session reset from the SAME router to negotiate new GR capability. * The present GR implementation needs a session reset after every new BGP GR mode change. * When BGP session reset happens due to sending or receiving BGP notification after changing BGP GR mode, there is no need of explicit session reset. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
bisdhdh	f009ff2697	bgpd: Adding Selection Deferral Timer handler changes. * Selection Deferral Timer for Graceful Restart. * Added selection deferral timer handling function. * Route marking as selection defer when update message is received. * Staggered processing of routes which are pending best selection. * Fix for multi-path test case. Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
bisdhdh	2986cac299	bgpd: Adding BGP GR Per Neighbor show commands. * Added new show command to show the graceful restart information for each neighbor. Cmd: show bgp [<ipv4\|ipv6>] neighbors [<A.B.C.D\|X:X::X:X\|WORD>] graceful-restart * Changes to show neighbors commands for displaying graceful restart information. Cmd :show [ip] bgp [<view\|vrf> VIEWVRFNAME] [<ipv4\|ipv6>] neighbors [<A.B.C.D\|X:X::X:X\| Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
bisdhdh	794b37d521	bgpd: Adding BGP GR Global & Per Neighbour FSM changes * Added FSM for peer and global configuration for graceful restart * Added debug option BGP_GRACEFUL_RESTART for logs specific to graceful restart processing Signed-off-by: Biswajit Sadhu <sadhub@vmware.com>	2020-01-23 09:34:25 +05:30
Donatas Abraitis	53b4aaeca0	bgpd: Send notification to the peer on FSM error We should send a NOTIFICATION message with the Error Code Finite State Machine Error if we receive NOTIFICATION in OpenSent state as defined in https://tools.ietf.org/html/rfc4271#section-8.2.2 Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2019-12-30 17:11:04 +02:00
Donatas Abraitis	e9613d32cc	Merge pull request #5429 from Spantik/bug_fix BGP: BGP assert when it tries to access peer which is closed.	2019-12-10 09:43:28 +02:00
Santosh P K	74e00a55c1	bgpd: BGP assert when it tries to access peer which is closed. Problem: BGP peer pointer is present in keepalive hash table even when socket has been closed in some race condition. When keepalive tries to access this peer it asserts. RCA: Below sequence of events causing assert. 1. Config node peer has went down due to TCP reset it's FD has been set to -1. 2. Doppelganger peer goes to established state and it has been added to peer hash table for keepalive when it was in openconfirm state. 3. Config node parameters including FD are exchanged with doppelganger. Doppelganger will not have FD -1. 4. Doppelganger will be deleted as part of this it will remove it from the keepalive peer hash table. 5. While removing from hash table it tries to acquire lock. 6. During this time keepalive thread has the lock and in a loop trying to send keepalive for peers in hash table. 7. It tries to send keepalive for doppelganger peer with fd set to -1 and asserts. Signed-off-by: Santosh P K <sapk@vmware.com>	2019-12-09 09:10:57 -08:00
David Lamparter	2b64873d24	*: generously apply const const const const your boat, merrily down the stream... Signed-off-by: David Lamparter <equinox@diac24.net>	2019-12-02 15:01:29 +01:00
Donatas Abraitis	c8d6f0d6c4	bgpd: Replace magic number 1 for TTL to BGP_DEFAULT_TTL For readability and maintainability purposes. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2019-11-27 10:48:17 +02:00
Donatas Abraitis	0e35025eb4	bgpd: Use BGP_NOTIFY_SUBCODE_UNSPECIFIC value for bgp_notify_send() where 0 Just a code cleanup to keep the code consistent. Signed-off-by: Donatas Abraitis <donatas.abraitis@gmail.com>	2019-11-10 17:54:37 +02:00
Lou Berger	ef5307f23f	Merge pull request #4861 from NaveenThanikachalam/logs BGP: Rectifying the log messages.	2019-09-17 11:33:43 -04:00
Naveen Thanikachalam	4cb5e18ba5	BGP: Rectifying the log messages. This change addresses the following: 1) Ensures logs under DEBUG macro checks are categorized as zlog_debug instead of zlog_info. 2) Error logs are categorized as zlog_err instead of zlog_info. 3) Rephrasing certain logs to make them appear more intuitive. Signed-off-by: NaveenThanikachalam <nthanikachal@vmware.com>	2019-09-09 22:59:22 -07:00
Quentin Young	1ce14168b3	Merge pull request #4809 from martonksz/master bgpd: hook for bgp peer status change events	2019-09-09 10:55:00 -04:00
Donald Sharp	11d443f591	Merge pull request #4925 from ddutt/master bgpd: Fixes to error message printed for failed peerings	2019-09-03 20:36:53 -04:00
Dinesh G Dutt	05912a17e6	bgpd: Fixes to error message printed for failed peerings There was a silly bug introduced when the command to show failed sessions was added. A missing "," caused the wrong error message to be printed. Debugging this led down a path that: - Led to discovering one more error message that needed to be added - Providing the error code along with the string in the JSON output to allow programs to key off numbers rather than strings. - Fixing the missing "," - Changing the error message to "Waiting for Peer IPv6 LLA" to make it clear that we're waiting for the link local addr. Signed-off-by: Dinesh G Dutt <5016467+ddutt@users.noreply.github.com>	2019-09-03 19:55:49 +00:00
David Lamparter	00dffa8cde	lib: add frr_with_mutex() block-wrapper frr_with_mutex(...) { ... } locks and automatically unlocks the listed mutex(es) when the block is exited. This adds a bit of safety against forgetting the unlock in error paths & co. and makes the code a slight bit more readable. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2019-09-03 17:15:17 +02:00
Dinesh G Dutt	3577f1c54f	bgpd: Add a new command to only show failed peerings In a data center, having 32-128 peers is not uncommon. In such a situation, to find a peer that has failed and why is several commands. This hinders both the automatability of failure detection and the ease/speed with which the reason can be found. To simplify this process of catching a failure and its cause quicker, this patch does the following: 1. Created a new function, bgp_show_failed_summary to display the failed summary output for JSON and vty 2. Created a new function to display the reset code/subcode. This is now used in the failed summary code and in the show neighbors code 3. Added a new variable failedPeers in all the JSON outputs, including the vanilla "show bgp summary" family. This lists the failed session count. 4. Display peer, dropped count, estd count, uptime and the reason for failure as the output of "show bgp summary failed" family of commands 5. Added three resset codes for the case where we're waiting for NHT, waiting for peer IPv6 addr, waiting for VRF to init. This also counts the case where only one peer has advertised an AFI/SAFI. The new command has the optional keyword "failed" added to the classical summary command. The changes affect only one existing output, that of "show [ip] bgp neighbors <nbr>". As we track the lack of NHT resolution for a peer or the lack of knowing a peer IPv6 addr, the output of that command will show a "waiting for NHT" etc. as the last reset reason. This patch includes update to the documentation too. Signed-off-by: Dinesh G Dutt <5016467+ddutt@users.noreply.github.com>	2019-09-02 14:21:44 +00:00
vivek	e2d3a90954	bgpd: Fix nexthop reg for IPv4 route exchange using GUA IPv6 peering In the case of IPv4 route exchange using GUA IPv6 peering, the route install into the FIB involves mapping the immediate next hop to an IPv4 link-local address and installing neighbor entries for this next hop address. To accomplish the latter, IPv6 Router Advertisements are exchanged (the next hop or peer must also have this enabled) and the RAs are dynamically initiated based on next hop resolution. However, in the case of a passive connection where the local system has not initiated anything, no NHT entry is created for the peer, hence RAs were not getting triggered. Address this by ensuring that a NHT entry is created even in this situation. This is done at the time the connection becomes established because the code has other assumptions that a NHT entry will be present only for the "configured" peer. The API to create the entry ensures there are no duplicates. Signed-off-by: Vivek Venkatraman <vivek@cumulusnetworks.com> Reviewed-by: Donald Sharp <sharpd@cumulusnetworks.com>	2019-08-18 22:12:06 -07:00
Marton Kun-Szabo	7d8d0eabb4	bgpd: hook for bgp peer status change events Generally available hook for plugging application-specific code in for bgp peer change events. This hook (peer_status_changed) replaces the previous, more specific 'peer_established' hook with a more general-purpose one. Also, 'bgp_dump_state' is now registered under this hook. Signed-off-by: Marton Kun-Szabo <martonk@amazon.com>	2019-08-13 11:59:27 -07:00
David Lamparter	584470fb5f	bgpd: add & use bgp packet dump hook The MRT dump code is already hooked in at the right places to write out packets; the BMP code needs exactly the same access so let's make this a hook. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2019-07-03 16:58:26 +02:00
Donald Sharp	1cfe005d0c	bgpd: Update an fsm debug message When debugging I was having a hard time correlating some data and noticed that a particular debug was not being very useful. Signed-off-by: Donald Sharp <sharpd@cumulusnstworks.com>	2019-05-28 18:10:26 -04:00
Philippe Guibert	b83a6e054c	bgpd: do not unregister bfd session when bgp session goes down This commit fixes a previous commit: "bfdd: remove operational bfd sessions from remote daemons" where the handling of unregister call triggers the deletion of bfd session. Actually, the BFD session should not be deleted, while bgp session is configured with BGP. this permits to receive BFD events up, and permit quicker reconnecion. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2019-05-14 16:50:01 +02:00
Philippe Guibert	fc04a6778e	bgpd: improve reconnection mechanism by cancelling connect timers if bfd comes back up, and a bgp reconnection is in progress, theorically it should be necessary to wait for the end of the reconnection process. however, since that reconnection process may take some time, update the fsm by cancelling the connect timer. This done, one just have to call the start timer. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2019-04-18 16:11:51 +02:00
Donald Sharp	dded74d578	bgpd: Don't prevent views from being able to connect Views are perfectly valid and should be allowed to connect. In a bgp instance scenario the vrf_id will always be UNKNOWN, so allow it. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2019-03-06 11:35:58 -05:00
root	36dc75886d	bgpd: Creating Loopback Interface Flaps BGPd (#2865 ) * The function bgp_router_id_zebra_bump() will check for active bgp peers before chenging the router ID. If there are established peers, router ID is not modified which prevents the flapping of established peer connection * Added field in bgp structure to store the count of established peers Signed-off-by: kssoman <somanks@vmware.com>	2018-11-19 04:35:32 -08:00
Don Slice	5742e42b98	bgpd: make name of default vrf/bgp instance consistent Problems were reported with the name of the default vrf and the default bgp instance being different, creating confusion. This fix changes both to "default" for consistency. Ticket: CM-21791 Signed-off-by: Don Slice <dslice@cumulusnetworks.com> Reviewed-by: CCR-7658 Testing: manual testing and automated tests before pushing	2018-10-31 06:20:37 -04:00
David Lamparter	0437e10517	*: spelchek Signed-off-by: David Lamparter <equinox@diac24.net>	2018-10-25 20:10:57 +02:00
Donald Sharp	19bd3dffc1	bgpd: Do a bit better job of tracking the bgp->peerhash When we add/remove peers we need to do a bit better job of tracking them in the bgp->peerhash. 1) When we have the doppelganger take over, make sure the winner is the one represented in the peerhash. 2) When creating the doppelganger, leave the current one in place instead of blindly replacing it. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-10-07 20:55:52 -04:00
Donald Sharp	9bf904cc8b	bgpd: Try to notice when configuration changes during startup During peer startup there exists the possibility that both locally and remote peers try to start communication at the same time. In addition it is possible for local configuration to change at the same time this is going on. When this happens try to notice that the remote peer may be in opensent or openconfirm and if so we need to restart the connection from both sides. Additionally try to write a bit of extra code in peer_xfer_conn to notice when this happens and to emit a error message to the end user about this happening so that it can be cleaned up. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-10-01 10:58:06 -04:00
Quentin Young	1c50c1c0d6	*: style for EC replacements Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 19:38:57 +00:00
Quentin Young	450971aa99	*: LIB_[ERR\|WARN] -> EC_LIB Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 19:34:28 +00:00
Quentin Young	e50f7cfdbd	bgpd: BGP_[WARN\|ERR] -> EC_BGP Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 18:51:04 +00:00
Quentin Young	09c866e34d	*: rename ferr_zlog -> flog_err_sys Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Quentin Young	af4c27286d	*: rename zlog_fer -> flog_err Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Donald Sharp	02705213b1	bgpd: Convert to using LIB_ERR_XXX where possible Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Don Slice	14454c9fdd	bgpd: implement zlog_ferr facility for enhance error messages in bgp Signed-off-by: Don Slice <dslice@cumulusnetworks.com<	2018-08-14 20:02:05 +00:00
Pascal Mathis	b90a8e13ee	bgpd: Implement group-overrides for peer timers This commit implements BGP peer-group overrides for the timer flags, which control the value of the hold, keepalive, advertisement-interval and connect connect timers. It was kept separated on purpose as the whole timer implementation is quite complex and merging this commit together with with the other flag implementations did not seem right. Basically three new peer flags were introduced, namely PEER_FLAG_ROUTEADV, PEER_FLAG_TIMER and PEER_FLAG_TIMER_CONNECT. The overrides work exactly the same way as they did before, but introducing these flags made a few conditionals simpler as they no longer had to compare internal data structures against eachother. Last but not least, the test suite has been adjusted accordingly to test the newly implemented flag overrides. Signed-off-by: Pascal Mathis <mail@pascalmathis.com>	2018-06-14 18:55:30 +02:00
Donald Sharp	c42eab4bf5	bgpd: Respect ability to reach nexthop if available When bgp is thinking about opening a connection to a peer, if we are connected to zebra, allow that to influence our decision to start the connection. Found Scenario: Both bgp and zebra are started up at the same time. Zebra is being used to create the connected route through which bgp will establish a peering relationship. The machine is a bit loaded due to other startup conditions and as such bgp gets to the connection stage here before zebra has installed the route. If bgp does not respect zebra data when it does have a connection then we will attempt to connect. The connect will fail because there is no route. At that time we will go into the connect timeout(2 minutes) and delay connection. What this does. If we have established a zebra connection and we do not have a clear path to the destination at this point do not allow the connection to proceed. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-05-11 07:46:43 -04:00
Donald Sharp	54ff5e9b02	bgpd: Cleanup messages from getsockopt The handling of the return codes for getsockopt was slightly wrong. getsockopt returns -1 on error and errno is set. What to do with the return code at that point is dependent on what sockopt you are asking about. In this case status holds the error returned for SO_ERROR. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-05-11 07:34:24 -04:00
G. Paul Ziemba	960035b2d9	bgpd: nexthop tracking with labels for vrf-vpn leaking Routes that have labels must be sent via a nexthop that also has labels. This change notes whether any path in a nexthop update from zebra contains labels. If so, then the nexthop is valid for routes that have labels. If a nexthop update has no labeled paths, then any labeled routes referencing the nexthop are marked not valid. Add a route flag BGP_INFO_ANNC_NH_SELF that means "advertise myself as nexthop when announcing" so that we can track our notion of the nexthop without revealing it to peers. Signed-off-by: G. Paul Ziemba <paulz@labn.net>	2018-04-04 10:00:23 -07:00
Quentin Young	d7c0a89a3a	*: use C99 standard fixed-width integer types The following types are nonstandard: - u_char - u_short - u_int - u_long - u_int8_t - u_int16_t - u_int32_t Replace them with the C99 standard types: - uint8_t - unsigned short - unsigned int - unsigned long - uint8_t - uint16_t - uint32_t Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-03-27 15:13:34 -04:00
Donald Sharp	5410015a79	bgpd: peer->bgp must be non NULL We lock and set peer->bgp at peer creation and only remove it at deletion. Therefore these tests are not needed. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-03-20 19:09:06 -04:00
Lou Berger	996c93142d	*: conform with COMMUNITY.md formatting rules, via 'make indent' Signed-off-by: Lou Berger <lberger@labn.net>	2018-03-06 14:04:32 -05:00
Philippe Guibert	f62abc7d65	bgpd: do not start BGP VRF peer connection, if VRF not unknown Upon starting a BGP VRF instance, the server socket is not created, because the VRF ID is not known, and then underlying VRF backend is not ready yet. Because of that, the peer connection attempt will not be started before. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2018-02-27 11:11:24 +01:00
Philippe Guibert	61cf4b3715	bgpd: bgp support for netns The change contained in this commit does the following: - discovery of vrf id from zebra daemon, and adaptation of bgp contexts with BGP. The list of network addresses contain a reference to the bgp context supporting the vrf. The bgp context contains a vrf pointer that gives information about the netns path in case the vrf is a netns path. Only some contexts are impacted, namely socket creation, and retrieval of local IP settings. ( this requires vrf identifier). Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2018-02-27 11:11:24 +01:00
Russ White	2ed7e4c3c3	Merge pull request #1591 from qlyoung/bgpd-ringbuf bgpd: use ring buffer for network input	2018-01-10 19:59:24 -05:00
Quentin Young	0112e9e0b9	bgpd: use atomic_* ops on _Atomic variables Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-01-09 15:40:48 -05:00
Quentin Young	74ffbfe6fe	bgpd: use ring buffer for network input The multithreading code has a comment that reads: "XXX: Heavy abuse of stream API. This needs a ring buffer." This patch makes the relevant code use a ring buffer. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-01-03 14:35:11 -05:00
Quentin Young	7a86aa5a0a	bgpd: schedule packet job after connection xfer During initial session establishment, bgpd performs a "connection transfer" to a new peer struct if the connection was initiated passively (i.e. by the remote peer). With the addition of buffered input and a reorganized packet processor, the following race condition manifests: 1. Remote peer initiates a connection. After exchanging OPEN messages, we send them a KEEPALIVE. They send us a KEEPALIVE followed by 10,000 UPDATE messages. The I/O thread pushes these onto our local peer's input buffer and schedules a packet processing job on the main thread. 2. The packet job runs and processes the KEEPALIVE, which completes the handshake on our end. As part of transferring to ESTABLISHED we transfer all peer state to a new struct, as mentioned. Upon returning from the KEEPALIVE processing routing, the peer context we had has now been destroyed. We notice this and stop processing. Meanwhile 10k UPDATE messages are sitting on the input buffer. 3. N seconds later, the remote peer sends us a KEEPALIVE. The I/O thread schedules another process job, which finds 10k UPDATEs waiting for it. Convergence is achieved, but has been delayed by the value of the KEEPALIVE timer. The racey part is that if the remote peer takes a little bit of time to send UPDATEs after KEEPALIVEs -- somewhere on the order of a few hundred milliseconds -- we complete the transfer successfully and the packet processing job is scheduled on the new peer upon arrival of the UPDATE messages. Yuck. The solution is to schedule a packet processing job on the new peer struct after transferring state. Lengthy commit message in case someone has to debug similar problems in the future... Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:05 -05:00
Quentin Young	7db44ec8fa	bgpd: transfer raw input buffer to new peer During initial session establishment, bgpd performs a "connection transfer" to a new peer struct if the connection was initiated passively (i.e. by the remote peer). With the addition of buffered input, I forgot to transfer the raw input buffer to the new peer. This resulted in infrequent failures during session handshaking whereby half of a packet would be thrown away in the middle of a read causing us to send a NOTIFY for an unsynchronized header. Usually the transfer coincided with a clean input buffer, hence why it only showed up once in a while.	2017-11-30 16:18:05 -05:00
Quentin Young	387f984e58	bgpd: fix bgp active open At some point when rearranging FSM code, bgpd lost the ability to perform active opens because it was only paying attention to POLLIN and not POLLOUT, when the latter is used to signify a successful connection in the active case. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:05 -05:00
Quentin Young	becedef6c3	bgpd, tests: comment formatting Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:05 -05:00
Quentin Young	bea0122657	bgpd: misc fsm fixes * Keepalive on/off calls are necessary in certain cases due to screwy fsm flow not turning them on after transferring a passive peer connection in peer_xfer_conn * Missed a case bgp_event_update() that resulted in a return code of -1 instead of BGP_Stop, which confuses the packet processing routine Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:02 -05:00
Quentin Young	d815168795	bgpd: fix bgp_packet.c / bgp_fsm.c organization Despaghettification of bgp_packet.c and bgp_fsm.c Sometimes we call bgp_event_update() inline packet parsing. Sometimes we post events instead. Sometimes we increment packet counters in the FSM. Sometimes we do it in packet routines. Sometimes we update EOR's in FSM. Sometimes we do it in packet routines. Fix the madness. bgp_process_packet() is now the centralized place to: - Update message counters - Execute FSM events in response to incoming packets FSM events are now executed directly from this function instead of being queued on the thread_master. This is to ensure that the FSM contains the proper state after each packet is parsed. Otherwise there could be race conditions where two packets are parsed in succession without the appropriate FSM update in between, leading to session closure due to receiving inappropriate messages for the current FSM state. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:02 -05:00
Quentin Young	a9794991c7	bgpd: bye bye THREAD_BACKGROUND Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:01 -05:00
Quentin Young	9eb217ff69	bgpd: batched i/o Instead of reading a packet header and the rest of the packet in two separate i/o cycles, instead read a chunk of data at one time and then parse as many packets as possible out of the chunk. Also changes bgp_packet.c to batch process packets. To avoid thrashing on useless mutex locks, the scheduling call for bgp_process_packet has been changed to always succeed at the cost of no longer being cancel-able. In this case this is acceptable; following the pattern of other event-based callbacks, an additional check in bgp_process_packet to ignore stray events is sufficient. Before deleting the peer all events are cleared which provides the requisite ordering. XXX: chunk hardcoded to 5, should use something similar to wpkt_quanta Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:00 -05:00
Quentin Young	b72b6f4fc9	bgpd: rename peer_keepalives* --> bgp_keepalives* Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	424ab01d0f	bgpd: implement buffered reads * Move and modify all network input related code to bgp_io.c * Add a real input buffer to `struct peer` * Move connection initialization to its own thread.c task instead of piggybacking off of bgp_read() * Tons of little fixups Primary changes are in bgp_packet.[ch], bgp_io.[ch], bgp_fsm.[ch]. Changes made elsewhere are almost exclusively refactoring peer->ibuf to peer->curr since peer->ibuf is now the true FIFO packet input buffer while peer->curr represents the packet currently being processed by the main pthread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	56257a44e4	bgpd: move bgp i/o to a separate source file After implement threading, bgp_packet.c was serving the double purpose of consolidating packet parsing functionality and handling actual I/O operations. This is somewhat messy and difficult to understand. I've thus moved all code and data structures for handling threaded packet writes to bgp_io.[ch]. Although bgp_io.[ch] only handles writes at the moment to keep the noise on this commit series down, for organization purposes, it's probably best to move bgp_read() and its trappings into here as well and restructure that code so that read()'s happen in the pthread and packet processing happens on the main thread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	dc1188bb4d	bgpd: correctly schedule select() at session startup On TCP connection failure during session setup, bgp_stop() checks whether peer->t_read is non-null to know whether or not to unschedule select() on peer->fd before calling close() on it. Using the API exposed by thread.c instead of bgpd's wrapper macro BGP_READ_ON() results in this thread value never being set, which causes bgp_stop() to skip the cancellation of select() before calling close(). Subsequent calls to select() on that fd crash the daemon. Use the macro instead. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:58 -05:00
Quentin Young	727c4f870b	bgpd: transfer packets from peer stub to actual peer During transition from OpenConfirm -> Established, we wipe the peer stub's output buffer. Because thread.c prioritizes I/O operations over regular background threads and events, in a single threaded environment this ordering meant that the output buffer would be happily empty at wipe time. In MT-land, this convenient coincidence is no longer true; thus we need to make sure that any packets remaining on the peer stub get transferred over to the peer proper. Also removes misleading comment indicating that bgp_establish() sends a keepalive packet. It does not. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:58 -05:00
Quentin Young	03014d48f4	bgpd: put BGP keepalives in a pthread This patch, in tandem with moving packet writes into a dedicated kernel thread, fixes session flaps caused by long-running internal operations starving the (old) userspace write thread. BGP keepalives are now produced by a kernel thread and placed onto the peer's output queue. These are then consumed by the write thread. Both of these tasks are concurrent with the rest of bgpd, obviating the session flaps described above. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	07a1652682	bgpd: move bgp_connect_check() to bgp_fsm.c Prior to this change, after initiating a nonblocking connection to the remote peer bgpd would call both BGP_READ_ON and BGP_WRITE_ON on the peer's socket. This resulted in a call to select(), so that when some event (either a connection success or failure) occurred on the socket, one of bgp_read() or bgp_write() would run. At the beginning of each of those functions was a hook into bgp_connect_check(), which checked the socket status and issued the correct connection event onto the BGP FSM. This code is better suited for bgp_fsm.c. Placing it there avoids scheduling packet reads or writes when we don't know if the socket has established a connection yet, and the specific functionality is a better fit for the responsibility scope of this unit. This change also helps isolate the responsibilities of the packet-writing kernel thread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	d3ecc69e5f	bgpd: move packet writes into dedicated pthread * BGP_WRITE_ON() removed * BGP_WRITE_OFF() removed * peer_writes_on() added * peer_writes_off() added * bgp_write_proceed_actions() removed Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	05c7a1cc93	bgpd: use FOREACH_AFI_SAFI where possible Improves consistency and readability. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-21 13:02:06 -05:00
Don Slice	d25e4efc52	bgpd: fix various problems with hold/keepalive timers Problem reported that we weren't adjusting the keepalive timer correctly when we negotiated a lower hold time learned from a peer. While working on this, found we didn't do inheritance correctly at all. This fix solves the first problem and also ensures that the timers are configured correctly based on this priority order - peer defined > peer-group defined > global config. This fix also displays the timers as "configured" regardless of which of the three locations above is used. Ticket: CM-18408 Signed-off-by: Don Slice <dslice@cumulusnetworks.com> Reviewed-by: CCR-6807 Testing-performed: Manual testing successful, fix tested by submitter, bgp-smoke completed successfully	2017-10-26 11:55:31 -04:00
Renato Westphal	a08ca0a7e1	lib: remove SAFI_RESERVED_4 and SAFI_RESERVED_5 SAFI values have been a major source of confusion over the last few years. That's because each SAFI needs to be represented in two different ways: * IANA's value used to send/receive packets over the network; * Internal value used for array indexing. In the second case, defining reserved values makes no sense because we don't want to index SAFIs that simply don't exist. The sole purpose of the internal SAFI values is to remove the gaps we have among the IANA values, which would represent wasted memory in C arrays. With that said, remove these reserved SAFIs to avoid further confusion in the future. Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2017-07-31 23:38:38 -03:00
David Lamparter	9d303b37d7	Revert "*: reindent pt. 2" This reverts commit `c14777c6bf`. clang 5 is not widely available enough for people to indent with. This is particularly problematic when rebasing/adjusting branches. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-22 14:52:33 +02:00
whitespace / reindent	c14777c6bf	: reindent pt. 2 w/ clang 5 reflow comments * struct members go 1 per line * binpack algo was adjusted	2017-07-17 15:26:02 -04:00
whitespace / reindent	d62a17aede	*: reindent indent.py `git ls-files \| pcregrep '\.[ch]$' \| pcregrep -v '^(ldpd\|babeld\|nhrpd)/'` Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-17 14:04:07 +02:00
David Lamparter	acd738fc7f	*: fix GCC 7 switch/case fallthrough warnings Need a comment on these. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-14 16:59:43 +02:00
Quentin Young	56b4067930	*: simplify log message lookup log.c provides functionality for associating a constant (typically a protocol constant) with a string and finding the string given the constant. However this is highly delicate code that is extremely prone to stack overflows and off-by-one's due to requiring the developer to always remember to update the array size constant and to do so correctly which, as shown by example, is never a good idea.b The original goal of this code was to try to implement lookups in O(1) time without a linear search through the message array. Since this code is used 99% of the time for debugs, it's worth the 5-6 additional cmp's worst case if it means we avoid explitable bugs due to oversights... Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-06-21 15:22:21 +00:00

1 2 3 4 5 ...

350 Commits