mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-05-28 00:27:07 +00:00

Author	SHA1	Message	Date
Donald Sharp	5160672d99	bgpd: Prevent use after free When bgp_stop finishes and it deletes the peer it is sending back a return code stating that the peer was deleted, but the code was operating like it was not deleted and continued to access the data structure. Fix. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-25 10:43:56 -04:00
Donald Sharp	d4a9b103b7	bgpd: bgp_event_update switch to a switch The return code from a event handling perspective is an enum. Let's intentionally make it a switch so that all cases are ensured to be covered now and in the future. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-25 10:28:02 -04:00
Donald Sharp	8dd97a7404	bgpd: bgp_event_update mixes enum's with a non-enum Straighten out the code to not mix the two. Especially since bgp was assigning non enum values to the enum. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-25 10:03:14 -04:00
Donald Sharp	419c5b4ef0	bgpd: Cleanup bgp_start declarations Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-18 09:29:04 -04:00
Donald Sharp	26ad36e097	bgpd: Convert FSM to use `struct peer_connection` The BGP FSM was using the peer as the unit of work but the FSM is connection focused. So let's switch it over to using that. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-18 09:29:04 -04:00
Donald Sharp	3e5a31b24e	bgpd: Convert `struct peer_connection` to dynamically allocated As part of the conversion to a `struct peer_connection` it will be desirable to have 2 pointers one for when we open a connection and one for when we receive a connection. Start this actual conversion over to this in `struct peer`. If this sounds confusing take a look at the bgp state machine for connections and how it resolves the processing of this router opening -vs- this router receiving an open. At some point in time the state machine decides that we are keeping one of the two connections. Future commits will allow us to untangle the peer/doppelganger duality with this abstraction. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-18 09:29:04 -04:00
Donald Sharp	5d52756735	bgpd: Move t_process_packet and t_process_packet_error to connection The t_process_packet thread events should be managed by the connection. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-18 09:29:04 -04:00
Donald Sharp	e20c23fa5b	bgpd: Move status and ostatus to `struct peer_connection` The status and ostatus are a function of the `struct peer_connection` move it into that data structure. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-18 09:29:04 -04:00
Donald Sharp	71d72c4998	bgpd: READ and WRITE flags are a part of the connection Move PEER_THREAD_WRITES_ON and PEER_THREAD_READS_ON to be a part of the `struct peer_connection` since this is a connection oriented bit of data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-18 09:29:04 -04:00
Donald Sharp	c528b3b153	bgpd: Move t_write and t_read into `struct peer_connection` Move the peer->t_write and peer->t_read into `struct peer_connection` as that these are properties of the connection. Signed-off-by: Donald Sharp <sharpd@nvidia.com> P# Please enter the commit message for your changes. Lines starting	2023-08-18 09:29:04 -04:00
Donald Sharp	ccb51e8266	bgpd: Convert bgp_io.c to take `struct peer_connection` bgp_io.c is clearly connection oriented so let's convert it over to using `struct peer_connection` Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-18 09:29:04 -04:00
Donald Sharp	1f32eb30d9	bgpd: Start abstraction of `struct peer_connection` BGP tracks connections based upon the peer. But the problem with this is that the doppelganger structure for it is being created. This has introduced a bunch of fragileness in that the peer exists independently of the connections to it. The whole point of the doppelganger structure was to allow BGP to both accept and initiate tcp connections and then when we get one to a `good` state we collapse into the appropriate one. The problem with this is that having 2 peer structures for this creates a situation where we have to make sure we are configing the `right` one and also make sure that we collapse the two independent peer structures into 1 acting peer. This makes no sense let's abstract out the peer into having 2 connection one for incoming connections and one for outgoing connections then we can easily collapse down without having to do crazy stuff. In addition people adding new features don't need to have to go touch a million places in the code. This is the start of this abstraction. In this commit we'll just pull out the fd and input/output buffers into a connection data structure. Future commits will abstract further. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-08-18 09:29:04 -04:00
Donald Sharp	acf4defcd8	bgpd: Remove peer->obuf_work This is never used. Free up another 65k of stream data never used per peer. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-07-21 12:30:20 -04:00
Donatas Abraitis	c76f6146ab	bgpd: Deprecate Prestandard Outbound Route Filtering capability https://www.rfc-editor.org/rfc/rfc8810.html Not relevant anymore. Use RFC'd version of ORF. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-07-07 23:41:43 +03:00
Donatas Abraitis	58a92cb810	bgpd: Use enum bgp_fsm_state_progress for bgp_stop() ``` bgpd/bgp_fsm.c:1360:29: warning: conflicting types for ‘bgp_stop’ due to enum/integer mismatch; have ‘enum bgp_fsm_state_progress(struct peer )’ [-Wenum-int-mismatch] 1360 \| enum bgp_fsm_state_progress bgp_stop(struct peer peer) \| ^~~~~~~~ In file included from bgpd/bgp_fsm.c:29: ./bgpd/bgp_fsm.h:111:12: note: previous declaration of ‘bgp_stop’ with type ‘int(struct peer )’ 111 \| extern int bgp_stop(struct peer peer); \| ^~~~~~~~ ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-06-13 16:01:40 +03:00
Donald Sharp	907234817c	bgpd: Give more data when state machine fails to change state When a state machine transition fails, bgpd would output data about what happened, but not necessarily give the reason why. Add that data to the output. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-06-02 11:02:54 -04:00
Donald Sharp	24a58196dd	*: Convert event.h to frrevent.h We should probably prevent any type of namespace collision with something else. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-03-24 08:32:17 -04:00
Donald Sharp	e16d030c65	*: Convert THREAD_XXX macros to EVENT_XXX macros Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-03-24 08:32:17 -04:00
Donald Sharp	4f830a0799	*: Convert thread_timer_remain_XXX to event_timer_remain_XXX Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-03-24 08:32:17 -04:00
Donald Sharp	907a2395f4	*: Convert thread_add_XXX functions to event_add_XXX Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-03-24 08:32:17 -04:00
Donald Sharp	e6685141aa	*: Rename `struct thread` to `struct event` Effectively a massive search and replace of `struct thread` to `struct event`. Using the term `thread` gives people the thought that this event system is a pthread when it is not Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-03-24 08:32:17 -04:00
Donald Sharp	cb37cb336a	*: Rename thread.[ch] to event.[ch] This is a first in a series of commits, whose goal is to rename the thread system in FRR to an event system. There is a continual problem where people are confusing `struct thread` with a true pthread. In reality, our entire thread.c is an event system. In this commit rename the thread.[ch] files to event.[ch]. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-03-24 08:32:16 -04:00
Donald Sharp	aa554d4b65	bgpd: Always restart timer from scratch in OpenConfirm/Established Imagine this scenario: A peer has very large hold/keepalive timers of 600/200. This peer is using the DataCenter default time. As such the open will cause the t_holdtime to be negotiated to 600 seconds. Now also imagine that both peers are in update-delay. If we do not restart the timers and both peers are in Update Delay, we will continously reset the peer because the hold time will be hit( since the peer is not sending us any data ). Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-03-16 19:23:38 -04:00
David Lamparter	4b216f58dc	bgpd: fix NULL argument warning gcc 12.2.0 complains `error: ‘%s’ directive argument is null`, even though all enum values are covered with a string. Let's just go with a `???` default. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2023-03-16 11:00:02 +01:00
Donald Sharp	115ccb9acf	lib, bgpd: Add more debugs to GR Capability exchange a) Make it legible what type of message is being passed back and forth instead of having to guess it from the insufficient debugs b) Make it explicit which bgp instance is sending this data c) Cleanup bgp_zebra_update to have a cleaner api Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-03-09 08:36:51 -05:00
Donald Sharp	8383d53e43	Merge pull request #12780 from opensourcerouting/spdx-license-id *: convert to SPDX License identifiers	2023-02-17 09:43:05 -05:00
Donatas Abraitis	234f6fd4f4	bgpd: Add BGP Software Version Capability Implement: https://datatracker.ietf.org/doc/html/draft-abraitis-bgp-version-capability Tested with GoBGP: ``` % ./gobgp neighbor 192.168.10.124 BGP neighbor is 192.168.10.124, remote AS 65001 BGP version 4, remote router ID 200.200.200.202 BGP state = ESTABLISHED, up for 00:01:49 BGP OutQ = 0, Flops = 0 Hold time is 3, keepalive interval is 1 seconds Configured hold time is 90, keepalive interval is 30 seconds Neighbor capabilities: multiprotocol: ipv4-unicast: advertised and received ipv6-unicast: advertised route-refresh: advertised and received extended-nexthop: advertised Local: nlri: ipv4-unicast, nexthop: ipv6 UnknownCapability(6): received UnknownCapability(9): received graceful-restart: advertised and received Local: restart time 10 sec ipv6-unicast ipv4-unicast Remote: restart time 120 sec, notification flag set ipv4-unicast, forward flag set 4-octet-as: advertised and received add-path: received Remote: ipv4-unicast: receive enhanced-route-refresh: received long-lived-graceful-restart: advertised and received Local: ipv6-unicast, restart time 10 sec ipv4-unicast, restart time 20 sec Remote: ipv4-unicast, restart time 0 sec, forward flag set fqdn: advertised and received Local: name: donatas-pc, domain: Remote: name: spine1-debian-11, domain: software-version: advertised and received Local: GoBGP/3.10.0 Remote: FRRouting/8.5-dev-MyOwnFRRVersion-gdc92f44a45-dirt cisco-route-refresh: received Message statistics: ``` FRR side: ``` root@spine1-debian-11:~# vtysh -c 'show bgp neighbor 192.168.10.17 json' \| \ > jq '."192.168.10.17".neighborCapabilities.softwareVersion.receivedSoftwareVersion' "GoBGP/3.10.0" root@spine1-debian-11:~# ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-02-15 23:14:48 +02:00
David Lamparter	acddc0ed3c	*: auto-convert to SPDX License IDs Done with a combination of regex'ing and banging my head against a wall. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2023-02-09 14:09:11 +01:00
Donald Sharp	58cf0823bf	bgpd: Add missing enum's to case statement Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2023-01-31 12:29:08 -05:00
Donatas Abraitis	d188b08f30	bgpd: Vanish FQDN capability hostname/domainname before handling new BGP OPEN Before this, if the peer disables sending FQDN capability, the old hostname still (STALE) exists and is misleading in the outputs of `show bgp ...`. Especially when using with `bgp default show-hostname`, etc. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-01-30 23:34:41 +02:00
Donatas Abraitis	cfd01fc0ac	Revert "bgpd: optimal router reflection cli and fsm changes" This reverts commit `70cd87ca02`. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2023-01-17 18:15:28 +02:00
Donald Sharp	881979e427	bgpd: use the enum instead of an int The bgp_fsm_change_status function takes an int for the new bgp state, which is an `enum bgp_fsm_status status` let's convert over to being explicit.bgpd: use the enum instead of an int Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-12-15 07:31:55 -05:00
Donald Sharp	4da144f3d8	bgpd: Change fsm to use an enum for passing state The BGP fsm uses return codes to pass event success/fail as well as some extra data to the bgp_event_update function. Convert this to use a enum instead of an int to track the changes. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-12-05 10:10:36 -05:00
Donald Sharp	56b7d690bb	bgpd: Hash release before we change the underlying hash assumptions The bgp->peerhash is made up of the sockunion and the CONFIG_NODE flag. If the CONFIG_NODE flag is moved around or changed then we get into a situation where both the doppelganger and the peer actually hash to the exact same thing. Leading to wrongful deletion and pointers being used after freed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-12-05 09:11:22 -05:00
Donald Sharp	69ef3f3136	bgpd: Fix several use after free's in bgp for the peer Three fixes: a) When calling bgp_fsm_change_status with `Deleted` do not add a new event to the peer's t_event because we are already in the process of deleting everything b) When bgp_stop decides to delete a peer return a notification that it is happening to bgp_event_update so that it does not set the peer state back to idle or do other processing. c) bgp_event_update can cause a peer deletion, because the peer can be deleted in the fsm function but the peer data structure is still pointed to and used after words. So lock the peer before entering and prevent a use after free. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-12-05 09:11:21 -05:00
vivek	71d276c88a	bgpd: Display FD in peer FSM state change log Signed-off-by: Vivek Venkatraman <vivek@nvidia.com>	2022-11-20 22:45:37 -05:00
Donatas Abraitis	5597214ccb	bgpd: Show the reason when the session is killed due to RTT Simulated latency with: ``` tc qdisc add dev eth3 root netem delay 100ms ``` ``` donatas-laptop# sh ip bgp summary failed IPv4 Unicast Summary (VRF default): BGP router identifier 192.0.2.252, local AS number 65000 vrf-id 0 BGP table version 28 RIB entries 0, using 0 bytes of memory Peers 1, using 724 KiB of memory Neighbor EstdCnt DropCnt ResetTime Reason 192.168.10.65 2 2 00:00:17 Admin. shutdown (RTT) Displayed neighbors 1 Total number of neighbors 1 donatas-laptop# ``` Another end received: ``` %NOTIFICATION: received from neighbor 192.168.10.17 6/2 (Cease/Administrative Shutdown) "shutdown due to high round-trip-time (104ms > 5ms, hit 21 times)" ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-11-04 15:56:23 +02:00
Donald Sharp	adf552ab6b	bgpd: Limit snmp trap for backwards state movement from established Currently the bgp mib specifies two traps: a) Into established state b) transition backwards from a state b) really is an interesting case. It means transitioning from say established to starting over. It can also mean when bgp is trying to connect and that fails and the state transitions backwards. Now let's imagine 500 peers with tight timers (say a data center) and there is network trauma you have just created an inordinately large number of traps for each peer. Let's limit FRR to changing from the old status as Established to something else. This will greatly limit the trap but it will also be something end operators are actually interested in. I actually had several operators say they had to write special code to ignore all the backward state transitions that they didn't care about. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-11-03 20:39:39 -04:00
Russ White	f4bf8f1494	Merge pull request #12159 from opensourcerouting/fix/conditional_advertisement_track_updates bgpd: Notify BGP conditional advertisement thread when the peer goes down	2022-11-01 12:24:51 -04:00
Donald Sharp	6029afbc10	bgpd: Clarify what NHT error message means When waiting on a path to reach the peer, modify the debug/show output to give a better understanding to the operator about what they should be looking for. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-10-27 09:21:41 -04:00
Donatas Abraitis	0775362306	bgpd: Notify BGP conditional advertisement thread when the peer goes down Also, make sure we check if the advertisement table changed using FROM peer, not TO peer. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-10-20 15:21:47 +03:00
Madhuri Kuruganti	70cd87ca02	bgpd: optimal router reflection cli and fsm changes Signed-off-by: Madhuri Kuruganti <maduri111@gmail.com>	2022-10-12 13:43:55 +05:30
Samanvitha B Bhargav	b96b4f1c5f	bgpd: Cleanup memory leaks associated with t_deferral_timer We are allocating temporary memory for information about what to process in this thread, which is not being cleaned up on thread cancelling. Signed-off-by: Samanvitha B Bhargav <bsmanvitha@vmware.com>	2022-09-23 00:14:12 -07:00
Donatas Abraitis	61adcf71be	bgpd: Fix memory leak for `conf_copy()` - SoO ecommunity ==1179738== 48 (40 direct, 8 indirect) bytes in 1 blocks are definitely lost in loss record 13 of 29 ==1179738== at 0x483AB65: calloc (vg_replace_malloc.c:760) ==1179738== by 0x493C8D5: qcalloc (memory.c:116) ==1179738== by 0x208F0C: ecommunity_dup (bgp_ecommunity.c:267) ==1179738== by 0x2B300C: conf_copy (bgp_updgrp.c:170) ==1179738== by 0x2B35BF: peer2_updgrp_copy (bgp_updgrp.c:277) ==1179738== by 0x2B5189: update_group_find (bgp_updgrp.c:826) ==1179738== by 0x2B70D0: update_group_adjust_peer (bgp_updgrp.c:1769) ==1179738== by 0x23DB7D: update_group_adjust_peer_afs (bgp_updgrp.h:519) ==1179738== by 0x243B21: bgp_establish (bgp_fsm.c:2129) ==1179738== by 0x244B94: bgp_event_update (bgp_fsm.c:2597) ==1179738== by 0x26B0E6: bgp_process_packet (bgp_packet.c:2895) ==1179738== by 0x498F5FD: thread_call (thread.c:2008) ==1179738== by 0x49253DA: frr_run (libfrr.c:1198) ==1179738== by 0x1EEC38: main (bgp_main.c:520) Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-09-10 18:13:45 +03:00
Donatas Abraitis	5761ca2faa	Merge pull request #11894 from sri-mohan1/sri-bgp-dbg1 bgpd: changes for code maintainability	2022-09-02 13:21:34 +03:00
sri-mohan1	a10c287292	bgpd: changes for code maintainability these changes are for improving the code maintainability Signed-off-by: sri-mohan1 <sri.mohan@samsung.com>	2022-09-02 00:31:40 +05:30
Russ White	d72c279d08	Merge pull request #11833 from opensourcerouting/feature/bgp_neighbor_soo bgpd: Add `neighbor soo` command	2022-08-30 11:17:53 -04:00
Donald Sharp	083ec940ab	bgpd: Convert from bgp_clock() to monotime() Let's convert to our actual library call instead of using yet another abstraction that makes it fun for people to switch daemons. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2022-08-24 08:23:40 -04:00
Donatas Abraitis	01da2d2691	bgpd: Add `neighbor soo` command BGP SoO is a tag that is appended on BGP updates to allow a peer to mark a particular peer as belonging to a particular site. In certain MPLS L3 VPN configurations, the BGP AS-Path may not provide the granularity needed prevent a loop in the control-plane. With this in mind, BGP SoO is designed to fill this gap and prevent a routing loop that may occur. If we configure for example, `neighbor soo 65000:1` at PEs, routes won't be announced between CPEs if soo matches. This is especially needed when using as-override or allowas-in. Also, this is the automated way of the same behavior as configuring route-maps for each peer like: ``` bgp extcommunity-list cpe permit soo 65000:1 ! route-map cpe permit 10 set extcommunity soo 65000:1 ... route-map cpe deny 10 match extcommunity cpe route-map cpe permit 20 ... ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2022-08-20 21:22:45 +03:00
Donatas Abraitis	eed5b70f94	Merge pull request #11657 from donaldsharp/why_timer convert thread_cancel to THREAD_OFF	2022-07-22 08:26:08 +03:00

1 2 3 4 5 ...

315 Commits