mirror_frr

mirror of https://git.proxmox.com/git/mirror_frr synced 2025-05-28 19:18:43 +00:00

Author	SHA1	Message	Date
root	36dc75886d	bgpd: Creating Loopback Interface Flaps BGPd (#2865 ) * The function bgp_router_id_zebra_bump() will check for active bgp peers before chenging the router ID. If there are established peers, router ID is not modified which prevents the flapping of established peer connection * Added field in bgp structure to store the count of established peers Signed-off-by: kssoman <somanks@vmware.com>	2018-11-19 04:35:32 -08:00
Don Slice	5742e42b98	bgpd: make name of default vrf/bgp instance consistent Problems were reported with the name of the default vrf and the default bgp instance being different, creating confusion. This fix changes both to "default" for consistency. Ticket: CM-21791 Signed-off-by: Don Slice <dslice@cumulusnetworks.com> Reviewed-by: CCR-7658 Testing: manual testing and automated tests before pushing	2018-10-31 06:20:37 -04:00
David Lamparter	0437e10517	*: spelchek Signed-off-by: David Lamparter <equinox@diac24.net>	2018-10-25 20:10:57 +02:00
Donald Sharp	19bd3dffc1	bgpd: Do a bit better job of tracking the bgp->peerhash When we add/remove peers we need to do a bit better job of tracking them in the bgp->peerhash. 1) When we have the doppelganger take over, make sure the winner is the one represented in the peerhash. 2) When creating the doppelganger, leave the current one in place instead of blindly replacing it. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-10-07 20:55:52 -04:00
Donald Sharp	9bf904cc8b	bgpd: Try to notice when configuration changes during startup During peer startup there exists the possibility that both locally and remote peers try to start communication at the same time. In addition it is possible for local configuration to change at the same time this is going on. When this happens try to notice that the remote peer may be in opensent or openconfirm and if so we need to restart the connection from both sides. Additionally try to write a bit of extra code in peer_xfer_conn to notice when this happens and to emit a error message to the end user about this happening so that it can be cleaned up. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-10-01 10:58:06 -04:00
Quentin Young	1c50c1c0d6	*: style for EC replacements Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 19:38:57 +00:00
Quentin Young	450971aa99	*: LIB_[ERR\|WARN] -> EC_LIB Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 19:34:28 +00:00
Quentin Young	e50f7cfdbd	bgpd: BGP_[WARN\|ERR] -> EC_BGP Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-09-13 18:51:04 +00:00
Quentin Young	09c866e34d	*: rename ferr_zlog -> flog_err_sys Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Quentin Young	af4c27286d	*: rename zlog_fer -> flog_err Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Donald Sharp	02705213b1	bgpd: Convert to using LIB_ERR_XXX where possible Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-08-14 20:02:05 +00:00
Don Slice	14454c9fdd	bgpd: implement zlog_ferr facility for enhance error messages in bgp Signed-off-by: Don Slice <dslice@cumulusnetworks.com<	2018-08-14 20:02:05 +00:00
Pascal Mathis	b90a8e13ee	bgpd: Implement group-overrides for peer timers This commit implements BGP peer-group overrides for the timer flags, which control the value of the hold, keepalive, advertisement-interval and connect connect timers. It was kept separated on purpose as the whole timer implementation is quite complex and merging this commit together with with the other flag implementations did not seem right. Basically three new peer flags were introduced, namely PEER_FLAG_ROUTEADV, PEER_FLAG_TIMER and PEER_FLAG_TIMER_CONNECT. The overrides work exactly the same way as they did before, but introducing these flags made a few conditionals simpler as they no longer had to compare internal data structures against eachother. Last but not least, the test suite has been adjusted accordingly to test the newly implemented flag overrides. Signed-off-by: Pascal Mathis <mail@pascalmathis.com>	2018-06-14 18:55:30 +02:00
Donald Sharp	c42eab4bf5	bgpd: Respect ability to reach nexthop if available When bgp is thinking about opening a connection to a peer, if we are connected to zebra, allow that to influence our decision to start the connection. Found Scenario: Both bgp and zebra are started up at the same time. Zebra is being used to create the connected route through which bgp will establish a peering relationship. The machine is a bit loaded due to other startup conditions and as such bgp gets to the connection stage here before zebra has installed the route. If bgp does not respect zebra data when it does have a connection then we will attempt to connect. The connect will fail because there is no route. At that time we will go into the connect timeout(2 minutes) and delay connection. What this does. If we have established a zebra connection and we do not have a clear path to the destination at this point do not allow the connection to proceed. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-05-11 07:46:43 -04:00
Donald Sharp	54ff5e9b02	bgpd: Cleanup messages from getsockopt The handling of the return codes for getsockopt was slightly wrong. getsockopt returns -1 on error and errno is set. What to do with the return code at that point is dependent on what sockopt you are asking about. In this case status holds the error returned for SO_ERROR. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-05-11 07:34:24 -04:00
G. Paul Ziemba	960035b2d9	bgpd: nexthop tracking with labels for vrf-vpn leaking Routes that have labels must be sent via a nexthop that also has labels. This change notes whether any path in a nexthop update from zebra contains labels. If so, then the nexthop is valid for routes that have labels. If a nexthop update has no labeled paths, then any labeled routes referencing the nexthop are marked not valid. Add a route flag BGP_INFO_ANNC_NH_SELF that means "advertise myself as nexthop when announcing" so that we can track our notion of the nexthop without revealing it to peers. Signed-off-by: G. Paul Ziemba <paulz@labn.net>	2018-04-04 10:00:23 -07:00
Quentin Young	d7c0a89a3a	*: use C99 standard fixed-width integer types The following types are nonstandard: - u_char - u_short - u_int - u_long - u_int8_t - u_int16_t - u_int32_t Replace them with the C99 standard types: - uint8_t - unsigned short - unsigned int - unsigned long - uint8_t - uint16_t - uint32_t Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-03-27 15:13:34 -04:00
Donald Sharp	5410015a79	bgpd: peer->bgp must be non NULL We lock and set peer->bgp at peer creation and only remove it at deletion. Therefore these tests are not needed. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com>	2018-03-20 19:09:06 -04:00
Lou Berger	996c93142d	*: conform with COMMUNITY.md formatting rules, via 'make indent' Signed-off-by: Lou Berger <lberger@labn.net>	2018-03-06 14:04:32 -05:00
Philippe Guibert	f62abc7d65	bgpd: do not start BGP VRF peer connection, if VRF not unknown Upon starting a BGP VRF instance, the server socket is not created, because the VRF ID is not known, and then underlying VRF backend is not ready yet. Because of that, the peer connection attempt will not be started before. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2018-02-27 11:11:24 +01:00
Philippe Guibert	61cf4b3715	bgpd: bgp support for netns The change contained in this commit does the following: - discovery of vrf id from zebra daemon, and adaptation of bgp contexts with BGP. The list of network addresses contain a reference to the bgp context supporting the vrf. The bgp context contains a vrf pointer that gives information about the netns path in case the vrf is a netns path. Only some contexts are impacted, namely socket creation, and retrieval of local IP settings. ( this requires vrf identifier). Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2018-02-27 11:11:24 +01:00
Russ White	2ed7e4c3c3	Merge pull request #1591 from qlyoung/bgpd-ringbuf bgpd: use ring buffer for network input	2018-01-10 19:59:24 -05:00
Quentin Young	0112e9e0b9	bgpd: use atomic_* ops on _Atomic variables Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-01-09 15:40:48 -05:00
Quentin Young	74ffbfe6fe	bgpd: use ring buffer for network input The multithreading code has a comment that reads: "XXX: Heavy abuse of stream API. This needs a ring buffer." This patch makes the relevant code use a ring buffer. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2018-01-03 14:35:11 -05:00
Quentin Young	7a86aa5a0a	bgpd: schedule packet job after connection xfer During initial session establishment, bgpd performs a "connection transfer" to a new peer struct if the connection was initiated passively (i.e. by the remote peer). With the addition of buffered input and a reorganized packet processor, the following race condition manifests: 1. Remote peer initiates a connection. After exchanging OPEN messages, we send them a KEEPALIVE. They send us a KEEPALIVE followed by 10,000 UPDATE messages. The I/O thread pushes these onto our local peer's input buffer and schedules a packet processing job on the main thread. 2. The packet job runs and processes the KEEPALIVE, which completes the handshake on our end. As part of transferring to ESTABLISHED we transfer all peer state to a new struct, as mentioned. Upon returning from the KEEPALIVE processing routing, the peer context we had has now been destroyed. We notice this and stop processing. Meanwhile 10k UPDATE messages are sitting on the input buffer. 3. N seconds later, the remote peer sends us a KEEPALIVE. The I/O thread schedules another process job, which finds 10k UPDATEs waiting for it. Convergence is achieved, but has been delayed by the value of the KEEPALIVE timer. The racey part is that if the remote peer takes a little bit of time to send UPDATEs after KEEPALIVEs -- somewhere on the order of a few hundred milliseconds -- we complete the transfer successfully and the packet processing job is scheduled on the new peer upon arrival of the UPDATE messages. Yuck. The solution is to schedule a packet processing job on the new peer struct after transferring state. Lengthy commit message in case someone has to debug similar problems in the future... Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:05 -05:00
Quentin Young	7db44ec8fa	bgpd: transfer raw input buffer to new peer During initial session establishment, bgpd performs a "connection transfer" to a new peer struct if the connection was initiated passively (i.e. by the remote peer). With the addition of buffered input, I forgot to transfer the raw input buffer to the new peer. This resulted in infrequent failures during session handshaking whereby half of a packet would be thrown away in the middle of a read causing us to send a NOTIFY for an unsynchronized header. Usually the transfer coincided with a clean input buffer, hence why it only showed up once in a while.	2017-11-30 16:18:05 -05:00
Quentin Young	387f984e58	bgpd: fix bgp active open At some point when rearranging FSM code, bgpd lost the ability to perform active opens because it was only paying attention to POLLIN and not POLLOUT, when the latter is used to signify a successful connection in the active case. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:05 -05:00
Quentin Young	becedef6c3	bgpd, tests: comment formatting Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:05 -05:00
Quentin Young	bea0122657	bgpd: misc fsm fixes * Keepalive on/off calls are necessary in certain cases due to screwy fsm flow not turning them on after transferring a passive peer connection in peer_xfer_conn * Missed a case bgp_event_update() that resulted in a return code of -1 instead of BGP_Stop, which confuses the packet processing routine Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:02 -05:00
Quentin Young	d815168795	bgpd: fix bgp_packet.c / bgp_fsm.c organization Despaghettification of bgp_packet.c and bgp_fsm.c Sometimes we call bgp_event_update() inline packet parsing. Sometimes we post events instead. Sometimes we increment packet counters in the FSM. Sometimes we do it in packet routines. Sometimes we update EOR's in FSM. Sometimes we do it in packet routines. Fix the madness. bgp_process_packet() is now the centralized place to: - Update message counters - Execute FSM events in response to incoming packets FSM events are now executed directly from this function instead of being queued on the thread_master. This is to ensure that the FSM contains the proper state after each packet is parsed. Otherwise there could be race conditions where two packets are parsed in succession without the appropriate FSM update in between, leading to session closure due to receiving inappropriate messages for the current FSM state. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:02 -05:00
Quentin Young	a9794991c7	bgpd: bye bye THREAD_BACKGROUND Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:01 -05:00
Quentin Young	9eb217ff69	bgpd: batched i/o Instead of reading a packet header and the rest of the packet in two separate i/o cycles, instead read a chunk of data at one time and then parse as many packets as possible out of the chunk. Also changes bgp_packet.c to batch process packets. To avoid thrashing on useless mutex locks, the scheduling call for bgp_process_packet has been changed to always succeed at the cost of no longer being cancel-able. In this case this is acceptable; following the pattern of other event-based callbacks, an additional check in bgp_process_packet to ignore stray events is sufficient. Before deleting the peer all events are cleared which provides the requisite ordering. XXX: chunk hardcoded to 5, should use something similar to wpkt_quanta Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:18:00 -05:00
Quentin Young	b72b6f4fc9	bgpd: rename peer_keepalives* --> bgp_keepalives* Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	424ab01d0f	bgpd: implement buffered reads * Move and modify all network input related code to bgp_io.c * Add a real input buffer to `struct peer` * Move connection initialization to its own thread.c task instead of piggybacking off of bgp_read() * Tons of little fixups Primary changes are in bgp_packet.[ch], bgp_io.[ch], bgp_fsm.[ch]. Changes made elsewhere are almost exclusively refactoring peer->ibuf to peer->curr since peer->ibuf is now the true FIFO packet input buffer while peer->curr represents the packet currently being processed by the main pthread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	56257a44e4	bgpd: move bgp i/o to a separate source file After implement threading, bgp_packet.c was serving the double purpose of consolidating packet parsing functionality and handling actual I/O operations. This is somewhat messy and difficult to understand. I've thus moved all code and data structures for handling threaded packet writes to bgp_io.[ch]. Although bgp_io.[ch] only handles writes at the moment to keep the noise on this commit series down, for organization purposes, it's probably best to move bgp_read() and its trappings into here as well and restructure that code so that read()'s happen in the pthread and packet processing happens on the main thread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:59 -05:00
Quentin Young	dc1188bb4d	bgpd: correctly schedule select() at session startup On TCP connection failure during session setup, bgp_stop() checks whether peer->t_read is non-null to know whether or not to unschedule select() on peer->fd before calling close() on it. Using the API exposed by thread.c instead of bgpd's wrapper macro BGP_READ_ON() results in this thread value never being set, which causes bgp_stop() to skip the cancellation of select() before calling close(). Subsequent calls to select() on that fd crash the daemon. Use the macro instead. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:58 -05:00
Quentin Young	727c4f870b	bgpd: transfer packets from peer stub to actual peer During transition from OpenConfirm -> Established, we wipe the peer stub's output buffer. Because thread.c prioritizes I/O operations over regular background threads and events, in a single threaded environment this ordering meant that the output buffer would be happily empty at wipe time. In MT-land, this convenient coincidence is no longer true; thus we need to make sure that any packets remaining on the peer stub get transferred over to the peer proper. Also removes misleading comment indicating that bgp_establish() sends a keepalive packet. It does not. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:58 -05:00
Quentin Young	03014d48f4	bgpd: put BGP keepalives in a pthread This patch, in tandem with moving packet writes into a dedicated kernel thread, fixes session flaps caused by long-running internal operations starving the (old) userspace write thread. BGP keepalives are now produced by a kernel thread and placed onto the peer's output queue. These are then consumed by the write thread. Both of these tasks are concurrent with the rest of bgpd, obviating the session flaps described above. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	07a1652682	bgpd: move bgp_connect_check() to bgp_fsm.c Prior to this change, after initiating a nonblocking connection to the remote peer bgpd would call both BGP_READ_ON and BGP_WRITE_ON on the peer's socket. This resulted in a call to select(), so that when some event (either a connection success or failure) occurred on the socket, one of bgp_read() or bgp_write() would run. At the beginning of each of those functions was a hook into bgp_connect_check(), which checked the socket status and issued the correct connection event onto the BGP FSM. This code is better suited for bgp_fsm.c. Placing it there avoids scheduling packet reads or writes when we don't know if the socket has established a connection yet, and the specific functionality is a better fit for the responsibility scope of this unit. This change also helps isolate the responsibilities of the packet-writing kernel thread. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	d3ecc69e5f	bgpd: move packet writes into dedicated pthread * BGP_WRITE_ON() removed * BGP_WRITE_OFF() removed * peer_writes_on() added * peer_writes_off() added * bgp_write_proceed_actions() removed Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-30 16:17:57 -05:00
Quentin Young	05c7a1cc93	bgpd: use FOREACH_AFI_SAFI where possible Improves consistency and readability. Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-11-21 13:02:06 -05:00
Don Slice	d25e4efc52	bgpd: fix various problems with hold/keepalive timers Problem reported that we weren't adjusting the keepalive timer correctly when we negotiated a lower hold time learned from a peer. While working on this, found we didn't do inheritance correctly at all. This fix solves the first problem and also ensures that the timers are configured correctly based on this priority order - peer defined > peer-group defined > global config. This fix also displays the timers as "configured" regardless of which of the three locations above is used. Ticket: CM-18408 Signed-off-by: Don Slice <dslice@cumulusnetworks.com> Reviewed-by: CCR-6807 Testing-performed: Manual testing successful, fix tested by submitter, bgp-smoke completed successfully	2017-10-26 11:55:31 -04:00
Renato Westphal	a08ca0a7e1	lib: remove SAFI_RESERVED_4 and SAFI_RESERVED_5 SAFI values have been a major source of confusion over the last few years. That's because each SAFI needs to be represented in two different ways: * IANA's value used to send/receive packets over the network; * Internal value used for array indexing. In the second case, defining reserved values makes no sense because we don't want to index SAFIs that simply don't exist. The sole purpose of the internal SAFI values is to remove the gaps we have among the IANA values, which would represent wasted memory in C arrays. With that said, remove these reserved SAFIs to avoid further confusion in the future. Signed-off-by: Renato Westphal <renato@opensourcerouting.org>	2017-07-31 23:38:38 -03:00
David Lamparter	9d303b37d7	Revert "*: reindent pt. 2" This reverts commit `c14777c6bf`. clang 5 is not widely available enough for people to indent with. This is particularly problematic when rebasing/adjusting branches. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-22 14:52:33 +02:00
whitespace / reindent	c14777c6bf	: reindent pt. 2 w/ clang 5 reflow comments * struct members go 1 per line * binpack algo was adjusted	2017-07-17 15:26:02 -04:00
whitespace / reindent	d62a17aede	*: reindent indent.py `git ls-files \| pcregrep '\.[ch]$' \| pcregrep -v '^(ldpd\|babeld\|nhrpd)/'` Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-17 14:04:07 +02:00
David Lamparter	acd738fc7f	*: fix GCC 7 switch/case fallthrough warnings Need a comment on these. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-07-14 16:59:43 +02:00
Quentin Young	56b4067930	*: simplify log message lookup log.c provides functionality for associating a constant (typically a protocol constant) with a string and finding the string given the constant. However this is highly delicate code that is extremely prone to stack overflows and off-by-one's due to requiring the developer to always remember to update the array size constant and to do so correctly which, as shown by example, is never a good idea.b The original goal of this code was to try to implement lookups in O(1) time without a linear search through the message array. Since this code is used 99% of the time for debugs, it's worth the 5-6 additional cmp's worst case if it means we avoid explitable bugs due to oversights... Signed-off-by: Quentin Young <qlyoung@cumulusnetworks.com>	2017-06-21 15:22:21 +00:00
David Lamparter	57463530f3	Merge branch 'stable/3.0' Conflicts: ospf6d/ospf6_lsa.c ospfd/ospf_vty.c zebra/interface.c Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-05-18 12:28:12 +02:00
David Lamparter	92eedda1fb	Merge branch stable/2.0 into stable/3.0 Conflicts: bgpd/bgp_fsm.c ospf6d/ospf6_lsa.c ospfd/ospf_vty.c zebra/redistribute.c Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2017-05-18 12:23:13 +02:00

1 2 3

148 Commits