Commit Graph

29287 Commits

Author SHA1 Message Date
Louis Scalbert
bac0d1e606 topotests: isis-tilfa add a switchover test after linkdown
Add a switchover test that consists in shutting down an interface.

Check that the switchover between primary and backup happens before the
SPF re-computation.

Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
2022-05-24 10:06:05 +02:00
Louis Scalbert
d526476184 isisd: avoid fast-reroute from running twice on a down interface event
isis_ifp_down() may in some circumstances be called twice on a down
interface event.

Avoid applying fast-reroute on an already down interface.

Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
2022-05-24 10:06:05 +02:00
Louis Scalbert
08e4960ab4 isisd: avoid fast-reroute on down adjacency when the interface is down
When an IS-IS interface is coming down, fast-reroute may be triggered
twice: a first time after the detection of the interface down event and
a second time after the detection of the adjacency down (because of the
expiration of the ISIS Hello or BFD timers).

Avoid a BFD down event from running fast-reroute another time if the
interface was already detected down.

Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
2022-05-24 10:05:49 +02:00
Donatas Abraitis
41b1d7a3a4 bgpd: Don't increment twice notification sent stats counter
Before:

```
$ vtysh -c 'clear ip bgp 192.168.10.17'
$ vtysh -c 'show bgp neighbor 192.168.10.17 json' | jq '."192.168.10.17".messageStats.notificationsSent'
2
```

After:

```
$ vtysh -c 'clear ip bgp 192.168.10.17'
$ vtysh -c 'show bgp neighbor 192.168.10.17 json' | jq '."192.168.10.17".messageStats.notificationsSent'
1
```

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
2022-05-24 10:20:09 +03:00
Kuldeep Kashyap
fb943df10b tests: [Topojson] Enhance CLI logging
As of now we are logging only JSON output of CLIs
in topotests(topojson) executions and same o/p is
getting printed twice, which is of no use.

Enhanced code to show both plain and JSON output
of CLIs and remove duplicate logging.

It will help in reducing execution logs and in
verification, if sometimes there is mis-match
in CLI plain and JSON outputs.

Signed-off-by: Kuldeep Kashyap <kashyapk@vmware.com>
2022-05-23 23:34:19 -07:00
Donatas Abraitis
ae872c2f88 bgpd: Add JSON output for show rpki cache-connection
```
spine1-debian-11# sh rpki cache-connection
Connected to group 1
rpki tcp cache 192.168.10.17 8283 pref 1 (connected)
rpki tcp cache 192.168.10.17 8282 pref 2
spine1-debian-11# sh rpki cache-connection json
{
  "connectedGroup":1,
  "connections":[
    {
      "mode":"tcp",
      "host":"192.168.10.17",
      "port":"8283",
      "preference":1,
      "state":"connected"
    },
    {
      "mode":"tcp",
      "host":"192.168.10.17",
      "port":"8282",
      "preference":2,
      "state":"disconnected"
    }
  ]
}
```

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
2022-05-24 00:06:07 +03:00
Jafar Al-Gharaibeh
778e0cef38 debian, redhat: update changelog for new release
Signed-off-by: Jafar Al-Gharaibeh <jafar@atcorp.com>
2022-05-23 14:56:00 -05:00
Donald Sharp
2ca908755c tests: pbr_topo1 sometimes fails under really heavy load
This test is sometimes failing under severe load.  Give some time
for the linux rule installation to actually be registered by the
system before declaring failure.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-23 15:28:46 -04:00
Jafar Al-Gharaibeh
ac425bfc2b
Merge pull request #11150 from opensourcerouting/fix/use_librtr_higher_0.8.0
packaging: Require librtr >= 0.8.0
2022-05-23 11:11:10 -04:00
Donatas Abraitis
3da33b248a github: Add base branch label to every PR to distinguish easily
Faster notice and filter backports by labels per release.

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
2022-05-23 16:58:07 +03:00
Louis Scalbert
62e458278b isisd: apply fast-reroute when an interface falls down
Backup routes are sent to zebra by routing daemons such as isisd so that
the dataplane can pre-install them with a lower priority. When an
interface comes down, the associated primary routes are discarded by the
dataplane and the backup ones take over.

However, some dataplanes (e.g. Netlink ones) do not pre-install the
backup routes. Associated prefixes have no next-hop until SPF is
recomputed.

Apply fast-reroute as soon as an interface falls down by sending route
UPDATEs to zebra.

Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
2022-05-23 15:31:34 +02:00
Donatas Abraitis
23706eacc2 doc: Use full URL for FRR keyword highlight
Seems replacement is not working when referenced, only when used directly
in the text |PACKAGE_URL|.

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
2022-05-23 11:47:16 +03:00
Louis Scalbert
a11742dc53 isisd: allow nexthop lookup on interface only
Allow the nexthoplookup function to return the first nexthop found on
ifindex interface if the IP is unspecified.

Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
2022-05-23 10:44:50 +02:00
Louis Scalbert
a254fa54e3 isisd: remove previous labels before fast-reroute
Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
2022-05-23 10:44:25 +02:00
Louis Scalbert
7125297323 isisd: apply fast-reroute on an adjacency failure
When a adjacency falls down, the primary routes are not deleted on the
dataplane until the SPF is recomputed. Even the backup routes are
pre-installed on the dataplane, there is no fast-route optimization.

Reasons for an adjacency to come down are:
- BFD down
- Hello timer timeout
- User adjacency clear

Apply the backup route switchover for fast-reroute as soon an IS-IS
adjacency falls down before the first SPF re-computation. Pre-computed
backup routes are applied sooner.

Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
2022-05-23 10:44:20 +02:00
sarita patra
2ea34a81df pim6d: Don't enable mld on pimreg interface
Signed-off-by: sarita patra <saritap@vmware.com>
2022-05-23 01:24:30 -07:00
Donatas Abraitis
4febdb6b9a
Merge pull request #10836 from anlancs/bgpd-mh-delay-esi
zebra: delay setting esi in zebra_evpn_local_es_update()
2022-05-23 07:49:08 +02:00
Donatas Abraitis
972426602a tests: Add Notification support for Graceful-Restart tests
Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
2022-05-22 22:40:22 +03:00
Donald Sharp
a830fc4c79
Merge pull request #11217 from opensourcerouting/fix/doc_overview
doc: Fix FRR/Slack keyword links in overview section
2022-05-22 15:03:05 -04:00
Donald Sharp
6e3b94d9d3
Merge pull request #11238 from opensourcerouting/rtadv-cleanup
zebra: clean up rtadv integration
2022-05-22 15:02:27 -04:00
David Lamparter
7ca9c407ed zebra: clean up rtadv integration
Move a few things into places they actually belong, and reduce the
number of places we have `#ifdev HAVE_RTADV`.  Just overall code
prettification.

... I had actually done this quite a while ago while doing some other
random hacking and thought it more useful to not be sitting on it on my
disk...

Signed-off-by: David Lamparter <equinox@opensourcerouting.org>
2022-05-21 14:14:01 +02:00
Jafar Al-Gharaibeh
8eeadd88e9
Merge pull request #11227 from donaldsharp/verify_not_queued
tests: Ensure routes are not queued when calling verify_rib
2022-05-20 16:03:23 -04:00
David Lamparter
9807de3be3
Merge pull request #11223 from donaldsharp/ospf_shenanigans 2022-05-20 18:32:07 +02:00
Donald Sharp
197191a0ce
Merge pull request #11232 from opensourcerouting/bgp-sendhold-derp
bgpd: fix oopsie with SendHoldTime==0
2022-05-20 11:10:04 -04:00
Donald Sharp
006ef829d8 tests: Allow a bit longer for bfd topo tests to synchronize
Allowing only 4 seconds for a bfd test to synchronize is going
to run into problems on extremely loaded systems.  The test
system should value it actually converged over it actually
converged in a reasonable time, especially on test systems
that are loaded because of many multiples of tests running
at the same time.  If it is important to actually test
that something got done by the RFC, the CI system as it
is currently written is not the correct place for this.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-20 10:31:48 -04:00
Donald Sharp
cb8018f4c3 tests: Make verify_rib wait up to 40 seconds.
Under heavy load I am seeing verify_rib failing after 12 seconds
but succeeding after 17:

2022-05-19 18:52:54,374 DEBUG: topolog: Exiting lib API: verify_rib
2022-05-19 18:52:54,374 DEBUG: topolog: Function returned True
2022-05-19 18:52:54,374 WARNING: topolog: RETRY DIAGNOSTIC: SUCCEED after FAILED with requested timeout of 12.0s; however, succeeded in 14.7s, investigate timeout timing

There is no reason to not have the test wait a bit longer for very very
heavily loaded systems.  Change the time to 40 seconds.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-20 10:31:48 -04:00
Donald Sharp
6ec742d95f tests: Ensure routes are not queued when calling verify_rib
Lots of tests call verify_rib that takes a list of routes that
need to be verified in some fashion.  This verify_rib functionality
will try up to 12 seconds before failing the check that zebra
has the route and has installed it.

Unfortunately the verify_rib code was not looking to see if
the route was queued for installation and was then allowing
tests to immediately do subsuquent steps that depended on
that route actually being installed sometimes causing tests
to fail.

Write a bit of additional code that looks at the queued
status and allows the test to wait a bit longer for zebra
to finish processing before allowing the test to move on
to the next bit.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-20 10:31:48 -04:00
Donald Sharp
cfb9e0ee59 lib: Make thread_is_scheduled a static inline
For performance, yo.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-20 09:53:44 -04:00
Donald Sharp
c905f04c7c ospf6d: Clean up thread interface
a) Remove setting of thread pointer to NULL after
thread invocation, this is already done.

b) Use thread_is_scheduled()

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-20 09:52:16 -04:00
Donald Sharp
23b11ab185 ospf6d: Remove double check of default prefix
The ospf6_is_valid_summary_addr function is checking
to see if a prefix is the default and also then double
comparing it against the v6 prefix part.  No need to do this.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-20 09:52:16 -04:00
David Lamparter
cbbb1813a6
Merge pull request #11008 from patrasar/sec_addr_list_pimv6 2022-05-20 15:25:52 +02:00
David Lamparter
5a744d5c86
Merge pull request #10903 from SaiGomathiN/pimv6debug 2022-05-20 15:23:31 +02:00
David Lamparter
382268f46a bgpd: fix oopsie with SendHoldTime==0
... *duh*

Signed-off-by: David Lamparter <equinox@opensourcerouting.org>
2022-05-20 14:16:24 +02:00
anlan_cs
c331ef1665 zebra: remove one unnecessary check for l3vni nb
The parent node of "vrf"  MUST be non-NULL, so the check is unnecessary and
misleading. Otherwise, there will be a branch of NULL parent node, it makes
no sense, remove it.

Signed-off-by: anlan_cs <vic.lan@pica8.com>
2022-05-20 03:11:27 -04:00
Donald Sharp
d37879b7a2
Merge pull request #11225 from opensourcerouting/bgp-sendhold
bgpd: implement SendHoldTimer
2022-05-19 14:51:29 -04:00
Sri Mohana Singamsetty
bde51e807f
Merge pull request #11216 from chiragshah6/fdev2
zebra: netlink registry of rtm tunnel notification
2022-05-19 10:28:25 -07:00
Sri Mohana Singamsetty
0e6e6bc36e
Merge pull request #11222 from donaldsharp/bgp_zebra_stuff
Bgp zebra stuff
2022-05-19 09:41:41 -07:00
Sri Mohana Singamsetty
93480825fb
Merge pull request #11204 from anlancs/fix/bgpd-check-vnihash
bgpd: remove unnecessary check for evpn
2022-05-19 09:38:10 -07:00
Sri Mohana Singamsetty
595ebf525b
Merge pull request #11210 from anlancs/fix/zebra-leak-vtp
zebra: fix missing delete vtep during vni transition
2022-05-19 09:35:27 -07:00
David Lamparter
bd9fb6f368 bgpd: implement SendHoldTimer
As described by
https://www.ietf.org/archive/id/draft-spaghetti-idr-bgp-sendholdtimer-04.html

Since this replicates the HoldTime check on the receiver that is already
part of the protocol, I do not believe it necessary to wait for IETF
progress on this draft.  It's just replicating an existing element of
the protocol at the other side of the session.

Signed-off-by: David Lamparter <equinox@opensourcerouting.org>
2022-05-19 12:14:40 +02:00
Donatas Abraitis
153d1426cc doc: Fix FRR/Slack keyword links in overview section
Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
2022-05-19 12:17:35 +03:00
Russ White
18028bdb9b
Merge pull request #11183 from opensourcerouting/feature/handle_bgp_gr_notification
bgpd: Activate Graceful-Restart when receiving CEASE/HOLDTIME notifications
2022-05-18 15:17:59 -04:00
Russ White
4c705792cb
Merge pull request #11214 from kuldeepkash/multicast_pim_dr_nondr
tests: Fix for test_pim_dr_nondr_with_ospf_topo2 test failure
2022-05-18 15:11:15 -04:00
Russ White
2c5d51d6f7
Merge pull request #11218 from opensourcerouting/fix/adv_routes_best_path_wrong_local_as
bgpd: Show a correct local AS for the advertised/bestpath routes
2022-05-18 15:10:18 -04:00
Russ White
0c9a3eab90
Merge pull request #11219 from pguibert6WIND/doc_show_isis
doc: use real wording for the show isis segment-routing node command
2022-05-18 15:09:25 -04:00
Donald Sharp
1b3cf91b0c zebra: Fix newline in log message
Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-18 14:42:22 -04:00
Donald Sharp
cea8b6556f bgpd: Allow nht to work when connection succeeds
Originally commit: 8761cd6ddb
implemented the ability for LL nexthop tracking to be
interface based.  During some more testing the code
to make the interface event happen was noticed to
not properly run in some cases.  This was because
it was originally assumed that the connect could
not succeed.  Testing has shown that the connect
can succeed and FRR needs to handle the nht tracking
in this case too.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
2022-05-18 14:42:20 -04:00
Russ White
c1e2a1eae3
Merge pull request #11205 from chiragshah6/fdev1
zebra: new netlink parse utility for rta used to send nhg msg
2022-05-18 11:13:22 -04:00
Chirag Shah
42ed3bd77f zebra: add netlink tunnel msg to dump routine
This patch parses vxlan vnifilter rtm tunnel
message which contains vni mapping to vxlan device.
The new notifications are RTM_NEWTUNNEL,
RTM_DELTUNNEL, and RTM_GETTUNNEL.

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/
linux.git/commit/?h=v5.18-rc7&id=7b8135f4df98b155b23754b6065c157861e268f1

Testing Done:

2022/05/18 00:34:25 ZEBRA: netlink_recv_msg: << netlink message dump
[recv]
2022/05/18 00:34:25 ZEBRA: nlmsghdr [len=36 type=(120) NEWTUNNEL
flags=(0x0000) {} seq=0 pid=0]
2022/05/18 00:34:25 ZEBRA:   tnlm [family=(7) AF_BRIDGE ifindex=46
2022/05/18 00:34:25 ZEBRA:   vni_start 4001, vni_end 0

Signed-off-by: Chirag Shah <chirag@nvidia.com>
2022-05-18 07:56:44 -07:00
Chirag Shah
47e2eb270d zebra: netlink registry rtm tunnel notif
The kernel supports l3vxlan device to have (l3vni)
vni filter similar to vlan filtering on bridge device.

To receive netlink notification, FRR to register
for new netlink RTNLGRP_TUNNEL message.
This message required to register via additional
socket option as it's beyond bitmap size.

kernel patches:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/
linux.git/commit/?h=v5.18-rc7&id=7b8135f4df98b155b23754b6065c157861e268f1

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/
linux.git/commit/?h=v5.18-rc7&id=f9c4bb0b245cee35ef66f75bf409c9573d934cf9

Ticket:#3073812
Testing Done:

Signed-off-by: Chirag Shah <chirag@nvidia.com>
2022-05-18 07:56:35 -07:00