mirror_corosync

mirror of https://git.proxmox.com/git/mirror_corosync synced 2026-01-12 22:16:52 +00:00

Author	SHA1	Message	Date
Christine Caulfield	fc8580bdbf	totem: Use nodeid ONLY in srp_addr This shrinks the srp_addr (and consequently every packet sent by corosync) so that instead of containing loads of IP addresses to identify a node, it just sends the nodeid. This then allows us to make ring0 optional and replaceable when running knet. It also means that we need some other way of identifying the local node in corosync.conf, so the nodelist.node.name entry is now mandatory and is mapped to the local host using the same algorithm as used in cman. This code needs LOTS of testing as it touches a huge amount of totemsrp and totemconfig. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-03-01 14:18:51 +01:00
Rytis Karpuška	105f3ae98c	totempg: Fix corrupted messages Commit `899cb29983` changed copy_len to iovec[i].iov_len, assuming, copy_len is always the same as iovec[i].iov_len under those circumstances, but it missed the possability of small message being partly put at the end of packet, which cuts this message in two parts and therefore making copy_len not equal to iovec[i].iov_len. This is revert of `899cb29983` Signed-off-by: Rytis Karpuška <rytisk@neurotechnology.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-02-09 17:38:05 +01:00
Rytis Karpuška	899cb29983	totempg: use iovec[i].iov_len instead of copy_len To be more explicit that we are copying whole message. Related to `0ebae6b47d`. Signed-off-by: Rytis Karpuška <rytisk@neurotechnology.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-02-08 09:30:07 +01:00
Rytis Karpuška	0ebae6b47d	totempg: Fix fragmentation segfault The problem was that two or more messages were concatenated together during fragmentation in mcast_msg() function. In specific case, message of just short of 1MB was provided for mcast_msg() and it happened so, that the remainder (212 bytes to be exact) left some free space in packet, therefore branch if ((copy_len + fragment_size) < (max_packet_size - sizeof (unsigned short))) { ... was selected and this was the last mesage in provided iovec. Then, on the second call, came another big message (about 300KB ) and during fragmentation mcast.fragmented was set to 1. On the other end, while receiving messages, due to missing mcast.fragmentation==0 those two messages were concatenated and therefore assembly->data array overflowed overwriting linked list pointers and offset (which happened to be set to 0 and that 300KB message was being copied from the beginning again). After whole 300KB message has been sent, mcast.fragmentation==0 arrived and totempg_deliver_fn() tried to move assembly structure to assembly_list_free list, but as linked list pointers has been overriden, segfault occured. Signed-off-by: Rytis Karpuška <rytisk@neurotechnology.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-02-08 09:29:22 +01:00
Fabio M. Di Nitto	1411608a81	[build] fix build with non-standard knet location Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-02-05 15:57:12 +01:00
Jan Friesse	11fa527ed4	logging: Close before and open blackbox after fork Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-30 13:21:52 +01:00
Jan Friesse	79dba9c51f	logging: Make blackbox configurable Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-30 13:21:48 +01:00
Jan Friesse	1fba1b83aa	build: Replace -lknet with autoconf generated vars Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-25 16:08:09 +01:00
Jan Friesse	589ed92505	build: Remove rdma/ibverbs Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-25 16:08:07 +01:00
Christine Caulfield	31ddba64a2	config: Don't fudge port numbers When I was adding knet I wanted the port numbers to default to the base port number + the linknumber. However I seem to have messed this up such that any port number specified in the config file has the link number added to it. Which is almost certainly not what people would expect. This patch sets it right. If a port number is not specified then 5405+linknumber is used. If a port number IS specified then that actual number is used. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-01-18 16:31:24 +01:00
Christine Caulfield	22ae4cacda	knet: Allow ping_timers to be auto-configured knet ping_timers are auto-configured according to token value. This patch also fixes some knet config bugs that resulted in defaults not being applied when values were removed from corosync.conf. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-01-15 15:08:19 +01:00
yuskiida	e7734fab70	build: Add the headers necessary for RPM build Signed-off-by: yuskiida <yusk.iida@gmail.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-01-11 14:47:46 +01:00
Christine Caulfield	236032f7b5	config: if local node addr is wrong, fail with a sensible message If no valid local address is found in corosync.conf then corosync exits with: "parse error in config: No multicast port specified" This is because of the config change for knet that always populates the interfaces. The old error of "no interfaces found" was only slightly better anyway IMHO. This patch adds an explicit check that local_node_pos has been set in icmap and uses that to determine if a valid local address has been found. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-01-09 17:50:12 +01:00
Jan Friesse	96cb977880	totemknet: Drop truncated packets on receive This is backport of part of "totemudpu: Scale receive buffer" patch. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-09 17:46:31 +01:00
Jan Friesse	0f1813adff	totemudp: Make use of UDP_RECEIVE_FRAME_SIZE_MAX Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-09 17:46:28 +01:00
Jan Friesse	32535b842c	totemudpu: Export and rename UDPU_FRAME_SIZE_MAX Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-09 17:46:25 +01:00
Jan Friesse	3982f795d5	totemconfig: Fix UDP autogeneration of mcast addr Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-09 17:46:21 +01:00
Jan Friesse	155c0d4052	totemudpu: Scale receive buffer Receive buffer should be based on PROCESSOR_COUNT_MAX and not static buffer. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-09 17:46:04 +01:00
Christine Caulfield	98bb0c78c8	config: Allow selection of crypto_model KNET has options for nss or openssl crpyto libraries, make this available to corosync. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-01-05 15:25:17 +01:00
Christine Caulfield	2a6a571c06	config: Allow links to have different ip_versions knet allows links to have different IP versions - proivided they all match per link. So don't force them all to be the same. I've added a check here to make sure that all nodes on the same link are using the same IP version. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-12-22 17:15:19 +01:00
Bin Liu	b1d3eca448	wd: fix snprintf warnings When running ./configure --enable-watchdog, gcc 7.2.1 will report warnings for snprintf. This patch fixes the warnings. Signed-off-by: Bin Liu <bliu@suse.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-12-01 17:23:54 +01:00
Christine Caulfield	1ca72a1154	totemsrp: Revert totemsrp_get_ifaces() changes In my enthusiasm for removing code while integrating knet I also deleted the correct code for returning IP address for a node, so that only the IP addres of the local node was ever returned. This commit restores the the previous code. Also, because we always return INTERFACE_MAX interfaces now (they don't have to be contiguous) set ss_family to zero if that interface is not in use so that downstream apps know and don't display a lot of 0.0.0.0 Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-11-30 16:59:05 +01:00
Bin Liu	af21baf0ff	totemconfig: remove duplicate aes256 test Signed-off-by: Bin Liu <bliu@suse.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-11-29 18:18:52 +01:00
Jan Friesse	154895dfbe	sync: Call sync_init of all services at once This patch solves situation which can happen very rearly: - Node B is running - Node A is started and tries to create singleton membership. It also initialize service S which tries to send message during initialization - Just before node A finished move to operational state, it gets Node B multicast message so moves to gather state - Node A and B creates membership and moves to operational state and sync is started - Node A and B receives message sent by node A during initialization of service S - Node A exits before sync of service is finished In this situation, node B may never execute sync_init for service S. So node B service S is not aware of existence of node A but it received message from it. Similar situation can theoretically also happen during merge. Solution is to change flow of sync, so now it looks like: - Build service_list - Call sync_init for all local services - Send service_list - Receive service_list from all members and send barier - For all services: - Receive barier - Call sync_activate if this is not first service - Call sync_process for next service or finish sync if previous this service is the last one - Send barier Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2017-11-16 15:22:19 +01:00
Jan Friesse	499eaac80f	sync: Remove unneeded determine sync code Code was used for compatibility with old sync v1 (in needle this was deleted and previous version 2 became v1), and it's no longer needed. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2017-11-16 15:22:14 +01:00
Christine Caulfield	1df7eca5ad	stats: Add some missing knet stats Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-11-16 08:35:50 +01:00
Ferenc Wágner	09b0123d58	Send corosync startup notification to systemd This enables starting the daemon directly in the service file, because dependent units won't be started until initialization is complete. Signed-off-by: Ferenc Wágner <wferi@debian.org> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-11-09 09:49:18 +01:00
Jan Friesse	f05d1c9293	coroparse: Do not convert empty uid, gid to 0 When uid (or gid) value was empty string it was incorrectly converted to 0. Solution is to check input string emptines. Thanks Bin Liu <bliu@suse.com> for reporting the bug. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Bin Liu <bliu@suse.com>	2017-11-06 09:37:54 +01:00
Christine Caulfield	45fe19ed86	stats: Don't display errors when reading knet stat Only add the knet handle stat keys if we are actually running knet. This prevents errors occurring when iterating through all of the stats keys Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-11-03 13:40:41 +01:00
Christine Caulfield	d9dfd41e4e	stats: Add cmap key to clear the various stats. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-31 17:39:14 +01:00
Bin Liu	cf339c20c3	totemconfig: generate mcast icmap items for UDP Generating mcastaddr and mcastport in icmap make sense only for UDP transport. Signed-off-by: Bin Liu <bliu@suse.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-30 14:14:48 +01:00
Bin Liu	99567f0e65	totemconfig: add nodeid check for knet Signed-off-by: Bin Liu <bliu@suse.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-30 13:02:03 +01:00
Christine Caulfield	396bca4739	config: Fix memory leak totem_volatile_config_set_string_value was not properly freeing memory. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-23 17:31:14 +02:00
Christine Caulfield	16f616b65d	knet: Add support for knet compression Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-23 17:30:25 +02:00
Jan Friesse	165d748c04	cmap: Remove noop highest config version check Signed-off-by: Jan Friesse <jfriesse@redhat.com>	2017-10-11 17:11:33 +02:00
Jonathan Davies	2d0e8114ba	cmap: don't shutdown highest config_version node Scenario: 1. node A starts corosync with config_version = 2, nodelist = {A, B} 2. node B starts corosync with config_version = 1, nodelist = {A, B} corosync.conf(5) says the config_version option is "used to prevent joining old nodes with not up-to-date configuration." So expected outcome is: * corosync on node A remains alive * corosync on node B exits Actual outcome is: * corosync on node A exits * corosync on node B exits Explanation of actual behaviour: * Host A will have cmap_my_config_version = 2 but cmap_highest_config_version_received = 1, so will shutdown in cmap_sync_activate because these are not equal. * Host B will have cmap_my_config_version = 1 but cmap_highest_config_version_received = 2, so will shutdown in cmap_sync_activate because these are not equal. Instead, node A should consider its own config_version in the calculation of the highest config_version, i.e. cmap_highest_config_version_received = 2, and so not shutdown in cmap_sync_activate. Signed-off-by: Jonathan Davies <jonathan.davies@citrix.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-11 17:07:35 +02:00
Kazunori INOUE	576a493d1e	totemudp: Remove memb_join discarding This is already implemented in totemsrp in much cleaner way (added by commit `ab8942f626`). Signed-off-by: Kazunori INOUE <inouekazu@intellilink.co.jp> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-02 11:33:58 +02:00
Edwin Torok	15383b3eb3	votequorum: make atb consistent on nodelist reload When the cluster changes from even sized to odd sized corosync disables auto-tie-breaker if wait_for_all is not enabled. However when changing from odd sized to even sized it doesn't reenable it, causing auto_tie_breaker to be inconsistent across the cluster: the newly added node and any nodes that restart corosync will have it, but all the previously running nodes won't. Signed-off-by: Edwin Torok <edvin.torok@citrix.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-26 18:05:17 +02:00
Fabio M. Di Nitto	76591baa4a	totem: Remove unnecessary NSS headers Also fix corosync.spec.in to depend on libknet. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-22 10:27:01 +02:00
Christine Caulfield	294a629fb5	config: Allow dynamic link configuration Now we are using knet, it's possible to dynamically add, remove and reconfigure links on the fly. Also print 'n' for non-existant knet links. This will show up only on loopback links >0. But it looks better than 'status =' Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-21 17:16:21 +02:00
Masse Nicolas	5b38aa721a	totemudp: Retry if bind fails If bind call fails it's retried for BIND_MAX_RETRIES. If it's still unsuccessful, corosync exists instead of working incorrectly. Slightly modified by reviewer. Signed-off-by: Masse Nicolas <nicolas.masse@stormshield.eu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-19 12:44:26 +02:00
Ferenc Wágner	b7b318b86f	wd: default to not using a watchdog Signed-off-by: Ferenc Wágner <wferi@debian.org> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-14 17:40:48 +02:00
Ferenc Wágner	151ed9dfe5	wd: remove extra capitalization typo Signed-off-by: Ferenc Wágner <wferi@debian.org> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-12 14:23:04 +02:00
Jonathan Davies	3296a0d41a	totemknet: fix debug message typo Signed-off-by: Jonathan Davies <jonathan.davies@citrix.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-11 11:51:16 +02:00
Ferenc Wágner	0f33464531	wd: fix typo Signed-off-by: Ferenc Wágner <wferi@debian.org> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-11 11:40:12 +02:00
Christine Caulfield	ed235edfe3	stats: add knet 'handle' stats knet handle stats show compression and crypto statistics. With these you can see the effectiveness of compression and the overheads of both crypto and compression. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-08-23 14:18:59 +02:00
Christine Caulfield	01495f650c	main: use syslog & printf directly for early log messages libqb seems funny about logging things before its fully configured. This corosync commit didn't help either: `8b6bd86a55` So to make sure that messages about the config file not being opened get delivered to the user/syslog we send them directly. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-08-22 09:51:09 +01:00
Christine Caulfield	9898fc8760	totempg: Allow space for incoming overflow totempg needs to store the current message + any overflow for the next message which can be up to (nearly) the MTU size. in knet that's large, but for UDP it's just 1500. The reason we've never seen it before is because the actual max message size is 1024 less than 1MB and after all the headers are stripped out the overflow is usually 1024 bytes or less. The 1024*1024 size of the assembly buffer is large enough to hold a max message (1047552) + 1024 bytes of a new UDP message. So we never saw any problems. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-08-14 14:04:31 +01:00
Chrissie Caulfield	f4a7e54d45	totemknet: Use knet's LOOPBACK transport (#236 ) knet now has a built-in LOOPBACK transport so use that rather than special-casing it for ourself. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2017-08-04 12:59:16 +01:00
Christine Caulfield	9da89f32c2	CFG: Remove ring-reenable code RRP doesn't exist any more so all the ring re-enable code is redundant. I've removed it from the library and all the code that does anything, but I've left the hole in the IPC just in case old libraries are hanging around. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-08-03 14:32:02 +02:00

1 2 3 4 5 ...

1959 Commits