mirror_corosync

mirror of https://git.proxmox.com/git/mirror_corosync synced 2026-01-14 12:34:51 +00:00

Author	SHA1	Message	Date
Christine Caulfield	1ba03a3816	icmap: fix the icmap_get__r functions Make the icmap_r functions read from the specified map rather than the global map. Also include icmap_get_string_r() which seems to have been missed out. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2019-11-18 16:29:57 +01:00
Jan Friesse	5731af2782	logging: Add CS_PRI_NODE_ID and CS_PRI_RING_ID Previously node id was logged ether as a %d (most often), %u, %x or PRI.32 and ring id ether as %lld, %llx with various separators (., :, /) between rep nodeid and seq. This seems to cause confusion. This patch adds macros CS_PRI_NODE_ID, CS_PRI_RING_ID and CS_PRI_RING_ID_SEQ (CS prefix = corosync, PRI modeled in spirit of inttypes.h PRIx32) and makes code use them. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2019-07-03 10:53:52 +02:00
Jan Friesse	72737d3929	udpu: Drop packets from unlisted IPs This feature allows corosync to block packets received from unknown nodes (nodes with IP address which is not in the nodelist). This is mainly for situations when "forgotten" node is booted and tries to join cluster which already removed such node from configuration. Another use case is to allow atomic reconfiguration and rejoin of two separate clusters. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2019-05-29 16:30:10 +02:00
Jan Friesse	41f9e966bb	cpg: Add CPG_REASON_UNDEFINED Previously the reason field for the member_list items in cpg_totem_confchg_fn was unset what may be little confusing. Solution is to add a special value CPG_REASON_UNDEFINED and use it for the member_list items. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2019-04-16 14:49:10 +02:00
Jan Friesse	2ab4d41886	totemip: Use AF_UNSPEC for ipv4-6 and ipv6-4 AF_UNSPEC returns different results than AF_INET/AF_INET6, because of nsswitch.conf search is in order and it stops asking other modules once current module success. Example of difference between previous and new code when ipv6-4 is used: - /etc/hosts contains test_name with an ipv4 - previous code called AF_INET6 where /etc/hosts failed so other methods were used which may return IPv6 addr -> result was ether fail or IPv6 address. - new code calls AF_UNSPEC returning IPv4 defined in /etc/hosts -> result is IPv4 address New code behavior should solve problems caused by nss-myhostname. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2019-01-11 09:37:30 +01:00
Jan Friesse	a84ade701c	totemconfig: Enhance totem.ip_version Originally totem.ip_version was used to force ip version used by totem. With Knet this variable didn't make too much sense so it was not used. Sadly rely only on DNS resolver order doesn't always work (RFC is quite complicated, but if IPv6 is not configured then IPv4 is preferred), what we tried to solve by forcing IPv6 and only if that fails, use IPv4. Sadly this collides with nss_myhostname which is able to return every local address and today system usually have at least one autogenerated link-local IPv6 address so it is able to "overwrite" /etc/hosts. Solution is to enhance totem.ip_version and use it also for Knet. totem.ip_version is now just a flag for resolver and can have four states: ipv4 (only IPv4 is used), ipv6 (only IPv6 is used), ipv4-6 (ask IPv4 first and if it fails ask for IPv6) and ipv6-4 (ask IPv6 first and if it fails ask for IPv4). Default for Knet and UDPU transports is ipv6-4, for UDP it's ipv4, because autogenerated mcast addr doesn't play too well with ipv6-4. So everywhere where nss_myhostname becomes problem, it's just possible to set totem.ip_version to ipv4-6. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-12-14 10:56:06 +01:00
Jan Friesse	82f35f1720	log: Implement support for reopening log files Feature depends on existence of libqb function qb_log_file_reopen. New function call is added into CFG service API. This function is used by corosync-cfgtool which now accepts -L parameter. Finally, logrotate "postrotate" script is calling corosync-cfgtool -L to notify corosync, instead of using copytruncate option. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-10-16 14:46:52 +02:00
Jan Friesse	40a13843d1	build: Remove totempg shared library leftovers Because totempg is not distributed it doesn't make sense to distribute totem header files. Also pkgconfig file should not be created any more. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-09-27 13:02:05 +02:00
Chris Walker	51989b4a0a	Add option to force cluster into GATHER state Signed-off-by: Chris Walker <cwalker@cray.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-09-07 13:27:36 +02:00
Chris Walker	3f7d2cf6aa	Add token_warning configuration option Token_warning is used to present information about when the token was last received. Signed-off-by: Chris Walker <cwalker@cray.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-08-14 10:34:49 +02:00
Jan Friesse	f286388275	cmap: Fix strncpy warning in cmap_iter_next cmap_iter_next in contrast of it's icmap counterpart copies key name into user preallocated space. In the worst case, key name may be CMAP_KEYNAME_MAXLEN, so cmap_iter_next then need CMAP_KEYNAME_MAXLEN + additional byte to store zero. strncpy was copying only CMAP_KEYNAME_MAXLEN characters so there was possibility of unterminated string. Patch solves this by using memcpy and always add trailing zero. Documentation was improved suggesting minimum size of keyname buffer to be CMAP_KEYNAME_MAXLEN + 1. Also sam and quorumtool were using too short buffer so they are fixed too. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-08-13 09:00:41 +02:00
Jan Friesse	69857efb5b	totem: Display IP of sender To make finding victim of incompatible messages easier, IP of sender is logged. Propagating IP in layers makes patch slightly larger. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-03-16 13:58:15 +01:00
Jan Friesse	0c509a25a7	totemsrp: Add magic and version into header Magic number (0xC070) together with version in every packet is used for detecting that other node is really Corosync 3.x. Endian_detector field is removed and magic number is now used instead. If received packet magic number differs, guessing is used to show more about the source (Corosync 2.3+, 2.2 are quite reliable, Knet and unencrypted Corosync 2.1/2.0/1.x/OpenAIS are semi-reliable and encrypted Corosync 2.1/2.0/1.x/OpenAIS are quite unreliable). Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-03-16 13:57:55 +01:00
Christine Caulfield	2c20590d16	knet: Always use link0 for loopback Even if it's not used for anything else. Also, make cfgtool show the correct link ID when links are not contiguous Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-03-01 14:23:20 +01:00
Christine Caulfield	fc8580bdbf	totem: Use nodeid ONLY in srp_addr This shrinks the srp_addr (and consequently every packet sent by corosync) so that instead of containing loads of IP addresses to identify a node, it just sends the nodeid. This then allows us to make ring0 optional and replaceable when running knet. It also means that we need some other way of identifying the local node in corosync.conf, so the nodelist.node.name entry is now mandatory and is mapped to the local host using the same algorithm as used in cman. This code needs LOTS of testing as it touches a huge amount of totemsrp and totemconfig. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-03-01 14:18:51 +01:00
Fabio M. Di Nitto	1411608a81	[build] fix build with non-standard knet location Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-02-05 15:57:12 +01:00
Jan Friesse	11fa527ed4	logging: Close before and open blackbox after fork Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-30 13:21:52 +01:00
Jan Friesse	79dba9c51f	logging: Make blackbox configurable Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-30 13:21:48 +01:00
yuskiida	e7734fab70	build: Add the headers necessary for RPM build Signed-off-by: yuskiida <yusk.iida@gmail.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-01-11 14:47:46 +01:00
Jan Friesse	32535b842c	totemudpu: Export and rename UDPU_FRAME_SIZE_MAX Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2018-01-09 17:46:25 +01:00
Christine Caulfield	98bb0c78c8	config: Allow selection of crypto_model KNET has options for nss or openssl crpyto libraries, make this available to corosync. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2018-01-05 15:25:17 +01:00
Christine Caulfield	45fe19ed86	stats: Don't display errors when reading knet stat Only add the knet handle stat keys if we are actually running knet. This prevents errors occurring when iterating through all of the stats keys Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-11-03 13:40:41 +01:00
Bin Liu	250750b829	cfg: nodeid should be unsigned int nodeid in struct req_lib_cfg_get_node_addrs is "unsigned int", so the function corosync_cfg_get_node_addrs should have its param "nodeid" to be unsigned int. Signed-off-by: Bin Liu <bliu@suse.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-11-01 17:34:04 +01:00
Christine Caulfield	d9dfd41e4e	stats: Add cmap key to clear the various stats. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-31 17:39:14 +01:00
Jan Pokorný	5ed5282cb5	logsys: Avoid redundant callsite section checking Previously, corosync executable was repeatedly (proportionally to the count of LOGSYS_DECLARE_SUBSYS macro applications involved in the constituent source files) checking the same for no gain in the pre-main startup. This is not needed since nothing changes with static data shared withing the same program space (it may have been a different story once upon a time if loadable modules were in use), so make that happen in (one-off per executable) LOGSYS_DECLARE_SYSTEM instead. Libqb offers it's own ready-made macro to that effect, simply to isolate the inner percularities from the library user (that should not be required to understand anything about the orphan sections and respective autocreated symbols to denote their boundaries). As it is compile-time conditionalized in the same way, just use it directly instead. As a value added, corosync will be kept up to date about the possibly growing set of the logging-sanity checks as it gets compiled with newer and newer libqb versions (their header files, for that matter). Signed-off-by: Jan Pokorný <jpokorny@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-23 18:09:32 +02:00
Christine Caulfield	16f616b65d	knet: Add support for knet compression Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-10-23 17:30:25 +02:00
Christine Caulfield	294a629fb5	config: Allow dynamic link configuration Now we are using knet, it's possible to dynamically add, remove and reconfigure links on the fly. Also print 'n' for non-existant knet links. This will show up only on loopback links >0. But it looks better than 'status =' Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-21 17:16:21 +02:00
Masse Nicolas	5b38aa721a	totemudp: Retry if bind fails If bind call fails it's retried for BIND_MAX_RETRIES. If it's still unsuccessful, corosync exists instead of working incorrectly. Slightly modified by reviewer. Signed-off-by: Masse Nicolas <nicolas.masse@stormshield.eu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-09-19 12:44:26 +02:00
Christine Caulfield	ed235edfe3	stats: add knet 'handle' stats knet handle stats show compression and crypto statistics. With these you can see the effectiveness of compression and the overheads of both crypto and compression. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-08-23 14:18:59 +02:00
Christine Caulfield	9da89f32c2	CFG: Remove ring-reenable code RRP doesn't exist any more so all the ring re-enable code is redundant. I've removed it from the library and all the code that does anything, but I've left the hole in the IPC just in case old libraries are hanging around. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-08-03 14:32:02 +02:00
Christine Caulfield	55c3dcb76d	stats: Add map with on-demand statistics Icmap is factored out so it's possible to add other maps for cmap. API call to switch maps from application end is added. Corosync-cmapctl is enhanced with -m option. Stats contains all statistics previously found in runtime.connections, runtime.services and runtime.totem prefixes together with new knet related. All stats are read only. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-07-27 15:53:04 +02:00
Jan Friesse	cf18736d52	totemconfig: Make crypto work again Knet needs longer key and supports various key lengths. Split TOTEM_PRIVATE_KEY_LEN into TOTEM_PRIVATE_KEY_LEN_MIN and TOTEM_PRIVATE_KEY_LEN_MAX (both using KNET_*_KEY_LEN). Fix incorrect "Could only read..." message. Make sure key is properly initialized/zeroed. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2017-07-03 13:19:02 +02:00
Michael Jones	afd97d7884	coroapi: Use size_t for private_data_size Unsigned int and size_t represent two different concepts. Same problem was present in ipc_glue. Signed-off-by: Michael Jones <jonesmz@jonesmz.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-05-29 17:23:37 +02:00
Christine Caulfield	16770a4153	totem: Fix buffer sizes knet needs buffers to be KNET_MAX_PACKET_SIZE or messages will get lost or corrupted. UDPU packets shouldn't be that big so I introduced UDP_FRAME_SIZE_MAX for that transport. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2017-03-02 14:57:39 +00:00
Christine Caulfield	c0f1d576d6	knet: Fix MTU sizes & allow transport config in corosync.conf Corosync layers don't need to know the knet MTU size - this way corosync fragments buffers only when they get larger than the KNET buffer size (64K) and knet fragments below that based on the actual MTU and transport considerations. It is also now possible to configure knet to use UDP or SCTP transports in corosync.conf. This is currently done per-link so if you have more than 1 link you need several interface{} stanzas inside totem{} to make it use other than the default of UDP. if it's useful I might add the option of a global default. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2017-02-13 16:54:30 +00:00
Christine Caulfield	7cec6a131d	knet: Allow configuration of more params knet_pmtud_interval & knet_pong_count Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-11-15 09:32:09 +00:00
Jan Friesse	7a8732d85a	list: Remove list.h List.h is no longer needed. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2016-10-27 14:56:52 +02:00
Michael Jones	b4c06e52f3	list: Replace uses of list.h with qblist.h Signed-off-by: Michael Jones <jonesmz@jonesmz.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-10-27 14:56:52 +02:00
Christine Caulfield	268cde6ee4	totem: Add Kronosnet transport. This is a big update that removes RRP & MRP from the codebase and makes knet the default transport for corosync. UDP & UDPU are still (currently) supported but are deprecated. Also crypto and mutiple interfaces are only supported over knet. To compile this codebase you will need to install libknet from https://github.com/fabbione/kronosnet The corosync.conf(5) man page has been updated with info on the new options. Older config files should still work but many options have changed because of the knet implementation so configs should be checked carefully. In particular any cluster using using RRP over UDP or UDPU will not start as RRP is no longer present. If you need multiple interface support then you should be using the knet transport. Knet brings many benefits to the corosync codebase, it provides support for more interfaces than RRP (up to 8), will be more reliable in the event of network outages and allows dynamic reconfiguration of interfaces. It also fixes the ifup/ifdown and 127.0.0.1 binding problems that have plagued corosync/openais from day 1 Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-10-11 10:09:42 +01:00
Christine Caulfield	3a5d51fca7	votequorum: Fix up quorum/nodelist callbacks This patch tidies the two state change callbacks and explains them in the man page: The difference between votequorum_nodelist_notification_t and votequorum_quorum_notification_t is subtle but important. The 'nodelist' callback is sent at the start of a cluster state transition and contains the new ring_id and only the list of nodes that are included in the sync state - ie only active nodes. No quorum information is included this callback because it is not available at that time. The 'quorum' callback is sent after the cluster state transition has completed and does contain quorum information. In addition, the nodelist contains a list of all nodes known to votequorum (whether up or down) and their state as well as information about the quorum device attached (if any). quorum callbacks will not be sent for qdevice up and down events unless they affect quorum. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-06-28 13:58:39 +02:00
Christine Caulfield	cf0028c86e	votequorum: split callbacks into nodelist and quorum This split is needed for qdevice, so that it gets the ring_id and nodelist as part of the sync process and not afterwards - when quorum has been calculated. As this is and unsupported API I'm not too worried about breaking existing code - all the clients I know of are using the quorum API anyway as they should be. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-06-28 13:58:38 +02:00
bliu	7e36b89664	low:typo fix in sam.h Signed-off-by: bliu <bliu@suse.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-06-27 12:47:34 +02:00
Michael Jones	dfae95cce9	Adds doxygen stubs to include directory Signed-off-by: Michael Jones <jonesmz@jonesmz.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-05-12 15:59:48 +02:00
Ferenc Wágner	aff57f9996	votequorum: Make sure cs_error_t is defined Signed-off-by: Ferenc Wágner <wferi@niif.hu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2015-08-28 08:59:08 +02:00
Ferenc Wágner	c52933a29f	Close Doxygen group in include/corosync/cmap.h This avoids warning: end of file while inside a group. Signed-off-by: Ferenc Wágner <wferi@niif.hu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2015-08-26 09:26:26 +02:00
Ferenc Wágner	2250d812c1	Doxygen fix for cmap_iter_next() Remove the extra cmap_ prefix of the iter_handle parameter. Signed-off-by: Ferenc Wágner <wferi@niif.hu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2015-08-26 09:26:26 +02:00
Christine Caulfield	8cc8e51363	cpg: Add support for messages larger than 1Mb If a cpg client sends a message larger than 1Mb (actually slightly less to allow for internal buffers) cpg will now fragment that into several corosync messages before sending it around the ring. cpg_mcast_joined() can now return CS_ERR_INTERRUPT which means that the cpg membership was disrupted during the send operation and the message needs to be resent. The new API call cpg_max_atomic_msgsize_get() returns the maximum size of a message that will not be fragmented internally. New test program cpghum was written to stress test this functionality, it checks message integrity and order of receipt. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2015-03-05 16:45:15 +00:00
Jan Friesse	03f95ddaa1	Adjust MTU for IPv6 correctly MTU for IPv6 is 20 bytes larger then IPv4. This fact was not taken into account so IPv6 packets were larger then MTU resulting in fragmentation. Solution is to substract correct IP header size. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2014-10-01 14:20:21 +02:00
Jan Friesse	17488909d4	votequorum: Make qdev timeout in sync configurable Configuration option quorum.device.sync_timeout is available for setting qdevice poll timeout for synchronization phase. Default value is 30 sec. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2014-08-05 17:22:52 +02:00
Jan Friesse	b8902464d1	votequorum: Add ring id to poll call If votequorum service receives incorrect (not current) ringid, call is ignored and CS_ERR_MESSAGE_ERROR is returned. This and previous commits makes incompatible changes in votequorum API/ABI, so library version is increased. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2014-08-05 17:22:41 +02:00

1 2 3 4 5 ...

589 Commits