mirror_corosync

mirror of https://git.proxmox.com/git/mirror_corosync synced 2026-01-20 10:43:04 +00:00

Author	SHA1	Message	Date
Bin Liu	d2a5e1442e	logconfig: Do not overwrite logger_subsys priority logfile_priority and syslog_priority could be modified by logging.logger_subsys.{logfile_priority\|syslog_priority}. which could lead to the following output(which are at notice level): corosync[21419]: [QUORUM] Using quorum provider corosync_votequorum corosync[21419]: [QUORUM] Members[1]: 1084777643 corosync[21419]: [QUORUM] This node is within the primary component and will provide service. corosync[21419]: [QUORUM] Members[3]: 1084777563 1084777584 1084777643 even the syslog_priority is warning. This patch could avoid the overwrite. Signed-off-by: Bin Liu <bliu@suse.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-03-10 09:09:42 +01:00
Christine Caulfield	16770a4153	totem: Fix buffer sizes knet needs buffers to be KNET_MAX_PACKET_SIZE or messages will get lost or corrupted. UDPU packets shouldn't be that big so I introduced UDP_FRAME_SIZE_MAX for that transport. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2017-03-02 14:57:39 +00:00
Christine Caulfield	30771a39a8	main: Don't ask libqb to handle segv, it doesn't work segv should be handled by corosync, libqb is not the place to be handling emergency signals. This currently requires the head of libqb git tree to generate a blackbox & coredump in the event of a segfault, but it's better than the write() spin that currently happens. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2017-02-27 15:14:41 +00:00
Jan Friesse	8b6bd86a55	Logsys: Change logsys syslog_priority priority LibQB adds default "*" syslog filter so we have to set syslog_priority as low as possible so filters applied later in _logsys_config_apply_per_file takes effect. Signed-off-by: Jan Friesse <jfriesse@redhat.com>	2017-02-24 16:23:50 +01:00
Fabio M. Di Nitto	36ef2af5a7	knet: improve logging messages by adding knet subsystem Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2017-02-24 09:41:35 +01:00
Fabio M. Di Nitto	19232f6052	knet: Change nodeids to knet_node_id_t for new knet compatibility after some feedback on github, people prefers to have the option to support up to 64K node_id's. libknet added knet_node_id_t to mask the size and type, currently set to uint16_t. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2017-02-14 06:08:45 +01:00
Christine Caulfield	c0f1d576d6	knet: Fix MTU sizes & allow transport config in corosync.conf Corosync layers don't need to know the knet MTU size - this way corosync fragments buffers only when they get larger than the KNET buffer size (64K) and knet fragments below that based on the actual MTU and transport considerations. It is also now possible to configure knet to use UDP or SCTP transports in corosync.conf. This is currently done per-link so if you have more than 1 link you need several interface{} stanzas inside totem{} to make it use other than the default of UDP. if it's useful I might add the option of a global default. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2017-02-13 16:54:30 +00:00
Fabio M. Di Nitto	970549ddfc	knet: PMTUd data_mtu already accounts for IP and knet header overheads provide some more space for data and small (+1% perf boost) Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2017-02-11 06:41:38 +01:00
Fabio M. Di Nitto	18fef0ae7f	knet: switch from write to sendto() this provides another 9.6% performance boost on 2 node clusters Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2017-02-11 06:24:12 +01:00
Christine Caulfield	d9df98ceba	knet: Change nodeids to 8 bit for new knet compatibility I've also put an assert in totemknet_member_add() to check for invalid nodeids. Later on we need to fix the rest of the corosync code to only use 8bit nodeids (or force people to use UDPU if they want large nodeids). Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2017-02-03 09:38:32 +00:00
Christine Caulfield	2d478505e5	knet: Fix member_remove to shut down existing links first Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2017-01-16 13:16:15 +00:00
Christine Caulfield	029b8ebad6	knet: Reduce default pong count to 2 for faster startup The default PONG_COUNT of 5 made corosync slow to connect to other nodes. This helps.	2017-01-03 13:30:26 +00:00
Christine Caulfield	950cca886e	totemknet: Make it compile with kronosnet git master Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-12-22 10:25:11 +00:00
Takeshi MIZUTA	4939c75629	Remove redundant header file inclusion Signed-off-by: Takeshi MIZUTA <miz.take4@gmail.com> Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-12-05 09:59:08 +01:00
Bin Liu	819d66ca1c	Totempg: remove duplicate memcpy in mcast_msg func In function mcast_msg of totempg.c, line 923, there is a memcpy call in "else" branch, and also another memcpy out of the "else" branch, while the two calls have the same parameters. It is possibleto remove the memcpy in "else" branch. Signed-off-by: Bin Liu <bliu@suse.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-12-05 09:40:55 +01:00
Takeshi MIZUTA	034553c080	man: Modify man-page according to command usage Signed-off-by: Takeshi MIZUTA <miz.take4@gmail.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-12-01 16:32:42 +01:00
Takeshi MIZUTA	9c5b39d438	totempg: totempg_groups_join return valid error totempg_groups_join() is called by sync_init(). sync_init() judge that totempg_groups_join() failed if return code of totempg_groups_join() is -1. Therefore, the return code should return in -1 when totempg_groups_join() fails. Signed-off-by: Takeshi MIZUTA <miz.take4@gmail.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-11-23 09:22:21 +01:00
Christine Caulfield	401f483cce	knet: Support reload of link parameters Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-11-17 11:41:54 +00:00
Takeshi MIZUTA	f5dcc4a5f2	list: Unify the list processing with qb_list func Signed-off-by: Takeshi MIZUTA <miz.take4@gmail.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-11-15 12:19:13 +01:00
Christine Caulfield	7cec6a131d	knet: Allow configuration of more params knet_pmtud_interval & knet_pong_count Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-11-15 09:32:09 +00:00
Chrissie Caulfield	65219a6300	knet: Don't lose log messages when knet gets busy (#165 ) Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-11-14 15:01:34 +00:00
Jan Friesse	1f90c31ba7	list: Replace for_each by safe version where need Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2016-10-27 14:56:52 +02:00
Michael Jones	b4c06e52f3	list: Replace uses of list.h with qblist.h Signed-off-by: Michael Jones <jonesmz@jonesmz.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-10-27 14:56:52 +02:00
Christine Caulfield	86de6ce1e6	totem: add totemknet.[ch] it seems git is better at deleting files than adding them Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-10-13 08:46:34 +01:00
Michael Jones	a24d26c46a	cfg: Prevents use of uninitialized buffer Signed-off-by: Michael Jones <jonesmz@jonesmz.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-10-12 16:19:05 +02:00
Christine Caulfield	268cde6ee4	totem: Add Kronosnet transport. This is a big update that removes RRP & MRP from the codebase and makes knet the default transport for corosync. UDP & UDPU are still (currently) supported but are deprecated. Also crypto and mutiple interfaces are only supported over knet. To compile this codebase you will need to install libknet from https://github.com/fabbione/kronosnet The corosync.conf(5) man page has been updated with info on the new options. Older config files should still work but many options have changed because of the knet implementation so configs should be checked carefully. In particular any cluster using using RRP over UDP or UDPU will not start as RRP is no longer present. If you need multiple interface support then you should be using the knet transport. Knet brings many benefits to the corosync codebase, it provides support for more interfaces than RRP (up to 8), will be more reliable in the event of network outages and allows dynamic reconfiguration of interfaces. It also fixes the ifup/ifdown and 127.0.0.1 binding problems that have plagued corosync/openais from day 1 Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-10-11 10:09:42 +01:00
HideoYamauchi	f1ffe31ce5	coropase: Set a poll_period value for wd monitor Signed-off-by: HideoYamauchi <renayama19661014@ybb.ne.jp> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-10-06 15:48:38 +02:00
Christine Caulfield	c4683be9b0	votequorum: simplify reconfigure message handling As we now have update_node_expected_votes(), we can use that when receiving a new EXPECTED_VOTES value from another node rather than having our own loop. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-09-13 15:55:58 +01:00
Christine Caulfield	bd2e6b5d9d	votequorum: Don't update expected_votes display if value is too high If expected_votes was set via the library but the calculation decides it's too high, then an error is correctly returned but the value is still set in the nodes' expected_votes field and turns up in the corosync-quorumtool display. This patch separates out the quorum calculation from the updating of expected_votes per node to prevent this from happening. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-09-13 14:28:56 +01:00
Ferenc Wágner	cf10a754e9	Fix various typos occured -> occurred parantheses -> parentheses configuraton -> configuration aquire -> acquire retrive -> retrieve prefered -> preferred Signed-off-by: Ferenc Wágner <wferi@niif.hu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-09-12 09:50:11 +02:00
Jan Friesse	f837f95dfe	Config: Flag config uidgid entries Uidgid entries parsed from configuration files now has prefix (uidgid.config.) so they are distinguishable from dynamically added entries. Entries added from config file are pruned on reload if no longer exists in config file (dynamic one stays unaffected). Also whole uidgid.config. prefix is made read only. This make PCMK work again after configuration reload is called. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2016-08-04 16:13:48 +02:00
HideoYamauchi	71c9035c27	Low: totemsrp: Addition of the log. Signed-off-by: HideoYamauchi <renayama19661014@ybb.ne.jp> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-08-01 10:11:45 +02:00
Jan Friesse	1925074909	Fix few bugs found by coverity Signed-off-by: Jan Friesse <jfriesse@redhat.com>	2016-06-28 13:58:43 +02:00
Christine Caulfield	0665aca9e1	quorum: revert patch that adds qdevice (node 0) to quorum callback Revert patch 9f54f0a1fad7dad42c55562a50dfb9d773e6a660 as it causes more troubles than it solves. Code that uses the quorum nodelist to get a list of actual nodes in the cluster for communication break using this as well as the display from corosync-quorumtool Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-06-28 13:58:43 +02:00
Christine Caulfield	c9c6d9e30f	quorum: Return qdevice nodeid in the quorum callbacks (if active). Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-06-28 13:58:41 +02:00
Christine Caulfield	e41b256c67	votequorum: Allow wait_for_all with qdevice Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-06-28 13:58:39 +02:00
Christine Caulfield	98548e1880	qnetd: lms: Fix search for node/ring_id check We were looking for us in other node lists, rather than others in our nodelist. Also, remove debug print in votequorum.c Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-06-28 13:58:39 +02:00
Christine Caulfield	3a5d51fca7	votequorum: Fix up quorum/nodelist callbacks This patch tidies the two state change callbacks and explains them in the man page: The difference between votequorum_nodelist_notification_t and votequorum_quorum_notification_t is subtle but important. The 'nodelist' callback is sent at the start of a cluster state transition and contains the new ring_id and only the list of nodes that are included in the sync state - ie only active nodes. No quorum information is included this callback because it is not available at that time. The 'quorum' callback is sent after the cluster state transition has completed and does contain quorum information. In addition, the nodelist contains a list of all nodes known to votequorum (whether up or down) and their state as well as information about the quorum device attached (if any). quorum callbacks will not be sent for qdevice up and down events unless they affect quorum. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-06-28 13:58:39 +02:00
Christine Caulfield	cf0028c86e	votequorum: split callbacks into nodelist and quorum This split is needed for qdevice, so that it gets the ring_id and nodelist as part of the sync process and not afterwards - when quorum has been calculated. As this is and unsupported API I'm not too worried about breaking existing code - all the clients I know of are using the quorum API anyway as they should be. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com>	2016-06-28 13:58:38 +02:00
Jan Friesse	44df76a7ee	config: get_cluster_mcast_addr error is not fatal Signed-off-by: Jan Friesse <jfriesse@redhat.com>	2016-06-28 13:57:14 +02:00
Ferenc Wágner	c76ee39f61	Fix typo: Diabled -> disabled Signed-off-by: Ferenc Wágner <wferi@niif.hu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-06-22 14:26:48 +02:00
Ferenc Wágner	b1de8efd15	Fix typo: aquire -> acquire Signed-off-by: Ferenc Wágner <wferi@niif.hu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-06-22 14:26:28 +02:00
Ferenc Wágner	841f48e253	Fix typo: Uknown -> Unknown Signed-off-by: Ferenc Wágner <wferi@niif.hu> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-06-22 14:26:22 +02:00
Christine Caulfield	f2a1fcc5bf	logconfig: Fix logging reload disabling logfiles In my previous logconfig patch, adding a subsys so the logging stanzas could disable logging to a file, because the subsys closed the file used by the main logging. This patch only applies defaults to higher-level logging and non-deprecated keys. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-05-27 17:36:30 +02:00
yuusuke	2ef086bd9b	wd: Warn if values are out of range Signed-off-by: yuusuke <yusk.iida@gmail.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2016-05-27 10:38:30 +02:00
yuusuke	39cd6b3d1d	parser: WD Read type correctly from corosync.conf Signed-off-by: yuusuke <yusk.iida@gmail.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2016-05-27 10:36:24 +02:00
Christine Caulfield	571b1621e9	Add some more RO keys Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-05-24 12:33:55 +02:00
Christine Caulfield	125848d80a	Reapply config defaults corosync.conf reload There were several places where defaults were not restored if the keys were removed from corosync.conf and the file reloaded. This patch adds those back so that reloading corosync.conf has the expected effect when keys are deleted. Signed-off-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-05-24 12:33:35 +02:00
Jan Friesse	b93d75abc4	schedwrk: Cleanup and make it work on PPC BE Schedwrk is passing hdb handle (64-bit) to totempg_callback_token_create as a context. Context is defined to be pointer, so there is conversion function which stores 64-bit hdb_handle into pointer. Potentially, pointer can be 32-bit. This means, check part of hdb is discarded (and have to get special no_check value in schedwrk_do) later. This works quite well on 32-bit Little-Endian system. Sadly on Big-Endian system, check partition of hdb is stored instead of value. Result is error of hdb_handle_get call. Proposed solution is to pass handle pointer to totempg_callback_token_create as context. This means full hdb (check + value) can be used in schedwrk_do (easier detection of memory corruption). Main reason for this patch is to remove usage of pointer as integer value. Small drawback of given solution is that handle pointer must be memory allocated on heap or static memory, making API more bug-prone. Current usage of schedwrk API across corosync always use memory in .text section (safe), so it's not a problem. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2016-05-17 16:29:25 +02:00
Valentin Vidic	8d8d4a936a	wd: make watchdog device configurable Add configuration option resources.watchdog_device allowing runtime selection of watchdog device. Useful for newer servers having more than one watchdog available (IPMI and iTCO). Special value "off" disables watchdog in configuration rather than just using build options. Useful when watchdog device is needed elsewhere (SBD cluster stonith service). Signed-off-by: Valentin Vidic <Valentin.Vidic@CARNet.hr> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2016-05-03 15:47:15 +02:00

1 2 3 4 5 ...

1888 Commits