mirror_corosync

mirror of https://git.proxmox.com/git/mirror_corosync synced 2025-10-31 16:25:29 +00:00

Author	SHA1	Message	Date
Fabio M. Di Nitto	cc7bfeb462	votequorum: drop votequorum_qdevice_getinfo and collapse data into getinfo it's really pointless to have basically a duplicated API call to transfer one value and one name. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:17 +02:00
Fabio M. Di Nitto	65a6c29a31	votequorum: external defines should all be prefixed with VOTEQUORUM_ Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:17 +02:00
Fabio M. Di Nitto	2a37b56c49	votequorum: drop _FLAG_ from defines those are all info flags.. it's redudant and inconsistent Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:17 +02:00
Fabio M. Di Nitto	3416eacbec	votequorum: fix define name to match reality Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:17 +02:00
Fabio M. Di Nitto	2dae49e54a	votequorum: remove last instance of state and rename it to cast_vote also align naming of vote to cast_vote for info calls Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:17 +02:00
Fabio M. Di Nitto	43d1439600	votequorum: add qdevice CAST_VOTE status/flag this is a preparation commit for the next changes. right now it is no more than an alias to ALIVE. CAST_VOTE is required to support master/slave feature from qdevice. Effectively a quorum device can be: Not registered / registered (connected to API but nothing else is happening) if registered: Not alive / alive (quorum device is petting the API via poll and timer is running) if alive: Not voting (slave) / voting (master) Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:16 +02:00
Fabio M. Di Nitto	987e26f8d1	votequorum: rename NODE_FLAGS_QDEVICE_STATE to NODE_FLAGS_QDEVICE_ALIVE STATE is confusing and overloaded term in votequorum as it's used for nodes and other bits. make the name unique and ALIVE means that the qdevice is heartbeating to votequorum. improve display of the status in tools and tests. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:16 +02:00
Fabio M. Di Nitto	4621a6cd02	votequorum: rename NODE_FLAGS_QDEVICE to NODE_FLAGS_QDEVICE_REGISTERED make the flag name explicit Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:16 +02:00
Fabio M. Di Nitto	06e75d0b22	votequorum: re-enable qdevice api Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-08-07 11:07:16 +02:00
Jan Friesse	9fb7979370	Introduce SERVICES_COUNT_MAX macro Sync/service was using maximal number of services in ehter numberic form (magic constant) or inconsistently, this means using SERVICE_HANDLER_MAXIMUM_COUNT which means maximal number of handlers. New macro solves this. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-08-02 09:32:05 +02:00
Angus Salkeld	775f71591b	LOG: drop the number of logging subsystems from 64 to 32 Currently 14 are used, 64 seems like a waste of memory. Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-05-29 14:02:42 +10:00
Fabio M. Di Nitto	1dcb2d43d9	icmap: fix a valgrind errors (pass 1) clean up a lot of allocated blocks at exit. those changes has no runtime effects, but it makes valgrind output a bit more useful by dropping over 700 errors/warnings to skip over every single run. there are still a few icmap related valgrind errors but those need some more complex and timeconsuming investigation. pre patch: ==21844== HEAP SUMMARY: ==21844== in use at exit: 1,229,321 bytes in 1,516 blocks ==21844== total heap usage: 7,191 allocs, 5,675 frees, 3,819,853 bytes allocated ==21844== LEAK SUMMARY: ==21844== definitely lost: 3,617 bytes in 11 blocks ==21844== indirectly lost: 21,960 bytes in 11 blocks ==21844== possibly lost: 1,080,101 bytes in 131 blocks ==21844== still reachable: 123,643 bytes in 1,363 blocks ==21844== suppressed: 0 bytes in 0 blocks ==21844== ERROR SUMMARY: 136 errors from 136 contexts (suppressed: 0 from 0) post patch: ==25793== HEAP SUMMARY: ==25793== in use at exit: 1,185,870 bytes in 808 blocks ==25793== total heap usage: 9,427 allocs, 8,619 frees, 4,156,841 bytes allocated ==25793== LEAK SUMMARY: ==25793== definitely lost: 3,697 bytes in 12 blocks ==25793== indirectly lost: 22,248 bytes in 13 blocks ==25793== possibly lost: 1,079,655 bytes in 113 blocks ==25793== still reachable: 80,270 bytes in 670 blocks ==25793== suppressed: 0 bytes in 0 blocks ==25793== ERROR SUMMARY: 119 errors from 119 contexts (suppressed: 0 from 0) Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-04-24 09:28:23 +02:00
Angus Salkeld	353e223377	Check before making a reference to __start___verbose Signed-off-by: Angus Salkeld <asalkeld@redhat.com>	2012-04-05 23:49:47 +10:00
Jan Friesse	e925f42165	Make ifaces_get work with dynamic no_rings Commit which added number of addresses to srp_address structure didn't count with totemsrp_ifaces_get where whole structure was copied instead of addresses only. This is now fixed. Also to make API totempg forward compatible, size of interfaces array must be passed to ifaces_get like functions to prevent memory overwrite. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-03-26 11:54:26 +02:00
Jan Friesse	3b7c2f0588	Update crypto_set API Also few leftovers from cfg is removed and version of totempg is increased to 5 to reflect all changes we made Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-03-15 17:33:53 +01:00
Fabio M. Di Nitto	0a6a6bbcfa	crypto: drop secauth and make crypto none work again keep totem.secauth config key for compatibility if the key is NOT set, crypto will default to aes256/sha1 if the key is set to "off", crypto is disabled. this reflects pretty much old behavior keywords totem.crypto_cipher and totem.crypto_hash can override secauth individually. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-03-14 11:28:36 +01:00
Jan Friesse	ab1675f0fe	Parse and use hash and crypto from config file Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-03-13 17:38:59 +01:00
Fabio M. Di Nitto	55e8476697	crypto: mask the crypto operations from totem packet size management totem doesn't need to understand what crypto does. totem needs to be able to tell crypto: "those are data, play with them" and crypto needs to return: "here are your scrambled data and the new size" similar to decrypt/verify. this way we add enough dynamic within crypto to change header size and all at any given time (for different hash algorithm for example) without affecting on wire compat. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-03-13 15:50:58 +01:00
Jan Friesse	8cdd2fc493	Remove libtomcrypt Tomcrypt in corosync is for long time not updated. Because we have support for libnss, libtomcrypt can be removed. Also few leftovers (AES is 256 bits, not 128, ...) are removed. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-03-13 09:19:47 +01:00
Fabio M. Di Nitto	20a5289074	drop evs service there are several reasons for this: 1) evs is only partially implemented with no plans to complete it typedef enum { EVS_TYPE_UNORDERED, /* not implemented / EVS_TYPE_FIFO, / same as agreed / EVS_TYPE_AGREED, EVS_TYPE_SAFE / not implemented */ } evs_guarantee_t; 2) evs has no users in any upstream distribution and no search engine can find any other upstream using it. 3) the only reason (I was told) to carry around evs was that evs receives the full ring_id struct from totem. This is only partially correct because while the structures are prepared to carry around those data, they are never transmitted from corosync engine down the IPC line to the user. CPG ring_id contains the exact same information and it's actually less buggy (due to prototying of the info). worst case scenario where a user really absolutely need libevs, it can be easily reimplemented as libcpg wrapper and avoid lots of code duplication. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-03-12 15:51:50 +01:00
Fabio M. Di Nitto	b654661b4c	build: drop obsoleted SOCKETDIR option yet another leftover from the past that can go away Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-03-12 07:12:48 +01:00
Fabio M. Di Nitto	eb3d49ef7d	pload: make it a test service and not a public one pload is a performance benchmark that measures the onwire speed of corosync. problem is that once pload has been executed, the cluster is basically dead. turn pload into a test tool, by removing corosync-pload tool and user library. cleanup pload code to make it more readable and drop lots of unnecessary stuff. add test/ploadstart tool that can configure and start pload via cmap calls. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-03-12 07:11:51 +01:00
Fabio M. Di Nitto	142ce8c3a1	totem: drop crypt_accept: concept/option this was another old onwire compat mode that is not useful anylonger. we can safely move the new model by default. According to Honza (real hardware 1 node testing) there are no performance impact. My tests (8 nodes VM cluster), there is up to 10/12% performance improvements up to 1M packet size where old and new models are equal. As a side note, nss still shows to be a performance loss on both real and virtual hw (without any kind of nss hw acceleration). Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-03-10 07:08:30 +01:00
Fabio M. Di Nitto	8f6e5ff530	sync: kill evil and syncv1 in one shot this change breaks onwire compatibility. cpg is the only user of sync_* interface and it's the only service that will require extra testing. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-03-09 11:15:08 +01:00
Fabio M. Di Nitto	03c76be696	votequorum: fix votequorum_getinfo man page and align struct name Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-02-27 12:41:04 +01:00
Fabio M. Di Nitto	cb5fd77501	votequorum: major rework to fix qdevice API and integration with core qdevice is a very special node in the cluster and it adds a certain amount of complexity and special cases across the code. most of the qdevice data are shared across the cluster (name/votes) but effectively each node has a different view of the qdevice (registered/unregistered/voting/etc.) with this change, we align the qdevice view across the node, exchanging more data between nodes and we fix how qdevice behaves and it is configured. The only side effect is that the amount of data transmitted on wire is slightly higher. The qdevice API is still disabled by default. This means that the amount of real changes in current code are a lot smaller than it appears by this patch. TODO: documentation/man pages needs to be updated once this change is in (and behavior finalized). User visible changes: - configuration (coroparse, exec/votequorum): the quorum device section is now standalone within the quorum. quorum { provider: corosync_votequorum device { model: (name) timeout: (millisec) votes: } } the keyword "model:" is mandatory to enable qdevice in configuration and should express the name of the script/daemon that will provide the qdevice. Looking into the future, an init script or systemd service will look for that name in /path/to/be/decided/name and start/stop qdevice. timeout: defines the maximum interval the qdevice implementation has available between poll (see votequorum_qdevice_poll.3) before the device is considered dead and votes discarded votes: is now a configuration parameter and not an API call. quorum devices don't care what they need to vote. votes is autocalculated when a nodelist is available and all nodes in the list vote 1. Otherwise this parameter is mandatory. - configuration (exec/votequorum): startup and runtime configuration changes have been improved. errors at startup are considered fatal. errors at runtime have different exit paths. startup: * quorum.two_node and qdevice are incompatible. * quorum.expected_votes requires quorum.device.votes. * quorum.expected_votes - quorum.device.votes cannot be lower than 2. * qdevice and last_man_standing are mutually exclusive. * qdevice and auto_tie_breaker are mutually exclusive. runtime config changes: * quorum.two_node and qdevice are incompatible: if quorum device is alive, two_node is disabled. if quorum device is not alive and node count is 2, two_node is enabled, and quorum device cannot be registered * if either last_man_standing or auto_tie_breaker were enabled at startup, and at runtime quorum device is configured, quorum device registration will be blocked. * if quorum.expected_votes is configured but not quorum.device.votes, quorum device registration will be blocked. * if quorum.device.votes is not configured and we cannot automatically calculate it, quorum device registration will be blocked. * An error in configuring quorum.expected_votes and quorum.device.votes will block quorum device registration. blocking quorum device registation, also means dropping the votes. quorum.device.votes (either set or automatically calculated) is now used to determine current expected_votes in the cluster. - logging (exec/votequorum): all errors from configuration are treated as WARNING/CRITICAL. lots of extra DEBUG output is added (see internal changes too). - corosync-quorumtool (tools/corosync-quorumtool): * added option to forcefully kick out a quorum device from the local node. This is for emergency recovery only and it is only available when qdevice API is built-in. * Improved status output, specifically add node state and qdevice information [root@fedora-master-node2 coro]# corosync-quorumtool -s Version: 1.99.4.12-9c7d-dirty Quorum type: corosync_votequorum Nodes: 2 Ring ID: 132 Quorate: Yes Node votes: 1 Node state: Member Expected votes: 3 Highest expected: 3 Total votes: 3 Quorum: 2 Flags: Quorate Qdevice Nodeid Votes Name 1 1 fedora-master-node1.int.fabbione.net 2 1 fedora-master-node2.int.fabbione.net 0 1 QDEVICE (Voting) * allow to print status for any node in the cluster known to local node. [root@fedora-master-node1 coro]# corosync-quorumtool -s Version: 1.99.4.12-9c7d-dirty Quorum type: corosync_votequorum Nodes: 2 Ring ID: 144 Quorate: Yes Node votes: 1 Node state: Member Expected votes: 3 Highest expected: 3 Total votes: 2 Quorum: 2 Flags: Quorate Nodeid Votes Name 1 1 fedora-master-node1.int.fabbione.net 2 1 fedora-master-node2.int.fabbione.net [root@fedora-master-node1 coro]# corosync-quorumtool -s -n 2 Version: 1.99.4.12-9c7d-dirty Quorum type: corosync_votequorum Nodes: 2 Ring ID: 144 Quorate: Yes Node votes: 1 Node state: Member Expected votes: 3 Highest expected: 3 Total votes: 3 Quorum: 2 Flags: Quorate Qdevice Nodeid Votes Name 1 1 fedora-master-node1.int.fabbione.net 2 1 fedora-master-node2.int.fabbione.net 0 1 QDEVICE (Voting) Internal changes: - change qdevice timer to not run all time, but only when necessary. - change votequorum_nodeinfo on wire data to use flags instead of uint8_t and add QDEVICE status. - allocate nodeid 0 to qdevice since it's the only real nodeid that be reserved. - change send_nodeinfo to allow to send nodeinfo for any node so that we can share qdevice info across the cluster (and this might be useful in future if we need to sync internal cluster view). - add votequorum api call to update qdevice name - add runtime data if quorum device has been forcefully disabled by config error - add qdevice votes to expected_votes calculation (this is probably the biggest difference vs cman) - change votequorum_read_nodelist_configuration so that we can autocalculate votes for qdevice (we need the nodecount vs votes). - add all checks for startup/runtime config (see above). - do not make qdevice part of the membership_list received from totem. None of our users care about it and it is not a real node. - change onwire message handlers to deal with "data for this node from any node" case and undersand nodeid 0 for qdevice info - always allocate qdevice at startup. this simplifies code a lot. - dispatch qdevice nodeinfo on membership changes. - inform libvotequorum users when a qdevice is registered - improve substantially qdevice api and add a simple barrier based on qdevice name. - add qdevice API barrier at cluster level. This feature allow only one qdevice name to be active in the cluster at any time. - qdevice getinfo can now report status for qdevice on any node. - change slightly the way the qdevice API is built-in/out: only the libvotequorum calls are #ifdef'out now. Doing so in the core is too complex and would make the code unreadable with the risk of missing a bit or two effectively introducing an on-wire incompatibility if we will ever turn the API on. - probably added some bugs on the way... TODO: update qdevice_* API once the above is settled and test qdevice integration with other features. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com> (only second part)	2012-02-27 09:30:26 +01:00
Jan Friesse	27e9988486	Add generic implementation of getifaddrs Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-16 10:47:56 +01:00
Angus Salkeld	1877d3b6f5	Change the IPC TIMEOUT to block. This is to make sure that we properly wait for responses from corosync. I have made a fix to libqb to properly handle the case when corosync exits/crashes between a send and receive. Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-02-14 21:27:02 +11:00
Angus Salkeld	023c4fa0cc	Move hdb_error_to_cs to corotypes.h Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-14 11:10:14 +11:00
Steven Dake	a7b4e7e045	Update copyright dates on include/totem files Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Fabio Di Nitto <fdinitto@redhat.com>	2012-02-13 17:05:04 -07:00
Steven Dake	4ee9550f80	Remove jhash.h since it is not used We would use libqb for hashing now if we needed hashing. cpg no longer uses jhash.h. Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Fabio Di Nitto <fdinitto@redhat.com>	2012-02-13 17:05:04 -07:00
Steven Dake	2514fc59b1	Updated copyright dates in include directory Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Fabio Di Nitto <fdinitto@redhat.com>	2012-02-13 17:05:04 -07:00
Steven Dake	815375411e	Remove unused or unimplemented CFG apis Remove: cfg_statetrack cfg_statetrackstop cfg_administrativestateste cfg_administrativestateget cfg_serviceload cfg_serviceunload Rev SO to 5.0.0 Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Angus Salkeld <asalkeld@redhat.com>	2012-02-13 17:04:49 -07:00
Fabio M. Di Nitto	225ee49c9f	cpg: drop dead code not used/referenced anywhere Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-09 16:54:42 +01:00
Fabio M. Di Nitto	20dd9ba36d	quorum: drop dead code spotted while writing man pages. There are no users for this struct Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-09 16:45:46 +01:00
Angus Salkeld	10faac6509	move cs_strerror() to common_lib Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-09 10:45:56 +11:00
Angus Salkeld	da483b8121	Add a common library that can be shared between libs and corosync We have always had this problem and worked around it by coping code or using inline functions. Both not good IMO. Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-09 10:45:56 +11:00
Steven Dake	9a2eb5d521	Remove cs_config.h from global header install Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-02-08 08:31:10 -07:00
Steven Dake	7592e3b61e	Remove include/engine/quorum and integrate it into exec/engine.h Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-02-08 08:31:10 -07:00
Steven Dake	113e8d6ed3	Remove swab.h from global headers Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-02-08 08:31:10 -07:00
Steven Dake	d9a2110769	Remove list.h from global header install Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-02-08 08:31:09 -07:00
Steven Dake	0031919a3f	Remove mar_gen.h from global header install since it is not needed Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-02-08 08:31:09 -07:00
Jan Friesse	9260efdf47	Add CS_DISPATCH_ONE_NONBLOCKING dispatch type Add missing option for dispatch, which fills gap in combination of block/nonblock and one/all dispatch types. New type doesn't mask CS_ERR_TRY_AGAIN, and it means "no message was processed". Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-08 16:03:46 +01:00
Fabio M. Di Nitto	62bbe076a8	corotype: drop deprecated CPG_ defines the only user of those obsoleted defines is dlm master (already ported) to use CS_ and cmirror (that needs full porting to new corosync either way). Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-08 13:37:46 +01:00
Fabio M. Di Nitto	e9f9eb9c3d	corotypes: drop deprecated QUORUM_ defines neither corosync or any of the dependencies use it. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-08 13:37:46 +01:00
Fabio M. Di Nitto	4120a2c1cb	corotypes: drop deprecated EVS_ defines none of our current dependencies use it. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-02-08 13:37:46 +01:00
Angus Salkeld	db70e14fcd	Make sure ipc functions return CS_ERR_TRY_AGAIN and not CS_ERR_TIMEOUT This is because most applications that use corosync do not test for TIMEOUT but only for TRY_AGAIN. Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-and-Tested-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-02-07 20:21:08 +11:00
Fabio M. Di Nitto	46b7b155a4	votequorum: add leave_remove option this also cleanup NODESTATE for good. JOINING was never used Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2012-01-31 16:58:08 +01:00
Steven Dake	007e5c9458	Honor exec_init_fn call exec_init_fn now either returns NULL (success) or a string which indicates the error that occured during service engine initialization. If an error occurs, corosync will exit. This patch adds ykd and makes other suggestions from Fabio Di Nitto. Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Fabio Di Nitto <fdinitto@redhat.com>	2012-01-30 14:05:09 -07:00
Fabio M. Di Nitto	ccd36af00e	votequorum: rename qdisk to qdevice a quorum device is not necessarely a disk and this also aligns various names to be generic Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-By: Christine Caulfield <ccaulfie@redhat.com>	2012-01-27 11:17:02 +01:00
Fabio M. Di Nitto	40aa40ed84	votequorum: drop NODESTATE_LEAVING this is another leftover from cman compatibility layer Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2012-01-26 14:32:54 +01:00
Angus Salkeld	3131601ce2	Remove all unneccessary "\n" from log messages These look ugly, are inconsistently done and just have to be removed later in libqb before calling syslog. Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-01-23 13:08:23 +11:00
Fabio M. Di Nitto	2cd6ad9922	votequorum: ifdef qdiskd API out as agreed, the API has not been tested yet. Adding later is better than removing it. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-01-18 14:23:06 +01:00
Steven Dake	bb849be586	Get rid of external config loader in include/engine/config.h Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Fabio Di Nitto <fdinitto@redhat.com>	2012-01-16 09:30:50 -07:00
Steven Dake	75bc06d916	Remove lcr directory, files, and references since it is no longer needed Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Fabio Di Nitto <fdinitto@redhat.com>	2012-01-16 09:30:40 -07:00
Steven Dake	08b635f8da	Move cs_error into global header so that third party applications can use it Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Andrew Beekhof <abeekhof@redhat.com>	2012-01-16 07:32:40 -07:00
Fabio M. Di Nitto	2003a87eb0	votequorum/quorum-tools: drop unnecessary includes Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Christine Caulfield <ccaulfie@redhat.com>	2012-01-16 15:28:56 +01:00
Jan Friesse	a1df899d35	icmap: Add fast version of inc and dec operation Biggest difference between fast and standard inc/dec operation is in fast that fast doesn't do malloc/memcpy, but also it means that tracking events doesn't have old value set. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-01-13 11:13:38 +01:00
Fabio M. Di Nitto	23ea4f0f11	votequorum: drop votequorum_leave this was a compatibility function for cman_tool only. Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-01-13 09:25:47 +01:00
Fabio M. Di Nitto	1cf165e776	votequorum: display flags for all features Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-01-13 09:25:47 +01:00
Fabio M. Di Nitto	f464038b17	votequorum: drop HASSTATE/SETSTATE this is a leftover from killing DISALLOWED Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-01-13 09:25:47 +01:00
Steven Dake	7e1c9771f2	unshare exec/icmap.so Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Fabio Di Nitto <fdinitto@redhat.com>	2012-01-12 07:29:41 -07:00
Jan Friesse	bb6bbd01e6	Store rrp faulty status of ring in cmap New key with faulty status of ring is created in cmap as name runtime.totem.pg.mrp.rrp.$ring_number.faulty Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-01-11 14:12:06 +01:00
Fabio M. Di Nitto	0a2ae35584	votequorum: drop kill_reason leftovers (part of disallowed) Reviewed-by: Steven Dake <sdake@redhat.com> Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-01-10 15:48:37 +01:00
Fabio M. Di Nitto	b3949957f3	votequorum: clean up coding style first pass to bring votequorum at corosync codying style. fix whitespaces, add missing {}, fix comments, be consistent with ENTER/LEAVE usage, be consistent with some functions variable names and some more cosmetic changes Reviewed-by: Steven Dake <sdake@redhat.com> Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-01-10 15:48:29 +01:00
Fabio M. Di Nitto	9589611dc4	votequorum: drop concept of DISALLOWED this is a very old leftover from the RHEL5 timeframe, not used in RHEL6. Also change votequorum soname since this change implies an ABI change. Reviewed-by: Steven Dake <sdake@redhat.com> Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-01-10 15:48:10 +01:00
Fabio M. Di Nitto	e34d509df7	quorum: change API to return quorum type at initialization time corosync internal theory of operation is that without a quorum provider the cluster is always quorate. This is fine for membership free clusters but it does pose a problem for applications that need membership and "real" quorum. this change add quorum_type to quorum_initialize call to return QUORUM_FREE or QUORUM_SET. Applications can then make their own decisions to error out or continue operating. The only other way to know if a quorum provider is enabled/configured is to poke at confdb/objdb, but adds an unnecessary burden to applications that really don't need to use an entire library for a boolean value. Reviewed-by: Steven Dake <sdake@redhat.com> Signed-off-by: Fabio M. Di Nitto <fdinitto@redhat.com>	2012-01-10 15:47:24 +01:00
Angus Salkeld	9e36255b8e	IPC: don't block forever on a recv msg as corosync might be gone. This at least will not make the client hang forever. Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2012-01-10 08:32:31 +11:00
Steven Dake	e5aba30a49	Move coroapi out of external headers Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Angus Salkled <asalkeld@redhat.com>	2012-01-07 17:47:45 -07:00
Steven Dake	8ad583a54c	Move logsys.c into corosync binary instead of a shared object Our preferred shared logging system is exported via the libqb library. As a result, the corosync project no longer needs to export logsys.so and the code can be directly included in the binary. The header file can also be removed. Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2012-01-06 18:19:59 -07:00
Jan Friesse	7c250a5147	Remove objdb and confdb Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-12-15 09:19:18 +01:00
Jan Friesse	120531cddb	Move SAM to use CMAP service Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-12-15 09:19:18 +01:00
Jan Friesse	8a45e2b152	Move corosync core to use icmap Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-12-15 09:19:17 +01:00
Jan Friesse	b3c99977de	Add user library to use cmap service Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-12-15 09:19:17 +01:00
Jan Friesse	a2824073c7	Add cmap service Cmap service is application developer interface to icmap and it is direct replacement for confdb. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-12-15 09:19:17 +01:00
Jan Friesse	525e6a6ebe	Add icmap Icmap is replacement for objdb, based on libqb map (trie). Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-12-15 09:19:17 +01:00
Angus Salkeld	2ba4ebe09e	Fix cpgbench (large message sizes) To allow async cpg messages of 1M we need to: 1) increase the totem queue size 4 times 2) align the critical level to one large message free There are a number of reasons for doing this: We can't let cpg_mcast_joined() fail because the user will not see it and will assume is has succeded. The reason we are getting good performance is by providing a negative feedback loop from the totem q to the IPC/poll system. This relies on 4 q states low/med/high/crit. With messages of size 1M you now have a q of size one and now go from level low to crit instantly then back to low as messages are put on and taken off. I don't think this is the best behaviour. By having a q size of 4 allows the system to utilize the q better and give us time to respond to changes in the q level. To effectively achieve flow control with a q of size 1 would require all the clients to request the space on the q like is done in totempg_groups_joined_reserve() but probably in shared memory This would take quite a bit of re-work. Signed-off-by: Angus Salkeld <asalkeld@redhat.com>	2011-12-15 10:43:11 +11:00
Angus Salkeld	a6729003a6	OBJDB: free up resources on exit Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-11-11 09:06:50 +11:00
Angus Salkeld	0fc51c40fd	LOG: cleanup logging resources at exit Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-11-11 09:05:08 +11:00
Jan Friesse	26db8b21b2	api: Change some of totempg definitons Recent changes in patch "Get rid of hdb usage in totempg.h interface" caused incompatibility between corosync API and totempg. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-10-24 17:43:36 +02:00
Jan Friesse	1711aea72f	Allow compilation of totempg without warnings Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-10-24 17:43:28 +02:00
Jan Friesse	99bbf4cc78	logsys.h: Properly define LEAVE macro Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Angus Salkeld <asalkeld@redhat.com>	2011-10-24 14:24:52 +02:00
Angus Salkeld	1b63c3cf57	LOG: update the log defines Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-10-22 10:51:47 +11:00
Angus Salkeld	78a5260c06	LOG: use libqb facility conversion functions Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-10-21 19:34:43 +11:00
Jan Friesse	752239eaa1	rrp: Higher threshold in passive mode for mcast There were too much false positives with passive mode rrp when high number of messages were received. Patch adds new configurable variable rrp_problem_count_mcast_threshold which is by default 10 times rrp_problem_count_threshold and this is used as threshold for multicast packets in passive mode. Variable is unused in active mode. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed by: Steven Dake <sdake@redhat.com>	2011-09-01 11:21:09 +02:00
Steven Dake	e920fef7e9	Get rid of hdb usage in totempg.h interface hdb has some expense and is not necessary in the totempg.so runtime. This patch removes the dependence on hdb and instead uses a direct pointer. Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Angus Salkeld <asalkeld@redhat.com>	2011-08-23 22:29:01 -07:00
Steven Dake	bb42020f9a	Use qb_hdb instead of mutex based hdb code Rid ourselves of the mutex usage still in the code base Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Angus Salkeld <asalkeld@redhat.com>	2011-08-23 12:48:21 -07:00
Steven Dake	9f36a892a8	Move cs_queue.h from include directory to exec directory This file is only used by totemsrp.c. Move out of general include directory. Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Angus Salkeld <asalkeld@redhat.com>	2011-08-22 19:31:33 -07:00
Jan Friesse	99852ab203	Allow compile master on RHEL 6 corosync_timer_handle_t is know conditionally defined to prevent double definition causing compile fault on RHEL 6 systems. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Angus Salkeld <asalkeld@redhat.com>	2011-08-09 11:29:48 +02:00
Angus Salkeld	37e17e7a94	libqb: logging & trace Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-08-09 10:37:16 +10:00
Angus Salkeld	f717bc60e1	libqb: make timer api a wrapper around qb_loop timers. - change timeout value to nano seconds - fix timer handles (don't alloc on stack) Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-08-09 10:37:14 +10:00
Angus Salkeld	c6895faa05	libqb: change ipc -> qb_ipc IPC: return 0/-ENOBUFS from message handler IPC: use the new rate_limit API to improve perf. CPG: add send_async API & hook up flow control IPC: Fix flow control getting stuck. IPC: Port the remaining libs to use libqb IPC IPC: remove libqb flowcontrol API TEST: put cpg_dispatch() in it's own thread IPC: cleanup ipc_glue.c name everything cs_ipcs_() IPC: add back statistics IPC: remove coroipcc_ symbols from lib.versions IPC: init each se's IPC as it is loaded. IPC: use the new connection_closed() event to free the context. IPC: re-add zero copy functionality back IPC: remove cpg_mcast_joined_async() and make it the default -> now cpg_mcast_joined() == cpg_mcast_joined_async() libqb: expose a libqb error converter libqb: add missing error conversions libqb: remove repeat try loop in lib/cpg.c CPG: fix zero copy mcast CPG: use newer return codes Add ENOTCONN to qb_to_cs_error() libqb: fix error conversion from errno to cs_error_t in confdb libqb: change errno_to_cs to qb_to_cs_error libqb: add a cs_strerror() to get a more meaningful message libqb: fix some confusing error conversions. libqb: set the timeout on recv's to -1 (wait forever) Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-08-09 10:37:14 +10:00
Angus Salkeld	fce8a3c3b6	libqb: convert coropoll calls to qb_loop calls. Signed-off-by: Angus Salkeld <asalkeld@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-08-09 10:37:14 +10:00
Tim Beale	04f37df2f7	Add some more stats for debugging + overload - number of times client is told to try again + invalid_request - message contained invalid paramter, e.g. invalid size + msg_queue_avail - messages currently available at the Totem layer + msg-queue_reserved - messages currently reserved at the Totem layer Signed-off-by: Tim Beale <tim.beale@alliedtelesis.co.nz> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-07-19 08:58:41 -07:00
Steven Dake	c544e87bb0	Correct missing poll funtions from service handler struct needed for confdb APIs Signed-off-by: Steven Dake <sdake@redhat.com> Reviewed-by: Jan Friesse <jfriesse@redhat.com>	2011-07-15 13:30:41 -07:00
Tim Beale	77f7e5b0fe	Fix compile/runtime issues for _POSIX_THREAD_PROCESS_SHARED < 1 For the case where _POSIX_THREAD_PROCESS_SHARED < 1, the code doesn't compile for corosync v1.3.1. And when it does compile, it crashes on our system - our version of uClibc seems to always expect a 4th arg. The man pages suggests the 4th arg is optional, but does say: 'For greater portability it is best to always call semctl() with four arguments', which is what this patch does. Also removed semop as it's an unused variable. Signed-off-by: Tim Beale <tim.beale@alliedtelesis.co.nz> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-07-06 06:44:22 -07:00
Jiaju Zhang	5dc33c2824	RRP: redundant ring automatic recovery This patch automatically recovers redundant ring failures. Please note that this patch introduced rrp_autorecovery_check_timeout in totem config hence breaks internal ABI. The internal ABI users of totem.h need to rebuild their binaries. Signed-off-by: Jiaju Zhang <jjzhang@suse.de> Signed-off-by: Steven Dake <sdake@redhat.com> Tested-by: Jan Friesse <jfriesse@redhat.com> Tested-by: Florian Haas <florian.haas@linbit.com> Tested-by: Jiaju Zhang <jjzhang@suse.de>	2011-07-05 09:13:48 -07:00
Jan Friesse	8c717c22b2	Remove spinlocks Spinlocks are now removed, because even spinlock can improve speed is some special cases, in most cases it makes corosync CPU usage much more intensive and less responsive then if only mutexes are used. What we were doing is: pthread_mutex_lock pthread_spin_lock pthread_spin_unlock pthread_mutex_unlock what is not safe. Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-06-29 12:01:54 +02:00
Jerome Flesch	00434a4f10	Fix usage of strerror_r()/perror() Signed-off-by: Jerome Flesch <jerome.flesch@netasq.com> Reviewed-by: Angus Salkeld <asalkeld@redhat.com>	2011-06-28 09:56:58 +02:00
Jan Friesse	afa0398ca4	mainconfig: Check retval of logsys_format_set Signed-off-by: Jan Friesse <jfriesse@redhat.com> Reviewed-by: Steven Dake <sdake@redhat.com>	2011-06-06 10:02:34 +02:00

1 2 3 4 5 ...

561 Commits