qemu-server

mirror of https://git.proxmox.com/git/qemu-server synced 2025-10-14 10:54:34 +00:00

Author	SHA1	Message	Date
Fabian Ebner	ea5b400812	sync_disks: log output of storage_migrate Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-04-08 22:11:54 +02:00
Fabian Ebner	49a5a0d84b	sync_disks: be more verbose if storage_migrate fails If storage_migrate dies, the error message might not include the volume ID or the target storage ID, but those might be good to know. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-04-08 22:11:54 +02:00
Fabian Ebner	cc1a3820db	sync_disks: use allow_rename to avoid collisions on the target storage This makes it possible to migrate a VM with volumes store1:vm-123-disk-0 store2:vm-123-disk-0 to some targetstorage. Also prevents migration failure when there is an orphaned disk with the same volid on the target. To avoid confusion, the name should not change for 'vmstate'-volumes. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-04-08 22:11:54 +02:00
Fabian Ebner	97ece9ddce	Update volume IDs in one go Use 'update_volume_ids' for the live-migrated disks as well. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-04-08 22:11:54 +02:00
Fabian Ebner	37666e4caa	Take note of changes to the volume IDs when migrating and update the config Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-04-08 22:11:54 +02:00
Fabian Ebner	1f726e0a85	Use new storage_migrate interface Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-04-08 22:11:54 +02:00
Fabian Ebner	912792e245	Switch to using foreach_volume instead of foreach_drive It was necessary to move foreach_volid back to QemuServer.pm In VZDump/QemuServer.pm and QemuMigrate.pm the dependency on QemuConfig.pm was already there, just the explicit "use" was missing. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-04-08 22:11:54 +02:00
Stefan Reiter	58c64ad5d9	Include "-cpu" parameter with live-migration This is required to support custom CPU models, since the "cpu-models.conf" file is not versioned, and can be changed while a VM using a custom model is running. Changing the file in such a state can lead to a different "-cpu" argument on the receiving side. This patch fixes this by passing the entire "-cpu" option (extracted from /proc/.../cmdline) as a "qm start" parameter. Note that this is only done if the VM to migrate is using a custom model (which we can check just fine, since the <vmid>.conf is versioned with pending changes), thus not breaking any live-migration directionality. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-04-07 17:27:58 +02:00
Fabian Grünbichler	bf8fc5a307	migrate: allow arbitrary source->target storage maps the syntax is backwards compatible, providing a single storage ID or '1' works like before. the new helper ensures consistent behaviour at all call sites. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-04-02 17:47:14 +02:00
Stefan Reiter	c05f1b33ea	migration: fix downtime limit auto-increase `485449e37` ("qmp: use migrate-set-parameters in favor of deprecated options") changed the initial "migrate_set_downtime" QMP call to the more recent "migrate-set-parameters", but forgot to do so for the auto-increase code further below. Since the units of the two calls don't match, this would have caused the auto-increase to increase the limit to absurd levels as soon as it kicked in (ms treated as s). Update the second call to the new version as well, and while at it remove the unnecessary "defined()" check for $migrate_downtime, which is always initialized from the defaults anyway. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-04-02 16:48:51 +02:00
Fabian Grünbichler	6a039d06e9	migrate: improve cleanup_remotedisks to also handle cases where disk allocation failed in the remote vm_start, and we only have a bitmap but no target drive information. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-04-01 17:41:07 +02:00
Dominik Csapak	818ce80ec1	fix efidisks on storages with minimum sizes bigger than OVMF_VARS.fd on storages where the minimum size of images is bigger than the real OVMF_VARS.fd file, they get padded to their minimum size when using such an image, qemu maps it fully to the vm, but the efi does not find the vars region and creates a file on the first efi partition it finds this breaks some settings in the ovmf, such as resolution to fix this, we have to specify the size for the pflash, so that qemu only maps the first n bytes in the vm (this only works for raw files, not for qcow2) we also have to use the correct size when converting between storages in 'clone_disk' (used for move disk and cloning vms) and when live migrating to different storages when we now expect that the source image is always correctly used/created (e.g. raw with size=x in pflash argument) then we always create the target correctly when encountering users which have a non-valid image (e.g. a efidisk moved from zfs to qcow2 before this patch), we have to tell them to recreate the efidisk and the settings on it we have to version_guard it to 4.1+pve2 (since we haven't bumped yet since the change to pve2) also add 2 tests, one for the old version and one for the new Signed-off-by: Dominik Csapak <d.csapak@proxmox.com> Tested-by: Stefan Reiter <s.reiter@proxmox.com> Reviewed-by: Stefan Reiter <s.reiter@proxmox.com> [ Thomas: rebased to master ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-03-30 09:41:55 +02:00
Fabian Ebner	5c50a84f23	migration with targetstorage: check if target storage supports images This makes sure that live migration also respects content types. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-03-27 14:32:42 +01:00
Thomas Lamprecht	2cd808d331	migrate sync disks: split long line Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-03-27 10:17:54 +01:00
Thomas Lamprecht	b10afa311d	migrate sync_disks: use own variable for often referenced storage config also fix two places where we used $self->{vmid} even if $vmid was in scope (and the same). Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-03-27 10:13:10 +01:00
Fabian Grünbichler	9b3f5a5c99	migrate: cleanup disk/bitmaps if 'qm start' failed since bitmaps are set early on, and 'qm start' potentially has allocated the disks but still failed. we can only clean up what we know about anyway, so the disk part is still only best effort. also use replicated_volumes instead of bitmap existence to check for replicated volumes, since 'qm start' on an old node that does not understand replicated volumes might have allocated a new volume that we DO want to clean up, and not skip. also cleanup disks after stopping target VM, otherwise we might end up in a situation where the target VM is still running and using the disks, thus blocking the disk cleanup. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-03-27 07:54:44 +01:00
Fabian Grünbichler	7f5fb49a7c	migrate: fix auto-vivification in cleanup_bitmaps this does not currently trigger since nothing uses $self->{target_drive} afterwards. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-03-27 07:54:44 +01:00
Fabian Grünbichler	88126be3f7	migrate: fix replication false-positives by only checking for replicatable volumes when a replication job is defined, and passing only actually replicated volumes to the target node via STDIN, and back via STDOUT. otherwise this can pick up theoretically replicatable, but not actually replicated volumes and treat them wrong. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-03-27 07:54:44 +01:00
Fabian Ebner	47250f03ef	Fix calls to get_replicateable_volumes There is a need to set $noerr, because otherwise migration for a VM with a non-replicatable volume fails with: missing replicate feature on volume 'myfs:107/vm-107-disk-2.raw' Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-03-25 14:53:17 +01:00
Thomas Lamprecht	6d7450cbec	qemu migrate: sort and split module usage Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-03-25 10:05:58 +01:00
Thomas Lamprecht	28e6e180bc	add basic version check for live-migration with replicated disks as we need at least pve-qemu in 4.2 for this to work, the target side is implicitly checked with "to old version" check for migrate or the mirror will fail anyway. Just use the simple "qemu binary version check", as we could stil live migrate an older snapshot with older machine versions if both sides have a recent enough qemu. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-03-25 10:02:36 +01:00
Fabian Grünbichler	9b6efe436d	migrate: add live-migration of replicated disks with incremental drive-mirror and dirty-bitmap tracking. 1.) get replicated disks that are currently referenced by running VM 2.) add a block-dirty-bitmap to each of them 3.) replicate ALL replicated disks 4.) pass bitmaps from 2) to drive-mirror for disks from 1) 5.) skip replicated disks when cleaning up volumes on either source or target added error handling is just removing the bitmaps if an error occurs at any point after 2, except when the handover to the target node has already happened, since the bitmaps are cleaned up together with the source VM in that case. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com> Tested-by: Stefan Reiter <s.reiter@proxmox.com>	2020-03-24 12:22:32 +01:00
Fabian Grünbichler	b9f44d2773	migrate: add replication info to disk overview to make migration logs a bit easier to grasp with a quick glance. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com> Tested-by: Stefan Reiter <s.reiter@proxmox.com>	2020-03-24 11:54:32 +01:00
Thomas Lamprecht	1e0074c437	migrate phase3: add to comment why a blockjob cancel is OK here Clarify why a cancel is actually not really canceling here, because we're already finished with storage migration and the block jobs are all in ready state and we (source) are going to stop soon to hand over to target. > Note that if you issue 'block-job-cancel' after 'drive-mirror' has > indicated (via the event BLOCK_JOB_READY) that the source and > destination are synchronized, then the event triggered by this > command changes to BLOCK_JOB_COMPLETED, to indicate that the > mirroring has ended and the destination now has a point-in-time > copy tied to the time of the cancellation -- qapi/block-core.json (QEMU 4.2) Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2020-03-20 11:08:23 +01:00
Mira Limbeck	ff09c795ed	revert spice_ticket prefix change in `7827de4` The change to the prefixed version broke migration from new to old qemu-server version. This reverts the change and adds a TODO comment for 7.0 to change it to the prefixed version then. Signed-off-by: Mira Limbeck <m.limbeck@proxmox.com>	2020-03-20 10:37:33 +01:00
Fabian Grünbichler	db1f8b39e1	drive_mirror: rename variables and values and add some more details to comments. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-03-18 08:21:29 +01:00
Mira Limbeck	7827de41a2	add unix socket support for NBD storage migration The reuse of the tunnel, which we're opening to communicate with the target node and to forward the unix socket for the state migration, for the NBD unix socket requires adding support for an array of sockets to forward, not just a single one. We also have to change the $sock_addr variable to an array for the cleanup of the socket file as SSH does not remove the file. To communicate to the target node the support of unix sockets for NBD storage migration, we're specifying an nbd_protocol_version which is set to 1. This version is then passed to the target node via STDIN. Because we don't want to be dependent on the order of arguments being passed via STDIN, we also prefix the spice ticket with 'spice_ticket: '. The target side handles both the spice ticket and the nbd protocol version with a fallback for old source nodes passing the spice ticket without a prefix. All arguments are line based and require a newline in between. When the NBD server on the target node is started with a unix socket, we get a different line containing all the information required to start the drive-mirror. This contains the unix socket path used on the target node which we require for forwarding and cleanup. Signed-off-by: Mira Limbeck <m.limbeck@proxmox.com>	2020-03-18 08:03:44 +01:00
Mira Limbeck	e02fb12620	add qemu_drive_mirror_monitor completion modes With Qemu 4.2 we encountered a problem with unix sockets and SSH socket forwarding for drive-mirror. It seems the socket gets reopened again and again after it closes for some reason. This can be worked around by specifying 'block-job-cancel' instead of 'block-job-complete' when we're not interested in swapping the disks again from NBD to their original protocol. This is always the case when we use drive-mirror for live migrating a VM. qemu_drive_mirror is used for migration and for clone_disk. All in all we have 3 cases to handle. Either the 'skip' case which skips the completion of the job. The 'wait' case which was the default before and still is when $completion is undefined. And the new 'wait_noswap' case which is used for the live migration. If 'wait_noswap' is specified, we issue a 'block-job-cancel' once the block job is in 'ready' state. This completes the block job without swapping the disks. clone_disk always uses 'block-job-cancel' via the qemu_blockjobs_cancel sub. Signed-off-by: Mira Limbeck <m.limbeck@proxmox.com>	2020-03-18 08:03:44 +01:00
Fabian Ebner	0ad295f9fb	Consistently use format determined in 'PVE::Storage::foreach_volid' Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> LGTM-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-03-09 19:36:58 +01:00
Fabian Ebner	5eca0c3643	sync_disks: Always set 'snapshots' for qcow2 and vmdk volumes This fixes an issue when migrating a VM with an unused volume with format qcow2 or vmdk. Since 'snapshots' wasn't set, storage_migrate wanted to export/import with format raw+size instead. Therefore it used (instead of just 'dd') 'qemu-img convert', which fails when its output leaves through a pipe. Upon importing, a second error is present, because the format from the volume ID doesn't match the format of the stream and there is no conversion yet. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> LGTM-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-03-09 19:36:45 +01:00
Fabian Ebner	e0fd2b2f84	Create Drive.pm and move drive-related code there The initialization for the drive keys in $confdesc is changed to be a single for-loop iterating over the keys of $drivedesc_hash and the initialization of the unusedN keys is move to directly below it. To avoid the need to change all the call sites, functions with more than a few callers are exported from the submodule and imported into QemuServer.pm. For callers of the now imported functions within QemuServer.pm, the prefix PVE::QemuServer is dropped, because it is unnecessary and now even confusing. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-03-07 18:23:57 +01:00
Stefan Reiter	29eb909ee0	fix #2611 : use correct operation in get_bandwidth_limit Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-03-03 11:47:13 +01:00
Stefan Reiter	485449e37b	qmp: use migrate-set-parameters in favor of deprecated options migrate_set_downtime, migrate_set_speed and migrate-set-cachesize have all been deprecated since 2.8 or 2.11 [0]. They still work, but no reason not to use the correct version. Note that the downtime-limit parameter switched from seconds to milliseconds, so convert to that. Slightly improve log output with units while at it. [0] https://qemu.weilnetz.de/doc/qemu-doc.html#Deprecated-features Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2020-02-06 13:50:33 +01:00
Fabian Grünbichler	683ab65491	migrate: re-order lines to improve readability Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2020-02-05 09:43:09 +01:00
Fabian Ebner	1764fa05d0	Extract volume ID before calling 'parse_volume_id' Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-02-05 08:41:05 +01:00
Fabian Ebner	8b02e56870	rename 'volid' to 'drivestr' where it's not only a volume ID Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-02-05 08:41:05 +01:00
Fabian Ebner	c96173968a	Remove unused 'sharedvm' variable AFAICT this one hasn't been in use since commit '4530494bf9f3d45c4a405c53ef3688e641f6bd8e' Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2020-01-09 17:43:51 +01:00
Stefan Reiter	8bf30c2a72	fix #2493 : show QEMU errors in migration log QEMU usually only prints warnings and errors and stays silent otherwise, so it makes sense to just log all of it's output. Prefix it with '[<target_hostname>]' to indicate that the output is coming from the remote node, so users know where to search for the error. Side effect is that the 'VM start' task created by the migration will now show the "QEMU:" prefix, but it's still very readable IMHO. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2019-12-12 13:36:19 +01:00
Stefan Reiter	6e0216d862	hide long commandline on vm_start/migrate failure By default run_command prints the entire commandline executed when an error occurs, but QEMU and our migrate command are not only uninteresting to the user[] but also annoyingly long. Hide them and only print the exit code. [] Especially our migrate command, since it can't be manually executed anyway. QEMU's commandline might contain something interesting, but is so long that it's tricky to parse anyway, any a user can always call 'qm showcmd --pretty'. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2019-12-12 13:35:40 +01:00
Stefan Reiter	68b108ee3a	update disk size before local disk migration Split out 'update_disksize' from the renamed 'update_disk_config' to allow code reuse in QemuMigrate. Remove dots after messages to keep style consistent for migration log. After updating in sync_disks (phase1) of migration, write out updated config. This means that even if migration fails or is aborted in later stages, we keep the fixed config - this is not an issue, as it would have been fixed on the next attempt anyway, and it can't hurt to have the correct size instead of a wrong one either way. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2019-12-11 10:42:56 +01:00
Stefan Reiter	71c58bb7ed	remove $vmid param from print_drive It isn't used in the sub, but suggest it is needed. No users outside qemu-server found. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2019-12-09 11:44:13 +01:00
Thomas Lamprecht	dad06e2068	refactor storage whitelist in sync_disks to regex Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2019-12-04 18:40:03 +01:00
Thomas Lamprecht	40a572f7e8	migrate phase 3 cleanup: add error into error propagation message Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2019-11-30 17:27:14 +01:00
Stefan Reiter	3392d6cacf	refactor: extract QEMU machine related helpers to package ...PVE::QemuServer::Machine. qemu_machine_feature_enabled is exported since it has a lot of users in PVE::QemuServer and a long enough name as it is. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2019-11-20 16:29:23 +01:00
Stefan Reiter	0a13e08ec2	refactor: create QemuServer::Monitor for high-level QMP access QMP and monitor helpers are moved from QemuServer.pm. By using only vm_running_locally instead of check_running, a cyclic dependency to QemuConfig is avoided. This also means that the $nocheck parameter serves no more purpose, and has thus been removed along with vm_mon_cmd_nocheck. Care has been taken to avoid errors resulting from this, and occasionally a manual check for a VM's existance inserted on the callsite. Methods have been renamed to avoid redundant naming: * vm_qmp_command -> qmp_cmd * vm_mon_cmd -> mon_cmd * vm_human_monitor_command -> hmp_cmd mon_cmd is exported since it has many users. This patch also changes all non-package users of vm_qmp_command to use the mon_cmd helper. Includes mocking for tests. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2019-11-20 16:29:23 +01:00
Thomas Lamprecht	e85d01f282	migration: fix false-positive log for copying local images Only log that if we actually have local disks. Add also an explicit log for replication. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2019-11-20 16:01:35 +01:00
Fabian Ebner	9270672e67	fix typo in migration cleanup error message Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2019-10-28 11:30:10 +01:00
Mira Limbeck	9860fe4ef9	close #2263 : die on live migration with local cloudinit disk Live migration with a local cloudinit disk was never intended to work. It did however work to an extent that the migration completed but the disk on the source node could not be deleted. Now die if a live migration is started with a local cloudinit disk. With the GUI changes live migration is already disabled as it recognizes the cloudinit disk as a local resource. Signed-off-by: Mira Limbeck <m.limbeck@proxmox.com>	2019-08-26 12:13:07 +02:00
Dominik Csapak	ccab68c22c	fix remote viewer live migration for some reason not setting port results in a port of '65535' which triggers an execption in http-server anyevent, so we set the port to 0 also, we have to read the ticket from stdin even for 'unix' type secure migration Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2019-08-20 11:49:24 +02:00
Tim Marx	ca6abacf6b	migrate: log which local resource causes error Signed-off-by: Tim Marx <t.marx@proxmox.com>	2019-05-07 10:22:12 +00:00

1 2 3 4

188 Commits