pve-http-server

mirror of https://git.proxmox.com/git/pve-http-server synced 2025-09-18 20:30:56 +00:00

Author	SHA1	Message	Date
Rob Rozestraten via pve-devel	07e56cc9dd	fix unexpected EOF for client when closing TLS session When pve-http-server initiates the closure of a TLS session, it does not send a TLS close notify, resulting in an unexpected EOF error on systems with recent crypto policies. This can break functionality with other applications, such as Foreman[0]. This behavior can be observed in the following cases: * client uses HTTP/1.0 (no keepalive; server closes connection) * client sends no data for 5 sec (timeout; server closes connection) * server responds with 400 (no keepalive; server closes connection) This patch sends the TLS close notify prior to socket teardown, resulting in clean closure of TLS connections and no client error. It also moves shutdown() to after the clearing of handlers. The reason for this is stoptls() must come before shutdown(), but it also triggers on_drain(), which calls client_do_disconnect() again. The extra call to client_do_disconnect() is avoided inside accept_connections() by commit `f737984`, but perhaps clearing the handlers prior to shutdown() will avoid it in all cases. [0]: https://github.com/theforeman/foreman_fog_proxmox/issues/325 Signed-off-by: Rob Rozestraten <admin@truthsolo.net> Link: https://lore.proxmox.com/mailman.798.1741211145.293.pve-devel@lists.proxmox.com Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2025-04-08 14:49:50 +02:00
Dominik Csapak	2650923a42	fix #6230 : increase allowed post size In some situations, e.g., when one has a large resource mapping, the UI can generate a request that is bigger than the current limit of 64KiB. Our files in pmxcfs can grow up to 1 MiB, so theoretically, a single mapping can grow to that size. In practice, a single entry will have much less. In #6230, a user has a mapping with about ~130KiB. Increase the limit to 512KiB so we have a bit of headroom left. We have to also increase the 'rbuf_max' size here, otherwise the request will fail (since the buffer is too small for the request). Since the post limit and the rbuf_max are tightly coupled, let it reflect that in the code. To do that sum the post size + max header size there. A short benchmark shows that it only slightly impacts performance for the same amount of data (but that could be runtime variance too): I used a 4 node virtualized cluster, benchmarked with oha[0] with these options: ADDR=<IP> oha --insecure -H $COOKIE -H $CSRFTOKEN -D bodyfile \ -m "PUT" -T "application/x-www-form-urlencoded" -n 3000 -c 50 \ --disable-keepalive --latency-correction \ "https://$ADDR:8006/api2/json/cluster/mapping/pci/test" So 3000 requests with 50 parallel. I also restarted pveproxy and daemon in between runs, and took the rss values around the 50% runtime of the benchmark. average time requests/s pvedaemon rss pveproxy rss old with 60k body 3.0067s 16.3487 140M-155M 141M-170M new with 60k body 3.0865s 15.7623 140M-155M 141M-171M new with 180k body 8.3834s 5.8934 140M-158M 141M-181M Using a bigger body size had a large impact on the time, but that's IMHO expected. Also, RSS is not that much impacted, only when using many requests with larger request size, but this should also be expected. 0: https://github.com/hatoo/oha Signed-off-by: Dominik Csapak <d.csapak@proxmox.com> [TL: fix wrapping the benchmark command here] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2025-04-03 18:04:45 +02:00
Dominik Csapak	169d42e0f6	use HTTP_INTERNAL_SERVER_ERROR were appropriate instead of '501' The http status code 501 is meant to be 'Not Implemented'[0] but that clearly does not fit here as the default error when we encounter a problem during handling an api request or upload. So instead use '500' (HTTP_INTERNAL_SERVER_ERROR) which we already use in other places where it fits. 0: https://datatracker.ietf.org/doc/html/rfc9110#name-501-not-implemented Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2025-01-28 15:52:10 +01:00
Dominik Csapak	9b59e1f033	add error message into http body In our rust client, we can't access the http reason phrases[0], so let's put them into the body itself if we don't specify an explicit content. our proxmox-client code in rust already uses the body as message if there is one [1], so we get that automatically. 0: https://github.com/hyperium/http/issues/737 1: https://git.proxmox.com/?p=proxmox.git;a=blob;f=proxmox-client/src/client.rs;h=9b078a9820405b22ca54c17ea4da4c586e0649b4;hb=refs/heads/master#l237 Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2025-01-28 15:52:10 +01:00
Fabian Grünbichler	f737984826	fix #4816 : do not disconnect twice if client sends no data client_do_disconnect expects to be called exactly once per connection, since it takes care of closing and unsetting the handle corresponding to the connection. to find bugs in our connection handling, it will log "detected empty handle" if it is called for a request/connection that no longer has a handle. the edge case of opening a connection without sending any data leads to the error callback being called twice: Dec 04 09:37:02 xxx pveproxy[175235]: err (): Connection timed out this is the (5 second) timeout triggering Dec 04 09:37:02 xxx pveproxy[175235]: err (1): Broken pipe this is AnyEvent trying to drain the buffer while the connection is already closed as soon as a single byte of traffic is sent, only the timeout will trigger. there is no guarantee that the on_error callback is only called once (in fact, it's possible to return from it for non-fatal errors and continue processing the connection). if there are further reports of empty handles with this in place, other on_error callbacks might need similar logic - but it should only be added if the triggering conditions are clear and deemed safe. the additional logging is only cosmetic after all, but might point out an actual issue in our connection handling code. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2025-01-28 15:28:47 +01:00
Thomas Skinner	fe9d61a83d	fix #5699 : pveproxy: add library methods for real IP support Signed-off-by: Thomas Skinner <thomas@atskinner.net>	2025-01-24 09:36:22 +01:00
Friedrich Weber	f5ac409d20	fix #5391 : proxy request: avoid HTTP 599 Too many redirections The API server proxies HTTP requests in two cases: - between cluster nodes (pveproxy->pveproxy) - between daemons on one node for protected API endpoints (pveproxy->pvedaemon) The API server uses AnyEvent::HTTP for proxying, with unfortunate settings for connection reuse (details below). With these settings, long-running synchronous API requests on the proxy destination's side can cause unrelated proxied requests to fail with a misleading HTTP 599 "Too many redirections" error response. In order to avoid these errors, improve the connection reuse settings. In more detail: Per default, AnyEvent::HTTP reuses previously-opened connections for requests with idempotent HTTP verbs, e.g. GET/PUT/DELETE [1]. However, when trying to reuse a previously-opened connection, it can happen that the destination unexpectedly closes the connection. In case of idempotent requests, AnyEvent::HTTP's http_request will retry by recursively calling itself. Since the API server disallows recursion by passing `recurse => 0` to http_request initially, the recursive call fails with "HTTP 599 Too many redirections". This can happen both for pveproxy->pveproxy and pveproxy->pvedaemon, as connection reuse is enabled in both cases. Connection reuse being enabled in the pveproxy->pvedaemon case was likely not intended: A comment mentions that "keep alive for localhost is not worth it", but only sets `keepalive => 0` and not `persistent => 0`. This setting switches from HTTP/1.1 persistent connections to HTTP/1.0-style keep-alive connections, but still allows connection reuse. The destination unexpectedly closing the connection can be due to unfortunate timing, but it becomes much more likely in case of long-running synchronous requests. An example sequence: 1) A pveproxy worker P1 handles a protected request R1 and proxies it to a pvedaemon worker D1, opening a pveproxy worker->pvedaemon worker connection C1. The pvedaemon worker D1 is relatively fast (<1s) in handling R1. P1 saves connection C1 for later reuse. 2) A different pveproxy worker P2 handles a protected request R2 and proxies it to the same pvedaemon worker D1, opening a new pveproxy worker->pvedaemon connection C2. Handling this request takes a long time (>5s), for example because it queries a slow storage. While the request is being handled, the pvedaemon worker D1 cannot do anything else. 3) Since pvedaemon worker D1 sets a timeout of 5s when accepting connections and it did not see anything on connection C1 for >5s (because it was busy handling R2), it closes the connection C1. 4) pveproxy worker P1 handles a protected idempotent request R3. Since the request is idempotent, it tries to reuse connection C1. But C1 was just closed by D1, so P1 fails request R3 with HTTP 599 as described above. In addition, AnyEvent::HTTP's default of reusing connections for all idempotent HTTP verbs is problematic in our case, as not all PUT requests of the PVE API are actually idempotent, e.g. /sendkey [2]. To fix the issues above, improve the connection reuse settings: a) Actually disable connection reuse for pveproxy->pvedaemon requests, by passing `persistent => 0`. b) For pveproxy->pveproxy requests, enable connection reuse for GET requests only, as these should be actually idempotent. c) If connection reuse is enabled, allow one retry by passing `recurse => 1`, to avoid the HTTP 599 errors. With a) and b), the API server will reuse connections less often, which can theoretically result in a performance drop. To gain confidence that the performance impact is tolerable, here are the results of a simple benchmark. The benchmark runs hey [3] against a virtual 3-node PVE cluster, with or without the patch applied. It performs 10000 requests in 2 worker threads to `PUT $HTTP_NODE:8006/api2/json/nodes/$PROXY_NODE/config` with a JSON payload that sets a 32KiB ASCII `description`. The shortened hey invocation: hey -H "$TOKEN" -m PUT -T application/json -D payload.json \ --disable-keepalive -n 10000 -c 2 "$URL" The endpoint was chosen because it is performs little work (locks and writes a config file), it is protected (to test behavior change a)), and it is a PUT endpoint (to test behavior change b)). The command is ran two times: - With $HTTP_NODE == $PROXY_NODE for pveproxy->pvedaemon proxying - With $HTTP_NODE != $PROXY_NODE for pveproxy->pveproxy->pvedaemon proxying For each invocation, we record the response times. Without this patch: $HTTP_NODE == $PROXY_NODE Slowest: 0.0215 secs Fastest: 0.0061 secs Average: 0.0090 secs 0.006 [1] \| 0.008 [2409] \|■■■■■■■■■■■■■■■■■■■■■■■■ 0.009 [4065] \|■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■ 0.011 [1781] \|■■■■■■■■■■■■■■■■■■ 0.012 [1024] \|■■■■■■■■■■ 0.014 [414] \|■■■■ 0.015 [196] \|■■ 0.017 [85] \|■ 0.018 [21] \| 0.020 [2] \| 0.022 [2] \| $HTTP_NODE != $PROXY_NODE Slowest: 0.0584 secs Fastest: 0.0075 secs Average: 0.0105 secs 0.007 [1] \| 0.013 [8445] \|■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■ 0.018 [1482] \|■■■■■■■ 0.023 [56] \| 0.028 [5] \| 0.033 [1] \| 0.038 [0] \| 0.043 [0] \| 0.048 [0] \| 0.053 [5] \| 0.058 [5] \| With this patch: $HTTP_NODE == $PROXY_NODE Slowest: 0.0194 secs Fastest: 0.0062 secs Average: 0.0088 secs 0.006 [1] \| 0.007 [1980] \|■■■■■■■■■■■■■■■■■■■ 0.009 [4134] \|■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■ 0.010 [1874] \|■■■■■■■■■■■■■■■■■■ 0.011 [1406] \|■■■■■■■■■■■■■■ 0.013 [482] \|■■■■■ 0.014 [93] \|■ 0.015 [16] \| 0.017 [5] \| 0.018 [4] \| 0.019 [5] \| $HTTP_NODE != $PROXY_NODE Slowest: 0.0369 secs Fastest: 0.0091 secs Average: 0.0121 secs 0.009 [1] \| 0.012 [5711] \|■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■ 0.015 [3392] \|■■■■■■■■■■■■■■■■■■■■■■■■ 0.017 [794] \|■■■■■■ 0.020 [79] \|■ 0.023 [16] \| 0.026 [3] \| 0.029 [2] \| 0.031 [0] \| 0.034 [1] \| 0.037 [1] \| Comparing the averages, there is - little difference when $HTTP_NODE == $PROXY_NODE (0.009s vs 0.0088s). So for pveproxy->pvedaemon proxying, the effect of disabling connection reuse seems negligible. - ~15% overhead when $HTTP_NODE != $PROXY_NODE (0.0105s vs 0.0121s). Such an increase for pveproxy->pveproxy->pvedaemon proxying is not nothing, but in real-world workloads I'd expect the response time for non-idempotent requests to be dominated by other factors. [1] https://metacpan.org/pod/AnyEvent::HTTP#persistent-=%3E-$boolean [2] https://pve.proxmox.com/pve-docs/api-viewer/index.html#/nodes/{node}/qemu/{vmid}/sendkey [3] https://github.com/rakyll/hey Suggested-by: Fabian Grünbichler <f.gruenbichler@proxmox.com> Signed-off-by: Friedrich Weber <f.weber@proxmox.com>	2024-10-04 12:44:08 +02:00
Fabian Grünbichler	c7ce508372	download handling: adapt to method schema field rename check for both variants for now, and remove (old) alias with next major release Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2024-09-23 17:25:22 +02:00
Fabian Grünbichler	af9d2fe4f6	handler: remove support for directly returned download info this was only used by PMG's HttpServer and for non-API file responses. all of those got dropped there in favour of always returning an object like { data => { download => { [download info here] }, [..], }, [..], } in case of PMG, or passing in a download hash in case of APIServer internal calls. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2024-09-23 17:25:22 +02:00
Fabian Grünbichler	e1f830d1e3	handler: only allow downloads for annotated endpoints only a few API endpoints should allow downloads, mark them explicitly and forbid downloading for the rest. Fixes: `6d832db` ("allow 'download' to be passed from API handler") Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2024-09-23 17:25:22 +02:00
Maximiliano Sandoval	589c8c6cd4	http: support Content-Encoding=deflate Add support for compressing the body of responses with `Content-Encoding: deflate` following [RFC9110]. Note that in this context `deflate` is actually a "zlib" data format as defined in [RFC1950]. To preserve the current behavior we prefer `Content-Encoding: gzip` whenever `gzip` is listed as one of the encodings in the `Accept-Encoding` header and the data should be compressed. [RFC9110] https://www.rfc-editor.org/rfc/rfc9110#name-deflate-coding [RFC1950] https://www.rfc-editor.org/rfc/rfc1950 Suggested-by: Lukas Wagner <l.wagner@proxmox.com> Signed-off-by: Maximiliano Sandoval <m.sandoval@proxmox.com> Tested-by: Folke Gleumes <f.gleumes@proxmox.com>	2024-04-18 14:36:20 +02:00
Friedrich Weber	a8dd7b668e	access control: avoid "uninitialized value" warning if using IP ranges ALLOW_FROM/DENY_FROM accept any syntax understood by Net::IP. However, if an IP range like "10.1.1.1-10.1.1.3" is configured, a confusing Perl warning is printed to the syslog on a match: Use of uninitialized value in concatenation (.) or string at [...] The reason is that we use Net::IP::prefix to prepare a debug message, but this returns undef if a range was specified. To avoid the warning, use Net::IP::print to obtain a string representation instead. Signed-off-by: Friedrich Weber <f.weber@proxmox.com>	2024-01-30 11:11:34 +01:00
Fabian Grünbichler	365c5d1d48	fix #4859 : properly configure TLSv1.3 only mode set_min/max_proto_version is recommended upstream nowadays, and it seems to be required for some reason if only TLS v1.3 is supposed to be enabled. querying via get_options gives us the union of - system-wide openssl defaults - our internal SSL defaults - flags configured by the user via /etc/default/pveproxy note that by default only 1.2 and 1.3 are enabled in the first place, so disabling either leaves a single version being set as min and max. Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2023-07-20 12:42:58 +02:00
Fabian Grünbichler	91787a2003	fix #4802 : reduce CA lookups while proxying OpenSSL as packaged in Debian bookworm now ships a compat symlink for the "combined" CA certificates file (CAfile) as managed by update-ca-certificates. This symlink is in addition to the CApath one that has been around for a file. The new symlink in turn gets picked up by openssl-using code that uses the default values for the trust store. Every TLS context initialization now reads the full combined file, even if no TLS is actually employed on a connection. We do such an initialization for every proxied connection (where our HTTP server is the client). By specifying an explicit CA path (that is identical to the default one), the old behaviour of looking up each CA certificate individually iff needed is enabled again. For an API endpoint where HTTP request handling is the bottle neck (as opposed to the actual API handler), this improves performance of proxied requests to be back in line with unproxied ones handled directly by the unprivileged daemon. For all proxied requests, CPU usage is decreased as well. The default CAfile and CApath contain the same certificates, so there should be no change in trusted certificates. Additionally, certificate fingerprints are pinned in this context and verified against the cache of pinned fingerprints. Reported-by: Roland Kletzing <roland.kletzing@cybercon.de> Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2023-07-03 09:35:30 +02:00
Dominik Csapak	9a43feac82	avoid AnyEvent::AIO to fix CPU spinning if pure-perl lib is installed when installing AnyEvent::AIO (by the package libanyevent-aio-perl), the worker forks of our daemons using AnyEvent would consume 100% cpu cycles while trying to do an epoll_wait which no one read from. It was not really clear which part of the code set that fd up. Reading the documentation of the related perl modules, it became clear that the issue was with AnyEvent::IO. By default this uses AnyEvent::AIO (if installed) which in turn uses IO::AIO which explicitly says it uses pthreads and is not really fork compatible (which we rely heavy upon). It seems that IO::AIO sets up some fds with epoll in the END handler of it's library (or earlier, but sends data to it in the END handler), so that when using 'exit' instead of 'POSIX::_exit' (which we do in PVE::Daemon) creates the observed behavior. Interestingly we did not use any of AnyEvent::IO's functionality, so we can safely remove it. Even if we would have used it in the past, without AnyEvent::AIO the IO would not have been async anyway (the pure perl impl doesn't do async IO). My best guess is that we wanted to use it, but noticed that we can't, and forgot to remove the use statement. (This is indicated by a comment that says aio_load is not async unless IO::AIO is used) This only occurs now, since bookworm is the first debian release to package the library. if we ever wanted to use AnyEvent::AIO, there are probably two other ways that could fix it: * replace our 'exit()' calls with 'POSIX::_exit()', which seems to fix it, but other side effects are currently unknown * use 'IO::AIO::reinit()' after forking, which also seems to fix it, but perldoc says it 'is not an operation supported by any standards, but happens to work on GNU/LINUX and some newer BSD systems' With this fix, one can safely install 'libanyevent-aio-perl' and 'libperl-languageserver-perl' (the only user of it AFAICS) on a Proxmox VE or Proxmox Mail Gateway system. Signed-off-by: Dominik Csapak <d.csapak@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-07-03 09:34:50 +02:00
Stoiko Ivanov	81f0f40ea1	proxy request: handle missing content-type header In case the actual request-body is empty it seems not Content-Type header is set by browsers. Tested on a vm with stopping and starting a container via GUI (/api2/extjs/nodes/<nodename>/lxc/<vmid>/status/stop) fixes `f398a3d94b` Reported-by: Friedrich Weber <f.weber@proxmox.com> Reported-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Stoiko Ivanov <s.ivanov@proxmox.com>	2023-06-09 18:57:28 +02:00
Dominik Csapak	bd9d641690	use proper arrays for array parameter since there is no other way to get an array parameter when using x-www-form-urlencoded content type the previous format with \0 separated strings (known as '-alist' format) should not be used anymore (in favor of the now supported arrays) Acked-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2023-06-07 13:16:57 +02:00
Dominik Csapak	f398a3d94b	proxy request: forward json content type and parameters instead of always trying to encode them as x-www-form-urlencoded Acked-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2023-06-07 13:16:54 +02:00
Thomas Lamprecht	1e9befeb80	multiline parameter style nit fix Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-06-06 17:12:50 +02:00
Thomas Lamprecht	148dc08e90	replace junior semicolon with actual one commas can be used in two ways, quoting Perl Best Practices (PBP): > The comma actually has two distinct roles in Perl. In a scalar > context, it is (as those former C programmers expect) a sequencing > operator: “do this, then do that”. But in a list context, such as > the argument list of a print, the comma is a list separator, not > technically an operator at all. -- PBP, page 69 And the separating variant is called a "junior semicolon" by PBP. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-04-14 16:38:35 +02:00
Thomas Lamprecht	cb53bd6861	explicitly disallow tmpfilename parameter in query URL This is an internal parameter and we pass the actual internal one around via the $reqstate variable, so avoid confusion and return a clear error if a POST request sets this query parameter. Reported-by: Friedrich Weber <f.weber@proxmox.com> Suggested-by: Friedrich Weber <f.weber@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-04-14 16:22:40 +02:00
Matthias Heiserer	a2a3d17be8	file upload: don't calculate MD5, log file name instead Until now, we calculated the MD5 hash of any uploaded file during the upload, regardless of whether the user chose to provide a hash sum and algorithm. The hash was only logged in the syslog. As the user can provide a hash algorithm and a checksum when uploading a file, which gets automatically checked (after the upload), this is not needed anymore. Instead, the file name is logged. Depending on the speed of the network and the cpu, upload speed or CPU usage might improve: All tests were made by uploading a 3.6GB iso from the PVE host to a local VM. First line is with md5, second without. no networklimit multipart upload complete (size: 3826831360B time: 20.310s rate: 179.69MiB/s md5sum: 8c651682056205967d530697c98d98c3) multipart upload complete (size: 3826831360B time: 16.169s rate: 225.72MiB/s filename: ubuntu-22.04.1-desktop-amd64.iso) 125MB/s network In this test, pveproxy worker used x % CPU during the upload. As you can see, the reduced CPU usage is noticable in slower networks. ~75% CPU: multipart upload complete (size: 3826831360B time: 30.764s rate: 118.63MiB/s md5sum: 8c651682056205967d530697c98d98c3) ~60% CPU: multipart upload complete (size: 3826831360B time: 30.763s rate: 118.64MiB/s filename: ubuntu-22.04.1-desktop-amd64.iso) qemu64 cpu, no network limit multipart upload complete (size: 3826831360B time: 46.113s rate: 79.14MiB/s md5sum: 8c651682056205967d530697c98d98c3) multipart upload complete (size: 3826831360B time: 41.492s rate: 87.96MiB/s filename: ubuntu-22.04.1-desktop-amd64.iso) qemu64, -aes, 1 core, 0.7 cpu multipart upload complete (size: 3826831360B time: 79.875s rate: 45.69MiB/s md5sum: 8c651682056205967d530697c98d98c3) multipart upload complete (size: 3826831360B time: 66.364s rate: 54.99MiB/s filename: ubuntu-22.04.1-desktop-amd64.iso) Signed-off-by: Matthias Heiserer <m.heiserer@proxmox.com> [ T: reflow text-width and slightly add to subject ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-04-13 12:51:18 +02:00
Friedrich Weber	602eb8aabd	multipart upload: properly parse file parts without Content-Type As reported in the forum, multipart requests are parsed incorrectly if the file part header contains only Content-Disposition, but no other fields (in particular, no Content-Type). As a result, uploaded files are mangled: In most cases, an additional carriage return and line feed (\r\n) is prepended to the file contents. As an example, consider the following file part (with explicit \r\n for clarity): Content-Disposition: form-data; name=...; filename=...\r\n Content-Type: application/x-iso9660-image\r\n \r\n file contents... The current parsing code for file parts roughly works as follows: 1) Consume the Content-Disposition field including the trailing \r\n 2) Consume and ignore everything up to and including the next \r\n\r\n 3) Read the file contents This works fine in the example above. However, it has a bug in case Content-Disposition is the only header field: Content-Disposition: form-data; name=...; filename=...\r\n \r\n file contents... Now, step 1 already consumes the first half of the \r\n\r\n sequence that marks the end of the part headers. As a result, step 3 starts reading the file at a wrong offset: - If the remaining contents of the read buffer (currently sized 16KiB) contain \r\n\r\n, step 2 consumes everything up to and including this marker and step 3 starts reading file contents there. As a result, the uploaded file is truncated at its beginning. - Otherwise, step 2 is a noop and step 3 considers the remaining second half of the \r\n\r\n marker to be part of the file contents. As a result, the uploaded file is prepended with an extra \r\n. To fix this, modify step 1 to not consume the trailing \r\n. This keeps the \r\n\r\n marker intact, no matter whether additional header fields are present or not. Fixes: `3e3faddb4a` Link: https://forum.proxmox.com/threads/125411/ Signed-off-by: Friedrich Weber <f.weber@proxmox.com>	2023-04-11 14:38:22 +02:00
Fabian Grünbichler	4737252f60	header processing: add comments Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2023-03-07 11:20:09 +01:00
Fabian Grünbichler	14f39f121c	header processing: explicit return 0 Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2023-03-07 11:20:09 +01:00
Fabian Grünbichler	b636292c6c	header processing: style fixups Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2023-03-07 11:20:09 +01:00
Max Carrara	d8898f5e20	fix whitespace Signed-off-by: Max Carrara <m.carrara@proxmox.com>	2023-03-07 11:19:59 +01:00
Max Carrara	933a4dbbaf	fix #4494 : redirect HTTP to HTTPS Allow HTTP connections up until the request's header has been parsed and processed. If no TLS handshake has been completed beforehand, the server now responds with either a '301 Moved Permanently' or a '308 Permanent Redirect' as noted in the MDN web docs[1]. This is done after the header was parsed; for the redirect to work, the `Host` header field of the request is used to create the `Location` field of the response. This makes redirections independent of how the server is accessed (e.g. via IP, localhost, FQDN, ...) possible. Upon redirection the client is immediately disconnected; otherwise, they would have to wait for the connection to time out until they may reconnect via TLS again. [1] https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/301 Signed-off-by: Max Carrara <m.carrara@proxmox.com>	2023-03-07 11:19:32 +01:00
Max Carrara	f2e54bb78a	header processing: factor out auth and request handling The part responsible for authentication and subsequent request handling is moved into the new `authenticate_and_handle_request` subroutine. If `authenticate_and_handle_request` doesn't return early, it returns `1` for further control flow purposes. Some minor things are formatted or renamed for readability's sake. Signed-off-by: Max Carrara <m.carrara@proxmox.com>	2023-03-07 11:18:41 +01:00
Max Carrara	bda4864145	header processing: extract into separate subroutine The code concerned with processing the request's header in `unshift_read_header` is moved into the new `process_header` subroutine. If `process_header` doesn't return early, it returns `1` for further control flow purposes. Some minor things are formatted or renamed for readability's sake. Signed-off-by: Max Carrara <m.carrara@proxmox.com>	2023-03-07 11:18:10 +01:00
Thomas Lamprecht	435dbe0c06	multipart upload: code cleanup/reuse Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-03-06 13:01:16 +01:00
John Hollowell	0b6b3b372b	multipart upload: remove ignore-whitespace flag from regex makes it rather harder to read and now unnecessary Signed-off-by: John Hollowell <jhollowe@johnhollowell.com> [ T: resolve merge conflict and add commit message ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-03-06 13:00:17 +01:00
John Hollowell	3e3faddb4a	fix #4344 : http-server: ignore unused multipart headers In commit `0fbcbc2` ("fix #3990: multipart upload: rework to fix uploading small files") a breaking change was added which now requires the file's multipart part to have a `Content-Type` even though the content type is never used. It is just included to consume those bytes so phase 2 (dumping the file contents into the file) can continue. Avoid this overly strict and unused requirement. Signed-off-by: John Hollowell <jhollowe@johnhollowell.com> [ T: resolve merge conflict, add telling commit message ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-03-06 12:54:09 +01:00
Matthias Heiserer	44791210d7	multipart upload: ignore trailing-newline requirement from spec Allow upload without trailing newline, even though this is not compliant with RFC 1521. RFC 1521 mandates that the close-delimiter ends in a newline: 'close-delimiter := "--" boundary "--" CRLF' However, some software (e.g. postman) sends their request without a trailing newline, which resulted in failing uploads. Signed-off-by: Matthias Heiserer <m.heiserer@proxmox.com> Reviewed-by: Daniel Tschlatscher <d.tschlatscher@proxmox.com> Tested-by: Daniel Tschlatscher <d.tschlatscher@proxmox.com>	2022-12-13 13:17:41 +01:00
Matthias Heiserer	e3295acc78	fix multipart upload: ignore additional headers Reported in the forum: https://forum.proxmox.com/threads/image-upload-fails-after-upgrading-from-7-1-to-7-3.119051/#post-516517 When additional headers existed in the request body, the upload failed. With this patch, all additional headers get ignored. Example: The following upload would fail because no headers were expected after Content-Disposition. ``` --EPIHyQJFC5ftgoXHMe8-Jc6E7FqA4oMb0QBfOTz Content-Disposition: form-data; name="content" Content-Type: text/plain; charset=ISO-8859-1 iso ``` would fail. These headers now also get ignored, as we don't use them. Also, upload now works when the Content-Disposition header isn't the first, i.e.: ``` --XVH95dt1-A3J8mWiLCmHCW4roSC7-gBntjATBy-- Content-Type: text/plain; charset=ISO-8859-1 Content-Disposition: form-data; name="content" ``` Fixed upload was tested using * Curl * GUI * Apache HttpClient 5 Signed-off-by: Matthias Heiserer <m.heiserer@proxmox.com> Reviewed-by: Daniel Tschlatscher <d.tschlatscher@proxmox.com> Tested-by: Daniel Tschlatscher <d.tschlatscher@proxmox.com>	2022-12-13 13:16:31 +01:00
Matthias Heiserer	26ea294acd	multipart upload: fix upload of files starting with newlines Currently, if a file starts with a newline, it gets removed and the upload succeeds (provided no hash is given). Signed-off-by: Matthias Heiserer <m.heiserer@proxmox.com> Reviewed-by: Daniel Tschlatscher <d.tschlatscher@proxmox.com> Tested-by: Daniel Tschlatscher <d.tschlatscher@proxmox.com>	2022-12-13 13:16:19 +01:00
Dominik Csapak	5a08cdbedf	remove dead code 'parse_content_disposition' our recent change to parsing the upload headers made that code unnecessary, so remove it Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2022-11-07 16:42:32 +01:00
Dominik Csapak	a66b77d850	upload: re-allow white space in filenames Some fields (e.g. filename) can contain spaces, but our 'trim' function, would only return the value until the first whitespace character instead of removing leading/trailing white space. This lead to passing the wrong filename to the API call (e.g. 'foo' instead of 'foo (1).iso'), which would then reject it because of the 'wrong' extension. Fix this by just using the battle proven trim from pve-common. Fixes: `0fbcbc2` ("fix #3990: multipart upload: rework to fix uploading small files") Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2022-11-07 16:40:15 +01:00
Thomas Lamprecht	5339ae14d9	unshift_read_header: minor code style improvement Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-09-29 17:05:34 +02:00
Thomas Lamprecht	32163b8e11	multipart upload: report duration with millisecond precision in syslog Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-09-29 17:05:34 +02:00
Thomas Lamprecht	aad755eb35	multipart upload: avoid some extra lines and general code style fixes Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-09-29 17:05:34 +02:00
Thomas Lamprecht	dafe441609	multipart upload: avoid code duplication in writing data to tmp file Separate the flow into first getting the length and data reference and only then handle writing/digest in a common way Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-09-29 17:05:34 +02:00
Thomas Lamprecht	59128f6b5a	multipart upload: factor out content-disposition extraction and improve the boundary variable helper name by adding a _re postfix and using snake case everywhere. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-09-29 17:05:34 +02:00
Thomas Lamprecht	42ec24969f	multipart upload: drop unused variables Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-09-29 17:05:34 +02:00
Matthias Heiserer	0fbcbc2628	fix #3990 : multipart upload: rework to fix uploading small files == The problem Upload of files smaller than ~16kb failed. This was because the code assumed that it would be called several times, and each time would do a certain action. When the whole file was in the buffer, this failed because the function would try parssing the first part in the payload and then return, instead of parsing the rest of the available data. == Why not just modifying the current code a bit? The code had a lot of nested control statements and a non-intuitive control flow (phase 0->2->1->1->1 and so on). The way the phases and buffer content were checked made it rather difficult to just fix a few lines. == What was changed * Part headers are parsed with a single regex line each, which improves code readability. * Parsing the content is done in order, so even if the whole data is in the buffer, it can be read in one go. Files of arbitrary sizes can be uploaded. == Tested with * Uploaded 0B, 1B, 14KB, 16KB, 1GB, 10GB, 20GB files * Tested with all checksums and without * Tested on firefox, chromium, and pvesh I didn't do any fuzzing or automated upload testing. == Drawbacks & Potential issues * Part headers are hardcoded, adding new ones requries modifying this file == does not fix * upload can still time out Signed-off-by: Matthias Heiserer <m.heiserer@proxmox.com> Tested-by: Daniel Tschlatscher <d.tschlatscher@proxmox.com>	2022-09-29 17:04:34 +02:00
Matthias Heiserer	91b86f4e2d	AnyEvent: whitespace fix and remove unnecessary parentheses. Signed-off-by: Matthias Heiserer <m.heiserer@proxmox.com>	2022-09-29 14:40:46 +02:00
Daniel Tschlatscher	9c1388daf1	acknowledge content-disposition header Acknowledging the Content-Disposition header makes it possible for the backend to tell the browser whether a file should be downloaded, rather than displayed inline, and what it's default name should be. Signed-off-by: Daniel Tschlatscher <d.tschlatscher@proxmox.com>	2022-09-29 14:35:32 +02:00
Thomas Lamprecht	4099febef5	request: add missing early return to complete error check While $self->error will immediately send out a 4xx or 5xx response anyhow its still good to cover against possible side effects (e.g., from future code in that branch) on the server and return directly. Note that this is mostly for completeness sake, we already have another check that covers this one for relevant cases in commit `580d540ea9`. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-07-04 11:08:19 +02:00
Thomas Lamprecht	c2bd69c7b5	requests: assert that theres no @ in the URLs authority We don't expect any userinfo in the authority and t o avoid that this allows some leverage in doing weird things later its better to error out early on such requests. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com> Originally-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-07-02 08:27:13 +02:00
Thomas Lamprecht	e9df8a6e76	pass through streaming: only allow from privileged local pvedaemon Ensures that no external request can control streaming on proxying requests as safety net for when we'd have another issue in the request handling part. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com> Originally-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-07-02 07:59:53 +02:00

1 2

70 Commits