haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-05-27 03:33:36 -04:00

Author	SHA1	Message	Date
Olivier Houchard	de3f245df0	BUG/MEDIUM: servers: Store the connection hash with the parameter cache When we store the negociated server parameters, such as the ALPN, also store the calculated hash with the connection. If it is different, as can happen because the IP address is different because set-dst was used, we certainly do not want to reuse the information in the cache, otherwise we could end up using the wrong ALPN and mux. That means we already have to calculate the hash in connect_server() now, while before we would not do it for Websockets, if we could not do connection reuse, as that's all the hash was used for. This should fix Github issue #3386 This should be backported as far as 3.2.	2026-05-20 10:29:22 +02:00
Amaury Denoyelle	e139dd90e3	MAJOR: mux_quic: support stream elasticity during connection lifetime Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details qcc_release_remote_stream() is called each time a remote stream is closed. Flow control accounting is updated and when necessary, a MAX_STREAMS_BIDI frame is prepared to allow the peer to initiate new streams. This patch extends stream elasticity features with the QUIC bidirection stream flow control mechanism. The announced value can now be possibly reduced depending on conn_calc_max_streams(). The first step is to decrement closed streams from the global committed extra streams total. This must be performed conn_calc_max_streams() to ensure the calculation will be valid. Then, there is two cases depending on conn_calc_max_streams() result. If the value is less than the peer still remaining stream window, nothing more is performed. If the opposite case, flow control must be increased and a MAX_STREAMS_BIDI frame is prepared, with the value adjusted to not exceed the stream elasticity limit. Global extra streams total is then finally incremented. This calcul also ensures that when all streams are closed, global extra streams accounting operations are decremented by 1, as a connection always has access to one stream which is excluded from the global total. Note that if stream elasticity is not active, flow control increases principle is unchanged and remains statically performed. This patch is labelled as major as it complexifies bidirectional stream flow control mechanisme. This is a sensitive operation as there is a risk of connection freeze if flow control updates are inadvertently skipped.	2026-05-20 09:52:50 +02:00
Amaury Denoyelle	89f3975acc	MINOR: mux_quic: define ms_bidi_rel QCC member Add a new QCC member <ms_bidi_rel>. This represents the number of concurrent streams advertised similarly to ms_bidi, but as a relative value. This patch does not introduce any functional change. For now, <ms_bidi_rel> will be equal to <ms_bidi_init>. However, with the implementation of stream elasticity and dynamic adjustment for concurrent max-streams-bidi, the former will be required to keep the last advertised value.	2026-05-20 09:52:50 +02:00
Amaury Denoyelle	d21ec4c707	MINOR: quic: use stream elasticity value for initial advertisement When stream elasticity is active, the maximum number of concurrent bidi streams advertised via transport parameters is now reduced depending on the connection load. This is implemented via conn_calc_max_streams() which returns the value to use. This is not applied on listeners with enabled 0-RTT. Indeed, for such connections, clients are expected to reuse the previously seen transport parameters. The server on the other hand must not decrease several values on the newly advertised params, in particular for the maximum number of concurrent bidi streams. The simplest way to prevent 0-RTT failure is to not mix stream elasticity with it. Note that the 0-RTT limitation is only applied for the initial value : during the connection lifetime, stream elasticity can still be used by the MUX to dynamically reduce the stream window. This will be implemented in a future patch.	2026-05-20 09:52:50 +02:00
Amaury Denoyelle	e4adba6e64	MINOR: mux_quic: implement basic committed_extra_streams accounting Account QUIC frontend connections into committed_extra_streams when stream elasticity setting is active. This is performed in QCC init and release functions. This patch has no impact on QUIC subsystem for now. Connections will still allow a static number of concurrent streams based on tune.quic.fe.stream.max-concurrent. However, this has a direct repercussion on H2 subsystem, as a higher count of QUIC connections will reduce the concurrent streams allowed there.	2026-05-20 09:52:50 +02:00
Amaury Denoyelle	33c8270903	OPTIM: h2: do not update committed streams if elasticity disabled When streams-elasticity is enabled in the configuration, H2 mux is responsible to update the global committed_extra_streams value. Adjust these operations to ensure they are skipped if streams-elasticity is disabled, which is the current default. This prevents unnecessary atomic operations in this case. No need to backport unless streams-elasticity feature is picked in older releases.	2026-05-20 09:52:50 +02:00
Amaury Denoyelle	ad3562fea1	MINOR: h2: explain committed_extra_streams dec on h2_init() error h2_init() is now responsible to increment committed_extra_streams for new frontend connections, in relation to the newly implemented stream-elasticity feature. In case of an early error, a mirroring decrement is executed on fail_stream label. However, for now this error label can only be selected via BE conns. In fact, it's not yet possible for h2_init() to fail after the extra streams increment. However, the decrement operation is kept to prevent any omissions in case of future evolutions of h2_init() error path. To prevent reporting of a possible dead code, add an extra comment which summarizes the situation.	2026-05-20 09:52:50 +02:00
Maxime Henrion	641fe4f119	MEDIUM: startup: add automatic chroot feature It is now possible to use "chroot auto" in the configuration. This lets haproxy create an anonymous (cleaned up after the process terminates) and read-only directory for chroot. This directory is created in /tmp; we might want to support creating it in a different directory in the future, either by respecting $TMPDIR or by allowing an optional directory after the "auto" keyword.	2026-05-20 08:34:24 +02:00
Maxime Henrion	2d2980408f	MINOR: startup: support unprivileged chroot if possible Try to use unshare(CLONE_NEWUSER) if available so we can have a chroot as an unprivileged user. This is a Linux-only mechanism.	2026-05-20 08:34:17 +02:00
Willy Tarreau	7004bb3b8c	MINOR: backend: support hash-key guid for a stabler distribution Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details When server fleets are constantly updated, using a stable distribution across a bunch of load balancers can be convenient. The addr and port already provide a bit of this but for situations were addresses might differ between sites or change dynamically this does not work. The guid is perfect for this because by definition it's supposed to designate a single server and be unique. So when two servers anywhere have the same, the tool that provisionned them promises that they are the same server. So here we introduce "hash-key guid" which performs a 32-bit hash on the GUID value. When no guid is provided, a fallback is performed on ID, as is done for other keys.	2026-05-19 19:11:25 +02:00
Willy Tarreau	a59e6e5efd	MINOR: server: support hash-key id32 for a cleaner distribution The "id" hash-key scales the ID by a factor of 16 that tries to leave room between the nodes on the 32-bit space to permit smooth weight variations (e.g. during slowstart). However this does not deal well with overlaps between server IDs. For example, assigning IDs that are only multiples of 256 million to 16 servers yields traffic only on one since in practice they all have the same 28 lower bits. The new "id32" hash key bridges this gap by using the full 32-bit ID of the server as the key. On the other hand, the user must be careful not to switch the hash function to "none" when using incremental IDs because in this case they might be very poorly distributed. But this can be convenient for automated provisionning systems which assign IDs themselves, as the full 32 bits are used now.	2026-05-19 19:11:25 +02:00
Willy Tarreau	cb5d98c495	BUG/MINOR: backend: fix balance hash calculation when using hash-type none The "hash-type xxx none" is broken for keys that are not in type string because the sample fetch call casts them to SMP_T_BIN, that tends to preserve the original format (integers, IP addresses etc), but the gen_hash() function in case of BE_LB_HFCN_NONE expects to read a string representing a number, that it parses to retrieve the value, and just fails on many binary types. For example, the following just always returns key 0: balance hash rand() hash type consistent none An ugly workaround is to make sure the expression returns a string, for example this: balance hash rand(),concat() hash type consistent none In order to fix most cases here, we force the conversion to type string when using BE_LB_HFCN_NONE, but a better approach would require a larger rework and split gen_hash() or change it to accept an integer as well, so that the caller could cast to SMP_T_INT for BE_LB_HFCN_NONE and pass the resulting number already parsed with the least information loss. In this case even IPv4 addresses would be preserved. The current approach at least addresses the initially envisioned use cases, and the limitations have been added to the doc. This can be backported to 3.0 though it's not really important.	2026-05-19 19:11:25 +02:00
Willy Tarreau	f2bf3483ba	BUG/MINOR: server: accept server IDs above 2^31 and clarify error message Due to the check of the stored value instead of the parsed one, it was not permitted to use server IDs above 2^31 while they are perfectly possible. Let's refine the parsing and also update the error message to indicate the range. The doc was also refined to reflect the relation with hash-key. This may be backported though it wouldn't have any effect on working configs.	2026-05-19 19:11:25 +02:00
Amaury Denoyelle	f2b152c95e	MEDIUM: ssl: allow h3/QMux negotiation without explicit proto Implements automatic selection of QMux MUX if "h3" ALPN has been negotiated on top of TCP/SSL. The first part of this change is to define "alpn" member of mux_proto_list. This is necessary so that conn_get_best_mux_entry() can select it when "h3" has been chosen. As a side-effect, this also automatically sets a default ALPN to "h3" for bind lines with "proto qmux". The most important change is to adapt the SSL layer. On handshake completion, the eligible MUX is retrieved via conn_select_mux_fe/be() functions. If xprt_qmux is required by it, MUX init is delayed and QMux handshake is started first. This last change is necessary as connection flags CO_FL_QMUX_RECV/SEND are only set if "proto qmux" is explicitely set. In case xprt_qmux is activated via pure ALPN negotiation, these flags are also set on xprt_qmux_init(). This is mandatory to ensure emission/reception of QMux transport parameters will be performed as expected.	2026-05-19 18:40:50 +02:00
Amaury Denoyelle	e30bcfe6cd	MINOR: proxy/server: reject TCP ALPN h3 without experimental Add a postparsing check on TCP ALPN bind and server setting. An error is reported if the token "h3" is present and expose-experimental-directives is not globally activated. This ensures that QMux protocol won't be selected if experimental features are not explicitely requested. The check is not performed though if "proto qmux" is explicitely defined, as this setting already checks for experimental support. Currently, it's not possible to activate QMux without any explicit "proto qmux" config. However, this will be implemented in a next patch, so this check will become necessary.	2026-05-19 18:40:50 +02:00
Amaury Denoyelle	879c78c909	MINOR: connection/mux_quic: add MUX <init_xprt> field for QMux handshake The first part of this patch defines a new mux_proto_list field named <xprt_init>. This allows to define an extra XPRT layer which should be activated first prior to the MUX creation both on frontend and backend sides. This is immediately used for QMux mux_proto_list to require XPRT_QMUX handshake. With this change, activation of QMux connection flags in session_accept_fd() and connect_server() are adjusted to take into account <init_xprt> field. This approach is much more evolutive than relying on the previous MUX name. Change in connect_server() will also be necessary to support QMux activation on a TCP server with h3 ALPN without explicit "proto qmux". This guarantees that MUX initialization is delayed after QMux handshake.	2026-05-19 18:40:50 +02:00
Amaury Denoyelle	356f1ab5d7	MINOR: connection: define conn_select_mux_be() This patch is similar to the previous one but this time for backend connections. The MUX selection code is directly extracted from conn_install_mux_chk() and conn_install_mux_be().	2026-05-19 18:40:46 +02:00
Amaury Denoyelle	86ffbaa0f5	MINOR: connection: define conn_select_mux_fe() Define a new function conn_select_mux_fe(). The objective is to have a preliminary function to determine the MUX which will be used without initializing it. This will be useful for MUX which relies on a specific XPRT handshake prior to its startup, which is the case for QMux protocol. The code of conn_select_mux_fe() is identical to the beginning of conn_install_mux_fe() with a similar MUX selection logic. However, connection MUX initialization is not performed in this case. In a future patch, both functions should be merged together to reduce code duplication.	2026-05-19 18:33:54 +02:00
Olivier Houchard	6aab6d4e98	MEDIUM: connections: Use both mux_proto and alpn to pick a mux In conn_get_best_mux() and conn_get_best_mux_entry(), the mux name was provided sometimes based on the "proto" directive, sometimes based on the ALPN, but in any case, it was compared again the mux_proto_list mux_proto field. This is not correct, as ALPN can be different from the internal mux_proto. So enhance those functions so that they wll accept an ALPN as well. If a mux_proto is provided, that will be used, if not, and if an ALPN is provided, then that will be used, and compared against the ALPN provided by the mux, if any.	2026-05-19 18:33:54 +02:00
Olivier Houchard	022681eca2	MINOR: mux: Rename the "token" from mux_proto_list to mux_proto In struct mux_proto_list, rename the "token" field to "mux_proto". That field should only be used to match the name provided in the "proto" directive, and it will be soon. This should be a no-op.	2026-05-19 18:33:54 +02:00
Amaury Denoyelle	50354f929d	BUG/MINOR: httpclient-cli: fix uninit variable in error label The following patch fixes a leak in case of httpclient_start() failure in the httpclient_cli code by adding httpclient_destroy() call on error path. `c53256adbc` BUG/MINOR: httpclient-cli: Destroy http-client context if failing to start it However, error label may be selected prior to httpclient allocation if CLI arguments are incorrect. This can cause a crash due to a deferencing of an uninitialized variable. This has been detected via a compilation error : src/httpclient_cli.c: In function 'hc_cli_parse': src/httpclient_cli.c:162:2: error: 'hc' may be used uninitialized in this function [-Werror=maybe-uninitialized] 162 \| httpclient_destroy(hc); \| ^~~~~~~~~~~~~~~~~~~~~~ This must be backported along the above patch, which is scheduled up to the 2.6 release.	2026-05-19 18:33:13 +02:00
Christopher Faulet	6f6bf3fecc	CLEANUP: haterm: Remove "(too old kernel)" from warning message during init During initialization of the haterm master pipe, If its size is limited (lower than the configured one * 5/4), a warning is emitted. In this warning, it is specified this happened because the kernel is too old. But it is unrelated. So let's remove this part.	2026-05-19 17:50:50 +02:00
Christopher Faulet	1279bd80e9	MINOR: haterm: Don't init haterm master pipe if not used There is no reason to initialize the haterm master pipe if haterm is not used. So now, it is only performed if a non-disabled haterm frontend is found. To do so, in addition to test the proxy's flags and capabilities, we also check if "stream_new_from_sc" points on "hstream_new".	2026-05-19 17:50:50 +02:00
Christopher Faulet	b74b5289c8	BUG/MINOR: h1: Don't mask websocket protocol if multiple protocols used During H1 message parsing, the Upgrade header values are checked to detect "websocket" prototol, to properly handle websocket upgrades between H1 and H2 and to possibly reject messages if mandatory headers are missing. However, the flag is reset for each new Upgrade header and the information may be lost. So never reset it. This patch must be backported as far as 2.4.	2026-05-19 17:50:50 +02:00
Christopher Faulet	8dd49dfaba	BUG/MEDIUM: h1: Skip all h2c values from Upgrade headers during parsing During the H1 message parsing, the Upgrade header values are checked to detect "h2c" and "h2" tokens and skip them. To do so, we rely on H1_MF_UPG_H2C flag, set during the parsing. And during the request post-parsing, if this flag was set, all Upgrade headers are removed. This was fixed by the commit `7b89aa5b1` ("BUG/MINOR: h1: do not forward h2c upgrade header token"). However, there are two issues here and the commit above must be refined. First, the flag is reset for each new Upgrade header. So "h2c" or "h2" tokens will be properly detected if all tokens are set on the same Upgrade header. But if splitted on several headers, previously detected tokens will be hidden by a next ones. Concretly, the following will be properly caught Connection: upgrade Upgrade: foo, h2c, bar But then following not: Connection: upgrade: Upgrade: foo, h2c Upgrade: bar Then, when a "h2c" or "h2" token is finally reported, all Upgrade headers are removed, regardless other tokens. So, to fix the both issues, everything is now handled during the message parsing by skipping "h2c" and "h2" tokens, rebuilding the Upgrade header value without then offending tokens. The same was already performed for the Connection header, to skip "keep-alive" and "close" value. So it is not a so fancy change. Thanks to this change, it is no longer necessary to handle H1_MF_UPG_H2C during the request post-parsing. And in fact, this flag is no longer necessary. So let's remove it too. Thanks to Vincent55 for finding and reporting this. This patch must be backported as far as 2.4.	2026-05-19 17:50:50 +02:00
Christopher Faulet	c53256adbc	BUG/MINOR: httpclient-cli: Destroy http-client context if failing to start it When the call to httpclient_start() failed, it is the caller responsibilty to destroy the http-client context by calling httpclient_destroy(). It is performed at several places but it was missing in the httpclient_cli code. So let's fix it. This patch must be backported as far as 2.6. On 3.2 and lower, it must be applied on http_client.c.	2026-05-19 17:50:50 +02:00
Christopher Faulet	18c5cd6674	BUG/MINOR: server: Properly handle init-state value during haproxy startup Unlike stated in the configuration manual, the server 'init-state' parameter was not evaluated during haproxy startup/reload. After a review, it appeared there were also issues if combined with the 'track' parameter. In addtition, this parameter was only evaluated when health-checks were enabled for the server, leading to unexpected behavior if the serve settings are dynamically changed via the CLI. To fix those issues, behavior of the 'init-state' parameter was slightly adapted. It is always evaluated, even when there is no running health-checks for the server. An error is reported if the 'track' parameter is also defined. Both cannot work together. In addition, the "none" state was introduced to be able to restore the default behavior. It will be especially useful when the parameter is inherited from a 'default-server' directive. This patch should fix the issue #3298. It must be backported as far as 3.2.	2026-05-19 17:50:50 +02:00
Remi Tricot-Le Breton	b786eaf1b1	BUG/MINOR: jws: Add missing return value check (EVP_PKEY_get_bn_param) Two calls of 'EVP_PKEY_get_bn_param' did not have their return value checked. This patch can be backported up to 3.2.	2026-05-19 15:21:26 +02:00
Willy Tarreau	307294b30a	BUG/MINOR: jws: fix OpenSSL 3.0 version check from > to >= Three #if directives used > 0x30000000L which excluded OpenSSL 3.0.0 exactly from the modern code path, treating it as pre-3.0. Changed all three to >= 0x30000000L to match jwe.c and openssl-compat.h conventions. This affects EC key thumbprint generation, RSA JWK generation, and JWS algorithm detection for OpenSSL 3.0.0.	2026-05-19 15:21:24 +02:00
Willy Tarreau	0284be5456	BUG/MEDIUM: limits: properly account for global.maxpipes in compute_ideal_maxconn() Starting a config with maxpipes and no maxconn always ended up in error because the number of FDs needed for pipes was not deduced from the total number of FDs when calculating maxconn, and was later found to exceed the total number of allocatable FD during final checks. When global.maxpipes is set, it must be used during compute_ideal_maxconn() so that it's properly deduced. Without this, just having "maxpipes 500" in a config prevents it from starting. With the fix, it properly starts with a maxconn adjusted depending on the number of splice-enabled proxies. This should be backported, theoretically everywhere, but preferably progressively. The following config should fail on affected versions and load with fixed ones: global maxpipes 500 frontend srv1 bind :8001	2026-05-19 15:19:23 +02:00
Willy Tarreau	11bad01760	MINOR: proxy: remove the experimental status on dynamic backends As initially planned, if no trouble was reported on dynamic backend commands on the CLI, the experimental status could be dropped before the release. The feedback was not very broad, but was conclusive in that the operations work as expected and the current syntax can be preserved even for future evolutions. So we can drop the experimental status.	2026-05-19 14:56:45 +02:00
Willy Tarreau	b59fe471a5	DOC: config: further clarify that resolvers "default" exists It was explained in the general presentation of resolvers but not in the "resolvers" keyword description itself, which might be where users could be looking for that info, so let's quickly repeat that info there.	2026-05-19 14:48:27 +02:00
Willy Tarreau	29b9da7821	CLEANUP: jwe: fix theoretical overflow in AAD length calculation Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The expression items[JWE_ELT_JOSE].length << 3 performs the shift on an unsigned int (32-bit) before being cast to uint64_t instead of after. This means that we don't cover for a possible overflow (which would never happen as it would need a header length beyond 512MB). At least fixing it will avoid code check reports.	2026-05-18 18:52:28 +02:00
Willy Tarreau	d4a4be6c34	BUG/MINOR: jwt: fix possible memory leak in convert_ecdsa_sig() error path The allocated ec_R and ec_S were not released in case one of the two would fail to be allocated/created, and would cause a memory leak. Let's add the missing BN_free(). This may be backported to 2.4.	2026-05-18 18:50:30 +02:00
Willy Tarreau	bbc41785d9	CLEANUP: tcpcheck: mention that we're a bit far for a sync errno The collection of errno in tcpcheck_eval_connect() and tcpcheck_main() is quite far from the production location, and the risk of having a zero errno is definitely not null. Tests show that this works, so better not try to fix something not broken, but at least place a comment there indicating that it's not necessarily super-reliable. This would need to be revisited the day we finally store errno in the connection.	2026-05-18 18:47:41 +02:00
Willy Tarreau	3b825d2745	BUG/MINOR: check: properly report errno in chk_report_conn_err() When in 2.2, with commit `c8dc20a825` ("BUG/MINOR: checks: refine which errno values are really errors."), errno reporting was refined, an extra check was added before calling retrieve_errno_from_socket(), and by mistake the test on !errno got inverted so that we only call the function to retrieve the error from the socket when errno is set! The first test in the function detects it and returns without changing anything, so this didn't have much effect, however when errno is not set (certain call places purposely pass zero so that getsockopt() is used), this wasn't called so the error wasn't reported. Apparently it only happened when called from process_chk_conn() after an async error was detected, so probably just cases where POLLERR is reported, which remains infrequent. Let's fix the direction of this flag. It can be backported if needed but it's unlikely anyone really noticed.	2026-05-18 18:40:37 +02:00
Willy Tarreau	3da2b63274	BUG/MINOR: sock: store the connection error status When an async connect() fails in sock_conn_check(), it returns an errno that will not be retrieved later by a subsequent getsockopt(SO_ERROR). The problem is that this errno is then definitely lost. This is visible in the 4be_1srv_smtpchk_httpchk_layer47errors regtest that fails on certain systems (e.g. glibc 2.31 on arm32 running Linux 6.1), where the connect() error is systematically lost and the "Connection refused" is never seen in the check status. It also matches a few random reports of the past indicating that the connection error was sometimes not reported in the stats page in front of a down server. Ideally we should store errno in connections as soon as the error is seen. However this would require significant changes that are not acceptable yet for 3.4 nor stable releases. A more acceptable fix is to make use of the extra CO_ER_* flags set by conn_set_errno() as soon as the error is detected. This will recognize a sufficiently large number of errors and the check status will report them (here we'll have "ECONNREFUSED" in the check). Note that on systems where the error is seen synchronously, we can have "ECONNREFUSED (Connection refused)", but this is not a problem. This fix adds the missing conn_set_errno() call to sock_conn_check(), that is thus sufficient to catch this error. In addition, the two affected regtests were updated to search for ECONNREFUSED here. This might be backported to older releases if users request it, but it is probably not necessary.	2026-05-18 18:16:25 +02:00
Willy Tarreau	fdb569c2ea	REGTESTS: quic/issuers_chain_path: do not forget to enable QUIC compat mode This test is compatible with QUIC_OPENSSL_COMPAT but the "limited-quic" directive was not set, making it fail on older libs with no QUIC support despite being declared as compatible.	2026-05-18 18:01:53 +02:00
Willy Tarreau	fd31df765f	REGTESTS: do not run quic/tls13_ssl_crt-list_filters in quic openssl compat mode This test uses the the backend, it fails in QUIC_OPENSSL_COMPAT so let's disable it in this case, like other similar tests.	2026-05-18 18:01:53 +02:00
Willy Tarreau	b44d60eb42	CLEANUP: stick-table: uniformize the different action_inc_gpc*() Some checks failed Contrib / admin/halog/ (push) Has been cancelled Details Contrib / dev/flags/ (push) Has been cancelled Details Contrib / dev/haring/ (push) Has been cancelled Details Contrib / dev/hpack/ (push) Has been cancelled Details Contrib / dev/poll/ (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details While action_inc_gpc1() explicitly checks if s->stkctr or sess->stkctr are set since 2.8 with commit `6c0117168` ("MEDIUM: stick-table: set the track-sc limit at boottime via tune.stick-counters"), action_inc_gpc0() and the generic action_inc_gpc() still stuck to the old approach of not checking them, causing confusion when reviewing the code. Upon closer inspection, the only case where the pointer may be NULL is when global.tune.nb_stk_ctr is zero, which happens when the global section contains "tune.stick-counters 0". However in this case, the config parser "parse_inc_gpc()" will reject any reference to any stick counter, so in theory there is no problem. Regardless, the difference of treatment between sibling functions remains confusing and the check is cheap, so let's generalize it, it will save a future reader from the need to inspect stream_new() and session_new().	2026-05-17 23:10:27 +02:00
Willy Tarreau	015933794e	BUG/MINOR: session/trace: use distinct flags for SESS_EV_END and _ERR Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Session traces were brought in 3.1 by commit `abb07af67` ("MINOR: session/trace: enable very minimal session tracing") though there was an issue, because SESS_EV_END and SESS_EV_ERR have the same value (it's a copy-paste mistake). This can be backported to 3.2.	2026-05-16 20:29:40 +02:00
Willy Tarreau	4519906c70	DOC: internal: add a few rules about internal core principles The new file core-principles.txt quickly enumerates a number of rules and invariants across the project. These can be used as quick reminders as well as basic rules for reviews. It's still lacking a lot of info but should be a good start.	2026-05-16 20:12:32 +02:00
Willy Tarreau	2f88b4bc4b	CLEANUP: address a few typos and copy-paste errors in httpclient and dns Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details These are either typos or copy-paste mistakes (mostly mouse-induced spaces instead of tabs for dns.c).	2026-05-15 18:25:13 +02:00
Willy Tarreau	9ebb00e673	CLEANUP: proxy: fix duplicate declaration of cli_find_frontend in proxy.h The function cli_find_frontend was declared twice identically at lines 98-99 of include/haproxy/proxy.h. The second declaration should have been for cli_find_backend, which is defined in src/proxy.c and used in several places but was missing from the header's exported symbols. This is a simple copy-paste mistake where line 99 duplicated line 98 verbatim instead of declaring cli_find_backend.	2026-05-15 18:24:57 +02:00
Willy Tarreau	3460626148	BUG/MINOR: resolvers: fix missing task_idle destruction in resolvers_destroy() When destroying a stream-based DNS nameserver, task_req and task_rsp were destroyed but task_idle was missed, causing a task object leak. This doesn't necessarily have to be backported since it's only upon exit that it is visible.	2026-05-15 18:19:41 +02:00
Willy Tarreau	6cbcb4f9db	BUG/MINOR: resolvers: fix leaked fields on cfg_parse_resolvers() error paths cfg_parse_resolvers() has many error paths on allocation failure when parsing "nameserver". These paths handle their own cleanup instead of centralizing it. The result is that some errors paths leak some fields. The most complex ones are the strdup() failures which require to check for stream or dgram to figure what to free. These can be detected via ASAN on a dummy strdup() allocation failure: Indirect leak of 131080 byte(s) in 1 object(s) allocated from: #0 0x7f0b7ed1f0ab in malloc (/usr/lib64/libasan.so.8+0x11f0ab) #1 0x000000c73e19 in dns_ring_new src/dns_ring.c:59 #2 0x000000af1848 in dns_dgram_init src/dns.c:480 #3 0x000000922005 in cfg_parse_resolvers src/resolvers.c:3792 #4 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #5 0x0000009e0a39 in read_cfg src/haproxy.c:1142 #6 0x000000447e8c in main src/haproxy.c:3474 #7 0x7f0b7e02ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #8 0x7ffd35f1531c ([stack]+0x2031c) Indirect leak of 304 byte(s) in 1 object(s) allocated from: #0 0x7f0b7ed1ea23 in calloc (/usr/lib64/libasan.so.8+0x11ea23) #1 0x000000af1681 in dns_dgram_init src/dns.c:468 #2 0x000000922005 in cfg_parse_resolvers src/resolvers.c:3792 #3 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #4 0x0000009e0a39 in read_cfg src/haproxy.c:1142 #5 0x000000447e8c in main src/haproxy.c:3474 #6 0x7f0b7e02ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #7 0x7ffd35f1531c ([stack]+0x2031c) Indirect leak of 104 byte(s) in 1 object(s) allocated from: #0 0x7f0b7ed1ea23 in calloc (/usr/lib64/libasan.so.8+0x11ea23) #1 0x000000921f83 in cfg_parse_resolvers src/resolvers.c:3772 #2 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #3 0x0000009e0a39 in read_cfg src/haproxy.c:1142 #4 0x000000447e8c in main src/haproxy.c:3474 #5 0x7f0b7e02ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #6 0x7ffd35f1531c ([stack]+0x2031c) Indirect leak of 64 byte(s) in 1 object(s) allocated from: #0 0x7f0b7ed1f0ab in malloc (/usr/lib64/libasan.so.8+0x11f0ab) #1 0x000000c73e09 in dns_ring_new src/dns_ring.c:55 #2 0x000000af1848 in dns_dgram_init src/dns.c:480 #3 0x000000922005 in cfg_parse_resolvers src/resolvers.c:3792 #4 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #5 0x0000009e0a39 in read_cfg src/haproxy.c:1142 #6 0x000000447e8c in main src/haproxy.c:3474 #7 0x7f0b7e02ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #8 0x7ffd35f1531c ([stack]+0x2031c) Indirect leak of 15 byte(s) in 1 object(s) allocated from: #0 0x7f0b7ed18e20 in strdup (/usr/lib64/libasan.so.8+0x118e20) #1 0x00000092203b in cfg_parse_resolvers src/resolvers.c:3798 #2 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #3 0x0000009e0a39 in read_cfg src/haproxy.c:1142 #4 0x000000447e8c in main src/haproxy.c:3474 #5 0x7f0b7e02ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #6 0x7ffd35f1531c ([stack]+0x2031c) This should be completely reworked so that the cleanup is performed in a central place, as the risk to get it wrong remains high. This patch does the minimal changes to clean this up. It does not need to be backported since it only triggers on boot OOM.	2026-05-15 18:07:50 +02:00
Willy Tarreau	677fdfe126	BUG/MINOR: resolvers: fix leaked dgram and dns_ring struct in parse_resolve_conf() Some strdup() failures in parse_resolve_conf() do not release everything due to the way the function is built, resulting in leaks on error that are caught by ASAN: Direct leak of 304 byte(s) in 1 object(s) allocated from: #0 0x7fe74231ea23 in calloc (/usr/lib64/libasan.so.8+0x11ea23) #1 0x000000af1681 in dns_dgram_init src/dns.c:468 #2 0x00000091cbbf in parse_resolve_conf src/resolvers.c:3559 #3 0x00000092179e in cfg_parse_resolvers src/resolvers.c:3815 #4 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #5 0x0000009e0a39 in read_cfg src/haproxy.c:1142 #6 0x000000447e8c in main src/haproxy.c:3474 #7 0x7fe74162ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #8 0x7ffc0a43e31f ([stack]+0x2031f) Indirect leak of 131080 byte(s) in 1 object(s) allocated from: #0 0x7fe74231f0ab in malloc (/usr/lib64/libasan.so.8+0x11f0ab) #1 0x000000c73e19 in dns_ring_new src/dns_ring.c:59 #2 0x000000af1848 in dns_dgram_init src/dns.c:480 #3 0x00000091cbbf in parse_resolve_conf src/resolvers.c:3559 #4 0x00000092179e in cfg_parse_resolvers src/resolvers.c:3815 #5 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #6 0x0000009e0a39 in read_cfg src/haproxy.c:1142 #7 0x000000447e8c in main src/haproxy.c:3474 #8 0x7fe74162ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #9 0x7ffc0a43e31f ([stack]+0x2031f) Indirect leak of 64 byte(s) in 1 object(s) allocated from: #0 0x7fe74231f0ab in malloc (/usr/lib64/libasan.so.8+0x11f0ab) #1 0x000000c73e09 in dns_ring_new src/dns_ring.c:55 #2 0x000000af1848 in dns_dgram_init src/dns.c:480 #3 0x00000091cbbf in parse_resolve_conf src/resolvers.c:3559 #4 0x00000092179e in cfg_parse_resolvers src/resolvers.c:3815 #5 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #6 0x0000009e0a39 in read_cfg src/haproxy.c:1142 #7 0x000000447e8c in main src/haproxy.c:3474 #8 0x7fe74162ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #9 0x7ffc0a43e31f ([stack]+0x2031f) SUMMARY: AddressSanitizer: 131448 byte(s) leaked in 3 allocation(s). Let's free the dgram and the dns ring. This can be backported though it's not important as it only happens on OOM condition during boot.	2026-05-15 18:00:04 +02:00
Willy Tarreau	b15e9b1b29	BUG/MINOR: resolvers: report the expression error in the do-resolve() action parser When an expression is used for do-resolve(), an error may be reported. Unfortunately it was scratched and replaced by the do-resolve() error, leaving no chance to know exactly what was wrong. Let's report the contents of the error when available. It will indicate identifiers that are not found or invalid ranges or types being used. This can be backported to all versions.	2026-05-15 17:53:00 +02:00
Willy Tarreau	0c8c9b1c2a	CLEANUP: resolvers: properly initialize the sample in resolv_action_do_resolve() The sample used to pass the IP address only had its data, px, sess and strm fields initialized before being passed to vars_set_by_name(). It turns out that this latter one doesn't seem to touch ctx, flags nor opt but nothing guarantees it. Let's at least initialize the fields properly to avoid passing random garbage. No backport is needed.	2026-05-15 17:51:58 +02:00
Willy Tarreau	bed842390f	BUG/MINOR: proxy: use proxy_drop() in parse_new_proxy() error path In parse_new_proxy(), when proxy_defproxy_cpy() fails, the error path used ha_free(&curproxy) to release the partially constructed proxy. However, the proxy was allocated via alloc_new_proxy() which performs significant setup: - setup_new_proxy() inserts it into the proxy_by_name tree (proxy_store_name) - It appends to the global proxies list (LIST_APPEND) - proxy_take() increments its refcount Additionally, proxy_defproxy_cpy() may have allocated further resources (strdup'd strings, compression structures, email alert fields, etc). Using ha_free() only freed the proxy struct itself, leaving: - The proxy still registered in the name tree (dangling pointer) - The proxy still linked in the global proxies list - All strdup'd strings and other allocations leaked This is visible with ASAN when causing random allocation errors: [NOTICE] (27033) : haproxy version is 3.4-dev12-b15468-11 [NOTICE] (27033) : path to executable is ./haproxy [ALERT] (27033) : config : parsing [/dev/stdin:5015] : proxy 'bk3': failed to duplicate tcpcheck preset-vars [ALERT] (27033) : config : Error(s) found in configuration file : /dev/stdin ================================================================= ==27033==ERROR: LeakSanitizer: detected memory leaks Direct leak of 4 byte(s) in 1 object(s) allocated from: #0 0x7f113e518e20 in strdup (/usr/lib64/libasan.so.8+0x118e20) #1 0x000000955410 in setup_new_proxy src/proxy.c:3178 #2 0x000000955816 in alloc_new_proxy src/proxy.c:3221 #3 0x000000956c33 in parse_new_proxy src/proxy.c:3554 #4 0x000000a24d03 in cfg_parse_listen src/cfgparse-listen.c:495 #5 0x00000089d33e in parse_cfg src/cfgparse.c:2202 #6 0x0000009e0bb9 in read_cfg src/haproxy.c:1142 #7 0x000000447e8c in main src/haproxy.c:3474 #8 0x7f113d82ad13 in __libc_start_call_main (/lib64/libc.so.6+0x2ad13) #9 0x7fff65b4e320 ([stack]+0x20320) SUMMARY: AddressSanitizer: 4 byte(s) leaked in 1 allocation(s). The fix replaces ha_free(&curproxy) with proxy_drop(curproxy), which properly calls deinit_proxy() to release all internal resources, removes the proxy from trees and lists, decrements the refcount, and frees the struct. No backport is needed since proxy_drop() is only in 3.4.	2026-05-15 17:39:25 +02:00

1 2 3 4 5 ...

27215 commits