haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-06-23 08:21:33 -04:00

Author	SHA1	Message	Date
Olivier Houchard	aaee6c463c	MINOR: tasks: Remove wq_lock and the per-thread group wait queues Now that they are no longer used, remove wq_lock and the per-thread group wait queues.	2026-06-12 11:49:09 +02:00
Olivier Houchard	45a64123d6	BUG/MEDIUM: threads: Fiw build when using no thread Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details FreeBSD / clang (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details In thread_detect_count(), avoid any usage of thread_cpu_enable_at_boot if we're building without thread support. That variable is only defined when building with threads, and those tests make little sense when building with no thread, anyway. This was submitted by: ririnto <ririnto@kakao.com> This should fix github issue #3408. This should be backported to 3.4.	2026-06-08 01:16:49 +02:00
Willy Tarreau	4185bc2cb8	MINOR: thread: report when thread-groups or nbthread results in less threads Some setups where the number of threads is forced without any binding (no cpu-map), are quite suspicious if they result in less threads than available CPUs, and not even predictably bound, so we want to notify the user that this might be an oversight. Similarly, when thread-groups is forced and not nbthread (and no cpu-map), and the final number of threads is lower than the hard-limit or the number of CPUs we also indicate the impact and how to remedy it. This can happen for example when starting on a machine with more than 64 CPUs and thread-groups forced to 1, or on more than 128 CPUs and thread-groups forced to 2 (e.g. when moving an older config to a new platform). It is possible that some of these conditions might need to be readjusted in the future to catch other traps or to relax certain commonly used, valid cases, so for now it is preferable not to backport this patch.	2026-05-28 18:49:47 +02:00
Willy Tarreau	f91b1ce9af	CLEANUP: thread: indicate when max-threads-per-group is ignored Since it's easy to get caught by some parameters being ignored, let's detect when mtpg was explicitly set and report a notice if it is ignored due to thread-groups being set. For this we need to avoid presetting the value in the global section and only set it when entering function thread_detect_count(), which is OK since the value cannot be used before.	2026-05-28 18:49:47 +02:00
Willy Tarreau	f5847d11f7	BUG/MEDIUM: threads: ignore max-threads-per-group when thread-groups is set As documented, max-threads-per-group is the default number of threads to arrange in a group before creating another group, and is only meant to be used when thread-groups is not set. However it was always enforced, so configs like: global thready-groups 2 which were sufficient in 3.2 and above to start with 64-128 threads are now suddenly limited to 32 threads! Let's relax the limit when thread-groups is set! No backport is needed since this is only 3.4.	2026-05-28 18:49:47 +02:00
Willy Tarreau	617df441d6	BUG/MINOR: threads: set at least grp_max when mtpg is too small When starting, say, 128 threads with max-threads-per-group set to 2 and MAX_TGROUPS set to the default 32, instead of setting the resulting number of groups to 32 and threads to 64, they're set to 1 and 32 respectively because the condition to raise grp_min is not satisfied. Let's cut the condition in two parts to also permit to raise it at least to grp_max. This should be backported to 3.2.	2026-05-28 18:49:47 +02:00
Willy Tarreau	d7c747b572	BUG/MINOR: threads: properly set the number of tgroups when non using policy When nbthread is set, the CPU policies are not used and do not set nbthread nor nbtgroups. When back into thread_detect_count(), these are set respectively to thr_max and 1. The problem which becomes very visible with max-threads-per-group, is that setting this one in combination with nbthreads results in only one group with the calculated number of threads per group. And there's not even a warning. So basically a configuration having: global nbthread 64 max-threads-per-group 8 would only start 8 threads. In this case, grp_min remains valid and should be used, so let's just change the assignment so that the number of groups is always correct. A few ifdefs had to move because the calculations were only made for the USE_CPU_AFFINITY case. Now these parts have been refined so that all the logic continues to apply even without USE_CPU_AFFINITY. One visible side effect is that setting nbthread above 64 will automatically create the associated number of groups even when USE_CPU_AFFINITY is not set. Previously it was silently changed to match the per-group limit. Ideally this should be backported to 3.2 where the issue was introduced, though it may change the behavior of configs that were silently being ignored (e.g. "nbthread 128"), so the backport should be considered with care. At least 3.3 should have it because it uses cpu-policy by default so it's only for failing cases that it would be involved.	2026-04-15 17:47:26 +02:00
Willy Tarreau	5fe0579d49	MEDIUM: threads: start threads by groups Till now, threads were all started one at a time from thread 1. This will soon cause us limitations once we want to reduce shared stuff between thread groups. Let's slightly change the startup sequence so that the first thread starts one initial thread for each group, and that each of these threads then starts all other threads from their group before switching to the final task. Since it requires an intermediary step, we need to store that threads' start function to access it from the group, so it was put into the tgroup_info which still has plenty of room available. It could also theoretically speed up the boot sequence, though in practice it doesn't change anything because each thead's initialization is made one at a time to avoid races during the early boot. However ther is now a function in charge of starting all extra threads of a group, and whih is called from this group.	2026-04-15 15:53:56 +02:00
Amaury Denoyelle	f58b2698ce	MEDIUM: proxy: add lock for global accesses during proxy free Define a new lock with label PROXIES_DEL_LOCK. Its purpose is to protect operations performed on global lists or trees while a proxy is freed. Currently, this lock is unneeded as proxies are only freed on single-thread init or deinit. However, with the incoming dynamic backend deletion, this operation will be also performed at runtime, outside of thread isolation.	2026-03-02 14:09:25 +01:00
Olivier Houchard	bf7a2808fc	BUG/MEDIUM: threads: Differ checking the max threads per group number Differ checking the max threads per group number until we're done parsing the configuration file, as it may be set after a "thread-group- directive. Otherwise the default value of 64 will be used, even if there is a max-threads-per-group directive. This should be backported to 3.3.	2026-02-06 03:01:50 +01:00
Olivier Houchard	9766211cf0	BUG/MINOR: threads: Initialize maxthrpertgroup earlier. Give global.maxthrpertgroup its default value at global creation, instead of later when we're trying to detect the thread count. It is used when verifying the configuration file validity, and if it was not set in the config file, in a few corner cases, the value of 0 would be used, which would then reject perfectly fine configuration files. This should be backported to 3.3.	2026-02-06 03:01:36 +01:00
Olivier Houchard	a209c35f30	MEDIUM: thread: Turn the group mask in thread set into a group counter Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details If we want to be able to have more than 64 thread groups, we can no longer use thread group masks as long. One remaining place where it is done is in struct thread_set. However, it is not really used as a mask anywhere, all we want is a thread group counter, so convert that mask to a counter.	2026-01-15 05:24:53 +01:00
Olivier Houchard	1397982599	MINOR: threads: Eliminate all_tgroups_mask. Now that it is unused, eliminate all_tgroups_mask, as we can't 64bits masks to represent thread groups, if we want to be able to have more than 64 thread groups.	2026-01-15 03:46:57 +01:00
Olivier Houchard	82196eb74e	BUG/MEDIUM: threads: Fix binding thread on bind. The code to parse the "thread" keyword on bind lines was changed to check if the thread numbers were correct against the value provided with max-threads-per-group, if any were provided, however, at the time those thread keywords have been set, it may not yet have been set, and that breaks the feature, so revert to check against MAX_THREADS_PER_GROUP instead, it should have no major impact.	2026-01-13 11:45:46 +01:00
Olivier Houchard	7e22d9c484	MEDIUM: cpu-topo: Add a new "max-threads-per-group" global keyword Add a new global keyword, max-threads-per-group. It sets the maximum number of threads a thread group can contain. Unless the number of thread groups is fixed with "thread-groups", haproxy will just create more thread groups as needed. The default and maximum value is 64.	2025-12-18 18:52:52 +01:00
Willy Tarreau	c365e47095	BUG/MEDIUM: threads/config: drop absent threads from thread groups Thread groups can be assigned arbitrary thread ranges, but if the mentioned threads do not exist, this causes crashes in listener_accept() or some connections to be ignored. The reason is that the calculated mask is derived from the thread group's enabled threads count. Examples: global nbthread 2 thread-groups 2 thread-group 1 1-64 thread-group 2 65-128 frontend f-crash bind :8001 thread 1/all frontend f-freeze bind :8002 thread 2/all This commit removes missing threads, emits a warning when the thread group just has less threads than requested, and an error when it is left with no threads at all. This must be backported to 3.1 since the issue is present there already.	2025-10-17 20:36:00 +02:00
Remi Tricot-Le Breton	c3c0597a34	MEDIUM: jwt: Remove certificate support in jwt_verify converter The jwt_verify converter will not take full-on certificates anymore in favor of a new soon to come jwt_verify_cert. We might end up with a new jwt_verify_hmac in the future as well which would allow to deprecate the jwt_verify converter and remove the need for a specific internal tree for public keys. The logic to always look into the internal jwt tree by default and resolve to locking the ckch tree as little as possible will also be removed. This allows to get rid of the duplicated reference to EVP_PKEYs, the one in the jwt tree entry and the one in the ckch_store.	2025-10-13 10:38:52 +02:00
Willy Tarreau	4b23b2ed32	MINOR: thread: add a lock level information in the thread_ctx The new lock_level field indicates the number of cumulated locks that are held by the current thread. It's fed as soon as DEBUG_THREAD is at least 1. In addition, thread_isolate() adds 128, so that it's even possible to check for combinations of both. The value is also reported in thread dumps (warnings and panics).	2025-09-11 16:32:34 +02:00
Olivier Houchard	f8e9545f70	BUG/MEDIUM: threads: Disable the workaround to load libgcc_s on macOS Don't use the workaround to load libgcc_s on macOS. It is not needed there, and it causes issues, as recent macOS dislike processes that fork after threads where created (and the workaround creates a temporary thread). This fixes crashes on macOS at least when using master-worker, and using the system resolver. This should fix Github issue #3035 This should be backported up to 2.8.	2025-07-21 13:56:29 +02:00
Valentine Krasnobaeva	9e11c852fe	MINOR: cpu-topo: write thread-cpu bindings into trash buffer Write thread-cpu bindings and cluster summary into provided trash buffer. Like this we can call this function in any place, when this info is needed.	2025-07-17 19:07:58 +02:00
Valentine Krasnobaeva	2405283230	MINOR: cpu-topo: split cpu_dump_topology() to show its summary in show dev cpu_dump_topology() prints details about each enabled CPU and a summary with clusters info and thread-cpu bindings. The latter is often usefull for debugging and we want to add it in the 'show dev' output. So, let's split cpu_dump_topology() in two parts: cpu_topo_debug() to print the details about each enabled CPU; and cpu_topo_dump_summary() to print only the summary. In the next commit we will modify cpu_topo_dump_summary() to write into local trash buffer and it could be easily called from debug_parse_cli_show_dev().	2025-07-17 19:07:46 +02:00
Remi Tricot-Le Breton	522bca98e1	MAJOR: jwt: Allow certificate instead of public key in jwt_verify converter The 'jwt_verify' converter could only be passed public keys as second parameter instead of full-on public certificates. This patch allows proper certificates to be used. Those certificates can be loaded in ckch_stores like any other certificate which means that all the certificate-related operations that can be made via the CLI can now benefit JWT validation as well. We now have two ways JWT validation can work, the legacy one which only relies on public keys which could not be stored in ckch_stores without some in depth changes in the way the ckch_stores are built. In this legacy way, the public keys are fully stored in a cache dedicated to JWT only which does not have any CLI commands and any way to update them during runtime. It also requires that all the public keys used are passed at least once explicitely to the 'jwt_verify' converter so that they can be loaded during init. The new way uses actual certificates, either already stored in the ckch_store tree (if predefined in a crt-store or already used previously in the configuration) or loaded in the ckch_store tree during init if they are explicitely used in the configuration like so: var(txn.bearer),jwt_verify(txn.jwt_alg,"cert.pem") When using a variable (or any other way that can only be resolved during runtime) in place of the converter's <key> parameter, the first time we encounter a new value (for which we don't have any entry in the jwt tree) we will lock the ckch_store tree and try to perform a lookup in it. If the lookup fails, an entry will still be inserted into the jwt tree so that any following call with this value avoids performing the ckch_store tree lookup.	2025-06-30 17:59:55 +02:00
Aurelien DARRAGON	5694a98744	MAJOR: mailers: remove native mailers support As mentioned in 2.8 announce on the mailing list [1] and on the wiki [2] native mailers were deprecated and planned for removal in 3.3. Now is the time to drop the legacy code for native mailers which is based on a tcpcheck "hack" and cannot be maintained. Lua mailers should be used as a drop in replacement. Indeed, "mailers" and associated config directives are preserved because mailers config is exposed to Lua, which helps smoothing the transition from native mailers to Lua based ones. As a reminder, to keep mailers configuration working as before without making changes to the config file, simply add the line below to the global section: lua-load examples/lua/mailers.lua mailers.lua script (provided in the git repository, adjust path as needed) may be customized by users familiar with Lua, by default it emulates the behavior of the native (now removed) mailers. [1]: https://www.mail-archive.com/haproxy@formilux.org/msg43600.html [2]: https://github.com/haproxy/wiki/wiki/Breaking-changes	2025-06-24 10:55:58 +02:00
Willy Tarreau	23371b3e7c	MINOR: threads: turn the full lock debugging to DEBUG_THREAD=2 At level 1 it now does nothing. This is reserved for some subsequent patches which will implement lighter debugging.	2025-04-28 16:50:34 +02:00
Willy Tarreau	903a6b14ef	MINOR: threads: prepare DEBUG_THREAD to receive more values We now default the value to zero and make sure all tests properly take care of values above zero. This is in preparation for supporting several degrees of debugging.	2025-04-28 16:50:34 +02:00
Willy Tarreau	1af592c511	MINOR: stick-table: use a separate lock label for updates Too many locks were sharing STK_TABLE_LOCK making it hard to analyze. Let's split the already heavily used update lock.	2025-04-24 14:02:22 +02:00
Valentine Krasnobaeva	08efe8cd24	BUG/MINOR: thread: protect thread_cpus_enabled_at_boot with USE_THREAD Following error is triggered at linker invokation, when we try to compile with USE_THREAD=0 and -O0. make -j 8 TARGET=linux-glibc USE_LUA=1 USE_PCRE2=1 USE_LINUX_CAP=1 \ USE_MEMORY_PROFILING=1 OPT_CFLAGS=-O0 USE_THREAD=0 /usr/bin/ld: src/thread.o: warning: relocation against `thread_cpus_enabled_at_boot' in read-only section `.text' /usr/bin/ld: src/thread.o: in function `thread_detect_count': /home/vk/projects/haproxy/src/thread.c:1619: undefined reference to `thread_cpus_enabled_at_boot' /usr/bin/ld: /home/vk/projects/haproxy/src/thread.c:1619: undefined reference to `thread_cpus_enabled_at_boot' /usr/bin/ld: /home/vk/projects/haproxy/src/thread.c:1620: undefined reference to `thread_cpus_enabled_at_boot' /usr/bin/ld: warning: creating DT_TEXTREL in a PIE collect2: error: ld returned 1 exit status make: *** [Makefile:1044: haproxy] Error 1 thread_cpus_enabled_at_boot is only available when we compiled with USE_THREAD=1, which is the default for the most targets now. In some cases, we need to recompile in mono-thread mode, thus thread_cpus_enabled_at_boot should be protected with USE_THREAD in thread_detect_count(). thread_detect_count() is always called during the process initialization never mind of multi thread support. It sets some defaults in global.nbthread and global.nbtgroups. This patch is related to GitHub issue #2916. No need to be backported as it was added in 3.2-dev9 version.	2025-04-14 16:03:21 +02:00
Olivier Houchard	ba521a1d88	MINOR: threads: Add HA_RWLOCK_TRYRDTOWR() Add HA_RWLOCK_TRYRDTOWR(), that tries to upgrade a lock from reader to writer, and fails if any seeker or writer already holds it.	2025-04-01 18:05:30 +02:00
Willy Tarreau	e17512c3b2	MINOR: thread: dump the CPU topology in thread_map_to_groups() It was previously done in thread_detect_count() but that's not quite handy because we still don't know about the groups setting. Better do it slightly later and have all the relevant info instead.	2025-03-31 15:42:13 +02:00
Willy Tarreau	aaa4080b8b	CLEANUP: thread: now remove the temporary CPU node binding code This is now superseded by the default "safe" cpu-policy, and every time it's used, that code was bypassed anyway since global.nbthread was set. We can now safely remove it. Note that for other policies which do not set a thread count nor further restrict CPUs (such as "none", or even "safe" when finding a single node), we continue to go through the fallback code that automatically assigns CPUs to threads and counts them.	2025-03-14 18:33:16 +01:00
Willy Tarreau	156430ceb6	MINOR: cpu-topo: add a CPU policy setting to the global section We'll need to let the user decide what's best for their workload, and in order to do this we'll have to provide tunable options. For that, we're introducing struct ha_cpu_policy which contains a name, a description and a function pointer. The purpose will be to use that function pointer to choose the best CPUs to use and now to set the number of threads and thread-groups, that will be called during the thread setup phase. The only supported policy for now is "none" which doesn't set/touch anything (i.e. all available CPUs are used).	2025-03-14 18:33:16 +01:00
Willy Tarreau	18133a054d	MINOR: cpu-topo: implement a sorting mechanism for CPU index CPU selection will be performed by sorting CPUs according to various criteria. For dumps however, that's really not convenient and we'll need to reorder the CPUs according to their index only. This is what the new function cpu_reorder_by_index() does. It's called in thread_detect_count() before dumping the CPU topology.	2025-03-14 18:30:31 +01:00
Willy Tarreau	e3aef4c9a4	MEDIUM: thread: reimplement first numa node detection Let's reimplement automatic binding to the first NUMA node when thread count is not forced. It's the same thing as is already done in check_config_validity() except that this time it's based on the collected CPU information. The threads are automatically counted and CPUs from non-first node(s) are evicted.	2025-03-14 18:30:30 +01:00
Willy Tarreau	4a525e8d27	MEDIUM: cpu-topo: make sure to properly assign CPUs to threads as a fallback If no cpu-map is done and no cpu-policy could be enforced, we still need to count the number of usable CPUs, assign them to all threads and set the nbthread value accordingly. This already handles the part that was done in check_config_validity() via thread_cpus_enabled_at_boot.	2025-03-14 18:30:30 +01:00
Willy Tarreau	1af4942c95	MEDIUM: thread: start to detect thread groups and threads min/max By mutually refining the thread count and group count, we can try to detect the most suitable setup for the current machine. Taskset is implicitly handled correctly. tgroups automatically adapt to the configured number of threads. cpu-map manages to limit tgroups to the smallest supported value. The thread-limit is enforced. Just like in cfgparse, if the thread count was forced to a higher value, it's reduced and a warning is emitted. But if it was not set, the thr_max value is bound to this limit so that further calculations respect it. We continue to default to the max number of available threads and 1 tgroup by default, with the limit. This normally allows to get rid of that test in check_config_validity().	2025-03-14 18:30:30 +01:00
Willy Tarreau	ac1db9db7d	MINOR: thread: turn thread_cpu_mask_forced() into an init-time variable The function is not convenient because it doesn't allow us to undo the startup changes, and depending on where it's being used, we don't know whether the values read have already been altered (this is not the case right now but it's going to evolve). Let's just compute the status during cpu_detect_usable() and set a variable accordingly. This way we'll always read the init value, and if needed we can even afford to reset it. Also, placing it in cpu_topo.c limits cross-file dependencies (e.g. threads without affinity etc).	2025-03-14 18:30:30 +01:00
Willy Tarreau	8c524c7c9d	REORG: cpu-topo: move bound cpu detection from cpuset to cpu-topo The cpuset files are normally used only for cpu manipulations. It happens that the initial CPU binding detection was initially placed there since there was no better place, but in practice, being OS-specific, it should really be in cpu-topo. This simplifies cpuset which doesn't need to know about the OS anymore.	2025-03-14 18:30:30 +01:00
Willy Tarreau	05a4efb102	MINOR: thread: rely on the cpuset functions to count bound CPUs let's just clean up the thread_cpus_enabled() code a little bit by removing the OS-specific code and rely on ha_cpuset_detect_bound() instead. On macos we continue to use sysconf() for now.	2025-03-14 18:30:30 +01:00
Willy Tarreau	12383fd9f5	BUG/MEDIUM: thread: use pthread_self() not ha_pthread[tid] in set_affinity A bug was uncovered by the work on NUMA. It only triggers in the CI with libmusl due to a race condition. What happens is that the call to set_thread_cpu_affinity() is done very early in the polling loop, and that it relies on ha_pthread[tid] instead of pthread_self(). The problem is that ha_pthread[tid] is only set by the return from pthread_create(), which might happen later depending on the number of CPUs available to run the starting thread. Let's just use pthread_self() here. ha_pthread[] is only used to send signals between threads, there's no point in using it here. This can be backported to 2.6.	2025-03-12 15:59:23 +01:00
Willy Tarreau	eced1d6d8a	DEBUG: thread: reduce the struct lock_stat to store only 30 buckets Storing only 30 buckets means we only keep 256 bytes per label. This further simplifies address calculation and reduces the memory used without complicating the locking code. It means we won't measure wait times larger than a second but we're not supposed to face this as it would trigger the watchdog anyway. It may become a little bit just if measuring using rdtsc() instead of now_mono_time() though (typically the limit would be around 350ms for a 3 GHz CPU).	2025-02-10 18:34:43 +01:00
Willy Tarreau	c2f2d6fd3c	DEBUG: thread: make lock_stat per operation instead of for all operations It's more convenient (and more readable) to have the lock stats arranged by operation type (read, seek, write). It will also allow to later simplify the structure format and the bucket address calculation. Now lock_stat[] got split into lock_stats_rd[], lock_stats_sk[], lock_stats_wr[].	2025-02-10 18:34:43 +01:00
Willy Tarreau	4168d1278c	DEBUG: thread: don't keep the redundant _locked counter Now that we have our sums by bucket, the _locked counter is redundant since it's always equal to the sum of all entries. Let's just get rid of it and replace its consumption with a loop over all buckets, this will reduce the overhead of taking each lock at the expense of a tiny extra effort when dumping all locks, which we don't care about.	2025-02-10 18:34:43 +01:00
Willy Tarreau	a22550fbd7	DEBUG: thread: report the wait time buckets for lock classes In addition to the total/average wait time, we now also store the wait time in 2^N buckets. There are 32 buckets for each type (read, seek, write), allowing to store wait times from 1-2ns to 2.1-4.3s, which is quite sufficient, even if we'd want to switch from NS to CPU cycles in the future. The counters are only reported for non- zero buckets so as not to visually pollute the output. This significantly inflates the lock_stat struct, which is now aligned to 256 bytes and rounded up to 1kB. But that's not really a problem, given that there's only one per lock label.	2025-02-10 18:34:43 +01:00
Willy Tarreau	0b849c59fb	DEBUG: thread: make lock time computation more consistent The lock time computation was a bit inconsistent between functions, particularly those using a try_lock. Some of them would count the lock as taken without counting the time, others would simply not count it. This is essentially due to the way the time is retrieved, as it was done inside the atomic increment. Let's instead always use start_time to carry the elapsed time, by presetting it to the negative time before the event and addinf the positive time after, so that it finally contains the duration. Then depending on the try lock's success, we add the result or not. This was generalized to all lock functions for consistency, and because this will be handy for future changes.	2025-02-10 18:34:43 +01:00
Willy Tarreau	99a88ee904	DEBUG: thread: report the spin lock counters as seek locks Technically speaking, spin locks use a seek lock, not a write lock, so better count them appropriately for consistency (lock time, or function calls count).	2025-02-10 18:34:43 +01:00
Amaury Denoyelle	8e0e7d9d1a	BUG/MINOR: guid/server: ensure thread-safety on GUID insert/delete Since 3.0, it is possible to assign a GUID to proxies, listeners and servers. These objects are stored in a global tree guid_tree. Proxies and listeners are static. However, servers may be added or deleted at runtime, which imply that guid_tree must be protected. Fix this by declaring a read-write lock to protect tree access. For now, only guid_insert() and guid_remove() are protected using a write lock. Outside of these, GUID tree is not accessed at runtime. If server CLI commands are extended to support GUID as server identifier, lookup operation should be extended with a read lock protection. Note that during stat-file preloading, GUID tree is accessed for lookup. However, as it is performed on startup which is single threaded, there is no need for lock here. A BUG_ON() has been added to ensure this precondition remains true. This bug could caused a segfault when using dynamic servers with GUID. However, it was never reproduced for now. This must be backported up to 3.0. To avoid a conflict issue, the previous cleanup patch can be merged before it.	2024-11-07 18:17:03 +01:00
Willy Tarreau	381ed2a4dd	MINOR: config: add thread-hard-limit to set an upper bound to nbthread On todays large systems, it's not always desired to run on all threads for light loads, and usually users enforce nbthread to a lower value (e.g. 8). The problem is that this is a fixed value, and moving such configs to smaller machines continues to enforce the value and this becomes extremely unproductive due to having more threads than CPUs. This also happens quite a bit in VMs, containers, or cloud instances of various sizes. This commit introduces the thread-hard-limit setting that allows to only set an upper bound to the number of threads without raising a lower value. This means that using "thread-hard-limit 8" will make sure that no more than 8 threads will be used when available, but it will remain two when run on a dual-core machine.	2024-05-24 09:46:49 +02:00
Remi Tricot-Le Breton	ac9c49b40d	MEDIUM: cache: Use dedicated cache tree lock alongside shctx lock Every use of the cache tree was covered by the shctx lock even when no operations were performed on the shared_context lists (avail and hot). This patch adds a dedicated RW lock for the cache so that blocks of code that work on the cache tree only can use this lock instead of the superseding shctx one. This is useful for operations during which the concerned blocks are already in the hot list. When the two locks need to be taken at the same time, in http_action_req_cache_use and in shctx_row_reserve_hot, the shctx one must be taken first. A new parameter needed to be added to the shared_context's free_block callback prototype so that cache_free_block can take the cache lock and release it afterwards.	2023-11-16 19:35:10 +01:00
Willy Tarreau	cbbee15462	CLEANUP: ring: rename the ring lock "RING_LOCK" instead of "LOGSRV_LOCK" The ring lock was initially mostly used for the logs and used to inherit its name in lock stats. Now that it's exclusively used by rings, let's rename it accordingly.	2023-09-20 21:38:33 +02:00
Willy Tarreau	86854dd032	MEDIUM: threads: detect excessive thread counts vs cpu-map This detects when there are more threads bound via cpu-map than CPUs enabled in cpu-map, or when there are more total threads than the total number of CPUs available at boot (for unbound threads) and configured for bound threads. In this case, a warning is emitted to explain the problems it will cause, and explaining how to address the situation. Note that some configurations will not be detected as faulty because the algorithmic complexity to resolve all arrangements grows in O(N!). This means that having 3 threads on 2 CPUs and one thread on 2 CPUs will not be detected as it's 4 threads for 4 CPUs. But at least configs such as T0:(1,4) T1:(1,4) T2:(2,4) T3:(3,4) will not trigger a warning since they're valid.	2023-09-04 19:39:17 +02:00

1 2 3

132 commits