bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-02-25 10:59:35 -05:00

Author	SHA1	Message	Date
Ondřej Surý	a8e6c3b8f7	Make isc_result tables smaller The isc_result_t enum was to sparse when each library code would skip to next << 16 as a base. Remove the huge holes in the isc_result_t enum to make the isc_result tables more compact. This change required a rewrite how we map dns_rcode_t to isc_result_t and back, so we don't ever return neither isc_result_t value nor dns_rcode_t out of defined range.	2023-06-15 15:32:04 +02:00
Mark Andrews	e6e4ac05b8	Fix typo in synchronize_rcu macro (add h) synchronize_rcu has not been used until now in BIND9 and there was a typo in the define (a 'h' was missing).	2023-06-06 08:10:09 +10:00
Ondřej Surý	f760ee3f8c	Disable URCU inlining if inlined rcu_dereference() fails to compile In some cases, the inlined version rcu_dereference() would not compile when working on pointer to opaque struct (namely Ubuntu Jammy). Detect such condition in the autoconf and disable the inlining of the small functions if it breaks the build.	2023-06-01 16:51:38 +02:00
Michał Kępień	6029010dd2	Remove <isc/cmocka.h> The last use of the cmocka_add_test_byname() helper macro was removed in commit `63fe9312ff`. Remove the <isc/cmocka.h> header that defines it.	2023-05-18 15:12:23 +02:00
Tony Finch	c319ccd4c9	Fixes for liburcu-qsbr Move registration and deregistration of the main thread from `isc_loopmgr_run()` into `isc__initialize()` / `isc__shutdown()`: liburcu-qsbr fails an assertion if we try to use it from an unregistered thread, and we need to be able to use it when the event loops are not running. Use `rcu_assign_pointer()` and `rcu_dereference()` in qp-trie transactions so that they properly mark threads as online. The RCU-protected pointer is no longer declared atomic because liburcu does not (yet) use standard C atomics. Fix the definition of `isc_qsbr_rcu_dereference()` to return the referenced value, and to call the right function inside liburcu. Change the thread sanitizer suppressions to match any variant of `rcu_*_barrier()`	2023-05-15 20:49:42 +00:00
Tony Finch	fc770a8bd0	Remove the now-unused ISC_STACK We are using the liburcu concurrent data structures instead.	2023-05-12 20:49:43 +01:00
Tony Finch	c377e0a9e3	Help thread sanitizer to cope with liburcu All the places the qp-trie code was using `call_rcu()` needed `__tsan_release()` and `__tsan_acquire()` annotations, so add a couple of wrappers to encapsulate this pattern. With these wrappers, the tests run almost clean under thread sanitizer. The remaining problems are due to `rcu_barrier()` which can be suppressed using `.tsan-suppress`. It does not suppress the whole of `liburcu`, because we would like thread sanitizer to detect problems in `call_rcu()` callbacks, which are called from `liburcu`. The CI jobs have been updated to use `.tsan-suppress` by default, except for a special-case job that needs the additional suppressions in `.tsan-suppress-extra`. We might be able to get rid of some of this after liburcu gains support for thread sanitizer. Note: the `rcu_barrier()` suppression is not entirely effective: tsan sometimes reports races that originate inside `rcu_barrier()` but tsan has discarded the stack so it does not have the information required to suppress the report. These "races" can be made much easier to reproduce by adding `atexit_sleep_ms=1000` to `TSAN_OPTIONS`. The problem with tsan's short memory can be addressed by increasing `history_size`: when it is large enough (6 or 7) the `rcu_barrier()` stack usually survives long enough for suppression to work.	2023-05-12 20:48:31 +01:00
Tony Finch	05ca11e122	Remove isc_qsbr (we are using liburcu instead) This commit breaks the qp-trie code.	2023-05-12 20:48:31 +01:00
Tony Finch	4f97a679f0	A macro for the size of a struct with a flexible array member It can be fairly long-winded to allocate space for a struct with a flexible array member: in general we need the size of the struct, the size of the member, and the number of elements. Wrap them all up in a STRUCT_FLEX_SIZE() macro, and use the new macro for the flexible arrays in isc_ht and dns_qp.	2023-05-12 20:48:31 +01:00
Ondřej Surý	00f1823366	Change the isc_quota API to use cds_wfcqueue internally The isc_quota API was using locked list of isc_job_t objects to keep the waiting TCP accepts. Change the isc_quota implementation to use cds_wfcqueue internally - the enqueue is wait-free and only dequeue needs to be locked.	2023-05-12 14:16:25 +02:00
Ondřej Surý	7b1d985de2	Change the isc_async API to use cds_wfcqueue internally The isc_async API was using lock-free stack (where enqueue operation was not wait-free). Change the isc_async to use cds_wfcqueue internally - enqueue and splice (move the queue members from one list to another) is nonblocking and wait-free.	2023-05-12 14:16:25 +02:00
Ondřej Surý	7220851f67	Replace glue_cache hashtable with direct link in rdatasetheader Instead of having a global hashtable with a global rwlock for the GLUE cache, move the glue_list directly into rdatasetheader and use Userspace-RCU to update the pointer when the glue_list is empty. Additionally, the cached glue_lists needs to be stored in the RBTDB version for early cleaning, otherwise the circular dependencies between nodes and glue_lists will prevent nodes to be ever cleaned up.	2023-05-12 13:25:39 +02:00
Michal Nowak	31935a3537	Disable ASAN in nsupdate for fatal cases Clang 16 LeakSanitizer reports a memory leak when dns_request_create() returned a TLS error in the nsupdate system test. While technically a memory leak on error handling, it's not a problem because the program is immediately terminated; nsupdate is not expected to run for a prolonged time.	2023-05-11 13:39:51 +02:00
Tony Finch	7d1ceaf35d	Move per-thread RCU setup into isc_thread All the per-loop `libuv` setup remains in `isc_loop`, but the per-thread RCU setup is moved to `isc_thread` alongside the other per-thread setup. This avoids repeating the per-thread setup for `call_rcu()` helpers, and explains a little better why some parts of the per-thread setup is missing for `call_rcu()` helpers. This also removes the per-loop `call_rcu()` helpers as we refactored the isc__random_initialize() in the previous commit.	2023-04-27 12:38:53 +02:00
Tony Finch	e0248bf60f	Simplify isc_thread a little Remove the `isc_threadarg_t` and `isc_threadresult_t` typedefs which were unhelpful disguises for `void *`, and free the dummy jemalloc allocation sooner.	2023-04-27 12:38:53 +02:00
Tony Finch	06f534fa69	Avoid spurious compilation failures in liburcu headers When liburcu is not installed from a system package, its headers are not treated as system headers by the compiler, so BIND's -Werror and other warning options take effect. The liburcu headers have a lot of inline functions, some of which do not use all their arguments, which BIND's build treats as an error.	2023-04-27 12:38:53 +02:00
Ondřej Surý	c2c907d728	Improve the Userspace RCU integration This commit allows BIND 9 to be compiled with different flavours of Userspace RCU, and improves the integration between Userspace RCU and our event loop: - In the RCU QSBR, the thread is put offline when polling and online when rcu_dereference, rcu_assign_pointer (or friends) are called. - In other RCU modes, we check that we are not reading when reaching the quiescent callback in the event loop. - We register the thread before uv_work_run() callback is called and after it has finished. The rcu_(un)register_thread() has a large overhead, but that's fine in this case.	2023-04-27 12:38:53 +02:00
Ondřej Surý	ae997d9e21	Add ISC_LIST_FOREACH(_SAFE) macros There's a recurring pattern walking the ISC_LISTs that just repeats over and over. Add two macros: * ISC_LIST_FOREACH(list, elt, link) - walk the static list * ISC_LIST_FOREACH_SAFE(list, elt, link, next) - walk the list in a manner that's safe against list member deletions	2023-04-25 08:16:46 +02:00
Evan Hunt	0393b54afb	add a result code for ENOPROTOOPT, EPROTONOSUPPORT there was no isc_result_t value for invalid protocol errors that could be returned from libuv.	2023-04-21 12:42:10 +02:00
Ondřej Surý	b497e90179	Add isc_spinlock unit with shim pthread_spin implementation The spinlock is small (atomic_uint_fast32_t at most), lightweight synchronization primitive and should only be used for short-lived and most of the time a isc_mutex should be used. Add a isc_spinlock unit which is either (most of the time) a think wrapper around pthread_spin API or an efficient shim implementation of the simple spinlock.	2023-04-21 12:10:02 +02:00
Ondřej Surý	3b10814569	Fix the streaming read callback shutdown logic When shutting down TCP sockets, the read callback calling logic was flawed, it would call either one less callback or one extra. Fix the logic in the way: 1. When isc_nm_read() has been called but isc_nm_read_stop() hasn't on the handle, the read callback will be called with ISC_R_CANCELED to cancel active reading from the socket/handle. 2. When isc_nm_read() has been called and isc_nm_read_stop() has been called on the on the handle, the read callback will be called with ISC_R_SHUTTINGDOWN to signal that the dormant (not-reading) socket is being shut down. 3. The .reading and .recv_read flags are little bit tricky. The .reading flag indicates if the outer layer is reading the data (that would be uv_tcp_t for TCP and isc_nmsocket_t (TCP) for TLSStream), the .recv_read flag indicates whether somebody is interested in the data read from the socket. Usually, you would expect that the .reading should be false when .recv_read is false, but it gets even more tricky with TLSStream as the TLS protocol might need to read from the socket even when sending data. Fix the usage of the .recv_read and .reading flags in the TLSStream to their true meaning - which mostly consist of using .recv_read everywhere and then wrapping isc_nm_read() and isc_nm_read_stop() with the .reading flag. 4. The TLS failed read helper has been modified to resemble the TCP code as much as possible, clearing and re-setting the .recv_read flag in the TCP timeout code has been fixed and .recv_read is now cleared when isc_nm_read_stop() has been called on the streaming socket. 5. The use of Network Manager in the named_controlconf, isccc_ccmsg, and isc_httpd units have been greatly simplified due to the improved design. 6. More unit tests for TCP and TLS testing the shutdown conditions have been added. Co-authored-by: Ondřej Surý <ondrej@isc.org> Co-authored-by: Artem Boldariev <artem@isc.org>	2023-04-20 12:58:32 +02:00
Ondřej Surý	1715cad685	Refactor the isc_quota code and fix the quota in TCP accept code In `e185412872`, the TCP accept quota code became broken in a subtle way - the quota would get initialized on the first accept for the server socket and then deleted from the server socket, so it would never get applied again. Properly fixing this required a bigger refactoring of the isc_quota API code to make it much simpler. The new code decouples the ownership of the quota and acquiring/releasing the quota limit. After (during) the refactoring it became more clear that we need to use the callback from the child side of the accepted connection, and not the server side.	2023-04-12 14:10:37 +02:00
Ondřej Surý	0a468e7c9e	Make isc_tid() a header-only function The isc_tid() function is often called on the hot-path and it's the only function is to return thread_local variable, make the isc_tid() function a header-only to save several function calls during query-response processing.	2023-04-12 14:10:37 +02:00
Artem Boldariev	2b3a3c21dc	Stream DNS: avoid memory copying/buffer resizing when reading data This commit optimises isc_dnsstream_assembler_t in such a way that memory copying and reallocation are avoided when receiving one or more complete DNS messages at once. We try to handle the data from the messages directly, without storing them in an intermediate memory buffer.	2023-04-03 13:31:46 +00:00
Tony Finch	cd0e7f853a	Simplify histogram quantiles The `isc_histosummary_t` functions were written in the early days of `hg64` and carried over when I brought `hg64` into BIND. They were intended to be useful for graphing cumulative frequency distributions and the like, but in practice whatever draws charts is better off with a raw histogram export. Especially because of the poor performance of the old functions. The replacement `isc_histo_quantiles()` function is intended for providing a few quantile values in BIND's stats channel, when the user does not want the full histogram. Unlike the old functions, the caller provides all the query fractions up-front, so that the values can be found in a single scan instead of a scan per value. The scan is from larger values to smaller, since larger quantiles are usually more interesting, so the scan can bail out early.	2023-04-03 12:08:05 +01:00
Tony Finch	bc2389b828	Add per-thread sharded histograms for heavy loads Although an `isc_histo_t` is thread-safe, it can suffer from cache contention under heavy load. To avoid this, an `isc_histomulti_t` contains a histogram per thread, so updates are local and low-contention.	2023-04-03 12:08:05 +01:00
Tony Finch	82213a48cf	Add isc_histo for histogram statistics This is an adaptation of my `hg64` experiments for use in BIND. As well as renaming everything according to ISC style, I have written some more extensive tests that ensure the edge cases are correct and the fenceposts are in the right places. I have added utility functions for working with precision in terms of decimal significant figures as well as this code's native binary.	2023-04-03 12:08:05 +01:00
Ondřej Surý	3a6a0fa867	Replace DE_CONST(k, v) with v = UNCONST(k) macro Replace the complicated DE_CONST macro that required union with much simple reference-dereference trick in the UNCONST() macro.	2023-04-03 10:25:56 +00:00
Ondřej Surý	4ec9c4a1db	Cleanup the last Windows / MSC ifdefs and comments Cleanup the remnants of MS Compiler bits from <isc/refcount.h>, printing the information in named/main.c, and cleanup some comments about Windows that no longer apply. The bits in picohttpparser.{h,c} were left out, because it's not our code.	2023-04-03 09:06:20 +00:00
Mark Andrews	e029803704	Handle fatal and FIPS provider interactions When fatal is called we may be holding memory allocated by OpenSSL. This may result in the reference count for the FIPS provider not going to zero and the shared library not being unloaded during OPENSSL_cleanup. When the shared library is ultimately unloaded, when all remaining dynamically loaded libraries are freed, we have already destroyed the memory context we where using to track memory leaks / late frees resulting in INSIST being called. Disable triggering the INSIST when fatal has being called.	2023-04-03 12:44:27 +10:00
Mark Andrews	5a2e82557e	Define isc_fips_mode() and isc_fips_set_mode() isc_fips_mode() determines if the process is running in FIPS mode isc_fips_set_mode() sets the process into FIPS mode	2023-04-03 12:05:28 +10:00
Tony Finch	555690a3c9	Simplify thread spawning The `isc_trampoline` module had a lot of machinery to support stable thread IDs for use by hazard pointers. But the hazard pointer code is gone, and the `isc_loop` module now has its own per-loop thread IDs. The trampoline machinery seems over-complicated for its remaining tasks, so move the per-thread initialization into `isc/thread.c`, and delete the rest.	2023-03-31 17:21:52 +01:00
Ondřej Surý	a5f5f68502	Refactor isc_time_now() to return time, and not result The isc_time_now() and isc_time_now_hires() were used inconsistently through the code - either with status check, or without status check, or via TIME_NOW() macro with RUNTIME_CHECK() on failure. Refactor the isc_time_now() and isc_time_now_hires() to always fail when getting current time has failed, and return the isc_time_t value as return value instead of passing the pointer to result in the argument.	2023-03-31 15:02:06 +02:00
Ondřej Surý	263d232c79	Replace isc_fsaccess API with more secure file creation The isc_fsaccess API was created to hide the implementation details between POSIX and Windows APIs. As we are not supporting the Windows APIs anymore, it's better to drop this API used in the DST part. Moreover, the isc_fsaccess was setting the permissions in an insecure manner - it operated on the filename, and not on the file descriptor which can lead to all kind of attacks if unpriviledged user has read (or even worse write) access to key directory. Replace the code that operates on the private keys with code that uses mkstemp(), fchmod() and atomic rename() at the end, so at no time the private key files have insecure permissions.	2023-03-31 12:52:59 +00:00
Ondřej Surý	aca7dd3961	Add isc_os_umask() function to get current umask As it's impossible to get the current umask without modifying it at the same time, initialize the current umask at the program start and keep the loaded value internally. Add isc_os_umask() function to access the starttime umask.	2023-03-31 12:52:59 +00:00
Ondřej Surý	4bd6096d4b	Remove isc_stdtime_get() macro Now that isc_stdtime_get() macro is unused, remove it from the header file.	2023-03-31 13:33:16 +02:00
Ondřej Surý	c11af0448a	Provide isc_stdtime_now(void) that returns value As isc_stdtime_get() cannot fail, the API seems to be too complicated, add new isc_stdtime_now() that returns the unixtime as a return value.	2023-03-31 13:16:28 +02:00
Ondřej Surý	2c0a9575d7	Replace __attribute__((unused)) with ISC_ATTR_UNUSED attribute macro Instead of marking the unused entities with UNUSED(x) macro in the function body, use a `ISC_ATTR_UNUSED` attribute macro that expans to C23 [[maybe_unused]] or __attribute__((__unused__)) as fallback.	2023-03-30 23:29:25 +02:00
Ondřej Surý	1176bf0552	Use C23 attributes if available, add ISC_ATTR_UNUSED Use C23 attribute styles if available: * Add new ISC_ATTR_UNUSED attribute macro that either expands to C23's [[maybe_unused]] or __attribute__((__unused__)); * Add default expansion of the `noreturn` to [[noreturn]] if available; * Move the FALLTHROUGH from <isc/util.h> to <isc/attributes.h>	2023-03-30 22:43:39 +02:00
Ondřej Surý	f5fc224af3	Add isc_async_current() macro to run job on current loop Previously, isc_job_run() could have been used to run the job on the current loop and the isc_job_run() would take care of allocating and deallocating the job. After the change in this MR, the isc_job_run() is more complicated to use, so we introduce the isc_async_current() macro to suplement isc_async_run() when we need to run the job on the current loop.	2023-03-30 16:07:41 +02:00
Ondřej Surý	1844590ad9	Refactor isc_job_run to not-make any allocations Change the isc_job_run() to not-make any allocations. The caller must make sure that it allocates isc_job_t - usually as part of the argument passed to the callback. For simple jobs, using isc_async_run() is advised as it allocates its own separate isc_job_t.	2023-03-30 16:00:52 +02:00
Tony Finch	295e7c80e8	Ad-hoc backtrace logging with isc_backtrace_log() It's sometimes helpful to get a quick idea of the call stack when debugging. This change factors out the backtrace logging from named's fatal error handler so that it's easy to use in other places too.	2023-03-29 10:47:53 +00:00
Evan Hunt	fe7ed2ba24	update stream sockets with bound address/port when isc_nm_listenstreamdns() is called with a local port of 0, a random port is chosen. call uv_getsockname() to determine what the port is as soon as the socket is bound, and add a function isc_nmsocket_getaddr() to retrieve it, so that the caller can connect to the listening socket. this will be used in cases where the same process is acting as both client and server.	2023-03-28 12:38:28 -07:00
Artem Boldariev	719343348e	Delete old TLS DNS and TCP DNS dead code This commit removes old, unused TLS DNS and TCP DNS definitions from the code. They should have been deleted earlier, but that was missed.	2023-03-15 18:40:58 +02:00
Tony Finch	7e565a87a7	Apply adjusted clang-format The headers were slightly reordered when liburcu was added.	2023-03-10 17:31:28 +01:00
Aram Sargsyan	fce68da460	Fix ISC_REFCOUNT_TRACE_IMPL usage ISC_REFCOUNT_TRACE_IMPL uses isc_tid(), but the corresponding header file is not included, which breaks, for example, compiling BIND with DNS_CATZ_TRACE defined in lib/dns/include/dns/catz.h. Add '#include <isc/tid.h>' in lib/isc/include/isc/refcount.h.	2023-03-09 21:38:04 +00:00
Tony Finch	c43668f031	Remove some lingering references to libbind9 Clean up the `.clang-format` #include priority list and the `\file` declaration in `isc/getaddresses.h`.	2023-03-08 10:06:22 +00:00
Tony Finch	9b7aa536ba	QSBR: safe memory reclamation for lock-free data structures This "quiescent state based reclamation" module provides support for the qp-trie module in dns/qp. It is a replacement for liburcu, written without reference to the urcu source code, and in fact it works in a significantly different way. A few specifics of BIND make this variant of QSBR somewhat simpler: * We can require that wait-free access to a qp-trie only happens in an isc_loop callback. The loop provides a natural quiescent state, after the callbacks are done, when no qp-trie access occurs. * We can dispense with any API like rcu_synchronize(). In practice, it takes far too long to wait for a grace period to elapse for each write to a data structure. * We use the idea of "phases" (aka epochs or eras) from EBR to reduce the amount of bookkeeping needed to track memory that is no longer needed, knowing that the qp-trie does most of that work already. I considered hazard pointers for safe memory reclamation. They have more read-side overhead (updating the hazard pointers) and it wasn't clear to me how to nicely schedule the cleanup work. Another alternative, epoch-based reclamation, is designed for fine-grained lock-free updates, so it needs some rethinking to work well with the heavily read-biased design of the qp-trie. QSBR has the fastest read side of the basic SMR algorithms (with no barriers), and fits well into a libuv loop. More recent hybrid SMR algorithms do not appear to have enough benefits to justify the extra complexity.	2023-02-23 15:57:53 +00:00
Tony Finch	63cd73d43e	Include thread ID in refcount trace output	2023-02-23 14:28:27 +00:00
Evan Hunt	dc27552c30	remove isc_glob the isc_glob module was originally needed to support posix-style glob processing on Windows, but is now just an unnecessary wrapper around glob(3). this commit removes it.	2023-02-22 17:35:29 +00:00

1 2 3 4 5 ...

1713 commits