bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-06-05 11:12:05 -04:00

Author	SHA1	Message	Date
Aram Sargsyan	6539f73e3a	Introduce the concept of broken catalog zones The DNS catalog zones draft version 5 document describes various situations when a catalog zones must be considered as "broken" and not be processed. Implement those checks in catz.c and add corresponding system tests. (cherry picked from commit `a8228d5f19`)	2022-04-28 12:48:41 +00:00
Artem Boldariev	4637b72da6	Change X509_STORE_up_ref() shim return value X509_STORE_up_ref() must return 1 on success, while the previous implementation would return the references count. This commit fixes that.	2022-04-28 13:39:22 +03:00
Artem Boldariev	26feac0c61	Implement shim for SSL_CTX_set1_cert_store() (affects Debian 9) This commit implements a shim for SSL_CTX_set1_cert_store() for OpenSSL/LibreSSL versions where it is not available.	2022-04-28 13:39:22 +03:00
Artem Boldariev	6c05fb09c3	Add support for Strict/Mutual TLS into BIND This commit adds support for Strict/Mutual TLS into BIND. It does so by implementing the backing code for 'hostname' and 'ca-file' options of the 'tls' statement. The commit also updates the documentation accordingly.	2022-04-28 13:39:21 +03:00
Artem Boldariev	05091f0095	Restore disabled unused 'tls' options: 'ca-file' and 'hostname' This commit restores the 'tls' options disabled in `78b73d0865`.	2022-04-28 13:39:21 +03:00
Artem Boldariev	9b0eb3e5a3	Add ISC_R_TLSBADPEERCERT error code to the TLS related code This commit adds support for ISC_R_TLSBADPEERCERT error code, which is supposed to be used to signal for TLS peer certificates verification in dig and other code. The support for this error code is added to our TLS and TLS DNS implementations. This commit also adds isc_nm_verify_tls_peer_result_string() function which is supposed to be used to get a textual description of the reason for getting a ISC_R_TLSBADPEERCERT error.	2022-04-28 13:39:21 +03:00
Artem Boldariev	e2a3ec2ba5	Extend TLS context cache with CA certificates store This commit adds support for keeping CA certificates stores associated with TLS contexts. The intention is to keep one reusable store per a set of related TLS contexts.	2022-04-28 13:39:21 +03:00
Artem Boldariev	6fdc03102a	Add foundational functions to implement Strict/Mutual TLS This commit adds a set of functions that can be used to implement Strict and Mutual TLS: * isc_tlsctx_load_client_ca_names(); * isc_tlsctx_load_certificate(); * isc_tls_verify_peer_result_string(); * isc_tlsctx_enable_peer_verification().	2022-04-28 13:39:21 +03:00
Artem Boldariev	b975ee7be4	Add utility functions to manipulate X509 certificate stores This commit adds a set of high-level utility functions to manipulate the certificate stores. The stores are needed to implement TLS certificates verification efficiently.	2022-04-28 13:39:21 +03:00
Matthijs Mekking	125603a543	Add stale answer extended errors Add DNS extended errors 3 (Stale Answer) and 19 (Stale NXDOMAIN Answer) to responses. Add extra text with the reason why the stale answer was returned. To test, we need to change the configuration such that for the first set of tests the stale-refresh-time window does not interfer with the expected extended errors. (cherry picked from commit `c66b9abc0b`)	2022-04-28 11:21:22 +02:00
Evan Hunt	af37c6f60c	prevent a deadlock in the shutdown system test The shutdown test sends 'rdnc status' commands in parallel with 'rndc stop' A new rndc connection arriving will reference the ACL environment to see whether the client is allowed to connect. Commit `c0995bc380` added a mutex lock to ns_interfacemgr_getaclenv(), but if the new connection arrives while the interfaces are being purged during shutdown, that lock is already being held. If the the connection event slips in ahead of one of the netmgr's "stop listening" events on a worker thread, a deadlock can occur. The fix is not to hold the interfacemgr lock while shutting down interfaces; only while actually traversing the interface list to identify interfaces needing shutdown. (cherry picked from commit `5c4cf3fcc4`)	2022-04-27 23:56:59 -07:00
Evan Hunt	9296be2130	refactor resume_dsfetch() clean up resume_dsfetch() so that the fctx reference counting is saner and easier to follow. (cherry picked from commit `7b2ea97e46`)	2022-04-27 16:12:55 -07:00
Evan Hunt	759e0b030e	refactor validated() minor changes to ensure that fctx reference counting is clear and correct. (cherry picked from commit `d2f407cca3`)	2022-04-27 16:12:55 -07:00
Evan Hunt	50ef4a8cae	rename maybe_destroy() to maybe_cancel_validators() the maybe_destroy() function no longer destroys the fctx, so rename it and update the comments. (cherry picked from commit `7c5afebcdc`)	2022-04-27 16:12:55 -07:00
Evan Hunt	8a90a62d0f	refactor fctx_done() to set fctx to NULL previously fctx_done() detached the fctx but did not clear the pointer passed into it from the caller. in some conditions, when rctx_done() was reached while waiting for a validator to complete, fctx_done() could be called twice on the same fetch, causing a double detach. fctx_done() now clears the fctx pointer, to reduce the chances of such mistakes. (cherry picked from commit `b4592d02a1`)	2022-04-27 16:12:55 -07:00
Artem Boldariev	f39041ee65	Replace listener TLS contexts on reconfiguration This commit makes use of isc_nmsocket_set_tlsctx(). Now, instead of recreating TLS-enabled listeners (including the underlying TCP listener sockets), only the TLS context in use is replaced.	2022-04-27 23:58:38 +03:00
Artem Boldariev	3a75b33287	Add isc_nmsocket_set_tlsctx() This commit adds isc_nmsocket_set_tlsctx() - an asynchronous function that replaces the TLS context within a given TLS-enabled listener socket object. It is based on the newly added reference counting functionality. The intention of adding this function is to add functionality to replace a TLS context without recreating the whole socket object, including the underlying TCP listener socket, as a BIND process might not have enough permissions to re-create it fully on reconfiguration.	2022-04-27 23:58:38 +03:00
Artem Boldariev	f52c06054b	Maintain a per-thread TLS ctx reference in TLS stream code This commit changes the generic TLS stream code to maintain a per-worker thread TLS context reference.	2022-04-27 23:58:38 +03:00
Artem Boldariev	28460151ca	Use isc_tlsctx_attach() in TLS DNS code This commit adds proper reference counting for TLS contexts into generic TLS DNS (DoT) code.	2022-04-27 23:58:38 +03:00
Artem Boldariev	ff987957e7	Use isc_tlsctx_attach() in TLS stream code This commit adds proper reference counting for TLS contexts into generic TLS stream code.	2022-04-27 23:58:38 +03:00
Artem Boldariev	677819d22d	Add isc_tlsctx_attach() The implementation is done on top of the reference counting functionality found in OpenSSL/LibreSSL, which allows for avoiding wrapping the object. Adding this function allows using reference counting for TLS contexts in BIND 9's codebase.	2022-04-27 23:58:38 +03:00
Aram Sargsyan	4de1f65e4d	Handle ISC_R_SUCCESS on a deactivated response in udp_recv() There is a possibility for `udp_recv()` to be called with `eresult` being `ISC_R_SUCCESS`, but nevertheless with already deactivated `resp`, which can happen when the request has been canceled in the meantime. (cherry picked from commit `e3a88862c0`)	2022-04-27 18:08:42 +00:00
Artem Boldariev	8b19f62ac5	TLSDNS: call send callbacks after only the data was sent This commit ensures that write callbacks are getting called only after the data has been sent via the network. Without this fix, a situation could appear when a write callback could get called before the actual encrypted data would have been sent to the network. Instead, it would get called right after it would have been passed to the OpenSSL (i.e. encrypted). Most likely, the issue does not reveal itself often because the callback call was asynchronous, so in most cases it should have been called after the data has been sent, but that was not guaranteed by the code logic. Also, this commit removes one memory allocation (netievent) from a hot path, as there is no need to call this callback asynchronously anymore.	2022-04-27 17:57:11 +03:00
Ondřej Surý	192df8d2f1	The route socket and its storage was detached while still reading The interfacemgr and the .route was being detached while the network manager had pending read from the socket. Instead of detaching from the socket, we need to cancel the read which in turn will detach the route socket and the associated interfacemgr. (cherry picked from commit `9ae34a04e8`)	2022-04-26 16:41:24 +02:00
Ondřej Surý	8beaee0b08	Remove task exclusive mode from ns_clientmgr The .lock, .exiting and .excl members were not using for anything else than starting task exclusive mode, setting .exiting to true and ending exclusive mode. Remove all the stray members and dead code eliminating the task exclusive mode use from ns_clientmgr. (cherry picked from commit `4f74e1010e`)	2022-04-26 15:56:30 +02:00
Ondřej Surý	ce8ffdda69	Remove exclusive mode from ns_interfacemgr Now that the dns_aclenv_t has now properly rwlocked .localhost and .localnets member, we can remove the task exclusive mode use from the ns_interfacemgr. Some light related cleanup has been also done. (cherry picked from commit `c0995bc380`)	2022-04-26 14:21:57 +02:00
Ondřej Surý	ab528a0fcb	Add isc_rwlock around dns_aclenv .localhost and .localnets member In order to modify the .localhost and .localnets members of the dns_aclenv, all other processing on the netmgr loops needed to be stopped using the task exclusive mode. Add the isc_rwlock to the dns_aclenv, so any modifications to the .localhost and .localnets can be done under the write lock. (cherry picked from commit `8138a595d9`)	2022-04-26 14:21:57 +02:00
Ondřej Surý	2a648b9078	Abort when libuv at runtime mismatches libuv at compile time When we compile with libuv that has some capabilities via flags passed to f.e. uv_udp_listen() or uv_udp_bind(), the call with such flags would fail with invalid arguments when older libuv version is linked at the runtime that doesn't understand the flag that was available at the compile time. Enforce minimal libuv version when flags have been available at the compile time, but are not available at the runtime. This check is less strict than enforcing the runtime libuv version to be same or higher than compile time libuv version.	2022-04-26 12:11:51 +02:00
Ondřej Surý	7e72c55ff9	Run resume_dslookup() from the correct task The rctx_chaseds() function calls dns_resolver_createfetch(), passing fctx->task as the target task to run resume_dslookup() from. This breaks task-based serialization of events as fctx->task is the task that the dns_resolver_createfetch() caller wants to receive its fetch completion event in; meanwhile, intermediate fetches started by the resolver itself (e.g. related to QNAME minimization) must use res->buckets[bucketnum].task instead. This discrepancy may cause trouble if the resume_dslookup() callback happens to be run concurrently with e.g. fctx_doshutdown(). Fix by passing the correct task to dns_resolver_createfetch() in rctx_chaseds(). (cherry picked from commit `741a7096fc`)	2022-04-22 15:57:22 +02:00
Michał Kępień	4ac4640c40	Fix loading plugins using just their filenames BIND 9 plugins are installed using Automake's pkglib_LTLIBRARIES stanza, which causes the relevant shared objects to be placed in the $(libdir)/@PACKAGE@/ directory, where @PACKAGE@ is expanded to the lowercase form of the first argument passed to AC_INIT(), i.e. "bind". Meanwhile, NAMED_PLUGINDIR - the preprocessor macro that the ns_plugin_expandpath() function uses for determining the absolute path to a plugin for which only a filename has been provided (rather than a path) - is set to $(libdir)/named. This discrepancy breaks loading plugins using just their filenames. Fix the issue (and also prevent it from reoccurring) by setting NAMED_PLUGINDIR to $(pkglibdir). (cherry picked from commit `5065c4686e`)	2022-04-22 13:29:10 +02:00
Michał Kępień	2da371d005	Prevent memory bloat caused by a jemalloc quirk Since version 5.0.0, decay-based purging is the only available dirty page cleanup mechanism in jemalloc. It relies on so-called tickers, which are simple data structures used for ensuring that certain actions are taken "once every N times". Ticker data (state) is stored in a thread-specific data structure called tsd in jemalloc parlance. Ticks are triggered when extents are allocated and deallocated. Once every 1000 ticks, jemalloc attempts to release some of the dirty pages hanging around (if any). This allows memory use to be kept in check over time. This dirty page cleanup mechanism has a quirk. If the first allocator-related action for a given thread is a free(), a minimally-initialized tsd is set up which does not include ticker data. When that thread subsequently calls *alloc(), the tsd transitions to its nominal state, but due to a certain flag being set during minimal tsd initialization, ticker data remains unallocated. This prevents decay-based dirty page purging from working, effectively enabling memory exhaustion over time. [1] The quirk described above has been addressed (by moving ticker state to a different structure) in jemalloc's development branch [2], but not in any numbered jemalloc version released to date (the latest one being 5.2.1 as of this writing). Work around the problem by ensuring that every thread spawned by isc_thread_create() starts with a malloc() call. Avoid immediately calling free() for the dummy allocation to prevent an optimizing compiler from stripping away the malloc() + free() pair altogether. An alternative implementation of this workaround was considered that used a pair of isc_mem_create() + isc_mem_destroy() calls instead of malloc() + free(), enabling the change to be fully contained within isc__trampoline_run() (i.e. to not touch struct isc__trampoline), as the compiler is not allowed to strip away arbitrary function calls. However, that solution was eventually dismissed as it triggered ThreadSanitizer reports when tools like dig, nsupdate, or rndc exited abruptly without waiting for all worker threads to finish their work. [1] https://github.com/jemalloc/jemalloc/issues/2251 [2] `c259323ab3` (cherry picked from commit `7aa7b6474b`)	2022-04-21 14:22:13 +02:00
Mark Andrews	40bfb70d6a	Update the rdataset->trust field in ncache.c:rdataset_settrust Both the trust recorded in the slab stucture and the trust on rdataset need to be updated. (cherry picked from commit `d043a41499`)	2022-04-19 09:44:09 +10:00
Aram Sargsyan	546732546f	Do not use REQUIRE in dns_catz_entry_detach() after other code The REQUIRE checks should be at the top of the function before any assignments or code. Move the REQUIRE check to the top. (cherry picked from commit `99d1ec6c4b`)	2022-04-14 20:53:59 +00:00
Aram Sargsyan	5037aeb5d2	Replace CATZ_OPT_MASTERS with CATZ_OPT_PRIMARIES Update the enum entry in the continued effort of replacing some DNS terminology. (cherry picked from commit `59c486391d`)	2022-04-14 20:53:53 +00:00
Aram Sargsyan	c37a75df5d	Implement catalog zones change of ownership (coo) support Catalog zones change of ownership is special mechanism to facilitate controlled migration of a member zone from one catalog to another. It is implemented using catalog zones property named "coo" and is documented in DNS catalog zones draft version 5 document. Implement the feature using a new hash table in the catalog zone structure, which holds the added "coo" properties for the catalog zone (containing the target catalog zone's name), and the key for the hash table being the member zone's name for which the "coo" property is being created. Change some log messages to have consistent zone name quoting types. Update the ARM with change of ownership documentation and usage examples. Add tests which check newly the added features. (cherry picked from commit `bb837db4ee`)	2022-04-14 20:53:31 +00:00
Aram Sargsyan	581d7bece0	Do not cancel processing record datasets in catalog zone after an error When there are multiple record datasets in a database node of a catalog zone, and BIND encounters a soft error during processing of a dataset, it breaks from the loop and doesn't process the other datasets in the node. There are cases when this is not desired. For example, the catalog zones draft version 5 states that there must be a TXT RRset named `version.$CATZ` with exactly one RR, but it doesn't set a limitation on possible non-TXT RRsets named `version.$CATZ` existing alongside with the TXT one. In case when one exists, we will get a processing error and will not continue the loop to process the TXT RRset coming next. Remove the "break" statement to continue processing all record datasets. (cherry picked from commit `0b2d5490cd`)	2022-04-14 19:51:45 +00:00
Aram Sargsyan	d8e1f51a04	Process the 'version' record of the catalog zone first When processing a new or updated catalog zone, the record datasets from the database are being processed in order. This creates a problem because we need to know the version of the catalog zone schema to process some of the records differently, but we do not know the version until the 'version' record gets processed. Find the 'version' record and process it first, only then iterate over the database to process the rest, making sure not to process the 'version' record twice. (cherry picked from commit `6035980bb1`)	2022-04-14 19:51:37 +00:00
Aram Sargsyan	f75c39811d	Implement catalog zones options new syntax based on custom properties According to DNS catalog zones draft version 5 document, catalog zone custom properties must be placed under the "ext" label. Make necessary changes to support the new custom properties syntax in catalog zones with version "2" of the schema. Change the default catalog zones schema version from "1" to "2" in ARM to prepare for the new features and changes which come starting from this commit in order to support the latest DNS catalog zones draft document. Make some restructuring in ARM and rename the term catalog zone "option" to "custom property" to better reflect the terms used in the draft. Change the version of 'catalog1.zone.' catalog zone in the "catz" system test to "2", and leave the version of 'catalog2.zone.' catalog zone at version "1" to test both versions. Add tests to check that the new syntax works only with the new schema version, and that the old syntax works only with the legacy schema version catalog zones. (cherry picked from commit `cedfebc64a`)	2022-04-14 19:51:22 +00:00
Matthijs Mekking	c3ab09deb5	Update dns_dnssec_syncdelete() function Update the function that synchronizes the CDS and CDNSKEY DELETE records. It now allows for the possibility that the CDS DELETE record is published and the CDNSKEY DELETE record is not, and vice versa. Also update the code in zone.c how 'dns_dnssec_syncdelete()' is called. With KASP, we still maintain the DELETE records our self. Otherwise, we publish the CDS and CDNSKEY DELETE record only if they are added to the zone. We do still check if these records can be signed by a KSK. This change will allow users to add a CDS and/or CDNSKEY DELETE record manually, without BIND removing them on the next zone sign. Note that this commit removes the check whether the key is a KSK, this check is redundant because this check is also made in 'dst_key_is_signing()' when the role is set to DST_BOOL_KSK. (cherry picked from commit `3d05c99abb`)	2022-04-13 14:43:40 +02:00
Tony Finch	4191fd01be	Ensure that dns_request_createvia() has a retry limit There are a couple of problems with dns_request_createvia(): a UDP retry count of zero means unlimited retries (it should mean no retries), and the overall request timeout is not enforced. The combination of these bugs means that requests can be retried forever. This change alters calls to dns_request_createvia() to avoid the infinite retry bug by providing an explicit retry count. Previously, the calls specified infinite retries and relied on the limit implied by the overall request timeout and the UDP timeout (which did not work because the overall timeout is not enforced). The `udpretries` argument is also changed to be the number of retries; previously, zero was interpreted as infinity because of an underflow to UINT_MAX, which appeared to be a mistake. And `mdig` is updated to match the change in retry accounting. The bug could be triggered by zone maintenance queries, including NOTIFY messages, DS parental checks, refresh SOA queries and stub zone nameserver lookups. It could also occur with `nsupdate -r 0`. (But `mdig` had its own code to avoid the bug.) (cherry picked from commit `71ce8b0a51`)	2022-04-06 18:17:55 +01:00
Ondřej Surý	a1f3ff0dd1	Rename the configuration option to load balance sockets to reuseport After some back and forth, it was decidede to match the configuration option with unbound ("so-reuseport"), PowerDNS ("reuseport") and/or nginx ("reuseport"). (cherry picked from commit `7e71c4d0cc`)	2022-04-06 17:24:13 +02:00
Ondřej Surý	25bc461446	Revert "General cleanup of dns_rpz implementation" This reverts commit `bfee462403`.	2022-04-06 10:42:29 +02:00
Ondřej Surý	242500909a	Revert "Refactor the dns_rpz_add/delete to use local rpz copy" This reverts commit `f4cba0784e`.	2022-04-06 10:42:22 +02:00
Ondřej Surý	9b78612e7d	Revert "Run the RPZ update as offloaded work" This reverts commit `e128b6a951`.	2022-04-06 10:30:06 +02:00
Ondřej Surý	df91d61dc7	Rename shutdown() to test_shutdown() in timer_test.c The shutdown() is part of standard library (POSIX-1), don't use such name in the timer_test.c, but rather rename it to test_shutdown(). (cherry picked from commit `7868d8145b`)	2022-04-05 01:56:09 +02:00
Ondřej Surý	cd24556e14	Enable the load-balance-sockets configuration Previously, HAVE_SO_REUSEPORT_LB has been defined only in the private netmgr-int.h header file, making the configuration of load balanced sockets inoperable. Move the missing HAVE_SO_REUSEPORT_LB define the isc/netmgr.h and add missing isc_nm_getloadbalancesockets() implementation. (cherry picked from commit `142c63dda8`)	2022-04-05 01:38:49 +02:00
Ondřej Surý	64265f1c0e	Add option to configure load balance sockets Previously, the option to enable kernel load balancing of the sockets was always enabled when supported by the operating system (SO_REUSEPORT on Linux and SO_REUSEPORT_LB on FreeBSD). It was reported that in scenarios where the networking threads are also responsible for processing long-running tasks (like RPZ processing, CATZ processing or large zone transfers), this could lead to intermitten brownouts for some clients, because the thread assigned by the operating system might be busy. In such scenarious, the overall performance would be better served by threads competing over the sockets because the idle threads can pick up the incoming traffic. Add new configuration option (`load-balance-sockets`) to allow enabling or disabling the load balancing of the sockets. (cherry picked from commit `85c6e797aa`)	2022-04-04 23:59:59 +02:00
Ondřej Surý	e128b6a951	Run the RPZ update as offloaded work Previously, the RPZ updates ran quantized on the main nm_worker loops. As the quantum was set to 1024, this might lead to service interruptions when large RPZ update was processed. Change the RPZ update process to run as the offloaded work. The update and cleanup loops were refactored to do as little locking of the maintenance lock as possible for the shortest periods of time and the db iterator is being paused for every iteration, so we don't hold the rbtdb tree lock for prolonged periods of time. (cherry picked from commit `f106d0ed2b`)	2022-04-04 22:59:59 +02:00
Ondřej Surý	f4cba0784e	Refactor the dns_rpz_add/delete to use local rpz copy Previously dns_rpz_add() were passed dns_rpz_zones_t and index to .zones array. Because we actually attach to dns_rpz_zone_t, we should be using the local pointer instead of passing the index and "finding" the dns_rpz_zone_t again. Additionally, dns_rpz_add() and dns_rpz_delete() were used only inside rpz.c, so make them static. (cherry picked from commit `b6e885c97f`)	2022-04-04 22:59:59 +02:00
Ondřej Surý	bfee462403	General cleanup of dns_rpz implementation Do a general cleanup of lib/dns/rpz.c style: * Removed deprecated and unused functions * Unified dns_rpz_zone_t naming to rpz * Unified dns_rpz_zones_t naming to rpzs * Add and use rpz_attach() and rpz_attach_rpzs() functions * Shuffled variables to be more local (cppcheck cleanup) (cherry picked from commit `840179a247`)	2022-04-04 22:59:59 +02:00

1 2 3 4 5 ...

13825 commits