bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-03-11 02:30:44 -04:00

Author	SHA1	Message	Date
Ondřej Surý	86f1ec34dc	Silence all warnings that stem from the default config As we now setup the logging very early, parsing the default config would always print warnings about experimental (and possibly deprecated) options in the default config. This would even mess with commands like `named -V` and it is also wrong to warn users about using experimental options in the default config, because they can't do anything about this. Add CFG_PCTX_NODEPRECATED and CFG_PCTX_NOEXPERIMENTAL options that we can pass to cfg parser and silence the early warnings caused by using experimental options in the default config.	2024-08-14 12:50:31 +00:00
Aydın Mercan	596903a6b7	use deterministic ecdsa for openssl >= 3.2 OpenSSL has added support for deterministic ECDSA (RFC 6979) with version 3.2. Use it by default as derandomization doesn't pose a risk for DNS usecases and is allowed by FIPS 186-5.	2024-08-14 14:34:44 +03:00
Aram Sargsyan	730fd32ee6	Reconfigure catz member zones during named reconfiguration During a reconfiguration named doesn't reconfigure catalog zones member zones. Implement the necessary code to reconfigure catz member zones.	2024-08-13 16:22:58 +02:00
Ondřej Surý	8e86e55af1	Don't skip the counting if fcount_incr() is called with force==true (v2) The fcount_incr() was not increasing counter->count when force was set to true, but fcount_decr() would try to decrease the counter leading to underflow and assertion failure. Swap the order of the arguments in the condition, so the !force is evaluated after incrementing the .count.	2024-08-13 12:51:22 +02:00
Ondřej Surý	39aef50b9b	Move the dst__openssl_toresult to isc_tls unit Since the enable_fips_mode() now resides inside the isc_tls unit, BIND 9 would fail to compile when FIPS mode was enabled as the DST subsystem logging functions were missing. Move the crypto library logging functions from the openssl_link unit to isc_tls unit and enhance it, so it can now be used from both places keeping the old dst__openssl_toresult* macros alive.	2024-08-08 11:59:41 +02:00
Evan Hunt	104f3b82fb	implement 'max-query-restarts' implement, document, and test the 'max-query-restarts' option which specifies the query restart limit - the number of times we can follow CNAMEs before terminating resolution.	2024-08-07 13:20:05 -07:00
Evan Hunt	7e3b425dc2	reduce the max-recursion-queries default to 32 the number of iterative queries that can be sent to resolve a name now defaults to 32 rather than 100.	2024-08-07 13:19:57 -07:00
Evan Hunt	c5588babaf	make "max_restarts" a configurable value MAX_RESTARTS is no longer hard-coded; ns_server_setmaxrestarts() and dns_client_setmaxrestarts() can now be used to modify the max-restarts value at runtime. in both cases, the default is 11.	2024-08-07 13:03:08 -07:00
Evan Hunt	05d78671bb	reduce MAX_RESTARTS to 11 the number of steps that can be followed in a CNAME chain before terminating the lookup has been reduced from 16 to 11. (this is a hard-coded value, but will be made configurable later.)	2024-08-07 13:00:42 -07:00
Evan Hunt	825f3d68c5	add debug logging when creating or attaching to a query counter fctx_create() now logs at debug level 9 when the fctx attaches to an existing counter or creates a new one.	2024-08-07 11:21:44 -07:00
Evan Hunt	af7db89513	apply max-recursion-queries quota to validator queries previously, validator queries for DNSKEY and DS records were not counted toward the quota for max-recursion-queries; they are now.	2024-08-07 11:21:44 -07:00
Evan Hunt	d3b7e92783	attach query counter to NS fetches there were cases in resolver.c when queries for NS records were started without passing a pointer to the parent fetch's query counter; as a result, the max-recursion-queries quota for those queries started counting from zero, instead of sharing the limit for the parent fetch, making the quota ineffective in some cases.	2024-08-07 11:21:44 -07:00
Aydın Mercan	f58ed932d8	use only c23 or c11 noreturn specifiers Since we require C11 or greater, we can depend on using either _Noreturn or [[noreturn]].	2024-08-07 18:27:40 +03:00
Ondřej Surý	e6f2f2a5e6	Initialize the DST subsystem implicitly Instead of calling dst_lib_init() and dst_lib_destroy() explicitly by all the programs, create a separate memory context for the DST subsystem and use the library constructor and destructor to initialize the DST internals.	2024-08-07 17:03:27 +02:00
Ondřej Surý	c11b736e44	Disassociate the SSL object from the cached SSL_SESSION When the SSL object was destroyed, it would invalidate all SSL_SESSION objects including the cached, but not yet used, TLS session objects. Properly disassociate the SSL object from the SSL_SESSION before we store it in the TLS session cache, so we can later destroy it without invalidating the cached TLS sessions. Co-authored-by: Ondřej Surý <ondrej@isc.org> Co-authored-by: Artem Boldariev <artem@isc.org> Co-authored-by: Aram Sargsyan <aram@isc.org>	2024-08-07 14:25:11 +00:00
Ondřej Surý	684f3eb8e6	Attach/detach to the listening child socket when accepting TLS When TLS connection (TLSstream) connection was accepted, the children listening socket was not attached to sock->server and thus it could have been freed before all the accepted connections were actually closed. In turn, this would cause us to call isc_tls_free() too soon - causing cascade errors in pending SSL_read_ex() in the accepted connections. Properly attach and detach the children listening socket when accepting and closing the server connections.	2024-08-07 14:17:43 +00:00
Ondřej Surý	495cf18c75	Remove checks for OPENSSL_API_LEVEL define Since the support for OpenSSL Engines has been removed, we can now also remove the checks for OPENSSL_API_LEVEL; The OpenSSL 3.x APIs will be used when compiling with OpenSSL 3.x, and OpenSSL 1.1.xx APIs will be used only when OpenSSL 1.1.x is used.	2024-08-06 15:17:48 +02:00
Ondřej Surý	ef7aba7072	Remove OpenSSL Engine support The OpenSSL 1.x Engines support has been deprecated in the OpenSSL 3.x and is going to be removed. Remove the OpenSSL Engine support in favor of OpenSSL Providers.	2024-08-06 15:17:48 +02:00
Ondřej Surý	5beae5faf9	Fix the glue table in the QP and RBT zone databases When adding glue to the header, we add header to the wait-free stack to be cleaned up later which sets wfc_node->next to non-NULL value. When the actual cleaning happens we would only cleanup the .glue_list, but since the database isn't locked for the time being, the headers could be reused while cleaning the existing glue entries, which creates a data race between database versions. Revert the code back to use per-database-version hashtable where keys are the node pointers. This allows each database version to have independent glue cache table that doesn't affect nodes or headers that could already "belong" to the future database version.	2024-08-05 15:36:54 +02:00
Evan Hunt	6b720bfe1a	minor findnode optimization when searching the cache for a node so that we can delete an rdataset, it is not necessary to set the 'create' flag. if the node doesn't exist yet, we then we won't be able to delete anything from it anyway.	2024-08-05 13:36:41 +00:00
Evan Hunt	a68a77ca86	dns_difftuple_create() cannot fail dns_difftuple_create() could only return success, so change its type to void and clean up all the calls to it. other functions that only returned a result value because of it have been cleaned up in the same way.	2024-08-05 13:31:38 +00:00
Evan Hunt	a84d54c6ff	raise the log level of priming failures when a priming query is complete, it's currently logged at level ISC_LOG_DEBUG(1), regardless of success or failure. we are now raising it to ISC_LOG_NOTICE in the case of failure.	2024-08-05 13:56:13 +02:00
Aydın Mercan	2a76352b37	fix the rsa exponent to 65537 There isn't a realistic reason to ever use e = 4294967297. Fortunately its codepath wasn't reachable to users and can be safetly removed. Keep in mind the `dns_key_generate` header comment was outdated. e = 3 hasn't been used since 2006 so there isn't a reason to panic. The toggle was the public exponents between 65537 and 4294967297.	2024-08-05 11:21:59 +00:00
Aydın Mercan	5dbb560747	remove the crc64 implementation CRC-64 has been added for map files. Now that the map file format has been removed, there isn't a reason to keep the implementation.	2024-08-05 11:21:25 +00:00
Ondřej Surý	13941c8ca7	Call rcu_barrier() in the isc_mem_destroy() just once The previous work in this area was led by the belief that we might be calling call_rcu() from within call_rcu() callbacks. After carefully checking all the current callback, it became evident that this is not the case and the problem isn't enough rcu_barrier() calls, but something entirely else. Call the rcu_barrier() just once as that's enough and the multiple rcu_barrier() calls will not hide the real problem anymore, so we can find it.	2024-08-05 10:24:47 +00:00
Ondřej Surý	8ccfbcfe72	Remove no longer needed OpenSSL shims and checks Since the minimal OpenSSL version is now OpenSSL 1.1.1, remove all kind of OpenSSL shims and checks for functions that are now always present in the OpenSSL libraries. Co-authored-by: Ondřej Surý <ondrej@isc.org> Co-authored-by: Aydın Mercan <aydin@isc.org>	2024-08-05 10:23:59 +00:00
Ondřej Surý	37dbd57c16	Fix the assertion failure when putting 48-bit number to buffer When putting the 48-bit number into a fixed-size buffer that's exactly 6 bytes, the assertion failure would occur as the 48-bit number is internally represented as 64-bit number and the code was checking if there is enough space for `sizeof(val)`. This causes assertion failure when otherwise valid TSIG signature has a bad timing information. Specify the size of the argument explicitly, so the 48-bit number doesn't require 8-byte long buffer.	2024-08-05 09:55:18 +02:00
Ondřej Surý	a513d4c07f	Don't skip the counting if fcount_incr() is called with force==true The fcount_incr() was incorrectly skipping the accounting for the fetches-per-zone if the force argument was set to true. We want to skip the accounting only when the fetches-per-zone is completely disabled, but for individual names we need to do the accounting even if we are forcing the result to be success.	2024-08-05 07:33:20 +00:00
Ondřej Surý	827a153d99	Remove superfluous memset() in isc_nmsocket_init() The tlsstream part of the isc_nmsocket_t gets initialized via designater initializer and doesn't need the extra memset() later; just remove it.	2024-08-05 07:32:12 +00:00
Ondřej Surý	cc4f99bc6d	Fix PTHREAD_MUTEX_ADAPTIVE_NP and PTHREAD_MUTEX_ERRORCHECK_NP usage The PTHREAD_MUTEX_ADAPTIVE_NP and PTHREAD_MUTEX_ERRORCHECK_NP are usually not defines, but enum values, so simple preprocessor check doesn't work. Check for PTHREAD_MUTEX_ADAPTIVE_NP from the autoconf AS_COMPILE_IFELSE block and define HAVE_PTHREAD_MUTEX_ADAPTIVE_NP. This should enable adaptive mutex on Linux and FreeBSD. As PTHREAD_MUTEX_ERRORCHECK actually comes from POSIX and Linux glibc does define it when compatibility macros are being set, we can just use PTHREAD_MUTEX_ERRORCHECK instead of PTHREAD_MUTEX_ERRORCHECK_NP.	2024-08-05 07:31:39 +00:00
Ondřej Surý	f158884344	Remove ISC_MUTEX_INITIALIZER It's hard to get it right on different platforms and it's unused in BIND 9 anyway.	2024-08-05 07:31:39 +00:00
Ondřej Surý	b26079fdaf	Don't open route socket if we don't need it When automatic-interface-scan is disabled, the route socket was still being opened. Add new API to connect / disconnect from the route socket only as needed. Additionally, move the block that disables periodic interface rescans to a place where it actually have access to the configuration values. Previously, the values were being checked before the configuration was loaded.	2024-08-05 07:31:02 +00:00
Ondřej Surý	912eaf6cb9	Clarify that cds_wfcq_dequeue_blocking() doesn't block if empty	2024-08-05 07:30:10 +00:00
Mark Andrews	47338c2c87	Remove unnecessary operations Decrementing optlen immediately before calling continue is unneccesary and inconsistent with the rest of dns_message_pseudosectiontoyaml and dns_message_pseudosectiontotext. Coverity was also reporting an impossible false positive overflow of optlen (CID 499061). 4176 } else if (optcode == DNS_OPT_CLIENT_TAG) { 4177 uint16_t id; 4178 ADD_STRING(target, "; CLIENT-TAG:"); 4179 if (optlen == 2U) { 4180 id = isc_buffer_getuint16(&optbuf); 4181 snprintf(buf, sizeof(buf), " %u\n", id); 4182 ADD_STRING(target, buf); CID 499061: (#1 of 1): Overflowed constant (INTEGER_OVERFLOW) overflow_const: Expression optlen, which is equal to 65534, underflows the type that receives it, an unsigned integer 16 bits wide. 4183 optlen -= 2; 4184 POST(optlen); 4185 continue; 4186 } 4187 } else if (optcode == DNS_OPT_SERVER_TAG) {	2024-08-02 03:44:04 +00:00
Aram Sargsyan	5f47c2b567	Allow shorter resolver-query-timeout configuration There are use cases for which shorter timeout values make sense. For example if there is a load balancer which sets RD=1 and forwards queries to a BIND resolver which is then configured to talk to backend servers which are not visible in the public NS set. WIth a shorter timeout value the frontend can give back SERVFAIL early when backends are not available and the ultimate client will not penalize the BIND-frontend for non-response.	2024-08-01 18:30:35 +00:00
Aram Sargsyan	63b8a75de9	Rename dns_zone_forcereload() to dns_zone_forcexfr() The new name describes the function more accurately.	2024-08-01 11:01:17 +00:00
Aram Sargsyan	3d1179501a	Make dns_xfrin_shutdown() safe to run from a different loop If the current loop is different than the zone transfer's loop then run the shutdown operation asynchronously.	2024-08-01 10:43:47 +00:00
Aram Sargsyan	402ca316ae	Implement rndc retransfer -force With this new optional argument if there is an ongoing zone transfer it will be aborted before a new zone transfer is scheduled.	2024-08-01 10:43:47 +00:00
Aram Sargsyan	b156531b29	Do not automatically restart a canceled zone transfer If a zone transfer is canceled there is no need to try the next primary or retry with AXFR.	2024-08-01 10:43:47 +00:00
Mark Andrews	bca63437a1	Add missing period to generated IPv4 6to4 name The period between the most significant nibble of the IPv4 address and the 2.0.0.2.IP6.ARPA suffix was missing resulting in the wrong name being checked.	2024-08-01 15:17:30 +10:00
Mark Andrews	6d1c7beb15	Cleanup old clang-format string splitting	2024-08-01 14:17:57 +10:00
Mark Andrews	f78beca942	Remove false positive qname minimisation error Don't report qname minimisation NXDOMAIN errors when the result is NXDOMAIN.	2024-08-01 14:17:57 +10:00
Mark Andrews	393d7fa78e	Fix yaml output In yaml mode we emit a string for each question and record. Certain names and data could result in invalid yaml being produced. Use single quote string for all questions and records. This requires that single quotes get converted to two quotes within the string.	2024-08-01 12:30:57 +10:00
Mark Andrews	b51c9eb797	Properly reject zero length ALPN in commatxt_fromtext ALPN are defined as 1*255OCTET in RFC 9460. commatxt_fromtext was not rejecting invalid inputs produces by missing a level of escaping which where later caught be dns_rdata_fromwire on reception. These inputs should have been rejected svcb in svcb 1 1.svcb alpn=\,abc svcb1 in svcb 1 1.svcb alpn=a\,\,abc and generated 00 03 61 62 63 and 01 61 00 02 61 62 63 respectively. The correct inputs to include commas in the alpn requires double escaping. svcb in svcb 1 1.svcb alpn=\\,abc svcb1 in svcb 1 1.svcb alpn=a\\,\\,abc and generate 04 2C 61 62 63 and 06 61 2C 2C 61 62 63 respectively.	2024-08-01 10:20:55 +10:00
Aram Sargsyan	cb5238cc62	Replace #define DNS_GETDB_ with struct of bools This makes it easier to pretty-print the attributes in a debugger.	2024-07-31 11:52:52 +00:00
Aram Sargsyan	b621f1d88e	Return SERVFAIL for a too long CNAME chain Due to the maximum query restart limitation a long CNAME chain it is cut after 16 queries but named still returns NOERROR. Return SERVFAIL instead and the partial answer.	2024-07-31 10:54:10 +00:00
Mark Andrews	48d39f7c30	Check that FILE_STREAM(channel) is not already closed isc_log_closefilelogs can also close log files. isc_log_doit failed to check if the file handle was still valid before closing it.	2024-07-31 17:36:38 +10:00
Mark Andrews	e8dbc5db92	Properly compute the physical memory size On a 32 bit machine casting to size_t can still lead to an overflow. Cast to uint64_t. Also detect all possible negative values for pages and pagesize to silence warning about possible negative value. 39#if defined(_SC_PHYS_PAGES) && defined(_SC_PAGESIZE) 1. tainted_data_return: Called function sysconf(_SC_PHYS_PAGES), and a possible return value may be less than zero. 2. assign: Assigning: pages = sysconf(_SC_PHYS_PAGES). 40 long pages = sysconf(_SC_PHYS_PAGES); 41 long pagesize = sysconf(_SC_PAGESIZE); 42 3. Condition pages == -1, taking false branch. 4. Condition pagesize == -1, taking false branch. 43 if (pages == -1 \|\| pagesize == -1) { 44 return (0); 45 } 46 5. overflow: The expression (size_t)pages * pagesize might be negative, but is used in a context that treats it as unsigned. CID 498034: (#1 of 1): Overflowed return value (INTEGER_OVERFLOW) 6. return_overflow: (size_t)pages * pagesize, which might have underflowed, is returned from the function. 47 return ((size_t)pages * pagesize); 48#endif /* if defined(_SC_PHYS_PAGES) && defined(_SC_PAGESIZE) */	2024-07-31 05:55:30 +00:00
Mark Andrews	53a5f50e9d	Do not update find.result_v4 and find.result_v6 These values are supposed to be static for the life of the find and clean_finds_at_name was updating them resulting in TSAN error reports. WARNING: ThreadSanitizer: data race Write of size 4 at 0x000000000001 by thread T1 (mutexes: write M1, write M2): #0 clean_finds_at_name lib/dns/adb.c:1537 #1 fetch_callback lib/dns/adb.c:4009 #2 task_run lib/isc/task.c:815 #3 isc_task_run lib/isc/task.c:896 #4 isc__nm_async_task netmgr/netmgr.c:848 #5 process_netievent netmgr/netmgr.c:920 #6 process_queue netmgr/netmgr.c:1013 #7 process_all_queues netmgr/netmgr.c:767 #8 async_cb netmgr/netmgr.c:796 #9 uv__async_io /usr/src/libuv-v1.44.1/src/unix/async.c:163 #10 isc__trampoline_run lib/isc/trampoline.c:189 Previous read of size 4 at 0x000000000001 by thread T2: #0 findname lib/dns/resolver.c:3749 #1 fctx_getaddresses lib/dns/resolver.c:3993 #2 fctx_try lib/dns/resolver.c:4390 #3 rctx_nextserver lib/dns/resolver.c:10356 #4 rctx_done lib/dns/resolver.c:10503 #5 resquery_response lib/dns/resolver.c:8511 #6 udp_recv lib/dns/dispatch.c:638 #7 isc__nm_async_readcb netmgr/netmgr.c:2885 #8 isc__nm_readcb netmgr/netmgr.c:2858 #9 udp_recv_cb netmgr/udp.c:650 #10 isc__nm_udp_read_cb netmgr/udp.c:1057 #11 uv__udp_recvmsg /usr/src/libuv-v1.44.1/src/unix/udp.c:303 #12 isc__trampoline_run lib/isc/trampoline.c:189	2024-07-31 14:46:45 +10:00
Mark Andrews	14a76ae498	Log key calculation overflows	2024-07-30 10:58:54 +02:00
Mark Andrews	25845a866e	Check for overflow when adding lifetime	2024-07-30 10:58:54 +02:00
Matthijs Mekking	129973ebb0	No longer update key lifetime if key is retired The key lifetime should no longer be adjusted if the key is being retired earlier, for example because a manual rollover was started. This would falsely be seen as a dnssec-policy lifetime reconfiguration, and would adjust the retire/removed time again. This also means we should update the status output, and the next rollover scheduled is now calculated using (retire-active) instead of key lifetime.	2024-07-30 10:57:14 +02:00
Matthijs Mekking	1cec0b0448	Update key lifetime and metadata after reconfig If dnssec-policy is reconfigured and the key lifetime has changed, update existing keys with the new lifetime and adjust the retire and removed timing metadata accordingly. If the key has no lifetime yet, just initialize the lifetime. It may be that the retire/removed timing metadata has already been set. Skip keys which goal is not set to omnipresent. These keys are already in the progress of retiring, or still unused.	2024-07-30 10:57:14 +02:00
Artem Boldariev	5781ff3a93	Drop expired but not accepted TCP connections This commit ensures that we are not attempting to accept an expired TCP connection as we are not interested in any data that could have been accumulated in its internal buffers. Now we just drop them for good.	2024-07-03 15:03:02 +03:00
Ondřej Surý	bf9fd2a6ff	Reset the TCP connection on a failed send When sending fails, the ns__client_request() would not reset the connection and continue as nothing is happening. This comes from the model that we don't care about failed UDP sends because datagrams are unreliable anyway, but it greatly affects TCP connections with keep-alive. The worst case scenario is as follows: 1. the 3-way TCP handshake gets completed 2. the libuv calls the "uv_connection_cb" callback 3. the TCP connection gets queue because of the tcp-clients quota 4. the TCP client sends as many DNS messages as the buffers allow 5. the TCP connection gets dropped by the client due to the timeout 6. the TCP connection gets accepted by the server 7. the data already sent by the client gets read 8. all sending fails immediately because the TCP connection is dead 9. we consume all the data in the buffer in a very tight loop As it doesn't make sense to trying to process more data on the TCP connection when the sending is failing, drop the connection immediately on the first sending error.	2024-07-03 09:07:20 +02:00
Ondřej Surý	1c0564d715	Remove ns_query_init() cannot fail, remove the error paths As ns_query_init() cannot fail now, remove the error paths, especially in ns__client_setup() where we now don't have to care what to do with the connection if setting up the client could fail. It couldn't fail even before, but now it's formal.	2024-07-03 09:05:51 +02:00
Ondřej Surý	bc3e713317	Throttle the reading when writes are asynchronous Be more aggressive when throttling the reading - when we can't send the outgoing TCP synchronously with uv_try_write(), we start throttling the reading immediately instead of waiting for the send buffers to fill up. This should not affect behaved clients that read the data from the TCP on the other end.	2024-07-03 08:45:39 +02:00
Ondřej Surý	57cd34441a	Be smarter about refusing to add many RR types to the database Instead of outright refusing to add new RR types to the cache, be a bit smarter: 1. If the new header type is in our priority list, we always add either positive or negative entry at the beginning of the list. 2. If the new header type is negative entry, and we are over the limit, we mark it as ancient immediately, so it gets evicted from the cache as soon as possible. 3. Otherwise add the new header after the priority headers (or at the head of the list). 4. If we are over the limit, evict the last entry on the normal header list.	2024-07-01 12:48:51 +02:00
Ondřej Surý	b27c6bcce8	Expand the list of the priority types and move it to db_p.h Add HTTPS, SVCB, SRV, PTR, NAPTR, DNSKEY and TXT records to the list of the priority types that are put at the beginning of the slabheader list for faster access and to avoid eviction when there are more types than the max-types-per-name limit.	2024-07-01 12:47:30 +02:00
Artem Boldariev	55b1a093ea	Do not un-throttle TCP connections on isc_nm_read() Due to omission it was possible to un-throttle a TCP connection previously throttled due to the peer not reading back data we are sending. In particular, that affected DoH code, but it could also affect other transports (the current or future ones) that pause/resume reading according to its internal state.	2024-06-12 13:44:37 +03:00
Mark Andrews	e52c2a654b	Clear qctx->zversion Clear qctx->zversion when clearing qctx->zrdataset et al in lib/ns/query.c:qctx_freedata. The uncleared pointer could lead to an assertion failure if zone data needed to be re-saved which could happen with stale data support enabled.	2024-06-10 17:45:38 +02:00
Petr Špaček	9370acd3a7	Require local KEYs for SIG(0) verification This is additional hardening. There is no known use-case for KEY RRs from DNS cache and it potentially allows attackers to put weird keys into cache.	2024-06-10 17:36:45 +02:00
Aram Sargsyan	d69fab1530	Mark SIG(0) quota settings as experimantal A different solution in the future might be adopted depending on feedback and other new information, so it makes sense to mark these options as EXPERIMENTAL until we have more data.	2024-06-10 17:36:45 +02:00
Aram Sargsyan	54ddd848fe	Avoid running get_matching_view() asynchronously on an error path Also create a new ns_client_async_reset() static function to decrease code duplication.	2024-06-10 17:35:40 +02:00
Aram Sargsyan	7ca9bd6014	Limit the number of keys for SIG(0) message verification Check at most two KEY RRs agains a SIG(0) signature. This should limit potential abuse and at the same time allow key rollover.	2024-06-10 17:33:11 +02:00
Aram Sargsyan	70ff4a3f85	Run resolver message signature checking asynchronously	2024-06-10 17:33:11 +02:00
Aram Sargsyan	ad489c44df	Remove sig0checks-quota-maxwait-ms support Waiting for a quota to appear complicates things and wastes rosources on timer management. Just answer with REFUSE if there is no quota.	2024-06-10 17:33:11 +02:00
Aram Sargsyan	f0cde05e06	Implement asynchronous view matching for SIG(0)-signed queries View matching on an incoming query checks the query's signature, which can be a CPU-heavy task for a SIG(0)-signed message. Implement an asynchronous mode of the view matching function which uses the offloaded signature checking facilities, and use it for the incoming queries.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	710bf9b938	Implement asynchronous message signature verification Add support for using the offload threadpool to perform message signature verifications. This should allow check SIG(0)-signed messages without affecting the worker threads.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	7f013ad05d	Remove dns_message_rechecksig() This is a tiny helper function which is used only once and can be replaced with two function calls instead. Removing this makes supporting asynchronous signature checking less complicated.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	c7f79a0353	Add a quota for SIG(0) signature checks In order to protect from a malicious DNS client that sends many queries with a SIG(0)-signed message, add a quota of simultaneously running SIG(0) checks. This protection can only help when named is using more than one worker threads. For example, if named is running with the '-n 4' option, and 'sig0checks-quota 2;' is used, then named will make sure to not use more than 2 workers for the SIG(0) signature checks in parallel, thus leaving the other workers to serve the remaining clients which do not use SIG(0)-signed messages. That limitation is going to change when SIG(0) signature checks are offloaded to "slow" threads in a future commit. The 'sig0checks-quota-exempt' ACL option can be used to exempt certain clients from the quota requirements using their IP or network addresses. The 'sig0checks-quota-maxwait-ms' option is used to define a maximum amount of time for named to wait for a quota to appear. If during that time no new quota becomes available, named will answer to the client with DNS_R_REFUSED.	2024-06-10 17:33:08 +02:00
Matthijs Mekking	c1ac8b6ad0	Log rekey failure as error if too many records By default we log a rekey failure on debug level. We should probably change the log level to error. We make an exception for when the zone is not loaded yet, it often happens at startup that a rekey is run before the zone is fully loaded.	2024-06-10 16:55:12 +02:00
Matthijs Mekking	82635e56d8	Log error when update fails The new "too many records" error can make an update fail without the error being logged. This commit fixes that.	2024-06-10 16:55:12 +02:00
Evan Hunt	7dd6b47ace	fix a memory leak that could occur when signing when signatures were not added because of too many types already existing at a node, the diff was not being cleaned up; this led to a memory leak being reported at shutdown.	2024-06-10 16:55:12 +02:00
Ondřej Surý	52b3d86ef0	Add a limit to the number of RR types for single name Previously, the number of RR types for a single owner name was limited only by the maximum number of the types (64k). As the data structure that holds the RR types for the database node is just a linked list, and there are places where we just walk through the whole list (again and again), adding a large number of RR types for a single owner named with would slow down processing of such name (database node). Add a configurable limit to cap the number of the RR types for a single owner. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-types-per-name configuration option that can be configured globally, per-view and per-zone.	2024-06-10 16:55:09 +02:00
Ondřej Surý	32af7299eb	Add a limit to the number of RRs in RRSets Previously, the number of RRs in the RRSets were internally unlimited. As the data structure that holds the RRs is just a linked list, and there are places where we just walk through all of the RRs, adding an RRSet with huge number of RRs inside would slow down processing of said RRSets. Add a configurable limit to cap the number of the RRs in a single RRSet. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-records-per-type configuration option that can be configured globally, per-view and per-zone.	2024-06-10 16:55:07 +02:00
Ondřej Surý	e28266bfbc	Remove the extra memory context with own arena for sending The changes in this MR prevent the memory used for sending the outgoing TCP requests to spike so much. That strictly remove the extra need for own memory context, and thus since we generally prefer simplicity, remove the extra memory context with own jemalloc arenas just for the outgoing send buffers.	2024-06-10 16:48:54 +02:00
Ondřej Surý	4c2ac25a95	Limit the number of DNS message processed from a single TCP read The single TCP read can create as much as 64k divided by the minimum size of the DNS message. This can clog the processing thread and trash the memory allocator because we need to do as much as ~20k allocations in a single UV loop tick. Limit the number of the DNS messages processed in a single UV loop tick to just single DNS message and limit the number of the outstanding DNS messages back to 23. This effectively limits the number of pipelined DNS messages to that number (this is the limit we already had before).	2024-06-10 16:48:54 +02:00
Ondřej Surý	452a2e6348	Replace the tcp_buffers memory pool with static per-loop buffer As a single thread can process only one TCP send at the time, we don't really need a memory pool for the TCP buffers, but it's enough to have a single per-loop (client manager) static buffer that's being used to assemble the DNS message and then it gets copied into own sending buffer. In the future, this should get optimized by exposing the uv_try API from the network manager, and first try to send the message directly and allocate the sending buffer only if we need to send the data asynchronously.	2024-06-10 16:48:53 +02:00
Aram Sargsyan	982eab7de0	ns_client: reuse TCP send buffers Constantly allocating, reallocating and deallocating 64K TCP send buffers by 'ns_client' instances takes too much CPU time. There is an existing mechanism to reuse the ns_clent_t structure associated with the handle using 'isc_nmhandle_getdata/_setdata' (see ns_client_request()), but it doesn't work with TCP, because every time ns_client_request() is called it gets a new handle even for the same TCP connection, see the comments in streamdns_on_complete_dnsmessage(). To solve the problem, we introduce an array of available (unused) TCP buffers stored in ns_clientmgr_t structure so that a 'client' working via TCP can have a chance to reuse one (if there is one) instead of allocating a new one every time.	2024-06-10 16:48:53 +02:00
Ondřej Surý	4e7c4af17f	Throttle reading from TCP if the sends are not getting through When TCP client would not read the DNS message sent to them, the TCP sends inside named would accumulate and cause degradation of the service. Throttle the reading from the TCP socket when we accumulate enough DNS data to be sent. Currently this is limited in a way that a single largest possible DNS message can fit into the buffer.	2024-06-10 16:48:52 +02:00
Artem Boldariev	d80dfbf745	Keep the endpoints set reference within an HTTP/2 socket This commit ensures that an HTTP endpoints set reference is stored in a socket object associated with an HTTP/2 stream instead of referencing the global set stored inside a listener. This helps to prevent an issue like follows: 1. BIND is configured to serve DoH clients; 2. A client is connected and one or more HTTP/2 stream is created. Internal pointers are now pointing to the data on the associated HTTP endpoints set; 3. BIND is reconfigured - the new endpoints set object is created and promoted to all listeners; 4. The old pointers to the HTTP endpoints set data are now invalid. Instead referencing a global object that is updated on re-configurations we now store a local reference which prevents the endpoints set objects to go out of scope prematurely.	2024-06-10 16:40:12 +02:00
Artem Boldariev	c41fb499b9	DoH: avoid potential use after free for HTTP/2 session objects It was reported that HTTP/2 session might get closed or even deleted before all async. processing has been completed. This commit addresses that: now we are avoiding using the object when we do not need it or specifically check if the pointers used are not 'NULL' and by ensuring that there is at least one reference to the session object while we are doing incoming data processing. This commit makes the code more resilient to such issues in the future.	2024-06-10 16:40:10 +02:00
Ondřej Surý	086b63f56d	Use isc_queue to implement wait-free deadnodes queue Replace the ISC_LIST based deadnodes implementation with isc_queue which is wait-free and we don't have to acquire neither the tree nor node lock to append nodes to the queue and the cleaning process can also copy (splice) the list into a local copy without acquiring the list. Currently, there's little benefit to this as we need to hold those locks anyway, but in the future as we move to RCU based implementation, this will be ready. To align the cleaning with our event loop based model, remove the hardcoded count for the node locks and use the number of the event loops instead. This way, each event loop can have its own cleaning as part of the process. Use uniform random numbers to spread the nodes evenly between the buckets (instead of hashing the domain name).	2024-06-05 09:19:56 +02:00
Ondřej Surý	a9b4d42346	Add isc_queue implementation on top of cds_wfcq Add an isc_queue implementation that hides the gory details of cds_wfcq into more neat API. The same caveats as with cds_wfcq. TODO: Add documentation to the API.	2024-06-05 09:19:56 +02:00
Mark Andrews	56c3dcc5d7	Update resquery_senddone handling of ISC_R_TIMEDOUT Treat timed out as an address specific error.	2024-06-04 00:15:48 +10:00
Mark Andrews	4e3dd85b8d	Update resquery_senddone handling of ISC_R_CONNECTIONRESET Treat connection reset as an address specific error.	2024-06-04 00:15:48 +10:00
Mark Andrews	180b1e7939	Handle ISC_R_HOSTDOWN and ISC_R_NETDOWN in resolver.c These error codes should be treated like other unreachable error codes.	2024-06-04 00:15:48 +10:00
Mark Andrews	05472e63e8	Don't do DS checks over disabled address families	2024-06-03 18:34:31 +10:00
Mark Andrews	d026dbe536	Don't forward UPDATE messages over disabled address families	2024-06-03 18:34:31 +10:00
Mark Andrews	5d99625515	Don't send NOTIFY over disabled address families	2024-06-03 18:34:31 +10:00
Mark Andrews	2cd4303249	Report non-effective primaries When named is started with -4 or -6 and the primaries for a zone do not have an IPv4 or IPv6 address respectively issue a log message.	2024-06-03 18:34:31 +10:00
Mark Andrews	ecdde04e63	Zone transfers should honour -4 and -6 options Check if the address family has been disabled when transferring zones.	2024-06-03 18:34:31 +10:00
Mark Andrews	9be1873ef3	Add helper function isc_sockaddr_disabled	2024-06-03 18:34:31 +10:00
Matthijs Mekking	c40e5c8653	Call reset_shutdown if uv_tcp_close_reset failed If uv_tcp_close_reset() returns an error code, this means the reset_shutdown callback has not been issued, so do it now.	2024-06-03 10:14:47 +02:00
Matthijs Mekking	5b94bb2129	Do not runtime check uv_tcp_close_reset When we reset a TCP connection by sending a RST packet, do not bother requiring the result is a success code.	2024-06-03 10:14:47 +02:00
Mark Andrews	87e3b9dbf3	Pass a memory context in to dns_cache_create	2024-05-31 15:40:32 +10:00
Mark Andrews	5e77edd074	Use a new memory context when flushing the cache When the cache's memory context was in over memory state when the cache was flushed it resulted in LRU cleaning removing newly entered data in the new cache straight away until the old cache had been destroyed enough to take it out of over memory state. When flushing the cache create a new memory context for the new db to prevent this.	2024-05-31 15:40:32 +10:00
Ondřej Surý	3310cac2b0	Create the new database for AXFR from the dns_zone API The `axfr_makedb()` didn't set the loop on the newly created database, effectively killing delayed cleaning on such database. Move the database creation into dns_zone API that knows all the gory details of creating new database suitable for the zone.	2024-05-29 08:30:19 +02:00
Aram Sargsyan	4d3c31b928	fixup! Merge branch 'ondrej/light-cleanup-of-rdataslab' into 'main'	2024-05-25 11:47:33 +02:00
Ondřej Surý	3feabc8a22	Cleanup the dns_cache unit Remove duplicate code and use ISC_REFCOUNT_{DECL,IMPL} macros.	2024-05-25 11:47:33 +02:00
Ondřej Surý	03ed19cf71	Refactor the common buffer manipulation in rdataslab.c in macros The rdataslab.c was full of code like this: length = raw[0] * 256 + raw[1]; and count2 = current2++ 256; count2 += *current2++; Refactor code like this into peek_uint16() and get_uint16 macros to prevent code repetition and possible mistakes when copy and pasting the same code over and over. As a side note for an entertainment of a careful reader of the commit messages: The byte manipulation was changed from multiplication and addition to shift with or. The difference in the assembly looks like this: MUL and ADD: movzx eax, BYTE PTR [rdi] movzx edi, BYTE PTR [rdi+1] sal eax, 8 or edi, eax SHIFT and OR: movzx edi, WORD PTR [rdi] rol di, 8 movzx edi, di If the result and/or buffer is then being used after the macro call, there's more differences in favor of the SHIFT+OR solution.	2024-05-24 09:52:45 +02:00
Aydın Mercan	03a59cbb04	reinsert accidentally removed + in db trace It only affects development when using `DNS_DB_TRACE`.	2024-05-17 18:11:23 -07:00
Aydın Mercan	49e62ee186	fix typing mistakes in trace macros The detach function declaration in `ISC__REFCOUNT_TRACE_DECL` had an returned an accidental implicit int. While not allowed since C99, it became an error by default in GCC 14. `ISC_REFCOUNT_TRACE_IMPL` and `ISC_REFCOUNT_STATIC_TRACE_IMPL` expanded into the wrong macros, trying to declare it again with the wrong number of parameters.	2024-05-17 18:11:23 -07:00
Mark Andrews	b7de2c7cb9	Clang-format header file changes	2024-05-17 16:03:21 -07:00
Mark Andrews	6e9ed4983e	add test cases for several FORMERR code paths: - duplicated question - duplicated answer - qtype as an answer - two question types - question names - nsec3 bad owner name - short record - short question - mismatching question class - bad record owner name - mismatched class in record - mismatched KEY class - OPT wrong owner name - invalid RRSIG "covers" type - UPDATE malformed delete type - TSIG wrong class - TSIG not the last record	2024-05-17 13:39:22 +10:00
Evan Hunt	9c882f1e69	replace qpzone node attriutes with atomics there were TSAN error reports because of conflicting uses of node->dirty and node->nsec, which were in the same qword. this could be resolved by separating them, but we could also make them into atomic values and remove some node locking.	2024-05-17 00:33:35 +00:00
Matthijs Mekking	f882101265	Rewrite qp fix_iterator() The fix_iterator() function had a lot of bugs in it and while fixing them, the number of corner cases and the complexity of the function got out of hand. Rewrite the function with the following modifications: The function now requires that the iterator is pointing to a leaf node. This removes the cases we have to deal when the iterator was left on a dead branch. From the leaf node, pop up the iterator stack until we encounter the branch where the offset point is before the point where the search key differs. This will bring us to the right branch, or at the first unmatched node, in which case we pop up to the parent branch. From there it is easier to retrieve the predecessor. Once we are at the right branch, all we have to do is find the right twig (which is either the twig for the character at the position where the search key differs, or the previous twig) and walk down from there to the greatest leaf or, in case there is no good twig, get the previous twig from the successor and get the greatest leaf from there. If there is no previous twig to select in this branch, because every leaf from this branch node is greater than the one we wanted, we need to pop up the stack again and resume at the parent branch. This is achieved by calling prevleaf().	2024-05-16 09:49:41 +00:00
Matthijs Mekking	8b8c16d7a4	Get anyleaf when qp lookup is on a dead end branch Move the fix_iterator out of the loop and only call it when we found a leaf node. This leaf node may be the wrong leaf node, but fix_iterator should correct that. Also, when we don't need to set the iterator, just get any leaf. We only need to have a leaf for the qpkey_compare and the end result does not matter if compare was against an ancestor leaf or any leaf below that point.	2024-05-16 09:49:41 +00:00
Mark Andrews	ec3c624814	Properly build the NSEC/NSEC3 type bit map DNSKEY was incorrectly being added to the NESC/NSEC3 type bit map when it was obscured by the delegation. This lead to zone verification failures.	2024-05-16 10:27:49 +10:00
Mark Andrews	e84615629f	Properly update 'maxtype' 'maxtype' should be checked to see if it should be updated whenever a type is added to the type map.	2024-05-16 10:20:49 +10:00
Ondřej Surý	eb862ce509	Properly attach/detach isc_httpd in case read ends earlier than send An assertion failure would be triggered when sending the TCP data ends after the TCP reading gets closed. Implement proper reference counting for the isc_httpd object.	2024-05-15 12:22:10 +02:00
Evan Hunt	b6815de316	Fix QP chain on partial match When searching for a requested name in dns_qp_lookup(), we may add a leaf node to the QP chain, then subsequently determine that the branch we were on was a dead end. When that happens, the chain can be left holding a pointer to a node that is not an ancestor of the requested name. We correct for this by unwinding any chain links with an offset value greater or equal to that of the node we found.	2024-05-14 12:58:46 -07:00
Matthijs Mekking	91de4f6490	Refactor fix_iterator The code below the if/else construction could only be run if the 'if' code path was taken. Move the code into the 'if' code block so that it is more easier to read.	2024-05-14 12:58:46 -07:00
Aydın Mercan	e037520b92	Keep track of the recursive clients highwater The high-water allows administrators to better tune the recursive clients limit without having to to poll the statistics channel in high rates to get this number.	2024-05-10 12:08:52 +03:00
Aydın Mercan	09e4fb2ffa	Return the old counter value in `isc_stats_increment` Returning the value allows for better high-water tracking without running into edge cases like the following: 0. The counter is at value X 1. Increment the value (X+1) 2. The value is decreased multiple times in another threads (X+1-Y) 3. Get the value (X+1-Y) 4. Update-if-greater misses the X+1 value which should have been the high-water	2024-05-10 12:08:52 +03:00
Mark Andrews	88c48dde5e	Stop processing catalog zone changes when shutting down Abandon catz_addmodzone_cb and catz_delzone_cb processing if the loop is shutting down.	2024-05-09 08:17:44 +10:00
Mark Andrews	307e3ed9a6	catzs->view should maintain a view reference Use dns_view_weakattach and dns_view_weakdetach to maintain a reference to the view referenced through catzs->view.	2024-05-09 08:17:44 +10:00
Mark Andrews	799046929c	Only check SVBC alias forms at higher levels Allow SVBC (HTTPS) alias form with parameters to be accepted from the wire and when transfered. This is for possible future extensions.	2024-05-07 11:20:49 +10:00
Mark Andrews	efd27bb82d	Remove infinite loop on ISC_R_NOFILE When parsing a zonefile named-checkzone (and others) could loop infinitely if a directory was $INCLUDED. Record the error and treat as EOF when looking for multiple errors. This was found by Eric Sesterhenn from X41.	2024-05-07 10:01:12 +10:00
Mark Andrews	371824f078	Address infinite loop when processing $GENERATE In nibble mode if the value to be converted was negative the parser would loop forever. Process the value as an unsigned int instead of as an int to prevent sign extension when shifting. This was found by Eric Sesterhenn from X41.	2024-05-07 09:19:43 +10:00
Matthijs Mekking	5d7e613e81	RPZ response's SOA record is incorrectly set to 1 An RPZ response's SOA record TTL is set to 1 instead of the SOA TTL, a boolean value is passed on to query_addsoa, which is supposed to be a TTL value. I don't see what value is appropriate to be used for overriding, so we will pass UINT32_MAX.	2024-05-06 11:38:36 +02:00
Aram Sargsyan	8052848d50	Fix a bug in expireheader() call arguments order The expireheader() call in the expire_ttl_headers() function is erroneous as it passes the 'nlocktypep' and 'tlocktypep' arguments in a wrong order, which then causes an assertion failure. Fix the order of the arguments so it corresponds to the function's prototype.	2024-05-02 08:38:35 +00:00
Evan Hunt	f81bf6bafd	handle QP lookups involving escaped characters better in QP keys, characters that are not common in DNS names are encoded as two-octet sequences. this caused a glitch in iterator positioning when some lookups failed. consider the case where we're searching for "\009" (represented in a QP key as {0x03, 0x0c}) and a branch exists for "\000" (represented as {0x03, 0x03}). we match on the 0x03, and continue to search down. at the point where we find we have no match, we need to pop back up to the branch before the 0x03 - which may be multiple levels up the stack - before we position the iterator.	2024-05-01 00:36:51 -07:00
Evan Hunt	4b02246130	fix more ambiguous struct names there were some structure names used in qpcache.c and qpzone.c that were too similar to each other and could be confusing when debugging. they have been changed as follows: in qcache.c: - changed_t was unused, and has been removed - search_t -> qpc_search_t - qpdb_rdatasetiter_t -> qpc_rditer_t - qpdb_dbiterator_t -> qpc_dbiter_t in qpzone.c: - qpdb_changed_t -> qpz_changed_t - qpdb_changedlist_t -> qpz_changedlist_t - qpdb_version_t -> qpz_version_t - qpdb_versionlist_t -> qpz_versionlist_t - qpdb_search_t -> qpz_search_t - qpdb_load_t -> qpz_search_t	2024-04-30 12:50:01 -07:00
Evan Hunt	e300dfce46	use dns_qp_getname() where possible some calls to dns_qp_lookup() do not need partial matches, QP chains or QP iterators. in these cases it's more efficient to use dns_qp_getname().	2024-04-30 12:50:01 -07:00
Evan Hunt	2789e58473	get foundname from the node when calling dns_qp_lookup() from qpcache, instead of passing 'foundname' so that a name would be constructed from the QP key, we now just use the name field in the node data. this makes dns_qp_lookup() run faster. the same optimization has also been added to qpzone. the documentation for dns_qp_lookup() has been updated to discuss this performance consideration.	2024-04-30 12:50:01 -07:00
Evan Hunt	04d319afe4	include the nodenames when calculating memory to purge when the cache is over memory, we purge from the LRU list until we've freed the approximate amount of memory to be added. this approximation could fail because the memory allocated for nodenames wasn't being counted. add a dns_name_size() function so we can look up the size of nodenames, then add that to the purgesize calculation.	2024-04-30 12:50:01 -07:00
Evan Hunt	a8bda6ff1e	simplify qpcache iterators in a cache database, unlike zones, NSEC3 records are stored in the main tree. it is not necessary to maintain a separate 'nsec3' tree, nor to have code in the dbiterator implementation to traverse from one tree to another. (if we ever implement synth-from-dnssec using NSEC3 records, we'll need to revert this change. in the meantime, simpler code is better.)	2024-04-30 12:50:01 -07:00
Evan Hunt	7ff43befb7	clean up unnecessary dbiterator code related to origin the QP database doesn't support relative names as the RBTDB did, so there's no need for a 'new_origin' flag or to handle `DNS_R_NEWORIGIN` result codes.	2024-04-30 12:42:32 -07:00
Evan Hunt	85ab92b6e0	more cleanups in qpcache.c - remove unneeded struct members and misleading comments. - remove unused parameters for static functions. - rename 'find_callback' to 'delegating' for consistency with qpzone; the find callback mechanism is not used in QP databases.	2024-04-30 12:42:31 -07:00
Evan Hunt	3acab71d46	rename QPDB_HEADERNODE to HEADERNODE this makes the macro consistent between qpcache.c and qpzone.c. also removed a redundant definition of HEADERNODE in qpzone.c.	2024-04-30 12:42:31 -07:00
Evan Hunt	46d40b3dca	fix structure names in qpcache.c and qpzone.c - change dns_qpdata_t to qpcnode_t (QP cache node), and dns_qpdb_t to qpcache_t, as these types are only accessed locally. - also change qpdata_t in qpzone.c to qpznode_t (QP zone node), for consistency. - make the refcount declarations for qpcnode_t and qpznode_t static, using the new ISC_REFCOUNT_STATIC macros.	2024-04-30 12:42:07 -07:00
Evan Hunt	20d32512ca	clean up unnecessary requirements in qpcache.c qpcache can only support cache semantics now, so there's no longer any need to check for that internally.	2024-04-30 12:31:48 -07:00
Evan Hunt	a5d0e6c4ba	add static macros for ISC_REFCOUNT_DECL/IMPL this commit adds a mechanism to statically declare attach/detach and ref/unref methods, for objects that are only accessed within a single C file.	2024-04-30 12:31:48 -07:00
Ondřej Surý	c13a1d8b01	Improve the reference counting checks in newref() In qpcache (and rbtdb), there are some functions that acquire neither the tree lock nor the node lock when calling newref(). In theory, this could lead to a race in which a new reference is added to a node that was about to be deleted. We now detect this condition by passing the current tree and node lock status to newref(). If the node was previously unreferenced and we don't hold at least one read lock, we will assert.	2024-04-30 08:41:56 +02:00
Aydın Mercan	f30008a71c	Provide an early escape hatch for ns_client_transport_type Because some tests don't have a legtimate handle, provide a temporary return early that should be fixed and removed before squashing. This short circuiting is still correct until DoQ/DoH3 support is introduced.	2024-04-26 16:12:29 +03:00
Aydın Mercan	b5478654a2	Add fallback to ns_client_get_type despite unreachable GCC might fail to compile because it expects a return after UNREACHABLE. It should ideally just work anyway since UNREACHABLE is either a noreturn or UB (__builtin_unreachable / C23 unreachable). Either way, it should be optimized almost always so the fallback is free or basically free anyway when it isn't optimized out.	2024-04-26 16:12:29 +03:00
Aydın Mercan	4a3f7fe1ef	Emit and read correct DoT and DoH dnstap entries Other protocols still pretend to be TCP/UDP. This only causes a difference when using dnstap-read on a file with DoQ or DNSCrypt entries	2024-04-26 16:12:29 +03:00
Aydın Mercan	9d1a8a98c6	Update the dnstap protobuf definition The new definition includes the missing protocol definitions and specifies the protobuf version.	2024-04-26 16:08:46 +03:00
Ondřej Surý	6c54337f52	avoid a race in the qpzone getsigningtime() implementation the previous commit introduced a possible race in getsigningtime() where the rdataset header could change between being found on the heap and being bound. getsigningtime() now looks at the first element of the heap, gathers the locknum, locks the respective lock, and retrieves the header from the heap again. If the locknum has changed, it will rinse and repeat. Theoretically, this could spin forever, but practically, it almost never will as the heap changes on the zone are very rare. we simplify matters further by changing the dns_db_getsigningtime() API call. instead of passing back a bound rdataset, we pass back the information the caller actually needed: the resigning time, owner name and type of the rdataset that was first on the heap.	2024-04-25 15:48:43 -07:00
Evan Hunt	7e6be9f1b5	simplify qpzone database by using only one heap for resigning in RBTDB, the heap was used by zone databases for resigning, and by the cache for TTL-based cache cleaning. the cache use case required very frequent updates, so there was a separate heap for each of the node lock buckets. qpzone is for zones only, so it doesn't need to support the cache use case; the heap will only be touched when the zone is updated or incrementally signed. we can simplify the code by using only a single heap.	2024-04-25 15:41:39 -07:00
Evan Hunt	237123e500	simplify code by removing return values where possible fix_iterator() and related functions are quite difficult to read. perhaps it would be a little clearer if we didn't assign values to variables that won't subsequently be used, or unnecessarily pop the stack and then push the same value back onto it. also, in dns_qp_lookup() we previously called fix_iterator(), removed the leaf from the top of the iterator stack, and then added it back on. this would be clearer if we just push the leaf onto the stack when we need to, but leave the stack alone when it's already complete.	2024-04-25 10:29:07 -07:00
Evan Hunt	66dbff596b	clean up fix_iterator() arguments the value passed as 'start' was redundant; it's always the same as the current top of the iterator stack.	2024-04-25 10:29:07 -07:00
Evan Hunt	2dff926624	yet another fix_iterator() bug under some circumstances it was possible for the iterator to be set to the first leaf in a set of twigs, when it should have been set to the last. a unit test has been added to test this scenario. if there is a a tree containing the following values: {".", "abb.", "abc."}, and we query for "acb.", previously the iterator would be positioned at "abb." instead of "abc.". the tree structure is: branch (offset 1, ".") branch (offset 3, ".ab") leaf (".abb") leaf (".abc") we find the branch with offset 3 (indicating that its twigs differ from each other in the third position of the label, "abB" vs "abC"). but the search key differs from the found keys at position 2 ("aC" vs "aB"). we look up the bit value in position 3 of the search key ("B"), and incorrectly follow it onto the wrong twig ("abB"). to correct for this, we need to check for the case where the search key is greater than the found key in a position earlier than the branch offset. if it is, then we need to pop from the current leaf to its parent, and get the greatest leaf from there. a further change is needed to ensure that we don't do this twice; when we've moved to a new leaf and the point of difference between it and the search key even earlier than before, then we're definitely at a predecessor node and there's no need to continue the loop.	2024-04-25 10:29:07 -07:00
Michal Nowak	f454fa6dea	Update sources to Clang 18 formatting	2024-04-23 13:11:52 +02:00
Ondřej Surý	141e4c9805	Change the ADB_ENTRY_WINDOW to 60 seconds The previous value of 30 minutes used to cache the ADB names and entries was quite long. Change the value to 60 seconds for faster recovery after cached intermittent failure of the remote nameservers.	2024-04-22 10:36:36 +02:00
Ondřej Surý	6708da3112	Unify the expiration time handling for all ADB expiration The algorithm from the previous commit[1] is now used to calculate all the expiration values through the code (ncache results, cname/dname targets). 1. ISC_MIN(cur, ISC_MAX(now + ADB_ENTRY_WINDOW, now + rdataset->ttl))	2024-04-22 10:36:36 +02:00
Ondřej Surý	53cc00ee3f	Fix the expire_v4 and expire_v6 logic Correct the logic to set the expiration period of expire_{v4,v6} as follows: 1. If the trust is ultimate (local entry), immediately set the entry as expired, so the changes to the local zones have immediate effect. 3. If the expiration is already set and smaller than the new value, then leave the expiration value as it is. 2. Otherwise pick larger of `now + ADB_ENTRY_WINDOW` and `now + TTL` as the new expiration value.	2024-04-22 10:36:36 +02:00
Ondřej Surý	932665410d	Always set ADB entry expiration to now + ADB_ENTRY_WINDOW When ADB entry was created it was set to never expire. If we never called any of the functions that adjust the expiration, it could linger in the ADB forever. Set the expiration (.expires) to now + ADB_ENTRY_WINDOW when creating the new ADB entry to ensure the ADB entry will always expire.	2024-04-22 10:36:36 +02:00
Mark Andrews	26375bdcf2	Break out of the switch if we have already reached the quota This prevents consume_validation_fail being called and causing an INSIST.	2024-04-22 12:32:36 +10:00
Matthijs Mekking	a3915e535a	Move kasp key match function to kasp header The dnssec-ksr tool needs to check if existing key files match lines in the keys section of a dnssec-policy, so make this function publicly available.	2024-04-19 10:41:04 +02:00
Dominik Thalhammer	24ae1157e8	Rework isccc_ccmsg to support multiple messages per tcp read Previously, only a single controlconf message would be processed from a single TCP read even if the TCP read buffer contained multiple messages. Refactor the isccc_ccmsg unit to store the extra buffer in the internal buffer and use the already read data first before reading from the network again. Co-authored-by: Ondřej Surý <ondrej@isc.org> Co-authored-by: Dominik Thalhammer <dominik@thalhammer.it>	2024-04-18 20:08:44 +02:00
Ondřej Surý	3b9ea189b2	Don't count expired / future RRSIG against quota These don't trigger a public key verification unless dnssec-accept-expired is set.	2024-04-18 16:05:31 +02:00
Ondřej Surý	23835c4afe	Use xmlMemSetup() instead of xmlGcMemSetup() Since we don't have a specialized function for "atomic" allocations, it's better to just use xmlMemSetup() instead of xmlGcMemSetup() according to this: https://mail.gnome.org/archives/xml/2007-August/msg00032.html	2024-04-18 10:53:31 +02:00
Ondřej Surý	950f828cd2	Offload the isc_http response processing to worker thread Prepare the statistics channel data in the offloaded worker thread, so the networking thread is not blocked by the process gathering data from various data structures. Only the netmgr send is then run on the networkin thread when all the data is already there.	2024-04-18 10:53:00 +02:00
Matthijs Mekking	c3d8932f79	Add checkconf check for signatures-jitter Having a value higher than signatures-validity does not make sense and should be treated as a configuration error.	2024-04-18 09:50:33 +02:00
Matthijs Mekking	67f403a423	Implement signature jitter When calculating the RRSIG validity, jitter is now derived from the config option rather than from the refresh value.	2024-04-18 09:50:10 +02:00
Matthijs Mekking	0438d3655b	Refactor code that calculates signature validity There are three code blocks that are (almost) similar, refactor it to one function.	2024-04-18 09:50:10 +02:00
Matthijs Mekking	2a4daaedca	Add signatures-jitter option Add an option to speficy signatures jitter.	2024-04-18 09:50:10 +02:00
Mark Andrews	bf70d4840c	dns_qpkey_toname failed to reset name correctly This could lead to a mismatch between name->length and the rest of the name structure.	2024-04-18 00:17:48 +00:00
Ondřej Surý	eb1829b970	Use atomic operations to access the trust byte in ncache data Protect the access to the trust byte in the ncache data with relaxed atomic operation to mimick the current behaviour. This will teach TSAN that the concurrent access is fine.	2024-04-17 17:14:34 +02:00
Mark Andrews	4ef755ffb0	Only copy the name data after we know its actual length This prevents TSAN errors with the ncache code where the trust byte access needs to be protected by a lock. The old code copied the entire region before determining where the name ended. We now determine where the name ends then copy just that data and in doing so avoid reading the trust byte.	2024-04-17 17:14:34 +02:00
Mark Andrews	40fd4cd407	Wrong source address used for IPv6 notify messages The source address field of 'newnotify' was not updated from the default (0.0.0.0) when the destination address was an IPv6 address. This resulted in the messages failing to be sent. Set the source address to :: when the destination address is an IPv6 address.	2024-04-11 18:05:25 +00:00
Evan Hunt	2c88946590	dns_name_dupwithoffsets() cannot fail this function now always returns success; change it to void and clean up its callers.	2024-04-10 22:51:07 -04:00
Ondřej Surý	304b5ec1ad	Deprecate fixed value for the rrset-order option Mark the "fixed" value for the "rrset-order" option deprecated, so we can remove it in the future.	2024-04-02 15:21:00 +00:00
Ondřej Surý	7c96bf3e71	Deprecate sortlist option Mark the sortlist option deprecated, so we can remove it in the future.	2024-04-02 16:26:39 +02:00
Aram Sargsyan	a5ea7bcd25	Rename and fix dns_validator_destroy() to dns_validator_shutdown() Since the dns_validator_destroy() function doesn't guarantee that it destroys the validator, rename it to dns_validator_shutdown() and require explicit dns_validator_detach() to follow. Enforce the documented function requirement that the validator must be completed when the function is called. Make sure to set val->name to NULL when the function is called, so that the owner of the validator may destroy the name, even if the validator is not destroyed immediately. This should be safe, because the name can be used further only for logging by the offloaded work callbacks when they detect that the validator is already canceled/complete, and the logging function has a condition to use the name only when it is non-NULL.	2024-04-02 16:21:54 +02:00
Aram Sargsyan	a6c6ad048d	Remove a redundant log message and a comment If val->result is not ISC_R_SUCCESS, a similar message is logged further down in the function. Remove the redundant log message. Also remove an unnecessary code comment line.	2024-04-02 10:34:31 +00:00
Evan Hunt	63659e2e3a	complete removal of isc_loop_current() isc_loop() can now take its place. This also requires changes to the test harness - instead of running the setup and teardown outside of th main loop, we now schedule the setup and teardown to run on the loop (via isc_loop_setup() and isc_loop_teardown()) - this is needed because the new the isc_loop() call has to be run on the active event loop, but previously the isc_loop_current() (and the variants like isc_loop_main()) would work even outside of the loop because it needed just isc_tid() to work, but not the full loop (which was mainly true for the main thread).	2024-04-02 10:35:56 +02:00
Evan Hunt	c47fa689d4	use a thread-local variable to get the current running loop if we had a method to get the running loop, similar to how isc_tid() gets the current thread ID, we can simplify loop and loopmgr initialization. remove most uses of isc_loop_current() in favor of isc_loop(). in some places where that was the only reason to pass loopmgr, remove loopmgr from the function parameters.	2024-04-02 10:35:56 +02:00
Evan Hunt	ea6659a5e9	update foundname when detecting a zonecut above qname an assertion could be triggered in the QPDB cache if a DNAME was found above a queried NS, because the 'foundname' value was not correctly updated to point to the zone cut. the same mistake existed in qpzone and has been fixed there as well.	2024-04-02 10:00:03 +02:00
Matthijs Mekking	77d4bb9751	Fix fix_iterator hang If there are no more previous leaves, it means the queried name precedes the entire range of names in the database, so we should just move the iterator one step back and return, instead of continuing our search for the predecessor. This is similar to an earlier bug fixed in an earlier commit: `ea9a8cb392`	2024-03-25 10:40:23 +01:00
Mark Andrews	4d2d80f534	Remove remenants of cache support from qpzone.c These where leading to Coverity errors being reported.	2024-03-19 22:04:10 +00:00
Evan Hunt	17186e06bb	reduce memory consumption of the remaining QP databases use dynamically allocated names instead of fixednames in forward.c, keytable.c, nametree.c, and nta.c	2024-03-14 10:25:07 -07:00
Evan Hunt	c0fcc2899e	reduce memory consumption of rpz summary database use dynamically allocated names instead of fixednames in rpz.c	2024-03-14 10:20:52 -07:00
Evan Hunt	8b67476249	reduce memory consumption of qpcache database as with qpzone, use a dynamically-allocated dns_name instead of a dns_fixedname object to store node names in the QP database.	2024-03-14 10:20:52 -07:00
Evan Hunt	f908d358c4	reduce memory consumption of qpzone database every node of a QP database contains a copy of the nodename, which is used as the key for the QP-trie. previously, the name was stored as a dns_fixedname object, which has room for up to 255 characters. we can reduce the space consumed by dynamically allocating a dns_name object that's just long enough for the name to be stored.	2024-03-14 10:20:52 -07:00
Matthijs Mekking	ad33a73f83	Fix Coverity CID 487882: Error handling issues The dns_qpiter_next() was called without checking the return value. If we cannot move the iterator forward, there is no use in calling the step() function. /lib/dns/qpzone.c: 2804 in activeempty() 2798 * of the name we were searching for. Step the iterator 2799 * forward, then step() will continue forward until it 2800 * finds a node with active data. If that node is a 2801 * subdomain of the one we were looking for, then we're 2802 * at an active empty nonterminal node. 2803 */ >>> CID 487882: Error handling issues (CHECKED_RETURN) >>> Calling "dns_qpiter_next" without checking return value (as is done elsewhere 26 out of 27 times). 2804 dns_qpiter_next(it, NULL, NULL, NULL); 2805 return (step(search, it, FORWARD, next) && 2806 dns_name_issubdomain(next, current)); 2807 }	2024-03-14 14:01:23 +01:00
Matthijs Mekking	659fa0cbc3	Fix Coverity CID 487884: Dead code in qpcache.c Adding a changed record is zonedb related and does not belong in the cache code. This is a leftover dead code and can be safely removed. /lib/dns/qpcache.c: 3459 in add() 3453 } 3454 newheader->next = topheader->next; 3455 newheader->down = topheader; 3456 topheader->next = newheader; 3457 qpnode->dirty = 1; 3458 if (changed != NULL) { >>> CID 487884: (DEADCODE) >>> Execution cannot reach this statement: "changed->dirty = true;". 3459 changed->dirty = true; 3460 } 3461 } else { 3462 /* 3463 * No rdatasets of the given type exist at the node. 3464 */ /lib/dns/qpcache.c: 3409 in add() 3403 } 3404 newheader->next = topheader->next; 3405 newheader->down = topheader; 3406 topheader->next = newheader; 3407 qpnode->dirty = 1; 3408 if (changed != NULL) { >>> CID 487884: (DEADCODE) >>> Execution cannot reach this statement: "changed->dirty = true;". 3409 changed->dirty = true; 3410 } 3411 mark_ancient(header); 3412 if (sigheader != NULL) { 3413 mark_ancient(sigheader); 3414	2024-03-14 10:42:30 +00:00
Matthijs Mekking	e39de45adc	Detect invalid durations Be stricter in durations that are accepted. Basically we accept ISO 8601 formats, but fail to detect garbage after the integers in such strings. For example, 'P7.5D' will be treated as 7 days. Pass 'endptr' to 'strtoll' and check if the endptr is at the correct suffix.	2024-03-14 08:51:46 +01:00
Mark Andrews	40816e4e35	Don't use static stub when returning best NS If we find a static stub zone in query_addbestns look for a parent zone which isn't a static stub.	2024-03-14 11:39:27 +11:00
Evan Hunt	b3c8b5cfb2	remove dead code in rbtdb.c dns_db_addrdataset() enforces a requirement that version can only be NULL for a cache database. code that checks for zone semantics and version == NULL can never be reached.	2024-03-13 17:15:18 -07:00
Evan Hunt	29f1c93734	support nodefullname in rbt-zonedb.c this enables the 'dyndb' system test to pass when we build using --with-zonedb=rbt.	2024-03-13 17:15:18 -07:00
Evan Hunt	f0b164430a	remove dead code in qpzone.c qpzone does not support cache semantics, so dns_db_addrdataset(), _deleterdataset() and _subtractrdataset() can't be run with version == NULL; there's no need to check for it. we can also clean up free_qpdb() a bit since current_version is always non-NULL.	2024-03-13 17:15:18 -07:00
Mark Andrews	228cc557fe	Only call memmove if the rdata length is non zero This avoids undefined behaviour on zero length rdata where the data pointer is NULL.	2024-03-13 23:04:56 +00:00
Matthijs Mekking	0aac81cf80	Fix bug in keymgr Depends function The Depends relation refers to types of rollovers in which a certain record type is going to be swapped. Specifically, the Depends relation says there should be no dependency on the predecessor key (the set Dep(x, T) must be empty). But if the key is phased out (all its states are in HIDDEN), there is no longer a dependency. Since the relationship is still maintained (Predecessor and Successor metadata), the keymgr_dep function still returned true. In other words, the set Dep(x, T) is not considered empty. This slows down key rollovers, only retiring keys when the successor key has been fully propagated.	2024-03-13 10:58:24 +01:00
Matthijs Mekking	fb2f0c8168	Fix validate_dnskey_dsset when KSK is not signing When there is a secure chain of trust with a KSK that is not actively signing the DNSKEY RRset, the code for validating the DNSKEY RRset against the DS RRset could potentially skip DS records, thinking the chain of trust is broken while there is a valid DS with corresponding DNSKEY record present. This is because we pass the result ISC_R_NOMORE on when we are done checking for signatures, but then treat it as "no more DS records". Chaning the return value to something else (DNS_R_NOVALIDSIG seems the most appropriate here) fixes the issue.	2024-03-12 09:10:41 +01:00
Evan Hunt	5709f7bad9	rename qpdb to qpcache move qpdb.c to qpcache.c and rename the "qp" database implementation to "qpcache", in order to make it more clearly distinguishable from "qpzone".	2024-03-08 15:36:56 -08:00
Evan Hunt	e14a116ced	collapse qpdb implementation down to one file the code in qpdb.c was previously shared by qp-cachedb.c and qp-zonedb.c. since qp-zonedb.c no longer exists, it's not necessary to keep these separate any longer. the two files have been merged, and functions that were previously globally accessible have been changed to static and renamed.	2024-03-08 15:36:56 -08:00
Evan Hunt	ab084d8c4f	remove qp-zonedb.c and associated code now that "qpzone" databases are available for use in zones, we no longer need to retain the zone semantics in the "qp" database. all zone-specific code has been removed from QPDB, and "configure --with-zonedb" once again takes two values, rbt and qp. some database API methods that are never used with a cache have been removed from qpdb.c and qp-cachedb.c; these include newversion, closeversion, subtractrdataset, and nodefullname.	2024-03-08 15:36:56 -08:00
Evan Hunt	ac2c454f4f	add a nodefullname implementation for the qpzone database this enables the 'dyndb' system test to use a qpzone database.	2024-03-08 15:36:56 -08:00
Evan Hunt	3512cf5654	add setup/commit functions to rdatacallbacks because dns_qpmulti_commit() can be time consuming, it's inefficient to open and commit a qpmulti transaction for each rdataset being loaded into a database. we can improve load time by opening a qpmulti transaction before adding a group of rdatasets and then committing it afterward. this commit adds 'setup' and 'commit' functions to dns_rdatacallbacks_t, which can be called before and after the loops in which 'add' is called in dns_master_load() and axfr_apply().	2024-03-08 15:36:56 -08:00
Evan Hunt	2e45866715	use DNS_DB_NONSEC3 flag when copying non-dnssec records when copying the non-dnssec records in receive_secure_db(), use DNS_DB_NONSEC3 so we don't accidentally create nodes in the main tree for NSEC3 records. this was a long-standing error in the code, but was harmless in the RBTDB.	2024-03-08 15:36:56 -08:00
Evan Hunt	55f38e34dc	improve node reference counting QP database node data is not reference counted the same way RBT nodes were: in the RBT, node->references could be zero if the node was in the tree but was not in use by any caller, whereas in the QP trie, the database itself uses reference counting of nodes internally. this caused some subtle errors. in RBTDB, when the newref() function is called and the node reference count was zero, the node lock reference counter would also be incremented. in the QP trie, this can never happen - because as long as the node is in the database its reference count cannot be zero - and so the node lock reference counter was never incremented. this has been addressed by maintaining a separate "erefs" counter for external references to the node. this is the same approach used in the "qpdb-lite" database in commit `e91fbd8dea`. while troubleshooting this issue, some compile errors were discovered when building with DNS_DB_NODETRACE; those have also been fixed.	2024-03-08 15:36:56 -08:00
Evan Hunt	2b4133a32c	switch default zone database from "qp" to "qpzone" use the dns_qpmulti-based "qpzone" by default throughout BIND, instead of the existing dns_qp-based "qp", when creating zone databases. (cache databases still use "qp".) the "--with-zonedb" option has been updated in configure.ac to permit the use of both "qp" and "qpzone" databases. in zone.c there was a test that prevented any database type other than "qp" from hosting an RPZ. this was outdated, and has been removed.	2024-03-08 15:36:56 -08:00
Evan Hunt	2222728a4f	release RCU in dns_qpmulti_snapshot() previously, an RCU critical section was held open for the duration of a snapshot. this should not be necessary, as the snapshot makes local copies of QP trie metadata, and it causes problems when a DB iterator is held open between two loop events. we now call rcu_read_unlock() after setting up the snapshot.	2024-03-08 15:36:56 -08:00
Evan Hunt	6e167724e7	complete the qpzone database API implementation finish importing the database API methods from RBTDB to qpzone: issecure, nodecount, getnsec3parameters, findnsec3node, setsigningtime, getsigningtime, getsize, setgluecachestats, locknode, unlocknode, and addglue.	2024-03-08 15:36:56 -08:00
Evan Hunt	f46455cfcb	allow updating of records in a qpzone database add database API methods needed to apply updates to an existing zone database (newversion, addrdataset, subtractrdataset and deleterdataset). it is now possible to apply journals to zone databases after loading, so named-checkzone -J works correctly.	2024-03-08 15:36:56 -08:00
Evan Hunt	60b5422cda	make the qpzone database dumpable add database API method implementations needed to iterate and dump a qpzone database to a file (createiterator, allrdatasets and attachversion, plus dbiterator and rdatasetiter methods). named-checkzone -D can now dump the contents of most zones, but zone cuts are not correctly detected.	2024-03-08 15:36:56 -08:00
Evan Hunt	628fa8a3d6	make the qpzone database loadable add database API methods needed for loading rdatasets into memory (currentversion, beginload, endload), plus the methods used by zone_postload() for zone consistency checks (getoriginnode, find, findnode, findrdataset, attachnode, detachnode, deletedata). the QP trie doesn't support the find callback mechanism available in dns_rbt_findnode() which allows examination of intermediate nodes while searching, so the detection of wildcard and delegation nodes is now done by scanning QP chains after calling dns_qp_lookup(). Note that the lookup in previous_closest_nsec() cannot return ISC_R_NOTFOUND. In RBTDB, we checked for this return value and ovewrote the result with ISC_R_NOMORE if it occurred. In the qpzone implementation, we insist that this return value cannot happen. dns_qp_lookup() would only return ISC_R_NOTFOUND if we asked for a name outside the zone's authoritative domain, and we never do that when looking up a predecessor NSEC record. named-checkzone is now able to load a zone and check it for errors, but cannot dump it.	2024-03-08 15:36:49 -08:00
Evan Hunt	be24feb252	stub dns_qpmulti-based zone database implementation created files for a dns_qpmulti-based zone database, "qpzone". currently this only has create and destroy functions.	2024-03-06 20:57:31 -08:00
Mark Andrews	926d2e4cf2	dns_db_setloop called at wrong place on wrong db In cache_create_db, dns_db_setloop should be called on the newly created db only if the database creation succeeded.	2024-03-07 13:10:23 +11:00
Ondřej Surý	d492d676ef	Move the dns_db_setloop into cache_create_db() The dns_cache_flush() drops the old database and creates a new one, but it forgets to pass the loop that runs the node pruning and cleaning the rbtdb when flushing it next time. This causes the cleaning to skip cleaning the parent nodes (with .down == NULL) leading to increased memory usage over time until the database is unable to keep up and just stays overmem all the time.	2024-03-06 18:33:33 +01:00
Ondřej Surý	454c75a33a	Restore the parent cleaning logic in prune_tree() Reconstruct the variant of the prune_tree() parent cleaning to consider all elibible parents in a single loop as we were doing before all the changes that led to this commit. Update code comments so that they more precisely describe what the relevant bits of code actually do.	2024-03-06 13:03:17 +01:00
Evan Hunt	92b305be4b	add a compile-time option to select default zone and cache DB by default, QPDB is the database used by named and all tools and unit tests. the old default of RBTDB can now be restored by using "configure --with-zonedb=rbt --with-cachedb=rbt". some tests have been fixed so they will work correctly with either database. CHANGES and release notes have been updated to reflect this change.	2024-03-06 10:49:02 +01:00
Matthijs Mekking	3facc5b51d	Fix race condition crash When running resolver benchmark pipeline, a crash occurred: https://gitlab.isc.org/isc-projects/bind9-shotgun-ci/-/pipelines/163946 In the code we are doing a lookup, it fails (meaning there is no node with lookup name), we create the node and insert it and it fails. But dns_qp_insert can only return ISC_R_SUCCESS or ISC_R_EXISTS. So it must have been inserted in between. This is a race condition bug. The first lookup only requires a write lock and if the lookup failed the lock gets upgraded to a write lock and we insert the missing data. To fix the race condition bug, we need to do a lookup again after we have upgraded the lock to make sure it wasn't inserted in the mean time.	2024-03-06 10:49:02 +01:00
Matthijs Mekking	7db974b240	Remove pruning tree code Since qp-tries does not store interior nodes, we can remove all code related to pruning the tree.	2024-03-06 10:49:02 +01:00
Matthijs Mekking	78fd4e2b5c	Update qpdb.c to make coccinelle happy Applying semantic patch cocci/isc_mem_cget.spatch... 150 files match diff -u -p a/lib/dns/qpdb.c b/lib/dns/qpdb.c --- a/lib/dns/qpdb.c +++ b/lib/dns/qpdb.c @@ -3801,16 +3801,15 @@ dns__qpdb_create(isc_mem_t mctx, const goto cleanup_tree_lock; } INSIST(qpdb->node_lock_count < (1 << DNS_RBT_LOCKLENGTH)); - qpdb->node_locks = isc_mem_get(mctx, qpdb->node_lock_count - sizeof(db_nodelock_t)); + qpdb->node_locks = isc_mem_cget(mctx, qpdb->node_lock_count, + sizeof(db_nodelock_t)); qpdb->common.update_listeners = cds_lfht_new(16, 16, 0, 0, NULL); if (IS_CACHE(qpdb)) { dns_rdatasetstats_create(mctx, &qpdb->rrsetstats); - qpdb->lru = isc_mem_get(mctx, - qpdb->node_lock_count * - sizeof(dns_slabheaderlist_t)); + qpdb->lru = isc_mem_cget(mctx, qpdb->node_lock_count, + sizeof(dns_slabheaderlist_t)); for (i = 0; i < (int)qpdb->node_lock_count; i++) { ISC_LIST_INIT(qpdb->lru[i]); } @@ -3819,8 +3818,8 @@ dns__qpdb_create(isc_mem_t mctx, const / * Create the heaps. / - qpdb->heaps = isc_mem_get(hmctx, qpdb->node_lock_count - sizeof(isc_heap_t )); + qpdb->heaps = isc_mem_cget(hmctx, qpdb->node_lock_count, + sizeof(isc_heap_t )); for (i = 0; i < (int)qpdb->node_lock_count; i++) { qpdb->heaps[i] = NULL; } @@ -3834,8 +3833,8 @@ dns__qpdb_create(isc_mem_t mctx, const / * Create deadnode lists. / - qpdb->deadnodes = isc_mem_get(mctx, qpdb->node_lock_count - sizeof(dns_qpdatalist_t)); + qpdb->deadnodes = isc_mem_cget(mctx, qpdb->node_lock_count, + sizeof(dns_qpdatalist_t)); for (i = 0; i < (int)qpdb->node_lock_count; i++) { ISC_LIST_INIT(qpdb->deadnodes[i]); }	2024-03-06 10:49:02 +01:00
Evan Hunt	89c4c1aa87	add dns_db_nodefullname() the dyndb test requires a mechanism to retrieve the name associated with a database node, and since the database no longer uses RBT for its underlying storage, dns_rbt_fullnamefromnode() doesn't work. addressed this by adding dns_db_nodefullname() to the database API.	2024-03-06 10:49:02 +01:00
Matthijs Mekking	cdf62a18e7	Rework dbiterator implementation If the iterator is paused, the tree is unlocked and may change. In an RBT tree it's always possible to resume iteration as long as a valid node pointer was still held, but now that the underlying database structure is a QP trie, the iterator needs to be initialized based on the existing structure of the trie or it will return inconsistent results. We now call dns_qp_lookup() to reinitialize the QP iterator whenever dbiterator_next() or dbiterator_prev() is called on a paused iterator.	2024-03-06 10:49:02 +01:00
Matthijs Mekking	e91fbd8dea	Improve node reference counting QP database node data is not reference counted the same way RBT nodes were: in the RBT, node->references could be zero if the node was in the tree but was not in use by any caller, whereas in the QP trie, the database itself uses reference counting of nodes internally. this caused some subtle errors. in RBTDB, when the newref() function is called and the node reference count was zero, the node lock reference counter would also be incremented. in the QP trie, this can never happen - because as long as the node is in the database its reference count cannot be zero - and so the node lock reference counter was never incremented. reference counting will probably need to be refactored in more detail later; the node lock reference count may not be needed at all. but for now, as a temporary measure, we add a third reference counter, 'erefs' (external references), to the dns_qpdata structure. this is counted separately from the main reference counter, and should match the node reference count as it would have been in RBTDB. this change revealed a number of places where the node reference counter was being incremented on behalf of a caller without newref() being called; those were cleaned up as well. This is an adaptation of commit 3dd686261d2c4bcd15a96ebfea10baffa277732b	2024-03-06 10:49:02 +01:00
Matthijs Mekking	91a2755433	No special logic for relative names Nodes in a QP-trie contain the full domain name, while nodes in a red-black tree only contain names relative to a parent.	2024-03-06 10:49:02 +01:00
Matthijs Mekking	1a068c9656	Change free_gluetable Fixes a crash at shutdown.	2024-03-06 09:57:25 +01:00
Matthijs Mekking	10efb6fdc2	Calculating hashsize is obsolete We don't have hash tables for qp.	2024-03-06 09:57:25 +01:00
Matthijs Mekking	820abdb80a	Add proper qp cleanup Fix reference counting: unreference nodes that are succesfully inserted in the tree, detach created nodes, and cleanup the interior data in dns_qpdata_destroy().	2024-03-06 09:57:25 +01:00
Matthijs Mekking	fe97aa59b9	Replace dns_rbtnode_t with dns_qpdata_t This for now has almost the same structure contents except for dns_qpdata_t has 'fn' and 'name' to store the domain name.	2024-03-06 09:57:25 +01:00
Matthijs Mekking	cc3a40dafa	Replace dns_rbt_nodecount with dns_qp_memusage We now count the nodes by getting the memory usage and return the number of leaves.	2024-03-06 09:57:25 +01:00
Matthijs Mekking	e95dfc0119	Replace dns_rbt_namefromnode with dns_name_copy The name will be stored inside the node now so we can just copy it. These are leftovers, most of the namefromnode code has been replaced already in previous commits.	2024-03-06 09:57:24 +01:00
Matthijs Mekking	6a5de6390f	Replace rbtnodechain with qpchain and qpiter The qp approach pulled apart the chain and iterator into two separate things. Replace the rbtnodechain with qpchain and qpiter. Most of the times we are interested in the iterator only, the rbtnodechain was mainly used as an an iterator to get the previous and next name in the DNS canonical order. Since dns_qpiter_prev() and dns_qpiter_next() store the name, origin, and node in the provided parameters, often there is no need to call a current() function anymore. Getting the first or last item from the iterator is done by re-initializing the iterator and then call dns_qpiter_next() or dns_qpiter_prev() respectively. The dbiterator no longer needs to maintain a chain, only an iterator.	2024-03-06 09:57:24 +01:00
Matthijs Mekking	8572435a31	Replace rbt_findnode with qp_lookup All dns_qp_lookup() calls assume it is okay to find empty data, so we don't need to do anything special for the DNS_RBTFIND_EMPTYDATA. You can pass a callback function to dns_rbt_findnode(), something that qp does not support. Instead, call the function afterwards. This has the drawback that we do more lookup work if there was a zonecut. With dns_qp_lookup() we also don't pass any options. In this case, when DNS_RBTFIND_NOEXACT was set, we adapt the result after the lookup.	2024-03-06 09:57:24 +01:00
Matthijs Mekking	8fcfa36660	Replace rbt_deletenode with qp_deletename Replace dns_rbt_deletenode calls with dns_qp_deletename. For removing the name from the nsec tree, we no longer first have to find it: we can just remove the key (retrieved by name).	2024-03-06 09:57:24 +01:00
Matthijs Mekking	c53b95e134	Replace rbt_addnode with qp_insert Replace dns_rbt_addnode calls with dns_qp_insert. With QP, it sometimes makes more sense to first lookup the name and see if there is an existing node (rather than create new data, insert, find out a node already exists, and destroy the data again). This is done with dns_qp_getname(), which is more lightweight than dns_qp_lookup(), and we are only interested in if there is already a leaf node for this name or not.	2024-03-06 09:57:24 +01:00
Evan Hunt	bb4464181a	switch database defaults from "rbt" to "qp" replace the string "rbt" throughout BIND with "qp" so that qpdb databases will be used by default instead of rbtdb. rbtdb databases can still be used by specifying "database rbt;" in a zone statement.	2024-03-06 09:57:24 +01:00
Evan Hunt	845f832308	rename dns_rbtdb to dns_qpdb this commit renames all variables and macros with the string "rbtdb" or "RBDTB" to "qpdb" or "QPDB".	2024-03-06 09:57:24 +01:00
Matthijs Mekking	2edf73dc05	Begin replacement of rbt with qp in rbtdb - Copy rbtdb.c, rbt-zonedb.c and rbt-cachedb.c to qp-*. - Added qpmethods. - Added a new structure dns_qpdata that will replace dns_rbtnode. - Replaced normal, nsec, and nsec3 dns_rbt trees with dns_qp tries. - Replaced dns_rbt_create() calls with dns_qp_create(). - Replaced the dns_rbt_destroy() call with dns_qp_destroy(). - Create a dns_qpdata struct and create/destroy methods. This commit will not build.	2024-03-06 09:57:24 +01:00
Mark Andrews	5ff55e13e8	Restore the disassociate call to before the fetch [GL #3709] reordered the dns_rdataset_disassociate call to after the dns_resolver_createfetch call resulting in qctx->nsrrset still being associated when dns_resolver_createfetch is called in resume_dslookup (`7e4e125e`). Revert that part of the change and add comments as to why the multiple dns_rdataset_disassociate calls are where they are.	2024-03-06 10:08:30 +11:00
Ondřej Surý	e74c7dcf51	Always call the TCP dispatch connected callbacks asynchronously The TCP dispatch connected callbacks could be called synchronously which in turn could destroy xfrin before we return from dns_xfrin_create(). Delay the calling the callback called from tcp_dispatch_connect() by calling it always asynchronously.	2024-03-04 16:34:14 +01:00
Ondřej Surý	98d59bdf62	Pin the xfr to a specific loop Instead of getting the loop from the zone every time, attach the xfrin directly to the loop. This also allows to remove the extra safety tid checks from the dns_xfrin unit.	2024-03-04 16:34:14 +01:00
Ondřej Surý	d8220ca4ca	Make the TTL-based cleaning more aggressive It was discovered that the TTL-based cleaning could build up a significant backlog of the rdataset headers during the periods where the top of the TTL heap isn't expired yet. Make the TTL-based cleaning more aggressive by cleaning more headers from the heap when we are adding new header into the RBTDB.	2024-02-29 12:57:06 +01:00
Ondřej Surý	a9383e4b95	Remove expired rdataset headers from the heap It was discovered that an expired header could sit on top of the heap a little longer than desireable. Remove expired headers (headers with rdh_ttl set to 0) from the heap completely, so they don't block the next TTL-based cleaning.	2024-02-29 12:56:36 +01:00
Ondřej Surý	0b32d323e0	Simplify the parent cleaning in the prune_tree() mechanism Instead of juggling with node locks in a cycle, cleanup the node we are just pruning and send any the parent that's also subject to the pruning to the prune tree via normal way (e.g. enqueue pruning on the parent). This simplifies the code and also spreads the pruning load across more event loop ticks which is better for lock contention as less things run in a tight loop.	2024-02-29 11:23:03 +01:00
Ondřej Surý	eed17611d8	Reduce lock contention during RBTDB tree pruning The log message for commit `24381cc36d` explained: In some older BIND 9 branches, the extra queuing overhead eliminated by this change could be remotely exploited to cause excessive memory use. Due to architectural shift, this branch is not vulnerable to that issue, but applying the fix to the latter is nevertheless deemed prudent for consistency and to make the code future-proof. However, it turned out that having a single queue for the nodes to be pruned increased lock contention to a level where cleaning up nodes from the RBTDB took too long, causing the amount of memory used by the cache to grow indefinitely over time. This commit reverts the change to the pruning mechanism introduced by commit `24381cc36d` as BIND branches newer than 9.16 were not affected by the excessive event queueing overhead issue mentioned in the log message for the above commit.	2024-02-29 11:23:03 +01:00
Mark Andrews	0651063658	Add RESINFO record type This is a TXT clone using code point 261.	2024-02-26 12:02:40 +11:00
Mark Andrews	7ce2e86024	Do not use header_prev in expire_lru_headers dns__cacherbt_expireheader can unlink / free header_prev underneath it. Use ISC_LIST_TAIL after calling dns__cacherbt_expireheader instead to get the next pointer to be processed.	2024-02-23 12:00:12 +01:00
Artem Boldariev	f8812d4184	Do not lock workers when using -T transferslowly/transferstuck This commit ensures that worker threads are not sleeping (by using select()) when '-T transferslowly/transferstuck' test options are used. This commit converts synchronous implementation of the code into an asynchronous one based on timers.	2024-02-22 00:09:04 +02:00
Artem Boldariev	4cbe1eb368	DoT: do not crash resolver on TLS context creation failure The resolver's code was not ready to failures when trying to establish a connection via TCP-based transports (e.g. when creating TLS contexts before establishing a TLS connection). This commit fixes that.	2024-02-21 21:05:21 +02:00
Aram Sargsyan	9e38d0e3af	Clean up fetch_answered After the changes in [GL #4447] the 'fetch_answered' variable is always false now. Delete the unnecessary code.	2024-02-20 10:46:40 +00:00
Aram Sargsyan	03b68b8c38	Address scan-build warnings The warnings (see below) seem to be false-positives. Address them by adding runtime checks. resolver.c:1627:10: warning: Access to field 'tid' results in a dereference of a null pointer (loaded from variable 'fctx') [core.NullDereference] 1627 \| REQUIRE(fctx->tid == isc_tid()); \| ^~~~~~~~~ ../../lib/isc/include/isc/util.h:332:34: note: expanded from macro 'REQUIRE' 332 \| #define REQUIRE(e) ISC_REQUIRE(e) \| ^ ../../lib/isc/include/isc/assertions.h:45:11: note: expanded from macro 'ISC_REQUIRE' 45 \| ((void)((cond) \|\| \ \| ^~~~ resolver.c:10335:6: warning: Access to field 'depth' results in a dereference of a null pointer (loaded from variable 'fctx') [core.NullDereference] 10335 \| if (fctx->depth > depth) { \| ^~~~~~~~~~~ 2 warnings generated.	2024-02-16 08:42:48 +00:00
Aram Sargsyan	bd7463914f	Disallow stale-answer-client-timeout non-zero values Remove all the code and tests which support non-zero stale-answer-client-timeout values, and adjust the documentation.	2024-02-16 08:41:52 +00:00
Evan Hunt	e40fd4ed06	fix several bugs in the RBTDB dbiterator implementation - the DNS_DB_NSEC3ONLY and DNS_DB_NONSEC3 flags are mutually exclusive; it never made sense to set both at the same time. to enforce this, it is now a fatal error to do so. the dbiterator implementation has been cleaned up to remove code that treated the two as independent: if nonsec3 is true, we can be certain nsec3only is false, and vice versa. - previously, iterating a database backwards omitted NSEC3 records even if DNS_DB_NONSEC3 had not been set. this has been corrected. - when an iterator reaches the origin node of the NSEC3 tree, we need to skip over it and go to the next node in the sequence. the NSEC3 origin node is there for housekeeping purposes and never contains data. - the dbiterator_test unit test has been expanded, several incorrect expectations have been fixed. (for example, the expected number of iterations has been reduced by one; we were previously counting the NSEC3 origin node and we should not have been doing so.)	2024-02-15 10:15:50 -08:00
Evan Hunt	7d59a0ed81	prevent a possible race in setting up zone->xfr the call to dns_xfrin_create() wrote to zone->xfr with the zone unlocked.	2024-02-14 18:53:17 +00:00
Evan Hunt	3e683a9ed5	test for SIGTYPE correctly a comparison was incorrectly removed during a previous merge.	2024-02-14 09:32:20 -08:00
Michał Kępień	8610799317	BIND 9.19.21 -----BEGIN SSH SIGNATURE----- U1NIU0lHAAAAAQAAARcAAAAHc3NoLXJzYQAAAAMBAAEAAAEBANamVSTMToLcHCXRu1f52e tTJWV3T1GSVrPYXwAGe6EVC7m9CTl06FZ9ZG/ymn1S1++dk4ByVZXf6dODe2Mu0RuqGmyf MUEMKXVdj3cEQhgRaMjBXvIZoYAsQlbHO2BEttomq8PhrpLRizDBq4Bv2aThM0XN2QqSGS ozwYMcPiGUoMVNcVrC4ZQ+Cptb5C4liqAcpRqrSo8l1vcNg5b1Hk6r7NFPdx542gsGMLae wZrnKn3LWz3ZXTGeK2cRmBxm/bydiVSCsc9XjB+tWtIGUpQsfaXqZ7Hs6t+1f1vsnu88oJ oi1dRBo3YNRl49UiCukXWayQrPJa8wwxURS9W28JMAAAADZ2l0AAAAAAAAAAZzaGE1MTIA AAEUAAAADHJzYS1zaGEyLTUxMgAAAQBSREyaosd+mY8kovqAvGYR8pOui/7gOi6pBprPGw RlOB5z6YOx5FOjbVL/YvBhKk2gbox++o8jCMEmdNNbWeO3U3uBvxCa+8QGARbuMV6vdoR4 qjnOgOfryXyaRw7PQX0ZH0gPw1B1036y5bnW7WPkqrTvGgxW34O1q6j0EumE0vh90E24/l PAWKDCTqDR/+slGDuWgtPcCZuClljw1Mh0dAliKkGhp0l80qMQSr6O/p66A44UxzKwtnnt lagtO0j4nZ+BxC/hyaFc/FlCzeoc48qFQRIt0ZjYKU+XK0CUr2RTpYFdi/n7y3BNd7bDkD nIkEDddn/lXP5rkAdkmDCa -----END SSH SIGNATURE----- gpgsig -----BEGIN SSH SIGNATURE----- U1NIU0lHAAAAAQAAADMAAAALc3NoLWVkMjU1MTkAAAAg25GGAuUyFX1gxo7QocNm8V6J/8 frHSduYX7Aqk4iJLwAAAADZ2l0AAAAAAAAAAZzaGE1MTIAAABTAAAAC3NzaC1lZDI1NTE5 AAAAQEGqBHXwCtEJxRzHbTp6CfBNjqwIAjRD9G+HC4M7q77KBEBgc6dRf15ZRRgiWJCk5P iHMZkEMyWCnELMzhiTzgE= -----END SSH SIGNATURE----- Merge tag 'v9.19.21' BIND 9.19.21	2024-02-14 13:24:56 +01:00
Evan Hunt	ac9bd03a0d	clean up dns_rbt - create_node() in rbt.c cannot fail - the dns_rbt_*name() functions, which are wrappers around dns_rbt_[add\|find\|delete]node(), were never used except in tests. this change isn't really necessary since RBT is likely to go away eventually anyway. but keeping the API as simple as possible while it persists is a good thing, and may reduce confusion while QPDB is being developed from RBTDB code.	2024-02-14 01:36:44 -08:00
Evan Hunt	78d173b548	move DNS_RBT_NSEC_* to db.h these values pertain to whether a node is in the main, nsec, or nsec3 tree of an RBTDB. they need to be moved to a more generic location so they can also be used by QPDB. (this is in db.h rather than db_p.h because rbt.c needs access to it. technically, that's a layer violation, but it's a long-existing one; refactoring to get rid of it would be a large hassle, and eventually we expect to remove rbt.c anyway.)	2024-02-14 01:13:44 -08:00
Evan Hunt	27c862d953	separate generic DB helpers into db_p.h when the QPDB is implemented, we will need to have both qpdb_p.h and rbtdb_p.h. in order to prevent name collisions or code duplication, this commit adds a generic private header file, db_p.h, containing structures and macros that will be used by both databases. some functions and structs have been renamed to more specifically refer to the RBT database, in order to avoid namespace collision with similar things that will be needed by the QPDB later.	2024-02-14 09:00:27 +01:00
Evan Hunt	d1acc987e9	refactor wildcard matching refactor the wildcard matching code to make it a bit easier to understand, in hopes that it will reduce the difficulty of converting from RBTDB to QPDB later. there are also some minor optimizations: previously, after stepping backward to find the predecessor, we stepped back foward from the predecessor to find the successor. we now reset the rbtnode chain to its original starting point before stepping forward; this eliminates some unnecessary processing. and, if neither predecessor nor successor is found, we return early rather than carrying on with an unnecessary effort to match labels.	2024-02-13 22:14:17 +00:00
Mark Andrews	dc94f42209	Dissassociate rdatasets returned from dns_ncache_current lib/dns/validator.c:findnsec3proofs failed to disassociate the temporary rdataset returned by dns_ncache_current on all paths.	2024-02-13 11:42:56 +00:00
Mark Andrews	371defc357	Address CID 486326: Memory - corruptions (OVERRUN) Coverity detected that address->type.sa was too small when copying a struct sockaddr_sin6, use the alterative union element address->type.sin6 instead.	2024-02-13 09:21:49 +11:00
Mark Andrews	dd57db2274	Remove duplicate unreachable code block This was accidentially left in during the developement of !8299.	2024-02-12 15:18:46 +11:00
Ondřej Surý	175655b771	Fix case insensitive matching in isc_ht hash table implementation The case insensitive matching in isc_ht was basically completely broken as only the hashvalue computation was case insensitive, but the key comparison was always case sensitive.	2024-02-11 09:36:56 +01:00
Aydın Mercan	a911949ebc	Convert rwlock in isc_log_t to RCU The isc_log_t contains a isc_logconfig_t that is swapped, dereferenced or accessed its fields through a mutex. Instead of protecting it with a rwlock, use RCU.	2024-02-09 13:11:48 +03:00
Ondřej Surý	315aa3135a	Fix UAF in ccmsg.c when reading stopped before sending When shutting down the whole server, the reading could stop and detach from controlconnection before sending is done. If send callback then detaches from the last controlconnection handle, the ccmsg would be invalidated after the send callback and thus we must not access ccmsg after calling the send_cb().	2024-02-08 17:24:11 +01:00
Ondřej Surý	88a14985db	Add isc_nm_read_stop() and remove .reading member from ccmsg We need to stop reading when calling isc_ccmsg_disconnect() as the reading handle doesn't have to be last because sending might be in progress. After that, we can safely remove .reading member because the reading would not be called after the disconnect has been called. The ccmsg_senddone() should also not call the recv callback if the sending failed, that's the job of the caller's send callback - in fact it already does that, so the code in ccmsg_senddone() was superfluous.	2024-02-08 17:23:39 +01:00
Ondřej Surý	15329d471e	Add memory pools for isc_nmsocket_t structures To reduce memory pressure, we can add light per-loop (netmgr worker) memory pools for isc_nmsocket_t structures. This will help in situations where there's a lot of churn creating and destroying the nmsockets.	2024-02-08 15:13:47 +01:00
Ondřej Surý	750bd364b5	Reduce the isc_nmsocket_t size from 1840 to 1208 bytes Embedding isc_nmsocket_h2_t directly inside isc_nmsocket_t had increased the size of isc_nmsocket_t to 1840 bytes. Making the isc_nmsocket_h2_t to be a pointer to the structure and allocated on demand allows us to reduce the size to 1208 bytes. While there are still some possible reductions in the isc_nmsocket_t (embedded tlsstream, streamdns structures), this was the far biggest drop in the memory usage.	2024-02-08 15:13:47 +01:00
Ondřej Surý	eada7b6e13	Reduce struct isc__nm_uvreq size from 1560 to 560 bytes The uv_req union member of struct isc__nm_uvreq contained libuv request types that we don't use. Turns out that uv_getnameinfo_t is 1000 bytes big and unnecessarily enlarged the whole structure. Remove all the unused members from the uv_req union.	2024-02-08 15:13:47 +01:00
Ondřej Surý	2367b6a2e1	Reduce sizeof isc_sockaddr from 152 to 48 bytes After removing sockaddr_unix from isc_sockaddr, we can also remove sockaddr_storage and reduce the isc_sockaddr size from 152 bytes to just 48 bytes needed to hold IPv6 addresses.	2024-02-08 15:13:47 +01:00
Ondřej Surý	2463e5232d	Use proper padding instead of using alignas() As it was pointed out, the alignas() can't be used on objects larger than `max_align_t` otherwise the compiler might miscompile the code to use auto-vectorization on unaligned memory. As we were only using alignas() as a way to prevent false memory sharing, we can use manual padding in the affected structures.	2024-02-08 10:54:35 +01:00
Ondřej Surý	3f774c2a8a	Optimize cname_and_other_data to stop as earliest as possible Stop the cname_and_other_data processing if we already know that the result is true. Also, we know that CNAME will be placed in the priority headers, so we can stop looking for CNAME if we haven't found CNAME and we are past the priority headers.	2024-02-08 08:33:36 +01:00
Ondřej Surý	3ac482be7f	Optimize the slabheader placement for certain RRTypes Mark the infrastructure RRTypes as "priority" types and place them at the beginning of the rdataslab header data graph. The non-priority types either go right after the priority types (if any).	2024-02-08 08:33:36 +01:00
Ondřej Surý	5070c7f5c7	Fix missing RRSIG for CNAME with different slabheader order The cachedb was missing piece of code (already found in zonedb) that would make lookups in the slabheaders to miss the RRSIGs for CNAME if the order of CNAME and RRSIG(CNAME) was reversed in the node->data.	2024-02-08 08:02:48 +01:00
Ondřej Surý	0c18ed7ec6	Remove isc__tls_setfatalmode() function and the calls With _exit() instead of exit() in place, we don't need isc__tls_setfatalmode() mechanism as the atexit() calls will not be executed including OpenSSL atexit hooks.	2024-02-08 08:01:58 +01:00
Ondřej Surý	76997983fd	Use EXIT_SUCCESS and EXIT_FAILURE Instead of randomly using -1 or 1 as a failure status, properly utilize the EXIT_FAILURE define that's platform specific (as it should be).	2024-02-08 08:01:58 +01:00
Ondřej Surý	e140743e6a	Improve the rcu_barrier() call when destroying the mem context Instead of crude 5x rcu_barrier() call in the isc__mem_destroy(), change the mechanism to call rcu_barrier() until the memory use and references stops decreasing. This should deal with any number of nested call_rcu() levels. Additionally, don't destroy the contextslock if the list of the contexts isn't empty. Destroying the lock could make the late threads crash.	2024-02-08 08:01:58 +01:00
Ondřej Surý	2c98ccbdba	Use error checking mutex in developer mode on Linux When developer mode is enabled, use error checking mutex type, so we can discover wrong use of mutexes faster.	2024-02-07 20:54:05 +01:00
Ondřej Surý	01038d894f	Always use adaptive mutexes on Linux When adaptive mutexes are available (with glibc), always use them. Remove the autoconf switch and also fix the static initializer.	2024-02-07 20:54:05 +01:00
Ondřej Surý	cb1d2e57e9	Remove unused mutex from netmgr The netmgr->lock was dead code, remove it.	2024-02-07 20:54:05 +01:00
Mark Andrews	2f87c429a2	cleanup isc_symtab_define with isc_symexists_replace	2024-02-07 13:52:10 +11:00
Mark Andrews	1fb61494a8	Add RUNTIME_CHECK	2024-02-07 13:40:03 +11:00
Mark Andrews	95de7f829c	Ensure keyname buffer is big enough Use a temporary string rather than a fixed buffer to construct the keyname.	2024-02-07 13:39:51 +11:00
Mark Andrews	7cced1732d	cleanup isc_symtab_undefine callers isc_symtab_undefine now only return ISC_R_SUCCESS and ISC_R_EXISTS. Cleanup callers looking for other values.	2024-02-07 12:56:39 +11:00
Mark Andrews	4b93ae74c7	Restore dns_requestmgr_shutdown re-entrancy In the conversion to rcu the ability to call dns_requestmgr_shutdown multiple times was lost. nsupdate depended on this. Restore support for that.	2024-02-07 09:52:32 +11:00
Aram Sargsyan	2ec041b719	Expose the 'first refresh' zone flag in rndc status Expose the newly added 'first refresh' flag in the information provided by the 'rndc staus' command, by showing the number of zones, which are not yet fully ready, and their first refresh is pending or is in-progress.	2024-02-05 17:41:14 +00:00
Aram Sargsyan	0a1f05987f	Expose 'first refresh' zone flag in stats channel Add a new zone flag to indicate that a secondary type zone is not yet fully ready, and a first time refresh is pending or is in progress. Expose this new flag in the statistics channel's "Incoming Zone Transfers" section.	2024-02-05 17:41:14 +00:00
Aram Sargsyan	4cdef214d2	Require trust anchors for 'dnnsec-validation yes' Using the 'dnssec-validation yes' option now requires an explicitly confgiured 'trust-anchors' statement (or 'managed-keys' or 'trusted-keys', both deprecated).	2024-02-02 19:53:45 +00:00
Aram Sargsyan	0d7c7777da	Improve the definition of the DNS_GETDB_* flags Use the (1 << N) form for defining the flags, in order to avoid errors like the one fixed in the previous commit. Also convert the definitions to an enum, as done in some of our recent refactoring work.	2024-02-02 14:15:31 +00:00
Aram Sargsyan	be7d8fafe2	Fix the DNS_GETDB_STALEFIRST flag The DNS_GETDB_STALEFIRST flag is defined as 0x0C, which is the combination of the DNS_GETDB_PARTIAL (0x04) and the DNS_GETDB_IGNOREACL (0x08) flags (0x04 \| 0x08 == 0x0C) , which is an obvious error. All the flags should be power of two, so they don't interfere with each other. Fix the DNS_GETDB_STALEFIRST flag by setting it to 0x10.	2024-02-02 13:50:57 +00:00
Ondřej Surý	15096aefdf	Make the dns_validator validations asynchronous and limit it Instead of running all the cryptographic validation in a tight loop, spread it out into multiple event loop "ticks", but moving every single validation into own isc_async_run() asynchronous event. Move the cryptographic operations - both verification and DNSKEY selection - to the offloaded threads (isc_work_enqueue), this further limits the time we spend doing expensive operations on the event loops that should be fast. Limit the impact of invalid or malicious RRSets that contain crafted records causing the dns_validator to do many validations per single fetch by adding a cap on the maximum number of validations and maximum number of validation failures that can happen before the resolving fails.	2024-02-01 21:45:06 +01:00
Matthijs Mekking	07c2acf15d	Don't also skip keymgr run if checkds is skipped Checking the DS at the parent only happens if dns_zone_getdnsseckeys() returns success. However, if this function somehow fails, it can also prevent the keymgr from running. Before adding the check DS functionality, the keymgr should only run if 'dns_dnssec_findmatchingkeys()' did not return an error (either ISC_R_SUCCESS or ISC_R_NOTFOUND). After this change the correct result code is used again.	2024-02-01 12:06:08 +01:00
Evan Hunt	86fdc66ed3	check range of fetch-quota-param parameters the 'low', 'high' and 'discount' parameters to 'fetch-quota-param' are meant to be ratios with values between zero and one, but higher values can be assigned. this could potentially lead to an assertion in maybe_adjust_quota().	2024-01-31 18:19:38 -08:00
Aram Sargsyan	510f1de8a6	fix another message parsing regression The fix for CVE-2023-4408 introduced a regression in the message parser, which could cause a crash if an rdata type that can only occur in the question was found in another section. Use 'dns__message_putassociatedrdataset()' instead of 'dns__message_puttemprdataset()', because after calling the 'dns_rdatalist_tordataset()' function earlier the 'rdataset' is associated.	2024-01-31 15:52:46 +01:00
Evan Hunt	4c19d35614	fix a message parsing regression the fix for CVE-2023-4408 introduced a regression in the message parser, which could cause a crash if duplicate rdatasets were found in the question section. this commit ensures that rdatasets are correctly disassociated and freed when this occurs.	2024-01-31 15:52:46 +01:00
Matthijs Mekking	8602beecd1	Replace keystore attach/detach with ISC_REFCOUNT_IMPL/ISC_REFCOUNT_DECL This is now the default way to implement attaching to/detaching from a pointer. Also update cfg_keystore_fromconfig() to allow NULL value for the keystore pointer. In most cases we detach it immediately after the function call.	2024-01-25 15:37:40 +01:00
Matthijs Mekking	daaa70f48b	Refactor dns_keystore_directory() Add a default key-directory parameter to the function that can be returned if there is no keystore, or if the keystore directory is NULL (the latter is also true for the built-in keystore).	2024-01-25 15:37:40 +01:00
Matthijs Mekking	cb12b42839	Rename "uri" to "pkcs11-uri" The name "uri" was considered to be too generic and could potentially clash with a future URI configuration option. Renamed to "pkcs11-uri". Note that this option name was also preferred over "pkcs11uri", the dash is considered to be the more clearer form.	2024-01-25 15:37:40 +01:00
Matthijs Mekking	934d17255e	Better PKCS#11 label creation When using the same PKCS#11 URI for a zone that uses different DNSSEC policies, the PKCS#11 label could collide, i.e. the same label could be used for different keys. Add the policy name to the label to make it more unique. Also, the zone name could contain characters that are interpreted as special characters when parsing the PKCS#11 URI string. Mangle the zone name through 'dns_name_tofilenametext()' to make it PKCS#11 safe. Move the creation to a separate function for clarity. Furthermore, add a log message whenever a PKCS#11 object has been successfully created.	2024-01-25 15:37:40 +01:00
Matthijs Mekking	1ac02b0f1d	The use of isc_dir_t in keymgr is not needed The internal keymgr used 'isc_dir_open(&dir)' and 'isc_dir_close(&dir)', but was not using the variable 'dir`, other than checking if the directory can be opened. Errors like these will be be caught already in the dst_api function calls.	2024-01-25 15:37:40 +01:00
Matthijs Mekking	750536f74d	No longer need to get generated key from label The pkcs11-provider did not yet support getting X/Y coordinates on newly generated EC PKEY keys, thus we attempted to get the key from the label after it was generated in the keystore. This has been fixed in: https://github.com/latchset/pkcs11-provider/pull/293 Thus now we should be able to use the generated key structure immediately.	2024-01-25 15:37:40 +01:00
Matthijs Mekking	62e7cc66d0	Specify key usage to be digital signature If not set, the created keys allows signing plus decrypt which is bad practice. Setting the key usage explicitly will generate keys that allow only signing.	2024-01-25 14:48:07 +01:00
Matthijs Mekking	1e88bb0186	Create keys with PKCS#11 URI instead of object The pkcs11-provider has changed to take a PKCS#11 URI instead of an object identifier. Change the BIND 9 code accordingly to pass through the label instead of just the object identifier. See: https://github.com/latchset/pkcs11-provider/pull/284	2024-01-25 14:48:07 +01:00
Matthijs Mekking	3dff3eac0a	Fix tsan errors When working internally on the zone, we can access the zone's variables directly.	2024-01-25 14:48:07 +01:00
Matthijs Mekking	18b566ccea	Refactor findzonekeys Move dns_dnssec_findzonekeys from the dnssec.{c,h} source code to zone.{c,h} (the header file already commented that this should be done inside dns_zone_t). Alter the function in such a way, that keys are searched for in the key stores if a 'dnssec-policy' (kasp) is attached to the zone, otherwise keep using the zone's key-directory.	2024-01-25 14:48:07 +01:00
Matthijs Mekking	80387532cd	Use dst_key's directory when writing key files When writing key files to disk, use the internally stored directory. Add an access function 'dst_key_directory()'. Most calls to keymgr functions no longer need to provide the key-directory value. Only 'dns_keymgr_run' still needs access to the zone's key-directory in case the key-store is set to the built-in key-directory.	2024-01-25 14:47:43 +01:00
Matthijs Mekking	0701a140d3	Add directory to dst_key structure Store key directory when reading the key from file. This is the directory it was read from and can be used when saving the key back to disk.	2024-01-25 14:41:25 +01:00
Matthijs Mekking	9081426313	Refactor findmatchingkeys and keylistfromrdataset Refactor dns_dnssec_findmatchingkeys and dns_dnssec_keylistfromrdataset to take into account the key store directories in case the zone is using dnssec-policy (kasp). Add 'kasp' and 'keystores' parameters. This requires the keystorelist to be stored inside the zone structure. The calls to these functions in the DNSSEC tools can use NULL as the kasp value, as dnssec-signzone does not (yet) support dnssec-policy, and key collision is checked inside the directory where it is created.	2024-01-25 14:41:25 +01:00
Matthijs Mekking	f096472eb4	Create private keys with PKCS#11 object If there is a keystore configured with a PKCS#11 URI, zones that are using a dnssec-policy that uses such a keystore should create keys via the PKCS#11 interface. Those keys are generally stored inside an HSM. Some changes to the code are required, to store the engine reference into the keystore.	2024-01-25 14:41:25 +01:00
Matthijs Mekking	d795710541	Add object parameter to dst_key_generate() Add a parameter to store a possible PKCS#11 object that can later be used to identify a key with a PKCS#11 URI string (RFC 7512).	2024-01-25 14:41:25 +01:00
Matthijs Mekking	ffc41d1b14	Store key store reference instead of name When creating the kasp structure, instead of storing the name of the key store on keys, store a reference to the key store object instead. This requires to build the keystore list prior to creating the kasp structures, in the dnssec tools, the check code and the server code. We will create a builtin keystore called "key-directory" which means use the zone's key-directory as the key store. The check code changes, because now the keystore is looked up before creating the kasp structure (and if the keystore is not found, this is an error). Instead of looking up the keystore after all 'dnssec-policy' clauses have been read.	2024-01-25 14:41:25 +01:00
Matthijs Mekking	792670c991	Check if key-store directory is not reused Similar to key-directory, check for zones in different views and different key and signing policies. Zones must be using different key directories to store key files on disk. Now that a key directory can be linked with a dnssec-policy key, the 'keydirexist' checking needs to be reshuffled. Add tests for bad configuration examples, named-checkconf should catch those. Also add test cases for a mix of key-directory and key-store directory.	2024-01-25 14:41:24 +01:00
Matthijs Mekking	22d1fde1a5	Check if key-store directory exists Similar to key-directory, check if the key-store directory exists and if it is an actual directory. This commit fixes an accidental test bug in checkconf where if the "warn key-dir" test failed, the result was ignored.	2024-01-25 14:38:12 +01:00
Matthijs Mekking	594d4a81f1	Check if key-store exists Add checkconf check to ensure that the used key-store in the keys section exists. Error if that is not the case. We also don't allow the special keyword 'key-directory' as that is internally used to signal that the zone's key-directory should be used.	2024-01-25 14:38:12 +01:00
Matthijs Mekking	f837bb2af8	Parse key-store config Add the code that actually stores the key-store configuration into structures, also store the reference into the kasp key.	2024-01-25 14:38:11 +01:00
Matthijs Mekking	3a86c07422	Add code for creating keystore from config Add code for configuring keystore objects. Add this to the "kaspconf" code, as it is related to 'dnssec-policy' and it is too small to create a separate file for it.	2024-01-25 14:38:11 +01:00
Matthijs Mekking	0284482687	Add code to store key-stores New files to define a structure and functions for dealing with key-stores.	2024-01-25 14:38:11 +01:00
Matthijs Mekking	a035f3b10e	Add configuration for key-store Add new configuration for setting key stores. The new 'key-store' statement allows users to configure key store backends. These can be of type 'file' (that works the same as 'key-directory') or of type 'pkcs11'. In the latter case, keys should be stored in a HSM that is accessible through a PKCS#11 interface. Keys configured within 'dnssec-policy' can now also use the 'key-store' option to set a specific key store. Update the checkconf test to accomodate for the new configuration.	2024-01-25 14:38:11 +01:00
Mark Andrews	1b6f70076a	Extend dns_message_setopt to clear the opt record Use NULL to signal that the opt record, if any, set on the message be removed.	2024-01-23 10:47:31 +11:00
Mark Andrews	8f0f6d05e9	Add minimal EDNS UL option support This is defined in draft-ietf-dnssd-update-lease. This adds the ability to display the option and teaches dig about the name 'UL'.	2024-01-23 10:47:31 +11:00
Aydın Mercan	197de93bdc	Forward declare mallocx in isc/mem.h cmocka.h and jemalloc.h/malloc_np.h has conflicting macro definitions. While fixing them with push_macro for only malloc is done below, we only need the non-standard mallocx interface which is easy to just define by ourselves.	2024-01-18 09:34:36 +01:00
Ondřej Surý	41a0ee1071	Add workaround for jemalloc linking order Because we don't use jemalloc functions directly, but only via the libisc library, the dynamic linker might pull the jemalloc library too late when memory has been already allocated via standard libc allocator. Add a workaround round isc_mem_create() that makes the dynamic linker to pull jemalloc earlier than libc.	2024-01-18 09:34:36 +01:00
Artem Boldariev	20d5a805e2	TLS: improve framing by assembling DNS message in one buffer This commit improves TLS messages framing by avoiding an extra call to SSL_write_ex(). Before that we would use an extra SSL_write_ex() call to pass DNS message length to OpenSSL. That could create an extra TLS frame, increasing number of bytes sent due to frame header and padding. This commit fixes that by making the code pass both DNS message length and data at once, just like old TLS code did. It should improve compatibility with some buggy clients that expect both DNS message length and data to be in one TLS frame. Older TLS DNS code worked like this, too.	2024-01-17 17:09:41 +02:00
Aydın Mercan	2690dc48d3	Expose the TCP client count in statistics channel The statistics channel does not expose the current number of TCP clients connected, only the highwater. Therefore, users did not have an easy means to collect statistics about TCP clients served over time. This information could only be measured as a seperate mechanism via rndc by looking at the TCP quota filled. In order to expose the exact current count of connected TCP clients (tracked by the "tcp-clients" quota) as a statistics counter, an extra, dedicated Network Manager callback would need to be implemented for that purpose (a counterpart of ns__client_tcpconn() that would be run when a TCP connection is torn down), which is inefficient. Instead, track the number of currently-connected TCP clients separately for IPv4 and IPv6, as Network Manager statistics.	2024-01-17 11:11:12 +03:00
Artem Boldariev	dffb11f2c0	TCP: remove wrong INSIST(csock->recv_cb != NULL) This commit removes wrong INSIST() condition as the assumption that if 'csock->recv_cb != NULL' iff 'csock->statichandle != NULL' is wrong. There is no direct relation between 'csock->statichandle' and 'csock->recv_cb', as 'csock->statichandle' gets set when allocating a handle regardless of 'csock->recv_cb' not being NULL, as it is possible to attach to the handle without starting a read operation (at the very least, it is correct to start writing before reading). That condition made `cipher-suites` system test fail with crash on some platforms in FIPS mode (namely, Oracle Linux 9) despite not being related to FIPS at all.	2024-01-16 15:01:26 +02:00
Artem Boldariev	8ae661048d	Fix flawed logic when detecting same listener type The older version of the code was reporting that listeners are going to be of the same type after reconfiguration when switching from DoT to HTTPS listener, making BIND abort its executions. That was happening due to the flaw in logic due to which the code could consider a current listener and a configuration for the new one to be of the same type (DoT) even when the new listener entry is explicitly marked as HTTP. The checks for PROXY in between the configuration were masking that behaviour, but when porting it to 9.18 (when there is no PROXY support), the behaviour was exposed. Now the code mirrors the logic in 'interface_setup()' closely (as it was meant to).	2024-01-12 17:59:53 +02:00
Mark Andrews	2cf6cf967d	Report the type being filtered from an UPDATE When processing UPDATE request DNSKEY, CDNSKEY and CDS record that are managed by named are filtered out. The log message has been updated to report the actual type rather that just DNSKEY.	2024-01-12 14:06:58 +00:00
Artem Boldariev	d59cf5e0ce	Recreate listeners on DNS transport change This commit ensures that listeners are recreated on reconfiguration in the case when their type changes (or when PROXY protocol type changes, too). Previously, if a "listen-on" statement was modified to represent a different transport, BIND would not pick-up the change on reconfiguration if listener type changes (e.g. DoH -> DoT) for a given interface address and port combination. This commit fixes that by recreating the listener. Initially, that worked for most of the new transports as we would recreate listeners on each reconfiguration for DoH and DoT. But at some point we changed that in such a way that listeners were not recreated to avoid rebinding a port as on some platforms only root can do that for port numbers <1000, making some ports binding possible only on start-up. We chose to asynchronously update listener socket settings (like TLS contexts, HTTP settings) instead. Now, we both avoid recreating the sockets if unnecessary and recreate listeners when listener type changes.	2024-01-12 14:55:12 +02:00
Artem Boldariev	eb924e460b	Integrate TLS cipher suites support into BIND This commit makes BIND use the new 'cipher-suites' option from the 'tls' statement.	2024-01-12 13:27:59 +02:00
Artem Boldariev	3818c58bf6	Add TLS cipher suites configuration option to BIND This commit extends the 'tls' statement with 'cipher-suites' option.	2024-01-12 13:27:59 +02:00
Artem Boldariev	9d052522a0	Add TLS cipher-suites related low-level functionality This commits adds low-level wrappers on top of 'SSL_CTX_set_ciphersuites()'. These are going to be a foundation behind the 'cipher-suites' option of the 'tls' statement.	2024-01-12 13:27:59 +02:00
Mark Andrews	d5103b742b	Defer control channel message invalidation The conn_shutdown() function is called whenever a control channel connection is supposed to be closed, e.g. after a response to the client is sent or when named is being shut down. That function calls isccc_ccmsg_invalidate(), which resets the magic number in the structure holding the messages exchanged over a given control channel connection (isccc_ccmsg_t). The expectation here is that all operations related to the given control channel connection will have been completed by the time the connection needs to be shut down. However, if named shutdown is initiated while a control channel message is still in flight, some netmgr callbacks might still be pending when conn_shutdown() is called and isccc_ccmsg_t invalidated. This causes the REQUIRE assertion checking the magic number in ccmsg_senddone() to fail when the latter function is eventually called, resulting in a crash. Fix by splitting up isccc_ccmsg_invalidate() into two separate functions: - isccc_ccmsg_disconnect(), which initiates TCP connection shutdown, - isccc_ccmsg_invalidate(), which cleans up magic number and buffer, and then: - replacing all existing uses of isccc_ccmsg_invalidate() with calls to isccc_ccmsg_disconnect(), - only calling isccc_ccmsg_invalidate() when all netmgr callbacks are guaranteed to have been run. Adjust function comments accordingly.	2024-01-10 15:48:25 +01:00
Michał Kępień	24381cc36d	Limit isc_async_run() overhead for tree pruning Instead of issuing a separate isc_async_run() call for every RBTDB node that triggers tree pruning, maintain a list of nodes from which tree pruning can be started from and only issue an isc_async_run() call if pruning has not yet been triggered by another RBTDB node. In some older BIND 9 branches, the extra queuing overhead eliminated by this change could be remotely exploited to cause excessive memory use. Due to architectural shift, this branch is not vulnerable to that issue, but applying the fix to the latter is nevertheless deemed prudent for consistency and to make the code future-proof.	2024-01-05 12:33:14 +01:00
Mark Andrews	1fcc483df1	Restore dns64 state during serve-stale processing If we are in the process of looking for the A records as part of dns64 processing and the server-stale timeout triggers, redo the dns64 changes that had been made to the orignal qctx.	2024-01-05 12:17:00 +01:00
Mark Andrews	9d0fa07c5e	Save the correct result value to resume with nxdomain-redirect The wrong result value was being saved for resumption with nxdomain-redirect when performing the fetch. This lead to an assert when checking that RFC 1918 reverse queries where not leaking to the global internet.	2024-01-05 12:01:28 +01:00
Ondřej Surý	b8a9631754	Use hashmap when parsing a message When parsing messages use a hashmap instead of a linear search to reduce the amount of work done in findname when there's more than one name in the section. There are two hashmaps: 1) hashmap for owner names - that's constructed for each section when we hit the second name in the section and destroyed right after parsing that section; 2) per-name hashmap - for each name in the section, we construct a new hashmap for that name if there are more than one rdataset for that particular name.	2024-01-05 11:35:25 +01:00
Mark Andrews	d2ba96488e	Address races in dns_tsigkey_find() 1) Restart the process with a write lock if we discover an expired key while holding the read lock. 2) Move incrementing the key reference inside the lock block of code.	2024-01-05 11:16:12 +01:00
Aydın Mercan	ca9a05f9ce	Check for atomic operations consistency in checklibs.sh isc/atomic.h and its defined macros should be preferred over stdatomic.h and explicit atomic operations. Fix the redundant stdatomic.h header in histo.c found by the introduced check.	2024-01-03 17:04:31 +00:00
Aydın Mercan	294329da3a	Use <isc/atomic.h> instead of <stdatomic.h> directly in <isc/types.h>	2024-01-03 17:04:31 +00:00
Matthijs Mekking	b770740b44	Write new DNSKEY TTL to key file When the current DNSKEY TTL does not match the one from the policy, write the new TTL to disk.	2024-01-03 12:09:11 +11:00
Mark Andrews	27e74b2e4b	Only create private records for DNSKEYs that have changed We don't need to create private records for DNSKEY records that have only had their TTL's changed.	2024-01-03 12:09:11 +11:00
Mark Andrews	d601a90ea3	sync_secure_db failed to handle some TTL changes If the DNSKEY, CDNSKEY or CDS RRset had different TTLs then the filtering of these RRset resulted in dns_diff_apply failing with "not exact". Identify tuple pairs that are just TTL changes and allow them through the filter.	2024-01-03 12:09:11 +11:00
Mark Andrews	21be35c54e	Use the current CDS and CDNSKEY TTLs When adding new CDS and CDNSKEY records use the existing RRset TTL if they already exist.	2024-01-03 12:09:11 +11:00
Mark Andrews	dcb7799061	Update the DNSKEY, CDNSKEY and CDS TTLs to match dnskey-ttl If the TTLs of the DNSKEY, CDNSKEY and CDS do not match the dnskey-ttl update them by removing all records and re-adding them with the correct TTL.	2024-01-03 12:09:11 +11:00
Michał Kępień	9cf1f39b54	Silence a scan-build warning in dns_rbt_addname() Clang Static Analyzer is unable to grasp that when dns_rbt_addnode() returns ISC_R_EXISTS, it always sets the pointer passed to it via its 'nodep' parameter to a non-NULL value. Add an extra safety check in the conditional expression used in dns_rbt_addname() to silence that warning.	2023-12-22 19:27:37 +01:00
Evan Hunt	ea9a8cb392	prevent an infinite loop in fix_iterator() it was possible for fix_iterator() to get stuck in a loop while trying to find the predecessor of a missing node. this has been fixed and a regression test has been added.	2023-12-21 09:18:30 -08:00
Evan Hunt	84f79cd164	fix_iterator() could produce incoherent iterator stacks the fix_iterator() function moves an iterator so that it points to the predecessor of the searched-for name when that name doesn't exist in the database. the tests only checked the correctness of the top of the stack, however, and missed some cases where interior branches in the stack could be missing or duplicated. in these cases, the iterator would produce inconsistent results when walked. the predecessors test case in qp_test has been updated to walk each iterator to the end and ensure that the expected number of nodes are found.	2023-12-21 09:18:30 -08:00
Mark Andrews	0509351e92	Update the NSEC3PARAM TTL to match the SOA minimum When building NSEC3 chains update the NSEC3PARAM TTL to match the SOA minimum. Delete all records using the old TTL then re-add them using the new TTL.	2023-12-21 20:12:09 +11:00
Mark Andrews	f3ae88d84e	Don't delete the NSEC3PARAM immediately Wait until the new NSEC or NSEC3 chain is generated then it should be deleted.	2023-12-21 20:12:09 +11:00
Mark Andrews	a3d0476d17	Don't look for KSK status here and squash memory leak Just remove the key from consideration as it is being removed. The old code could leak a key reference as dst_free_key was not called every time we continued. This simplification will address this as well.	2023-12-21 09:18:45 +11:00
Mark Andrews	6ccb93884d	dns_request_cancel needs to be callable from any thread Check the tid and cancel the request immediately or pass it to the appropriate loop for processing. Call request->cb directly from req_sendevent as it is now always called with the correct tid.	2023-12-21 08:11:59 +11:00
Michał Kępień	efcba4dd23	Do not destroy IXFR journal in xfrin_end() The xfrin_end() function is run when a zone transfer is finished or canceled. One of the actions it takes for incremental transfers (IXFR) is calling dns_journal_destroy() on the zone journal structure that is stored in the relevant zone transfer context (xfr->ixfr.journal). That immediately invalidates that structure as it is not reference-counted. However, since the changes present in the IXFR stream are applied to the journal asynchronously (via isc_work_enqueue()), it is possible that some zone changes may still be in the process of being written to the journal by the time xfrin_end() destroys the relevant structure. Such a scenario leads to crashes. Fix by not destroying the zone journal structure until the entire zone transfer context is destroyed. xfrin_destroy() already conditionally calls dns_journal_destroy() and when the former is called, all asynchronous work for a given zone transfer process is guaranteed to be complete.	2023-12-20 17:21:14 +01:00
Matthijs Mekking	16f2c811e3	Revert "Remove kasp mutex lock" This reverts commit `634c80ea12`.	2023-12-20 08:30:44 +00:00
Mark Andrews	7ab4e1537a	Obtain a client->handle reference when calling async_restart otherwise client may be freed before async_restart is called.	2023-12-20 02:50:48 +11:00
Mark Andrews	c896e07277	Log what change generated a 'not exact' error	2023-12-20 01:56:38 +11:00
Matthijs Mekking	634c80ea12	Remove kasp mutex lock Multiple zones should be able to read the same key and signing policy at the same time. Since writing the kasp lock only happens during reconfiguration, and the complete kasp list is being replaced, there is actually no need for a lock. Reference counting ensures that a kasp structure is not destroyed when still being attached to one or more zones. This significantly improves the load configuration time.	2023-12-19 14:53:51 +01:00
Mark Andrews	6066e41948	Use 'now' rather than 'inception' in 'add_sigs' When kasp support was added 'inception' was used as a proxy for 'now' and resulted in signatures not being generated or the wrong signatures being generated. 'inception' is the time to be set in the signatures being generated and is usually in the past to allow for clock skew. 'now' determines what keys are to be used for signing.	2023-12-19 11:21:46 +11:00
Michał Kępień	b1baf7af3a	"trust-anchor-telemetry" is no longer experimental Remove the CFG_CLAUSEFLAG_EXPERIMENTAL flag from the "trust-anchor-telemetry" statement as the behavior of the latter has not been changed since its initial implementation and there are currently no plans to do so. This silences a relevant log message that was emitted even when the feature was explicitly disabled.	2023-12-18 15:11:39 +01:00
Michał Kępień	2a3b6d1406	Fix reference counting in do_nsfetch() Each function queuing a do_nsfetch() call using isc_async_run() is expected to increase the given zone's internal reference count (zone->irefs), which is then correspondingly decreased in either do_nsfetch() itself (when the dns_resolver_createfetch() fails) or in nsfetch_done() (when recursion is finished). However, do_nsfetch() can also return early if either the zone itself or the relevant view's resolver object is being shut down. In that case, do_nsfetch() simply returns without decreasing the internal reference count for the zone. This leaves a dangling zone reference around, which leads to hangs during named shutdown. Fix by executing the same cleanup code for early returns from do_nsfetch() as for a failed dns_resolver_createfetch() call in that function as the reference count will not be decreased in nsfetch_done() in any of these cases.	2023-12-18 11:33:43 +01:00
Aram Sargsyan	791a046cc7	Use atomic store operations instead of atomic initialize The atomic_init() function makes sense to use with structure's members when creating a new instance of a strucutre. In other places, use atomic store operations instead, in order to avoid data races.	2023-12-15 09:56:44 +00:00
Aydın Mercan	9c4dd863a6	Move atomic statscounter next to the non-atomic definition	2023-12-14 09:11:48 +01:00
Aydın Mercan	bb96142a17	Use a non-atomic counter when passing to stats dumper	2023-12-14 09:11:48 +01:00
Petr Špaček	7b0115e331	Avoid overflow during statistics dump Related: !1493 Fixes: #4467	2023-12-14 09:11:02 +01:00
Mark Andrews	fd077c2661	NetBSD has added 'hmac' to libc so rename out uses of hmac	2023-12-13 22:27:38 +00:00
Matthijs Mekking	21867f200a	Refactor getpred code Move the code to find the predecessor into one function, as it is shares quite some similarities: In both cases we first need to find the immediate predecessor/successor, then we need to find the immediate predecessor if the iterator is not already pointing at it.	2023-12-11 21:01:29 +00:00
Matthijs Mekking	ab8a0c4b5a	and fix yet another dns_qp_lookup() iterator bug This one is similar to the bug when searching for a key, reaching a dead-end branch that doesn't match, because the branch offset point is after the point where the search key differs. This fixes the case where we are multiple levels deep. In other words, we had a more-than-one matches after the point where the search key differs. For example, consider the following qp-trie: branch: "[e]", "[m]": - leaf: "a.b.c.d.e" - branch: "moo[g]", "moo[k]", "moo[n]": - leaf: "moog" - branch: "mook[e]", "mook[o]" - leaf: "mooker" - leaf: "mooko" - leaf: "moon" If searching for a key "monky", we would reach the branch with twigs "moo[k]" and "moo[n]". The key matches on the 'k' on offset=4, and reaches the branch with twigs "mook[e]" and "mook[o]". This time we cannot find a twig that matches our key at offset=5, there is no twig for 'y'. The closest name we found was "mooker". Note that on a branch it can't detect it is on a dead branch because the key is not encapsulated in a branch node. In the previous code we considered "mooker" to be the successor of "monky" and so we needed to the predecessor of "mooker" to find the predecessor for "monky". However, since the search key alread differed before entering this branch, this is not enough. We would be left with "moog" as the predecessor of "monky", while in this example "a.b.c.d.e" is the actual predecessor. Instead, we need to go up a level, find the predecessor and check again if we are on the right branch, and repeat the process until we are. Unit tests to cover the scenario are now added.	2023-12-11 21:01:29 +00:00
Matthijs Mekking	276bdcf5cf	and fix another dns_qp_lookup() iterator bug There was yet another edge case in which an iterator could be positioned at the wrong node after dns_qp_lookup(). When searching for a key, it's possible to reach a leaf that matches at the given offset, but because the offset point is after the point where the search key differs from the leaf's contents, we are now at the wrong leaf. In other words, the bug fixed the previous commit for dead-end branches must also be applied on matched leaves. For example, if searching for the key "monpop", we could reach a branch containing "moop" and "moor". the branch offset point - i.e., the point after which the branch's leaves differ from each other - is the fourth character ("p" or "r"). The search key matches the fourth character "p", and takes that twig to the next node (which can be a branch for names starting with "moop", or could be a leaf node for "moop"). The old code failed to detect this condition, and would have incorrectly left the iterator pointing at some successor, and not at the predecessor of the "moop". To find the right predecessor in this case, we need to get to the previous branch and get the previous from there. This has been fixed and the unit test now includes several new scenarios for testing search names that match and unmatch on the offset but have a different character before the offset.	2023-12-11 21:01:29 +00:00
Tom Krizek	059a63793a	Remove obsolete check for resolver-nonbackoff-tries With the resolver-nonbackoff-tries statement being removed in #4405, this check can no longer be reached and can be safely removed.	2023-12-07 13:10:58 +01:00
Mark Andrews	7e462c2b26	Also cleanup the space for the rbt nodes As we are in overmem state we want to free more memory than we are adding so we need to add in an allowance for the rbtnodes that may have been added and the names stored with them. There is the node for the owner name and a possible ENT node if there was a node split.	2023-12-07 02:59:04 +00:00
Mark Andrews	5e8f0e9ceb	Process the combined LRU lists in LRU order Only cleanup headers that are less than equal to the rbt's last_used time. Adjust the rbt's last_used time when the target cleaning was not achieved to the oldest value of the remaining set of headers. When updating delegating NS and glue records last_used was not being updated when it should have been. When adding zero TTL records to the tail of the LRU lists set last_used to rbtdb->last_used + 1 rather than now. This appoximately preserves the lists LRU order.	2023-12-07 02:59:04 +00:00
Evan Hunt	7d05590a6f	clean up client.c - make dns_client_startresolve() static since it's only used internally - remove outdated comments	2023-12-06 17:31:38 -08:00
Evan Hunt	50dd6aad34	remove unused functions in dns_master dns_master_dumpnode() and dns_master_dumpnodetostream() were never used and can be removed.	2023-12-06 17:31:38 -08:00
Evan Hunt	66496d550b	remove resolver-retry-interval and resolver-nonbackoff-tries fully remove these options and mark them as ancient.	2023-12-06 11:54:59 -08:00
Evan Hunt	4aaa4f7dca	deprecate resolver-retry-interval and resolver-nonbackoff-tries these options control default timing of retries in the resolver for experimental purposes; they are not known to useful in production environments. they will be removed in the future; for now, we only log a warning if they are used.	2023-12-06 11:51:22 -08:00
Evan Hunt	60a33ae6bb	fix another dns_qp_lookup() iterator bug there was another edge case in which an iterator could be positioned at the wrong node after dns_qp_lookup(). when searching for a key, it's possible to reach a dead-end branch that doesn't match, because the branch offset point is after the point where the search key differs from the branch's contents. for example, if searching for the key "mop", we could reach a branch containing "moon" and "moor". the branch offset point - i.e., the point after which the branch's leaves differ from each other - is the fourth character ("n" or "r"). however, both leaves differ from the search key at position three ("o" or "p"). the old code failed to detect this condition, and would have incorrectly left the iterator pointing at some lower value and not at "moor". this has been fixed and the unit test now includes this scenario.	2023-12-06 11:03:30 -08:00
Evan Hunt	8612902476	fix dns_qp_lookup() iterator bug in some cases it was possible for the iterator to be positioned in the wrong place by dns_qp_lookup(). previously, when a leaf node was found which matched the search key at its parent branch's offset point, but did not match after that point, the code incorrectly assumed the leaf it had found was a successor to the searched-for name, and stepped the iterator back to find a predecessor. however, it was possible for the non-matching leaf to be the predecessor, in which case stepping the iterator back was wrong. (for example: a branch contains "aba" and "abcd", and we are searching for "abcde". we step down to the twig matching the letter "c" in position 3. "abcd" is the predecessor of "abcde", so the iterator is already correctly positioned, but because the twig was an exact match, we would have moved it back one step to "aba".) this previously went unnoticed due to a mistake in the qp_test unit test, which had the wrong expected result for the test case that should have detected the error. both the code and the test have been fixed.	2023-12-06 11:03:30 -08:00
Evan Hunt	947bc0a432	add an iterator argument to dns_qp_lookup() the 'predecessor' argument to dns_qp_lookup() turns out not to be sufficient for our needs: the predecessor node in a QP database could have become "empty" (for the current version) because of an update or because cache data expired, and in that case the caller would have to iterate more than one step back to find the predecessor node that it needs. it may also be necessary for a caller to iterate forward, in order to determine whether a node has any children. for both of these reasons, we now replace the 'predecessor' argument with an 'iter' argument. if set, this points to memory with enough space for a dns_qpiter object. when an exact match is found by the lookup, the iterator will be pointing to the matching node. if not, it will be pointing to the lexical predecessor of the nae that was searched for. a dns_qpiter_current() method has been added for examining the current value of the iterator without moving it in either direction.	2023-12-06 11:03:30 -08:00
Artem Boldariev	b109fa9192	Fix TLS certs store deletion on concurrent access During initialisation or reconfiguration, it is possible that multiple threads are trying to create a TLS context and associated data (like TLS certs store) concurrently. In some cases, a thread might be too late to add newly created data to the TLS contexts cache, in which case it needs to be discarded. In the code that handles that case, it was not taken into account that, in some cases, the TLS certs store could not have been created or should not be deleted, as it is being managed by the TLS contexts cache already. Deleting the store in such cases might lead to crashes. This commit fixes the issue.	2023-12-06 16:01:20 +02:00
Artem Boldariev	5ed3a76f9d	BIND: Add 'allow-proxy' and 'allow-proxy-on' options The main intention of PROXY protocol is to pass endpoints information to a back-end server (in our case - BIND). That means that it is a valid way to spoof endpoints information, as the addresses and ports extracted from PROXYv2 headers, from the point of view of BIND, are used instead of the real connection addresses. Of course, an ability to easily spoof endpoints information can be considered a security issue when used uncontrollably. To resolve that, we introduce 'allow-proxy' and 'allow-proxy-on' ACL options. These are the only ACL options in BIND that work with real PROXY connections addresses, allowing a DNS server operator to specify from what clients and on which interfaces he or she is willing to accept PROXY headers. By default, for security reasons we do not allow to accept them.	2023-12-06 15:15:25 +02:00
Artem Boldariev	6725d36cfd	Avoid using sock->iface and sock->peer from the lower transport This commit modifies TLS Stream and DNS-over-HTTPS transports so that they do not use the "sock->iface" and "sock->peer" of the lower level transport directly. That did not cause any problems before, as things worked as expected, but with the introduction of PROXYv2 support we use handles to store the information in both PROXY Stream and UDP Proxy transports. Therefore, in order to propagate the information (like addresses), extracted from PROXYv2 headers, from the lower level transports to the higher-level ones, we need to get that information from the lower-level handles rather than sockets. That means that we should get the peer and interface addresses using the intended APIs ("isc_nmhandle_peeraddr()" and "isc_nmhandle_localaddr()").	2023-12-06 15:15:25 +02:00
Artem Boldariev	f650d3eb63	Add 'proxy' option to 'listen-on' statement This commit extends "listen-on" statement with "proxy" options that allows one to enable PROXYv2 support on a dedicated listener. It can have the following values: - "plain" to send PROXYv2 headers without encryption, even in the case of encrypted transports. - "encrypted" to send PROXYv2 headers encrypted right after the TLS handshake.	2023-12-06 15:15:25 +02:00
Artem Boldariev	3c45dd59cb	Add a utility function to dump all active sockets on a NM instance Add the new isc__nm_dump_active_manager() function that can be used for debugging purposes: it dumps all active sockets withing the network manager instance.	2023-12-06 15:15:25 +02:00
Artem Boldariev	4a88fc9d5b	PROXYv2 over UDP transport This commit adds a new transport that supports PROXYv2 over UDP. It is built on top of PROXYv2 handling code (just like PROXY Stream). It works by processing and stripping the PROXYv2 headers at the beginning of a datagram (when accepting a datagram) or by placing a PROXYv2 header to the beginning of an outgoing datagram. The transport is built in such a way that incoming datagrams are being handled with minimal memory allocations and copying.	2023-12-06 15:15:25 +02:00
Artem Boldariev	07531d102c	TLS: detect ISC_R_SHUTTINGDOWN and ISC_R_CANCELED cases properly In the previous versions of the NM, detecting the case when worker is shutting down was not that important and actual status code did not matter much. However, that might be not the case all the time. This commit makes necessary modifications to the code.	2023-12-06 15:15:25 +02:00
Artem Boldariev	9d7343cd7d	DoH: add PROXY over TLS support This commit extends DNS over HTTP(S) transport with PROXY over TLS support.	2023-12-06 15:15:25 +02:00
Artem Boldariev	eb52015db1	Stream DNS: add PROXY over TLS support This commit extends Stream DNS with PROXY over TLS support.	2023-12-06 15:15:25 +02:00
Artem Boldariev	999923c423	Fix TLS Stream in accordance with PROXY Stream over TLS support This commit makes TLS Stream code to take PROXY Stream over TLS support into account.	2023-12-06 15:15:24 +02:00
Artem Boldariev	3d1b6c48ab	Add PROXY over TLS support to PROXY Stream This commit makes it possible to use PROXY Stream not only over TCP, but also over TLS. That is, now PROXY Stream can work in two modes as far as TLS is involved: 1. PROXY over (plain) TCP - PROXYv2 headers are sent unencrypted before TLS handshake messages. That is the main mode as described in the PROXY protocol specification (as it is clearly stated there), and most of the software expects PROXYv2 support to be implemented that way (e.g. HAProxy); 2. PROXY over (encrypted) TLS - PROXYv2 headers are sent after the TLS handshake has happened. For example, this mode is being used (only ?) by "dnsdist". As far as I can see, that is, in fact, a deviation from the spec, but I can certainly see how PROXYv2 could end up being implemented this way elsewhere.	2023-12-06 15:15:24 +02:00
Artem Boldariev	eccc3fe0a0	Add PROXYv2 support to DNS over HTTP(S) transport This commit extends DNS over HTTP(S) transport with PROXYv2 support.	2023-12-06 15:15:24 +02:00
Artem Boldariev	e97903ca14	Add PROXY support to Stream DNS This commit makes it possible to use Stream DNS on top of PROXY Stream either directly or indirectly (in the case when TLS is involved).	2023-12-06 15:15:24 +02:00
Artem Boldariev	4437096ba0	Make it possible to use TLS Stream on top of PROXY Stream This commit modifies TLS Stream to make it possible to use over PROXY Stream. That is required to add PROVYv2 support into TLS-based transports (DNS over HTTP, DNS over TLS).	2023-12-06 15:15:24 +02:00
Artem Boldariev	d119d666b3	PROXY Stream transport This commit adds a new stream-based transport with an interface compatible with TCP. The transport is built on top of TCP transport and the new PROXYv2 handling code. Despite being built on top of TCP, it can be easily extended to work on top of any TCP-like stream-based transport. The intention of having this transport is to add PROXYv2 support into all existing stream-based DNS transport (DNS over TCP, DNS over TLS, DNS over HTTP) by making the work on top of this new transport. The idea behind the transport is simple after accepting the connection or connecting to a remote server it enters PROXYv2 handling mode: that is, it either attempts to read (when accepting the connection) or send (when establishing a connection) a PROXYv2 header. After that it works like a mere wrapper on top of the underlying stream-based transport (TCP).	2023-12-06 15:15:24 +02:00
Artem Boldariev	2c76717881	Add PROXYv2 header utilities This commit adds a set of utilities for dealing with PROXYv2 headers, both parsing and generating them. The code has no dependencies from the networking code and is (for the most part) a "separate library". The part responsible for handling incoming PROXYv2 headers is structured as a state machine which accepts data as input and calls a callback to notify the upper-level code about the data processing status. Such a design, among other things, makes it easy to write a thorough unit test suite for that, as there are fewer dependencies as well as will not stand in the way of any changes in the networking code.	2023-12-06 15:15:24 +02:00
Matthijs Mekking	d08f293f11	CID 469729: Remove leftover return call This 'return (ret);' call can never be reached and should have been removed as part of commit `75e0d394dd`.	2023-12-06 10:51:15 +01:00
Matthijs Mekking	ff4201e388	Lower the maximum allowed NSEC3 iterations to 50 BIND 9 will now treat the response as insecure when processing NSEC3 records with iterations larger than 50. Earlier, we limited the number of iterations to 150 (in #2445). RFC 9276 says: Because there has been a large growth of open (public) DNSSEC validating resolvers that are subject to compute resource constraints when handling requests from anonymous clients, this document recommends that validating resolvers reduce their iteration count limits over time. Specifically, validating resolver operators and validating resolver software implementers are encouraged to continue evaluating NSEC3 iteration count deployment trends and lower their acceptable iteration limits over time. After evaluation, we decided that the next major BIND release should lower the maximum allowed NSEC3 iterations to 50, which should be fine for 99,87% of the domain names.	2023-12-05 14:58:58 +00:00
Matthijs Mekking	75e0d394dd	dnssec-policy: refuse to load non-zero iterations According to RFC 9276, if NSEC3 must be used, then an iterations count of 0 MUST be used to alleviate computational burdens.	2023-12-05 14:58:58 +00:00
Mark Andrews	7ee20d7d10	Destroy the message before detaching the view With shared name memory pools (`f5af981831`) the message needs to be destroyed before the view is detached which in turn detaches the resolver which checks that all resources have been returned.	2023-12-04 22:00:25 +00:00
Ondřej Surý	3383331d06	Cleanup unused stats_bucket() macro	2023-11-29 14:16:20 +01:00
Ondřej Surý	14bdd21e0a	Refactor the handling of isc_mem overmem condition Previously, there were two methods of working with the overmem condition: 1. hi/lo water callback - when the overmem condition was reached for the first time, the water callback was called with HIWATER mark and .is_overmem boolean was set internally. Similarly, when the used memory went below the lo water mark, the water callback would be called with LOWATER mark and .is_overmem was reset. This check would be called every time memory was allocated or freed. 2. isc_mem_isovermem() - a simple getter for the internal .is_overmem flag This commit refactors removes the first method and move the hi/lo water checks to the isc_mem_isovermem() function, thus we now have only a single method of checking overmem condition and the check for hi/lo water is removed from the hot path for memory contexts that doesn't use overmem checks.	2023-11-29 14:16:20 +01:00
Mark Andrews	decc17d3b0	Ineffective DbC protections Dereference before NULL checks. Thanks to Eric Sesterhenn from X41 D-Sec GmbH for reporting this.	2023-11-21 14:48:43 +11:00
Matthijs Mekking	71f023a1c3	Recognize escapes when reading the public key Escapes are valid in DNS names, and should be recognized when reading the public key from disk.	2023-11-20 08:31:39 +01:00
Evan Hunt	9643281453	set loadtime during initial transfer of a secondary zone when transferring in a non-inline-signing secondary for the first time, we previously never set the value of zone->loadtime, so it remained zero. this caused a test failure in the statschannel system test, and that test case was temporarily disabled. the value is now set correctly and the test case has been reinstated.	2023-11-15 17:23:25 -08:00
Mark Andrews	560c245971	Adjust comment to have correct message limit value	2023-11-16 11:22:47 +11:00
Mark Andrews	a069513234	Check that buffer length in dns_message_renderbegin The maximum DNS message size is 65535 octets. Check that the buffer being passed to dns_message_renderbegin does not exceed this as the compression code assumes that all offsets are no bigger than this.	2023-11-16 11:15:49 +11:00
Ondřej Surý	17da9fed58	Remove AES algorithm for DNS cookies The AES algorithm for DNS cookies was being kept for legacy reasons, and it can be safely removed in the next major release. Remove both the AES usage for DNS cookies and the AES implementation itself.	2023-11-15 10:31:16 +01:00
Aram Sargsyan	c584899b1a	Fix catz db update callback registration logic error (take two) Please see the `998765fea5` commit for the description of the original issue. The commit had fixed the logic error, but it was reintroduced again later with the `a1afa31a5a` commit, where the check of the 'db_registered' flag was removed in dns__catz_update_cb(). The check was removed, because the registration function was made idempotent, so double registration is not an issue, but the check also prevented from unneeded registration, on which the original fix relied. This commit just removes the update callback registration code from the dns__catz_update_cb() function instead of bringing back the check, because after code flow analysis, it is now clear that it's not required at all. The "call onupdate() artificially" comment (which was mentioned by the removed code) is speaking about the dns_catz_dbupdate_callback() function, which is called by server.c on (re)configuration, and that function already takes care of update callback's registration since the `998765fea5` commit was applied, so there is no need to do that here again.	2023-11-14 08:59:48 +00:00
Aram Sargsyan	2826f885d5	Use atomics for the iterators number in isc_hashmap_t Concurrent threads can access a hashmap for reading by creating and then destroying an iterator, in which case the integer number of the active iterators is increased or decreased from different threads, introducing a data race. Use atomic operations to protect the variable.	2023-11-14 08:56:41 +00:00
Ondřej Surý	79d9360011	Reformat sources with up-to-date clang-format-17	2023-11-13 16:52:35 +01:00
Ondřej Surý	67d14b0ee5	Deprecate AES algorithm for DNS cookies The AES algorithm for DNS cookies was being kept for legacy reasons, and it can be safely removed in the next major release. Mark is as deprecated, so the `named-checkconf` prints a warning when in use.	2023-11-13 14:59:43 +01:00
Aram Sargsyan	6687de854f	Use a read lock when iterating over a hashmap The 'dns_tsigkeyring_t' structure has a read/write lock to protect its 'keys' member, which is a 'isc_hashmap_t' pointer and needs to be protected. The dns_tsigkeyring_dump() function, however, doesn't use the lock, which can introduce a race with another thread, if the other thread tries to modify the hashmap. Add a read lock around the code, which iterates over the hashmap.	2023-11-13 12:06:26 +00:00
Evan Hunt	461b9a0442	if GLUEOK is set, and glue is found in a zone DB, don't check the cache EXPERIMENT: when DNS_DB_GLUEOK is set, dns_view_find() will now return glue if it is found it a local zone database, without checking to see if a better answer has been cached previously.	2023-11-01 16:49:08 +01:00
Mark Andrews	9227b82e71	Also look for additional records in dns_adb_find If a child zone is served by the same servers as a parent zone and a NS query is made for the zone name then the addresses of the nameservers are returned in the additional section are tagged as trust additional.	2023-11-01 16:49:08 +01:00
Mark Andrews	578da93581	Turn on QNAME minimisation when fetching nameserver addresses	2023-11-01 16:49:08 +01:00
Evan Hunt	b12f709f05	restore isc_mem_setwater() call in the cache Commit `4db150437e` incorrectly removed the call to isc_mem_setwater() from dns_cache_setcachesize(). The water() function is a no-op, but we still need to set high- and low-water marks in the memory context, otherwise overmem conditions will not be detected.	2023-11-01 15:18:02 +00:00
Matthijs Mekking	2322425016	Don't ignore auth zones when in serve-stale mode When serve-stale is enabled and recursive resolution fails, the fallback to lookup stale data always happens in the cache database. Any authoritative data is ignored, and only information learned through recursive resolution is examined. If there is data in the cache that could lead to an answer, and this can be just the root delegation, the resolver will iterate further, getting closer to the answer that can be found by recursing down the root, and eventually puts the final response in the cache. Change the fallback to serve-stale to use 'query_getdb()', that finds out the best matching database for the given query.	2023-10-30 20:07:01 +01:00
Ondřej Surý	c855ed6a0b	Bump the mempool sizes in dns_message Increasing the initial and freemax sizes for dns_message memory pools restores the root zone performance. The former sizes were suited for per-dns_message memory pools and we need to bump the sizes up for per-thread memory pools.	2023-10-27 15:28:27 +02:00
Ondřej Surý	f8e264ba6d	Remove the lock-file configuration and -X argument to named The lock-file configuration (both from configuration file and -X argument to named) has better alternatives nowadays. Modern process supervisor should be used to ensure that a single named process is running on a given configuration. Alternatively, it's possible to wrap the named with flock(1).	2023-10-26 22:42:37 +02:00
Ondřej Surý	d3f2766a79	Mark the lock-file configuration option as deprecated This is first step in removing the lock-file configuration option, it marks both the `lock-file` configuration directive and -X option to named as deprecated.	2023-10-26 22:41:45 +02:00
Evan Hunt	03183baa6d	Prevent a possible race in dns_qpmulti_query() and _snapshot() The `.reader` member of dns_qpmulti_t was accessed without RCU protection; reader_open() calls rcu_dereference() on it, and this call needs to be inside an RCU critical section. A similar problem was identified in the dns_qpmulti_snapshot() - the RCU critical section was completely missing. These are relicts of the isc_qsbr - in the QSBR mode the rcu_read_lock() and rcu_read_unlock() are no-ops and whole event loop is a critical section.	2023-10-26 00:32:22 -07:00
Ondřej Surý	6bb42939cf	Refactor dns_message using ISC_LIST_FOREACH macros Do a light refactoring and cleanups that replaces common list walking patterns with ISC_LIST_FOREACH macros and split some nested loops into separate static functions to reduce the nesting depth.	2023-10-25 12:36:37 +02:00
Ondřej Surý	d2e84a4b97	Add ISC_LIST_FOREACH_REV(_SAFE) macros Add complementary macros to ISC_LIST_FOREACH(_SAFE) that walk the lists in reverse. * ISC_LIST_FOREACH_REV(list, elt, link) - walk the static list from tail to head * ISC_LIST_FOREACH_REV_SAFE(list, elt, link, next) - walk the list from tail to head in a manner that's safe against list member deletions	2023-10-25 12:36:13 +02:00
Ondřej Surý	fd732a7fb5	Add dns__message_putassociatedrdataset() to deduplicate code There was a lot of internal code looking like this: INSIST(dns_rdataset_isassociated(rdataset)); dns_rdataset_disassociated(rdataset) isc_mempool_put(msg->rdspool, rdataset); Deduplicate the code into local dns__message_puttemprdataset() routine, and drop the INSIST() which is checked in dns_rdataset_disassociate().	2023-10-25 12:36:08 +02:00
Ondřej Surý	5fca0fb519	Remove unused dns_message_movename() method Since dns_message_movename() was unused, it could be removed from the code based to declutter the API.	2023-10-25 11:43:10 +02:00
Ondřej Surý	f213f644ed	Add option to mark TCP dispatch as unshared The current dispatch code could reuse the TCP connection when dns_dispatch_gettcp() would be used first. This is problematic as the dns_resolver doesn't use TCP connection sharing, but dns_request could get the TCP stream that was created outside of the dns_request. Add new DNS_DISPATCHOPT_UNSHARED option to dns_dispatch_createtcp() that would prevent the TCP stream to be reused. Use that option in the dns_resolver call to dns_dispatch_createtcp() to prevent dns_request from reusing the TCP connections created by dns_resolver. Additionally, the dns_xfrin unit added TCP connection sharing for incoming transfers. While interleaving *xfr streams on a TCP connection should work this should be a deliberate change and be property of the server that can be controlled. Additionally some level of parallel TCP streams is desirable. Revert to the old behaviour by removing the dns_dispatch_gettcp() calls from dns_xfrin and use the new option to prevent from sharing the transfer streams with dns_request.	2023-10-24 13:07:03 +02:00
Ondřej Surý	e0e089f106	Don't set the offloaded work result from main thread The xfrin_recv_done() was accessing xfr->result where we stored the result of the offloaded work from a thread that could receive data while processing the transfer on the offloaded thread. Completely remove the offloaded result from the dns_xfrin_t structure and keep it local for xfr_apply() and xfr_apply_done() as the failure is already recorded in .shutdown_result and we now that the processing has failed because .shuttingdown has been already set.	2023-10-24 11:14:54 +02:00
Aram Sargsyan	4eb4fa288c	Fix shutdown races in catzs The dns__catz_update_cb() does not expect that 'catzs->zones' can become NULL during shutdown. Add similar checks in the dns__catz_update_cb() and dns_catz_zone_get() functions to protect from such a case. Also add an INSIST in the dns_catz_zone_add() function to explicitly state that such a case is not expected there, because that function is called only during a reconfiguration.	2023-10-23 08:21:39 +00:00
Evan Hunt	aacea440c3	handle pre-existing disp/dispentry when retrying when xfrin_start() is called to retry a transfer, close the existing dispatch entry and reuse the existing dispatch.	2023-10-20 18:16:25 +11:00
Mark Andrews	b69100b747	Suppress reporting upcoming changes in root hints To reduce the amount of log spam when root servers change their addresses keep a table of upcoming changes by expected date and time and suppress reporting differences for them until then. Add initial entry for B.ROOT-SERVERS.NET, Nov 27, 2023.	2023-10-20 14:05:56 +11:00
Mark Andrews	2ca2f7e985	Update b.root-servers.net IP addresses This covers both root hints and the default primaries for the root zone mirror. The official change date is Nov 27, 2023.	2023-10-20 14:05:56 +11:00
Ondřej Surý	3737ea592b	Offload AXFR and IXFR processing Instead of processing received data synchronously, store the incoming differences in the list and process them asynchronously when we need to commit the data into the database and/or journal.	2023-10-19 14:57:25 +02:00
Ondřej Surý	e5c79261c0	Remove all locking from XFR Instead of locking the struct dns_xfrin members that get accessed from the statistics, convert those into atomic types and use atomic accesses to prevent ThreadSanitizer from blowing up. In fact, even the atomic operations are not really needed here, because all writes are done from a single thread and we don't really require consistency from the statistics. It's easier to use atomics here, but it is slightly confusing as it suggests there might be multithreaded accesses to those variables while in fact, the only off-thread access happens when collecting the statistics.	2023-10-19 14:57:25 +02:00
Ondřej Surý	109dc883e7	Cleanup wrong whitespace in dns/diff.h	2023-10-19 14:57:25 +02:00
Ondřej Surý	e3892805d6	Remove the logic that applies differences when over limit The ixfr_putdata() and axfr_putdata() had a logic to apply dns_diff when the number of pending tuples went over 100. Since we are going to offload the XFR data processing, we don't need to do that anymore.	2023-10-19 14:57:25 +02:00
Ondřej Surý	8a590d1605	Cleanup the FAIL() macro in the dns_xfrin The FAIL() macro was just setting the result and jumping to failure, unobfuscate the code by removing the macro.	2023-10-19 14:57:25 +02:00
Ondřej Surý	74f9f5f821	Disable OpenSSL memory contexts for OpenSSL < 3.0.0 OpenSSL 1.1 has already reached end-of-life and since we are experiencing a weird memory leak in the mirror system test on just Ubuntu 20.04 (Focal) with OpenSSL 1.1, we disable the legacy code for enabling memory contexts for OpenSSL < 3.0.0 in this commit.	2023-10-19 12:54:40 +02:00
Mark Andrews	29f399797d	Adjust UDP timeouts used in zone maintenance Drop timeout before resending a UDP request from 15 seconds to 5 seconds and add 1 second to the total time to allow for the reply to the third request to arrive. This will speed up the time it takes for named to recover from a lost packet when refreshing a zone and for it to determine that a primary is down.	2023-10-18 13:06:28 +11:00
Michal Nowak	dd234c60fe	Update the source code formatting using clang-format-17	2023-10-17 17:47:46 +02:00
Matthijs Mekking	741ce2d07a	Don't resign raw version of the zone Update the function 'set_resigntime()' so that raw versions of inline-signing zones are not scheduled to be resigned. Also update the check in the same function for zone is dynamic, there exists a function 'dns_zone_isdynamic()' that does a similar thing and is more complete. Also in 'zone_postload()' check whether the zone is not the raw version of an inline-signing zone, preventing calculating the next resign time.	2023-10-16 09:26:56 +02:00
Ondřej Surý	96bbf95b83	Convert rwlock in dns_acl to RCU The dns_aclenv_t contains two dns_acl_t - localhost and localnets that can be swapped with a different ACLs as we configure BIND 9. Instead of protecting those two pointers with heavyweight read-write lock, use RCU mechanism to dereference and swap the pointers.	2023-10-13 14:44:40 +02:00
Ondřej Surý	546c327349	Convert manual dns_{acl,aclenv}_{attach,detach} to ISC_REFCOUNT_IMPL Instead of having a manual set of functions, use ISC_REFCOUNT_IMPL macro to implement the attach, detach, ref and unref functions.	2023-10-13 14:44:40 +02:00
Ondřej Surý	b3a8f0048f	Refactor dns_{acl,aclenv}_create to return void The dns_{acl,aclenv}_create() can't fail, so change it to return void.	2023-10-13 14:44:40 +02:00
Ondřej Surý	f5b0bd9b1b	Convert manual dns_iptable_{attach,detach} to ISC_REFCOUNT_IMPL Instead of having a manual set of functions, use ISC_REFCOUNT_IMPL macro to implement the attach, detach, ref and unref functions.	2023-10-13 14:44:40 +02:00
Ondřej Surý	613ada72b6	Refactor dns_iptable_create() to return void The dns_iptable_create() cannot fail now, so change it to return void.	2023-10-13 14:44:40 +02:00
Ondřej Surý	d46d51be78	Refactor isc_radix_create to return void The isc_radix_create() can't fail, so change it to return void.	2023-10-13 14:44:40 +02:00
Aram Sargsyan	20fdab8667	Fix undefined behaviour occurrences The undefined behaviour was detected by LLVM 17. Fix the affected functions definitions to match the expected function type.	2023-10-13 09:57:28 +00:00
Ondřej Surý	6afa961534	Don't undef <unit>_TRACE, instead add comment how to enable it In units that support detailed reference tracing via ISC_REFCOUNT macros, we were doing: /* Define to 1 for detailed reference tracing / #undef <unit>_TRACE This would prevent using -D<unit>_TRACE=1 in the CFLAGS. Convert the above mentioned snippet with just a comment how to enable the detailed reference tracing: / Add -D<unit>_TRACE=1 to CFLAGS for detailed reference tracing */	2023-10-13 11:40:16 +02:00
Evan Hunt	3a206da456	check chain length is nonzero before examining last entry It was possible to reach add_link() without visiting an intermediate node first, and the check for a duplicate entry could then cause a crash. Credit to OSS-Fuzz for discovering this error.	2023-10-12 11:31:32 -07:00
Ondřej Surý	91f3b0edee	Use mul and div instead of bitshifts to calculate srtt There was a microoptimization for smoothing srtt with bitshifts. Revert the code to use * 98 / 100, it doesn't really make that difference on modern CPUs, for comparison here: muldiv: imul eax, edi, 98 imul rax, rax, 1374389535 shr rax, 37 ret shift: mov eax, edi sal eax, 9 sub eax, edi shr eax, 9 ret	2023-10-12 12:35:00 +02:00
Ondřej Surý	0635bd01cb	Skip the no-op code in adjustsrtt() If factor == DNS_ADB_RTTADJAGE and addr->entry->lastage == now we would load value into new_srtt and then immediatelly store it back which triggers the synchronization between threads using .srtt values.	2023-10-12 12:35:00 +02:00
Ondřej Surý	cb0db600e7	Replace some ADB entry locking with atomics to reduce ADB contention Use atomics on couple of ADB entry members (.srtt, .flags, .expires, and .lastage) to remove ADB entry locking from couple of hot spots. The most prominent place is copy_namehook_lists() that gets called under ADB name lock and if the namehook list is long it acquires-releases quite a few ADB entry locks. Changing those ADB entry members to atomics allowed us to new_adbaddrinfo() not require locked ADB entry and since adbentry_overquota() already used atomics and handling lame information was dropped in the previous commit, we could not make the copy_namehook_lists() lockless. The other hotspot is dns_adb_adjustsrtt() and dns_adb_agesrtt() that can now use atomics because .srtt is already atomic_uint. And the last place that could now use atomics is dns_adb_changeflags().	2023-10-12 12:35:00 +02:00
Ondřej Surý	2b20db05e3	Remove dns_adblameinfo from dns_adb Keeping the information about lame server in the ADB was done in !322 to fix following security issue: [CVE-2021-25219] Disable "lame-ttl" cache The handling of the lame servers needs to be redesigned and it is not going to be enabled any time soon, and the current code is just dead code that takes up space, code and stands in the way of making ADB work faster. Remove all the internals needed for handling the lame servers in the ADB for now. It might get reintroduced later if and when we redesign ADB.	2023-10-12 12:35:00 +02:00
Matthijs Mekking	746e9809a8	Fix build error related to USDT The trace.h file is listed twice in the Makefile. This incidentally caused an error where the build refused to replace an earlier placed trace.h file.	2023-10-10 16:57:18 +02:00
Evan Hunt	bf81ef3fc0	reduce search_lock coverage now that we're using qpmulti for the summary database, we no longer need to hold search_lock for it. we do still need it for the radix tree and the trigger counts.	2023-10-09 13:29:02 -07:00
Evan Hunt	feea05d5c4	convert the RPZ summary database to to use a QP trie now that we have the QP chain mechanism, we can convert the RPZ summary database to use a QP trie instead of an RBT. also revised comments throughout the file accordingly, and incidentally cleaned up calls to new_node(), which can no longer fail.	2023-10-09 13:29:02 -07:00
Evan Hunt	86fbfc22b4	fix build bug with DNS_RPZ_TRACE nonstardard naming of ref/unref and attach/detach functions caused build errors when using DNS_RPZ_TRACE; this has been fixed.	2023-10-09 13:29:02 -07:00
Evan Hunt	8f6a3f47db	fix a QP chain bug depending on how the QP trie is traversed during a lookup, it is possible for a search to terminate on a leaf which is a partial match, without that leaf being added to the chain. to ensure the chain is correct in this case, when a partial match condition is detected via qpkey_compare(), we will call add_link() again, just in case. (add_link() will check for a duplicated node, so it will be harmless if it was already done.)	2023-10-09 13:29:02 -07:00
Mark Andrews	d97dc03b8e	Detect duplicate use of control sockets in named.conf Specifying duplicate control sockets can lead to hard to diagnose rndc connection failures.	2023-10-05 11:32:01 +11:00
Aram Sargsyan	b970556f21	Remove unnecessary NULL-checks in ns__client_setup() All these pointers are guaranteed to be non-NULL. Additionally, update a comment to remove obviously outdated information about the function's requirements.	2023-09-28 13:43:18 +00:00
Aram Sargsyan	fb7bbbd1be	Don't use an uninitialized link on an error path Move the block on the error path, where the link is checked, to a place where it makes sense, to avoid accessing an unitialized link when jumping to the 'cleanup_query' label from 4 different places. The link is initialized only after those jumps happen. In addition, initilize the link when creating the object, to avoid similar errors.	2023-09-28 08:14:05 +00:00
Evan Hunt	03016902dd	rename dns_qp_findname_ancestor() to dns_qp_lookup() I am weary of typing so long a name. (plus, the name has become slightly misleading now that the DNS_QPFIND_NOEXACT option no longer exists.)	2023-09-28 00:32:44 -07:00
Evan Hunt	6231fd66af	rename QP-related types to use standard BIND nomenclature changed type names in QP trie code to match the usual convention: - qp_node_t -> dns_qpnode_t - qp_ref_t -> dns_qpref_t - qp_shift_t -> dns_qpshift_t - qp_weight_t -> dns_qpweight_t - qp_chunk_t -> dns_qpchunk_t - qp_cell_t -> dns_qpcell_t	2023-09-28 00:32:39 -07:00

... 7 8 9 10 11 ...

15876 commits