bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-04-28 17:46:40 -04:00

Author	SHA1	Message	Date
Mark Andrews	b25a1943ee	Be more precise with the stopping conditions in zone_resigninc If there happens to be a RRSIG(SOA) that is not at the zone apex for any reason it should not be considered as a stopping condition for incremental zone signing. (cherry picked from commit `b7cdc3583e`)	2021-04-26 12:45:28 +02:00
Matthijs Mekking	f82d4f0474	Check for keyid conflicts between new keys When the keymgr needs to create new keys, it is possible it needs to create multiple keys. The keymgr checks for keyid conflicts with already existing keys, but it should also check against that it just created. (cherry picked from commit `668301f138`)	2021-04-26 10:48:06 +02:00
Ondřej Surý	4bae6d8d73	Fix lock-order-inversion (potential deadlock) in dns_resolver_createfetch There's a lock-order-inversion when running `zone_maintenance()` from the timer while shutting down the server `shutdown_server()`. This only happens when the taskmgr scheduling is more relaxed and paralellized, but the issue is real nevertheless. The associated ThreadSanitizer warning: WARNING: ThreadSanitizer: lock-order-inversion (potential deadlock) Cycle in lock order graph: M1 (0x000000000001) => M2 (0x000000000000) => M1 Mutex M2 acquired here while holding mutex M1 in thread T1: #0 pthread_mutex_lock <null> #1 dns_view_findzonecut lib/dns/view.c:1326:2 #2 fctx_create lib/dns/resolver.c:5144:13 #3 dns_resolver_createfetch lib/dns/resolver.c:10977:12 #4 zone_refreshkeys lib/dns/zone.c:10830:13 #5 zone_maintenance lib/dns/zone.c:11065:5 #6 zone_timer lib/dns/zone.c:14652:2 #7 task_run lib/isc/task.c:857:5 #8 isc_task_run lib/isc/task.c:944:10 #9 isc__nm_async_task lib/isc/netmgr/netmgr.c:730:24 #10 process_netievent lib/isc/netmgr/netmgr.c #11 process_queue lib/isc/netmgr/netmgr.c:885:8 #12 process_tasks_queue lib/isc/netmgr/netmgr.c:756:10 #13 process_queues lib/isc/netmgr/netmgr.c:772:7 #14 async_cb lib/isc/netmgr/netmgr.c:671:2 #15 uv__async_io /home/ondrej/Projects/tsan/libuv/src/unix/async.c:163:5 #16 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 #17 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:392:5 #18 nm_thread lib/isc/netmgr/netmgr.c:597:11 #19 isc__trampoline_run lib/isc/trampoline.c:184:11 Mutex M1 previously acquired by the same thread here: #0 pthread_mutex_lock <null> #1 zone_refreshkeys lib/dns/zone.c:10717:2 #2 zone_maintenance lib/dns/zone.c:11065:5 #3 zone_timer lib/dns/zone.c:14652:2 #4 task_run lib/isc/task.c:857:5 #5 isc_task_run lib/isc/task.c:944:10 #6 isc__nm_async_task lib/isc/netmgr/netmgr.c:730:24 #7 process_netievent lib/isc/netmgr/netmgr.c #8 process_queue lib/isc/netmgr/netmgr.c:885:8 #9 process_tasks_queue lib/isc/netmgr/netmgr.c:756:10 #10 process_queues lib/isc/netmgr/netmgr.c:772:7 #11 async_cb lib/isc/netmgr/netmgr.c:671:2 #12 uv__async_io /home/ondrej/Projects/tsan/libuv/src/unix/async.c:163:5 #13 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 #14 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:392:5 #15 nm_thread lib/isc/netmgr/netmgr.c:597:11 #16 isc__trampoline_run lib/isc/trampoline.c:184:11 Mutex M1 acquired here while holding mutex M2 in thread T2: #0 pthread_mutex_lock <null> #1 dns_zone_flush lib/dns/zone.c:11443:2 #2 view_flushanddetach lib/dns/view.c:657:5 #3 dns_view_flushanddetach lib/dns/view.c:690:2 #4 shutdown_server bin/named/server.c:10056:4 #5 task_run lib/isc/task.c:857:5 #6 isc_task_run lib/isc/task.c:944:10 #7 isc__nm_async_task lib/isc/netmgr/netmgr.c:730:24 #8 process_netievent lib/isc/netmgr/netmgr.c #9 process_queue lib/isc/netmgr/netmgr.c:885:8 #10 process_tasks_queue lib/isc/netmgr/netmgr.c:756:10 #11 process_queues lib/isc/netmgr/netmgr.c:772:7 #12 async_cb lib/isc/netmgr/netmgr.c:671:2 #13 uv__async_io /home/ondrej/Projects/tsan/libuv/src/unix/async.c:163:5 #14 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 #15 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:392:5 #16 nm_thread lib/isc/netmgr/netmgr.c:597:11 #17 isc__trampoline_run lib/isc/trampoline.c:184:11 Mutex M2 previously acquired by the same thread here: #0 pthread_mutex_lock <null> #1 view_flushanddetach lib/dns/view.c:645:3 #2 dns_view_flushanddetach lib/dns/view.c:690:2 #3 shutdown_server bin/named/server.c:10056:4 #4 task_run lib/isc/task.c:857:5 #5 isc_task_run lib/isc/task.c:944:10 #6 isc__nm_async_task lib/isc/netmgr/netmgr.c:730:24 #7 process_netievent lib/isc/netmgr/netmgr.c #8 process_queue lib/isc/netmgr/netmgr.c:885:8 #9 process_tasks_queue lib/isc/netmgr/netmgr.c:756:10 #10 process_queues lib/isc/netmgr/netmgr.c:772:7 #11 async_cb lib/isc/netmgr/netmgr.c:671:2 #12 uv__async_io /home/ondrej/Projects/tsan/libuv/src/unix/async.c:163:5 #13 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 #14 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:392:5 #15 nm_thread lib/isc/netmgr/netmgr.c:597:11 #16 isc__trampoline_run lib/isc/trampoline.c:184:11 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:79:8 #2 isc_nm_start lib/isc/netmgr/netmgr.c:303:3 #3 create_managers bin/named/main.c:957:15 #4 setup bin/named/main.c:1267:11 #5 main bin/named/main.c:1558:2 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create lib/isc/pthreads/thread.c:79:8 #2 isc_nm_start lib/isc/netmgr/netmgr.c:303:3 #3 create_managers bin/named/main.c:957:15 #4 setup bin/named/main.c:1267:11 #5 main bin/named/main.c:1558:2 SUMMARY: ThreadSanitizer: lock-order-inversion (potential deadlock) in __interceptor_pthread_mutex_lock (cherry picked from commit `25d27851d8`)	2021-04-19 22:31:37 +02:00
Ondřej Surý	97a5559ae3	Cleanup the isc_<>mgr_createinc() constructors Previously, the taskmgr, timermgr and socketmgr had a constructor variant, that would create the mgr on top of existing appctx. This was no longer true and isc_<>mgr was just calling isc_<*>mgr_create() directly without any extra code. This commit just cleans up the extra function. (cherry picked from commit `3388ef36b3`)	2021-04-19 15:57:40 +02:00
Ondřej Surý	08055b742c	Cleanup the public vs private ISCAPI remnants Since all the libraries are internal now, just cleanup the ISCAPI remnants in isc_socket, isc_task and isc_timer APIs. This means, there's one less layer as following changes have been done: * struct isc_socket and struct isc_socketmgr have been removed * struct isc__socket and struct isc__socketmgr have been renamed to struct isc_socket and struct isc_socketmgr * struct isc_task and struct isc_taskmgr have been removed * struct isc__task and struct isc__taskmgr have been renamed to struct isc_task and struct isc_taskmgr * struct isc_timer and struct isc_timermgr have been removed * struct isc__timer and struct isc__timermgr have been renamed to struct isc_timer and struct isc_timermgr * All the associated code that dealt with typing isc_<foo> to isc__<foo> and back has been removed. (cherry picked from commit `16fe0d1f41`)	2021-04-19 15:24:10 +02:00
Mark Andrews	9d85c56772	properly initialise resarg->lock	2021-04-19 14:32:53 +02:00
Evan Hunt	28511bfcfd	move samples/resolve.c to bin/tests/system "resolve" is used by the resolver system tests, and I'm not certain whether delv exercises the same code, so rather than remove it, I moved it to bin/tests/system. (cherry picked from commit `d0ec7d1f33`)	2021-04-19 14:32:53 +02:00
Evan Hunt	65777fcdf4	remove sample-async sample code for export libraries is no longer needed and this code is not used for any internal tests. also, sample-gai.c had already been removed but there were some dangling references. (cherry picked from commit `056afe7bdc`)	2021-04-19 13:25:48 +02:00
Evan Hunt	2f7f47bd99	rename dns_client_createx() to dns_client_create() there's no longer a need to use an alternate name. (cherry picked from commit `568d455c99`)	2021-04-19 13:25:48 +02:00
Evan Hunt	b4aaf6b83d	remove dns_client_request() and related code continues the cleanup of dns_client started in the previous commit. (cherry picked from commit `1beb05f3e2`)	2021-04-19 13:25:48 +02:00
Evan Hunt	131bbb9bbe	remove dns_client_update() and related code the libdns client API is no longer being maintained for external use, we can remove the code that isn't being used internally, as well as the related tests. (cherry picked from commit `fb2a352e7c`)	2021-04-19 13:25:48 +02:00
Ondřej Surý	cb6bfd1e9c	Fix task timing race in setnsec3param() When setnsec3param() is schedule from zone_postload() there's no guarantee that `zone->db` is not `NULL` yet. Thus when the setnsec3param() is called, we need to check for `zone->db` existence and reschedule the task, because calling `rss_post()` on a zone with empty `.db` ends up with no-op (the function just returns). (cherry picked from commit `0127ba6472`)	2021-04-19 11:48:39 +02:00
Michał Kępień	648ef3a2b4	Fix handling undefined GSS_SPNEGO_MECHANISM macro BIND 9 attempts to look up GSSAPI OIDs for the Kerberos 5 and SPNEGO mechanisms in the relevant header files provided by the Kerberos/GSSAPI library used. Due to the differences between various Kerberos/GSSAPI implementations, if any of the expected preprocessor macros (GSS_KRB5_MECHANISM, GSS_SPNEGO_MECHANISM) is not defined in the header files provided by the library used, the code in lib/dns/gssapictx.c defines its own version of each missing macro, so that BIND 9 can attempt to use the relevant security mechanisms anyway. Commit `a875dcc669`, which contains a partial backport of the changes introduced in commit `978c7b2e89`, left a block of code in the lib/dns/dst_internal.h header which defines the GSS_SPNEGO_MECHANISM preprocessor macro to NULL if it is not defined by any header file provided by the Kerberos/GSSAPI library used. This causes the gss_add_oid_set_member() call in the mech_oid_set_create() helper function to always return an error. This in turn causes the dst_gssapi_acquirecred() function to also always return an error, which ultimately prevents any named instance whose configuration includes the "tkey-gssapi-credential" option from starting. Remove the offending conditional definition of the GSS_SPNEGO_MECHANISM preprocessor macro from lib/dns/dst_internal.h, so that a proper GSSAPI OID is assigned to that macro in lib/dns/gssapictx.c when the Kerberos/GSSAPI library used does not define it.	2021-04-16 14:40:06 +02:00
Ondřej Surý	83c79a0b1e	Refactor dns_journal_rollforward() to work over opened journal Too much logic was cramped inside the dns_journal_rollforward() that made it harder to follow. The dns_journal_rollforward() was refactored to work over already opened journal and some of the previous logic was moved to new static zone_journal_rollforward() that separates the journal "rollforward" logic from the "zone" logic. (cherry picked from commit `55b942b4a0`)	2021-04-16 13:50:20 +02:00
Mark Andrews	875366565c	Fixing a recoverable journal should not result in the zone being written when dns_journal_rollforward returned ISC_R_RECOVERABLE the distintion between 'up to date' and 'success' was lost, as a consequence zone_needdump() was called writing out the zone file when it shouldn't have been. This change restores that distintion. Adjust system test to reflect visible changes. (cherry picked from commit `ec7a9af381`)	2021-04-16 13:50:20 +02:00
Matthijs Mekking	3e7c6a6fe8	Small refactor lib/dns/zone.c Introduce some macros that can be reused in 'zone_load_soa_rr()' and 'zone_get_from_db()' to make those functions more readable. (cherry picked from commit `8fcbef2423`)	2021-04-13 14:19:52 +02:00
Matthijs Mekking	b0fb734079	Use designated initializer in dns_zone_create Shorten the code and make it less prone to initialisation errors (it is still easy to forget adding an initializer, but it now defaults to 0). (cherry picked from commit `032110bd2e`)	2021-04-13 14:19:32 +02:00
Matthijs Mekking	e5736de60d	Implement draft-vandijk-dnsop-nsec-ttl The draft says that the NSEC(3) TTL must have the same TTL value as the minimum of the SOA MINIMUM field and the SOA TTL. This was always the intended behaviour. Update the zone structure to also track the SOA TTL. Whenever we use the MINIMUM value to determine the NSEC(3) TTL, use the minimum of MINIMUM and SOA TTL instead. There is no specific test for this, however two tests need adjusting because otherwise they failed: They were testing for NSEC3 records including the TTL. Update these checks to use 600 (the SOA TTL), rather than 3600 (the SOA MINIMUM). (cherry picked from commit `9af8caa733`)	2021-04-13 14:18:33 +02:00
Matthijs Mekking	0d47f9f20f	Use stale TTL as RRset TTL in dumpdb It is more intuitive to have the countdown 'max-stale-ttl' as the RRset TTL, instead of 0 TTL. This information was already available in a comment "; stale (will be retained for x more seconds", but Support suggested to put it in the TTL field instead. (cherry picked from commit `a83c8cb0af`)	2021-04-13 10:59:17 +02:00
Matthijs Mekking	7b17cc080e	Check staleness in bind_rdataset Before binding an RRset, check the time and see if this record is stale (or perhaps even ancient). Marking a header stale or ancient happens only when looking up an RRset in cache, but binding an RRset can also happen on other occasions (for example when dumping the database). Check the time and compare it to the header. If according to the time the entry is stale, but not ancient, set the STALE attribute. If according to the time is ancient, set the ANCIENT attribute. We could mark the header stale or ancient here, but that requires locking, so that's why we only compare the current time against the rdh_ttl. Adjust the test to check the dump-db before querying for data. In the dumped file the entry should be marked as stale, despite no cache lookup happened since the initial query. (cherry picked from commit `debee6157b`)	2021-04-13 10:59:10 +02:00
Matthijs Mekking	dcf6e3e58a	Fix nonsensical stale TTL values in cache dump When introducing change 5149, "rndc dumpdb" started to print a line above a stale RRset, indicating how long the data will be retained. At that time, I thought it should also be possible to load a cache from file. But if a TTL has a value of 0 (because it is stale), stale entries wouldn't be loaded from file. So, I added the 'max-stale-ttl' to TTL values, and adjusted the $DATE accordingly. Since we actually don't have a "load cache from file" feature, this is premature and is causing confusion at operators. This commit changes the 'max-stale-ttl' adjustments. A check in the serve-stale system test is added for a non-stale RRset (longttl.example) to make sure the TTL in cache is sensible. Also, the comment above stale RRsets could have nonsensical values. A possible reason why this may happen is when the RRset was marked a stale but the 'max-stale-ttl' has passed (and is actually an RRset awaiting cleanup). This would lead to the "will be retained" value to be negative (but since it is stored in an uint32_t, you would get a nonsensical value (e.g. 4294362497). To mitigate against this, we now also check if the header is not ancient. In addition we check if the stale_ttl would be negative, and if so we set it to 0. Most likely this will not happen because the header would already have been marked ancient, but there is a possible race condition where the 'rdh_ttl + serve_stale_ttl' has passed, but the header has not been checked for staleness. (cherry picked from commit `2a5e0232ed`)	2021-04-13 10:59:00 +02:00
Mark Andrews	f4331a48fa	Make calling generic rdata methods consistent add matching macros to pass arguments from called methods to generic methods. This will reduce the amount of work required when extending methods. Also cleanup unnecessary UNUSED declarations. (cherry picked from commit `a88d3963e2`)	2021-04-13 01:54:29 +00:00
Michał Kępień	363902ce2c	Free resources when gss_accept_sec_context() fails Even if a call to gss_accept_sec_context() fails, it might still cause a GSS-API response token to be allocated and left for the caller to release. Make sure the token is released before an early return from dst_gssapi_acceptctx(). (cherry picked from commit `d954e152d9`)	2021-04-08 10:41:08 +02:00
Mark Andrews	7b93ff93d6	Rewrite managed-key journal immediately Both managed keys and regular zone journals need to be updated immediately when a recoverable error is discovered. (cherry picked from commit `0fbdf189c7`)	2021-04-07 21:29:07 +02:00
Mark Andrews	511ea2d3f3	Update dns_journal_compact() to handle bad transaction headers Previously, dns_journal_begin_transaction() could reserve the wrong amount of space. We now check that the transaction is internally consistent when upgrading / downgrading a journal and we also handle the bad transaction headers. (cherry picked from commit `83310ffd92`)	2021-04-07 21:29:06 +02:00
Mark Andrews	6da2e05df9	Compute transaction size based on journal/transaction type previously the code assumed that it was a new transaction. (cherry picked from commit `520509ac7e`)	2021-04-07 21:29:06 +02:00
Mark Andrews	d9ad7ccf2d	Use journal_write_xhdr() to write the dummy transaction header Instead of journal_write(), use correct format call journal_write_xhdr() to write the dummy transaction header which looks at j->header_ver1 to determine which transaction header to write instead of always writing a zero filled journal_rawxhdr_t header. (cherry picked from commit `5a6112ec8f`)	2021-04-07 21:29:06 +02:00
Diego Fronza	5d391f07c0	Resolve TSAN data race in zone_maintenance Fix race between zone_maintenance and dns_zone_notifyreceive functions, zone_maintenance was attempting to read a zone flag calling DNS_ZONE_FLAG(zone, flag) while dns_zone_notifyreceive was updating a flag in the same zone calling DNS_ZONE_SETFLAG(zone, ...). The code reading the flag in zone_maintenance was not protected by the zone's lock, to avoid a race the zone's lock is now being acquired before an attempt to read the zone flag is made.	2021-04-07 13:22:36 +00:00
Matthijs Mekking	194a72b3f1	If RPZ config'd, bail stale-answer-client-timeout When we are recursing, RPZ processing is not allowed. But when we are performing a lookup due to "stale-answer-client-timeout", we are still recursing. This effectively means that RPZ processing is disabled on such a lookup. In this case, bail the "stale-answer-client-timeout" lookup and wait for recursion to complete, as we we can't perform the RPZ rewrite rules reliably. (cherry picked from commit `3d3a6415f7`)	2021-04-02 13:29:27 +02:00
Matthijs Mekking	29bcd113ea	Rename "staleonly" The dboption DNS_DBFIND_STALEONLY caused confusion because it implies we are looking for stale data only and ignore any active RRsets in the cache. Rename it to DNS_DBFIND_STALETIMEOUT as it is more clear the option is related to a lookup due to "stale-answer-client-timeout". Rename other usages of "staleonly", instead use "lookup due to...". Also rename related function and variable names. (cherry picked from commit `839df94190`)	2021-04-02 13:29:17 +02:00
Matthijs Mekking	34dd6521b1	Restore the RECURSIONOK attribute after staleonly When doing a staleonly lookup we don't want to fallback to recursion. After all, there are obviously problems with recursion, otherwise we wouldn't do a staleonly lookup. When resuming from recursion however, we should restore the RECURSIONOK flag, allowing future required lookups for this client to recurse. (cherry picked from commit `3f81d79ffb`)	2021-04-02 13:29:09 +02:00
Matthijs Mekking	114dc7888a	Remove result exception on staleonly lookup When implementing "stale-answer-client-timeout", we decided that we should only return positive answers prematurely to clients. A negative response is not useful, and in that case it is better to wait for the recursion to complete. To do so, we check the result and if it is not ISC_R_SUCCESS, we decide that it is not good enough. However, there are more return codes that could lead to a positive answer (e.g. CNAME chains). This commit removes the exception and now uses the same logic that other stale lookups use to determine if we found a useful stale answer (stale_found == true). This means we can simplify two test cases in the serve-stale system test: nodata.example is no longer treated differently than data.example. (cherry picked from commit `aaed7f9d8c`)	2021-04-02 13:28:59 +02:00
Matthijs Mekking	06823aa255	Remove INSIST on NS_QUERYATTR_ANSWERED The NS_QUERYATTR_ANSWERED attribute is to prevent sending a response twice. Without the attribute, this may happen if a staleonly lookup found a useful answer and sends a response to the client, and later recursion ends and also tries to send a response. The attribute was also used to mask adding a duplicate RRset. This is considered harmful. When we created a response to the client with a stale only lookup (regardless if we actually have send the response), we should clear the rdatasets that were added during that lookup. Mark such rdatasets with the a new attribute, DNS_RDATASETATTR_STALE_ADDED. Set a query attribute NS_QUERYATTR_STALEOK if we may have added rdatasets during a stale only lookup. Before creating a response on a normal lookup, check if we can expect rdatasets to have been added during a staleonly lookup. If so, clear the rdatasets from the message with the attribute DNS_RDATASETATTR_STALE_ADDED set. (cherry picked from commit `3d5429f61f`)	2021-04-02 13:28:08 +02:00
Matthijs Mekking	33d61b9651	Simplify when to detach the client With stale-answer-client-timeout, we may send a response to the client, but we may want to hold on to the network manager handle, because recursion is going on in the background, or we need to refresh a stale RRset. Simplify the setting of 'nodetach': * During a staleonly lookup we should not detach the nmhandle, so just set it prior to 'query_lookup()'. * During a staleonly "stalefirst" lookup set the 'nodetach' to true if we are going to refresh the RRset. Now there is no longer the need to clear the 'nodetach' if we go through the "dbfind_stale", "stale_refresh_window", or "stale_only" paths. (cherry picked from commit `48b0dc159b`)	2021-04-02 13:28:01 +02:00
Matthijs Mekking	b1496d19d5	Refactor stale lookups, ignore active RRsets When doing a staleonly lookup, ignore active RRsets from cache. If we don't, we may add a duplicate RRset to the message, and hit an assertion failure in query.c because adding the duplicate RRset to the ANSWER section failed. This can happen on a race condition. When a client query is received, the recursion is started. When 'stale-answer-client-timeout' triggers around the same time the recursion completes, the following sequence of events may happen: 1. Queue the "try stale" fetch_callback() event to the client task. 2. Add the RRsets from the authoritative response to the cache. 3. Queue the "fetch complete" fetch_callback() event to the client task. 4. Execute the "try stale" fetch_callback(), which retrieves the just-inserted RRset from the database. 5. In "ns_query_done()" we are still recursing, but the "staleonly" query attribute has already been cleared. In other words, the query will resume when recursion ends (it already has ended but is still on the task queue). 6. Execute the "fetch complete" fetch_callback(). It finds the answer from recursion in the cache again and tries to add the duplicate to the answer section. This commit changes the logic for finding stale answers in the cache, such that on "stale_only" lookups actually only stale RRsets are considered. It refactors the code so that code paths for "dbfind_stale", "stale_refresh_window", and "stale_only" are more clear. First we call some generic code that applies in all three cases, formatting the domain name for logging purposes, increment the trystale stats, and check if we actually found stale data that we can use. The "dbfind_stale" lookup will return SERVFAIL if we didn't found a usable answer, otherwise we will continue with the lookup (query_gotanswer()). This is no different as before the introduction of "stale-answer-client-timeout" and "stale-refresh-time". The "stale_refresh_window" lookup is similar to the "dbfind_stale" lookup: return SERVFAIL if we didn't found a usable answer, otherwise continue with the lookup (query_gotanswer()). Finally the "stale_only" lookup. If the "stale_only" lookup was triggered because of an actual client timeout (stale-answer-client-timeout > 0), and if database lookup returned a stale usable RRset, trigger a response to the client. Otherwise return and wait until the recursion completes (or the resolver query times out). If the "stale_only" lookup is a "stale-anwer-client-timeout 0" lookup, preferring stale data over a lookup. In this case if there was no stale data, or the data was not a positive answer, retry the lookup with the stale options cleared, a.k.a. a normal lookup. Otherwise, continue with the lookup (query_gotanswer()) and refresh the stale RRset. This will trigger a response to the client, but will not detach the handle because a fetch will be created to refresh the RRset. (cherry picked from commit `92f7a67892`)	2021-04-02 13:27:52 +02:00
Matthijs Mekking	fcf8fb4f39	Keep track of allow client detach The stale-answer-client-timeout feature introduced a dependancy on when a client may be detached from the handle. The dboption DNS_DBFIND_STALEONLY was reused to track this attribute. This overloads the meaning of this database option, and actually introduced a bug because the option was checked in other places. In particular, in 'ns_query_done()' there is a check for 'RECURSING(qctx->client) && (!QUERY_STALEONLY(&qctx->client->query) \|\| ...' and the condition is satisfied because recursion has not completed yet and DNS_DBFIND_STALEONLY is already cleared by that time (in query_lookup()), because we found a useful answer and we should detach the client from the handle after sending the response. Add a new boolean to the client structure to keep track of client detach from handle is allowed or not. It is only disallowed if we are in a staleonly lookup and we didn't found a useful answer. (cherry picked from commit `fee164243f`)	2021-04-02 13:27:43 +02:00
Ondřej Surý	565a6a5679	Move the dummy shims to single ifndef GSSAPI block Previously, every function had it's own #ifdef GSSAPI #else #endif block that defined shim function in case GSSAPI was not being used. Now the dummy shim functions have be split out into a single #else #endif block at the end of the file. This makes the gssapictx.c similar to 9.17.x code, making the backports and reviews easier.	2021-04-01 10:42:32 +02:00
Mark Andrews	3fd30e1634	Add Heimdal compatibility support The Heimdal Kerberos library handles the OID sets in a different manner. Unify the handling of the OID sets between MIT and Heimdal implementations by dynamically creating the OID sets instead of using static predefined set. This is how upstream recommends to handle the OID sets.	2021-04-01 10:42:32 +02:00
Mark Andrews	a875dcc669	Remove custom ISC SPNEGO implementation The custom ISC SPNEGO mechanism implementation is no longer needed on the basis that all major Kerberos 5/GSSAPI (mit-krb5, heimdal and Windows) implementations support SPNEGO mechanism since 2006. This commit removes the custom ISC SPNEGO implementation, and removes the option from both autoconf and win32 Configure script. Unknown options are being ignored, so this doesn't require any special handling.	2021-04-01 10:42:32 +02:00
Ondřej Surý	ee7283b3ee	Merge branch 'bind-dyndb-ldap-v9.16.13' into 'main' Do not require config.h to use isc/util.h See merge request isc-projects/bind9!4840 (cherry picked from commit `19b69e9a3b`) `81eb3396` Do not require config.h to use isc/util.h	2021-03-26 18:48:06 +00:00
Matthijs Mekking	1f8c5786f8	Delete CDS/CDNSKEY records when zone is unsigned CDS/CDNSKEY DELETE records are only useful if they are signed, otherwise the parent cannot verify these RRsets anyway. So once the DS has been removed (and signaled to BIND), we can remove the DNSKEY and RRSIG records, and at this point we can also remove the CDS/CDNSKEY records. (cherry picked from commit `6f31f62d69`)	2021-03-22 13:57:10 +01:00
Matthijs Mekking	7882c7fbea	Allow CDS/CDNSKEY DELETE records in unsigned zone While not useful, having a CDS/CDNSKEY DELETE record in an unsigned zone is not an error and "named-checkzone" should not complain. (cherry picked from commit `f211c7c2a1`)	2021-03-22 13:31:02 +01:00
Matthijs Mekking	b81502f4ae	Fix keymgr key init bug The 'keymgr_key_init()' function initializes key states if they have not been set previously. It looks at the key timing metadata and determines using the given times whether a state should be set to RUMOURED or OMNIPRESENT. However, the DNSKEY and ZRRSIG states were mixed up: When looking at the Activate timing metadata we should set the ZRRSIG state, and when looking at the Published timing metadata we should set the DNSKEY state. (cherry picked from commit `27e7d5f698`)	2021-03-22 11:24:55 +01:00
Patrick McLean	c5c9c9b83f	Add isc_time_now_hires function to get current time with high resolution The current isc_time_now uses CLOCK_REALTIME_COARSE which only updates on a timer tick. This clock is generally fine for millisecond accuracy, but on servers with 100hz clocks, this clock is nowhere near accurate enough for microsecond accuracy. This commit adds a new isc_time_now_hires function that uses CLOCK_REALTIME, which gives the current time, though it is somewhat expensive to call. When microsecond accuracy is required, it may be required to use extra resources for higher accuracy. (cherry picked from commit `ebced74b19`)	2021-03-20 11:59:21 -07:00
Witold Kręcicki	a6c4702796	Fix a startup/shutdown crash in ns_clientmgr_create	2021-03-18 15:33:28 -03:00
Witold Kręcicki	dd564da286	Shutdown interface if we can't listen on it to avoid shutdown hang	2021-03-18 15:27:28 -03:00
Ondřej Surý	121641686c	Temporarily disable tlsdns_test until it gets refactored The tlsdns API is not yet used in the 9.16 branch and the tlsdns_test fails too often. Temporarily disable running the test until it is actually needed.	2021-03-18 15:42:03 +01:00
Ondřej Surý	db49ffca20	Change the isc_nm_(get\|set)timeouts() to work with milliseconds The RFC7828 specifies the keepalive interval to be 16-bit, specified in units of 100 milliseconds and the configuration options tcp-*-timeouts are following the suit. The units of 100 milliseconds are very unintuitive and while we can't change the configuration and presentation format, we should not follow this weird unit in the API. This commit changes the isc_nm_(get\|set)timeouts() functions to work with milliseconds and convert the values to milliseconds before passing them to the function, not just internally.	2021-03-18 15:16:13 +01:00
Ondřej Surý	5d0647e067	Merge the common parts between udp, tcpdns and tlsdns protocol The udp, tcpdns and tlsdns contained lot of cut&paste code or code that was very similar making the stack harder to maintain as any change to one would have to be copied to the the other protocols. In this commit, we merge the common parts into the common functions under isc__nm_<foo> namespace and just keep the little differences based on the socket type.	2021-03-18 15:16:13 +01:00
Ondřej Surý	a017ba2615	Fix TCPDNS and TLSDNS timers After the TCPDNS refactoring the initial and idle timers were broken and only the tcp-initial-timeout was always applied on the whole TCP connection. This broke any TCP connection that took longer than tcp-initial-timeout, most often this would affect large zone AXFRs. This commit changes the timeout logic in this way: * On TCP connection accept the tcp-initial-timeout is applied and the timer is started * When we are processing and/or sending any DNS message the timer is stopped * When we stop processing all DNS messages, the tcp-idle-timeout is applied and the timer is started again	2021-03-18 15:16:13 +01:00

1 2 3 4 5 ...

12999 commits