bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-05-27 12:13:20 -04:00

Author	SHA1	Message	Date
Mark Andrews	698d9285d4	Only pick CPUs that are part of the existing CPU affinity set when assigning a thread to a CPU.	2020-12-21 15:09:57 +01:00
Mark Andrews	08df4f420a	Reorder in library dependancy order	2020-12-21 01:09:45 +00:00
Michał Kępień	2c44266a5a	Update library API versions	2020-12-16 22:05:50 +01:00
Ondřej Surý	ef685bab5c	Print warning when falling back to increment soa serial method When using the `unixtime` or `date` method to update the SOA serial, `named` and `dnssec-signzone` would silently fallback to `increment` method to prevent the new serial number to be smaller than the old serial number (using the serial number arithmetics). Add a warning message when such fallback happens.	2020-12-11 10:48:28 +01:00
Mark Andrews	c51ef23c22	Implement ipv4only.arpa forward and reverse zones as per RFC 8880.	2020-12-11 14:16:40 +11:00
Ondřej Surý	7ba18870dc	Reformat sources using clang-format-11	2020-12-08 18:36:23 +01:00
Ondřej Surý	5caf33feda	Fix HAVE_SO_REUSEPORT_LB macro name definition A typo in macro definition caused the load-balanced sockets to be disabled even on platforms with existing support for load-balanced sockets.	2020-12-04 14:45:22 +01:00
Ondřej Surý	87c5867202	Use sock->nchildren instead of mgr->nworkers when initializing NM On Windows, we were limiting the number of listening children to just 1, but we were then iterating on mgr->nworkers. That lead to scheduling more async_*listen() than actually allocated and out-of-bound read-write operation on the heap.	2020-12-03 18:03:25 +01:00
Ondřej Surý	151852f428	Fix datarace when UDP/TCP connect fails and we are in nmthread When we were in nmthread, the isc__nm_async_<proto>connect() function executes in the same thread as the isc__nm_<proto>connect() and on a failure, it would block indefinitely because the failure branch was setting sock->active to false before the condition around the wait had a chance to skip the WAIT(). This also fixes the zero system test being stuck on FreeBSD 11, so we re-enable the test in the commit.	2020-12-03 13:56:34 +01:00
Ondřej Surý	4adeaab73d	Add FreeBSD connection timeout socket option On FreeBSD, the option to configure connection timeout is called TCP_KEEPINIT, use it to configure the connection timeout there. This also fixes the dangling socket problems in the unit test, so re-enable them.	2020-12-03 09:23:24 +01:00
Ondřej Surý	1d066e4bc5	Distribute queries among threads even on platforms without lb sockets On platforms without load-balancing socket all the queries would be handle by a single thread. Currently, the support for load-balanced sockets is present in Linux with SO_REUSEPORT and FreeBSD 12 with SO_REUSEPORT_LB. This commit adds workaround for such platforms that: 1. setups single shared listening socket for all listening nmthreads for UDP, TCP and TCPDNS netmgr transports 2. Calls uv_udp_bind/uv_tcp_bind on the underlying socket just once and for rest of the nmthreads only copy the internal libuv flags (should be just UV_HANDLE_BOUND and optionally UV_HANDLE_IPV6). 3. start reading on UDP socket or listening on TCP socket The load distribution among the nmthreads is uneven, but it's still better than utilizing just one thread for processing all the incoming queries	2020-12-03 09:20:33 +01:00
Ondřej Surý	94afea9325	Don't use stack allocated buffer for uv_write() On FreeBSD, the stack is destroyed more aggressively than on Linux and that revealed a bug where we were allocating the 16-bit len for the TCPDNS message on the stack and the buffer got garbled before the uv_write() sendback was executed. Now, the len is part of the uvreq, so we can safely pass it to the uv_write() as the req gets destroyed after the sendcb is executed.	2020-12-03 08:58:16 +01:00
Michał Kępień	88f96faba8	Make netmgr initialize and cleanup Winsock itself On Windows, WSAStartup() needs to be called to initialize Winsock before any sockets are created or else socket() calls will return error code 10093 (WSANOTINITIALISED). Since BIND's Network Manager is intended to work as a reusable networking library, it should take care of calling WSAStartup() - and its cleanup counterpart, WSACleanup() - itself rather than relying on external code to do it. Add the necessary WSAStartup() and WSACleanup() calls to isc_nm_start() and isc_nm_destroy(), respectively.	2020-12-02 22:36:23 +01:00
Michał Kępień	dc2e1dea86	Extend log message for unexpected socket() errors Make sure the error code is included in the message logged for unexpected socket creation errors in order to facilitate troubleshooting on Windows.	2020-12-02 22:36:23 +01:00
Michal Nowak	8499825525	Add uv_wrap.h to libisctest_la_SOURCES uv_wrap.h is included in tcp_test.c and udp_test.c and therefore should be listed in lib/isc/tests/Makefile.am, otherwise unit test run from distribution tarball fails to compile: tcp_test.c:37:10: fatal error: uv_wrap.h: No such file or directory #include "uv_wrap.h" ^~~~~~~~~~~ udp_test.c:37:10: fatal error: uv_wrap.h: No such file or directory #include "uv_wrap.h" ^~~~~~~~~~~	2020-12-02 16:08:18 +01:00
Ondřej Surý	2e1dd56d0b	Fix the data race in accessing the isc_nm_t timers The following TSAN report about accessing the mgr timers (mgr->init, mgr->idle, mgr->keepalive and mgr->advertised) has been fixed in this commit: ================== WARNING: ThreadSanitizer: data race (pid=2746) Read of size 4 at 0x7b440008a948 by thread T18: #0 isc__nm_tcpdns_read /home/ondrej/Projects/bind9/lib/isc/netmgr/tcpdns.c:849:25 (libisc.so.1706+0x2ba0f) #1 isc_nm_read /home/ondrej/Projects/bind9/lib/isc/netmgr/netmgr.c:1679:3 (libisc.so.1706+0x22258) #2 tcpdns_connect_connect_cb /home/ondrej/Projects/bind9/lib/isc/tests/tcpdns_test.c:363:2 (tcpdns_test+0x4bc5fb) #3 isc__nm_async_connectcb /home/ondrej/Projects/bind9/lib/isc/netmgr/netmgr.c:1816:2 (libisc.so.1706+0x228c9) #4 isc__nm_connectcb /home/ondrej/Projects/bind9/lib/isc/netmgr/netmgr.c:1791:3 (libisc.so.1706+0x22713) #5 tcpdns_connect_cb /home/ondrej/Projects/bind9/lib/isc/netmgr/tcpdns.c:343:2 (libisc.so.1706+0x2d89d) #6 uv__stream_connect /home/ondrej/Projects/tsan/libuv/src/unix/stream.c:1381:5 (libuv.so.1+0x27c18) #7 uv__stream_io /home/ondrej/Projects/tsan/libuv/src/unix/stream.c:1298:5 (libuv.so.1+0x25977) #8 uv__io_poll /home/ondrej/Projects/tsan/libuv/src/unix/linux-core.c:462:11 (libuv.so.1+0x2e795) #9 uv_run /home/ondrej/Projects/tsan/libuv/src/unix/core.c:385:5 (libuv.so.1+0x158ec) #10 nm_thread /home/ondrej/Projects/bind9/lib/isc/netmgr/netmgr.c:530:11 (libisc.so.1706+0x1c94a) Previous write of size 4 at 0x7b440008a948 by main thread: #0 isc_nm_settimeouts /home/ondrej/Projects/bind9/lib/isc/netmgr/netmgr.c:490:12 (libisc.so.1706+0x1dda5) #1 tcpdns_recv_two /home/ondrej/Projects/bind9/lib/isc/tests/tcpdns_test.c:601:2 (tcpdns_test+0x4bad0e) #2 cmocka_run_one_test_or_fixture <null> (libcmocka.so.0+0x70be) #3 __libc_start_main /build/glibc-vjB4T1/glibc-2.28/csu/../csu/libc-start.c:308:16 (libc.so.6+0x2409a) Location is heap block of size 281 at 0x7b440008a840 allocated by main thread: #0 malloc <null> (tcpdns_test+0x42864b) #1 default_memalloc /home/ondrej/Projects/bind9/lib/isc/mem.c:713:8 (libisc.so.1706+0x6d261) #2 mem_get /home/ondrej/Projects/bind9/lib/isc/mem.c:622:8 (libisc.so.1706+0x69b9c) #3 isc___mem_get /home/ondrej/Projects/bind9/lib/isc/mem.c:1044:9 (libisc.so.1706+0x6d379) #4 isc__mem_get /home/ondrej/Projects/bind9/lib/isc/mem.c:2432:10 (libisc.so.1706+0x6889e) #5 isc_nm_start /home/ondrej/Projects/bind9/lib/isc/netmgr/netmgr.c:203:8 (libisc.so.1706+0x1c219) #6 nm_setup /home/ondrej/Projects/bind9/lib/isc/tests/tcpdns_test.c:244:11 (tcpdns_test+0x4baaa4) #7 cmocka_run_one_test_or_fixture <null> (libcmocka.so.0+0x70fd) #8 __libc_start_main /build/glibc-vjB4T1/glibc-2.28/csu/../csu/libc-start.c:308:16 (libc.so.6+0x2409a) Thread T18 'isc-net-0000' (tid=3513, running) created by main thread at: #0 pthread_create <null> (tcpdns_test+0x429e7b) #1 isc_thread_create /home/ondrej/Projects/bind9/lib/isc/pthreads/thread.c:73:8 (libisc.so.1706+0x8476a) #2 isc_nm_start /home/ondrej/Projects/bind9/lib/isc/netmgr/netmgr.c:271:3 (libisc.so.1706+0x1c66a) #3 nm_setup /home/ondrej/Projects/bind9/lib/isc/tests/tcpdns_test.c:244:11 (tcpdns_test+0x4baaa4) #4 cmocka_run_one_test_or_fixture <null> (libcmocka.so.0+0x70fd) #5 __libc_start_main /build/glibc-vjB4T1/glibc-2.28/csu/../csu/libc-start.c:308:16 (libc.so.6+0x2409a) SUMMARY: ThreadSanitizer: data race /home/ondrej/Projects/bind9/lib/isc/netmgr/tcpdns.c:849:25 in isc__nm_tcpdns_read ================== ThreadSanitizer: reported 1 warnings	2020-12-02 10:14:31 +01:00
Ondřej Surý	d6d2fbe0e9	Avoid netievent allocations when the callbacks can be called directly After turning the users callbacks to be asynchronous, there was a visible performance drop. This commit prevents the unnecessary allocations while keeping the code paths same for both asynchronous and synchronous calls. The same change was done to the isc__nm_udp_{read,send} as those two functions are in the hot path.	2020-12-02 09:45:05 +01:00
Ondřej Surý	3e5ee16eb6	Disable the new netmgr tests on non-Linux platforms The new netmgr tests are not-yet fine-tuned for non-Linux platforms. Disable them now, so we can move forward and fix the tests of *BSD in the next iteration. This commit will get reverted when we add support for netmgr multi-threading.	2020-12-01 17:24:15 +01:00
Ondřej Surý	0ba697fe8c	The cmocka.h header MUST be included before isc/util.h gets included The isc/util.h header redefine the DbC checks (REQUIRE, INSIST, ...) to be cmocka "fake" assertions. However that means that cmocka.h needs to be included after UNIT_TESTING is defined but before isc/util.h is included. Because isc/util.h is included in most of the project headers this means that the sequence MUST be: #define UNIT_TESTING #include <cmocka.h> #include <isc/_anything_.h> See !2204 for other header requirements for including cmocka.h.	2020-12-01 16:47:25 +01:00
Ondřej Surý	634bdfb16d	Refactor netmgr and add more unit tests This is a part of the works that intends to make the netmgr stable, testable, maintainable and tested. It contains a numerous changes to the netmgr code and unfortunately, it was not possible to split this into smaller chunks as the work here needs to be committed as a complete works. NOTE: There's a quite a lot of duplicated code between udp.c, tcp.c and tcpdns.c and it should be a subject to refactoring in the future. The changes that are included in this commit are listed here (extensively, but not exclusively): * The netmgr_test unit test was split into individual tests (udp_test, tcp_test, tcpdns_test and newly added tcp_quota_test) * The udp_test and tcp_test has been extended to allow programatic failures from the libuv API. Unfortunately, we can't use cmocka mock() and will_return(), so we emulate the behaviour with #define and including the netmgr/{udp,tcp}.c source file directly. * The netievents that we put on the nm queue have variable number of members, out of these the isc_nmsocket_t and isc_nmhandle_t always needs to be attached before enqueueing the netievent_<foo> and detached after we have called the isc_nm_async_<foo> to ensure that the socket (handle) doesn't disappear between scheduling the event and actually executing the event. * Cancelling the in-flight TCP connection using libuv requires to call uv_close() on the original uv_tcp_t handle which just breaks too many assumptions we have in the netmgr code. Instead of using uv_timer for TCP connection timeouts, we use platform specific socket option. * Fix the synchronization between {nm,async}_{listentcp,tcpconnect} When isc_nm_listentcp() or isc_nm_tcpconnect() is called it was waiting for socket to either end up with error (that path was fine) or to be listening or connected using condition variable and mutex. Several things could happen: 0. everything is ok 1. the waiting thread would miss the SIGNAL() - because the enqueued event would be processed faster than we could start WAIT()ing. In case the operation would end up with error, it would be ok, as the error variable would be unchanged. 2. the waiting thread miss the sock->{connected,listening} = `true` would be set to `false` in the tcp_{listen,connect}close_cb() as the connection would be so short lived that the socket would be closed before we could even start WAIT()ing * The tcpdns has been converted to using libuv directly. Previously, the tcpdns protocol used tcp protocol from netmgr, this proved to be very complicated to understand, fix and make changes to. The new tcpdns protocol is modeled in a similar way how tcp netmgr protocol. Closes: #2194, #2283, #2318, #2266, #2034, #1920 * The tcp and tcpdns is now not using isc_uv_import/isc_uv_export to pass accepted TCP sockets between netthreads, but instead (similar to UDP) uses per netthread uv_loop listener. This greatly reduces the complexity as the socket is always run in the associated nm and uv loops, and we are also not touching the libuv internals. There's an unfortunate side effect though, the new code requires support for load-balanced sockets from the operating system for both UDP and TCP (see #2137). If the operating system doesn't support the load balanced sockets (either SO_REUSEPORT on Linux or SO_REUSEPORT_LB on FreeBSD 12+), the number of netthreads is limited to 1. * The netmgr has now two debugging #ifdefs: 1. Already existing NETMGR_TRACE prints any dangling nmsockets and nmhandles before triggering assertion failure. This options would reduce performance when enabled, but in theory, it could be enabled on low-performance systems. 2. New NETMGR_TRACE_VERBOSE option has been added that enables extensive netmgr logging that allows the software engineer to precisely track any attach/detach operations on the nmsockets and nmhandles. This is not suitable for any kind of production machine, only for debugging. * The tlsdns netmgr protocol has been split from the tcpdns and it still uses the old method of stacking the netmgr boxes on top of each other. We will have to refactor the tlsdns netmgr protocol to use the same approach - build the stack using only libuv and openssl. * Limit but not assert the tcp buffer size in tcp_alloc_cb Closes: #2061	2020-12-01 16:47:07 +01:00
Mark Andrews	ab0bf49203	Adjust default value of "max-recursion-queries" Since the queries sent towards root and TLD servers are now included in the count (as a result of the fix for CVE-2020-8616), "max-recursion-queries" has a higher chance of being exceeded by non-attack queries. Increase its default value from 75 to 100.	2020-12-01 23:47:23 +11:00
Mark Andrews	49b9219bb3	Fix misplaced declaration	2020-12-01 10:46:58 +11:00
Mark Andrews	304df53991	Add comment about cookie sizes	2020-11-26 20:48:46 +00:00
Mark Andrews	0e3b1f5a25	Tighten DNS COOKIE response handling Fallback to TCP when we have already seen a DNS COOKIE response from the given address and don't have one in this UDP response. This could be a server that has turned off DNS COOKIE support, a misconfigured anycast server with partial DNS COOKIE support, or a spoofed response. Falling back to TCP is the correct behaviour in all 3 cases.	2020-11-26 20:48:46 +00:00
Diego Fronza	95add01643	Silence coverity warnings in query.c Return value of dns_db_getservestalerefresh() and dns_db_getservestalettl() functions were previously unhandled. This commit purposefully ignore those return values since there is no side effect if those results are != ISC_R_SUCCESS, it also supress Coverity warnings.	2020-11-26 14:55:14 +00:00
Matthijs Mekking	dff01583db	Add one missing check to nsec3param unit test Caught this missing check with clang-build while backporting #1620 to the v9_16 branch.	2020-11-26 12:40:22 +00:00
Michał Kępień	f440600126	Use proper cmocka macros for pointer checks Make sure pointer checks in unit tests use cmocka assertion macros dedicated for use with pointers instead of those dedicated for use with integers or booleans.	2020-11-26 13:10:40 +01:00
Michał Kępień	2bb0a5dcdb	Update library API versions	2020-11-26 12:12:17 +01:00
Matthijs Mekking	64db30942d	Add NSEC3PARAM unit test, refactor zone.c Add unit test to ensure the right NSEC3PARAM event is scheduled in 'dns_zone_setnsec3param()'. To avoid scheduling and managing actual tasks, split up the 'dns_zone_setnsec3param()' function in two parts: 1. 'dns__zone_lookup_nsec3param()' that will check if the requested NSEC3 parameters already exist, and if a new salt needs to be generated. 2. The actual scheduling of the new NSEC3PARAM event (if needed).	2020-11-26 10:43:59 +01:00
Matthijs Mekking	6b5d7357df	Detect NSEC3 salt collisions When generating a new salt, compare it with the previous NSEC3 paremeters to ensure the new parameters are different from the previous ones. This moves the salt generation call from 'bin/named/*.s' to 'lib/dns/zone.c'. When setting new NSEC3 parameters, you can set a new function parameter 'resalt' to enforce a new salt to be generated. A new salt will also be generated if 'salt' is set to NULL. Logging salt with zone context can now be done with 'dnssec_log', removing the need for 'dns_nsec3_log_salt'.	2020-11-26 10:43:59 +01:00
Matthijs Mekking	7878f300ff	Move logging of salt in separate function There may be a desire to log the salt without losing the context of log module, level, and category.	2020-11-26 10:43:59 +01:00
Matthijs Mekking	6f97bb6b1f	Change nsec3param salt config to saltlen Upon request from Mark, change the configuration of salt to salt length. Introduce a new function 'dns_zone_checknsec3aram' that can be used upon reconfiguration to check if the existing NSEC3 parameters are in sync with the configuration. If a salt is used that matches the configured salt length, don't change the NSEC3 parameters.	2020-11-26 10:43:59 +01:00
Matthijs Mekking	00c5dabea3	Add check for NSEC3 and key algorithms NSEC3 is not backwards compatible with key algorithms that existed before the RFC 5155 specification was published.	2020-11-26 10:43:59 +01:00
Matthijs Mekking	7039c5f805	Check nsec3param configuration values Check 'nsec3param' configuration for the number of iterations. The maximum number of iterations that are allowed are based on the key size (see https://tools.ietf.org/html/rfc5155#section-10.3). Check 'nsec3param' configuration for correct salt. If the string is not "-" or hex-based, this is a bad salt.	2020-11-26 10:43:27 +01:00
Matthijs Mekking	114af58ee2	Support for NSEC3 in dnssec-policy Implement support for NSEC3 in dnssec-policy. Store the configuration in kasp objects. When configuring a zone, call 'dns_zone_setnsec3param' to queue an nsec3param event. This will ensure that any previous chains will be removed and a chain according to the dnssec-policy is created. Add tests for dnssec-policy zones that uses the new 'nsec3param' option, as well as changing to new values, changing to NSEC, and changing from NSEC.	2020-11-26 10:43:27 +01:00
Matthijs Mekking	f7ca96c805	Add kasp nsec3param configuration Add configuration and documentation on how to enable NSEC3 when using dnssec-policy for signing your zones.	2020-11-26 10:43:27 +01:00
Matthijs Mekking	84a4273074	Move generate_salt function to lib/dns/nsec3 We will be using this function also on reconfig, so it should have a wider availability than just bin/named/server.	2020-11-26 10:43:27 +01:00
Michał Kępień	ea54a932d2	Convert add_quota() to a function cppcheck 2.2 reports the following false positive: lib/isc/tests/quota_test.c:71:21: error: Array 'quotas[101]' accessed at index 110, which is out of bounds. [arrayIndexOutOfBounds] isc_quota_t *quotas[110]; ^ The above is not even an array access, so this report is obviously caused by a cppcheck bug. Yet, it seems to be triggered by the presence of the add_quota() macro, which should really be a function. Convert the add_quota() macro to a function in order to make the code cleaner and to prevent the above cppcheck 2.2 false positive from being triggered.	2020-11-25 12:45:47 +01:00
Michał Kępień	0b6216d1c7	Silence cppcheck 2.2 false positive in udp_recv() cppcheck 2.2 reports the following false positive: lib/dns/dispatch.c:1239:14: warning: Either the condition 'resp==NULL' is redundant or there is possible null pointer dereference: resp. [nullPointerRedundantCheck] if (disp != resp->disp) { ^ lib/dns/dispatch.c:1210:11: note: Assuming that condition 'resp==NULL' is not redundant if (resp == NULL) { ^ lib/dns/dispatch.c:1239:14: note: Null pointer dereference if (disp != resp->disp) { ^ Apparently this version of cppcheck gets confused about conditional "goto" statements because line 1239 can never be reached if 'resp' is NULL. Move a code block to prevent the above false positive from being reported without affecting the processing logic.	2020-11-25 12:45:47 +01:00
JINMEI Tatuya	75cdd758ed	implementation of hook-based asynchronous functionality previously query plugins were strictly synchrounous - the query process would be interrupted at some point, data would be looked up or a change would be made, and then the query processing would resume immediately. this commit enables query plugins to initiate asynchronous processes and resume on a completion event, as with recursion.	2020-11-24 15:11:39 -08:00
JINMEI Tatuya	9c8dae041d	ns_query refactoring for hook-based recursion several small changes to query processing to make it easier to use hook-based recursion (and other asynchronous functionlity) later. - recursion quota check is now a separate function, check_recursionquota(), which is called by ns_query_recurse(). - pass isc_result to query_nxdomain() instead of bool. the value of 'empty_wild' will be determined in the function based on the passed result. this is similar to query_nodata(), and makes the signatures of the two functions more consistent. - pass the current 'result' value into plugin hooks.	2020-11-24 15:11:39 -08:00
Mark Andrews	38d6f68de4	add dns_dns64_findprefix	2020-11-25 08:25:29 +11:00
Mark Andrews	e980affba0	Fix DNAME when QTYPE is CNAME or ANY The synthesised CNAME is not supposed to be followed when the QTYPE is CNAME or ANY as the lookup is satisfied by the CNAME record.	2020-11-19 10:18:01 +11:00
Ondřej Surý	a49d88568f	Turn all the callback to be always asynchronous When calling the high level netmgr functions, the callback would be sometimes called synchronously if we catch the failure directly, or asynchronously if it happens later. The synchronous call to the callback could create deadlocks as the caller would not expect the failed callback to be executed directly.	2020-11-11 22:15:40 +01:00
Diego Fronza	581e2a8f28	Check 'stale-refresh-time' when sharing cache between views This commit ensures that, along with previous restrictions, a cache is shareable between views only if their 'stale-refresh-time' value are equal.	2020-11-11 12:53:24 -03:00
Diego Fronza	5e47a13fd0	Warn if 'stale-refresh-time' < 30 (default) RFC 8767 recommends that attempts to refresh to be done no more frequently than every 30 seconds. Added check into named-checkconf, which will warn if values below the default are found in configuration. BIND will also log the warning during loading of configuration in the same fashion.	2020-11-11 12:53:23 -03:00
Diego Fronza	4827ad0ec4	Add stale-refresh-time option Before this update, BIND would attempt to do a full recursive resolution process for each query received if the requested rrset had its ttl expired. If the resolution fails for any reason, only then BIND would check for stale rrset in cache (if 'stale-cache-enable' and 'stale-answer-enable' is on). The problem with this approach is that if an authoritative server is unreachable or is failing to respond, it is very unlikely that the problem will be fixed in the next seconds. A better approach to improve performance in those cases, is to mark the moment in which a resolution failed, and if new queries arrive for that same rrset, try to respond directly from the stale cache, and do that for a window of time configured via 'stale-refresh-time'. Only when this interval expires we then try to do a normal refresh of the rrset. The logic behind this commit is as following: - In query.c / query_gotanswer(), if the test of 'result' variable falls to the default case, an error is assumed to have happened, and a call to 'query_usestale()' is made to check if serving of stale rrset is enabled in configuration. - If serving of stale answers is enabled, a flag will be turned on in the query context to look for stale records: query.c:6839 qctx->client->query.dboptions \|= DNS_DBFIND_STALEOK; - A call to query_lookup() will be made again, inside it a call to 'dns_db_findext()' is made, which in turn will invoke rbdb.c / cache_find(). - In rbtdb.c / cache_find() the important bits of this change is the call to 'check_stale_header()', which is a function that yields true if we should skip the stale entry, or false if we should consider it. - In check_stale_header() we now check if the DNS_DBFIND_STALEOK option is set, if that is the case we know that this new search for stale records was made due to a failure in a normal resolution, so we keep track of the time in which the failured occured in rbtdb.c:4559: header->last_refresh_fail_ts = search->now; - In check_stale_header(), if DNS_DBFIND_STALEOK is not set, then we know this is a normal lookup, if the record is stale and the query time is between last failure time + stale-refresh-time window, then we return false so cache_find() knows it can consider this stale rrset entry to return as a response. The last additions are two new methods to the database interface: - setservestale_refresh - getservestale_refresh Those were added so rbtdb can be aware of the value set in configuration option, since in that level we have no access to the view object.	2020-11-11 12:53:23 -03:00
Michal Nowak	9088052225	Drop unused headers	2020-11-11 10:08:12 +01:00
Mark Andrews	244f84a84b	Address TSAN error between dns_rbt_findnode() and subtractrdataset(). Having dns_rbt_findnode() in previous_closest_nsec() check of node->data is a optimisation that triggers a TSAN error with subtractrdataset(). find_closest_nsec() still needs to check if the NSEC record are active or not and look for a earlier NSEC records if it isn't. Set DNS_RBTFIND_EMPTYDATA so node->data isn't referenced without the node lock being held. WARNING: ThreadSanitizer: data race Read of size 8 at 0x000000000001 by thread T1 (mutexes: read M1, read M2): #0 dns_rbt_findnode lib/dns/rbt.c:1708 #1 previous_closest_nsec lib/dns/rbtdb.c:3760 #2 find_closest_nsec lib/dns/rbtdb.c:3942 #3 zone_find lib/dns/rbtdb.c:4091 #4 dns_db_findext lib/dns/db.c:536 #5 query_lookup lib/ns/query.c:5582 #6 ns__query_start lib/ns/query.c:5505 #7 query_setup lib/ns/query.c:5229 #8 ns_query_start lib/ns/query.c:11380 #9 ns__client_request lib/ns/client.c:2166 #10 processbuffer netmgr/tcpdns.c:230 #11 dnslisten_readcb netmgr/tcpdns.c:309 #12 read_cb netmgr/tcp.c:832 #13 <null> <null> #14 <null> <null> Previous write of size 8 at 0x000000000001 by thread T2 (mutexes: write M3): #0 subtractrdataset lib/dns/rbtdb.c:7133 #1 dns_db_subtractrdataset lib/dns/db.c:742 #2 diff_apply lib/dns/diff.c:368 #3 dns_diff_apply lib/dns/diff.c:459 #4 do_one_tuple lib/dns/update.c:247 #5 update_one_rr lib/dns/update.c:275 #6 delete_if_action lib/dns/update.c:689 #7 foreach_rr lib/dns/update.c:471 #8 delete_if lib/dns/update.c:716 #9 dns_update_signaturesinc lib/dns/update.c:1948 #10 receive_secure_serial lib/dns/zone.c:15637 #11 dispatch lib/isc/task.c:1152 #12 run lib/isc/task.c:1344 #13 <null> <null> Location is heap block of size 130 at 0x000000000028 allocated by thread T3: #0 malloc <null> #1 default_memalloc lib/isc/mem.c:713 #2 mem_get lib/isc/mem.c:622 #3 mem_allocateunlocked lib/isc/mem.c:1268 #4 isc___mem_allocate lib/isc/mem.c:1288 #5 isc__mem_allocate lib/isc/mem.c:2453 #6 isc___mem_get lib/isc/mem.c:1037 #7 isc__mem_get lib/isc/mem.c:2432 #8 create_node lib/dns/rbt.c:2239 #9 dns_rbt_addnode lib/dns/rbt.c:1202 #10 dns_rbtdb_create lib/dns/rbtdb.c:8668 #11 dns_db_create lib/dns/db.c:118 #12 receive_secure_db lib/dns/zone.c:16154 #13 dispatch lib/isc/task.c:1152 #14 run lib/isc/task.c:1344 #15 <null> <null> Mutex M1 (0x000000000040) created at: #0 pthread_rwlock_init <null> #1 isc_rwlock_init lib/isc/rwlock.c:39 #2 dns_rbtdb_create lib/dns/rbtdb.c:8527 #3 dns_db_create lib/dns/db.c:118 #4 receive_secure_db lib/dns/zone.c:16154 #5 dispatch lib/isc/task.c:1152 #6 run lib/isc/task.c:1344 #7 <null> <null> Mutex M2 (0x000000000044) created at: #0 pthread_rwlock_init <null> #1 isc_rwlock_init lib/isc/rwlock.c:39 #2 dns_rbtdb_create lib/dns/rbtdb.c:8600 #3 dns_db_create lib/dns/db.c:118 #4 receive_secure_db lib/dns/zone.c:16154 #5 dispatch lib/isc/task.c:1152 #6 run lib/isc/task.c:1344 #7 <null> <null> Mutex M3 (0x000000000046) created at: #0 pthread_rwlock_init <null> #1 isc_rwlock_init lib/isc/rwlock.c:39 #2 dns_rbtdb_create lib/dns/rbtdb.c:8600 #3 dns_db_create lib/dns/db.c:118 #4 receive_secure_db lib/dns/zone.c:16154 #5 dispatch lib/isc/task.c:1152 #6 run lib/isc/task.c:1344 #7 <null> <null> Thread T1 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create pthreads/thread.c:73 #2 isc_nm_start netmgr/netmgr.c:232 #3 create_managers bin/named/main.c:909 #4 setup bin/named/main.c:1223 #5 main bin/named/main.c:1523 Thread T2 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create pthreads/thread.c:73 #2 isc_taskmgr_create lib/isc/task.c:1434 #3 create_managers bin/named/main.c:915 #4 setup bin/named/main.c:1223 #5 main bin/named/main.c:1523 Thread T3 (running) created by main thread at: #0 pthread_create <null> #1 isc_thread_create pthreads/thread.c:73 #2 isc_taskmgr_create lib/isc/task.c:1434 #3 create_managers bin/named/main.c:915 #4 setup bin/named/main.c:1223 #5 main bin/named/main.c:1523 SUMMARY: ThreadSanitizer: data race lib/dns/rbt.c:1708 in dns_rbt_findnode	2020-11-10 20:17:48 +00:00
Matthijs Mekking	b7856d2675	Cleanup duplicate definitions in query.h	2020-11-10 14:42:47 +00:00

1 2 3 4 5 ...

12931 commits