bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-03-18 00:26:11 -04:00

Author	SHA1	Message	Date
Evan Hunt	dd88a4cdfc	reduce MAX_RESTARTS to 11 the number of steps that can be followed in a CNAME chain before terminating the lookup has been reduced from 16 to 11. (this is a hard-coded value, but will be made configurable later.) (cherry picked from commit `05d78671bb`)	2024-08-07 15:36:14 -07:00
Evan Hunt	14bce7e275	add debug logging when creating or attaching to a query counter fctx_create() now logs at debug level 9 when the fctx attaches to an existing counter or creates a new one. (cherry picked from commit `825f3d68c5`)	2024-08-07 15:36:14 -07:00
Evan Hunt	18e39d989f	apply max-recursion-queries quota to validator queries previously, validator queries for DNSKEY and DS records were not counted toward the quota for max-recursion-queries; they are now. (cherry picked from commit `af7db89513`)	2024-08-07 15:36:09 -07:00
Evan Hunt	5ab4cae4ed	attach query counter to NS fetches there were cases in resolver.c when queries for NS records were started without passing a pointer to the parent fetch's query counter; as a result, the max-recursion-queries quota for those queries started counting from zero, instead of sharing the limit for the parent fetch, making the quota ineffective in some cases. (cherry picked from commit `d3b7e92783`)	2024-08-07 14:51:44 -07:00
Ondřej Surý	acabe271c5	Disassociate the SSL object from the cached SSL_SESSION When the SSL object was destroyed, it would invalidate all SSL_SESSION objects including the cached, but not yet used, TLS session objects. Properly disassociate the SSL object from the SSL_SESSION before we store it in the TLS session cache, so we can later destroy it without invalidating the cached TLS sessions. Co-authored-by: Ondřej Surý <ondrej@isc.org> Co-authored-by: Artem Boldariev <artem@isc.org> Co-authored-by: Aram Sargsyan <aram@isc.org> (cherry picked from commit `c11b736e44`)	2024-08-07 16:01:03 +00:00
Ondřej Surý	875755d9ea	Attach/detach to the listening child socket when accepting TLS When TLS connection (TLSstream) connection was accepted, the children listening socket was not attached to sock->server and thus it could have been freed before all the accepted connections were actually closed. In turn, this would cause us to call isc_tls_free() too soon - causing cascade errors in pending SSL_read_ex() in the accepted connections. Properly attach and detach the children listening socket when accepting and closing the server connections. (cherry picked from commit `684f3eb8e6`)	2024-08-07 17:20:03 +02:00
Ondřej Surý	9615f5b348	Don't loop indefinitely when isc_task quantum is 'unlimited' Don't run more events than already scheduled. If the quantum is set to a high value, the task_run() would execute already scheduled, and all new events that result from running event->ev_action(). Setting quantum to a number of scheduled events will postpone events scheduled after we enter the loop here to the next task_run() invocation.	2024-08-07 08:27:15 +02:00
Ondřej Surý	236de53c52	Use EXIT_SUCCESS and EXIT_FAILURE Instead of randomly using -1 or 1 as a failure status, properly utilize the EXIT_FAILURE define that's platform specific (as it should be). (cherry picked from commit76997983fde02d9c32aa23bda30b65f1ebd4178c)	2024-08-06 15:19:06 +02:00
Evan Hunt	18d7be118f	raise the log level of priming failures when a priming query is complete, it's currently logged at level ISC_LOG_INFO, regardless of success or failure. we are now changing it to ISC_LOG_NOTICE in the case of failure and ISC_LOG_DEBUG(1) in case of success. (cherry picked from commit `a84d54c6ff`)	2024-08-05 15:31:38 +02:00
JINMEI Tatuya	b9bef2cc89	add a trivial wrapper for uv_stream_get_write_queue_size	2024-08-05 10:27:37 +00:00
Mark Andrews	b0d32302b2	Remove unnecessary operations Decrementing optlen immediately before calling continue is unneccesary and inconsistent with the rest of dns_message_pseudosectiontoyaml and dns_message_pseudosectiontotext. Coverity was also reporting an impossible false positive overflow of optlen (CID 499061). 4176 } else if (optcode == DNS_OPT_CLIENT_TAG) { 4177 uint16_t id; 4178 ADD_STRING(target, "; CLIENT-TAG:"); 4179 if (optlen == 2U) { 4180 id = isc_buffer_getuint16(&optbuf); 4181 snprintf(buf, sizeof(buf), " %u\n", id); 4182 ADD_STRING(target, buf); CID 499061: (#1 of 1): Overflowed constant (INTEGER_OVERFLOW) overflow_const: Expression optlen, which is equal to 65534, underflows the type that receives it, an unsigned integer 16 bits wide. 4183 optlen -= 2; 4184 POST(optlen); 4185 continue; 4186 } 4187 } else if (optcode == DNS_OPT_SERVER_TAG) { (cherry picked from commit `47338c2c87`)	2024-08-02 06:43:03 +00:00
Mark Andrews	b8363ffff9	Add missing period to generated IPv4 6to4 name The period between the most significant nibble of the IPv4 address and the 2.0.0.2.IP6.ARPA suffix was missing resulting in the wrong name being checked. (cherry picked from commit `bca63437a1`)	2024-08-02 00:09:17 +00:00
Mark Andrews	4704c28cab	Cleanup old clang-format string splitting (cherry picked from commit `6d1c7beb15`)	2024-08-01 15:58:16 +10:00
Mark Andrews	7643a0322f	Remove false positive qname minimisation error Don't report qname minimisation NXDOMAIN errors when the result is NXDOMAIN. (cherry picked from commit `f78beca942`)	2024-08-01 15:58:16 +10:00
Mark Andrews	8fefb27f76	Fix yaml output In yaml mode we emit a string for each question and record. Certain names and data could result in invalid yaml being produced. Use single quote string for all questions and records. This requires that single quotes get converted to two quotes within the string. (cherry picked from commit `393d7fa78e`)	2024-08-01 03:48:12 +00:00
Mark Andrews	ce8356905c	Properly reject zero length ALPN in commatxt_fromtext ALPN are defined as 1*255OCTET in RFC 9460. commatxt_fromtext was not rejecting invalid inputs produces by missing a level of escaping which where later caught be dns_rdata_fromwire on reception. These inputs should have been rejected svcb in svcb 1 1.svcb alpn=\,abc svcb1 in svcb 1 1.svcb alpn=a\,\,abc and generated 00 03 61 62 63 and 01 61 00 02 61 62 63 respectively. The correct inputs to include commas in the alpn requires double escaping. svcb in svcb 1 1.svcb alpn=\\,abc svcb1 in svcb 1 1.svcb alpn=a\\,\\,abc and generate 04 2C 61 62 63 and 06 61 2C 2C 61 62 63 respectively. (cherry picked from commit `b51c9eb797`)	2024-08-01 01:10:53 +00:00
Aram Sargsyan	946931ccb7	Return SERVFAIL for a too long CNAME chain Due to the maximum query restart limitation a long CNAME chain it is cut after 16 queries but named still returns NOERROR. Return SERVFAIL instead and the partial answer. (cherry picked from commit `b621f1d88e`)	2024-07-31 15:14:43 +00:00
Mark Andrews	2994d6d700	Properly compute the physical memory size On a 32 bit machine casting to size_t can still lead to an overflow. Cast to uint64_t. Also detect all possible negative values for pages and pagesize to silence warning about possible negative value. 39#if defined(_SC_PHYS_PAGES) && defined(_SC_PAGESIZE) 1. tainted_data_return: Called function sysconf(_SC_PHYS_PAGES), and a possible return value may be less than zero. 2. assign: Assigning: pages = sysconf(_SC_PHYS_PAGES). 40 long pages = sysconf(_SC_PHYS_PAGES); 41 long pagesize = sysconf(_SC_PAGESIZE); 42 3. Condition pages == -1, taking false branch. 4. Condition pagesize == -1, taking false branch. 43 if (pages == -1 \|\| pagesize == -1) { 44 return (0); 45 } 46 5. overflow: The expression (size_t)pages * pagesize might be negative, but is used in a context that treats it as unsigned. CID 498034: (#1 of 1): Overflowed return value (INTEGER_OVERFLOW) 6. return_overflow: (size_t)pages * pagesize, which might have underflowed, is returned from the function. 47 return ((size_t)pages * pagesize); 48#endif /* if defined(_SC_PHYS_PAGES) && defined(_SC_PAGESIZE) */ (cherry picked from commit `e8dbc5db92`)	2024-07-31 07:30:39 +00:00
Mark Andrews	c399388a66	Do not update find.result_v4 and find.result_v6 These values are supposed to be static for the life of the find and clean_finds_at_name was updating them resulting in TSAN error reports. WARNING: ThreadSanitizer: data race Write of size 4 at 0x000000000001 by thread T1 (mutexes: write M1, write M2): #0 clean_finds_at_name lib/dns/adb.c:1537 #1 fetch_callback lib/dns/adb.c:4009 #2 task_run lib/isc/task.c:815 #3 isc_task_run lib/isc/task.c:896 #4 isc__nm_async_task netmgr/netmgr.c:848 #5 process_netievent netmgr/netmgr.c:920 #6 process_queue netmgr/netmgr.c:1013 #7 process_all_queues netmgr/netmgr.c:767 #8 async_cb netmgr/netmgr.c:796 #9 uv__async_io /usr/src/libuv-v1.44.1/src/unix/async.c:163 #10 isc__trampoline_run lib/isc/trampoline.c:189 Previous read of size 4 at 0x000000000001 by thread T2: #0 findname lib/dns/resolver.c:3749 #1 fctx_getaddresses lib/dns/resolver.c:3993 #2 fctx_try lib/dns/resolver.c:4390 #3 rctx_nextserver lib/dns/resolver.c:10356 #4 rctx_done lib/dns/resolver.c:10503 #5 resquery_response lib/dns/resolver.c:8511 #6 udp_recv lib/dns/dispatch.c:638 #7 isc__nm_async_readcb netmgr/netmgr.c:2885 #8 isc__nm_readcb netmgr/netmgr.c:2858 #9 udp_recv_cb netmgr/udp.c:650 #10 isc__nm_udp_read_cb netmgr/udp.c:1057 #11 uv__udp_recvmsg /usr/src/libuv-v1.44.1/src/unix/udp.c:303 #12 isc__trampoline_run lib/isc/trampoline.c:189 (cherry picked from commit `53a5f50e9d`)	2024-07-31 05:52:18 +00:00
Matthijs Mekking	8ec554e3de	No longer update key lifetime if key is retired The key lifetime should no longer be adjusted if the key is being retired earlier, for example because a manual rollover was started. This would falsely be seen as a dnssec-policy lifetime reconfiguration, and would adjust the retire/removed time again. This also means we should update the status output, and the next rollover scheduled is now calculated using (retire-active) instead of key lifetime. (cherry picked from commit `129973ebb0`)	2024-07-30 13:38:04 +02:00
Matthijs Mekking	7ab4a35820	Update key lifetime and metadata after reconfig If dnssec-policy is reconfigured and the key lifetime has changed, update existing keys with the new lifetime and adjust the retire and removed timing metadata accordingly. If the key has no lifetime yet, just initialize the lifetime. It may be that the retire/removed timing metadata has already been set. Skip keys which goal is not set to omnipresent. These keys are already in the progress of retiring, or still unused. (cherry picked from commit `1cec0b0448`)	2024-07-30 13:37:50 +02:00
Artem Boldariev	c33b3d26f6	TCP/TLS DNS: unthrottle only when all input data processing This commit ensures that we restart reading only when all DNS data in the input buffer is processed so the we will not get into the situation when the buffer is overrun.	2024-07-03 15:08:01 +02:00
Ondřej Surý	e31190e704	Reset the TCP connection on a failed send When sending fails, the ns__client_request() would not reset the connection and continue as nothing is happening. This comes from the model that we don't care about failed UDP sends because datagrams are unreliable anyway, but it greatly affects TCP connections with keep-alive. The worst case scenario is as follows: 1. the 3-way TCP handshake gets completed 2. the libuv calls the "uv_connection_cb" callback 3. the TCP connection gets queue because of the tcp-clients quota 4. the TCP client sends as many DNS messages as the buffers allow 5. the TCP connection gets dropped by the client due to the timeout 6. the TCP connection gets accepted by the server 7. the data already sent by the client gets read 8. all sending fails immediately because the TCP connection is dead 9. we consume all the data in the buffer in a very tight loop As it doesn't make sense to trying to process more data on the TCP connection when the sending is failing, drop the connection immediately on the first sending error. (cherry picked from commit `bf9fd2a6ff`)	2024-07-03 09:10:30 +02:00
Ondřej Surý	4b7c61381f	Throttle the reading when writes are asynchronous Be more aggressive when throttling the reading - when we can't send the outgoing TCP synchronously with uv_try_write(), we start throttling the reading immediately instead of waiting for the send buffers to fill up. This should not affect behaved clients that read the data from the TCP on the other end. (cherry picked from commit `bc3e713317`)	2024-07-03 09:10:20 +02:00
Ondřej Surý	7f99d7e565	Be smarter about refusing to add many RR types to the database Instead of outright refusing to add new RR types to the cache, be a bit smarter: 1. If the new header type is in our priority list, we always add either positive or negative entry at the beginning of the list. 2. If the new header type is negative entry, and we are over the limit, we mark it as ancient immediately, so it gets evicted from the cache as soon as possible. 3. Otherwise add the new header after the priority headers (or at the head of the list). 4. If we are over the limit, evict the last entry on the normal header list. (cherry picked from commit `57cd34441a`)	2024-07-01 15:04:43 +02:00
Ondřej Surý	c2650f5331	Expand the list of the priority types Add HTTPS, SVCB, SRV, PTR, NAPTR, DNSKEY and TXT records to the list of the priority types that are put at the beginning of the slabheader list for faster access and to avoid eviction when there are more types than the max-types-per-name limit. (cherry picked from commit `b27c6bcce8`)	2024-07-01 13:14:57 +02:00
Artem Boldariev	d4b1f7f239	Use smaller pools of requests and handles for sockets This commit ensures that socket objects use smaller sizes for its internal requests and handles pools. That prevents a memory allocator from thrashing.	2024-06-18 17:54:17 +03:00
Artem Boldariev	16c1d1eb2e	Avoid indefinite send re-scheduling in TLS DNS When a peer is not reading the data we are sending it was for the TLS DNS code to end up in a situation when it would indefinitely reschedule send requests, effectively turning the 'uv_loop' into a busy loop that would consume CPU cycles in endless efforts to send outgoing data. The main reason for that was only one send buffer dedicated for sends: the code would re-queue sends until it is empty - that would never happen when the remote side is not reading data. That seems like an omission from the older day of the Network Manager as it is quiet simple to make the code use multiple buffers for sends. That ultimately breaks the cycle of futile send request rescheduling. As a side effect, this commit also gets rid of one memory copying on a hot path.	2024-06-18 11:58:59 +03:00
Artem Boldariev	c71a61c44b	Introduce TCP throttling into TLS DNS code Throttling functionality was omitted from the `c6f13f12cd`. This commit fixes that, taking into account the latest developments in this area.	2024-06-18 11:58:59 +03:00
Artem Boldariev	eb4678e0b8	Do not un-throttle TCP connections on isc_nm_read() Due to omission it was possible to un-throttle a TCP connection previously throttled due to the peer not reading back data we are sending. In particular, that affected DoH code, but it could also affect other transports (the current or future ones) that pause/resume reading according to its internal state. (cherry picked from commit d228aa8bbb944fbd04baf22d151fde5c33561e26)	2024-06-18 11:58:59 +03:00
Mark Andrews	9cfd20cd90	Clear qctx->zversion Clear qctx->zversion when clearing qctx->zrdataset et al in lib/ns/query.c:qctx_freedata. The uncleared pointer could lead to an assertion failure if zone data needed to be re-saved which could happen with stale data support enabled. (cherry picked from commit 179fb3532ab8d4898ab070b2db54c0ce872ef709)	2024-06-10 19:20:06 +02:00
Petr Špaček	bef3d2cca3	Remove support for SIG(0) message verification	2024-06-10 19:02:49 +02:00
Matthijs Mekking	a97f52263b	Log rekey failure as error if too many records By default we log a rekey failure on debug level. We should probably change the log level to error. We make an exception for when the zone is not loaded yet, it often happens at startup that a rekey is run before the zone is fully loaded. (cherry picked from commit 68b840c731527e01699afaf084559152124b717a)	2024-06-10 18:51:28 +02:00
Matthijs Mekking	7bb36ae56e	Log error when update fails The new "too many records" error can make an update fail without the error being logged. This commit fixes that. (cherry picked from commit 558923e5405894cf976d102f0d246a28bdbb400c)	2024-06-10 18:51:28 +02:00
Evan Hunt	b89e516420	fix a memory leak that could occur when signing when signatures were not added because of too many types already existing at a node, the diff was not being cleaned up; this led to a memory leak being reported at shutdown. (cherry picked from commit 2825bdb1ae5be801e7ed603ba2455ed9a308f1f7)	2024-06-10 18:51:27 +02:00
Ondřej Surý	39d3e2a8ec	Add a limit to the number of RR types for single name Previously, the number of RR types for a single owner name was limited only by the maximum number of the types (64k). As the data structure that holds the RR types for the database node is just a linked list, and there are places where we just walk through the whole list (again and again), adding a large number of RR types for a single owner named with would slow down processing of such name (database node). Add a configurable limit to cap the number of the RR types for a single owner. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-types-per-name configuration option that can be configured globally, per-view and per-zone. (cherry picked from commit 00d16211d6368b99f070c1182d8c76b3798ca1db)	2024-06-10 18:50:06 +02:00
Ondřej Surý	e699ef939e	Add a limit to the number of RRs in RRSets Previously, the number of RRs in the RRSets were internally unlimited. As the data structure that holds the RRs is just a linked list, and there are places where we just walk through all of the RRs, adding an RRSet with huge number of RRs inside would slow down processing of said RRSets. Add a configurable limit to cap the number of the RRs in a single RRSet. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-records-per-type configuration option that can be configured globally, per-view and per-zone. (cherry picked from commit 3fbd21f69a1bcbd26c4c00920e7b0a419e8762fc)	2024-06-10 18:50:03 +02:00
Ondřej Surý	964891a794	Limit the number of DNS message processed from a single TCP read The single TCP read can create as much as 64k divided by the minimum size of the DNS message. This can clog the processing thread and trash the memory allocator because we need to do as much as ~20k allocations in a single UV loop tick. Limit the number of the DNS messages processed in a single UV loop tick to just single DNS message and limit the number of the outstanding DNS messages back to 23. This effectively limits the number of pipelined DNS messages to that number (this is the limit we already had before). This reverts commit `780a89012d`.	2024-06-10 18:43:46 +02:00
Ondřej Surý	0b1d70ed2a	Remove the extra memory context with own arena for sending (cherry picked from commit 8d4cc41c291f8a77a723ae8e62533538b3632d50)	2024-06-10 18:43:46 +02:00
Ondřej Surý	3f6b7f57a6	Replace the tcp_buffers memory pool with static per-loop buffer As a single thread can process only one TCP send at the time, we don't really need a memory pool for the TCP buffers, but it's enough to have a single per-loop (client manager) static buffer that's being used to assemble the DNS message and then it gets copied into own sending buffer. In the future, this should get optimized by exposing the uv_try API from the network manager, and first try to send the message directly and allocate the sending buffer only if we need to send the data asynchronously. (cherry picked from commit 297cc840fbaf34b9dfa1d02d88a023cd5bf5dc4a)	2024-06-10 18:43:46 +02:00
Aram Sargsyan	4e70342142	ns_client: reuse TCP send buffers Constantly allocating, reallocating and deallocating 64K TCP send buffers by 'ns_client' instances takes too much CPU time. There is an existing mechanism to reuse the ns_clent_t structure associated with the handle using 'isc_nmhandle_getdata/_setdata' (see ns_client_request()), but it doesn't work with TCP, because every time ns_client_request() is called it gets a new handle even for the same TCP connection, see the comments in streamdns_on_complete_dnsmessage(). To solve the problem, we introduce an array of available (unused) TCP buffers stored in ns_clientmgr_t structure so that a 'client' working via TCP can have a chance to reuse one (if there is one) instead of allocating a new one every time.	2024-06-10 18:43:45 +02:00
Ondřej Surý	c6f13f12cd	Throttle reading from TCP if the sends are not getting through When TCP client would not read the DNS message sent to them, the TCP sends inside named would accumulate and cause degradation of the service. Throttle the reading from the TCP socket when we accumulate enough DNS data to be sent. Currently this is limited in a way that a single largest possible DNS message can fit into the buffer. (cherry picked from commit 26006f7b44474819fac2a76dc6cd6f69f0d76828)	2024-06-10 18:43:44 +02:00
Artem Boldariev	998522e68e	Keep the endpoints set reference within an HTTP/2 socket This commit ensures that an HTTP endpoints set reference is stored in a socket object associated with an HTTP/2 stream instead of referencing the global set stored inside a listener. This helps to prevent an issue like follows: 1. BIND is configured to serve DoH clients; 2. A client is connected and one or more HTTP/2 stream is created. Internal pointers are now pointing to the data on the associated HTTP endpoints set; 3. BIND is reconfigured - the new endpoints set object is created and promoted to all listeners; 4. The old pointers to the HTTP endpoints set data are now invalid. Instead referencing a global object that is updated on re-configurations we now store a local reference which prevents the endpoints set objects to go out of scope prematurely. (cherry picked from commit b9b5d0c01a3a546c4a6a8b3bff8ae9dd31fee224)	2024-06-10 18:35:18 +02:00
Artem Boldariev	b601a5b781	DoH: avoid potential use after free for HTTP/2 session objects It was reported that HTTP/2 session might get closed or even deleted before all async. processing has been completed. This commit addresses that: now we are avoiding using the object when we do not need it or specifically check if the pointers used are not 'NULL' and by ensuring that there is at least one reference to the session object while we are doing incoming data processing. This commit makes the code more resilient to such issues in the future. (cherry picked from commit 0cca550dff403c6100b7c0da8f252e7967765ba7)	2024-06-10 18:35:16 +02:00
Mark Andrews	6455527830	Clear DNS_FETCHOPT_TRYSTALE_ONTIMEOUT When calling dns_resolver_createfetch in resolver.c with a callback of resume_dslookup, clear DNS_FETCHOPT_TRYSTALE_ONTIMEOUT from options as DNS_EVENT_TRYSTALE is not an expected event type and triggers a REQUIRE.	2024-06-06 07:48:49 +02:00
Mark Andrews	13be6cd991	Pass a memory context in to dns_cache_create (cherry picked from commit `87e3b9dbf3`)	2024-06-04 12:42:05 +00:00
Mark Andrews	26ad166a05	Use a new memory context when flushing the cache When the cache's memory context was in over memory state when the cache was flushed it resulted in LRU cleaning removing newly entered data in the new cache straight away until the old cache had been destroyed enough to take it out of over memory state. When flushing the cache create a new memory context for the new db to prevent this. (cherry picked from commit `5e77edd074`)	2024-06-04 12:42:05 +00:00
Mark Andrews	d3f708ba56	Update resquery_senddone handling of ISC_R_TIMEDOUT Treat timed out as an address specific error. (cherry picked from commit `56c3dcc5d7`)	2024-06-04 07:38:40 +00:00
Mark Andrews	99d2b4079f	Update resquery_senddone handling of ISC_R_CONNECTIONRESET Treat connection reset as an address specific error. (cherry picked from commit `4e3dd85b8d`)	2024-06-04 07:38:40 +00:00
Mark Andrews	e87a5e7bff	Handle ISC_R_HOSTDOWN and ISC_R_NETDOWN in resolver.c These error codes should be treated like other unreachable error codes. (cherry picked from commit `180b1e7939`)	2024-06-04 07:38:40 +00:00

1 2 3 4 5 ...

14412 commits