bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-03-10 02:01:32 -04:00

Author	SHA1	Message	Date
Evan Hunt	d4f791793e	Clarify reference counting in QP databases Change the names of the node reference counting functions and add comments to make the mechanism easier to understand: - newref() and decref() are now called qpcnode_acquire()/ qpznode_acquire() and qpcnode_release()/qpznode_release() respectively; this reflects the fact that they modify both the internal and external reference counters for a node. - qpcnode_newref() and qpznode_newref() are now called qpcnode_erefs_increment() and qpznode_erefs_increment(), and qpcnode_decref() and qpznode_decref() are now called qpcnode_erefs_decrement() and qpznode_erefs_decrement(), to reflect that they only increase and decrease the node's external reference counters, not internal.	2025-01-30 20:08:46 -08:00
Ondřej Surý	431513d8b3	Remove db_nodelock_t in favor of reference counted qpdb This removes the db_nodelock_t structure and changes the node_locks array to be composed only of isc_rwlock_t pointers. The .reference member has been moved to qpdb->references in addition to common.references that's external to dns_db API users. The .exiting members has been completely removed as it has no use when the reference counting is used correctly.	2025-01-30 16:43:02 +01:00
Ondřej Surý	36a26bfa1a	Remove origin_node from qpcache The origin_node in qpcache was always NULL, so we can remove the getoriginode() function and origin_node pointer as the dns_db_getoriginnode() correctly returns ISC_R_NOTFOUND when the function is not implemented.	2025-01-30 16:43:02 +01:00
Ondřej Surý	814b87da64	Refactor decref() in both qpcache.c and qpzone.c Cleanup the pattern in the decref() functions in both qpcache.c and qpzone.c, so it follows the similar patter as we already have in newref() function.	2025-01-30 16:43:02 +01:00
Colin Vidal	7c5678bb03	Use DNS_EDE_OTHER instead of its literal value	2025-01-30 11:54:36 +01:00
Colin Vidal	9021f9d802	detect dup EDE with bitmap and store next pos In order to avoid to loop to find the next position to store an EDE in a dns_edectx_t, add a "nextede" state which holds the next available position. Also, in order ot avoid to loop to find if an EDE is already existing in a dns_edectx_t, and avoid a duplicate, use a bitmap to immediately know if the EDE is there or not. Those both changes applies for adding or copying EDE. Also make the direction of dns_ede_copy more explicit/avoid errors by making "edectx_from" a const pointer.	2025-01-30 11:52:53 +01:00
Colin Vidal	7b01cbfb04	add lib/dns/ede.c documentation Add documentation usage of EDE compilation unit as well as centralize all EDE-related macros in the same lib/dns/include/dns/ede.h header.	2025-01-30 11:52:53 +01:00
Colin Vidal	f9f41190b3	Refactor test covering dns_ede API Migrate tests cases in client_test code which were exclusively testing code which is now all wrapped inside ede compilation unit. Those are testing maximum number of EDE, duplicate EDE as well as truncation of text of an EDE. Also add coverage for the copy of EDE from an edectx to another one, as well as checking the assertion of the maximum EDE info code which can be used.	2025-01-30 11:52:53 +01:00
Ondřej Surý	2f8e0edf3b	Split and simplify the use of EDE list implementation Instead of mixing the dns_resolver and dns_validator units directly with the EDE code, split-out the dns_ede functionality into own separate compilation unit and hide the implementation details behind abstraction. Additionally, the EDE codes are directly copied into the ns_client buffers by passing the EDE context to dns_resolver_createfetch(). This makes the dns_ede implementation simpler to use, although sligtly more complicated on the inside. Co-authored-by: Colin Vidal <colin@isc.org> Co-authored-by: Ondřej Surý <ondrej@isc.org>	2025-01-30 11:52:53 +01:00
Andoni Duarte Pintado	3a64b288c1	Merge tag 'v9.21.4'	2025-01-29 17:17:18 +01:00
Michal Nowak	5dbc87730e	Use archived version of draft-icann-dnssec-keymgmt-01.txt The iana.org link is gone.	2025-01-28 12:13:57 +01:00
Colin Vidal	39c2fc4670	fix byte order in EDE logging When an EDE code is added to a message, the code is converted early in a big-endian order so it can be memcpy-ed directly in the EDE buffer that will go on the wire. This previous change forget to update debug logs which still assume the EDE code was in host byte order. Add a separate variable to differentiate both and avoid ambiguities	2025-01-27 11:49:44 +01:00
Colin Vidal	78274ec2b1	fix EDE 22 time out detection Extended DNS error 22 (No reachable authority) was previously detected when `fctx_expired` fired. It turns out this function is used as a "safety net" and the timeout detection should be caught earlier. It was working though, because of another issue fixed by !9927. Since this change, the recursive request timed out detection occurs before `fctx_expired` so EDE 22 is not added to the response message anymore. The fix of the problem is to add the EDE 22 code in two situations: - When the dispatch code timed out (rctx_timedout) the resolver code checks various properties to figure out if it needs to make another fetch attempt. One of the paramters if the fetch expiration time. If it expires, the whole recursion is canceled, so it now adds the EDE 22 code. - If the fetch expiration time doesn't expires in the case above (and other parameters allows it) a new fetch attempt is made (fctx_query). But before the new request is actually made, the fetch expiration time is re-checked. It might then has elapsed, and the whole recursion is canceled. So it now also adds the EDE 22 code here as well.	2025-01-27 11:49:44 +01:00
Colin Vidal	46a58acdf5	add support for EDE code 1 and 2 Add support for EDE codes 1 (Unsupported DNSKEY Algorithm) and 2 (Unsupported DS Digest Type) which might occurs during DNSSEC validation in case of unsupported DNSKEY algorithm or DS digest type. Because DNSSEC internally kicks off various fetches, we need to copy all encountered extended errors from fetch responses to the fetch context. Upon an event, the errors from the fetch context are copied to the client response.	2025-01-24 12:26:30 +00:00
Evan Hunt	314741fcd0	deduplicate result codes ISCCC_R_SYNTAX, ISCCC_R_EXPIRED, and ISCCC_R_CLOCKSKEW have the same usage and text formats as DNS_R_SYNTAX, DNS_R_EXPIRED and DNS_R_CLOCKSCREW respectively. this was originally done because result codes were defined in separate libraries, and some tool might be linked with libisccc but not libdns. as the result codes are now defined in only one place, there's no need to retain the duplicates.	2025-01-23 15:54:57 -08:00
Evan Hunt	a19f6c6654	clean up result codes that are never used the following result codes are obsolete and have been removed from result.h and result.c: - ISC_R_NOTHREADS - ISC_R_BOUND - ISC_R_NOTBOUND - ISC_R_NOTDIRECTORY - ISC_R_EMPTY - ISC_R_NOTBLOCKING - ISC_R_INPROGRESS - ISC_R_WOULDBLOCK - DNS_R_TOOMANYHOPS - DNS_R_NOREDATA - DNS_R_BADCKSUM - DNS_R_MOREDATA - DNS_R_NOVALIDDS - DNS_R_UNKNOWNOPT - DNS_R_NOVALIDKEY - DNS_R_NTACOVERED - DST_R_COMPUTESECRETFAILURE - DST_R_NORANDOMNESS - DST_R_NOCRYPTO	2025-01-23 15:54:57 -08:00
Evan Hunt	10accd6260	clean up uses of ISC_R_NOMEMORY the isc_mem allocation functions can no longer fail; as a result, ISC_R_NOMEMORY is now rarely used: only when an external library such as libjson-c or libfstrm could return NULL. (even in these cases, arguably we should assert rather than returning ISC_R_NOMEMORY.) code and comments that mentioned ISC_R_NOMEMORY have been cleaned up, and the following functions have been changed to type void, since (in most cases) the only value they could return was ISC_R_SUCCESS: - dns_dns64_create() - dns_dyndb_create() - dns_ipkeylist_resize() - dns_kasp_create() - dns_kasp_key_create() - dns_keystore_create() - dns_order_create() - dns_order_add() - dns_peerlist_new() - dns_tkeyctx_create() - dns_view_create() - dns_zone_setorigin() - dns_zone_setfile() - dns_zone_setstream() - dns_zone_getdbtype() - dns_zone_setjournal() - dns_zone_setkeydirectory() - isc_lex_openstream() - isc_portset_create() - isc_symtab_create() (the exception is dns_view_create(), which could have returned other error codes in the event of a crypto library failure when calling isc_file_sanitize(), but that should be a RUNTIME_CHECK anyway.)	2025-01-23 15:54:57 -08:00
Matthijs Mekking	5e3aef364f	dnssec-signzone retain signature if key is offline Track inside the dns_dnsseckey structure whether we have seen the private key, or if this key only has a public key file. If the key only has a public key file, or a DNSKEY reference in the zone, mark the key 'pubkey'. In dnssec-signzone, if the key only has a public key available, consider the key to be offline. Any signatures that should be refreshed for which the key is not available, retain the signature. So in the code, 'expired' becomes 'refresh', and the new 'expired' is only used to determine whether we need to keep the signature if the corresponding key is not available (retaining the signature if it is not expired). In the 'keysthatsigned' function, we can remove: - key->force_publish = false; - key->force_sign = false; because they are redundant ('dns_dnsseckey_create' already sets these values to false).	2025-01-23 09:43:07 +00:00
Matthijs Mekking	7ae7851173	Fix possible truncation in dns_keymgr_status() If the generated status output exceeds 4096 it was silently truncated, now we output that the status was truncated.	2025-01-23 09:31:00 +01:00
Mark Andrews	89afc11389	Terminate yaml string after negative comment	2025-01-22 21:33:08 +00:00
Colin Vidal	4096f27130	add support for multiple EDE Extended DNS error mechanism (EDE) enables to have several EDE raised during a DNS resolution (typically, a DNSSEC query will do multiple fetches which each of them can have an error). Add support to up to 3 EDE errors in an DNS response. If duplicates occur (two EDEs with the same code, the extra text is not compared), only the first one will be part of the DNS answer. Because the maximum number of EDE is statically fixed, `ns_client_t` object own a static vector of `DNS_DE_MAX_ERRORS` (instead of a linked list, for instance). The array can be fully filled (all slots point to an allocated `dns_ednsopt_t` object) or partially filled (or empty). In such case, the first NULL slot means there is no more EDE objects.	2025-01-22 21:07:44 +01:00
Aram Sargsyan	a6d6c3cb45	Clean up fctx->next_timeout Since the support for non-zero values of stale-answer-client-timeout was removed in `bd7463914f`, 'next_timeout' is unused. Clean it up.	2025-01-22 13:40:45 +00:00
Aram Sargsyan	87c453850c	Fix rtt calculation bug for TCP in the resolver When TCP is used, 'fctx_query()' adds one second to the rtt (round-trip time) value, but there's a bug when the decision about using TCP is made already after the calculation. Move the block of the code which looks up the peers list to decide whether to use TCP into a place that's before the rtt calculation is performed. This commit doesn't add or remove any code, it just moves the code and adds a comment block.	2025-01-22 13:40:45 +00:00
Aram Sargsyan	e61ba5865f	Use a suitable response in tcp_connected() when initiating a read When 'ISC_R_TIMEDOUT' is received in 'tcp_recv()', it times out the oldest response in the active responses queue, and only after that it checks whether other active responses have also timed out. So when setting a timeout value for a read operation after a successful connection, it makes sense to take the timeout value from the oldest response in the active queue too, because, theoretically, the responses can have different timeout values, e.g. when the TCP dispatch is shared. Currently 'resp' is always NULL. Previously when connect and read timeouts were not separated in dispatch this affected only logging, but now since we are setting a new timeout after a successful connection, we need to choose a suitable response from the active queue.	2025-01-22 13:40:45 +00:00
JINMEI Tatuya	7f4471594d	Optimize database decref by avoiding locking with refs > 1 Previously, this function always acquires a node write lock if it might need node cleanup in case the reference decrements to 0. In fact, the lock is unnecessary if the reference is larger than 1 and it can be optimized as an "easy" case. This optimization could even be "necessary". In some extreme cases, many worker threads could repeat acquring and releasing the reference on the same node, resulting in severe lock contention for nothing (as the ref wouldn't decrement to 0 in most cases). This change would prevent noticeable performance drop like query timeout for such cases. Co-authored-by: JINMEI Tatuya <jtatuya@infoblox.com> Co-authored-by: Ondřej Surý <ondrej@isc.org>	2025-01-22 14:27:13 +01:00
Ondřej Surý	9f945c8b67	Shutdown the fetch context after canceling the last fetch Currently, the fetch context will continue running even when the last fetch (response) has been removed from the context, so named can process and cache the answer. This can lead to a situation where the number of outgoing recursing clients exceeds the the configured number for recursive-clients. Be more stringent about the recursive-clients limit and shutdown the fetch context immediately after the last fetch has been canceled from that particular fetch context.	2025-01-22 14:19:20 +01:00
Ondřej Surý	05faff6d53	Remove memory limit on ADB finds and fetches Address Database (ADB) shares the memory for the short lived ADB objects (finds, fetches, addrinfo) and the long lived ADB objects (names, entries, namehooks). This could lead to a situation where the resolver-heavy load would force evict ADB objects from the database to point where ADB is completely empty, leading to even more resolver-heavy load. Make the short lived ADB objects use the other memory context that we already created for the hashmaps. This makes the ADB overmem condition to not be triggered by the ongoing resolver fetches.	2025-01-22 14:13:35 +01:00
Aram Sargsyan	612d76b83d	Remove dispatch timeout INT16_MAX limitation In some places there was a limitation of the maximum timeout value of INT16_MAX, which is only about 32 seconds. Refactor the code to remove the limitation.	2025-01-22 11:57:53 +00:00
Aram Sargsyan	64ffbe82c0	Separate the connect and the read timeouts in dispatch The network manager layer has two different timers with their own timeout values for TCP connections: connect timeout and read timeout. Separate the connect and the read TCP timeouts in the dispatch module too.	2025-01-22 11:57:52 +00:00
Aram Sargsyan	9ccd1be482	Update the dns_dispatch_add() function's documentation The 'timedout' callback no longer exists. Remove the mentioning of the 'timedout' callback.	2025-01-22 11:52:24 +00:00
Colin Vidal	c9529c0acb	remove ISC_LINK(link) property from fetchctx Likely because of historical reasons, struct fetchctx does have a list link property but is never used as a list. Remove this link property.	2025-01-22 09:56:09 +00:00
Colin Vidal	93e6e72eb6	remove validator link form fetchctx struct fetchctx does have a list of pending validators as well as a pointer to the HEAD validator. Remove the validator pointer to avoid confusion, as there is no perticular reasons to have it directly accessible outside of the list.	2025-01-22 09:56:09 +00:00
Artem Boldariev	937b5f8349	DoH: reduce excessive bad request logging We started using isc_nm_bad_request() more actively throughout codebase. In the case of HTTP/2 it can lead to a large count of useless "Bad Request" messages in the BIND log, as often we attempt to send such request over effectively finished HTTP/2 sessions. This commit fixes that.	2025-01-15 14:09:17 +00:00
Artem Boldariev	4ae4e255cf	Do not stop timer in isc_nm_read_stop() in manual timer mode A call to isc_nm_read_stop() would always stop reading timer even in manual timer control mode which was added with StreamDNS in mind. That looks like an omission that happened due to how timers are controlled in StreamDNS where we always stop the timer before pausing reading anyway (see streamdns_on_complete_dnsmessage()). That would not work well for HTTP, though, where we might want pause reading without stopping the timer in the case we want to split incoming data into multiple chunks to be processed independently. I suppose that it happened due to NM refactoring in the middle of StreamDNS development (at the time isc_nm_cancelread() and isc_nm_pauseread() were removed), as the StreamDNS code seems to be written as if timers are not stoping during a call to isc_nm_read_stop().	2025-01-15 14:09:17 +00:00
Artem Boldariev	609a41517b	DoH: introduce manual read timer control This commit introduces manual read timer control as used by StreamDNS and its underlying transports. Before that, DoH code would rely on the timer control provided by TCP, which would reset the timer any time some data arrived. Now, the timer is restarted only when a full DNS message is processed in line with other DNS transports. That change is required because we should not stop the timer when reading from the network is paused due to throttling. We need a way to drop timed-out clients, particularly those who refuse to read the data we send.	2025-01-15 14:09:17 +00:00
Artem Boldariev	3425e4b1d0	DoH: floodding clients detection This commit adds logic to make code better protected against clients that send valid HTTP/2 data that is useless from a DNS server perspective. Firstly, it adds logic that protects against clients who send too little useful (=DNS) data. We achieve that by adding a check that eventually detects such clients with a nonfavorable useful to processed data ratio after the initial grace period. The grace period is limited to processing 128 KiB of data, which should be enough for sending the largest possible DNS message in a GET request and then some. This is the main safety belt that would detect even flooding clients that initially behave well in order to fool the checks server. Secondly, in addition to the above, we introduce additional checks to detect outright misbehaving clients earlier: The code will treat clients that open too many streams (50) without sending any data for processing as flooding ones; The clients that managed to send 1.5 KiB of data without opening a single stream or submitting at least some DNS data will be treated as flooding ones. Of course, the behaviour described above is nothing else but heuristical checks, so they can never be perfect. At the same time, they should be reasonable enough not to drop any valid clients, realatively easy to implement, and have negligible computational overhead.	2025-01-15 14:09:17 +00:00
Artem Boldariev	9846f395ad	DoH: process data chunk by chunk instead of all at once Initially, our DNS-over-HTTP(S) implementation would try to process as much incoming data from the network as possible. However, that might be undesirable as we might create too many streams (each effectively backed by a ns_client_t object). That is too forgiving as it might overwhelm the server and trash its memory allocator, causing high CPU and memory usage. Instead of doing that, we resort to processing incoming data using a chunk-by-chunk processing strategy. That is, we split data into small chunks (currently 256 bytes) and process each of them asynchronously. However, we can process more than one chunk at once (up to 4 currently), given that the number of HTTP/2 streams has not increased while processing a chunk. That alone is not enough, though. In addition to the above, we should limit the number of active streams: these streams for which we have received a request and started processing it (the ones for which a read callback was called), as it is perfectly fine to have more opened streams than active ones. In the case we have reached or surpassed the limit of active streams, we stop reading AND processing the data from the remote peer. The number of active streams is effectively decreased only when responses associated with the active streams are sent to the remote peer. Overall, this strategy is very similar to the one used for other stream-based DNS transports like TCP and TLS.	2025-01-15 14:09:17 +00:00
Ondřej Surý	a1982cf1bb	Limit the additional processing for large RDATA sets Limit the number of records appended to ADDITIONAL section to the names that have less than 14 records in the RDATA. This limits the number of the lookups into the database(s) during single client query. Also don't append any additional data to ANY queries. The answer to ANY is already big enough.	2025-01-14 09:57:54 +00:00
Ondřej Surý	8356179953	Rename the qpzone and qpcache methods that implement DB api All the database implementations share the same names for the methods implementing the database. That has some advantages like knowing what to expect, but it turns out that any time such method shows up in any kind of tracing - be it perf record, backtrace or anything else that uses symbol names, it is very hard to distinguish whether the find() belongs to qpcache, qpzone, builtin or sdlz implementation. Make at least the names for qpzone and qpcache unique.	2025-01-14 09:57:54 +00:00
Evan Hunt	232dac8cd5	detect when closest-encloser name is too long there was a database bug in which dns_db_find() could get a partial match for the query name, but still set foundname to match the full query name. this triggered an assertion when query_addwildcardproof() assumed that foundname would be shorter. the database bug has been fixed, but in case it happens again, we can just copy the name instead of splitting it. we will also log a warning that the closest-encloser name was invalid.	2025-01-09 17:04:08 -08:00
Evan Hunt	71e1c91695	dns_nsec3_addnsec3() can fail when iterating back when adding a new NSEC3 record, dns_nsec3_addnsec3() uses a dbiterator to seek to the newly created node and then find its predecessor. dbiterators in the qpzone use snapshots, so changes to the database are not reflected in an already-existing iterator. consequently, when we add a new node, we have to create a new iterator before we can seek to it.	2025-01-09 17:04:08 -08:00
Evan Hunt	ad4bab306c	qpzone find() function could set foundname incorrectly when a requested name is found in the QP trie during a lookup, but its records have been marked as nonexistent by a previous deletion, then it's treated as a partial match, but the foundname could be left pointing to the original qname rather than the parent. this could lead to an assertion failure in query_findclosestnsec3().	2025-01-09 17:03:51 -08:00
Aram Sargsyan	d75bdabe51	Fix a typo in dns/master.h The ISC_R_SEENINCLUDE definition does not exist, the correct one is DNS_R_SEENINCLUDE.	2025-01-08 14:00:55 +00:00
Aram Sargsyan	3d7a9fba3b	Don't disable RPZ and CATZ for zones with an $INCLUDE statement The code in zone_startload() disables RPZ and CATZ for a zone if dns_master_loadfile() returns anything other than ISC_R_SUCCESS, which makes sense, but it's an error because zone_startload() can also return DNS_R_SEENINCLUDE upon success when the zone had an $INCLUDE statement.	2025-01-08 14:00:55 +00:00
Michał Kępień	7bdf5152d6	Adjust dns_message_logpacketfrom() log prefixes Ensure the log prefixes passed to the dns_message_logpacketfrom() function by its callers do not include the word "from" as the latter is now emitted by the logfmtpacket() helper function.	2024-12-31 05:40:48 +01:00
Michał Kępień	58d38352ee	Adjust dns_message_logpacketfromto() log prefixes Ensure the log prefixes passed to the dns_message_logpacketfromto() function by its callers do not include the words "from" or "to" as those are now emitted by the logfmtpacket() helper function.	2024-12-31 05:40:48 +01:00
Michał Kępień	c5555a5ca2	Log both "from" and "to" socket in debug messages Move dns_dispentry_getlocaladdress() calls around so that they are not only invoked when dnstap support is compiled in. This function calls isc_nmhandle_localaddr(), which may issue a system call, but only if the ISC_SOCKET_DETAILS preprocessor macro is set at compile time. Pass the value extracted by dns_dispentry_getlocaladdress() to dns_message_logpacketfromto() so that it gets logged, adding useful information to the relevant debug messages.	2024-12-31 05:40:48 +01:00
Michał Kępień	4ab35f6839	Rename dns_message_logpacket() Since dns_message_logpacket() only takes a single socket address as a parameter (and it is always the sending socket's address), rename it to dns_message_logpacketfrom() so that its name better conveys its purpose and so that the difference in purpose between this function and dns_message_logpacketfromto() becomes more apparent.	2024-12-31 05:40:48 +01:00
Michał Kępień	fa073a0a63	Rename dns_message_logfmtpacket() Since dns_message_logfmtpacket() needs to be provided with both "from" and "to" socket addresses, rename it to dns_message_logpacketfromto() so that its name better conveys its purpose. Clean up the code comments for that function.	2024-12-31 05:40:48 +01:00
Michał Kępień	bafa5d3c2e	Enable logging both "from" and "to" socket Change the function prototype for dns_message_logfmtpacket() so that it takes two isc_sockaddr_t parameters: one for the sending side and another one for the receiving side. This enables debug messages to be more precise. Also adjust the function prototype for logfmtpacket() accordingly. Unlike dns_message_logfmtpacket(), this function must not require both 'from' and 'to' parameters to be non-NULL as it is still going to be used by dns_message_logpacket(), which only provides a single socket address. Adjust its log format to handle both of these cases properly. Adjust both dns_message_logfmtpacket() call sites accordingly, without actually providing the second socket address yet. (This causes the revised REQUIRE() assertion in dns_message_logfmtpacket() to fail; the issue will be addressed in a separate commit.)	2024-12-31 05:40:48 +01:00

1 2 3 4 5 ...

15719 commits