opnsense-src

mirror of https://github.com/opnsense/src.git synced 2026-06-06 23:32:52 -04:00

Author	SHA1	Message	Date
Gleb Smirnoff	99e5a70046	sysent: regen for deletion of gssd_syscall and new ABI for rpctls_syscall	2025-02-01 01:00:28 -08:00
Gleb Smirnoff	030c028255	kgssapi: remove the gssd_syscall Reviewed by: brooks Differential Revision: https://reviews.freebsd.org/D48554	2025-02-01 01:00:26 -08:00
CismonX	c814172896	open.2: update description for O_PATH - Add fstatfs(), fchdir(), fchroot(), extattr__fd(), cap__get(), cap_*_limit() to the list of syscalls that can take an O_PATH fd. - Remove readlinkat() from the list, since it is already discussed in the first few lines of the paragraph. It was originally added to the list when readlinkat() adds support for non-dir fd with an empty relative path (as if with AT_EMPTY_PATH), however, such use case is also discussed in the next paragraph. - Add funlinkat() to the list, since it accepts an extra fd (of the file to be unlinked), which is worth extra mentioning. - Fix a syntax issue which causes a bogus space to be rendered before a closing parentheses. Signed-off-by: CismonX <admin@cismon.net> Reviewed by: markj, jhb MFC after: 2 weeks Pull Request: https://github.com/freebsd/freebsd-src/pull/1564	2025-01-24 20:15:09 +00:00
Mark Johnston	010ee8215f	setfib.2: Note that the number of FIBs can be adjusted after boot Reviewed by: zlei, imp MFC after: 2 weeks Sponsored by: Klara, Inc. Sponsored by: Stormshield Differential Revision: https://reviews.freebsd.org/D48545	2025-01-21 15:39:50 +00:00
Ed Maste	724e383bd4	munmap.2: Add STANDARDS and note about portability POSIX used to specify that munmap shall fail with EINVAL if the addr argument is not a multiple of the page size, but that was changed to may fail. Note that we conform to contemporary POSIX and include a brief note for portable programs. Reviewed by: brooks Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D48481	2025-01-16 12:50:47 -05:00
Ed Maste	fab411c4fd	munmap.2: Remove EINVAL for negative len len is unsigned (it is size_t), so cannot be negative. Sponsored by: The FreeBSD Foundation	2025-01-15 16:49:21 -05:00
Ed Maste	9e36aaf0c2	munmap.2: Unaligned addresses do not return error We previously claimed that non-page-aligned addresses would return EINVAL, but the address is in fact rounded down to the page boundary. Reported by: Harald Eilertsen <haraldei@anduin.net> Reviewed by: brooks Sponsored by: The FreeBSD Foundation Fixes: `dabee6fecc` ("kern_descrip.c: add fdshare()/fdcopy()") Differential Revision: https://reviews.freebsd.org/D48465	2025-01-15 13:09:37 -05:00
John Baldwin	826509a3c3	open.2: Editorial pass - Use a typical tagged list for the open flags instead of a literal block. This permits using markup in the flag descriptions. Also, drop the offset to avoid indenting the entire list. - Note that O_RESOLVE_BENEATH only applies to openat(2) - Use a clearer description of O_CLOEXEC (what it means, not the internal flag it sets) - Note that exactly one permission flag is required. - Split up a paragraph on various flags so that each flag gets its own paragraph. Some flags already had their own paragraph, so this is more consistent. It also makes it clearer which flag a sentence is talking about when a flag has more than one sentence. - Appease some errors from igor and man2ps - In the discussion about a returned directory descriptor opened with O_SEARCH, avoid the use of Fa fd since the descriptor in question is a return value and not an argument to open or openat. - Various and sundry markup and language tweaks Reviewed by: kib, emaste Differential Revision: https://reviews.freebsd.org/D48253	2025-01-03 10:48:24 -05:00
John Baldwin	9b1585384d	kqueue.2: Editorial pass - Use consistent language to describe user values unchanged by the kernel. - Replace passive language with active in a few places. - Add a history note for kqueuex() and kqueue1(). - Add an MLINK and synopsis for kqueue1(). - Various wording and markup tweaks. Reviewed by: emaste Differential Revision: https://reviews.freebsd.org/D48203	2024-12-30 14:09:48 -05:00
Gleb Smirnoff	053a988497	tcp: don't ever return ECONNRESET on close(2) The SUS doesn't mention this error code as a possible one [1]. The FreeBSD manual page specifies a possible ECONNRESET for close(2): [ECONNRESET] The underlying object was a stream socket that was shut down by the peer before all pending data was delivered. In the past it had been EINVAL (see `21367f630d`), and this EINVAL was added as a safety measure in `623dce13c6`. After conversion to ECONNRESET it had been documented in the manual page in `78e3a7fdd5`, but I bet wasn't ever tested to actually be ever returned, cause the tcp-testsuite[2] didn't exist back then. So documentation is incorrect since 2006, if my bet wins. Anyway, in the modern FreeBSD the condition described above doesn't end up with ECONNRESET error code from close(2). The error condition is reported via SO_ERROR socket option, though. This can be checked using the tcp-testsuite, temporarily disabling the getsockopt(SO_ERROR) lines using sed command [3]. Most of these getsockopt(2)s are followed by '+0.00 close(3) = 0', which will confirm that close(2) doesn't return ECONNRESET even on a socket that has the error stored, neither it is returned in the case described in the manual page. The latter case is covered by multiple tests residing in tcp- testsuite/state-event-engine/rcv-rst-*. However, the deleted block of code could be entered in a race condition between close(2) and processing of incoming packet, when connection had already been half-closed with shutdown(SHUT_WR) and sits in TCPS_LAST_ACK. This was reported in the bug 146845. With the block deleted, we will continue into tcp_disconnect() which has proper handling of INP_DROPPED. The race explanation follows. The connection is in TCPS_LAST_ACK. The network input thread acquires the tcpcb lock first, sets INP_DROPPED, acquires the socket lock in soisdisconnected() and clears SS_ISCONNECTED. Meanwhile, the syscall thread goes through sodisconnect() which checks for SS_ISCONNECTED locklessly(!). The check passes and the thread blocks on the tcpcb lock in tcp_usr_disconnect(). Once input thread releases the lock, the syscall thread observes INP_DROPPED and returns ECONNRESET. - Thread 1: tcp_do_segment()->tcp_close()->in_pcbdrop(),soisdisconnected() - Thread 2: sys_close()...->soclose()->sodisconnect()->tcp_usr_disconnect() Note that the lockless operation in sodisconnect() isn't correct, but enforcing the socket lock there will not fix the problem. [1] https://pubs.opengroup.org/onlinepubs/9799919799/ [2] https://github.com/freebsd-net/tcp-testsuite [3] sed -i "" -Ee '/\+0\.00 getsockopt\(3, SOL_SOCKET, SO_ERROR, \[ECONNRESET\]/d' $(grep -lr ECONNRESET tcp-testsuite) PR: 146845 Reviewed by: tuexen, rrs, imp Differential Revision: https://reviews.freebsd.org/D48148	2024-12-23 10:35:49 -08:00
Olivier Certner	b6f4027ad9	setcred(2): Add manual page Reviewed by: Alexander Ziaee <concussious@runbox.com> Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D48063	2024-12-19 23:36:00 +01:00
Olivier Certner	ddb3eb4efe	New setcred() system call and associated MAC hooks This new system call allows to set all necessary credentials of a process in one go: Effective, real and saved UIDs, effective, real and saved GIDs, supplementary groups and the MAC label. Its advantage over standard credential-setting system calls (such as setuid(), seteuid(), etc.) is that it enables MAC modules, such as MAC/do, to restrict the set of credentials some process may gain in a fine-grained manner. Traditionally, credential changes rely on setuid binaries that call multiple credential system calls and in a specific order (setuid() must be last, so as to remain root for all other credential-setting calls, which would otherwise fail with insufficient privileges). This piecewise approach causes the process to transiently hold credentials that are neither the original nor the final ones. For the kernel to enforce that only certain transitions of credentials are allowed, either these possibly non-compliant transient states have to disappear (by setting all relevant attributes in one go), or the kernel must delay setting or checking the new credentials. Delaying setting credentials could be done, e.g., by having some mode where the standard system calls contribute to building new credentials but without committing them. It could be started and ended by a special system call. Delaying checking could mean that, e.g., the kernel only verifies the credentials transition at the next non-credential-setting system call (we just mention this possibility for completeness, but are certainly not endorsing it). We chose the simpler approach of a new system call, as we don't expect the set of credentials one can set to change often. It has the advantages that the traditional system calls' code doesn't have to be changed and that we can establish a special MAC protocol for it, by having some cleanup function called just before returning (this is a requirement for MAC/do), without disturbing the existing ones. The mac_cred_check_setcred() hook is passed the flags received by setcred() (including the version) and both the old and new kernel's 'struct ucred' instead of 'struct setcred' as this should simplify evolving existing hooks as the 'struct setcred' structure evolves. The mac_cred_setcred_enter() and mac_cred_setcred_exit() hooks are always called by pairs around potential calls to mac_cred_check_setcred(). They allow MAC modules to allocate/free data they may need in their mac_cred_check_setcred() hook, as the latter is called under the current process' lock, rendering sleepable allocations impossible. MAC/do is going to leverage these in a subsequent commit. A scheme where mac_cred_check_setcred() could return ERESTART was considered but is incompatible with proper composition of MAC modules. While here, add missing includes and declarations for standalone inclusion of <sys/ucred.h> both from kernel and userspace (for the latter, it has been working thanks to <bsm/audit.h> already including <sys/types.h>). Reviewed by: brooks Approved by: markj (mentor) Relnotes: yes Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D47618	2024-12-16 15:42:39 +01:00
Kyle Evans	74ecdf86d8	Tweak ppoll() to include 1003.1-2024 visibility, take two Note in the manpage that the 2024 edition finally added ppoll(), and also add the appropriate declarations for the correct versions of _POSIX_C_SOURCE (via __POSIX_VISIBLE). Differential Revision: https://reviews.freebsd.org/D48043	2024-12-14 22:40:16 -06:00
Kyle Evans	da5aed38d8	Revert "Tweak ppoll() to include 1003.1-2024 visibility" This reverts commit `212d7f439a`. A last minute change to remove __BSD_VISIBLE unearthed some breakage that I failed to re-test. Sigh.	2024-12-14 01:05:09 -06:00
Kyle Evans	dabf006a63	Add per-process flag to disable logsigexit I added a third value for kern.logsigexit to mean 'auto' as an abundance of caution, but I don't know how much it matters -- that can be easily consolidated back to boolean-ish. This is primarily targeted towards people running test suites under CI (e.g. buildbot, jenkins). Oftentimes tests entail segfaults that are expected, and logs get spammed -- this can be particularly high volume depending on the application. Per-process control of this behavior is desirable because they may still want to be logging legitimate segfaults, so the system-wide atomic bomb kern.logsigexit=0 is not a great option. This adds a process flag to disable it, controllable via procctl(2)/proccontrol(1); the latter knows it as "sigexitlog" due to its length, but it's referred to almost everywhere else as "sigexit_log." Reviewed by: kib (earlier version), pstef Differential Revision: https://reviews.freebsd.org/D21903	2024-12-13 23:18:30 -06:00
Kyle Evans	212d7f439a	Tweak ppoll() to include 1003.1-2024 visibility Note in the manpage that the 2024 edition finally added ppoll(), and also add the appropriate declarations for the correct versions of _POSIX_C_SOURCE. Differential Revision: https://reviews.freebsd.org/D48043	2024-12-13 22:15:19 -06:00
Brooks Davis	b9cf179622	libsys/i386/Symbol.sys.map: sort symbol names No functional change. Sponsored by: DARPA, AFRL	2024-12-11 20:31:30 +00:00
John Baldwin	8277c79017	procctl.2: Editing pass - Add some missing .Pp macros after the end of literal blocks and some lists to ensure there is a blank line before the following text. - Use an indent of Ds for nested lists to reduce excessive indentation and make the bodies of the nested list items easier to read. - Various and sundry rewordings and clarifications. Reviewed by: kib, emaste Differential Revision: https://reviews.freebsd.org/D47782	2024-12-04 09:11:56 -05:00
Edward Tomasz Napierala	60f87c7368	Regen	2024-11-29 12:10:45 +00:00
Edward Tomasz Napierala	b165e9e3ea	Add fchroot(2) This is similar to chroot(2), but takes a file descriptor instead of path. Same syscall exists in NetBSD and Solaris. It is part of a larger patch to make absolute pathnames usable in Capsicum mode, but should be useful in other contexts too. Reviewed By: brooks Sponsored by: Innovate UK Differential Revision: https://reviews.freebsd.org/D41564	2024-11-29 12:10:02 +00:00
Wolfram Schneider	fb4cdd5160	fhreadlink.2: fix old typo in the manpage PR: 282967 Approved by: kib	2024-11-25 18:38:20 +00:00
Kevin Bowling	c1e304c60c	setsockopt.2: Clarify SO_SPLICE action Reviewed by: gallatin, markj MFC after: 3 days Sponsored by: Netflix Differential Revision: https://reviews.freebsd.org/D47720 Co-authored-by: Mark Johnston <markj@FreeBSD.org>	2024-11-25 11:36:00 -07:00
Wolfram Schneider	aebac84982	manpage: cross link fhreadlink(2) <-> readlink(2)	2024-11-25 09:02:34 +00:00
Ed Maste	566c039d1e	fork: Document _Fork (and fork) as POSIX 2024 Also remove some information from HISTORY that is no longer needed (and could be confusing), now that _Fork is part of a standard. Reported by: kib Reviewed by: imp, kib Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D47588	2024-11-15 23:05:40 -05:00
Ed Maste	36887e0494	sched_getcpu: Add man page Reviewed by: kib Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D47556	2024-11-13 19:32:04 -05:00
Mark Johnston	bfd03046d1	unix: Add support for atomically setting the socket mode With this patch, it is possible to call fchmod() on a unix socket prior to binding it to the filesystem namespace, so that the mode is set atomically. Without this, one has to call chmod() after bind(), leaving a window where threads can connect to the socket with the default mode. After bind(), fchmod() reverts to failing with EINVAL. This interface is copied from Linux. The behaviour of fstat() is unmodified, i.e., it continues to return the mode as set by soo_stat(). PR: 282393 Reviewed by: kib MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D47361	2024-11-03 16:46:53 +00:00
Brooks Davis	59a8b439ac	libsys: remove yield special case Reviewed by: emaste Pull Request: https://github.com/freebsd/freebsd-src/pull/1503	2024-11-01 15:45:04 +00:00
Mark Johnston	f44029e322	linker: Make linker.h more self-contained struct kld_file_stat embeds a reference to MAXPATHLEN, defined in param.h. PR: 280432 MFC after: 2 weeks	2024-10-26 14:05:56 +00:00
Li-Wen Hsu	dab59af3bc	Canonicalize the name of the FreeBSD Foundation Reviewed by: emaste Sponsored by: The FreeBSD Foundation	2024-10-24 05:03:07 +08:00
Ed Maste	92cd5abb64	membarrier: man page improvements Reported by: fernape (in D46967) Fixes: `1fc766e3b4` ("membarrier: Add manual page") Sponsored by: The FreeBSD Foundation	2024-10-19 16:18:18 -04:00
Mitchell Horne	23cb03d145	thr_kill(2): fix title Mandoc emits a STYLE warning due to the lowercase letters.	2024-10-15 17:44:52 -03:00
Graham Percival	6e1fc01180	manuals: Fix "unusual .Xr" warnings with a script These were reported by `mandoc -T lint ...` as warnings: - unusual Xr order - unusual Xr punctuation Fixes made by script in https://github.com/Tarsnap/freebsd-doc-scripts Signed-off-by: Graham Percival <gperciva@tarsnap.com> Reviewed by: mhorne, Alexander Ziaee <concussious.bugzilla@runbox.com> Sponsored by: Tarsnap Backup Inc. Pull Request: https://github.com/freebsd/freebsd-src/pull/1464	2024-10-15 17:18:14 -03:00
Simon J. Gerraty	a64729f507	Update Makefile.depend files After building packages we have a number of new and updated Makefile.depend files Reviewed by: stevek	2024-10-14 10:26:17 -07:00
Gleb Popov	e3ebc5f534	procctl(2): Clarify the ESRCH error code case Approved by: kib Differential Revision: https://reviews.freebsd.org/D47010	2024-10-08 19:58:17 +03:00
Ed Maste	8b41e693fc	libsys: connect membarrier.2 Sponsored by: The FreeBSD Foundation	2024-10-07 08:01:34 -04:00
Konstantin Belousov	3670421e21	getrlimitusage.2: add the man page Reviewed by: olce Sponsored by: The FreeBSD Foundation MFC after: 1 week Differential revision: https://reviews.freebsd.org/D46823	2024-10-07 13:50:08 +03:00
Ed Maste	1fc766e3b4	membarrier: Add manual page Add a minimal membarrier man page that documents the available cmd values and errors that can be returned. We can add more information and iterate on it in the tree. Reviewed by: kib Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D46967	2024-10-06 19:41:57 -04:00
Brooks Davis	74f6ec6fe3	lib{c,sys}: .note.GNU-stack in syscall stubs Explicitly disable executable stacks in the syscall stubs on all architectures. Previously, aarch64 and riscv64 didn't include the .note.GNU-stack note due it being disabled by default in those ABIs. This appears to have been harmless in practice, but better to be clear in case a different compiler/linker has different defaults. This also reduces special cases in the Makefile. Reported by: jrtc27 Reviewed by: kib Differential Revision: https://reviews.freebsd.org/D44883	2024-10-04 23:01:45 +01:00
Brooks Davis	a57e881d32	sysent: regen comments	2024-10-03 18:01:30 +01:00
Brooks Davis	d9d2e3ab7c	sysent: regen comments	2024-10-01 18:46:40 +01:00
Brooks Davis	1235d276b7	lib{c,sys}: stop exposing errno symbol Officially since C11 (and in reality FreeBSD since 3.0 with commit `1b46cb523d`) errno has been defined to be a macro. Rename the symbol to __libsys_errno and move it to FBSDprivate_1.0 and confine it entierly to libsys for use by libthr. Add a FBSD_1.0 compat symbol for existing binaries that were incorrectly linked to the errno symbol during libc.so.7's lifetime. This deliberately breaks linking software that directly links to errno. Such software is broken and will fail in surprising ways if it becomes threaded (e.g., if it triggers loading of a pam or nss module that uses threads.) Reviewed by: kib Differential Revision: https://reviews.freebsd.org/D46780	2024-09-27 20:27:46 +01:00
Konstantin Belousov	927f379180	Regen	2024-09-27 18:02:23 +03:00
Konstantin Belousov	9b29fc89ae	Userspace enablement for getrlimitusage(2) Reviewed by: markj, olce Sponsored by: The FreeBSD Foundation MFC after: 1 week Differential revision: https://reviews.freebsd.org/D46747	2024-09-27 18:02:09 +03:00
Graham Percival	2878d99dfc	manuals: Misc macro typos These were reported by `mandoc -T lint` as ERROR: skipping unknown macro When these pages were rendered with `man`, the "unknown macro" meant that the entire line was omitted from the output. Obvious typos in: lib/libsys/swapon.2 lib/libsys/procctl.2 share/man/man9/firmware.9 lib/libcasper/services/cap_net/cap_net.3: 'mode' describes a function argument. lib/libsys/statfs.2: there's no .Tm command ("trademark?"), and .Tn ("tradename") is deprecated, so remove the macro entirely. usr.sbin/mfiutil/mfiutil.8: man was interpreting '/dev/' as a macro (which it didn't recognize). share/man/man4/qat.4: same issue as above, but with '0'. In this case, given the context of the previous line, rewriting as "Value '0'" seemed more appropriate. usr.sbin/mlx5tool/mlx5tool.8: typo in .Xr Signed-off-by: Graham Percival <gperciva@tarsnap.com> Sponsored by: Tarsnap Backup Inc. Reviewed by: concussious, imp Pull Request: https://github.com/freebsd/freebsd-src/pull/1417	2024-09-21 05:25:15 -06:00
Graham Percival	650056363b	manuals: Fix errors in .2 pages These were reported by `mandoc -T lint ...` as errors. fhlink.2, fhreadlink.2: remove unneeded block closing. getfh.2, procctl.2: add necessary block closing. ptrace.2: -width only takes one argument. swapon.2: <sys/vmparam.h> and <vm/swap_pager.h> weren't being displayed, because .It is for a list item whereas .In is for included files. Also, we want a blank line between <sys/ > headers and the other one. Signed-off-by: Graham Percival <gperciva@tarsnap.com> PR: 281597 Reviewed by: mhorne Sponsored by: Tarsnap Backup Inc.	2024-09-20 11:37:02 -03:00
Konstantin Belousov	54a8d1fbbf	getrlimit(2): document RLIMIT_PIPEBUF Sponsored by: The FreeBSD Foundation MFC after: 1 week Differential revision: https://reviews.freebsd.org/D46619	2024-09-20 09:46:06 +03:00
Stephen J. Kiernan	c644d3d896	libsys: Add dependencies for dirdeps build	2024-09-18 13:03:42 -04:00
Konstantin Belousov	3a2a5d6060	getrlimit(2): document RLIMIT_UMTXP Reviewed by: olce Sponsored by: The FreeBSD Foundation MFC after: 1 week Differential revision: https://reviews.freebsd.org/D46619	2024-09-15 09:30:00 +03:00
Brooks Davis	5b92737502	kcmp(2): fix whitespace in symbol list Fixes: `211bdd601e` Add kcmp(2) userspace bits	2024-09-12 12:35:04 +01:00
Mark Johnston	a1da7dc1cd	socket: Implement SO_SPLICE This is a feature which allows one to splice two TCP sockets together such that data which arrives on one socket is automatically pushed into the send buffer of the spliced socket. This can be used to make TCP proxying more efficient as it eliminates the need to copy data into and out of userspace. The interface is copied from OpenBSD, and this implementation aims to be compatible. Splicing is enabled by setting the SO_SPLICE socket option. When spliced, data that arrives on the receive buffer is automatically forwarded to the other socket. In particular, splicing is a unidirectional operation; to splice a socket pair in both directions, SO_SPLICE needs to be applied to both sockets. More concretely, when setting the option one passes the following struct: struct splice { int fd; off_t max; struct timveval idle; }; where "fd" refers to the socket to which the first socket is to be spliced, and two setsockopt(SO_SPLICE) calls are required to set up a bi-directional splice. select(), poll() and kevent() do not return when data arrives in the receive buffer of a spliced socket, as such data is expected to be removed automatically once space is available in the corresponding send buffer. Userspace can perform I/O on spliced sockets, but it will be unpredictably interleaved with splice I/O. A splice can be configured to unsplice once a certain number of bytes have been transmitted, or after a given time period. Once unspliced, the socket behaves normally from userspace's perspective. The number of bytes transmitted via the splice can be retrieved using getsockopt(SO_SPLICE); this works after unsplicing as well, up until the socket is closed or spliced again. Userspace can also manually trigger unsplicing by splicing to -1. Splicing work is handled by dedicated threads, similar to KTLS. A worker thread is assigned at splice creation time. At some point it would be nice to have a direct dispatch mode, wherein the thread which places data into a receive buffer is also responsible for pushing it into the sink, but this requires tighter integration with the protocol stack in order to avoid reentrancy problems. Currently, sowakeup() and related functions will signal the worker thread assigned to a spliced socket. so_splice_xfer() does the hard work of moving data between socket buffers. Co-authored by: gallatin Reviewed by: brooks (interface bits) MFC after: 3 months Sponsored by: Klara, Inc. Sponsored by: Stormshield Sponsored by: Netflix Differential Revision: https://reviews.freebsd.org/D46411	2024-09-10 16:51:37 +00:00

1 2 3

145 commits