opnsense-src

mirror of https://github.com/opnsense/src.git synced 2026-05-25 18:54:02 -04:00

History

Alexander Motin 558e8b6adb amd64: Stop using REP MOVSB for backward memmove()s. Enhanced REP MOVSB feature of CPUs starting from Ivy Bridge makes REP MOVSB the fastest way to copy memory in most of cases. However Intel Optimization Reference Manual says: "setting the DF to force REP MOVSB to copy bytes from high towards low addresses will expe- rience significant performance degradation". Measurements on Intel Cascade Lake and Alder Lake, same as on AMD Zen3 show that it can drop throughput to as low as 2.5-3.5GB/s, comparing to ~10-30GB/s of REP MOVSQ or hand-rolled loop, used for non-ERMS CPUs. This patch keeps ERMS use for forward ordered memory copies, but removes it for backward overlapped moves where it does not work. This is just a cosmetic sync with kernel, since libc does not use ERMS at this time. Reviewed by: mjg MFC after: 2 weeks (cherry picked from commit `f22068d91b`)		2022-06-29 21:13:51 -04:00
..
gen	libc: add _get_tp() private function	2021-04-23 14:14:07 +03:00
stdlib
string	amd64: Stop using REP MOVSB for backward memmove()s.	2022-06-29 21:13:51 -04:00
sys	libc/<arch>/sys/cerror.S: fix typo	2021-04-06 03:47:34 +03:00
_fpmath.h	libc: further adoption of SPDX licensing ID tags.	2017-11-25 17:12:48 +00:00
arith.h
gd_qnan.h
Makefile.inc
static_tls.h	Fix initial exec TLS mode for dynamically loaded shared objects.	2019-03-29 17:52:57 +00:00
Symbol.map	Add usermode helpers for for Intel userspace protection keys feature.	2019-02-20 09:56:23 +00:00
SYS.h	General further adoption of SPDX licensing ID tags.	2017-11-20 19:49:47 +00:00