quakeforge

mirror of https://git.code.sf.net/p/quake/quakeforge synced 2024-12-02 09:02:32 +00:00

Author	SHA1	Message	Date
Bill Currie	2a47a61bc3	[util] Add atomic ring buffer and automated tests Just head and tail are atomic, but it seems to work nicely (at least on intel). I actually had more trouble with gcc (due to accidentally testing lock-free with the wrong ring buffer... oops, but yup, gcc will happily optimize your loop to spin really really fast). Also served as a nice test for C11 threading.	2022-09-22 09:35:57 +09:00
Bill Currie	0ebb0717b0	[zone] Add failing test cases The tests fail as they exercise how the cache SHOULD work rather than how it does now. The tests do currently pass for the pending work I've done on the cache system, but while working on it, I remembered why I reworked cache allocation... The essential problem is that sounds are loaded into the cache, which is fine for synchronous output targets, but has proven to be a minefield for asynchronous output targets (JACK, ALSA). The reason for the minefield is the hunk takes priority over the cache, and is free to move cache blocks around, and even dispose of them entirely in order to satisfy memory allocations from either end of the hunk. Doing this in an entirely single-threaded process (as DOS Quake was) is perfectly safe, as the users of the cache just reload the pointer each time, and bail if it's null (meaning the block has been freed), or even cause the data to be reloaded if possible (I'm a little fuzzy on the details for that as I didn't write that code). However, in multi-threaded code, especially real-time (JACK, possibly ALSA), it's a recipe for disaster. The `4cab5b90e6` commit was a (mostly) successful attempt to mitigate the problem by allocating the cache blocks from the high-hunk (thus minimizing any movement caused by low-hunk allocations), it resulted in cache allocates and regular high-hunk allocations somehow getting intertwined: while investigating just how much memory ad_tears needs (somewhere between 192MB and 256MB), I got "trashed sentinel" errors and upon investigation, I found what looks very suspiciously like audio data written across a hunk control block. I've decided that the cache allocation algorithm should be reverted to how it was originally designed by Id (details will remain "modern"), but while working on the tests, I remembered why I had done the changes in the first place (above story). Thus the work on reverting the cache allocation can't go in until I get sound memory management independent of the cache. The tests are going in now so I have a constant reminder :)	2022-06-03 12:52:59 +09:00
Bill Currie	cfaf158ebc	[math] Add some bit-op functions Just 32-bit rounding to next higher power of two, and base 2 logarithm. Most importantly, they are suitable for use in initializers as they are constant in, constant out.	2022-01-23 13:47:14 +09:00
Bill Currie	c069e7754f	[util] Add a listener object I decided cvars and input buttons/axes need listeners so any changes to them can be propagated. This will make using cvars in bindings feasible and I have an idea for automatic imt switching that would benefit from listeners attached to buttons and cvars.	2021-11-25 13:36:02 +09:00
Bill Currie	f76964b86b	[util] Add an priority queue implementation Done via macros (like darray and ringbuffer). Might prove useful for qfvis and maybe dynamic lights.	2021-08-02 13:29:55 +09:00
Bill Currie	e4984aad17	[util] Add functions for binary heaps Sink, swim, build and sort, both "simple" and with a data parameter for the compare function.	2021-08-02 12:44:08 +09:00
Bill Currie	8fdd9c1f5a	[util] Write some tests for utf8 r/w And fix some errors with 5-byte encodings.	2021-07-27 23:29:14 +09:00
Bill Currie	24fd443ef3	[ui] Move txtbuffer and vrect tests to libs/ui Fixes make check	2021-06-13 14:29:46 +09:00
Bill Currie	bcc5686606	[util] Add fuzzy and reentrant bsearch Fuzzy bsearch is useful for finding an entry in a prefix sum array (value is >= ele[0], < ele[1]), and the reentrant version is good when data needs to be passed to the compare function. Adapted from the code used in pr_resolve.	2021-06-01 18:53:53 +09:00
Bill Currie	29e029c792	[util] Add float a simd version of the SEB And its support functions. I can't tell if it's any faster (mtwist_rand is a significant chunk of the benchmark timings, oops), but it's nice to have.	2021-03-27 23:38:10 +09:00
Bill Currie	09a10f80e1	[util] Add basic SIMD implemented vector functions They take advantage of gcc's vector_size attribute and so only cross, dot, qmul, qvmul and qrot (create rotation quaternion from two vectors) are needed at this stage as basic (per-component) math is supported natively by gcc. The provided functions work on horizontal (array-of-structs) data, ie a vec4d_t or vec4f_t represents a single vector, or traditional vector layout. Vertical layout (struct-of-arrays) does not need any special functions as the regular math can be used to operate on four vectors at a time. Functions are provided for loading a vec4 from a vec3 (4th element set to 0) and storing a vec4 into a vec3 (discarding the 4th element). With this, QF will require AVX2 support (needed for vec4d_t). Without support for doubles, SSE is possible, but may not be worthwhile for horizontal data. Fused-multiply-add is NOT used because it alters the results between unoptimized and optimized code, resulting in -mfma really meaning -mfast-math-anyway. I really do not want to have to debug issues that occur only in optimized code.	2020-12-30 18:20:11 +09:00
Bill Currie	f3682638d4	[util] Add a mini expression parser It is capable of parsing single expressions with fairly simple operations. It current supports ints, enums, cvars and (external) data structs. It is also thread-safe (in theory, needs proper testing) and the memory it uses can be mass-freed.	2020-12-21 14:22:42 +09:00
Bill Currie	af814ff9a8	[util] Add a cache-line aligned memory allocator This was inspired by Hoard: A Scalable Memory Allocator for Multithreaded Applications Emery D. Berger, Kathryn S. McKinley, Robert D. Blumofe, Paul R. Wilson, It's not anywhere near the same implementation, but it did take a few basic concepts. The idea is twofold: 1) A pool of memory from which blocks can be allocated and then freed en-mass and is fairly efficient for small (4-16 byte) blocks 2) Tread safety for use with the Vulkan renderer (and any other multi-threaded tasks). However, based on the Hoard paper, small allocations are cache-line aligned. On top of that, larger allocations are page aligned. I suspect it would help qfvis somewhat if I ever get around to tweaking qfvis to use cmem.	2020-12-21 14:14:29 +09:00
Bill Currie	e991c44232	[util] Make a minor improvement to QuatMultVec Switch from using addition to multiplication for doubling the value. Also, fix lib/util/test Makemodule to actually work (too much c&p coding).	2020-12-21 14:13:57 +09:00
Bill Currie	6d5ffa9f8e	[build] Move to non-recursive make There's still some cleanup to do, but everything seems to be working nicely: `make -j` works, `make distcheck` passes. There is probably plenty of bitrot in the package directories (RPM, debian), though. The vc project files have been removed since those versions are way out of date and quakeforge is pretty much dependent on gcc now anyway. Most of the old Makefile.am files are now Makemodule.am. This should allow for new Makefile.am files that allow local building (to be added on an as-needed bases). The current remaining Makefile.am files are for standalone sub-projects.a The installable bins are currently built in the top-level build directory. This may change if the clutter gets to be too much. While this does make a noticeable difference in build times, the main reason for the switch was to take care of the growing dependency issues: now it's possible to build tools for code generation (eg, using qfcc and ruamoko programs for code-gen).	2020-06-25 11:35:37 +09:00

15 commits