quakeforge

mirror of https://git.code.sf.net/p/quake/quakeforge synced 2024-11-30 08:00:51 +00:00

Author	SHA1	Message	Date
Bill Currie	0ebb0717b0	[zone] Add failing test cases The tests fail as they exercise how the cache SHOULD work rather than how it does now. The tests do currently pass for the pending work I've done on the cache system, but while working on it, I remembered why I reworked cache allocation... The essential problem is that sounds are loaded into the cache, which is fine for synchronous output targets, but has proven to be a minefield for asynchronous output targets (JACK, ALSA). The reason for the minefield is the hunk takes priority over the cache, and is free to move cache blocks around, and even dispose of them entirely in order to satisfy memory allocations from either end of the hunk. Doing this in an entirely single-threaded process (as DOS Quake was) is perfectly safe, as the users of the cache just reload the pointer each time, and bail if it's null (meaning the block has been freed), or even cause the data to be reloaded if possible (I'm a little fuzzy on the details for that as I didn't write that code). However, in multi-threaded code, especially real-time (JACK, possibly ALSA), it's a recipe for disaster. The `4cab5b90e6` commit was a (mostly) successful attempt to mitigate the problem by allocating the cache blocks from the high-hunk (thus minimizing any movement caused by low-hunk allocations), it resulted in cache allocates and regular high-hunk allocations somehow getting intertwined: while investigating just how much memory ad_tears needs (somewhere between 192MB and 256MB), I got "trashed sentinel" errors and upon investigation, I found what looks very suspiciously like audio data written across a hunk control block. I've decided that the cache allocation algorithm should be reverted to how it was originally designed by Id (details will remain "modern"), but while working on the tests, I remembered why I had done the changes in the first place (above story). Thus the work on reverting the cache allocation can't go in until I get sound memory management independent of the cache. The tests are going in now so I have a constant reminder :)	2022-06-03 12:52:59 +09:00
Bill Currie	db5b77c838	[cexpr] Require designated initializers for exprtype_t This will make expanding it much safer in the future.	2022-04-24 17:31:17 +09:00
Bill Currie	cfaf158ebc	[math] Add some bit-op functions Just 32-bit rounding to next higher power of two, and base 2 logarithm. Most importantly, they are suitable for use in initializers as they are constant in, constant out.	2022-01-23 13:47:14 +09:00
Bill Currie	80c5e2c3f6	[simd] Remove requirements for AVX2 for vec4d It seems gcc-11 does a pretty good job of emulating the instructions (it no longer requires avx2 for 256-bit wide vectors).	2022-01-06 18:06:56 +09:00
Bill Currie	5fb28d7c38	[math] Clean up vector component operations And add a unary op macro. Having VectorCompOp makes it easy to write macros that work for multiple data widths, which is why it and its users now use (dst, ...) instead of (..., dst) as in the past. I'll sort out the other macros later now that I know the compiler handily gives messages about the switched order (uninitialized vars etc).	2022-01-02 01:15:17 +09:00
Bill Currie	97034d9dde	[simd] Add 2d vector types For int, long, float and double. I've been meaning to add them for a while, and they're part of the new Ruamoko instructions set (which is progressing nicely).	2022-01-02 00:57:55 +09:00
Bill Currie	c11a930436	[simd] Set the homogeneous coord in the csvf test The homogeneous coord was not being initialized and thus was picking up rubbish from the stack. This is why the test would succeed in some circumstances but fail in others.	2021-12-26 12:25:06 +09:00
Bill Currie	854c92d10e	[simd] Indicate when the circumsphere is degenerate CircumSphere_vf sets the sphere radius to -1 when the points are degenerate (co-linear for three points, co-planar for four points).	2021-12-24 06:45:13 +09:00
Bill Currie	d2eb928210	[cexpr] Add a small library with some useful functions For now, just dot product, trig, and min/max/bound, but it works well as a proof of concept. The main goal was actually min. Only the list of symbols is provided, it is the user's responsibility to set up the symbol table and context.	2021-12-24 06:45:12 +09:00
Bill Currie	44102f2639	[cexpr] Support chained contexts for scoping cexpr's symbol tables currently aren't readily extended, and dynamic scoping is usually a good thing anyway. The chain of contexts is walked when a symbol is not found in the current context's symtab, but minor efforts are made to avoid checking the same symtab twice (usually cased by cloning a context but not updating the symtab).	2021-12-24 06:45:12 +09:00
Bill Currie	c8846f8007	[cexpr] Support array indexing The index is currently limited to ints, and is bounds checked (the array type has a size field indicating the number of elements).	2021-12-24 06:45:12 +09:00
Bill Currie	c069e7754f	[util] Add a listener object I decided cvars and input buttons/axes need listeners so any changes to them can be propagated. This will make using cvars in bindings feasible and I have an idea for automatic imt switching that would benefit from listeners attached to buttons and cvars.	2021-11-25 13:36:02 +09:00
Bill Currie	a01cafe972	[util] Minimize set growth At the low level, only unions can cause a set to grow. Of course, things get interesting at the higher level when infinite (inverted) sets are mixed in.	2021-08-11 12:31:03 +09:00
Bill Currie	37a5b475c0	[util] Minimize the string for infinite sets Instead of printing every representable member of an infinite set (ie, up to element 63 in a set that can hold 64 elements), only those elements up to one after the last non-member are listed. For example, {...} - {2 3} -> {0 1 4 ...} This makes reading (and testing!) infinite sets much easier.	2021-08-11 12:31:03 +09:00
Bill Currie	aa72f1dc31	[util] Fix reversed finite-infinite set union ops It looks like I tried to test it, but my tests weren't so good This seems to cover everything for the three main set ops.	2021-08-11 12:31:03 +09:00
Bill Currie	d56d8ac707	[util] Loosen up the epsilon on simd seb tests It seems my eeepc's SSE units don't get quite the same answers as does my i7's (maybe due to lack of hadd?).	2021-08-02 23:15:20 +09:00
Bill Currie	80b17623b1	[util] Fix an out-by-one in pqueue tests Showed up only when the data arrays were packed.	2021-08-02 23:08:14 +09:00
Bill Currie	f76964b86b	[util] Add an priority queue implementation Done via macros (like darray and ringbuffer). Might prove useful for qfvis and maybe dynamic lights.	2021-08-02 13:29:55 +09:00
Bill Currie	4f2113bc05	[util] Enable accidentally disabled seb tests	2021-08-02 12:44:08 +09:00
Bill Currie	e4984aad17	[util] Add functions for binary heaps Sink, swim, build and sort, both "simple" and with a data parameter for the compare function.	2021-08-02 12:44:08 +09:00
Bill Currie	9d819254d4	[util] Make a number of improvements to SEB Attempting to vis ad_tears drags a few lurking bugs out of SmallestEnclosingBall_vf: poor calculation of 2-point affine space, poor handling of duplicate points and dropped support points, poor calculation of the new center (related to duplicate points), and insufficient iterations for large point sets. qfvis (modified for cluster spheres) now loads ad_tears.	2021-07-30 14:57:47 +09:00
Bill Currie	45aa8e6504	[util] Loosen affine test epsilon for SEB Scaling the checks by 1e-6 was a little too tight for very small triangles, but 1e-5 seems to work well. This fixes SEB getting stuck for a ridiculously small (for quake) triangle in ad_tears (probably resulted from some bad math in qfbsp when generating the portal file from the bsp).	2021-07-29 15:03:54 +09:00
Bill Currie	4f51a3b406	[utils] Fix set tests for 32-bit machines	2021-07-29 14:10:18 +09:00
Bill Currie	8fdd9c1f5a	[util] Write some tests for utf8 r/w And fix some errors with 5-byte encodings.	2021-07-27 23:29:14 +09:00
Bill Currie	163d147044	[util] Give set_count a >8x speed boost I knew counting bits individually was slow, but it never really mattered until now. However, I didn't expect such a dramatic boost just by going to mapping bytes to bit counts. 16-bit words would be faster still, but the 64kB lookup table would probably start hurting cache performance, and 32-bit words (4GB table) definitely would ruin the cache. The universe isn't big enough for 64-bits :)	2021-07-27 13:54:22 +09:00
Bill Currie	cf96f340df	[util] Use same distance calc in simd CC test The test started failing when optimizing (not sure why). Using the same code to calculate the squared distance "fixes" the test.	2021-07-26 15:46:12 +09:00
Bill Currie	03921c03c5	[util] Expose set_expand and fix an out-by-one bug Having set_expand exposed is useful for loading data into a set. However, it turns out there was a bug in its size calculation in that when the requested set size was a multiple of SET_BITS (and greater than the current set size), the new set size one be SET_BITS larger than requested. There's now some tests for this :)	2021-07-26 09:54:03 +09:00
Bill Currie	1078bd9efa	[util] Implement Sys_Free for windows And get the tests so they can (sort of) be run.	2021-07-12 18:55:16 +09:00
Bill Currie	0a847f92f1	[util] Use mmap/munmap for cmem internal alloc/free This reduces the overhead needed to manage the memory blocks as the blocks are guaranteed to be page-aligned. Also, the superblock is now alllocated from within one of the memory blocks it manages. While this does slightly reduce the available cachelines within the first block (by one or two depending on 32 vs 64 bit pointers), it removes the need for an extra memory allocation (probably via malloc) for the superblock.	2021-07-12 16:33:47 +09:00
D G Turner	b799d48ccb	[simd] fix build when avx2 is not available, but avx is. This failed with errors such as: from ./include/QF/simd/vec4d.h:32, from libs/util/simd.c:37: ./include/QF/simd/vec4d.h: In function ‘qmuld’: /usr/lib/gcc/x86_64-pc-linux-gnu/10.3.0/include/avx2intrin.h:1049:1: error: inlining failed in call to ‘always_inline’ ‘_mm256_permute4x64_pd’: target specific option mismatch 1049 \| _mm256_permute4x64_pd (__m256d __X, const int __M)	2021-06-23 01:10:42 +01:00
Bill Currie	36df16eefc	[util] Fix incorrect type in test-mat3 Fixes make check for gcc-11	2021-06-13 15:00:57 +09:00
Bill Currie	24fd443ef3	[ui] Move txtbuffer and vrect tests to libs/ui Fixes make check	2021-06-13 14:29:46 +09:00
Bill Currie	713862c511	[util] Fix format issue in set test I guess I hadn't done a "make check" on 64-bit since getting windows builds to work.	2021-06-01 19:11:51 +09:00
Bill Currie	bcc5686606	[util] Add fuzzy and reentrant bsearch Fuzzy bsearch is useful for finding an entry in a prefix sum array (value is >= ele[0], < ele[1]), and the reentrant version is good when data needs to be passed to the compare function. Adapted from the code used in pr_resolve.	2021-06-01 18:53:53 +09:00
Bill Currie	0293167bd2	[util] Get simd tests working for emulated simd A bit of a mess for optimized vs unoptimized, but the tests acknowledge the differences in precision while checking that the code produces the right results allowing for that precision.	2021-06-01 18:53:53 +09:00
Bill Currie	ef9b04ba83	[util] Get tests working with sse2 It seems that i686 code generation is all over the place reguarding sse2 vs fp, with the resulting differences in carried precision. I'm not sure I'm happy with the situation, but at least it's being tested to a certain extent. Not sure if this broke basic (no sse) i686 tests.	2021-06-01 18:53:53 +09:00
Bill Currie	778c07e91f	[util] Get vectors working for non-SSE archs GCC does a fairly nice job of producing code for vector types when the hardware doesn't support SIMD, but it seems to break certain math optimization rules due to excess precision (?). Still, it works well enough for the core engine, but may not be well suited to the tools. However, so far, only qfvis uses vector types (and it's not tested yet), and tools should probably be used on suitable machines anyway (not forces, of course).	2021-06-01 18:53:53 +09:00
Bill Currie	a461c09586	[util] Make cmem consistent on 32 and 64 bit systems I don't know that the cache line size is 64 bytes on 32 bit systems, but it should be ok to assume that 64-byte alignment behaves well on systems with smaller cache lines so long as they are powers of two. This does mean there is some waste on 32-bit systems, but it should be fairly minimal (32 bytes per memblock, which manages page sized regions).	2021-06-01 18:53:52 +09:00
Bill Currie	9dca47c0e1	Make the set tests word size agnostic.	2021-06-01 18:53:52 +09:00
Bill Currie	b6ab832ed4	[simd] Add vabsf and some more tests	2021-03-28 19:49:43 +09:00
Bill Currie	29e029c792	[util] Add float a simd version of the SEB And its support functions. I can't tell if it's any faster (mtwist_rand is a significant chunk of the benchmark timings, oops), but it's nice to have.	2021-03-27 23:38:10 +09:00
Bill Currie	a3c1b2e992	[util] Rename qfplist.[ch] The name is a hold-over from before the current quakeforge tree and the QF include directory.	2021-03-21 16:13:03 +09:00
Bill Currie	4a97bc3ba5	[util] Create simd quaternion to matrix function This seems to be pretty close to as fast as it gets (might be able to do better with some shuffles of the negation constants instead of loading separate constants).	2021-03-04 17:45:10 +09:00
Bill Currie	45c0255643	[util] Add simd 4x4 matrix functions Currently just add, subtract, multiply (m m and m v).	2021-03-03 16:34:16 +09:00
Bill Currie	407ea15e35	[util] Fix some test bitrot	2021-03-02 15:05:19 +09:00
Bill Currie	55445a25f2	[util] Add some tests for the new cexpr work	2021-01-04 00:23:48 +09:00
Bill Currie	015cee7b6f	[util] Add vector-quaternion shortcut functions Care needs to be taken to ensure the right function is used with the right arguments, but with these, the need to use qconj(d\|f) for a one-off inverse rotation is removed.	2021-01-02 10:44:45 +09:00
Bill Currie	7bf90e5f4a	[util] Sort out implementation issues for simd	2021-01-02 09:55:59 +09:00
Bill Currie	1fd02322f4	[util] Clean up some minor issues in cmem I forgot to remove the noinline attribute and do a check with optimization (to catch the "pure" recommendation).	2021-01-01 22:29:31 +09:00
Bill Currie	9090c53519	[util] Add failing sub-line allocator tests I think the sub-line allocator falling over is the final source of qfvis's leaks. It certainly causes a mess of the sub-lines. But having some tests to get working sure beats scratching my head over qfvis :)	2020-12-30 18:20:11 +09:00

1 2 3

110 commits