quakeforge

mirror of https://git.code.sf.net/p/quake/quakeforge synced 2025-02-20 10:43:29 +00:00

Author	SHA1	Message	Date
Bill Currie	45c0255643	[util] Add simd 4x4 matrix functions Currently just add, subtract, multiply (m m and m v).	2021-03-03 16:34:16 +09:00
Bill Currie	9039c6975a	[util] Clean up some missed vsqrt changes	2021-01-05 08:35:53 +09:00
Bill Currie	015cee7b6f	[util] Add vector-quaternion shortcut functions Care needs to be taken to ensure the right function is used with the right arguments, but with these, the need to use qconj(d\|f) for a one-off inverse rotation is removed.	2021-01-02 10:44:45 +09:00
Bill Currie	7bf90e5f4a	[util] Sort out implementation issues for simd	2021-01-02 09:55:59 +09:00
Bill Currie	3125009a7c	[util] Add vector and quaternion types to cexpr Although there's no distinction between the two at the C level, I think it's probably best to separate them in a scripting language.	2020-12-30 18:20:11 +09:00
Bill Currie	1ddd57b09e	[util] Add qconj, vtrunc, vceil and vfloor functions I had forgotten these rather critical functions. Both double and float versions are included.	2020-12-30 18:20:11 +09:00
Bill Currie	09a10f80e1	[util] Add basic SIMD implemented vector functions They take advantage of gcc's vector_size attribute and so only cross, dot, qmul, qvmul and qrot (create rotation quaternion from two vectors) are needed at this stage as basic (per-component) math is supported natively by gcc. The provided functions work on horizontal (array-of-structs) data, ie a vec4d_t or vec4f_t represents a single vector, or traditional vector layout. Vertical layout (struct-of-arrays) does not need any special functions as the regular math can be used to operate on four vectors at a time. Functions are provided for loading a vec4 from a vec3 (4th element set to 0) and storing a vec4 into a vec3 (discarding the 4th element). With this, QF will require AVX2 support (needed for vec4d_t). Without support for doubles, SSE is possible, but may not be worthwhile for horizontal data. Fused-multiply-add is NOT used because it alters the results between unoptimized and optimized code, resulting in -mfma really meaning -mfast-math-anyway. I really do not want to have to debug issues that occur only in optimized code.	2020-12-30 18:20:11 +09:00

7 commits