quakeforge

mirror of https://git.code.sf.net/p/quake/quakeforge synced 2024-12-12 13:42:41 +00:00

Author	SHA1	Message	Date
Bill Currie	9a93bf8d4a	[qfvis] Make cluster reconstruction O(N) For most (if not all) maps. The heapsort is needed only if the clustered leafs are not contiguous, but most bsp compilers output contiguous leaf clusters, so is just a bit of protection. The difference isn't really noticeable on a fast machine, but no point in doing more work than necessary.	2021-08-08 12:34:18 +09:00
Bill Currie	b320c3352f	[util] Make set_t endian-agnostic Most of the set ops were always endian-agnostic since they were simply operating on multiple bits in parallel, but individual element add/remove/test was very endian-dependent. For the most part, this didn't matter, but it does matter very much when loading external data into a set or writing the data out (eg, for PVS).	2021-08-08 12:34:18 +09:00
Bill Currie	648ae3f877	[qfvis] Clean up the code and output a little Dead code removed, and the job progress lines are now consistent and have a job completion time when done.	2021-08-03 21:52:24 +09:00
Bill Currie	fe998f41b4	[qfvis] Convert leaf vis to cluster vis Now that only 3852 clusters need to be checked for each cluster, fat-pvs construction for ad_tears completes in about 0.7s, most of which seems to be loading, conversion, compression and writing. O(N^3) cuts both ways (hurts like crazy when N increases, does wonders when N decreases, especially by a factor of 25). And then throw in improved cache performance... I suspect having an off-line compiler is still useful, but even if qfvis's implementation never actually gets used, if cluster reconstruction is put in the engine, large maps will be feasible even for quakeworld. Just the reduced memory requirements alone will be a huge benefit (~3GB down to 1.8MB).	2021-08-03 15:48:45 +09:00
Bill Currie	8da019e31c	[qfvis] Reconstruct the leaf clusters in a bsp This is only the first half (vertical) in that the vis bits are still for the leafs rather than the clusters, but ad_tears goes from 500s to 7s for calculating the fat pvs (3852 clusters).	2021-08-03 11:37:24 +09:00
Bill Currie	42dc30ec29	[vulkan] Increase main staging buffer to 32MB 1k 32bpp sky textures need 24MB to load. Though there's always better handling of running out of stating buffer (ie, flushing and trying again).	2021-08-02 23:17:55 +09:00
Bill Currie	421047328a	[qfvis] Catch a missed winding mark stat	2021-08-02 23:17:55 +09:00
Bill Currie	d56d8ac707	[util] Loosen up the epsilon on simd seb tests It seems my eeepc's SSE units don't get quite the same answers as does my i7's (maybe due to lack of hadd?).	2021-08-02 23:15:20 +09:00
Bill Currie	80b17623b1	[util] Fix an out-by-one in pqueue tests Showed up only when the data arrays were packed.	2021-08-02 23:08:14 +09:00
Bill Currie	ec54c54226	[build] Fix some windows bitrot	2021-08-02 14:02:41 +09:00
Bill Currie	d99fb01b65	[build] Fix some distcheck bitrot	2021-08-02 13:47:00 +09:00
Bill Currie	f76964b86b	[util] Add an priority queue implementation Done via macros (like darray and ringbuffer). Might prove useful for qfvis and maybe dynamic lights.	2021-08-02 13:29:55 +09:00
Bill Currie	4f2113bc05	[util] Enable accidentally disabled seb tests	2021-08-02 12:44:08 +09:00
Bill Currie	e4984aad17	[util] Add functions for binary heaps Sink, swim, build and sort, both "simple" and with a data parameter for the compare function.	2021-08-02 12:44:08 +09:00
Bill Currie	f514345d77	[qfbsp] Show correct object counts for bsp29 files Yes, this was the goal of that size_t change, but it made sense anyway.	2021-08-01 22:05:31 +09:00
Bill Currie	674ffa0941	[util] Make bsp_t counts size_t and other bsp data counts unsigned, and clean up the resulting mess.	2021-08-01 21:54:05 +09:00
Bill Currie	40367e5bca	[qfvis] Thread the ambient sounds computation	2021-08-01 18:14:28 +09:00
Bill Currie	e671b3f230	[qfvis] Thread the portal vis compaction The compaction deals with merging all the portal visibility into cluster visibility, expanding out to leafs, and final compression.	2021-08-01 17:06:13 +09:00
Bill Currie	80a89c5e1e	[util] Write the correct bsp format id for bsp2 Oops :P	2021-08-01 14:07:24 +09:00
Bill Currie	523ab007d6	[qfvis] Produce more details base-vis stats And nicely, things add up (after fixing 32-bit overflows :P)	2021-07-30 23:07:12 +09:00
Bill Currie	ca8dcf3fa9	[qfvis] Use cluster sphere culling for base vis While this doesn't give as much of a boost as does basic sphere culling (since it's just culling sphere tests), it took ad_tears' base vis from 1000s to 720s on my machine.	2021-07-30 18:52:47 +09:00
Bill Currie	9d819254d4	[util] Make a number of improvements to SEB Attempting to vis ad_tears drags a few lurking bugs out of SmallestEnclosingBall_vf: poor calculation of 2-point affine space, poor handling of duplicate points and dropped support points, poor calculation of the new center (related to duplicate points), and insufficient iterations for large point sets. qfvis (modified for cluster spheres) now loads ad_tears.	2021-07-30 14:57:47 +09:00
Bill Currie	9461779ba7	[qfvis] Remove the cluster portals limit This removes the last of the arbitrary limits from qfvis. The goal is not so much supporting crazy maps, but more about better data usage (cluster_t is now 24 (or 16) bytes instead of 1048 (or 528). And passages isn't used (yet?)...	2021-07-29 21:03:07 +09:00
Bill Currie	fe98a513bc	[util] Add a function to check hunk pointers Its only real utility is to check that a pointer is not pointing into freed space.	2021-07-29 15:27:48 +09:00
Bill Currie	756214ca8e	[qfvis] Use unsigned for the plane side tests Doesn't make any difference to the number of instructions, but seeing sar instead of shr bothers me when working with bit patterns.	2021-07-29 15:25:37 +09:00
Bill Currie	6d312aaa63	[simd] Check the distance to the affine point As per usual, fp math finds a way to confound any epsilon test. So rather than relying entirely on test_support_points, check the distance from the sphere center to the affine point and break out of the loop if the distance is small enough (< 1% of the current radius). This allows qfvis to load ad_tears without hacks.	2021-07-29 15:15:14 +09:00
Bill Currie	45aa8e6504	[util] Loosen affine test epsilon for SEB Scaling the checks by 1e-6 was a little too tight for very small triangles, but 1e-5 seems to work well. This fixes SEB getting stuck for a ridiculously small (for quake) triangle in ad_tears (probably resulted from some bad math in qfbsp when generating the portal file from the bsp).	2021-07-29 15:03:54 +09:00
Bill Currie	4f51a3b406	[utils] Fix set tests for 32-bit machines	2021-07-29 14:10:18 +09:00
Bill Currie	72a1fef714	[qfvis] Use hunk to manage winding memory It turns out cmem is not so good for many large allocations (probably a bug in handling the blocks), but was really meant for lots of little churning allocations anyway. After an analysis of winding lifetimes, it became clear that the hunk allocator would work very well. The base windings are allocated from a global hunk (currently 1GB, plenty for even ad_tears), and ephemeral windings are allocated from a per-thread hunk of 1MB (seems to be way more than enough: gmsp3v2 uses a maximum of only 56064 bytes, and ad_tears got through 30% before I gave up on it). Any speed difference (for gmsp3v2) seems to be lost in the noise: still completing in 38.4s on my machine.	2021-07-29 11:49:18 +09:00
Bill Currie	8f376a48f8	[util] Add raw versions of hunk alloc and free They do not clear memory and thus are good for situations where speed is more critical.	2021-07-29 11:44:10 +09:00
Bill Currie	ca63c0360a	Do an audit of hunk mark usage I realized that after making the hunk 64-bit clean, I had forgotten to go through and convert all the saved marks to size_t.	2021-07-29 11:43:27 +09:00
Bill Currie	54604d9aa2	[util] Make hunk (optionally) thread-safe For now, the functions check for a null hunk pointer and use the global hunk (initialized via Memory_Init) if necessary. However, Hunk_Init is available (and used by Memory_Init) to create a hunk from any arbitrary memory block. So long as that block is 64-byte aligned, allocations within the hunk will remain 64-byte aligned.	2021-07-29 11:43:27 +09:00
Bill Currie	8fdd9c1f5a	[util] Write some tests for utf8 r/w And fix some errors with 5-byte encodings.	2021-07-27 23:29:14 +09:00
Bill Currie	e39bc83a6a	[qfvis] Optionally use utf8 to encode run lengths Adds 50 bytes to marcher's fat-pvs, but removes about 4.7MB from ad_tear's fat-pvs.	2021-07-27 23:29:14 +09:00
Bill Currie	5b4428420e	[utils] Get utf-8 writing working for up to 11 bits I need to write some automated tests for this, and reading of course, but 1 and two byte outputs look correct. Kind of sad it took sixteen years to get around to attempting to use the code :(	2021-07-27 23:29:02 +09:00
Bill Currie	b9d2882e02	[qfvis] Write out the fat-pvs file The output fat-pvs data is the difference between the base pvs and fat pvs. This currently makes for about 64kB savings for marcher.bsp, and about 233MB savings for ad_tears.bsp (or about 50% (470.7MB->237.1MB)). I expect using utf-8 encoding for the run lengths to make for even bigger savings (the second output fat-pvs leaf of marcher.bsp is all 0s, or 6 bytes in the file, which would reduce to 3 bytes using utf-8).	2021-07-27 20:04:19 +09:00
Bill Currie	9e2c474d38	[model] Ensure the pvs is not inverted Mod_DecompressVis_set (via Mod_LeafPVS_set) can be used to recycle pvs sets, but the set may have been set to everything at some stage, which is implemented by inverting the set (making the set infinite) and having 1-bits remove elements from the set. This is most definitely not wanted for pvs :) Currently undecided what to do about Mod_DecompressVis_mix, thus the fixme. Fixes the flickering lights in any map where the camera is out of the map for a single frame (eg, start.bsp, The Catacombs (hipnotic, hip2m3)).	2021-07-27 17:54:50 +09:00
Bill Currie	1f57f81a20	[qw] Use set_count to count the pvs/phs leafs	2021-07-27 14:01:08 +09:00
Bill Currie	163d147044	[util] Give set_count a >8x speed boost I knew counting bits individually was slow, but it never really mattered until now. However, I didn't expect such a dramatic boost just by going to mapping bytes to bit counts. 16-bit words would be faster still, but the 64kB lookup table would probably start hurting cache performance, and 32-bit words (4GB table) definitely would ruin the cache. The universe isn't big enough for 64-bits :)	2021-07-27 13:54:22 +09:00
Bill Currie	50740c1014	[model] Remove the confusion about numleafs The fact that numleafs did not include leaf 0 actually caused in many places due to never being sure whether to add 1. Hopefully this fixes some of the confusion. (and that comment in sv_init didn't last long :P)	2021-07-27 12:38:08 +09:00
Bill Currie	49c3dacbbc	[util] Rename set_size to set_count After seeing set_size and thinking it redundant (thought it returned the capacity of the set until I checked), I realized set_count would be a much better name (set_count (node->successors) in qfcc does make much more sense).	2021-07-27 11:52:21 +09:00
Bill Currie	05fa0538ab	[qw] Count leafs correctly for PHS This fixes some out-by-one bugs caused by there being an obo bug in the original SV_CalcPHS: space was being allocated for numleafs leafs, but numleafs does not count the world-surrounding solid leaf 0, but SV_CalcPHS generates pvs and phs data (just the "everything" set) for leaf 0, which is later used for broadcasts and the like.	2021-07-27 11:34:17 +09:00
Bill Currie	946867c82e	[qfvis] Start work on an off-line fat pvs compiler Extremely large maps take a very long time to process their PVS sets for PHS or shadows, so having an off-line compiler seems like a good idea. The data isn't written out yet, and the fat pvs code may not be optimal for cache access, but it gets through ad_tears in about 500s (12 threads, compared to 2100s single-threaded in the qw server).	2021-07-26 22:42:03 +09:00
Bill Currie	bd2ceca13a	[qw] Move the pvs and phs sets into single hunks Hunk_Alloc has a rather large per-block overhead. Also, small pvs sets (64 leafs) will fit into just the basic set structure so no need to allocate data for the maps.	2021-07-26 22:36:07 +09:00
Bill Currie	cf96f340df	[util] Use same distance calc in simd CC test The test started failing when optimizing (not sure why). Using the same code to calculate the squared distance "fixes" the test.	2021-07-26 15:46:12 +09:00
Bill Currie	c02fcee58a	[util] Make zone functions 64-bit clean This allows nq and qw clients/servers to use over 2GB of memory if necessary.	2021-07-26 15:43:57 +09:00
Bill Currie	5fc1a36d92	[gamecode] Be more forgiving of unresolved builtins If the progs code never calls the functions, then there is no problem. Thus unresolved builtin references get pointed to the bad function builtin.	2021-07-26 13:35:35 +09:00
Bill Currie	60d23bdc8f	[qfbsp] Remove all arbitrary bsp limits Planes, verts, etc can now all get crazy big.	2021-07-26 13:10:06 +09:00
Bill Currie	2e33503c4c	[vulkan] Fix a missed array index edit Caused by the view model implementation, and missed because gcc needs optimization to cache such errors (and my own error, of course).	2021-07-26 11:43:31 +09:00
Bill Currie	f47e03e606	[model] Remove 64k limit on visible leafs Modern maps can have many more leafs (eg, ad_tears has 98983 leafs). Using set_t makes dynamic leaf counts easy to support and the code much easier to read (though set_is_member and the iterators are a little slower). The main thing to watch out for is the novis set and the set returned by Mod_LeafPVS never shrink, and may have excess elements (ie, indicate that nonexistent leafs are visible).	2021-07-26 11:32:05 +09:00

... 23 24 25 26 27 ...

12690 commits