quakeforge

mirror of https://git.code.sf.net/p/quake/quakeforge synced 2024-11-29 15:41:59 +00:00

Author	SHA1	Message	Date
Bill Currie	2828500f04	[qfvis] Delay freeing of winding memory If anything, this is probably a nano-optimization, depending on how often portals are vis-rejected. I couldn't see any actual difference.	2021-08-08 12:34:18 +09:00
Bill Currie	421047328a	[qfvis] Catch a missed winding mark stat	2021-08-02 23:17:55 +09:00
Bill Currie	9461779ba7	[qfvis] Remove the cluster portals limit This removes the last of the arbitrary limits from qfvis. The goal is not so much supporting crazy maps, but more about better data usage (cluster_t is now 24 (or 16) bytes instead of 1048 (or 528). And passages isn't used (yet?)...	2021-07-29 21:03:07 +09:00
Bill Currie	72a1fef714	[qfvis] Use hunk to manage winding memory It turns out cmem is not so good for many large allocations (probably a bug in handling the blocks), but was really meant for lots of little churning allocations anyway. After an analysis of winding lifetimes, it became clear that the hunk allocator would work very well. The base windings are allocated from a global hunk (currently 1GB, plenty for even ad_tears), and ephemeral windings are allocated from a per-thread hunk of 1MB (seems to be way more than enough: gmsp3v2 uses a maximum of only 56064 bytes, and ad_tears got through 30% before I gave up on it). Any speed difference (for gmsp3v2) seems to be lost in the noise: still completing in 38.4s on my machine.	2021-07-29 11:49:18 +09:00
Bill Currie	ff4cd84891	[qfvis] Use simd vector code While whether it's any faster is debatable (it's slightly slower, but many more portals are being tested due to different rounding in the base vis stage), it's certainly easier to read.	2021-03-28 19:55:47 +09:00
Bill Currie	d072a7b99c	[qfvis] Add stats for memory usage Verbosity levels probably need more tweaking, but -v is at least a little more usable.	2021-03-27 23:04:13 +09:00
Bill Currie	f2b6b23acc	[qfvis] Switch to unsigned for various counts	2021-03-27 20:55:15 +09:00
Bill Currie	72280186bf	[qfvis] Use cmem for memory management While the main bulk of the improvement (36s down from 42s for gmsp3v2.bsp on my i7-6850K) comes from using a high-tide allocator for the windings (which necessitated using a fixed size), it is ever so slightly faster than using malloc as the back-end.	2021-03-27 20:30:35 +09:00
Bill Currie	6d5ffa9f8e	[build] Move to non-recursive make There's still some cleanup to do, but everything seems to be working nicely: `make -j` works, `make distcheck` passes. There is probably plenty of bitrot in the package directories (RPM, debian), though. The vc project files have been removed since those versions are way out of date and quakeforge is pretty much dependent on gcc now anyway. Most of the old Makefile.am files are now Makemodule.am. This should allow for new Makefile.am files that allow local building (to be added on an as-needed bases). The current remaining Makefile.am files are for standalone sub-projects.a The installable bins are currently built in the top-level build directory. This may change if the clutter gets to be too much. While this does make a noticeable difference in build times, the main reason for the switch was to take care of the growing dependency issues: now it's possible to build tools for code generation (eg, using qfcc and ruamoko programs for code-gen).	2020-06-25 11:35:37 +09:00
Bill Currie	125ef1f0ff	Move the whole separator test/creation into a function. This will make the next stage easier. (except that seems to be slower)	2013-03-19 20:39:01 +09:00
Bill Currie	f2452eb3c3	Rewrite the inner-loop of FindSeparators. For the most part, it's just refactoring the code so the plane creation and testing are in separate functions, but there is one important difference: the plane test now checks only the two points on either side of the point used to create the plane. Because the portal winding is guaranteed to be convex and planar, if both points are on the plane, all points are, and if neither point is behind the plane, no points are.a This shaved about 5 seconds off the level 4 run using 4 threads (~198s to ~193s) and about 12s from the single threaded run (~682s to ~670s (hmm, gained some time in recent changes)).	2013-03-19 17:00:00 +09:00
Bill Currie	cb096c601d	Use a per-portal rwlock for portal updates. This should make qfvis scale a little better with cpu count.	2013-03-18 15:03:11 +09:00
Bill Currie	c824e668ed	Rework some of the pthread stuff. Init/uninit is now separate from portal vising. The global lock has a better name and is now a rwlock. Use a separate lock for the stats.	2013-03-18 14:26:52 +09:00
Bill Currie	1c20a49dba	Use the recursive set allocator for mightsee. This completely removes the lock used to protect the set allocation code while keeping the use of the set api clean.	2013-03-18 13:30:50 +09:00
Bill Currie	a28ec8aa82	Revert "Allocate stack blocks and mightsee in one block." This reverts commit `1ea79e8626`. Conflicts: tools/qfvis/include/vis.h tools/qfvis/source/flow.c I've decided to do reentrant versions of the set allocators and I didn't particularly like the invasiveness of allocating sets this way.	2013-03-18 12:47:59 +09:00
Bill Currie	ad247fa12d	Rename some variables and remove some comments. The old variable names were confusing ("target" winding comes from "portal"?), and the comments were from when I really didn't understand concepts like separating planes. While they weren't wrong, they were quite inadequate and I want to write new ones.	2013-03-17 21:52:08 +09:00
Bill Currie	ccc432a7ea	Give the fields of pstack_t clearer names. And some comments.	2013-03-17 19:18:38 +09:00
Bill Currie	1ea79e8626	Allocate stack blocks and mightsee in one block. This bypasses set_new, but completely removes the use of the global lock from within RecursiveClusterFlow. This seems to give a small speedup: 203 seconds threaded.	2013-03-17 16:37:27 +09:00
Bill Currie	1d262f7dea	Clean up FindSeparators a little bit. This was testing an idea I had to remove the plane flips. It seems to have been good for the initial plane orientation, but was a slight slowdown for the pass-portal test. However, this makes the code a little easier to work with for my idea on improving the algorithm itself.	2013-03-17 10:16:47 +09:00
Bill Currie	5dba419233	Cache stack blocks and working mightsee sets. Since the stack structure in the thread data is a linked list, move the stack blocks off the program stack and into malloced memory. More importantly, when the stack block is allocated, the mightsee working set is allocated too, and as neither are freed, this greatly reduces contention for the lock. Also, because the memory is kept, single threaded time for gmsp3v2 dropped from 695s to 670s. Threaded is now about 207s (down from 350).	2013-03-16 22:58:59 +09:00
Bill Currie	2ea143283c	Rewrite mightsee_more to manipulate the sets directly. While using set operators was clearer, it was rather expensive (about 25s for gmsp3v2). qfvis now completes the map in about 695s (single threaded). About 15s faster than tyr for the same conditions (1 thread, level 4).	2013-03-16 21:51:41 +09:00
Bill Currie	195bdcb92f	Rework FindSeparators to make use of the winding direction. This is the second part of the separator search optimization from tyrutils vis. With this, qfvis is getting close to tyrutils vis when running single threaded (qfvis is suffering some nasty thread contention and thus can't get below about 350 seconds with 4 threads). 808s vs 707s.	2013-03-15 22:05:01 +09:00
Bill Currie	46d41ad9ac	Split up separator finding and winding clipping. Interesting, it makes very little (maybe faster) difference to find all the separators for levels 3 and 4. This might be due to the higher levels using most of the planes to fully clip source away. Anyway, it makes the code a little clearer (one function, one task).	2013-03-15 16:00:39 +09:00
Bill Currie	77c858060d	Add a bunch more statistics. Now I know why sphere culling was a loss: 78% of all tested target portals were trimmed by ClipToSeparators (50% eventually clipped away entirely).	2013-03-14 19:43:46 +09:00
Bill Currie	8032d1d4d1	Split out the mightsee intersection/subset tests. Having the code in separate functions makes the flow in the main loop a little easier to follow.	2013-03-14 19:43:46 +09:00
Bill Currie	eec87bd61b	Remove thread from stack_t. It really wasn't gaining anything and made reading the code a little harder.	2013-03-14 19:43:46 +09:00
Bill Currie	5d6df082f2	Move the vis stats vars into thread data. This should make the stats more reliable when running multi-threaded (chains is still random, but it seems there are set access issues).	2013-03-14 12:52:40 +09:00
Bill Currie	3232c22795	Implement mightsee updates from tyrutils. It doesn't seem to make much difference, but I suspect that's my set code not being optimal for the job.	2013-03-09 14:32:51 +09:00
Bill Currie	d1e65257b6	Implement the cached separators idea from tyrutils. I think the reason I didn't think of that when I tried to improve qfvis's performance many years ago is I just simply did not understand ClipToSeparators. However, the difference caching the separators makes is phenomenal. Before the change, single threaded qfvis would get stuck on one particular portal for at least a day (I gave up waiting), but now even a debug build will complete gmsp3v2.bsp in less than 12 minutes (4 threads on my quad-core). And that's at level 2! Getting stuck for a day was at level 0.	2013-03-08 22:20:29 +09:00
Bill Currie	4e87072d93	Fix a bunch of whitespace.	2013-03-07 14:31:00 +09:00
Bill Currie	4ccb4a4667	Make the test and vis set pointers const.	2013-03-07 14:28:15 +09:00
Bill Currie	281c5f185d	Correct the "can see anything new" check. I had the super-set and sub-set parameters backwards, but that's because I had misinterpreted the old code.	2013-03-07 14:24:30 +09:00
Bill Currie	299ff8f575	Use set functions for qfvis. While noticeably slower than the previous expanded set manipulation code, this is much easier to read. I can worry about optimizing the set code when I get qfvis behaving better.	2013-03-07 11:06:55 +09:00
Bill Currie	23a38738fc	Massive whitespace cleanup. Lots of trailing whitespace and otherwise blank lines.	2012-05-22 08:23:22 +09:00
Bill Currie	bc1b483525	Nuke the rcsid stuff. It's pretty useless in git.	2012-04-22 10:56:32 +09:00
Bill Currie	997102fea8	audit the usage of "only" There are still a few iffy places (notably around certain prepositions), but the relevant sentences are now much easier to read.	2010-01-13 06:42:26 +00:00
Bill Currie	3fb03fc2be	hah, should have been using __attribute__((used)) all that time (rather than __attribute__((unused))). fixes the missing console in -x11	2005-08-04 15:27:09 +00:00
Ragnvald Maartmann-Moe IV	527342d80d	Whitespace, codingstyle, pouncing shalraths.	2004-02-08 02:53:58 +00:00
Bill Currie	39c0a0f08d	make cluster recursion a warning rather than an error (see how this goes)	2003-09-24 00:59:53 +00:00
Bill Currie	0a8867fde9	replace all VectorSubtract (vec3_origin, ... and VectorCopy (vec3_origin, ... with VectorNegate (... and VectorZero (..., respectively. Should be a small speedup.	2003-09-03 22:17:04 +00:00
Bill Currie	7073afc0a4	port in OQ's detail, hint and skip brush/texture enhancements	2003-02-04 23:26:26 +00:00
Bill Currie	761a7546dd	re-arrange things so __attribute__ and __builtin_expect are properly autoconfiscated so rcsid will continue to work with gcc 3.3	2003-01-15 15:31:36 +00:00
Bill Currie	71196fedf6	make gcc more anal about prototypes, string constants and function visibility (ie, global functions must have a prototype)	2003-01-06 18:28:13 +00:00
Bill Currie	2a56b8f15f	go back to the `long way' for the cross product as qfvis was no longer getting the same results as vis (that may be acceptable, but not yet:)	2002-09-26 03:58:22 +00:00
Bill Currie	e81a0e2095	qfvis and qflight are still copyright Id	2002-09-25 01:51:58 +00:00
Bill Currie	324172414a	use the vector math macros from mathlib.h to make the math in ClipToSeparators clearer and comment on how --level effects the vis calculations	2002-09-24 21:48:47 +00:00
Bill Currie	5904859cca	split the base vis functions out of flow.c, add -D_REENTRANT to the compile for qfvis and a bunch of whitespace	2002-09-24 02:46:03 +00:00
Bill Currie	ded572b31f	various var cleanups	2002-09-23 22:54:28 +00:00
Bill Currie	ee61eaebbb	don't do threading if only 1 thread is used and add another state to vstatus_t for better portal state checking	2002-09-22 21:54:41 +00:00
Bill Currie	528eec1e12	go back to bailing out on errors	2002-09-22 21:32:36 +00:00

1 2

61 commits