littlefs

mirror of https://github.com/littlefs-project/littlefs.git synced 2025-12-06 07:32:43 +00:00

Author	SHA1	Message	Date
Christopher Haster	d4c772907d	scripts: csv.py: Fixed completely broken float parsing Whoops! A missing splat repetition here meant we only ever accepted floats with a single digit of precision and no e/E exponents. Humorously this went unnoticed because our scripts were only _outputting_ single digit floats, but now that that's fixed, float parsing also needs a fix. Fixed by allowing >1 digit of precision in our CsvFloat regex.	2025-05-15 15:44:30 -05:00
Christopher Haster	d5b28df33a	scripts: Fixed excessive rounding when writing floats to csv/json files This adds __csv__ methods to all Csv* classes to indicate how to write csv/json output, and adopts Python's default float repr. As a plus, this also lets us use "inf" for infinity in csv/json files, avoiding potential unicode issues. Before this we were reusing __str__ for both table rendering and csv/json writing, which rounded to a single decimal digit! This made float output pretty much useless outside of trivial cases. --- Note Python apparently does some of its own rounding (1/10 -> 0.1?), so the result may still not be round-trippable, but this is probably fine for our somewhat hack-infested csv scripts.	2025-05-15 15:44:30 -05:00
Christopher Haster	43c2330edc	scripts: csv.py: Tweaked hidden fields to not imply -b/--by defaults So now the hidden variants of field specifiers can be used to manipulate by fields and field fields without implying a complete field set: $ ./scripts/csv.py lfs.code.csv \ -Bsubsystem=lfsr_file -Dfunction='lfsr_file_' \ -fcode_size Is the same as: $ ./scripts/csv.py lfs.code.csv \ -bfile -bsubsystem=lfsr_file -Dfunction='lfsr_file_' \ -fcode_size Attempting to use -b/--by here would delete/merge the file field, as cvs.py assumes -b/-f specify all of the relevant field type. Note that fields can also be explicitly deleted with -D/--define's new glob support: $ ./scripts/csv.py lfs.code.csv -Dfile='*' -fcode_size --- This solves an annoying problem specific to csv.py, where manipulating by fields and field fields would often force you to specify all relevant -b/-f fields. With how benchmarks are parameterized, this list ends up _looong_. It's a bit of a hack/abuse of the hidden flags, but the alternative would be field globbing, which 1. would be a real pain-in-the-ass to implement, and 2. affect almost all of the scripts. Reusing the hidden flags for this keeps the complexity limited to csv.py.	2025-05-15 15:44:14 -05:00
Christopher Haster	7526b469b9	scripts: Adopted globs in all field matchers (-D/--define, -c/--compare) Globs in CLI attrs (-L'=bs=%(bs)s' for example), have been remarkably useful. It makes sense to extend this to the other flags that match against CSV fields, though this does add complexity to a large number of smaller scripts. - -D/--define can now use globs when filtering: $ ./scripts/code.py lfs.o -Dfunction='lfsr_file_' -D/--define already accepted a comma-separated list of options, so extending this to globs makes sense. Note this differs from test.py/bench.py's -D/--define. Globbing in test.py/bench.py wouldn't really work since -D/--define is generative, not matching. But there's already other differences such as integer parsing, range, etc. It's not worth making these perfectly consistent as they are really two different tools that just happen to look the same. - -c/--compare now matches with globs when finding the compare entry: $ ./scripts/code.py lfs.o -c'lfs_file_sync' This is quite a bit less useful that -D/--define, but makes sense for consistency. Note -c/--compare just chooses the first match. It doesn't really make sense to compare against multiple entries. This raised the question of globs in the field specifiers themselves (-f'bench_' for example), but I'm rejecting this for now as I need to draw the complexity/scope _somewhere_, and I'm worried it's already way over on the too-complex side. So, for now, field names must always be specified explicitly. Globbing field names would add too much complexity. Especially considering how many flags accept field names in these scripts.	2025-05-15 14:28:57 -05:00
Christopher Haster	55ea13b994	scripts: Reverted del to resolve shadowed builtins I don't know how I completely missed that this doesn't actually work! Using del _does_ work in Python's repl, but it makes sense the repl may differ from actual function execution in this case. The problem is Python still thinks the relevant builtin is a local variables after deletion, raising an UnboundLocalError instead of performing a global lookup. In theory this would work if the variable could be made global, but since global/nonlocal statements are lifted, Python complains with "SyntaxError: name 'list' is parameter and global". And that's A-Ok! Intentionally shadowing language builtins already puts this code deep into ugly hacks territory.	2025-05-15 14:10:42 -05:00
Christopher Haster	7c26bfc0a3	scripts: Simplified csv.py's func/uop/bop/top helpers Now that I know my way around the weirdness that is Python's class scope, this just required another function indirection to capture the class-level dicts correctly. I was considering using the __subclasses__ trick, but it seems like that would actually be more complicated here.	2025-04-16 15:23:13 -05:00
Christopher Haster	71930a5c01	scripts: Tweaked openio comment Dang, this touched like every single script.	2025-04-16 15:23:06 -05:00
Christopher Haster	c63ed79c5f	scripts: Prefer .a for single entry namedtuples - CsvInt.x -> CsvInt.a - CsvFloat.x -> CsvFloat.a - Rev.x -> Rev.a This matches CsvFrac.a (paired with CsvFrac.b), and avoids confusion with x/y variables such as Tile.x and Tile.y. The other contender was .v, since these are csv related types, but sticking with .a gets the point across that the name really doesn't have any meaning. There's also some irony that we're forcing namedtuples to have meaningless names, but it is useful to have a quick accessor for the internal value.	2025-04-16 15:23:03 -05:00
Christopher Haster	98b16a9013	scripts: Renamed RInt (and friends) -> CsvInt (and friends) This prefix was extremely arbitrary anyways. The prefix Csv* has slightly more meaning than R*, since these scripts interact with .csv files quite a bit, and it avoids confusion with rbyd-related things such as Rattr, Ralt, etc.	2025-04-16 15:23:02 -05:00
Christopher Haster	26a29bda31	scripts: Tweaked RFrac to return +-∞ when evaluated as a float This affects the table renderers as well as csv.py's ratio expr. This is a bit more correct, handwaving 0/0 (mapping 0/0 -> 100% is useful for cov.py, please don't kill me mathematicians): frac(1,0) => 1/0 (∞%) frac(0,0) => 0/0 (100.0%) frac(0,1) => 0/1 (0.0%)	2025-04-16 15:23:02 -05:00
Christopher Haster	613fa0f27a	scripts: Reverted to -p/--percent not providing a path So now the result scripts always require -d/--diff to diff: - before: ./scripts/csv.py a.csv -pb.csv - after: ./scripts/csv.py a.csv -db.csv -p For a couple reasons: - Easier to toggle - Simpler internally to only have one diff path flag - The previous behavior was a bit unintuitive	2025-04-16 15:23:00 -05:00
Christopher Haster	8e3760c5b8	scripts: Tweaked punescape to expect dict-like attrs This simplifies attrs a bit, and scripts can always override __getitem__ if they want to provide lazy attr generation. The original intention of accepting functions was to make lazy attr generation easier, but while tinkering around with the idea I realized the actual attr mapping/generation would be complicated enough that you'd probably want a full class anyways. All of our scripts are only using dict attrs anyways. And lazy attr generation is probably a premature optimization for the same reason everyone's ok with Python's slices being O(n).	2025-04-16 15:22:45 -05:00
Christopher Haster	270230a833	scripts: Adopted del to resolve shadowed builtins So: all_ = all; del all Instead of: import builtins all_, all = all, builtins.all The del exposes the globally scoped builtin we accidentally shadow. This requires less megic, and no module imports, though tbh I'm surprised it works. It also works in the case where you change a builtin globally, but that's a bit too crazy even for me...	2025-04-16 15:22:08 -05:00
Christopher Haster	313696ecf9	scripts: Fixed openio issue where some scripts didn't import os This only failed if "-" was used as an argument (for stdin/stdout), so the issue was pretty hard to spot. openio is a heavily copy-pasted function, so it makes sense to just add the import os to openio directly. Otherwise this mistake will likely happen again in the future.	2025-03-12 21:18:51 -05:00
Christopher Haster	92ac2a757e	scripts: Adopted json -> is_json tweak, avoiding name conflict This was a humorous name conflict that went unnoticed only because we lazily import json in read_csv.	2025-03-12 21:12:12 -05:00
Christopher Haster	c60301719a	scripts: Adopted dat tweak in other scripts This just makes dat behave similarly to Python's getattr, etc: - dat("bogus") -> raises ValueError - dat("bogus", 1234) -> returns 1234 This replaces try_dat, which is easy to forget about when copy-pasting between scripts. Though all of this wouldn't be necessary if only we could catch exceptions in expressions...	2025-03-12 21:12:12 -05:00
Christopher Haster	0d134a2830	scripts: Re-added -q/--quiet to result scripts I forgot that this is still useful for erroring scripts, such as stack.py when checking for recursion. Technically this is possible with -o/dev/null, but that's both unnecessarily complicated and includes the csv encoding cost for no reason.	2025-03-12 20:02:19 -05:00
Christopher Haster	9e22167a31	scripts: Re-adopted result prefixes Now that I'm looking into some higher-level scripts, being able to merge results without first renaming everything is useful. This gives most scripts an implicit prefix for field fields, but _not_ by fields, allowing easy merging of results from different scripts: $ ./scripts/stack.py lfs.ci -o- function,stack_frame,stack_limit lfs_alloc,288,1328 lfs_alloc_discard,8,8 lfs_alloc_findfree,16,32 ... At least now these have better support in scripts with the addition of the --prefix flag (this was tricky for csv.py), which allows explicit control over field field prefixes: $ ./scripts/stack.py lfs.ci -o- --prefix= function,frame,limit lfs_alloc,288,1328 lfs_alloc_discard,8,8 lfs_alloc_findfree,16,32 ... $ ./scripts/stack.py lfs.ci -o- --prefix=wonky_ function,wonky_frame,wonky_limit lfs_alloc,288,1328 lfs_alloc_discard,8,8 lfs_alloc_findfree,16,32 ...	2025-03-12 19:10:17 -05:00
Christopher Haster	aae03be54b	scripts: Fixed diff result sorting This was a bit broken when r was None. Which is unusual, but happens when rendering added/removed diff results.	2025-03-12 19:10:17 -05:00
Christopher Haster	299e2604c6	scripts: Changed -o/-O to an exclusive operation So: $ ./scripts/code.py lfs.o -o- -q Becomes: $ ./scripts/code.py lfs.o -o- The original intention of -o/-O _not_ being exclusive (aka table is still rendered unless disabled with -q/--quiet), was to allow results to be written to csv files and rendered to tables in a single pass. But this was never useful. Heck, we're not even using this in our Makefile right now because it would make the rule dependencies more complicated than it's worth. Even for long-running result scripts (perf.py, perfbd.py, etc), most of the work is building that csv file, the cost of rendering a table in a second pass is negligible. In every case I've used -o/-O, I've also wanted -q/--quiet, and almost always forget this on the first run. So might as well make the expected behavior the actual behavior. --- As a plus, this let us simplify some of the scripts a bit, by replacing visibility filters with -o/-O dependent by-fields.	2025-03-12 19:10:17 -05:00
Christopher Haster	e71aca65d9	scripts: Adopted default visibility in scripts with complex fields This makes it so scripts with complex fields will still output all fields to output csv/json files, while only showing a user-friendly subset unless -f/--field is explicitly provided. While internal fields are often too much information to show by default, csv/json files are expected to go to other scripts, not humans. So more information is more useful up until you actually hit a performance bottleneck. And if you _do_ somehow manage to hit a performance bottleneck, you can always limit the output with explicit -f/--field flags.	2025-03-12 19:10:17 -05:00
Christopher Haster	051bf66f9a	scripts: Tried to handle -d/--diff results consistently With this, we apply the same result modifiers (exprs/defines/hot/etc) to both the input results and -d/--diff results. So if both start with the same format, diffing/hotifying/etc should work as expected. This is really the only way I can seen -d/--diff results working with result modifiers in a way that makes sense. The downside of this is that you can't save results with some complex operation applied, and then diff while applying the same operation, since most of the newer operations (hotify) are _not_ idempotent. Fortunately the two alternatives are not unreasonable: 1. Save results _without_ the operation applied, since the operation will be applied to both the input and diff results. This is a bit asymmetric, but should work. 2. Apply the operation to the input and then pipe to csv.py for diffing. This used to "just work" when we did _not_ apply operations to output csv/json, but this was really just equivalent to 1.. I think the moral of the story is you can solve any problem with enough chained csv.py calls.	2025-03-12 19:10:17 -05:00
Christopher Haster	2f20f53e90	scripts: csv.py: Reverted define filtering to before expr eval It's just too unintuitive to filter after exprs. Note this is consistent with how exprs/mods are evaluated. Exprs/mods can't reference other exprs/mods because csv.py is only single-pass, so allowing defines to reference exprs/mods is surprising. And the solution to needing these sort of post-expr/mod references is the same for defines: You can always chain multiple csv.py calls. The reason defines were change to evaluate after expr eval was because this seemed inconsistent with other result scripts, but this is not actually the case. Other result scripts simply don't have exprs/mods, so filtering in fold is the same as filtering during collection. Note that even in fold, filtering is done _before_ the actual fold/sum operation. --- Also fixed a recursive-define regression when folding. Counter- intuitively, we _don't_ want to recursively apply define filters. If we do the results will just end up too confusing to be useful.	2025-03-12 19:10:17 -05:00
Christopher Haster	e851c654c5	scripts: Fixed typo hiding zero-sized results in table renderer This should either have checked diff_result==None, or we should be mapping diff_result=None => diff_result_=None. To be safe I've done both. This was a nasty typo and I only noticed because ctx.py stopped printing "cycle detected" for our linked-lists (which are expected to be cyclic).	2025-03-12 19:10:17 -05:00
Christopher Haster	5811b11131	scripts: csv.py: Replaced -l/--label with -I/-B/-F for hidden fields It felt weird that adding hidden fields required changing existing flags unrelated to the field you actually want to affect, and the upper/lower flag thing seems to work well for -s/-S sooo... - Replaced -l/--label with -B/--hidden-by for by fields that can be hidden from the table renderer. - Added -F/--hidden-field as a similar thing for field fields. - Better integrated -i/--enumerate into by fields, now these actually maintain related order. And of course added a matching -I/--hidden-enumerate flag. The only downside is this is eating a lot of flag names.. But one of the nice thing about limiting this complexity to csv.py is it avoids these flag names cluttering up the other result scripts. --- The -F/--hidden-fields flag I'm not so sure about, since field exprs can't really reference each other (single pass). But it does provide symmetry with -B/--hidden-by, and reserves the name in case hidden field fields are more useful in the future. Unfortunately it _is_ annoyingly inconsistent with other hidden fields (-S/--sort, -D/--define, etc) in that it does end up in output csvs... But this script is already feeling way over-engineered as is.	2025-03-12 19:10:17 -05:00
Christopher Haster	1b8733b3de	scripts: csv.py: Fixed issue with exprs not always being typechecked Kind of a complicated corner case, but this shows up if you try to sort by fields as numbers and not as strings. In theory this is possible by creating a hidden sort field with a typed expr: $ ./scripts/csv.py test.csv -bi -bfunction -Si=i But we weren't typechecking sort fields that already exist in the by fields, since these are usually strings. This fix is to make sure all exprs are in the typechecked fields, even if they are already in by fields. There's no real cost to this. --- Note this version does _not_ typecheck i, and sorts by string: $ ./scripts/csv.py test.csv -bi -bfunction -Si This raises the question, should we always sort by string by default? I don't think so. It's easy to miss the difference, and a typecheck error is a lot safer than incorrect sorting. So this will sort by number, with i as a hidden field: $ ./scripts/csv.py test.csv -bfunction -Si If you want to sort by string with a hidden field, this is still possible with -l/--label: $ ./scripts/csv.py test.csv -bi -lfunction -Si	2025-03-12 19:10:17 -05:00
Christopher Haster	7789714560	scripts: Adopted single folding pass, fixing perf[bd].py -r/--hot issue There's an ordering issue with hotifying and folding when we have multiple foldable results with children. This was hard to notice since most of the recursive scripts have unique results, but it _is_ an issue for perf.py/perfbd.py, which rely on result folding to merge samples. The fix is to fold _before_ hotifying. We could fold multiple times to avoid changing the behavior of the result scripts, but instead I've just moved the folding in the table renderer up into the relevant main functions. This means 1. we only fold once, and 2. folding affects outputted csv/json files. I'm a bit on the fence about this behavior change, but it is a bit more consistent with how -r/--hot, -z/--depth, etc, affect both table and csv/json results consistently. Maybe we should move towards the table render always reflecting the csv/json results? Most csv/json usage is with -q/--quiet anyways... --- This does create a new risk in that the table renderer can hide results if they aren't folded first. To hopefully avoid this I've added an assert in the table renderer if it notices results being hidden.	2025-03-12 19:10:17 -05:00
Christopher Haster	861dc3bd6a	scripts: csv.py: Added --help-mods to help explain % modifiers I guess in addition to its other utilities, csv.py is now also turning into a sort of man database for some of the more complicated APIs in the scripts: ./csv.py --help ./csv.py --help-exprs ./csv.py --help-mods It's a bit minimal, but better than nothing. Also dropped the %c modifier because this never actually worked.	2025-03-12 19:10:17 -05:00
Christopher Haster	b2768becaa	scripts: Added -l/--labels to csv.py This gives csv.py access to a hidden feature in our table renderer used by some of the other scripts: fields that affect by-field grouping, but aren't actually printed. For example, this prevents summing same named functions in different files, but only shows the function name in the table render: $ ./scripts/csv.py lfs.code.csv -bfile -bfunction -lfunction function size lfs_alloc 398 lfs_alloc_discard 31 lfs_alloc_findfree 77 ... This is especially useful when enumerating results. For example, this prevents any summing without extra table noise: $ ./scripts/csv.py lfs.code.csv -i -bfunction -fsize -lfunction function size lfs_alloc 398 lfs_alloc_discard 31 lfs_alloc_findfree 77 ... I also tweaked -b/--by field defaults a bit to account to enumerate/label fields a bit better.	2025-03-12 19:10:17 -05:00
Christopher Haster	748815bb46	scripts: Disentangled -r/--hot and -i/--enumerate This removes most of the special behavior around how -r/--hot and -i/--enumerate interact. This does mean -r/--hot risks folding results if -i/--enumerate is not specified, but this is _technically_ a valid operation. For most of the recursive result scripts, I've replaced the "i" field with separate "z" and "i" fields for depth and field number, which I think is a bit more informative/useful. I've also added a default-hidden "off" field to structs.py/ctx.py, since we have that info available. I considered replacing "i" with this, but decided against it since non-zero offsets for union members would risk being confusing/mistake prone.	2025-03-12 19:10:17 -05:00
Christopher Haster	ac30a20d12	scripts: Reworked to support optional json input/output Guh This may have been more work than I expected. The goal was to allowing passing recursive results (callgraph info, structs, etc) between scripts, which is simply not possible with csv files. Unfortunately, this raised a number of questions: What happens if a script receives recursive results? -d/--diff with recursive results? How to prevent folding of ordered results (structs, hot, etc) in piped scripts? etc. And ended up with a significant rewrite of most of the result scripts' internals. Key changes: - Most result scripts now support -O/--output-json in addition to -o/--json, with -O/--output-json including any recursive results in the "children" field. - Most result scripts now support both csv and json as input to relevant flags: -u/--use, -d/--diff, -p/--percent. This is accomplished by looking for a '[' as the first character to decide if an input file is json or csv. Technically this breaks if your json has leading whitespace, but why would you ever keep whitespace around in json? The human-editability of json was already ruined the moment comments were disallowed. - csv.py requires all fields to be explicitly defined, so added -i/--enumerate, -Z/--children, and -N/--notes. At least we can provide some reasonable defaults so you shouldn't usually need to type out the whole field. - Notably, the rendering scripts (plot.py, treemapd3.py, etc) and test/bench scripts do _not_ support json. csv.py can always convert to/from json when needed. - The table renderer now supports diffing recursive results, which is nice for seeing how the hot path changed in stack.py/perf.py/etc. - Moved the -r/--hot logic up into main, so it also affects the outputted results. Note it is impossible for -z/--depth to _not_ affect the outputted results. - We now sort in one pass, which is in theory more efficient. - Renamed -t/--hot -> -r/--hot and -R/--reverse-hot, matching -s/-S. - Fixed an issue with -S/--reverse-sort where only the short form was actually reversed (I misunderstood what argparse passes to Action classes). - csv.py now supports json input/output, which is funny.	2025-03-12 19:09:43 -05:00
Christopher Haster	dcbc195b41	scripts: csv.py: Replaced -b/--by exprs with % modifiers In addition to providing more functionality for creating -b/--by fields, this lets us remove strings from the expr parser. Strings had no well-defined operations and could best be described as an "ugly wart". Maybe we'll reintroduce string exprs in the future, but for now csv.py's -f/--field fields will be limited to numeric values. As an extra plus, no more excessive quoting when injecting new -b/--by fields. --- This also fixed sorting on non-field fields, which was apparently broken. Or at least mostly useless since it was defaulting to string sorting.	2025-03-11 18:48:27 -05:00
Christopher Haster	0adec7f15c	scripts: Replaced __builtins__ with builtins Apparently __builtins__ is a CPython implementation detail, and behaves differently when executed vs imported??? import builtins is the correct way to go about this.	2025-01-28 14:41:45 -06:00
Christopher Haster	62cc4dbb14	scripts: Disabled local import hack on import Moved local import hack behind if __name__ == "__main__" These scripts aren't really intended to be used as python libraries. Still, it's useful to import them for debugging and to get access to their juicy internals.	2025-01-28 14:41:30 -06:00
Christopher Haster	1d8d0785fc	scripts: More flags to control table renderer, -Q/--small-table, etc Instead of trying to be too clever, this just adds a bunch of small flags to control parts of table rendering: - --no-header - Don't show the header. - --small-header - Don't show by field names. - --no-total - Don't show the total. - -Q/--small-table - Equivalent to --small-header + --no-total. Note that -Q/--small-table replaces the previous -Y/--summary + -c/--compare hack, while also allowing a similar table style for non-compare results.	2024-12-18 14:03:35 -06:00
Christopher Haster	a3ac512cc1	scripts: Adopted Parser class in prettyasserts.py This ended up being a pretty in-depth rework of prettyasserts.py to adopt the shared Parser class. But now prettyasserts.py should be both more robust and faster. The tricky parts: - The Parser class eagerly munches whitespace by default. This is usually a good thing, but for prettyasserts.py we need to keep track of the whitespace somehow in order to write it to the output file. The solution here is a little bit hacky. Instead of complicating the Parser class, we implicitly add a regex group for whitespace when compiling our lexer. Unfortunately this does make last-minute patching of the lexer a bit messy (for things like -p/--prefix, etc), thanks to Python's re.Pattern class not being extendable. To work around this, the Lexer class keeps track of the original patterns to allow recompilation. - Since we no longer tokenize in a separate pass, we can't use the None token to match any unmatched tokens. Fortunately this can be worked around with sufficiently ugly regex. See the 'STUFF' rule. It's a good thing Python has negative lookaheads. On the flip side, this means we no longer need to explicitly specify all possible tokens when multiple tokens overlap. - Unlike stack.py/csv.py, prettyasserts.py needs multi-token lookahead. Fortunately this has a pretty straightforward solution with the addition of an optional stack to the Parser class. We can even have a bit of fun with Python's with statements (though I do wish with statements could have else clauses, so we wouldn't need double nesting to catch parser exceptions). --- In addition to adopting the new Parser class, I also made sure to eliminate intermediate string allocation through heavy use of Python's io.StringIO class. This, plus Parser's cheap shallow chomp/slice operations, gives prettyasserts.py a much needed speed boost. (Honestly, the original prettyasserts.py was pretty naive, with the assumption that it wouldn't be the bottleneck during compilation. This turned out to be wrong.) These changes cut total compile time in ~half: real user sys before (time make test-runner -j): 0m56.202s 2m31.853s 0m2.827s after (time make test-runner -j): 0m26.836s 1m51.213s 0m2.338s Keep in mind this includes both prettyasserts.py and gcc -Os (and other Makefile stuff).	2024-12-17 15:34:44 -06:00
Christopher Haster	dad3367e9e	scripts: Adopted Parser in csv.py It's a bit funny, the motivation for a new Parser class came from the success of simple regex + space munching in csv.py, but adopting Parser in csv.py makes sense for a couple reasons: - Consistency and better code sharing with other scripts that need to parse things (stack.py, prettyasserts.py?). - Should be more efficient, since we avoid copying the entire string every time we chomp/slice. Though I don't think this really matters for the size of csv.py's exprs... - No need to write every regex twice! Since Parser remembers the last match.	2024-12-16 19:27:31 -06:00
Christopher Haster	6a6ed0f741	scripts: Dropped cycle detection from table renderer Now that cycle detection is always done at result collection time, we don't need this in the table renderer itself. This had a tendency to cause problems for non-function scripts (ctx.py, structs.py).	2024-12-16 19:26:21 -06:00
Christopher Haster	dd389f23ee	scripts: Switched to sorted sets for result notes God, I wish Python had an OrderedSet. This is a fix for duplicate "cycle detected" notes when using -t/--hot. This mix of merging both _hot_notes and _notes in the HotResult class is tricky when the underlying container is a list. The order is unlikely to be guaranteed anyways, when different results with different notes are folded. And if we ever want more control over the order of notes in result scripts we can always change this back later.	2024-12-16 19:22:14 -06:00
Christopher Haster	3e03c2ee7f	scripts: Adopted better input file handling in result scripts - Error on no/insufficient files. Instead of just returning no results. This is more useful when debugging complicated bash scripts. - Use elf magic to allow any file order in perfbd.py/stack.py. This was already implemented in stack.py, now also adopted in perfbd.py. Elf files always start with the magic string "\x7fELF", so we can use this to figure out the types of input files without needing to rely on argument order. This is just one less thing to worry about when invoking these scripts.	2024-12-16 19:13:22 -06:00
Christopher Haster	ac79c88c6f	scripts: Improved cycle detection notes in scripts - Prevented childrenof memoization from hiding the source of a detected cycle. - Deduplicated multiple cycle detected notes. - Fixed note rendering when last column does not have a notes list. Currently this only happens when entry is None (no results).	2024-12-16 18:01:46 -06:00
Christopher Haster	faf4d09c34	scripts: Added __repr__ to RInt and friends Just a minor quality of life feature to help debugging these scripts.	2024-12-16 18:01:46 -06:00
Christopher Haster	8526cd9cf1	scripts: Prevented i/children/notes result field collisions Without this, naming a column i/children/notes in csv.py could cause things to break. Unlikely for children/notes, but very likely for i, especially when benchmarking. Unfortunately namedtuple makes this tricky. I _want_ to just rename these to _i/_children/_notes and call the problem solved, but namedtuple reserves all underscore-prefixed fields for its own use. As a workaround, the table renderer now looks for _i/_children/_notes at the _class_ level, as an optional name of which namedtuple field to use. This way Result types can stay lightweight namedtuples while including extra table rendering info without risk of conflicts. This also makes the HotResult type a bit more funky, but that's not a big deal.	2024-12-15 16:36:14 -06:00
Christopher Haster	183ede1b83	scripts: Option for result scripts to force children ordering This extends the recursive part of the table renderer to sort children by the optional "i" field, if available. Note this only affects children entries. The top-level entries are strictly ordered by the relevant "by" fields. I just haven't seen a use case for this yet, and not sorting "i" at the top-level reduces that number of things that can go wrong for scripts without children. --- This also rewrites -t/--hot to take advantage of children ordering by injecting a totally-no-hacky HotResult subclass. Now -t/--hot should be strictly ordered by the call depth! Though note entries that share "by" fields are still merged... This also gives us a way to introduce the "cycle detected" note and respect -z/--depth, so overall a big improvement for -t/--hot.	2024-12-15 16:35:52 -06:00
Christopher Haster	e6ed785a27	scripts: Removed padding from tail notes in tables We don't really need padding for the notes on the last column of tables, which is where row-level notes end up. This may seem minor, but not padding here avoids quite a bit of unnecessary line wrapping in small terminals.	2024-12-15 16:35:29 -06:00
Christopher Haster	512cf5ad4b	scripts: Adopted ctx.py-related changes in other result scripts - Adopted higher-level collect data structures: - high-level DwarfEntry/DwarfInfo class - high-level SymInfo class - high-level LineInfo class Note these had to be moved out of function scope due to pickling issues in perf.py/perfbd.py. These were only function-local to minimize scope leak so this fortunately was an easy change. - Adopted better list-default patterns in Result types: def __new__(..., children=None): return Result(..., children if children is not None else []) A classic python footgun. - Adopted notes rendering, though this is only used by ctx.py at the moment. - Reverted to sorting children entries, for now. Unfortunately there's no easy way to sort the result entries in perf.py/perfbd.py before folding. Folding is going to make a mess of more complicated children anyways, so another solution is needed... And some other shared miscellany.	2024-12-15 15:41:11 -06:00
Christopher Haster	b4c79c53d2	scripts: csv.py: Fixed NoneType issues with default sort $ ./scripts/csv.py lfs.code.csv -bfunction -fsize -S ... blablabla ... TypeError: cannot unpack non-iterable NoneType object The issue was argparse's const defaults bypassing the type callback, so the sort field ends up with None when it expects a tuple (well technically a tuple tuple). This is only an issue for csv.py because csv.py's sort fields can contain exprs.	2024-12-15 15:39:04 -06:00
Christopher Haster	e00db216c1	scripts: Consistent table renderer, cycle detection optional The fact that our scripts' table renderer was slightly different for recursive scripts (stack.py, perf.py) and non-recursive scripts (code.py, structs.py) was a ticking time bomb, one innocent edit away from breaking half the scripts. The makes the table renderer consistent across all scripts, allowing for easy copy-pasting when editing at the cost of some unused code in scripts. One hiccup with this though is the difference in cycle detection behavior between scripts: - stack.py: lfsr_bd_sync '-> lfsr_bd_prog '-> lfsr_bd_sync <-- cycle! - structs.py: lfsr_bshrub_t '-> u '-> bsprout '-> u <-- not a cycle! To solve this the table renderer now accepts a simple detect_cycles flag, which can be set per-script.	2024-12-14 12:25:15 -06:00
Christopher Haster	ef3accc07c	scripts: Tweaked -p/--percent to accept the csv file for diffing This makes the -p/--percent flag a bit more consistent with -d/--diff and -c/--compare, both of which change the printing strategy based on additional context.	2024-11-16 18:01:27 -06:00
Christopher Haster	9a2b561a76	scripts: Adopted -c/--compare in make summary-diff This showcases the sort of high-level result printing where -c/--compare is useful: $ make summary-diff code data stack structs BEFORE 57057 0 3056 1476 AFTER 68864 (+20.7%) 0 (+0.0%) 3744 (+22.5%) 1520 (+3.0%) There was one hiccup though: how to hide the name of the first field. It may seem minor, but the missing field name really does help readability when you're staring at a wall of CLI output. It's a bit of a hack, but this can now be controlled with -Y/--summary, which has the sole purpose of disabling the first field name if mixed with -c/--compare. -c/--compare is already a weird case for the summary row anyways...	2024-11-16 18:01:15 -06:00

1 2

72 Commits