littlefs

Author	SHA1	Message	Date
Christopher Haster	6d9c077261	Reordered LFSR_TAG_NAMELIMIT/FILELIMIT Not sure why, but this just seems more intuitive/correct. Maybe because LFSR_TAG_NAME is always the first tag in a file's attr set: LFSR_TAG_NAMELIMIT 0x0039 v--- ---- --11 1--1 LFSR_TAG_FILELIMIT 0x003a v--- ---- --11 1-1- Seeing as several parts of the codebase still use the previous order, it seems reasonable to switch back to that. No code changes.	2025-05-24 21:51:06 -05:00
Christopher Haster	55ea13b994	scripts: Reverted del to resolve shadowed builtins I don't know how I completely missed that this doesn't actually work! Using del _does_ work in Python's repl, but it makes sense the repl may differ from actual function execution in this case. The problem is Python still thinks the relevant builtin is a local variables after deletion, raising an UnboundLocalError instead of performing a global lookup. In theory this would work if the variable could be made global, but since global/nonlocal statements are lifted, Python complains with "SyntaxError: name 'list' is parameter and global". And that's A-Ok! Intentionally shadowing language builtins already puts this code deep into ugly hacks territory.	2025-05-15 14:10:42 -05:00
Christopher Haster	de7564e448	Added phase bits to cksum tags This carves out two more bits in cksum tags to store the "phase" of the rbyd block (maybe the name is too fancy, this is just the lowest 2 bits of the block address): LFSR_TAG_CKSUM 0x300p v-11 ---- ---- -pqq ^ ^ \| '-- phase bits '---- perturb bit The intention here is to catch mrootanchors that are "out-of-phase", i.e. they've been shifted by a small number of blocks. This can happen if we find the wrong mrootanchor (after, say, a magic scan), and risks filesystem corruption: formatted .-----------------'-----------------. mounted .-----------------'-----------------. .--------+--------+--------+--------+ ... \|(erased)\| mroot \| \| \| anchor \| ... \| \| \| '--------+--------+--------+--------+ ... Including the lower 2 bits of the block address in cksum tags avoids this, for up to a 3 block shift (the maximum number of redund mrootanchors). --- Note that cksum tags really are the only place we could put these bits. Anywhere else and they would interfere with the canonical cksum, which would break error correction. By definition these need to be different per block. We include these phase bits in every cksum tag (because it's easier), but these don't really say much about mdirs that are not the mrootanchor. Non-anchor mdirs can have arbitrary block addresses, therefore arbitrary phase bits. You _might_ be able to do something interesting if you sort the rbyd addresses and use the index as the phase bits, but that would add quite a bit of code for questionable benefit... You could argue this adds noise to our cksums, but: 1. 2 bits seems like a really small amount of noise 2. our cksums are just crc32cs 3. the phase bits humorously never change when you rewrite a block --- As with any feature this adds code, but only a small amount. I think it's worth the extra protection: code stack ctx before: 35792 2368 636 after: 35824 (+0.1%) 2368 (+0.0%) 636 (+0.0%) Also added test_mount_incompat_out_of_phase to test this. The dbg scripts _don't_ error (block mismatch seems likely when debugging), but dbgrbyd.py at least adds phase mismatch notes in -l/--log mode.	2025-04-30 00:57:17 -05:00
Christopher Haster	677c078b50	Added LFSR_TAG_BNAME/MNAME, stop btree lookups at first tag Now that we don't have to worry about name tag conflicts as much, we can add name tags for things that aren't files. This adds LFSR_TAG_BNAME for branch names, and LFSR_TAG_MNAME for mtree names. Note that the upper 4 bits of the subtype match LFSR_TAG_BRANCH and LFSR_TAG_MDIR respectively: LFSR_TAG_BNAME 0x0200 v--- --1- ---- ---- LFSR_TAG_MNAME 0x0220 v--- --1- --1- ---- LFSR_TAG_BRANCH 0x030r v--- --11 ---- --rr LFSR_TAG_MDIR 0x0324 v--- --11 --1- -1rr The encoding is somewhat arbitrary, but I figured reserving ~31 types for files is probably going to be plenty for littlefs. POSIX seems to do just fine with only ~7 all these years, and I think custom attributes will be more enticing for "niche" file types (symlinks, compressed files, etc), given the easy backwards compatibility. --- In addition to the debugging benefits, the new name tags let us stop btree lookups on the first non-bname/branch tag. Previously we always had to fetch the first struct tag as well to check if it was a branch. In theory this saves one rbyd lookup, but in practice it's a bit muddy. The problem is that there's two ways to use named btrees: 1. As buckets: mtree -> mdir -> mid 2. As a table: ddtree -> ddid The only named btree we _currently_ have is the mtree. And the mtree operates in bucket mode, with each mdir acting more-or-less as an extension to the btree. So we end up needing to do the second tag lookup anyways, and all we've done is complicated up the code. But we will _eventually_ need the table mode for the ddtree, where we care if the ddname is an exact match. And returning the first tag is arguably the more "correct" internal API, vs arbitrarily the first struct tag. But then again this change is pretty pricey... code stack ctx before: 35732 2440 640 after: 35888 (+0.4%) 2480 (+1.6%) 640 (+0.0%) --- It's worth noting the new BNAME/MNAME tags don't _require_ the btree lookup changes (which is why we can get away with not touching the dbg scripts). The previous algorithm of always checking for branch tags still works. Maybe there's an argument for conditionally using the previous API when compiling without the ddtree, but that sounds horrendously messy...	2025-04-30 00:25:30 -05:00
Christopher Haster	d308ec8322	Reworked tag encoding a little bit Mainly to make room for some future planned stuff: - Moved the mroot's redund bits from LFSR_TAG_GEOMETRY to LFSR_TAG_MAGIC: LFSR_TAG_MAGIC 0x003r v--- ---- --11 --rr This has the benefit of living in a fixed location (off=0x5), which may make mounting/debugging easier. It also makes LFSR_TAG_GEOMETRY less of a special case (LFSR_TAG_MAGIC is already a _very_ special case). Unfortunately, this does get in the way of our previous magic=0x3 encoding. To compensate (and to avoid conflicts with LFSR_TAG_NULL), I've added the 0x3_ prefix. This has the funny side-effect of rendering redunds 0-3 as ascii 0-3 (0x30-0x33), which is a complete accident but may actually be useful when debugging. Currently all config tags fit in the 0x3_ prefix, which is nice for debugging but not a hard requirement. - Flipped LFSR_TAG_FILELIMIT/NAMELIMIT: LFSR_TAG_FILELIMIT 0x0039 v--- ---- --11 1--1 LFSR_TAG_NAMELIMIT 0x003a v--- ---- --11 1-1- The file limit is a _bit_ more fundamental. It's effectively the required integer size for the filesystem. These may also be followed by LFSR_TAG_ATTRLIMIT based on how future attr revisits go. - Rearranged struct tags so that LFSR_TAG_BRANCH = 0x300: LFSR_TAG_BRANCH 0x030r v--- --11 ---- --rr LFSR_TAG_DATA 0x0304 v--- --11 ---- -1-- LFSR_TAG_BLOCK 0x0308 v--- --11 ---- 1err LFSR_TAG_DDKEY* 0x0310 v--- --11 ---1 ---- LFSR_TAG_DID 0x0314 v--- --11 ---1 -1-- LFSR_TAG_BSHRUB 0x0318 v--- --11 ---1 1--- LFSR_TAG_BTREE 0x031c v--- --11 ---1 11rr LFSR_TAG_MROOT 0x032r v--- --11 --1- --rr LFSR_TAG_MDIR 0x0324 v--- --11 --1- -1rr LFSR_TAG_MTREE 0x032c v--- --11 --1- 11rr *Planned LFSR_TAG_BRANCH is a very special tag when it comes to bshrub/btree traversal, so I think it deserves the subtype=0 slot. This also just makes everything fit together better, and makes room for the future planned ddkey tag. Code changes minimal: code stack ctx before: 35728 2440 640 after: 35732 (+0.0%) 2440 (+0.0%) 640 (+0.0%)	2025-04-29 16:25:00 -05:00
Christopher Haster	7dd473df82	Tweaked LFSR_TAG_STICKYNOTE encoding 0x205 -> 0x203 Now that LFS_TYPE_STICKYNOTE is a real type users can interact with, it makes sense to group it with REG/DIR. This also has the side-effect of making these contiguous. --- LFSR_TAG_BOOKMARKs, however, are still hidden from the user. This unfortunately means there will be a bit of a jump if we ever add LFS_TYPE_SYMLINK in the future, but I'm starting to wonder if that's the best way to approach symlinks in littlefs... If instead LFS_TYPE_SYMLINKS were implied via custom attribute, you could avoid the headache that comes with adding a new tag encoding, and allow perfect compatibility with non-symlink drivers. Win win. This seems like a better approach for _all_ of the theoretical future types (compressed files, device files, etc), and avoids the risk of oversaturating the type space. --- This had a surprising impact on code for just a minor encoding tweak. I guess the contiguousness pushed the compiler to use tables/ranges for more things? Or maybe 3 vs 5 is just an easier constant to encode? code stack ctx before: 35952 2440 640 after: 35928 (-0.1%) 2440 (+0.0%) 640 (+0.0%)	2025-04-24 14:35:52 -05:00
Christopher Haster	bd70270e11	scripts: Added -w/--word-bits to bound dbgleb128/dbgle32 parsing This is limited to dbgle32.py, dbgleb128.py, and dbgtag.py for now. This more closely matches how littlefs behaves, in that we read a bounded number of bytes before leb128 decoding. This minimizes bugs related to leb128 overflow and avoids reading inherently undecodable data. The previous unbounded behavior is still available with -w0. Note this gives dbgle32.py much more flexibility in that it can now decode other integer widths. Uh, ignore the name for now. At least it's self documenting that the default is 32-bits... --- Also fixed a bug in fromleb128 where size was reported incorrectly on offset + truncated leb128.	2025-04-16 15:23:12 -05:00
Christopher Haster	0cea8b96fb	scripts: Fixed O(n^2) slicing in Rbyd.fetch Do you see the O(n^2) behavior in this loop? j = 0 while j < len(data): word, d = fromleb(data[j:]) j += d The slice, data[j:], creates a O(n) copy every iteration of the loop. A bit tricky. Or at least I found it tricky to notice. Maybe because array indexing being cheap is baked into my brain... Long story short, this repeated slicing resulted in O(n^2) behavior in Rbyd.fetch and probably some other functions. Even though we don't care _too_ much about performance in these scripts, having Rbyd.fetch run in O(n^2) isn't great. Tweaking all from* functions to take an optional index solves this, at least on paper. --- In practice I didn't actually find any measurable performance gain. I guess array slicing in Python is optimized enough that the constant factor takes over? (Maybe it's being helped by us limiting Rbyd.fetch to block_size in most scripts? I haven't tested NAND block sizes yet...) Still, it's good to at least know this isn't a bottleneck.	2025-04-16 15:23:11 -05:00
Christopher Haster	b5c3b97ae1	scripts: Reworked dbgtag.py, added -i/--input, included hex in output This just gives dbgtag.py a few more bells and whistles that may be useful: - Can now parse multiple tags from hex: $ ./scripts/dbgtag.py -x 71 01 01 01 12 02 02 02 71 01 01 01 altrgt 0x101 w1 -1 12 02 02 02 shrubdir w2 2 Note this _does_ skip attached data, which risks some confusion but not skipping attached data will probably end up printing a bunch of garbage for most use cases: $ ./scripts/dbgtag.py -x 01 01 01 04 02 02 02 02 03 03 03 03 01 01 01 04 gdelta 0x01 w1 4 03 03 03 03 struct 0x03 w3 3 - Included hex in output. This is helpful for learning about the tag encoding and also helps identify tags when parsing multiple tags. I considered also included offsets, which might help with understanding attached data, but decided it would be too noisy. At some point you should probably jump to dbgrbyd.py anyways... - Added -i/--input to read tags from a file. This is roughly the same as -x/--hex, but allows piping from other scripts: $ ./scripts/dbgcat.py disk -b4096 0 -n4,8 \| ./scripts/dbgtag.py -i- 80 03 00 08 magic 8 Note this reads the entire file in before processing. We'd need to fit everything into RAM anyways to figure out padding.	2025-04-16 15:23:10 -05:00
Christopher Haster	270230a833	scripts: Adopted del to resolve shadowed builtins So: all_ = all; del all Instead of: import builtins all_, all = all, builtins.all The del exposes the globally scoped builtin we accidentally shadow. This requires less megic, and no module imports, though tbh I'm surprised it works. It also works in the case where you change a builtin globally, but that's a bit too crazy even for me...	2025-04-16 15:22:08 -05:00
Christopher Haster	97b6489883	scripts: Reworked dbglfs.py, adopted Lfs, Config, Gstate, etc I'm starting to regret these reworks. They've been a big time sink. But at least these should be much easier to extend with the future planned auxiliary trees? New classes: - Bptr - A representation of littlefs's data-only block pointers. Extra fun is the lazily checked Bptr.__bool__ method, which should prevent slowing down scripts that don't actually verify checksums. - Config - The set of littlefs config entries. - Gstate - The set of littlefs gstate. I may have had too much fun with Config and Gstate. Not only do these provide lookup functions for config/gstate, but known config/gstate get lazily parsed classes that can provide easy access to the relevant metadata. These even abuse Python's __subclasses__, so all you need to do to add a new known config/gstate is extend the relevant Config.Config/ Gstate.Gstate class. The __subclasses__ API is a weird but powerful one. - Lfs - The big one, a high-level abstraction of littlefs itself. Contains subclasses for known files: Lfs.Reg, Lfs.Dir, Lfs.Stickynote, etc, which can be accessed by path, did+name, mid, etc. It even supports iterating over orphaned files, though it's expensive (but incredibly valuable for debugging!). Note that all file types can currently have attached bshrubs/btrees. In the existing implementation only reg files should actually end up with bshrubs/btrees, but the whole point of these scripts is to debug things that _shouldn't_ happen. I intentionally gave up on providing depth bounds in Lfs. Too complicated for something so high-level. On noteworthy change is not recursing into directories by default. This hopefully avoids overloading new users and matches the behavior of most other Linux/Unix tools. This adopts -r/--recurse/--file-depth for controlling how far to recurse down directories, and -z/--depth/--tree-depth for controlling how far to recurse down tree structures (mostly files). I like this API. It's consistent with -z/--depth in the other dbg scripts, and -r/--recurse is probably intuitive for most Linux/Unix users. To make this work we did need to change -r/--raw -> -x/--raw. But --raw is already a bit of a weird name for what really means "include a hex dump". Note that -z/--depth/--tree-depth does _not_ imply --files. Right now only files can contain tree structures, but this will change when we get around to adding the auxiliary trees. This also adds the ability to specify a file path to use as the root directory, though we need the leading slash to disambiguate file paths and mroot addresses. --- Also tagrepr has been tweaked to include the global/delta names, toggleable with the optional global_ kwarg. Rattr now has its own lazy parsers for did + name. A more organized codebase would probably have a separate Name type, but it just wasn't worth the hassle. And the abstraction classes have all been tweaked to require the explicit Rbyd.repr() function for a CLI-friendly representation. Relying on __str__ hurt readability and debugging, especially since Python prefers __str__ over __repr__ when printing things.	2025-04-16 15:22:06 -05:00
Christopher Haster	73127470f9	scripts: Adopted rbydaddr/tagrepr changes across scripts Just some minor tweaks: - rbydaddr: Return list instead of tuple, note we rely on the type distinction in Rbyd.fetch now. - tagrepr: Rename w -> weight.	2025-04-16 15:21:59 -05:00
Christopher Haster	262ad7c08e	scripts: Simplified dbgtag.py, tweaked -x/--hex decoding This drops the option to read tags from a disk file. I don't think I've ever used this, and it requires quite a bit of circuitry to implement. Also dropped -s/--string, because most tags can't be represented as strings? And tweaked -x/--hex flags to correctly parse spaces in arguments, so now these are equivalent: - ./scripts/dbgtag.py -x 00 03 00 08 - ./scripts/dbgtag.py -x "00 03 00 08"	2025-04-16 15:21:54 -05:00
Christopher Haster	68f0534dd0	rbyd: Dropped special altn/alta encoding altas, and to a lesser extend altns, are just too problematic for our rbyd-append algorithm. Main issue is these break our "narrowing" invariant, where each alt only ever decreases the bounds. I wanted to use altas to simplify lfsr_rbyd_appendcompaction, but decided it wasn't worth it. Handling them correctly would require adding a number of special cases to lfsr_rbyd_appendrat, adding complexity to an already incredibly complex function. --- Fortunately, we don't really need altns/altas on-disk, but we _do_ need a way to mark alts as unreachable internally in order to know when we can collapse alts when recoloring (at this point bounds information is lost). I was originally going to use the alt's sign bit for this, but it turns out we already have this information thanks to setting jump=0 to assert that an alt is unreachable. So no explicit flag needed! This ends up saving a surprising amount of code for what is only a couple lines of changes: code stack ctx before: 38512 2624 640 after: 38440 (-0.2%) 2624 (+0.0%) 640 (+0.0%)	2025-02-08 14:53:47 -06:00
Christopher Haster	1c5adf71b3	Implemented self-validating global-checksums (gcksums) This was quite a puzzle. The problem: How do we detect corrupt mdirs? Seems like a simple question, but we can't just rely on mdir cksums. Our mdirs are independently updateable logs, and logs have this annoying tendency to "rollback" to previously valid states when corrupted. Rollback issues aren't littlefs-specific, but what _is_ littlefs- specific is that when one mdir rolls back, it can disagree with other mdirs, resulting in wildly incorrect filesystem state. To solve this, or at least protect against disagreeable mdirs, we need to somehow include the state of all other mdirs in each mdir commit. --- The first thought: Why not use gstate? We already have a system for storing distributed state. If we add the xor of all of our mdir cksums, we can rebuild it during mount and verify that nothing changed: .--------. .--------. .--------. .--------. .\| mdir 0 \| .\| mdir 1 \| .\| mdir 2 \| .\| mdir 3 \| \|\| \| \|\| \| \|\| \| \|\| \| \|\| gdelta \| \|\| gdelta \| \|\| gdelta \| \|\| gdelta \| \|'-----\|--' \|'-----\|--' \|'-----\|--' \|'-----\|--' '------\|-' '------\|-' '------\|-' '------\|-' '--.------' '--.------' '--.------' '--.------' cksum \| cksum \| cksum \| cksum \| \| \| v \| v \| v \| '---------> xor -------> xor -------> xor -------> gcksum \| v v v =? '---------> xor -------> xor -------> xor ---> gcksum Unfortunately it's not that easy. Consider what this looks like mathematically (g is our gcksum, c_i is an mdir cksum, d_i is a gcksumdelta, and +/-/sum is xor): g = sum(c_i) = sum(d_i) If we solve for a new gcksumdelta, d_i: d_i = g' - g d_i = g + c_i - g d_i = c_i The gcksum cancels itself out! We're left with an equation that depends only on the current mdir, which doesn't help us at all. Next thought: What if we permute the gcksum with a function t before distributing it over our gcksumdeltas? .--------. .--------. .--------. .--------. .\| mdir 0 \| .\| mdir 1 \| .\| mdir 2 \| .\| mdir 3 \| \|\| \| \|\| \| \|\| \| \|\| \| \|\| gdelta \| \|\| gdelta \| \|\| gdelta \| \|\| gdelta \| \|'-----\|--' \|'-----\|--' \|'-----\|--' \|'-----\|--' '------\|-' '------\|-' '------\|-' '------\|-' '--.------' '--.------' '--.------' '--.------' cksum \| cksum \| cksum \| cksum \| \| \| v \| v \| v \| '---------> xor -------> xor -------> xor -------> gcksum \| \| \| \| .--t--' \| \| \| \| '-> t(gcksum) \| v v v =? '---------> xor -------> xor -------> xor ---> t(gcksum) In math terms: t(g) = t(sum(c_i)) = sum(d_i) In order for this to work, t needs to be non-linear. If t is linear, the same thing happens: d_i = t(g') - t(g) d_i = t(g + c_i) - t(g) d_i = t(g) + t(c_i) - t(g) d_i = t(c_i) This was quite funny/frustrating (funnistrating?) during development, because it means a lot of seemingly obvious functions don't work! - t(g) = g - Doesn't work - t(g) = crc32c(g) - Doesn't work because crc32cs are linear - t(g) = g^2 in GF(2^n) - g^2 is linear in GF(2^n)!? Fortunately, powers coprime with 2 finally give us a non-linear function in GF(2^n), so t(g) = g^3 works: d_i = g'^3 - g^3 d_i = (g + c_i)^3 - g^3 d_i = (g^2 + gc_i + gc_i + c_i^2)(g + c_i) - g^3 d_i = (g^2 + c_i^2)(g + c_i) - g^3 d_i = g^3 + gc_i^2 + g^2c_i + c_i^3 - g^3 d_i = gc_i^2 + g^2c_i + c_i^3 --- Bleh, now we need to implement finite-field operations? Well, not entirely! Note that our algorithm never uses division. This means we don't need a full finite-field (+, -, , /), but can get away with a finite-ring (+, -, ). And conveniently for us, our crc32c polynomial defines a ring epimorphic to a 31-bit finite-field. All we need to do is define crc32c multiplication as polynomial multiplication mod our crc32c polynomial: crc32cmul(a, b) = pmod(pmul(a, b), P) And since crc32c is more-or-less just pmod(x, P), this lets us take advantage of any crc32c hardware/tables that may be available. --- Bunch of notes: - Our 2^n-bit crc-ring maps to a 2^n-1-bit finite-field because our crc polynomial is defined as P(x) = Q(x)(x + 1), where Q(x) is a 2^n-1-bit irreducible polynomial. This is a common crc construction as it provides optimal odd-bit/2-bit error detection, so it shouldn't be too difficult to adapt to other crc sizes. - t(g) = g^3 is not the only function that works, but it turns out to be a pretty good one: - 3 and 2^(2^n-1)-1 are coprime, which means our function t(g) = g^3 provides a one-to-one mapping in the underlying fields of all crc rings of size 2^(2^n). We know 3 and 2^(2^n-1)-1 are coprime because 2^(2^n-1)-1 = 2^(2^n)-1 (a Fermat number) - 2^(2^n-1) (a power-of-2), and 3 divides Fermat numbers >=3 (A023394) and is not 2. - Our delta, when viewed as a polynomial in g: d(g) = gc^2 + g^2c + c^3, has degree 2, which implies there are at most 2 solutions or 1-bit of information loss in the underlying field. This is optimal since the original definition already had 2 solutions before we even chose a function: d(g) = t(g + c) - t(g) d(g) = t(g + c) - t((g + c) - c) d(g) = t((g + c) + c) - t(g + c) d(g) = d(g + c) Though note the mapping of our crc-ring to the underlying field already represents 1-bit of information loss. - If you're using a cryptographic hash or other non-crc, you should probably just use an equal sized finite-field. Though note changing from a 2^n-1-bit field to a 2^n-bit field does change the math a bit, with t(g) = g^7 being a better non-linear function: - 7 is the smallest odd-number coprime with 2^n-1, a Fermat number, which makes t(g) = g^7 a one-to-one mapping. 3 humorously divides all 2^n-1 Fermat numbers. - Expanding delta with t(g) = g^7 gives us a 6 degree polynomial, which implies at most 6 solutions or ~3-bits of information loss. This isn't actually the best you can do, some exhaustive searching over small fields (<=2^16) suggests t(g) = g^(2^(n-1)-1) _might_ be optimal, but that's a heck of a lot more multiplications. - Because our crc32cs preserve parity/are epimorphic to parity bits, addition (xor) and multiplication (crc32cmul) also preserve parity, which can be used to show our entire gcksum system preserves parity. This is quite neat, and means we are guaranteed to detect any odd number of bit-errors across the entire filesystem. - Another idea was to use two different addition operations: xor and overflowing addition (or mod a prime). This probably would have worked, but lacks the rigor of the above solution. - You might think an RS-like construction would help here, where g = sum(c_ia^i), but this suffers from the same problem: d_i = g' - g d_i = g + c_ia^i - g d_i = c_ia^i Nothing here depends on anything outside of the current mdir. - Another question is should we be using an RS-like construction anyways to include location information in our gcksum? Maybe in another system, but I don't think it's necessary in littlefs. While our mdir are independently updateable, they aren't _entirely_ independent. The location of each mdir is stored in either the mtree or a parent mdir, so it always gets mixed into the gcksum somewhere. The only exception being the mrootanchor which is always at the fixed blocks 0x{0,1}. - This does _not_ catch "global-rollback" issues, where the most recent commit in the entire filesystem is corrupted, revealing an older, but still valid, filesystem state. But as far as I am aware this is just a fundamental limitation of powerloss-resilient filesystems, short of doing destructive operations. At the very least, exposing the gcksum would allow the user to store it externally and prevent this issue. --- Implementation details: - Our gcksumdelta depends on the rbyd's cksum, so there's a catch-22 if we include it in the rbyd itself. We can avoid this by including it in the commit tags (actually the separate canonical cksum makes this easier than it would have been earlier), but this does mean LFSR_TAG_GCKSUMDELTA is not an LFSR_TAG_GDELTA subtype. Unfortunate but not a dealbreaker. - Reading/writing the gcksumdelta gets a bit annoying with it not being in the rbyd. For now I've extended the low-level lfsr_rbyd_fetch_/ lfsr_rbyd_appendcksum_ to accept an optional gcksumdelta pointer, which is a bit awkward, but I don't know of a better solution. - Unlike the grm, _every_ mdir commit involves the gcksum, which means we either need to propagate the gcksumdelta up the mroot chain correctly, or somehow keep track of partially flushed gcksumdeltas. To make this work I modified the low-level lfsr_mdir_commit__ functions to accept start_rid=-2 to indicate when gcksumdeltas should be flushed. It's a bit of a hack, but I think it might make sense to extend this to all gdeltas eventually. The gcksum cost both code and RAM, but I think it's well worth it for removing an entire category of filesystem corruption: code stack ctx before: 37796 2608 620 after: 38428 (+1.7%) 2640 (+1.2%) 644 (+3.9%)	2025-02-08 14:53:30 -06:00
Christopher Haster	b6ab323eb1	Dropped the q-bit (previous-perturb) from cksum tags Now that we perturb commit cksums with the odd-parity zero, the q-bit no longer serves a purpose other than extra debug info. But this is a double-edged sword, because redundant info just means another thing that can go wrong. For example, should we assert? If the q-bit doesn't reflect the previous-perturb state it's a bug, but the only thing that would break would be the q-bit itself. And if we don't assert what's the point of keeping the q-bit around? Dropping the q-bit avoids answering this question and saves a bit of code: code stack ctx before: 37772 2608 620 after: 37768 (-0.0%) 2608 (+0.0%) 620 (+0.0%)	2025-01-28 14:41:45 -06:00
Christopher Haster	0adec7f15c	scripts: Replaced __builtins__ with builtins Apparently __builtins__ is a CPython implementation detail, and behaves differently when executed vs imported??? import builtins is the correct way to go about this.	2025-01-28 14:41:45 -06:00
Christopher Haster	66bf005bb8	Renamed LFSR_TAG_ORPHAN -> LFSR_TAG_STICKYNOTE I've been unhappy with LFSR_TAG_ORPHAN for a while now. While it's true these represent orphaned files, they also represent zombied files. And as long as a reference to the file exists in-RAM, I find it hard to say these files are truely "orphaned". We're also just using the term "orphan" for too many things. Really this tag just represents an mid reservation. The term stickynote works well enough for this, and fits in with the other internal tag, LFSR_TAG_BOOKMARK.	2025-01-28 14:41:45 -06:00
Christopher Haster	62cc4dbb14	scripts: Disabled local import hack on import Moved local import hack behind if __name__ == "__main__" These scripts aren't really intended to be used as python libraries. Still, it's useful to import them for debugging and to get access to their juicy internals.	2025-01-28 14:41:30 -06:00
Christopher Haster	7cfcc1af1d	scripts: Renamed summary.py -> csv.py This seems like a more fitting name now that this script has evolved into more of a general purpose high-level CSV tool. Unfortunately this does conflict with the standard csv module in Python, breaking every script that imports csv (which is most of them). Fortunately, Python is flexible enough to let us remove the current directory before imports with a bit of an ugly hack: # prevent local imports __import__('sys').path.pop(0) These scripts are intended to be standalone anyways, so this is probably a good pattern to adopt.	2024-11-09 12:31:16 -06:00
Christopher Haster	e3fdc3dbd7	scripts: Added simple mroot cycle detectors to dbg scripts These work by keeping a set of all seen mroots as we descend down the mroot chain. Simple, but it works. The downside of this approach is that the mroot set grows unbounded, but it's unlikely we'll ever have enough mroots in a system for this to really matter. This fixes scripts like dbgbmap.py getting stuck on intentional mroot cycles created for testing. It's not a problem for a foreground script to get stuck in an infinite loop, since you can just kill it, but a background script getting stuck at 100% CPU is a bit more annoying.	2024-11-07 11:46:39 -06:00
Christopher Haster	007ac97bec	scripts: Adopted double-indent on multiline expressions This matches the style used in C, which is good for consistency: a_really_long_function_name( double_indent_after_first_newline( single_indent_nested_newlines)) We were already doing this for multiline control-flow statements, simply because I'm not sure how else you could indent this without making things really confusing: if a_really_long_function_name( double_indent_after_first_newline( single_indent_nested_newlines)): do_the_thing() This was the only real difference style-wise between the Python code and C code, so now both should be following roughly the same style (80 cols, double-indent multiline exprs, prefix multiline binary ops, etc).	2024-11-06 15:31:17 -06:00
Christopher Haster	4d8bfeae71	attrs: Reduced UATTR/SATTR range down to 7-bits It would be nice to have a full 8-bit range for both user attrs and system attrs, for both backwards compatibility and maximizing the available attr space, but I think it just doesn't make sense from an API perspective. Sure we could finagle the user/sys bit into a flags argument, or provide separate lfsr_getuattr/getsattr functions, but asking users to use a 9-bit int for higher-level operations (dynamic attrs, iteration, etc) is a bit much... So this reduces the two attr ranges down to 7-bits, requiring 8-bits total to store all possible attr types in the current system: TAG_ATTR 0x0400 v--- -1-a -aaa aaaa TAG_UATTR 0x04aa v--- -1-- -aaa aaaa TAG_SATTR 0x05aa v--- -1-1 -aaa aaaa This really just affects scripts, since we haven't actually implemented attributes yet. Worst case we still have the 9-bit encoding space carved out, so we can always add an additional set of attrs in the future if we start running into attr pressure. Or, you know, just turn on the subtype leb128 encoding the 8th subtype bit is reserved for. Then you'd only be limited by internal driver details, probably 24-bits per attr range if we make tags 32-bits internally. Though this would probably come with quite a code cost...	2024-08-22 00:59:09 -05:00
Christopher Haster	c739e18f6f	Renamed LFSR_TAG_NOISE -> LFSR_TAG_NOTE Sort of like SHT_NOTE in elf files, but with no defined format. Using LFSR_TAG_NOTE for additional noise/nonces is still encouraged, but it can also be used to add debug info.	2024-06-20 13:04:20 -05:00
Christopher Haster	ae0e3348fe	Added -l/--list to dbgtag.py Inspired by errno's/dbgerr.py's -l/--list, this gives a quick and easy list of the current tag encodings, which can be very useful: $ ./scripts/dbgtag.py -l LFSR_TAG_NULL 0x0000 v--- ---- ---- ---- LFSR_TAG_CONFIG 0x00tt v--- ---- -ttt tttt LFSR_TAG_MAGIC 0x0003 v--- ---- ---- --11 LFSR_TAG_VERSION 0x0004 v--- ---- ---- -1-- ... snip ... We already need to keep dbgtag.py in-sync or risk a bad debugging experience, so we might as well let it tell us all the information it currently knows. Also yay for self-inspecting code, I don't know if it's bad that I'm becoming a fan of parsing information out of comments...	2024-06-20 13:02:08 -05:00
Christopher Haster	898f916778	Fixed pl hole in perturb logic Turns out there's very _very_ small powerloss hole in our current perturb logic. We rely on tag valid bits to validate perturb bits, but these intentionally don't end up in the commit checksum. This means there will always be a powerloss hole when we write the last valid bit. If we lose power after writing that bit, suddenly the remaining commit and any following commits may appear as valid. Now, this is really unlikely considering we need to lose power exactly when we write the cksum tag's valid bit, and our nonce helps protect against this. But a hole is a hole. The solution here is to include the _current_ perturb bit (q) in the commit's cksum tag, alongside the _next_ perturb bit (p). This will be included in the commit's checksum, but _not_ in the canonical checksum, allowing the commit's checksum validate the current perturb state without ruining our erased-state agnostic checksums: .---+---+---+---. . . .---+---+---+---. \ \ \ \ \|v\| tag \| \|v\| tag \| \| \| \| \| +---+---+---+---+ +---+---+---+---+ \| \| \| \| \| commit \| \| commit \| \| \| \| \| \| \| \| \| +-. \| \| \| +---+---+---+---+ +---+---+---+---+ / \| \| \| \| \|v\|qp-------------. \|v\|qp\| tag \| \| . . . +---+---+---+---+ \| +---+---+---+---+ \| . . . \| cksum \| \| \| cksum \| \| . . . +---+---+---+---+ \| +---+---+---+---+ \| . . . \| padding \| \| \| padding \| \| . . . \| \| \| \| \| \| . . . +---+---+---+---+ \| . +---+---+---+---+ \| \| \| \| \| erased \| +-> \|v------------------' \| \| \| \| \| \| +---+---+---+---+ \| \| \| . . \| \| commit \| +-. \| +- rbyd . . \| \|.----------------. \| \| \| \| cksum \| +\| -+---+---+---+ \| / \| +-. / +-> \|v\|qp\| tag \| '-----' \| \| \| +- ^ ---+---+---+ / \| '------' cksum ----------------' +---+---+---+---+ \| padding \| \| \| +---+---+---+---+ \| erased \| \| \| . . . . (Ok maybe this diagram needs work...) This adds another thing that needs to be checked during rbyd fetch, and note, we _do_ need to explicitly check this, but it solves the problem. If power is loss after v, q would be invalid, and if power is lost after q, our cksum would be invalid. Note this would have also been an issue for the previous cksum + parity perturb scheme. Code changes: code stack before: 33570 2592 after: 33598 (+0.1%) 2592 (+0.0%)	2024-06-07 19:41:47 -05:00
Christopher Haster	8a4f6fcf68	Adopted a simpler rbyd perturb scheme The previous cksum + parity scheme worked, but needing to calculate both cksum + parity on slightly different sets of metadata felt overly complicated. After taking a step back, I've realized the problem is that we're trying to force perturb effects to be implicit via the parity. If we instead actually implement perturb effects explicitly, things get quite a bit simpler... This does add a bit more logic to the read path, but I don't think it's worse than the mess we needed to parse separate cksum + parity. Now, the perturb bit has the explicit behavior of inverting all tag valid bits in the following commit. Which is conveniently the same as xoring the crc32c with 00000080 before parsing each tag: .---+---+---+---. . . .---+---+---+---. \ \ \ \ \|v\| tag \| \|v\| tag \| \| \| \| \| +---+---+---+---+ +---+---+---+---+ \| \| \| \| \| commit \| \| commit \| \| \| \| \| \| \| \| \| +-. \| \| \| +---+---+---+---+ +---+---+---+---+ / \| \| \| \| \|v\|p--------------. \|v\|p\| tag \| \| . . . +---+---+---+---+ \| +---+---+---+---+ \| . . . \| cksum \| \| \| cksum \| \| . . . +---+---+---+---+ \| +---+---+---+---+ \| . . . \| padding \| \| \| padding \| \| . . . \| \| \| \| \| \| . . . +---+---+---+---+ \| . +---+---+---+---+ \| \| \| \| \| erased \| +-> \|v------------------' \| \| \| \| \| \| +---+---+---+---+ \| \| \| . . \| \| commit \| +-. \| +- rbyd . . \| \| \| \| \| \| \| cksum \| +---+---+---+---+ / \| +-. / '-> \|v----------------------' \| \| +---+---+---+---+ / \| \| cksum ----------------' +---+---+---+---+ \| padding \| \| \| +---+---+---+---+ \| erased \| \| \| . . . . With this scheme, we don't need to calculate a separate parity, because each valid bit effectively validates the current state of the perturb bit. We also don't need extra logic to omit valid bits from the cksum, because flipping all valid bits effectively makes perturb=0 the canonical metadata encoding and cksum. --- I also considered only inverting the first valid bit, which would have the additional benefit of allowing entire commits to be crc32ced at once, but since we don't actually track when we've started a commit this turned out to be quite a bit more complicated than I thought. We need someway to validate the first valid bit, otherwise it could be flipped by a failed prog and we'd never notice. This is fine, we can store a copy of the previous perturb bit in the next cksum tag, but it does mean we need to track the perturb bit for the duration of the commit. So we'd end up needing to track both start-of-commit and the perturb bit state, which starts getting difficult to fit into our rbyd struct... It's easier and simpler to just flip every valid bit. As a plus this means every valid bit contributes to validating the perturb bit. --- Also renamed LFSR_TAG_PERTURB -> LFSR_TAG_NOISE just to avoid confusion. Though not sure if this tag should stick around... The end result is a nice bit of code/stack savings, which is what we'd expect with a simpler scheme: code stack before: 33746 2600 after: 33570 (-0.5%) 2592 (-0.3%)	2024-06-07 18:24:13 -05:00
Christopher Haster	11c948678f	Renamed size_limit -> file_limit This limits the maximum size of a file, which is also implies the maximum integer size required to mount. The exact name is a bit of a toss-up. I originally went with size_limit to avoid confusion around if file_limit reflected the file size or the number of files, but since this ends up mapping to lfs_off_t and _not_ lfs_size_t, I think size_limit may be a bit of a bad choice.	2024-05-18 13:00:15 -05:00
Christopher Haster	8a75a68d8b	Made rbyd cksums erased-state agnostic Long story short, rbyd checksums are now fully reproducible. If you write the same set of tags to any block, you will end up with the same checksum. This is actually a bit tricky with littlefs's constraints. --- The main problem boils down to erased-state. littlefs has a fairly flexible model for erased-state, and this brings some challenges. In littlefs, storage goes through 2 states: 1. Erase - Prepare storage for progging. Reads after an erase may return arbitrary, but consistent, values. 2. Prog - Program storage with data. Storage must be erased and no progs attempted. Reads after a prog must return the new data. Note in this model erased-state may not be all 0xffs, though it likely will be for flash. This allows littlefs to support a wide range of other storage devices: SD, RAM, NVRAM, encryption, ECC, etc. But this model also means erased-state may be different from block to block, and even different on later erases of the same block. And if that wasn't enough of a challenge, _erased-state can contain perfectly valid commits_. Usually you can expect arbitrary valid cksums to be rare, but thanks to SD, RAM, etc, modeling erase as a noop, valid cksums in erased-state is actually very common. So how do we manage erased-state in our rbyds? First we need some way to detect it, since we can't prog if we're not erased. This is accomplished by the forward-looking erased-state cksum (ecksum): .---+---+---+---. \ \| commit \| \| \| \| \| \| \| \| +---+---+---+---+ +-. \| ecksum -------. \| \| <-- ecksum - cksum of erased state +---+---+---+---+ \| / \| \| cksum --------\|---' <-- cksum - cksum of commit, +---+---+---+---+ \| including ecksum \| padding \| \| \| \| \| +---+---+---+---+ \ \| \| erased \| +-' \| \| / . . . . You may have already noticed the start of our problems. The ecksum contains the erased-state, which is different per-block, and our rbyd cksum contains the ecksum. We need to include the ecksum so we know if it's valid, but this means our rbyd cksum changes block to block. Solving this is simple enough: Stop the rbyd's canonical cksum before the ecksum, but include the ecksum in the actual cksum we write to disk. Future commits will need to start from the canonical cksum, so the old ecksum won't be included in new commits, but this shouldn't be a problem: .---+---+---+---. . . \ . \ . . . . .---+---+---+---. \ \ \| commit \| \| \| \| commit \| \| \| \| \| \| +- rbyd \| \| \| \| \| \| \| \| cksum \| \| \| \| +---+---+---+---+ +-. / +---+---+---+---+ \| \| \| ecksum -------. \| \| \| ecksum \| . . +---+---+---+---+ \| / \| +---+---+---+---+ . . \| cksum --------\|---' \| cksum \| . . +---+---+---+---+ \| +---+---+---+---+ . . \| padding \| \| \| padding \| . . \| \| \| \| \| . . +---+---+---+---+ \ \| . . . . . . . +---+---+---+---+ \| \| \| erased \| +-' \| commit \| \| \| \| \| / \| \| \| +- rbyd . . \| \| \| \| cksum . . +---+---+---+---+ +-. / \| ecksum -------. \| \| +---+---+---+---+ \| / \| \| cksum ------------' +---+---+---+---+ \| \| padding \| \| \| \| \| +---+---+---+---+ \ \| \| erased \| +-' \| \| / . . . . The second challenge is the pesky possibility of existing valid commits. We need some way to ensure that erased-state following a commit does not accidentally contain a valid old commit. This is where are tag's valid bits come into play: The valid bit of each tag must match the parity of all preceding tags (equivalent to the parity of the crc32c), and we can use some perturb bits in the cksum tag to make sure any tags in our erased-state do _not_ match: .---+---+---+---. \ . . . . . .---+---+---+---. \ \ \ \|v\| tag \| \| \|v\| tag \| \| \| \| +---+---+---+---+ \| +---+---+---+---+ \| \| \| \| commit \| \| \| commit \| \| \| \| \| \| \| \| \| \| \| \| +---+---+---+---+ +-----. +---+---+---+---+ +-. \| \| \|v\|p\| tag \| \| \| \|v\|p\| tag \| \| \| \| \| +---+---+---+---+ / \| +---+---+---+---+ / \| \| \| \| cksum \| \| \| cksum \| \| . . +---+---+---+---+ \| +---+---+---+---+ \| . . \| padding \| \| \| padding \| \| . . \| \| \| \| \| \| . . +---+---+---+---+ . . . \| . . +---+---+---+---+ \| \| \| \|v---------------- != --' \|v------------------' \| \| \| erased \| +---+---+---+---+ \| \| . . \| commit \| \| \| . . \| \| \| \| +---+---+---+---+ +-. +-. \|v\|p\| tag \| \| \| \| \| +---+---+---+---+ / \| / \| \| cksum ----------------' +---+---+---+---+ \| \| padding \| \| \| \| \| +---+---+---+---+ \| \|v---------------- != --' \| erased \| . . . . New problem! The rbyd cksum contains the valid bits, which contain the perturb bits, which depends on the erased-state! And you can't just derive the valid bits from the rbyd's canonical cksum. This avoids erased-state poisoning, sure, but then nothing in the new commit depends on the perturb bits! The catch-22 here is that we need the valid bits to both depend on, and ignore, the erased-state poisoned perturb bits. As far as I can tell, the only way around this is to make the rybd's canonical cksum not include the parity bits. Which is annoying, masking out bits is not great for bulk cksum calculation... But this does solve our problem: .---+---+---+---. \ . . . . . .---+---+---+---. \ \ \ \ \|v\| tag \| \| \|v\| tag \| \| \| o o +---+---+---+---+ \| +---+---+---+---+ \| \| \| \| \| commit \| \| \| commit \| \| \| \| \| \| \| \| \| \| \| \| \| \| +---+---+---+---+ +-----. +---+---+---+---+ +-. \| \| \| \|v\|p\| tag \| \| \| \|v\|p\| tag \| \| \| \| . . +---+---+---+---+ / \| +---+---+---+---+ / \| \| . . \| cksum \| \| \| cksum \| \| . . . +---+---+---+---+ \| +---+---+---+---+ \| . . . \| padding \| \| \| padding \| \| . . . \| \| \| \| \| \| . . . +---+---+---+---+ . . . \| . . +---+---+---+---+ \| \| \| \| \|v---------------- != --' \|v------------------' \| o o \| erased \| +---+---+---+---+ \| \| \| . . \| commit \| \| \| +- rbyd . . \| \| \| \| \| cksum +---+---+---+---+ +-. +-. / \|v\|p\| tag \| \| \| o \| +---+---+---+---+ / \| / \| \| cksum ----------------' +---+---+---+---+ \| \| padding \| \| \| \| \| +---+---+---+---+ \| \|v---------------- != --' \| erased \| . . . . Note that because each commit's cksum derives from the canonical cksum, the valid bits and commit cksums no longer contain the same data, so our parity(m) = parity(crc32c(m)) trick no longer works. However our crc32c still does tell us a bit about each tag's parity, so with a couple well-placed xors we can at least avoid needing two parallel calculations: cksum' = crc32c(cksum, m) valid' = parity(cksum' xor cksum) xor valid This also means our commit cksums don't include any information about the valid bits, since we mask these out before cksum calculation. Which is a bit concerning, but as far as I can tell not a real problem. --- An alternative design would be to just keep track of two cksums: A commit cksum and a canonical cksum. This would be much simpler, but would also require storing two cksums in RAM in our lfsr_rbyd_t struct. A bit annoying for our 4-byte crc32cs, and a bit more than a bit annoying for hypothetical 32-byte sha256s. It's also not entirely clear how you would update both crc32cs efficiently. There is a way to xor out the initial state before each tag, but I think it would still require O(n) cycles of crc32c calculation... As it is, the extra bit needed to keep track of commit parity is easy enough to sneak into some unused sign bits in our lfsr_rbyd_t struct. --- I've also gone ahead and mixed in the current commit parity into our cksum's perturb bits, so the commit cksum at least contains _some_ information about the previous parity. But it's not entirely clear this actually adds anything. Our perturb bits aren't _required_ to reflect the commit parity, so a very unlucky power-loss could in theory still make a cksum valid for the wrong parity. At least this situation will be caught by later valid bits... I've also carved out a tag encoding, LFSR_TAG_PERTURB, solely for adding more perturb bits to commit cksums: LFSR_TAG_CKSUM 0x3cpp v-11 cccc -ppp pppp LFSR_TAG_CKSUM 0x30pp v-11 ---- -ppp pppp LFSR_TAG_PERTURB 0x3100 v-11 ---1 ---- ---- LFSR_TAG_ECKSUM 0x3200 v-11 --1- ---- ---- LFSR_TAG_GCKSUMDELTA+ 0x3300 v-11 --11 ---- ---- + Planned This allows for more than 7 perturb bits, and could even mix in the entire previous commit cksum, if we ever think that is worth the RAM tradeoff. LFSR_TAG_PERTURB also has the advantage that it is validated by the cksum tag's valid bit before being included in the commit cksum, which indirectly includes the current commit parity. We may eventually want to use this instead of the cksum tag's perturb bits for this reason, but right now I'm not sure this tiny bit of extra safety is worth the minimum 5-byte per commit overhead... Note if you want perturb bits that are also included in the rbyd's canonical cksum, you can just use an LFSR_TAG_SHRUBDATA tag. Or any unreferenced shrub tag really. --- All of these changes required a decent amount of code, I think mostly just to keep track of the parity bit. But the isolation of rbyd cksums from erased-state is necessary for several future-planned features: code stack before: 33564 2816 after: 33916 (+1.0%) 2824 (+0.3%)	2024-05-04 17:25:01 -05:00
Christopher Haster	c4fcc78814	Tweaked file types/name tag encoding to be a bit less quirky The intention behind the quirky encoding was to leverage bit 1 to indicate if the underlying file type would be backed by the common file B-tree data structure. Looking forward, there may be several of these types, compressed files, contiguous files, etc, that for all intents and purposes are just normal files interpreted differently. But trying to leverage too many bits like this is probably going to give us a sparse, awkward, and confusing tag encoding, so I've reverted to a hopefully more normal encoding: LFSR_TAG_NAME 0x02tt v--- --1- -ttt tttt LFSR_TAG_NAME 0x0200 v--- --1- ---- ---- LFSR_TAG_REG 0x0201 v--- --1- ---- ---1 LFSR_TAG_DIR 0x0202 v--- --1- ---- --1- LFSR_TAG_SYMLINK* 0x0203 v--- --1- ---- --11 LFSR_TAG_BOOKMARK 0x0204 v--- --1- ---- -1-- LFSR_TAG_ORPHAN 0x0205 v--- --1- ---- -1-1 LFSR_TAG_COMPR* 0x0206 v--- --1- ---- -11- LFSR_TAG_CONTIG* 0x0207 v--- --1- ---- -111 * Hypothetical Note the carve-out for the hypothetical symlink tag. Symlinks are actually incredibly low in the priority list, but they are also the only current hypothetical file type that would need to be exposed to users. Grouping these up makes sense. This will get a bit messy if we ever end up with a 4th user-facing type, but there isn't any in POSIX at least (ignoring non-fs types, socket, fifo, character, block, etc). The gap also helps line things up so reg/orphan are a single bit flip, and the non-user facing types all share a bit. This had no impact on code size: code stack before: 33564 2816 after: 33564 (+0.0%) 2816 (+0.0%)	2024-05-04 17:24:48 -05:00
Christopher Haster	6e5d314c20	Tweaked struct tag encoding so b/m tags are earlier These b/m struct tags have a common pattern that would be good to emphasize in the encoding. The later struct tags get a bit more messy as they leave space for future possible extensions. New encoding: LFSR_TAG_STRUCT 0x03tt v--- --11 -ttt ttrr LFSR_TAG_DATA 0x0300 v--- --11 ---- ---- LFSR_TAG_BLOCK 0x0304 v--- --11 ---- -1rr LFSR_TAG_BSHRUB 0x0308 v--- --11 ---- 1--- LFSR_TAG_BTREE 0x030c v--- --11 ---- 11rr LFSR_TAG_MROOT 0x0310 v--- --11 ---1 --rr LFSR_TAG_MDIR 0x0314 v--- --11 ---1 -1rr LFSR_TAG_MSHRUB* 0x0318 v--- --11 ---1 1--- LFSR_TAG_MTREE 0x031c v--- --11 ---1 11rr LFSR_TAG_DID 0x0320 v--- --11 --1- ---- LFSR_TAG_BRANCH 0x032c v--- --11 --1- 11rr * Hypothetical Note that all shrubs currently end with 1---, and all btrees, including the awkward branch tag, end with 11rr. This had no impact on code size: code stack before: 33564 2816 after: 33564 (+0.0%) 2816 (+0.0%)	2024-05-04 17:24:33 -05:00
Christopher Haster	5fa85583cd	Dropped block-level erased-state checksums for RAM-tracked erased-state Unfortunately block-level erased-state checksums (becksums) don't really work as intended. An invalid becksum _does_ signal that a prog has been attempted, but a valid becksum does _not_ prove that a prog has _not_ been attempted. Rbyd ecksums work, but only thanks to a combination of prioritizing valid commits and the use of perturb bits to force erased-state changes. It _is_ possible to end up with an ecksum collision, but only if you 1. lose power before completing a commit, and 2. end up with a non-trivial crc32c collision. If this does happen, at the very least the resulting commit will likely end up corrupted and thrown away later. Block-level becksums, at least as originally designed, don't have either of these protections. To make matters worse, the blocks these becksums reference contain only raw user data. Write 0xffs into a file and you will likely end up with a becksum collision! This is a problem for a couple of reasons: 1. Progging multiple times to erased-state is likely to result in corrupted data, though this is also likely to get caught with validating writes. Worst case, the resulting data looks valid, but with weakened data retention. 2. Because becksums are stored in the copy-on-write metadata of the file, attempting to open a file twice for writing (or more advanced copy-on-write operations in the future) can lead to a situation where a prog is attempted on _already committed_ data. This is very bad and breaks copy-on-write guarantees. --- So clearly becksums are not fit for purpose and should be dropped. What can we replace them with? The first option, implemented here, is RAM-tracked erased state. Give each lfsr_file_t its own eblock/eoff fields to track the last known good erased-state. And before each prog, clear eblock/eoff so we never accidentally prog to the same erased-state twice. It's interesting to note we don't currently clear eblock/eoff in all file handles, this is ok only because we don't currently share eblock/eoff across file handles. Each eblock/eoff is exclusive to the lfsr_file_t and does not appear anywhere else in the system. The main downside of this approach is that, well, the RAM-tracked erase-state is only tracked in RAM. Block-level erased-state effectively does not persist across reboots. I've considered adding some sort of per-file erased-state tracking to the mdir that would need to be cleared before use, but such a mechanism ends up quite complicated. At the moment, I think the best second option is to put erased-state tracking in the future-planned bmap. This would let you opt-in to on-disk tracking of all erased-state in the system. One nice thing about RAM-tracked erased-state is that it's not on disk, so it's not really a compatibility concern and won't get in the way of additional future erased-state tracking. --- Benchmarking becksums vs RAM-tracking has been quite interesting. While in theory becksums can track much more erased-state, it's quite unlikely anything but the most recent erased-state actually ends up used. The end result is no real measurable performance loss, and actually a minor speedup because we don't need to calculate becksums on every block write. There are some pathological cases, such as multiple write heads, but these are out-of-scope right now (note! multiple explicit file handles currently handle this case beautifully because we don't share eblock/eoff!) Becksums were also relatively complicated, and needed extra scaffolding to pass around/propagate as secondary tags alongside the primary bptr. So trading these for RAM-tracking also gives us a nice bit of code/stack savings, albeit at a 2-word RAM cost in lfsr_file_t: code stack structs before: 33888 2864 1096 after: 33564 (-1.0%) 2816 (-1.7%) 1104 (+0.7%) lfsr_file_t before: 104 lfsr_file_t after: 112 (+7.7%)	2024-05-04 17:22:56 -05:00
Christopher Haster	86a8582445	Tweaked canonical altn to point to itself By definition, altns should never be followed, so it doesn't really matter where they point. But it's not like they can point literally nowhere, so where should they point? A couple options: 1. jump=jump - Wherever the old alt pointed - Easy, literally a noop - Unsafe, bugs could reveal outdated parts of the tree - Encoding size eh 2. jump=0 - Point to offset=0 - Easier, +0 code - Safer, branching to 0 should assert - Worst possible encoding size 3. jump=itself - Point to itself - A bit tricky, +4 code - Safe, should assert, even without asserts worst case infinite loop - Optimal encoding size An infinite loop isn't the best failure state, but we can catch this with an assert, which we would need for jump=0 anyways. And this is only a concern if there are other fs bugs. jump=0 is actually slightly worse if asserts are disabled, since we'd end up reading the revision count as garbage. Adopting jump=itself gives us the optimal 4-byte encoding: altbn w0 = 40 00 00 00 '-+-' ^ ^ '----\|--\|-- tag = altbn '--\|-- weight = 0 '-- jump = itself (branch - 0) This requires tweaking the alt encoder a bit, to avoid relative encoding jump=0s, but this is pretty cheap: code stack jump=jump: 34068 2864 jump=0: 34068 (+0.0%) 2864 (+0.0%) jump=itself: 34072 (+0.0%) 2864 (+0.0%) I thought we may need to also tweak the decoder, so later trunk copies don't accidentally point to the old location, but humorously our pruning kicks in redundantly to reset altbn's jump=itself on every trunk. Note lfsr_rbyd_lookupnext was also rearranged a bit to make it easier to assert on infinite loops and this also added some code. Probably just due to compiler noise: code stack before: 34068 2864 after: 34076 (+0.0%) 2864 (+0.0%) Also note that we still accept all of the above altbn encoding options. This only affects encoding and dbg scripts.	2024-04-28 13:21:46 -05:00
Christopher Haster	faf8c4b641	Tweaked alt-tag encoding to match color/dir naming order This is mainly to avoid mistakes caused by names/encodings disagreeing: LFSR_TAG_ALT 0x4kkk v1cd kkkk -kkk kkkk ^ ^^ '------+-----' '-\|\|--------\|------- valid bit '\|--------\|------- color '--------\|------- dir '------- key Notably, the LFSR_TAG_ALT() macro has already caused issues by being both 1. ambiguous, and 2. not really type-checkable. It's easy to get the order wrong and things not really break, just behave poorly, it's really not great! To be honest the exact order is a bit arbitrary, the color->dir naming appeared by accident because I guess it felt more natural. Maybe because of English's weird implicit adjective ordering? Maybe because of how often conditions show up as the last part of the name in other instruction sets? At least one plus is that this moves the dir-bit next to the key. This makes it so all of the condition information is encoding is the lowest 13-bits of the tag, which may lead to minor optimization tricks for implementing flips and such. Code changes: code stack before: 34080 2864 after: 34068 (-0.0%) 2864 (+0.0%)	2024-04-28 13:21:41 -05:00
Christopher Haster	8a646d5b8e	Added dbgtag.py for easy tag decoding on the command-line Example: $ ./scripts/dbgtag.py 0x3001 cksum 0x01 dbgtag.py inherits most of crc32c.py's decoding options. The most useful probably being -x/--hex: $ ./scripts/dbgtag.py -x e1 00 01 8a 09 altbgt 0x100 w1 -1162 dbgtag.py also supports reading from a block device if either -b/--block-size or --off are provided. This is mainly for consistency with the other dbg*.py scripts: $ ./scripts/dbgtag.py disk -b4096 0x2.1e4 bookmark w1 1 This should help when debugging and finding a raw tag/alt in some register. Manually decoding is just an unnecessary road bump when this happens.	2024-04-01 16:29:13 -05:00

35 Commits