littlefs

Author	SHA1	Message	Date
Christopher Haster	5ecd6d59cd	Tweaked config and gstate reprs in dbglfs.py to be more readable Mainly aligning things, it was easy for the previous repr to become a visual mess. This also represents the config more like how we represent other tags, since they've changed from a monolithic config block to separate attributes.	2023-10-14 01:25:20 -05:00
Christopher Haster	b936e33643	Tweaked dbg scripts to resize tag repr based on weight This a compromise between padding the tag repr correctly and parsing speed. If we don't have to traverse an rbyd (for, say, tree printing), we don't want to since parsing rbyds can get quite slow when things get big (remember this is a filesystem!). This makes tag padding a bit of a hard sell. Previously this was hardcoded to 22 characters, but with the new file struct printing it quickly became apparently this would be a problematic limit: 12288-15711 block w3424 0x1a.0 3424 67 64 79 70 61 69 6e 71 gdypainq It's interesting to note that this has only become an issue for large trees, where the weight/size in the tag can be arbitrarily large. Fortunately we already have the weight of the rbyd after fetch, so we can use a heuristic similar to the id padding: tag padding = 21 + nlog10(max(weight,1)+1) --- Also dropped extra information with the -x/--device flag. It hasn't really been useful and was implemented inconsistently. Maybe -x/--device should just be dropped completely...	2023-10-14 01:25:14 -05:00
Christopher Haster	c8b60f173e	Extended dbglfs.py to show file data structures You can now pass -s/--structs to dbglfs.py to show any file data structures: $ ./scripts/dbglfs.py disk -B4096 -f -s -t littlefs v2.0 0x{0,1}.9cf, rev 3, weight 0.256 {0000,0001}: -1.1 hello reg 128, trunk 0x0.993 128 0000.0993: .-> 0-15 shrubinlined w16 16 6b 75 72 65 65 67 73 63 kureegsc .-+-> 16-31 shrubinlined w16 16 6b 65 6a 79 68 78 6f 77 kejyhxow \| .-> 32-47 shrubinlined w16 16 65 6f 66 75 76 61 6a 73 eofuvajs .-+-+-> 48-63 shrubinlined w16 16 6e 74 73 66 67 61 74 6a ntsfgatj \| .-> 64-79 shrubinlined w16 16 70 63 76 79 6c 6e 72 66 pcvylnrf \| .-+-> 80-95 shrubinlined w16 16 70 69 73 64 76 70 6c 6f pisdvplo \| \| .-> 96-111 shrubinlined w16 16 74 73 65 69 76 7a 69 6c tseivzil +-+-+-> 112-127 shrubinlined w16 16 7a 79 70 61 77 72 79 79 zypawryy This supports the same -b/-t/-i options found in dbgbtree.py, with the one exception being -z/--struct-depth which is lowercase to avoid conflict with the -Z/--depth used to indicate the filesystem tree depth. I think this is a surprisingly reasonable way to show the inner structure of files without clobbering the user's console with file contents. Don't worry, if clobbering is desired, -T/--no-truncate still dumps all of the file content. Though it's still up to the user to manually apply the sprout/shrub overlay. That step is still complex enough to not implement in this tool yet. I	2023-10-14 01:25:08 -05:00
Christopher Haster	39f417db45	Implemented a filesystem traversal that understands file bptrs/btrees Ended up changing the name of lfsr_mtree_traversal_t -> lfsr_traversal_t, since this behaves more like a filesytem-wide traversal than an mtree traversal (it returns several typed objects, not mdirs like the other mtree functions for one). As a part of this changeset, lfsr_btraversal_t (was lfsr_btree_traversal_t) and lfsr_traversal_t no longer return untyped lfsr_data_ts, but instead return specialized lfsr_{b,t}info_t structs. We weren't even using lfsr_data_t for its original purpose in lfsr_traversal_t. Also changed lfsr_traversal_next -> lfsr_traversal_read, you may notice at this point the changes are intended to make lfsr_traversal_t look more like lfsr_dir_t for consistency. --- Internally lfsr_traversal_t now uses a full state machine with its own enum due to the complexity of traversing the filesystem incrementally. Because creating diagrams is fun, here's the current full state machine, though note it will need to be extended for any parity-trees/free-trees/etc: mrootanchor \| v mrootchain .-' \| \| v \| mtree ---> openedblock '-. \| ^ \| ^ v v \| v \| mdirblock openedbtree \| ^ v \| mdirbtree I'm not sure I'm happy with the current implementation, and eventually it will need to be able to handle in-place repairs to the blocks it sees, so this whole thing may need a rewrite. But in the meantime, this passes the new clobber tests in test_alloc, so it should be enough to prove the file implementation works. (which is definitely is not fully tested yet, and some bugs had to be fixed for the new tests in test_alloc to pass). --- Speaking of test_alloc. The inherent cyclic dependency between files/dirs/alloc makes it a bit hard to know what order to test these bits of functionality in. Originally I was testing alloc first, because it seems you need to be confident in your block allocator before you can start testing higher-level data structures. But I've gone ahead and reversed this order, testing alloc after files/dirs. This is because of an interesting observation that if alloc is broken, you can always increase the test device's size to some absurd number (-DDISK_SIZE=16777216, for example) to kick the can down the road. Testing in this order allows alloc to use more high-level APIs and focus on corner cases where the allocator's behavior requires subtlety to be correct (e.g. ENOSPC).	2023-10-14 01:13:40 -05:00
Christopher Haster	4996b8419d	Implemented most of file btree reading/writing Still needs testing, though the byte-level fuzz tests were already causing blocks to crystallize. I noticed this because of test failures which are fixed now. Note the block allocator currently doesn't understand file btrees. To get the current tests passing requires -DDISK_SIZE=16777216 or greater. It's probably also worth noting there's a lot that's not implemented yet! Data checksums and write validation for one. Also ecksums. And we should probably have some sort of special handling for linear writes so linear writes (the most common) don't end up with a bunch of extra crystallizing writes. Also the fact that btrees can become DAGs now is an oversight and a bit concerning. Will that work with a closed allocator? Block parity?	2023-10-14 01:12:26 -05:00
Christopher Haster	52113c6ead	Moved the test/bench runner path behind an optional flag So now instead of needing: ./scripts/test.py ./runners/test_runner test_dtree You can just do: ./scripts/test.py test_dtree Or with an explicit path: ./scripts/test.py -R./runners/test_runner test_dtree This makes it easier to run the script manually. And, while there may be some hiccups with the implicit relative path, I think in general this will make the test/bench scripts easier to use. There was already an implicit runner path, though only if the test suite was completely omitted. I'm not sure that would ever have actually been useful... --- Also increased the permutation field size in --list-*, since I noticed it was overflowing.	2023-10-14 00:54:28 -05:00
Christopher Haster	df32211bda	Changed -t/--dtree to -f/--files in dbglfs.py This flag makes more sense to me and avoids conflicts with the -d/--delta flag used for gstate.	2023-10-14 00:54:06 -05:00
Christopher Haster	a2aa25aa8e	Tweaked dbgrbyd.py to show -1 tag rids	2023-10-14 00:53:31 -05:00
Christopher Haster	ef691d4cfe	Tweaked rbyd lookup/append to use 0 lower rid bias Previously our lower/upper bounds were initialized to -1..weight. This made a lot of the math unintuitive and confusing, and it's not really necessary to support -1 rids (-1 rids arise naturally in order-statistic trees the can have weight=0). The tweak here is to use lower/upper bounds initialized to 0..weight, which makes the math behave as expected. -1 rids naturally arise from rid = upper-1.	2023-10-14 00:52:00 -05:00
Christopher Haster	3fb4350ce7	Updated dbg scripts to support shrub trees - Added shrub tags to tagrepr - Modified dbgrbyd.py to use last non-shrub trunk by default - Tweaked dbgrbyd's log mode to find maximum seen weight for id padding	2023-10-13 23:35:03 -05:00
Christopher Haster	9f0160556f	Made significant progress around inlined-file state during mdir commits The main improvement is moving the special inlined-file compaction logic up into lfsr_mdir_compact__. We only need this logic for files stored in mdirs, and thanks to its recursive nature, we weren't getting any benefit from handling this at a lower level anyways. This is a nice logical restructuring that probably saves a bit of code cost in the end. Another significant improvement is moving the staging copy of the inlined tree's state up into the file struct itself. This solves the problem of needed N copies of temporary inlined state when you have N open files. It also provides a central place to stage changes when compacting inlined trees, which happens across several different places in the mdir commit logic. Though some may see this as more a hack than a feature. Also note-worthy, but minor: these changes required an additional opened-mdir linked-list to know when the mdir is a file and may contain an inlined tree.	2023-10-13 23:19:24 -05:00
Christopher Haster	2f38822820	Still missing quite a bit, but rudimentary inlined-trees are now working And by working, I mean you can create inlined trees, just don't compact/split/move/etc anything. But this does outline the path files take when writing buffers into inlined trees. "Inlined trees" in littlefs are entire small rbyd trees embedded as secondary trees in an mdir's main rbyd tree. When fetching, we can indicate if a given trunk belongs to the main tree or secondary tree by setting one of the unused mode bits in the trunk's tag, now called the "deferred" bit. This bit doesn't need to be included in the alt's "key" field, so there's no issue with it conflicting with the alt's mode bits. This requires a bit of tweaking lfsr_rbyd_fetch, since it needs to fall back to the previous trunk if it discovers the most recent trunk belongs to an inlined tree. But as a benefit we can leverage the full power of rbyds in inlined files, including holes, partial updates, etc. One downside is it looks like these inlined trees may involve more work in maintining their state correctly, since they need to be sort of "brought along" when mdirs are compacted, even if they don't actually have a reference in the mdir yet. But the sheer amount of flexibility this gives inlined files may make this overhead worth it.	2023-10-13 23:11:35 -05:00
Christopher Haster	e7bf5ad82f	Added scripts/crc32c.py This seems like a useful script to have.	2023-09-15 18:42:48 -05:00
Christopher Haster	dd6a4e6496	Dropped the header from dbg scripts I had never noticed xxd has no header until comparing its output against dbgblock.py. Turns out these headers aren't really all that useful, and even sometimes wrong in dbglfs.py.	2023-09-15 17:45:17 -05:00
Christopher Haster	2cdd03c8fd	Added dbgblock.py for quicker hex dumps This script is basically the same as xxd, but with the other debug script's block address format: $ ./scripts/dbgblock.py disk -B4096 1 block 0x1, size 20 off data 00000000: 00 00 00 00 00 03 00 08 6c 69 74 74 6c 65 66 73 ........littlefs 00000010: 40 03 00 0c 80 04 00 02 02 00 d0 03 00 16 c0 04 @............... ...	2023-09-15 17:45:13 -05:00
Christopher Haster	c1fe64314c	Reworked how filesystem-level config is stored Now, instead of storing a single contiguous block of config data, config is stored as tagged metadata like any other attribute. This allows more flexibility towards adding/removing config in the future, without cluttering up the config with deprecated entries (see ATA's "IDENTIFY DEVICE" response). Most of the config entries are single leb128 limits on various integer types, with the exception of the magic string and version (major/minor pair). --- Note this also includes some semantic changes to the config: - Limits are stored as size-1. This avoid issues with integer overflow at extreme ranges. This was also adopted for block size (block limit) and block count (disk limit). This deviation between on-disk config and user-facing config risks confusion, but allows the potential for the full 2^31 range for these values. - The default cksum type, crc32c, has been changed to 0. Originally this was 2 to allow the type to map to the crc width for crc8, crc16, crc32c, crc64, etc. But dropping this idea and numbering checksums as they are implemented simplifies things. May come back to this. - Storing these configs as attributes opens up of the option of on-disk defaults when configs are missing. I'm being a bit conservative with this one, as it's not clear to me if we should prefer default configs (less code/storage, risk of untested config parsing) or prefer explicit on-disk configs. Currently the following have defaults since they seem the most obvious to me: - cksum type => defaults to crc32c - redund type => defaults to parity (TODO, should this default to no redund?) - utag_limit => defaults to 0x7f (no special tag decoding) - uattr_limit => defaults to block_limit (implicit)	2023-09-15 14:51:25 -05:00
Christopher Haster	5f3994c83b	Renamed mbits/mlimit to mleaf_bits/mleaf_limit - mbits -> mleaf_bits - mlimit -> mleaf_limit - mweight -> mleaf_weight - lfsr_mridmask -> lfsr_midrmask - lfsr_mbidmask -> lfsr_midbmask This is a bit tricky to name, since we want to clarify it's not the mtree limit and not the mdir's actual rbyd weight. But this also risks confusing around the difference between mdirs/mleaves (mdirs are mtree's leaves).	2023-09-15 14:09:42 -05:00
Christopher Haster	5504936b10	Added mlimit to the superconfig, dropped mtreelimit This should be stored in the superconfig, and we should use it during mount instead of rederiving it from the block_size (TODO). Note that this stores the "mlimit", (1 << mbits)-1, not the mbits directly. littlefs will probably always be limited to powers-of-two for this, since mbits is fairly arbitrary, but storing the expanded value allows for non-powers-of-two _just in case_.	2023-09-14 15:02:55 -05:00
Christopher Haster	900ea807ae	Changed to a shifted mid=bid.rid representation for debugging This only matters for developers, not users, but it still helps a lot to get debug representations right. Since the exact mid encoding depends on the block_size in an unintuitive manner, it's tricky to render in a debug-friendly way that is useful both with and without tools. Previously, I avoided shifting the bid representation, since this would be closer to the value in the device, but this hides the actual structure of the mtree. Now the bid is shifted, showing the underlying mtree/mdir structure, at the cost of needing to know the number of mbits to encode the mid back into an integer. So for example, on a device with 4KiB blocks, or 8 mbits: mid=1 mid=258 mid=515 Becomes: mid=0.1 mid=1.2 mid=2.3 This continues to make the mbits a more fundamental part of littlefs, but that's probably just how that's going to be.	2023-09-14 00:36:13 -05:00
Christopher Haster	f7900edc1c	Updated dbg scripts with changes, adopted mbid.mrid in debug prints This format for mids is a compromise in readability vs debugability. For example, if our mbid weight is 256 (4KiB blocks), the 19th entry in the second mdir would be the raw integer 275. With this mid format, we would print it as 256.19. The idea is to make it easy to see it's the 19th entry in the mdir while still making it relatively easy to see that 256.19 and 275 are equivalent when debugging. --- The scripts also took some tweaking due to the mid change. Tried to keep the names consistent, but I don't think it's worthwhile to change too much of the scripts while they are working.	2023-09-05 10:10:30 -05:00
Christopher Haster	256430213d	Dropped separate BTREE/BRANCH encodings There is a bit of redundancy here, as we already know the weights of btree's inner-branches from their parents. But in theory sharing the same encoding for both the top level btree reference and inner-branches should offer more chance for deduplication and hopefully less code. This also moves some members around in the btree encoding so that the redund blocks are at the beginning. This _might_ simplify decoding of the variable-length redund blocks at some point. Current btree encoding: .----+----+----+----. \| blocks ... redund leb128s (1-20 bytes) : : \|----+----+----+----\| \| trunk ... 1 leb128 (1-5 bytes) \|----+----+----+----\| \| weight ... 1 leb128 (1-5 bytes) \|----+----+----+----\| \| cksum \| 1 le32 (4 bytes) '----+----+----+----' This also partially reverts some tag name changes: - BNAME -> BRANCH - DMARK -> BOOKMARK	2023-08-22 13:20:37 -05:00
Christopher Haster	528f104cb4	Enabled internal test code at the suite-level Test suites already had the ability to provide suite-level code via the "code" attribute, but this was placed in the suite's generated source file, making it inaccessbile to internal tests. This change allows suite code to be placed in the same place as internal tests, via the "in" attribute, though this has some caveats: 1. Suite-level code generally declares helper functions in global scope. We don't parse this code or anything, so name collisions between helper functions across different test suites is up to the developer to resolve. 2. Internal suite-level code has access to internal functions/variables/ etc, this means we can't place a copy in our suite's generate source and expect it to compile. For this reason, internal suite-level code is unavailable for non-internal tests in the suite. This also means you only get to place internal suite-level code in a single source file. Though this is not really an issue since littlefs is basically a single file...	2023-08-19 12:20:13 -05:00
Christopher Haster	314c832588	Adopted new struct encoding scheme with redund tag bits Struct tags, in littlefs, generally encode pointers to different on-disk data structures. At this point, they've gotten a bit complex, with the btree struct, for example, containing 1. a block address, 2. the trunk offset, 3. the weight of the trunk, and 4. a checksum. Also some future plans: 1. Block redundancy will make it so these pointers may have a variable number of block addresses to contend with. 2. Different checksum types may make the checksum field itself variable length, at least on larger builds of littlefs. This may also happen if we support truncated checksums in littlefs for storage saving reasons. Having two variable sized fields becomes a bit of a pain. We can use the encoded tag size to figure out the size of one of these fields, but not both. The change here makes it so the tag size now determines the checksum size, requiring the redundancy amount to go somewhere else. This makes it so checksums can be variably sized, and the explicit redundancy amount avoids the need to parse the leb128s fully to know how many blocks we're expecting. But where to put the redundancy amount? This commit carves out 2-bits from the struct tag to store the amount of redundancy to allow up to 3 blocks of redundancy: v0000011 0TTTTTrr ^--^---^-^----^-^- valid bit '---\|-\|----\|-\|- 3-bit mode (0x0 for structs) '-\|----\|-\|- 4-bit suptype (0x3 for structs) '----\|-\|- 0 bit (reserved for leb128) '-\|- 5-bit subtype '- 2-bit redund 3 blocks may sound extremely limiting, but it's a common limit for filesystems, 1. because you have to keep in mind each redundant block adds that much more writing/reading overhead and 2. the fact that 2^(2^n)-1 is always divisible by 3 makes >3 parity blocks much more complicated mathematically. Worst case, if we ever have >3 redundant blocks, we can create new struct subtypes. Maybe adding extended struct types that prefix the block addresses with a leb128 encoding the redundancy amount. --- As a part of this, reorganized the on-disk btree and ecksum encodings to put the checksum last. Also split out the btree and inner btree branches as separate struct types. The btree includes the weight, whereas the weight is implicit in inner btree branches. This came about after realizing context-specific prefixes are relatively easy to add thanks to the composability of our parsers. This led to some name collisions though: - BRANCH -> BNAME - BOOKMARK -> DMARK	2023-08-11 12:55:48 -05:00
Christopher Haster	d2f2b53262	Renamed fcksum -> ecksum This checksum is used to keep track of if we have erased, and not yet touched, the unused bytes trailing our current commit in the rbyd. The working theory is that if any prog attempt is made, it will, most likely, change the checksum of the contents, allowing littlefs to determine if trailing erased-state is safe to use, even under powerloss. littlefs can also perturb future data by a single bit, to force this checksum to always be invalidated during normal operation. The original name, "forward erased-state checksums (fcksum)", came from the idea that the checksum "looks forward" into the next commit. But after using them for a bit, I think the name is unnecessarily confusing. It, uh, also looks a lot like a swear word. I think shortening the name to just "erased-state checksums (ecksum)", even though the previous name is already in use in a release, is reasonable. --- It's probably hard to believe but the name change from fcrc -> ecrc really was unrelated to the crc -> cksum change. But boy is it convenient for avoiding an awkward name. A lot of these name changes involved sed scripts, so I didn't notice how awkward fcksum would be to use until writing this commit message.	2023-08-07 14:34:47 -05:00
Christopher Haster	7031d6e1b3	Changed most references to crc/csum -> cksum The reason for this is to move away from the idea that littlefs is strictly bound to CRCs and make the code more welcoming to other checksum types, such as SHA256, etc. Of course, changing the name doesn't really do anything. littlefs actually _is_ strictly bound to CRCs in a couple ways that other filesystems aren't. These would need to have workarounds for other checksum types: - We leverage the parity-preserving nature of (some) CRCs to not have to also calculate the parity of metadata in rbyd commits. - We leverage the linearity of CRCs to retroactively flip the perturb bit in the cksum tag without needing to recalculate the checksum. Though the fact we need to do this is because of how we use parity above, so this may just not be needed for non-CRC checksums. - The plans for global-CRCs (not yet implemented) rely heavily on the mathematical properties of CRC polynomials. This doesn't mean global-CRCs can't work with other checksums, you would just need to find a different type of polynomial.	2023-08-07 14:18:37 -05:00
Christopher Haster	d77a173d5c	Changed source to consistently use rid for rbyd ids Originally it made sense to name the rbyd ids, well, ids, at least in the internals of the rbyd functions. But this doesn't work well outside of the rbyd code, where littlefs has to juggle several different id types with different purposes: - rid => rbyd-id, 31-bit index into an rbyd - bid => btree-id, 31-bit index into a btree - mid => mdir-id, 15-bit+15-bit index into the mtree - did => directory-id, 31-bit unique identifier for directories Even though context makes it clear which id the id refers to in the rbyd internals, updating the name to rid makes it clearer that these are the same type of id when looking at code both inside and outside the rbyd functions.	2023-08-07 14:10:09 -05:00
Christopher Haster	64a1b46ea2	Renamed a couple directory related things - dstart -> bookmark - dnamelookup -> namelookup	2023-08-07 14:00:44 -05:00
Christopher Haster	db514f20f2	Fixed structs.py when structs contain substructs The previous state machine would happily pick up random names if the struct had no name of its own. This was picking up typedefs of random structs and making things really confusing. Now the rule is that unnamed structs are not printed. Unnamed structs are usually implementation details so their size is not really useful. Also made the parsing state machine for objdump outputs more resilient to these sort of issues. Also changed structs.py to also report unions if they have a name.	2023-08-06 00:40:40 -05:00
Christopher Haster	da4e86abac	Split test_dirs into test_dtree and test_dseek - test_dtree - Pure directory creation/deletion/move functionality testing. This ends up testing the core of littlefs file entry manipulation, since directories is all we need for that. - test_dseek - Tests more of the corner cases specific to directory iteration and seeking. This involves an annoying amount of interactions with concurrent updates to the filesystem that are complicated to test for. Also generally renaming the "fstree" concept to "dtree". This only changes dbglfs.py as far as I'm aware. It's useful to have a name for this thing and "directory tree" fits a bit better than "filesystem tree" which could be ambiguous when we also have the "metadata tree" as a different concept.	2023-08-04 14:17:42 -05:00
Christopher Haster	4efb55e0d7	In tests/benches, renamed cfg -> CFG This is to better indicate this is a runner generated variable.	2023-08-04 14:05:07 -05:00
Christopher Haster	1c128afc90	Renamed internal runner field filter -> if_ This makes it more consistent with the actual test field, at the cost of the symbol collision.	2023-08-04 13:54:10 -05:00
Christopher Haster	5be7bae518	Replaced tn/bn prefixes with an actual dependency system in tests/benches The previous system of relying on test name prefixes for ordering was simple, but organizing tests by dependencies and topologically sorting during compilation is 1. more flexible and 2. simplifies test names, which get typed a lot. Note these are not "hard" dependencies, each test suite should work fine in isolation. These "after" dependencies just hint an ordering when all tests are ran. As such, it's worth noting the tests should NOT error of a dependency is missing. This unfortunately makes it a bit hard to catch typos, but allows faster compilation of a subset of tests. --- To make this work the way tests are linked has changed from using custom linker section (fun linker magic!) to a weakly linked array appended to every source file (also fun linker magic!). At least with this method test.py has strict control over the test ordering, and doesn't depend on 1. the order in which the linker merges sections, and 2. the order tests are passed to test.py. I didn't realize the previous system was so fragile.	2023-08-04 13:33:00 -05:00
Christopher Haster	56adc60a80	Added grms and reporting of missing/orphaned dstarts in dbglfs.py With a bit of color, this is very useful for debugging and finding incorrect dstart/grm situations. This was used to find and fix the bugs in the previous commit.	2023-07-25 13:58:11 -05:00
Christopher Haster	e4ba43dd5f	Extended grm to support two atomics removes Ugh. I overlooked a weird corner case in rename's behavior that requires changes to the grm to support. POSIX's rename, which lfsr_rename is trying to match, supports renaming files over existing files, effectively removing the previous file during the rename. This is supported, even if the files are directories, but with the additional requirement that the previous directory is empty (matching the behavior of lfsr_remove). This creates a weird situation for littlefs. In order to remove directories in littlefs, we need to atomically remove both the dstart entry that reserves the directory's did and the directories entry in its parent. This is made possible by using the grm to mark one entry as pending removed while removing the other. But in order to rename atomically, we need to use the grm to mark the source of the rename as removed while creating/replacing the destination of the rename. So we end up needing two grms simultaneously. This is extra annoying because the niche case of renaming a directory over another empty directory is the only case where we need two grms, but this requirement almost doubles the grm size both in-ram and reserved in every mdir, from 11 bytes to 21 bytes, and increases the lfs_t size by 28 bytes. --- Anyways, this commit extends the grm to support up to two pending removes. Fortunately the implementation was simple since we already have a type field that can be extended, and grm operations just needed to be changed from if statements to for loops.	2023-07-25 13:30:04 -05:00
Christopher Haster	c5e84e874f	Changed how fuzz tests are iterated to allow powerloss-fuzz testing Instead of iterating over a number of seeds in the test itself, the seeds are now permuted as a part of normal test defines. This lets each seed take advantage of other test features, mainly the ability to test powerlosses heuristically. This is probably how it should have been done in the first place, but the permutation tests can't do this since the number of permutations changes as the size of the test input changes. The test define system can't handle that very well. The tradeoffs here are: - We can't do cross-fuzz checks, such as the balance checks in the rbyd tests, though those really should be moved to benchmarks anyways. - The large number of cheap fuzz permutations skews the total permutation count, though I'm not sure this matters. before: 3083 permutations (-Gnor) after: 409893 permutations (-Gnor)	2023-07-18 21:40:44 -05:00
Christopher Haster	c928ed131f	Changed all dir tests to be reentrant To help with this, added TEST_PL, which is set to true when powerloss testing. This way tests can check for stronger conditions (no EEXIST) when not powerloss testing. With TEST_PL, there's really no reason every test in t5_dirs shouldn't be reentrant, and this gives us a huge improvement of test coverage very cheaply. --- The increased test coverage caught a bug, which is that gstate wasn't being consumed properly when mtree uninlining. Humorously, this went unnoticed because the most common form of mtree uninlining, mdir splitting, ended up incorrectly consuming the gstate twice, which canceled itself out since the consume operation is basically just xor. Also added support for printing dstarts to dbglfs.py, to help debugging.	2023-07-18 21:40:43 -05:00
Christopher Haster	97f867b28d	Added powerloss testing over lfsr_mkdir, fixed grm bugs The grm bugs were mostly issues with: 1. Not maintaining the on-disk grm state in RAM (lfs->grm) correctly, this needs to be updated correctly after every commit or littlefs gets a confused. 2. lfsr_fs_fixgrm got a bit confused when it was missed when changing the no-rm encoding from 0 to -2. Added some inline functions to help avoid this in the future. 3. Leaking information due to mixing fixed sized and variable sized encodings of the grm delta in places. This is a bit tricky to write an assert for as we don't parse the full grm when we see a no-rm grm.	2023-07-18 21:40:43 -05:00
Christopher Haster	b98ac119c7	Added scripts/dbglfs.py for debugging the filesystem tree Currently this can show: - The filesystem tree: $ ./scripts/dbglfs.py disk -B4096 littlefs v2.0 0x{0,1}.bd4, rev 1, weight 41 mdir ids name type {00ce,00cf}: 0.1 dir0000 dir 0x1070c73 {0090,0091}: 2.30 \|-> child0000 dir 0x8ec7fb2 {0042,0043}: 24.35 \| \|-> grandchild0000 dir 0x32d990b {0009,000a}: 25.0 \| \|-> grandchild0001 dir 0x1461a08 25.1 \| \|-> grandchild0002 dir 0x216e9fc 25.2 \| \|-> grandchild0003 dir 0x7d6aff 25.3 \| \|-> grandchild0004 dir 0x4b70e14 25.4 \| \|-> grandchild0005 dir 0x6dc8d17 25.5 \| \|-> grandchild0006 dir 0x58c7ee3 25.6 \| '-> grandchild0007 dir 0x7e7fde0 {0090,0091}: 2.31 \|-> child0001 dir 0xa87fcb1 {0077,0078}: 29.1 \| \|-> grandchild0000 dir 0x12194f5 29.2 \| \|-> grandchild0001 dir 0x34a17f6 ... - The on-disk filesystem config: $ ./scripts/dbglfs.py disk -B4096 -c littlefs v2.0 0x{0,1}.bd4, rev 1, weight 41 mdir ids tag data (truncated) config: major_version 2 02 . minor_version 0 00 . csum_type 2 02 . flags 0 00 . block_size 4096 80 20 . block_count 256 80 02 .. ... - Any global-state on-disk: $ ./scripts/dbglfs.py disk -B4096 -g -d littlefs v2.0 0x{0,1}.bd4, rev 1, weight 41 mdir ids tag data (truncated) gstate: grm none 00 00 00 cc 05 57 ff 7f .....W.. {0000,0001}: -1 grm 8 01 03 24 cc 05 57 ff 7f ..$..W.. {00ce,00cf}: 0 grm 3 00 2f 1b ./. {00d0,00d1}: 1 grm 3 01 04 01 ... Note this already reveals a bug, since grm none should be all zeros. Also made some other minor tweaks to dbg scripts for consistency.	2023-07-18 21:40:41 -05:00
Christopher Haster	61c51b699a	In scripts, adopted aggresive width-finding for unbounded recursion This makes it easier to read the output, at a cost of these scripts not terminating if the underlying call sctucture contains loops. Previously these scripts would not terminate, but at least output the call tree as they visit each function. This was hard to read, and wasn't really that useful? If you hit a case with infinite recursion, you can limit the output size explicitly with -Z. Note this also drops --tree in stack.py. Since we get more readable output, this flag is less useful. This simplifies the script a bit.	2023-07-18 21:40:39 -05:00
Christopher Haster	3959545a9f	Some dbg script work - Changed how names are rendered in dbgbtree.py/dbgmtree.py to be consistent with non-names. The special rendering isn't really worth it now that names aren't just ascii/utf8. - Changed the ordering of raw/device/human rendering of btree entries to be more consistent with rendering of other entries (don't attempt to group btree entries). - Changed dbgmtree.py header to show information about the mtree.	2023-07-18 21:40:38 -05:00
Christopher Haster	c2d9f1b047	Implemented, but untested, global-removes This implementation is in theory correct, but of course, being untested, who knows? Though this does come with remounting added to all of the directory tests. This effectively tests that all of the directory creation tests we have so far maintain grm=0 after each unmount-mount cycle. Which is valuable.	2023-07-18 21:40:36 -05:00
Christopher Haster	cc0ac25b5e	Implemented infrastructure necessary for global-removes This has, in theory, global-removes (grm) being written out as a part of of directory creation, but they aren't used in any form and so may not be being written correctly. But it did require quite a bit of problem solving to get to this point (the interactions between mtree splitsand grms is really annoying), so it's worth a commit.	2023-07-18 21:40:30 -05:00
Christopher Haster	da810aca26	Implemented mtree path/dname lookup, rudimentary lfsr_mkdir/lfsr_dir_read This makes it now possible to create directories in the new system. The new system now uses a single global "mtree" to store all metadata entries in the filesystem. In this system, a directory is simply a range of metadata entries. This has a number of benefits, but does come with its own problems: 1. We need to indicate which directory each file belongs to. To do this the file's name entry has been changed to a tuple of leb128-encoded directory-id + actual file name: 01 66 69 6c 65 2e 74 78 74 .file.txt ^ '----------+----------' '------------\|------------ leb128 directory-id '------------ ascii/utf8 name If we include the directory-id as part of filename comparison, files should naturally be next to other files in the same directory. 2. We need a way allocate directory-ids for new directories. This turns out to be a bit more tricky than I expected. We can't use any mid/bid/rid inherent to the mtree, because these change on any file creation/deletion. And since we commit the did into the tree, that's not acceptable. Initially I though you could just find the largest did and increment, but this gives you no way to reclaim deleted dids. And sure, deleted dids have no storage consumption, but eventually you will overflow the did integer. Since this can suddenly happen in a filesystem that's been in a steady-state for years, that's pretty unnacceptable. One solution is to do a simple linear search over the mtree for an unused did. But with a runtime of O(n^2 log(n)), this raises performance concerns. Sidenote: It's interesting to note that the Linux kernel's allocation of process-ids, a very similar problem, is surprisingly complex and relies on a radix-tree of bitmaps (struct idr). This suggests I'm not missing an obvious solution somewhere. The solution I settled on here is to instead treat the set of dids as a sort of hash table: 1. Hash the full directory path into a did. 2. Perform a linear search until we have no collision. leb128(truncate28(crc32c("dir"))) .--------' v 9e cd c8 30 66 69 6c 65 2e 74 78 74 ...0file.txt '----+----' '----------+----------' '-----------------\|------------ leb128 directory-id '------------ ascii/utf8 name Worst case, this can still exhibit the worst case O(n^2 log(n)) performance when we are close to full dids. However that seems unlikely to happen in practice, since we don't truncate our hashes, unlike normal hash tables. An additional 32-bit word for each file is a small price to pay for a low-chance of collisions. In the current implementation, I do truncate the hash to 28-bits. Since we encode the hash with leb128, and hashes are statistically random, this gives us better usage of the leb128 encoding. However it does limit a 32-bit littlefs to 256 Mi directories. Maybe this should be a configurable limit in the future. But that highlights another benefit of this scheme. It's easy to change in the future without disk changes. 3. We need a way to know if a directory-id is allocated, even if the directory is empty. For this we just introduce a new tag: LFSR_TAG_DSTART, which is an empty file entry that indicates the directory at the given did in the mtree is allocated. To create/delete these atomically with the reference in our parent directory, we can use the GRM system for atomic renames. Note this isn't implemented yet. This is also the first time we finally get around to testing all of the dname lookup functions, so this did find a few bugs, mostly around reporting the root correctly.	2023-07-05 13:41:21 -05:00
Christopher Haster	cf588ac3fa	Dropped alt-always as an rbyd trunk terminator Now that tree rebalancing is implemented and needed a null terminator anyways, I think it's clear that the benefit of the alt-always pointers as trunk terminator has pretty limited value. Now a null or other tag is needed for every trunk, which simplifies checks for end-of-trunk. Alt-always tags are still emitted for deletes, etc, but there their behavior is implicit, so no special checks are needed. Alt-always tags are naturally cleaned up as a part of rbyd pruning.	2023-06-27 00:49:31 -05:00
Christopher Haster	43dc3a5c8d	Implemented tree rebalancing during rbyd compaction This isn't actually for performance reasons, but to reduce storage overhead of the rbyd metadata tree, which was showing signs of being problematic for small block sizes. Originally, the plan for compaction was to rely on the self-balancing rbyd append algorithm and simply append each tag to a new tree. Unfortunately, since each append requires a rewrite of the trunk (current search path), this introduces ~nlog(n) alts but only uses ~n alts for the final tree. This really starts to put pressure on small blocks, where the exponential-ness of the log doesn't kick in and overhead limits are already tight. Measuring lfsr_mdir_commit code size, this shows a ~556 byte cost on thumb: 16416 -> 16972 (+3.4%). Though there are still some optimizations on the table, this implementation needs a cleanup pass. alt overhead code cost rebalance: <= 28n 16972 append: <= 24nlog(n) 16416 Note these all assume worst case alt overhead, but we _need_ to assume worst case for our rbyd estimations, or else the filesystem can get stuck in unrecoverable compaction states. Because of the code cost I'm not sure if rebalancing will stay, be optional, or replace append-compaction completely yet. Some implementation notes: - Most tree balancing algorithms rely on true recursion, I suspect recursion may be a hard requirement in general, but it's hard to find bounded-ram algorithms. This solution gets around the ram requirement by leveraging the fact that our tags exist in a log to build up each layer in the tree tail-recursively. It's interesting to note that this is a special case of having little ram but lots of storage. - Humorously this shouldn't result in a performance improvement. Rbyd trees result in a worst case 2log(n) height, and rebalancing gives us a perfect worst case log(n) height, but, since we need an additional alt pointer for each node in our tree, things bump back up to 2log(n). - Originally the plan was to terminate each node with an alt-always tag, but during implementation I realized there was no easy way to get the key that splits the children with awkward tree lookups. As a workaround each node is terminated with an altle tag that contains the key followed by an unreachable null tag. This is redundant information, but makes the algorithm easier to implement. Fortunately null tags use the smallest tag encoding, which isn't that small, but that means this wastes at most 4*n bytes. - Note this preserves the first-tag-always-ends-up-at-off=0x4 rule, which is necessary for the littlefs magic to end up in a consistent place. - I've dropped dropping vestigial names for now, which means vestigial names can remain in btrees indefinitely. Need to revisit this.	2023-06-25 15:23:46 -05:00
Christopher Haster	799e7cfc81	Flipped btree encoding to allow variable redund blocks This should have been done as a part of the earlier tag reencoding work, since having the block at the end was what allowed us to move the redund-count out of the tag encoding. New encoding: [-- 32-bit csum --] [-- leb128 weight --] [-- leb128 trunk --] [-- leb128 block --] Note that since our tags have an explicit size, we can store a variable number of blocks. The plan is to use this to eventually store redundant copies for error correction: [-- 32-bit csum --] [-- leb128 weight --] [-- leb128 trunk --] [-- leb128 block --] -. [-- leb128 block --] +- n redundant blocks [-- leb128 block --] \| ... -' This does have a significant tradeoff, we need to know the checksum size to access the btree structure. This doesn't seem like a big deal, but with the possibility of different checksum types may be an annoying issue. Note that FCRC was also flipped for consistency.	2023-06-19 16:08:50 -05:00
Christopher Haster	e79c15b026	Implemented wide tags for both rbyd commit and lookup Wide tags are a happy accident that fell out of the realization that we can view all subtypes of a given tag suptype as a range in our rbyd. Combining this with how natural it is to operate on ranges in an rbyd allows us to perform operations on an entire range of subtypes as though it were a single tag. - lookup wide tag => find the smallest tag with this tag's suptype, O(log(n)) - remove wide tag => remove all tags with this tag's suptype, O(log(n)) - append wide tag => remove all tags with this tag's suptype, and then append our tag, O(log(n)) This is very useful for littlefs, where we've already been using tag's subtypes to hold extra type info, and have had to rely on awkward alternatives such as deleting existing subtypes before writing our new subtype. For example, when committing file metadata (not yet implemented), we can append a wide struct tag to update the metadata while also clearing out any lingering struct tags from previous commits, all in one rbyd append operation. This uses another mode bit in-device to change the behavior of lfsr_rbyd_commit, of which we have a couple: vwgrtttt 0TTTTTTT ^^^^---^--------^- valid bit (currently unused, maybe errors?) '\|\|---\|--------\|- wide bit, ignores subtype (in-device) '\|---\|--------\|- grow bit, don't create new id (in-device) '---\|--------\|- rm bit, remove this tag (in-device) '--------\|- 4-bit suptype '- leb128 subtype	2023-06-19 16:08:43 -05:00
Christopher Haster	2467d2e486	Added a separate tag encoding for the mtree This helps with debugging and can avoid weird issues if a file btree ever accidentally ends up attached to id -1 (due to fs bug). Though a separate encoding isn't strictly necessary, maybe this should be reverted at some point.	2023-06-18 15:12:43 -05:00
Christopher Haster	7180b70c9c	Allowed "alta" (altbgt 0) to terminate rbyd trunks, dropped rm bit This replaces unr with null on disk, though note both the rm bit and unr are used in-device still, they just don't get written to disk. This removes the need for the rm bit on disk. Since we no longer need to figure out what's been removed during fetch, we can save this bit for both internal and future on-disk use. Special handling of alta allows us to avoid emitting an unr tag (now null) if the current trunk is truly unreachable. This is minor now, but important for a theoretical rbyd rebalance operation (planned), which brings the rbyd overhead down from ~3x to ~2x. These changes give us two ways to terminate trunks without a tag: 1. With an alta, if the current trunk is unreachable: altbgt 0x403 w0 0x7b altbgt 0x402 w0 0x29 alta w0 0x4 2. With a null, if the current trunk is reachable, either for code convenience or because emitting an alta is impossible (an empty rbyd for example): altbgt 0x403 w0 0x7b altbgt 0x402 w0 0x29 altbgt 0x401 w0 0x4 null	2023-06-17 18:11:45 -05:00
Christopher Haster	2113d877d6	Moved bits around in tag encoding to allow leb128 custom attributes Yet another tag encoding, but hopefully narrowing in on a good long term design. This change trades a subtype bit for the ability to extend subtypes indefinitely via leb128 in the future. The immediate benefit is ~unlimited custom attributes, though I'm not sure how to make this configurable yet. Extended custom attributes may have a significant impact on alt tag sizes, so it may be worth defaulting to only 8-bit custom attributes still. Tag encoding: vmmmtttt 0TTTTTTT 0wwwwwww 0sssssss ^--^---^--------^--------^--------^- valid bit '---\|--------\|--------\|--------\|- 3-bit mode '--------\|--------\|--------\|- 4-bit suptype '--------\|--------\|- leb128 subtype '--------\|- leb128 weight '- leb128 size/jump This limits subtypes to 7-bits, but this seems very reasonable at the moment. This also seems to limit custom attributes to 7-bits, but we can use two separate suptypes to bring this back up to 8-bits. I was planning to do this anyways to have separate "user-attributes" and "system-attributes", so this actually fits in really well.	2023-06-16 01:51:33 -05:00

1 2 3 4 5

241 Commits