Lightnews — Scholar-powered news

Karl Seguin

@openmymind.net.web.brid.gy

Comparing Strings as Integers with @bitCast

In the last blog posts, we looked at different ways to compare strings in Zig. A few posts back, we introduced Zig's `@bitCast`. As a quick recap, `@bitCast` lets us force a specific type onto a value. For example, the following prints 1067282596: const std = @import("std"); pub fn main() !void { const f: f32 = 1.23; const n: u32 = @bitCast(f); std.debug.print("{d}\n", .{n}); } What's happening here is that Zig represents the 32-bit float value of `1.23` as: `[4]u8{164, 112, 157, 63}`. This is also how Zig represents the 32-bit unsigned integer value of `1067282596`. Data is just bytes; it's the type system - the compiler's knowledge of what data is what type - that controls what and how that data is manipulated. It might seem like there's something special about bitcasting from a float to an integer; they're both numbers after all. But you can `@bitCast` from any two equivalently sized types. Can you guess what this prints?: const std = @import("std"); pub fn main() !void { const data = [_]u8{3, 0, 0, 0}; const x: i32 = @bitCast(data); std.debug.print("{d}\n", .{x}); } The answer is `3`. Think about the above snippet a bit more. We're taking an array of bytes and telling the compiler to treat it like an integer. If we made `data` equal to `[_]u8{'b', 'l', 'u', 'e'}`, it would still work (and print `1702194274`). We're slowly heading towards being able to compare strings as-if they were integers. If you're wondering why 3 is encoded as `4]u8{3, 0, 0, 0}` and not `[4]u8{0, 0, 0, 3}`, I talked about binary encoding in my [Learning TCP series. From the last post, we could use multiple `std.mem.eql` or, more simply, `std.meta.stringToEnum` to complete the following method: fn parseMethod(value: []const u8) ?Method { // ... } const Method = enum { get, put, post, head, }; We can also use `@bitCast`. Let's take it step-by-step. The first thing we'll need to do is switch on `value.len`. This is necessary because the three-byte "GET" will need to be `@bitCast` to a `u24`, whereas the four-byte "POST" needs to be `@bitCast` to a `u32`: fn parseMethod(value: []const u8) ?Method { switch (value.len) { 3 => switch (@as(u24, @bitCast(value[0..3]))) { // TODO else => {}, }, 4 => switch (@as(u32, @bitCast(value[0..4]))) { // TODO else => {}, }, else => {}, } return null; } If you try to run this code, you'll get a compilation error: _cannot @bitCast from '*const [3]u8'_. `@bitCast` works on actual bits, but when we slice our `[]const u8` with a compile-time known range (`[0..3]`), we get a pointer to an array. We can't `@bitCast` a pointer, we can only `@bitCast` actual bits of data. For this to work, we need to derefence the pointer, i.e. use: `value[0..3].*`. This will turn our `*const [3]u8` into a `const [3]u8`. fn parseMethod(value: []const u8) ?Method { switch (value.len) { // changed: we now derefernce the value (.*) 3 => switch (@as(u24, @bitCast(value[0..3].*))) { // TODO else => {}, }, // changed: we now dereference the value (.*) 4 => switch (@as(u32, @bitCast(value[0..4].*))) { // TODO else => {}, }, else => {}, } return null; } Also, you might have noticed the `@as(u24, ...)` and `@as(u32, ...)`. `@bitCast`, like most of Zig's builtin functions, infers its return type. When we're assiging the result of a `@bitCast` to a variable of a known type, i.e: `const x: i32 = @bitCast(data);`, the return type of `i32` is inferred. In the above `switch`, we aren't assigning the result to a varible. We have to use `@as(u24, ...)` in order for `@bitCast` to kknow what it should be casting to (i.e. what its return type should be). The last thing we need to do is fill our switch blocks. Hopefully it's obvious that we can't just do: 3 => switch (@as(u24, @bitCast(value[0..3].*))) { "GET" => return .get, "PUT" => return .put, else => {}, }, ... But you might be thinking that, while ugly, something like this might work: 3 => switch (@as(u24, @bitCast(value[0..3].*))) { @as(u24, @bitCast("GET".*)) => return .get, @as(u24, @bitCast("PUT".*)) => return .put, else => {}, }, ... Because `"GET"` and `"PUT"` are string literals, they're null terminated and of type `*const [3:0]u8`. When we dereference them, we get a `const [3:0]u8`. It's close, but it means that the value is 4 bytes (`[4]u8{'G', 'E', 'T', 0}`) and thus cannot be `@bitCast` into a `u24`. This is ugly, but it works: fn parseMethod(value: []const u8) ?Method { switch (value.len) { 3 => switch (@as(u24, @bitCast(value[0..3].*))) { @as(u24, @bitCast(@as([]const u8, "GET")[0..3].*)) => return .get, @as(u24, @bitCast(@as([]const u8, "PUT")[0..3].*)) => return .put, else => {}, }, 4 => switch (@as(u32, @bitCast(value[0..4].*))) { @as(u32, @bitCast(@as([]const u8, "HEAD")[0..4].*)) => return .head, @as(u32, @bitCast(@as([]const u8, "POST")[0..4].*)) => return .post, else => {}, }, else => {}, } return null; } That's a mouthful, so we can add small function to help: fn parseMethod(value: []const u8) ?Method { switch (value.len) { 3 => switch (@as(u24, @bitCast(value[0..3].*))) { asUint(u24, "GET") => return .get, asUint(u24, "PUT") => return .put, else => {}, }, 4 => switch (@as(u32, @bitCast(value[0..4].*))) { asUint(u32, "HEAD") => return .head, asUint(u32, "POST") => return .post, else => {}, }, else => {}, } return null; } pub fn asUint(comptime T: type, comptime string: []const u8) T { return @bitCast(string[0..string.len].*); } Like the verbose version, the trick is to cast our null-terminated string literal into a string slice, `[]const u8`. By passing it through the `asUint` function, we get this without needing to add the explicit `@as([]const u8)`. There is a more advanced version of `asUint` which doesn't take the uint type parameter (`T`). If you think about it, the uint type can be inferred from the string's length: pub fn asUint(comptime string: []const u8) @Type(.{ .int = .{ // bits, not bytes, hence * 8 .bits = string.len * 8, .signedness = .unsigned, }, }) { return @bitCast(string[0..string.len].*); } Which allows us to call it with a single parameter: `asUint("GET")`. This might be your first time seeing such a return type. The `@Type` builtin is the opposite of `@typeInfo`. The latter takes a type and returns information on it in the shape of a `std.builtin.Type` union. Whereas `@Type` takes the `std.builtin.Type` and returns an actual usable type. One of these days I'll find the courage to blog about `std.builtin.Type`! As a final note, some people dislike the look of this sort of return type and rather encapsulate the logic in its own function. This is the same: pub fn asUint(comptime string: []const u8) AsUintReturn(string) { return @bitCast(string[0..string.len].*); } // Remember that, in Zig, by convention, a function should be // PascalCase if it returns a type (because types are PascalCase). fn AsUintReturn(comptime string: []const u8) type { return @Type(.{ .int = .{ // bits, not bytes, hence * 8 .bits = string.len * 8, .signedness = .unsigned, }, }); } ### Conclusion Of the three approaches, this is the least readable and less approachable. Is it worth it? It depends on your input and the values you're comparing against. In my benchmarks, using `@bitCast` performs roughly the same as `std.meta.stringToEnum`. But there are some cases where `@bitCast` can outperform `std.meta.stringToEnum` by as much as 50%. Perhaps that's the real value of this approach: the performance is less dependent on the input or the values being matched against. Leave a comment

www.openmymind.net

October 15, 2025 at 6:38 AM

Karl Seguin

@openmymind.net.web.brid.gy

GetOrPut With String Keys

I've previously blogged about how much I like Zig's `getOrPut` hashmap method. As a brief recap, we can visualize Zig's hashmap as two arrays: keys: values: -------- -------- | Paul | | 1234 | @mod(hash("Paul"), 5) == 0 -------- -------- | | | | -------- -------- | | | | -------- -------- | Goku | | 9001 | @mod(hash("Goku"), 5) == 3 -------- -------- | | | | -------- -------- When we call `get("Paul")`, we could think of this simplified implementation: fn get(map: *Self, key: K) ?V { const index = map.getIndexOf(key) orelse return null; return map.values[index]; } And, when we call `getPtr("Paul")`, we'd have this implementation: fn getPtr(map: *Self, key: K) ?*V { const index = map.getIndexOf(key) orelse return null; // notice the added '&' // we're taking the address of the array index return &map.values;[index]; } By taking the address of the value directly from the hashmap's array, we avoid copying the entire value. That can have performance implications (though not for the integer value we're using here). It also allows us to directly manipulate that slot of the array: const value = map.getPtr("Paul") orelse return; value.* = 10; This is a powerful feature, but a dangerous one. If the underlying array changes, as can happen when items are added to the map, `value` would become invalid. So, while `getPtr` is useful, it requires mindfulness: try to minimize the scope of such references. Currently, Zig's HashMap doesn't shrink when items are removed, so, for now, removing items doesn't invalidate any pointers into the hashmap. But expect that to change at some point. ### GetOrPut `getOrPut` builds on the above concept. It returns a pointer to the value **and** the key, as well as creating the entry in the hashmap if necessary. For example, given that we already have an entry for "Paul", if we call `map.getOrPut("Paul")`, we'd get back a `value_ptr` that points to a slot in the hahmap's `values` array, as well as a`key_ptr` that points to a slot in the hashmap's `keys` array. If the requested key _doesn't_ exist, we get back the same two pointers, and it's our responsibility to set the value. If I asked you to increment counters inside of a hashmap, without `getOrPut`, you'd end up with two hash lookups: // Go count, exists := counters["hits"] if exists == false { counters["hits"] = 1 } else { counters["hits"] = count + 1; } With `getOrPut`, it's a single hash lookup: const gop = try counters.getOrPut("hits"); if (gop.found_existing) { gop.value_ptr.* += 1; } else { gop.value_ptr.* = 1; } ### getOrPut With String Keys It seems trivial, but the most important thing to understand about `getOrPut` is that it will set the key for you if the entry has to be created. In our last example, notice that even when `gop.found_existing == false`, we never set `key_ptr` - `getOrPut` automatically sets it to the key we pass in, i.e. `"hits"`. If we were to put a breakpoint after `getOrPut` returns but before we set the value, we'd see that our two arrays look something like: keys: values: -------- -------- | | | | -------- -------- | hits | | ???? | -------- -------- | | | | -------- -------- Where the entry in the `keys` array is set, but the corresponding entry in `values` is left undefined. You'll note that `getOrPut` doesn't take a value. I assume this is because, in some cases, the value might be expensive to derive, so the current API lets us avoid calculating it when `gop.found_existing == true`. This is important for keys that need to be owned by the hashmap. Most commonly strings, but this applies to any other key which we'll "manage". Taking a step back, if we wanted to track hits in a hashmap, and, most likely, we wanted the lifetime of the keys to be tied to the hashmap, you'd do something like: fn register(allocator: Allocator, map: *std.StringHashMap(u32), name: []const u8) !void { const owned = try allocator.dupe(u8, name); try map.put(owned, 0); } Creating your "owned" copy of `name`, frees the caller from having to maintain `name` beyond the call to `register`. Now, if this key is removed, or the entire map cleaned up, we need to free the keys. That's why I like the name "owned", it means the hash map "owns" the key (i.e. is responsible for freeing it): var it = map.keyIterator(); while (it.next()) |key_ptr| { allocator.free(key_ptr.*); } map.deinit(allocator); The interaction between key ownership and `getOrPut` is worth thinking about. If we try to merge this ownership idea with our incrementing counter code, we might try: fn hit(allocator: Allocator, map: *std.StringHashMap(u32), name: []const u8) !void { const owned = try allocator.dupe(u8, name); const gop = try map.getOrPut(owned); if (gop.found_existing) { gop.value_ptr.* += 1; } else { gop.value_ptr.* = 1; } } But this code has a potential memory leak, can you spot it? (see Appendix A for a complete runnable example) When `gop.found_existing == true`, `owned` is never used and never freed. One bad option would be to free `owned` when the entry already exists: fn hit(allocator: Allocator, map: *std.StringHashMap(u32), name: []const u8) !void { const owned = try allocator.dupe(u8, name); const gop = try map.getOrPut(owned); if (gop.found_existing) { // This line was added. But this is a bad solution allocator.free(owned); gop.value_ptr.* += 1; } else { gop.value_ptr.* = 1; } } It works, but we needlessly `dupe` `name` if the entry already exists. Rather than prematurely duping the key in case the entry doesn't exist, we want to delay our `dupe` until we know it's needed. Here's a better option: fn hit(allocator: Allocator, map: *std.StringHashMap(u32), name: []const u8) !void { // we use `name` for the lookup. const gop = try map.getOrPut(name); if (gop.found_existing) { gop.value_ptr.* += 1; } else { // this line was added gop.key_ptr.* = try allocator.dupe(u8, name); gop.value_ptr.* = 1; } } It might seem reckless to pass `name` into `getOrPut`. We need the key to remain valid as long as the map entry exists. Aren't we undermining that requirement? Let's walk through the code. When `hit` is called for a new `name`, `gop.found_existing` will be false. `getOrPut` will insert `name` in our `keys` array. This is bad because we have no `guarantee` that `name` will be valid for as long as we need it to be. But the problem is immediately remedied when we overwrite `key_ptr.*`. On subsequent calls for an existing `name`, when `gop.found_existing == true`, the `name` is only used as a lookup. It's no different than doing a `getPtr`; `name` only has to be valid for the call to `getOrPut` because `getOrPut` doesn't keep a reference to it when an existing entry is found. ### Conclusion This post was a long way to say: don't be afraid to write to `key_ptr.*`. Of course you can screw up your map this way. Consider this example: fn hit(allocator: Allocator, map: *std.StringHashMap(u32), name: []const u8) !void { // we use `name` for the lookup. const gop = try map.getOrPut(name); if (gop.found_existing) { gop.value_ptr.* += 1; } else { // what's this? gop.key_ptr.* = "HELLO"; gop.value_ptr.* = 1; } } Because the key is used to organize the map - find where items go and where they are, jamming random keys where they don't belong is going to cause issues. But it can also be used correctly and safely, as long as you understand the details. ### Appendix A - Memory Leak This code `should` report a memory leak. const std = @import("std"); const Allocator = std.mem.Allocator; pub fn main() !void { var gpa = std.heap.GeneralPurposeAllocator(.{}){}; const allocator = gpa.allocator(); defer _ = gpa.detectLeaks(); // I'm using the Unmanaged variant because the Managed ones are likely to // be removed (which I think is a mistake). Using Unmanaged makes this // snippet more future-proof. I explain unmanaged here: // https://www.openmymind.net/Zigs-HashMap-Part-1/#Unmanaged var map: std.StringHashMapUnmanaged(u32) = .{}; try hit(allocator, ↦, "teg"); try hit(allocator, ↦, "teg"); var it = map.keyIterator(); while (it.next()) |key_ptr| { allocator.free(key_ptr.*); } map.deinit(allocator); } fn hit(allocator: Allocator, map: *std.StringHashMapUnmanaged(u32), name: []const u8) !void { const owned = try allocator.dupe(u8, name); const gop = try map.getOrPut(allocator, owned); if (gop.found_existing) { gop.value_ptr.* += 1; } else { gop.value_ptr.* = 1; } } Leave a comment

www.openmymind.net

October 15, 2025 at 6:38 AM

Karl Seguin

@openmymind.net.web.brid.gy

Zig's dot star syntax (value.*)

www.openmymind.net

October 15, 2025 at 6:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

ArenaAllocator.free and Nested Arenas

What happens when you `free` with an ArenaAllocator? You might be tempted to look at the documentation for std.mem.Allocator.free which says "Free an array allocated with alloc". But this is the one thing we're sure it _won't_ do. In its current implementation, calling `free` usually does nothing: the freed memory isn't made available for subsequent allocations by the arena, and it certainly isn't released back to the operating system. However, under specific conditions `free` will make the memory re-usable by the arena. The only way to really "free" the memory is to call `deinit`. The only case when we're guaranteed that the memory will be reusable by the arena is when it was the last allocation made: const str1 = try arena.dupe(u8, "Over 9000!!!"); arena.free(str1); Above, whatever memory was allocated to duplicate our string will be available for subsequent allocations made with `arena`. In the following case, the two calls to `arena.free` do nothing: const str1 = try arena.dupe(u8, "ab"); const str2 = try arena.dupe(u8, "12"); arena.free(str1); arena.free(str2); In order to "fix" this code, we'd need to reverse the order of the two frees: const str1 = try arena.dupe(u8, "ab"); const str2 = try arena.dupe(u8, "12"); arena.free(str2); //swapped this line with the next arena.free(str1); Now, when we call `arena.free(str2)`, the memory allocated for `str2` will be available to subsequent allocations. But what happens when we call `arena.free(str1)`? The answer, again, is: _it depends_. It has to do with the internal state of the arena. Simplistically, an `ArenaAllocator` keeps a linked list of memory buffers. Imagine something like: buffer_list.head -> ------------ | next | -> null | ---- | | | | | | | | | | | ------------ Our linked list has a single node along with 5 bytes of available space. After we allocate `str1`, it looks like: buffer_list.head -> ------------ | next | -> null | ---- | str1 -> | a | | b | | | | | | | ------------ Then, when we allocate `str2`, it looks like: buffer_list.head -> ------------ | next | -> null | ---- | str1 -> | a | | b | str2 -> | 1 | | 2 | | | ------------ When we free `str2`, it goes back to how it was before: buffer_list.head -> ------------ | next | -> null | ---- | str1 -> | a | | b | | | | | | | ------------ Which means that when we `arena.free(str1)`, it **will** make that memory available again. However, if instead of allocating two strings, we allocate three: const str1 = try arena.dupe(u8, "ab"); const str2 = try arena.dupe(u8, "12"); const str3 = try arena.dupe(u8, "()"); arena.free(str3); arena.free(str2); arena.free(str1); Our first buffer doesn't have enough space for the new string, so a new node is prepended to our linked list: buffer_list.head -> ------------ ------------ | next | -> | next | -> null | ---- | | ---- | str3 -> | ( | | a | <- str1 | ) | | b | | | | 1 | <- str2 | | | 2 | | | | | ------------ ------------ When we call `arena.free(str3)`, the memory for that allocation will be made available, but subsequent frees, even if they're in the correct order (i.e. freeing `str2` then `str1`) will be noops. The ArenaAllocator doesn't have the capability to go back to act on anything but the head of our linked list, even if it's empty. In short, when we `free` the last allocation, that memory will _always_ be made available. But subsequent `frees` only behave this way if (a) they're also in order and (b) happen to be allocate within the same internal memory node. ### Nested Arenas Zig's allocator are said to be composable. When we create an `ArenaAllocator`, we pass a single parameter: an allocator. That parent allocator (1) can be any other type of allocator. You can, for example, create an `ArenaAllocator` on top of a `FixedBufferAllocator`. You can also create an `ArenaAllocator` on top of another `ArenaAllocator`. (1) Zig calls this the "child allocator", but that doesn't make any sense to me. This kind of thing often happens within libraries, where an API takes an `std.mem.Allocator` and the library creates an `ArenaAllocator`. And what happens when the provided allocator was already an arena? Libraries aside, I'm mean something like: var parent_arena = ArenaAllocator.init(gpa_allocator); const parent_allocator = parent_arena.allocator(); var inner_arena = ArenaAllocator.init(parent_allocator); const inner_allocator = inner_arena.allocator(); _ = try inner_allocator.dupe(u8, "Over "); _ = try inner_allocator.dupe(u8, "9000!"); inner_arena.deinit(); It does work, but at best, when `deinit` is called, the memory will be made available to be re-used by `inner_arena`. Except in simple cases, allocations made by `inner_arena` are likely to span multiple buffers of `parent_arena`, and of course you can still make allocations directly in `parent_arena` which can generate its own new buffers or simply make the ordering requirement impossible to fulfill. For example, if we make an allocation in `parent_arena` before `inner_arena.deinit();` is called: _ = try parent_allocator.dupe(u8, "!!!"); inner_arena.deinit(); Then the `deinit` does nothing. So while nesting ArenaAllocator's works, I don't think there's any advantage over using a single Arena. And, I think in many cases where you have an "inner_arena", like in a library, it's better if the caller provides a non-Arena parent allocator so that all the memory is really freed when the library is done with it. Of course, there's a transparency issue here. Unless the library documents exactly how it's using your provided allocator, or unless you explore the code - and assuming the implementation doesn't change - it's hard to know what you should use. Leave a comment

www.openmymind.net

October 15, 2025 at 6:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

Allocator.resize

www.openmymind.net

October 15, 2025 at 6:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

Zig's new LinkedList API (it's time to learn @fieldParentPtr)

In a recent, post-Zig 0.14 commit, Zig's `SinglyLinkedList` and `DoublyLinkedList` saw significant changes. The previous version was a generic and, with all the methods removed, looked like: pub fn SinglyLinkedList(comptime T: type) type { return struct { first: ?*Node = null, pub const Node = struct { next: ?*Node = null, data: T, }; }; } The new version isn't generic. Rather, you embed the linked list node with your data. This is known as an intrusive linked list and tends to perform better and require fewer allocations. Except in trivial examples, the data that we store in a linked list is typically stored on the heap. Because an intrusive linked list has the linked list node embedded in the data, it doesn't need its own allocation. Before we jump into an example, this is what the new structure looks like, again, with all methods removed: pub const SinglyLinkedList = struct { first: ?*Node = null, pub const Node = struct { next: ?*Node = null, }; }; Much simpler, and, notice that this has no link or reference to any of our data. Here's a working example that shows how you'd use it: const std = @import("std"); const SinglyLinkedList = std.SinglyLinkedList; pub fn main() !void { // GeneralPurposeAllocator is being renamed // to DebugAllocator. Let's get used to that name var gpa: std.heap.DebugAllocator(.{}) = .init; const allocator = gpa.allocator(); var list: SinglyLinkedList = .{}; const user1 = try allocator.create(User); defer allocator.destroy(user1); user1.* = .{ .id = 1, .power = 9000, .node = .{}, }; list.prepend(&user1.node;); const user2 = try allocator.create(User); defer allocator.destroy(user2); user2.* = .{ .id = 2, .power = 9001, .node = .{}, }; list.prepend(&user2.node;); var node = list.first; while (node) |n| { std.debug.print("{any}\n", .{n}); node = n.next; } } const User = struct { id: i64, power: u32, node: SinglyLinkedList.Node, }; To run this code, you'll need a nightly release from within the last week. What do you think the output will be? You should see something like: SinglyLinkedList.Node{ .next = SinglyLinkedList.Node{ .next = null } } SinglyLinkedList.Node{ .next = null } We're only getting the nodes, and, as we can see here and from the above skeleton structure of the new `SinglyLinkedList`, there's nothing about our users. Users have nodes, but there's seemingly nothing that links a node back to its containing user. Or is there? In the past, we've described how the compiler uses the type information to figure out how to access fields. For example, when we execute `user1.power`, the compiler knows that: 1. `id` is +0 bytes from the start of the structure, 2. `power` is +8 bytes from the start of the structure (because id is an i64), and 3. `power` is an i32 With this information, the compiler knows how to access `power` from `user1` (i.e. jump forward 8 bytes, read 4 bytes and treat it as an i32). But if you think about it, that logic is simple to reverse. If we know the address of `power`, then the address of `user` has to be `address_of_power - 8`. We can prove this: const std = @import("std"); pub fn main() !void { var user = User{ .id = 1, .power = 9000, }; std.debug.print("address of user: {*}\n", .{&user;}); const address_of_power = &user.power; std.debug.print("address of power: {*}\n", .{address_of_power}); const power_offset = 8; const also_user: *User = @ptrFromInt(@intFromPtr(address_of_power) - power_offset); std.debug.print("address of also_user: {*}\n", .{also_user}); std.debug.print("also_user: {}\n", .{also_user}); } const User = struct { id: i64, power: u32, }; The magic happens here: const power_offset = 8; const also_user: *User = @ptrFromInt(@intFromPtr(address_of_power) - power_offset); We're turning the address of our user's power field, `&user.power;` into an integer, subtracting 8 (8 bytes, 64 bits), and telling the compiler that it should treat that memory as a `*User`. This code will _probably_ work for you, but it isn't safe. Specifically, unless we're using a packed or extern struct, Zig makes no guarantees about the layout of a structure. It could put `power` BEFORE `id`, in which case our `power_offset` should be 0. It could add padding after every field. It can do anything it wants. To make this code safer, we use the `@offsetOf` builtin to get the actual byte-offset of a field with respect to its struct: const power_offset = @offsetOf(User, "power"); Back to our linked list, given that we have the address of a `node` and we know that it is part of the `User` structure, we _are_ able to get the `User` from a node. Rather than use the above code though, we'll use the _slightly_ friendlier `@fieldParentPtr` builtin. Our `while` loop changes to: while (node) |n| { const user: *User = @fieldParentPtr("node", n); std.debug.print("{any}\n", .{user}); node = n.next; } We give `@fieldParentPtr` the name of the field, a pointer to that field as well as a return type (which is inferred above by the assignment to a `*User` variable), and it gives us back the instance that contains that field. Performance aside, I have mixed feelings about the new API. My initial reaction is that I dislike exposing, what I consider, a complicated builtin like `@fieldParentPtr` for something as trivial as using a linked list. However, while `@fieldParentPtr` seems esoteric, it's quite useful and developers should be familiar with it because it can help solve problems which are otherwise problematic. Leave a comment

www.openmymind.net

October 15, 2025 at 6:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

Zig's new Writer

As you might have heard, Zig's `Io` namespace is being reworked. Eventually, this will mean the re-introduction of async. As a first step though, the Writer and Reader interfaces and some of the related code have been revamped. > This post is written based on a mid-July 2025 development release of Zig. It doesn't apply to Zig 0.14.x (or any previous version) and is likely to be outdated as more of the Io namespace is reworked. Not long ago, I wrote a blog post which tried to explain Zig's Writers. At best, I'd describe the current state as "confusing" with two writer interfaces while often dealing with `anytype`. And while `anytype` is convenient, it lacks developer ergonomics. Furthermore, the current design has significant performance issues for some common cases. ### Drain The new `Writer` interface is `std.Io.Writer`. At a minimum, implementations have to provide a `drain` function. Its signature looks like: fn drain(w: *Writer, data: []const []const u8, splat: usize) Error!usize You might be surprised that this is the method a custom writer needs to implemented. Not only does it take an array of strings, but what's that `splat` parameter? Like me, you might have expected a simpler `write` method: fn write(w: *Writer, data: []const u8) Error!usize It turns out that `std.Io.Writer` has buffering built-in. For example, if we want a `Writer` for an `std.fs.File`, we need to provide the buffer: var buffer: [1024]u8 = undefined; var writer = my_file.writer(&buffer;); Of course, if we don't want buffering, we can always pass an empty buffer: var writer = my_file.writer(&.{}); This explains why custom writers need to implement a `drain` method, and not something simpler like `write`. The simplest way to implement `drain`, and what a lot of the Zig standard library has been upgraded to while this larger overhaul takes place, is: fn drain(io_w: *std.Io.Writer, data: []const []const u8, splat: usize) !usize { _ = splat; const self: *@This() = @fieldParentPtr("interface", io_w); return self.writeAll(data[0]) catch return error.WriteFailed; } We ignore the `splat` parameter, and just write the first value in `data` (`data.len > 0` is guaranteed to be true). This turns `drain` into what a simpler `write` method would look like. Because we return the length of bytes written, `std.Io.Writer` will know that we potentially didn't write all the data and call `drain` again, if necessary, with the rest of the data. > If you're confused by the call to `@fieldParentPtr`, check out my post on the upcoming linked list changes. The actual implementation of `drain` for the `File` is a non-trivial ~150 lines of code. It has platform-specific code and leverages vectored I/O where possible. There's obviously flexibility to provide a simple implementation or a more optimized one. ### The Interface Much like the current state, when you do `file.writer(&buffer;)`, you don't get an `std.Io.Writer`. Instead, you get a `File.Writer`. To get an actual `std.Io.Writer`, you need to access the `interface` field. This is merely a convention, but expect it to be used throughout the standard, and third-party, library. Get ready to see a lot of `&xyz.interface;` calls! This simplification of `File` shows the relationship between the three types: pub const File = struct { pub fn writer(self: *File, buffer: []u8) Writer{ return .{ .file = self, .interface = std.Io.Writer{ .buffer = buffer, .vtable = .{.drain = Writer.drain}, } }; } pub const Writer = struct { file: *File, interface: std.Io.Writer, // this has a bunch of other fields fn drain(io_w: *std.Io.Writer, data: []const []const u8, splat: usize) !usize { const self: *Writer = @fieldParentPtr("interface", io_w); // .... } } } The instance of `File.Writer` needs to exist somewhere (e.g. on the stack) since that's where the `std.Io.Writer` interface exists. It's possible that `File` could directly have an `writer_interface: std.Io.Writer` field, but that would limit you to one writer per file and would bloat the `File` structure. We can see from the above that, while we call `Writer` an "interface", it's just a normal struct. It has a few fields beyond `buffer` and `vtable.drain`, but these are the only two with non-default values; we have to provide them. The `Writer` interface implements a lot of typical "writer" behavior, such as a `writeAll` and `print` (for formatted writing). It also has a number of methods which only a `Writer` implementation would likely care about. For example, `File.Writer.drain` has to call `consume` so that the writer's internal state can be updated. Having all of these functions listed side-by-side in the documentation confused me at first. Hopefully it's something the documentation generation will one day be able to help disentangle. ### Migrating The new `Writer` has taken over a number of methods. For example, `std.fmt.formatIntBuf` no longer exists. The replacement is the `printInt` method of `Writer`. But this requires a `Writer` instance rather than the simple `[]u8` previous required. It's easy to miss, but the `Writer.fixed([]u8) Writer` function is what you're looking for. You'll use this for any function that was migrating to `Writer` and used to work on a `buffer: []u8`. While migrating, you might run into the following error: _no field or member function named 'adaptToNewApi' in '...'_. You can see why this happens by looking at the updated implementation of `std.fmt.format`: pub fn format(writer: anytype, comptime fmt: []const u8, args: anytype) !void { var adapter = writer.adaptToNewApi(); return adapter.new_interface.print(fmt, args) catch |err| switch (err) { error.WriteFailed => return adapter.err.?, }; } Because this functionality was moved to `std.Io.Writer`, any `writer` passed into `format` has to be able to upgrade itself to the new interface. This is done, again only be convention, by having the "old" writer expose an `adaptToNewApi` method which returns a type that exposes a `new_interface: std.Io.Writer` field. This is pretty easy to implement using the basic `drain` implementation, and you can find a handful of examples in the standard library, but it's of little help if you don't control the legacy writer. ### Conclusion I'm hesitant to provide opinion on this change. I don't understand language design. However, while I think this is an improvement over the current API, I keep thinking that adding buffering directly to the `Writer` isn't ideal. I believe that most languages deal with buffering via composition. You take a reader/writer and wrap it in a BufferedReader or BufferedWriter. This approach seems both simple to understand and implement while being powerful. It can be applied to things beyond buffering and IO. Zig seems to struggle with this model. Rather than provide a cohesive and generic approach for such problems, one specific feature (buffering) for one specific API (IO) was baked into the standard library. Maybe I'm too dense to understand or maybe future changes will address this more holistically. Leave a comment

www.openmymind.net

October 15, 2025 at 6:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

I'm too dumb for Zig's new IO interface

www.openmymind.net

October 15, 2025 at 6:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

Everything is a []u8

www.openmymind.net

October 15, 2025 at 6:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

Is Zig's New Writer Unsafe?

If we wanted to write a function that takes one of Zig's new `*std.Io.Reader` and write it to stdout, we might start with something like: fn output(r: *std.Io.Reader) !void { const stdout = std.fs.File.stdout(); var buffer: [???]u8 = undefined; var writer = stdout.writer(&buffer;); _ = try r.stream(&writer.interface;, .unlimited); try writer.interface.flush(); } But what should the size of `buffer` be? If this was a one-and-done, maybe we'd leave it empty or put some seemingly sensible default, like 1K or 4K. If it was a mission critical piece of code, maybe we'd benchmark it or make it platform dependent. But unless I'm missing something, whatever size we use, this function's behavior is undefined. You see, the issue is that readers can require a specific buffer sizes on a writer (and writers can require a specific buffer size on a reader). For example, this code, with a small buffer of 64, fails an assertion in debug mode, and falls into an endless loop in release mode: const std = @import("std"); pub fn main() !void { var fixed = std.Io.Reader.fixed(&.{ 40, 181, 47, 253, 36, 110, 149, 0, 0, 88, 111, 118, 101, 114, 32, 57, 48, 48, 48, 33, 10, 1, 0, 192, 105, 241, 2, 170, 69, 248, 150 }); var decompressor = std.compress.zstd.Decompress.init(&fixed;, &.{}, .{}); try output(&decompressor.reader;); } fn output(r: *std.Io.Reader) !void { const stdout = std.fs.File.stdout(); var buffer: [64]u8 = undefined; var writer = stdout.writer(&buffer;); _ = try r.stream(&writer.interface;, .unlimited); try writer.interface.flush(); } Some might argue that this is a documentation challenge. It's true that the documentation for `zstd.Decompress` mentions what a `Writer`'s buffer must be. **But this is not a documentation problem**. There are legitimate scenarios where the nature of a `Reader` is unknown (or, at least, difficult to figure out). A type of a reader could be conditional, say based on an HTTP response header. A library developer might take a `Reader` as an input and present their own `Reader` as an output - what buffer requirement should they document? Worse is that the failure can be conditional on the input. For example, if we change our source to: var fixed = std.Io.Reader.fixed(&.{ 40, 181, 47, 253, 36, 11, 89, 0, 0, 111, 118, 101, 114, 32, 57, 48, 48, 48, 33, 10, 112, 149, 178, 212, }); Everything works, making this misconfiguration particularly hard to catch early. To me this seems almost impossible - like, I must be doing something wrong. And if I am, I'm sorry. But, if I'm not, this is a problem right? Leave a comment

www.openmymind.net

October 15, 2025 at 6:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

Is Zig's New Writer Unsafe?

If we wanted to write a function that takes one of Zig's new `*std.Io.Reader` and write it to stdout, we might start with something like: fn output(r: *std.Io.Reader) !void { const stdout = std.fs.File.stdout(); var buffer: [???]u8 = undefined; var writer = stdout.writer(&buffer;); _ = try r.stream(&writer.interface;, .unlimited); try writer.interface.flush(); } But what should the size of `buffer` be? If this was a one-and-done, maybe we'd leave it empty or put some seemingly sensible default, like 1K or 4K. If it was a mission critical piece of code, maybe we'd benchmark it or make it platform dependent. But unless I'm missing something, whatever size we use, this function's behavior is undefined. You see, the issue is that readers can require a specific buffer sizes on a writer (and writers can require a specific buffer size on a reader). For example, this code, with a small buffer of 64, fails an assertion in debug mode, and falls into an endless loop in release mode: const std = @import("std"); pub fn main() !void { var fixed = std.Io.Reader.fixed(&.{ 40, 181, 47, 253, 36, 110, 149, 0, 0, 88, 111, 118, 101, 114, 32, 57, 48, 48, 48, 33, 10, 1, 0, 192, 105, 241, 2, 170, 69, 248, 150 }); var decompressor = std.compress.zstd.Decompress.init(&fixed;, &.{}, .{}); try output(&decompressor.reader;); } fn output(r: *std.Io.Reader) !void { const stdout = std.fs.File.stdout(); var buffer: [64]u8 = undefined; var writer = stdout.writer(&buffer;); _ = try r.stream(&writer.interface;, .unlimited); try writer.interface.flush(); } Some might argue that this is a documentation challenge. It's true that the documentation for `zstd.Decompress` mentions what a `Writer`'s buffer must be. **But this is not a documentation problem**. There are legitimate scenarios where the nature of a `Reader` is unknown (or, at least, difficult to figure out). A type of a reader could be conditional, say based on an HTTP response header. A library developer might take a `Reader` as an input and present their own `Reader` as an output - what buffer requirement should they document? Worse is that the failure can be conditional on the input. For example, if we change our source to: var fixed = std.Io.Reader.fixed(&.{ 40, 181, 47, 253, 36, 11, 89, 0, 0, 111, 118, 101, 114, 32, 57, 48, 48, 48, 33, 10, 112, 149, 178, 212, }); Everything works, making this misconfiguration particularly hard to catch early. To me this seems almost impossible - like, I must be doing something wrong. And if I am, I'm sorry. But, if I'm not, this is a problem right? Leave a comment

www.openmymind.net

September 20, 2025 at 2:37 AM

Karl Seguin

@openmymind.net.web.brid.gy

Allocator.resize

www.openmymind.net