OpenZFS Leadership, 22 Aug 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: August 2022 OpenZFS Leadership Meeting

Description

Agenda: ZED config; failmode=error; microzap size

full notes: https://docs.google.com/document/d/1w2jv2XVYFmBVvG1EGf-9A5HBVsjAYoLIFZAnWHhV-BM/edit#

A

Looking at the agenda right now, when the first item is said, configurability.

B

Yeah, that's uh Rob and I. So.

C

B

We discussed it a bit last month um where uh Seagate as John from Seagate was here and I was talking about having this issue where he was getting spurious errors, uh and sometimes there would be just enough to set off the threshold instead of 10. uh You know errors within 15 minutes or whatever the hard-coded in the C file limits are, and we were looking at. uh You know, improving the configuration of that and Rob did a bit of a survey of what other issues around Zed configurability people have had.

B

uh You know we saw an old ticket from like 2019 about controlling the behavior of spares and a bunch of other things that are, you know, were configurable and Solaris, but are not uh on Zed on Linux um and based on that, what Robin I think makes the most sense is to actually use the the new vdev properties framework as a way to provide the configuration for how you will uh you know where Zed can I can look up this information because in particular, well we've thought about doing command line uh parameters.

B

Last month, uh John from Seagate brought up the fact that you know if they have two different pools. They might not want the same values for each pool, and then the command line option gets awfully Messy as soon as you're, trying to do uh more than one pool or something, and so using the existing Vita properties framework uh does seem to make the most sense there and it fits with the other part of that work.

B

We did uh Mario savorski uh did the tracking down on that and found that the reason why I see it was seeing different Behavior uh with ZFS versus, if they did, you know, EXT or xfs or butterfs on Linux was that ZFS is specifically setting the fail fast option um on the the iOS before it sends them down to the Linux kernel. uh So that's specifically asking them scroll. You know never retry. This uh just report. The error back to me.

B

Instead of s, will deal with it uh and in this particular case, because the errors are basically a transport error in the uh you know, getting the signal across the the fabric they're using and so on. uh They actually do want the retries uh and so we're looking at adding a Vita property for that as well. That would say you know on on these disks, um we'll set the mask on the fast fail to say you know, ignore transport errors uh and and let the driver retry those.

B

uh But still, if you know there are other errors, then those still will fail immediately and go back up into uh ZFS, so that you know Zed can actually still kick out drives that are are failing, but won't uh spuriously uh spuriously kick out drives based on transport errors, where retry actually does make sense.

A

So the bullet item in the in the dock. Here it says for new properties, whatever yeah.

B

So there's the number of I o, or the number of I o errors in the amount of time and then the number of checksum errors and then in the amount of time. So the number of errors and in how many seconds for both I o errors and check some errors that result in Zed decided that that disk is bad and should be kicked in and then kind of only slightly relatedly.

B

The fifth uh Vita property we would do is for controlling uh the the mask of the fail fast bits that we send down to the the living subsystem so that uh certain errors, the driver, like The Controller driver, will Auto retry instead of not uh because currently ZFS is explicitly telling the driver not to, and we want to be able to assume that to say you know: retry transport errors, but don't retry. If it's you know, the drive gave an error right.

D

Alan, is there an inheritance uh facility for for vdas or I mean, do they have like a common pool one or would you have to set it, especially on every Drive.

B

uh We didn't do inheritance when we designed it we talked about when it would be useful and when it wouldn't I think in this case we can probably do something reasonable, but where you would only have to set it on each top level, v-dev instead of every V Dev uh or have it do something like if the the setting isn't set either way on the V Dev on the leaf itself, then check the the top level V Dev to avoid having to set it on every disk.

B

um But we don't currently have inheritance on VW properties, because for some of them it didn't make sense to have it. But there are some cases where maybe it does, and so I'm definitely open to what people's thoughts are about, how to deal with inheritance there like I guess we do have the presidents of like was it uh I think, can mount and a couple other normal properties that don't inherit.

D

Yeah I mean I was just thinking off my head. That's like it's in the general sense, it's more like a pool property uh like the error rate type stuff, although you could have a non-homogeneous pool, I suppose, but um yeah.

B

uh We talked about a pool property originally, but Brian was kind of against that uh feed up properties are a little more out of the way. So it's not going to cut up the user as much, but it might actually make sense to do a pool property for these instead. But uh you know whatever you guys accept when we Upstream it I guess it comes down to.

D

Yeah I mean the pool itself, doesn't have a media property right. No.

B

It's just just top.

D

B

A

B

Does I think have a zap? The only issue is I'm, not sure how you would address it with the command line interface we have now because it uses uh the like, is pool check to tell when you specify multiple uh names on the the command line. It.

B

Checks, whether the second one is a pool or not to then decide if you're doing vdent properties, so it might not actually be possible to access the root one, but we might be able to come up with a way around that, uh like we already, you know, made a special keyword of like all Dash B devs, and maybe we could have you know, root Dash, V, Dev or something to uh to exactly to access the root of the pools. Vita properties.

B

That might make sense.

D

Yeah I was thinking or some type of I, don't know how you do it in the CLI, where you could say apply this to all.

B

D

Yes, recursively setting it yeah some Dash r or something so you don't have to.

B

Yeah, one of those two I think makes sense.

B

Does anybody else have any thoughts on this.

D

I'm surprised it's taking this long, it's like when we had done the original Zed work. uh We had I think a phase three that we were going to do the configuration and it just never had an owner, I guess.

B

Yeah we looked at that too, and thought about you know. If we pull in like lib UCL or something to do config parsing, we could make a config file for it, but then starting to think about how you would lay out that config file in a way where you could specify values for different pools or whatever it quickly became you know. Do we just make this a tool, property.

D

Yeah, the other one was just an artifact of how it was implemented, but so no need to model that necessarily.

E

Yeah yeah good morning, uh Mr Rob, hey uh yeah, the um I think the illumest. You know they have uh just a configuration file that they were pulling in and they only end up. Setting three values: it'd be quite a bit of quite a bit of work to bring in for only having to set a few things for the uh the thresholds. But um that's just what I noticed as I was looking at some of it.

D

B

Kind of curious.

D

If anybody's ever changed, those thresholds, I would imagine you'd want to in some some scenarios, but yeah.

B

Certainly, you just have to recompile zed with the new static values.

D

No, but in the Solaris context, did people think they were too too aggressive or not aggressive enough or.

E

I didn't see any uh historical references uh to it.

E

Just searching around.

E

The ones on FreeBSD are quite a bit more uh lenient as far as the default manager that uh we're using over there I think it's six years and some odd seconds or something yeah as opposed to 10 and 10. or 10 hours and 10 minutes. Or what.

B

I believe it was, this is 10 and 10 or 10 and 15. One of the two yeah.

B

uh Don did you want to talk about your NFS thing before we.

A

B

D

uh Sure um yeah we we discovered uh sort of in our use, use of manifests that you, uh the exports file, there's a file that has all the the ZFS specific exports.

A

D

uh It can come sometimes have orphaned entries in it and on Linux that prevents the server from starting up. It tries to stat everything and if it can't it just Falls over so I was looking at uh ways to prevent that. The first approach would be just to have, let's reset that file, whenever we do sdfs share Dash a uh and in our case that actually, you know, solves the problem, because before the Linux servers, NFL server starts, it'll actually call the the share service to do that.

D

um But then, in talking with Alan about FreeBSD and sort of How It's architected, it kind of makes more sense to push it down um into CFS proper, rather than on top of it and I've started on that. The other issue is that destroy itself, um at least on Linux I, don't know about FreeBSD. It looks at instead of looking at all the data sets to see.

D

If you have this property share interface property, it looks at it Scopes it to just what procafest knows about what's mounted, and so, if your Faust system is not mounted or unmounted behind zfs's back like using u-mount then like when you do a destroyer when you're tearing down it, doesn't it's just looking at product fast? So if you don't have a mount you'll leak, you'll orphan, that that entry, so Alan I was thinking as a next phase is we have the r exports file and I and there's one on freebsds in addition to looking at?

D

What's what's mounted in the system? Looking what's exploited on the system, you know, expand the scope a little bit so yeah that.

C

D

Sense so that should solve at least one I was seeing with. um If it's not mounted, we can uh orphan the reference.

B

Yeah because I think the other uh idea I have for that was you know we have this mounted uh read-only property that tells us if the data set is matched or not, and we could have one for shared. So then you know at destroy time. We could just you know as we're walking the list of data sets and and dealing with them uh to unmount them and everything we could make sure to unshare them uh as the pool before the pool goes away.

D

Okay, yeah I thought. The issue was that we didn't want to iterate our overall data set, so we just iterated over all Mount points right, so I was going to just expand that iterate over all Mount points and all exports which which should contain um at least for this particular problem. Yeah.

B

I have to look at how that works on FreeBSD uh MAV. Do you know how we deal with walking the the list to decide what to unmountain so long when you export or destroy.

F

B

I, don't remember.

F

I haven't touched.

B

So yeah it's a bit less of an issue on FreeBSD, because Mount D will just complain about each line that doesn't work but still reload and still start properly, unlike on Linux, where NFS will just not start if there's any bad lines in the exports file. um So it's not the end of the world, but I.

D

Don't know if we.

G

H

B

Easy way to get a list of all of the exported file systems in order to walk that and unshare them.

D

Well, the way targeted it looks like each OS has an exports file and has a lock for it. That's at least that's how it looks, and so I'm presuming there's there's an exports file generated on vpsd. Yes,.

B

There's like Etc ZFS exports, yeah on FreeBSD and.

D

Then, and it it has a concept of like what's mounted on the on the OS, so I don't know what it sources on FreeBSD, but on Linux it uses process. So.

B

Right and we can get the list of maps I just don't know if we can as easily get the list of what's exported with NFS other.

D

Than that the file is hard-coded into inside it. Okay, you know yeah, so just in that file, we'd have to parse that file. Okay, because the trick would be like. What's the format of that file, I don't know, but the extent we can parse that file, then we can generate a list of additional things to check so I think yeah with those with those two changes.

D

The the first change is to truncate the file before we rebuild it and then the light I want to be to Source it when we're tearing down I think that to tell at least the issues we know about.

H

Does sure NFS play nicely with any exports that someone may have configured prior to using the data festure or NFS? So.

B

In FreeBSD, any ones you manually configure will be in ETC exports and then those ZFS ones will be an Etc ZFS exports and mount D just reads both I'm less sure. What would happen if you had the same one in both, but the ETC ZFS one is, has a big statement at the top about how it's owned by ZFS and you shouldn't modify it, but the the problem Don ran into was, if you manually, unmount the data set with umap rather than like zfsu Mount and then destroy the pool.

B

Zfs didn't notice that it needed to remove that data set from the exports file, and so, if you reset the machine with that tool now destroyed, it would mention. Hey, go share this dataset via NFS, but it doesn't exist anymore. On FreeBSD that'll just generate a warning, but on Linux it will actually stop nfsd from starting.

H

Yeah I definitely saw that get stale on FreeBSD. It was not a super sophisticated tool early on, but what's the behavior on say Linux? Is it sharing the same exports file.

D

No Linux uses a dot d convention, so there's slash, Etsy, exports.d, slash and then there's a zfs.exports, and so what ZFS does is it maintains that list um like, if you add, a share or remove one, and then it tells uh NFS to Via export FS, hey? We we changed that list. It's sort of like you know, system D, where you say like hey, go, rebuild everything because I changed one of my files, so it gets consumed that way, but.

G

D

Has a stale entry in there it'll try to stat everything and that's being exported and if it can't stat it it'll just fall over.

H

B

Yeah, whereas on FreeBSD the the rc.d script for Mount D checks, if ZFS isn't enabled, and if it is it also, in addition to pulling in ETC exports, it also pulls in ETC, ZFS exports and Parcels in both, but, like I, said the the big difference there is when it stats one of them and it doesn't exist. It just emits a warning and skips it rather than breaking nfsd.

D

Okay, so I'm gonna I'll do a follow-up on that pull request and instead of doing it in the share service I'll do it in CFS proper, which unfortunately, unfortunately, is like. You know dozens of files, because it the way that it's architected yeah I'll, have to touch a bunch of files, but yeah.

B

It's all minor, I think the other idea we have was when they do the manual unmount, that we should trigger the unshare, then the same as if they did set FSU mount on it.

B

Would that also solve the problem.

D

uh I think the u-mount goes directly to the kernel. So then.

B

Generate you get a vop call still to set a fast right.

D

We unmount him I, I, believe yeah I, don't know, I'd have to go back and check if EFS involves CFS or not, but yeah you'd have to tickle all that code is in user land. I think that it's been, you know maybe Sheriff Sharon Fest, so you'd have to tickle something go. Do.

B

Something and that's not fun,.

D

Yeah so yeah um never mind but I mean I think for Linux we're covered because anytime, you restart the server. You also start to share service right.

B

So yeah I think probably for both it's. It does no harm to do what we mentioned there, where, as part of the startup script, we can truncate the exports file up before we run database share a and that way we know that we've just generated a clean one with all the exports in it.

D

I'm trying to think that there might be a problem in that, if you add a new share and you have an orphan one in there, would that prevent the new share from being shared.

C

I

D

Add a new entry to the file, but the file is a an orphaned entry uh which does that prevent the new share from happening, and so that might be a problem.

B

Or does it do even worse than cause NFS to stop? Hopefully,.

G

B

Enough that when he gets a reload it, it just rejects the new config file and keeps running on the old config rather than dying. But no.

D

Yeah that has that has no bearing on the surface itself. If it's a problem because um NFS the server uses that um it runs, export FS to regenerate it. And if that call fails, then it then it falls over but yeah. But if you just call that independently, it doesn't affect the server. But.

B

Right I was just thinking like what happened when you share the new file system and it tries to tell NFS to reload. Does it.

D

Yeah, it won't hurt, it won't.

B

Hurt it won't cover I, don't think, but.

D

It could prevent future shares.

B

Right, yeah, all those shares all the new ones won't work until the old one goes away and so yeah. That's.

D

Why I follow I'll, follow up on that and see if that's a bug, and that might be I may just follow an issue and deal with that on another day, but well.

B

I think, if we're dealing with most of it in ZFS, as you described a minute ago, then we should be able to kill most of those cases. And then you know the truncate thing is just an extra safety belt so that on a reboot we definitely know we're not going to.

D

Die um well, we do what we do is we actually copy the file? So we, when we, when you add a new entry, we actually uh create a new file, add the entry and then copy over the old entries. So as we copy them, we could see like we can call the the orphans yep. So I could.

B

D

Okay, all right I'll I'll, follow up on that part as well makes.

G

B

uh The next item we had was, uh we have an interesting case having to do with fail mode, uh so we looked into it a bit and currently the options are, you know, fail mode equals weight which will cause any attempts to write to a suspended, pool to just wait until the pool's not suspended anymore um continue, which is kind of a hybrid anything that was already somewhat in progress will continue to wait, but anything new that starts after the full suspended will actually just return an error, um and you know some reads: might succeed if they're in uh the arc or if the remaining disc can handle it or whatever, and then we have Panic, uh which you know isn't very fun.

B

uh But the use case we've run into is some applications that would rather get the error and be able to return that to the the original request, or you know, cause their load balancer to send the request somewhere else, or something like that. So you know if you do an asynchronous right and then fsync it if the pool is suspended, that fsync will just not return until you.

B

You know repair whatever's wrong with your pool and get it unsuspended, and that leads to that just sitting there forever and you know, can take down what the the web service or whatever is, is making the request to the pools, uh and so we wondered about adding you know, fail mode, equals error or fail mode equals fail, or something like that, uh where you know, when the pool suspended, we will actually go and and return an error for all of those uh like pending fsyncs and so on.

B

uh Some of the code for this exists in the old pull request for forced export, uh where we go and unwind all the pending iOS and having an option like that, uh and then you know if we finish Force export. That also raises the question of: would it be valuable, have failed mode equals export uh where you know if this pool gets suspended and gets to the point where it's stuck?

B

Go: Force export it for me, so that it doesn't cause issues with my other pools. You know if it's hung on the uh spawn namespace lock or something where uh is going to cause every other ZFS command to hang waiting for that lock, uh it's going to mean that this one pool going down is going to mean all the other pools in my system. No longer work.

B

Mav, do you have any thoughts on this.

F

I'm sorry I missed what was the question.

B

uh So we're looking at adding a new fail mode where uh currently, if the pool gets suspended- and you do it uh had done an asynchronous right and call fsync, the fsync will just wait forever until the the pool is unsuspended and it can actually complete the transaction you're waiting for, whereas we were looking at adding an option so that if the pool is suspended any currently pending, operations that are stuck would actually return an error, uh and you know unfreeze the applications.

F

Oh I clearly see how that's useful I'm, just thinking how many applications actually doing is useful for fcm carriers like yeah.

B

This particular customer's application definitely does something useful. You know uh it can read it's like an API request, so it can return uh to the the caller that you know the put failed and then the caller will just try it again and it'll get routed to a different machine or something and and it's okay.

F

Well, I think it would be good to look on the fsync specification to see whether it actually expected to return error to be handled and, if so, I don't see it to be bad to have it as an option.

B

Yeah at least uh I remember: there was a whole kerfuffle about this with postgres, where uh I think it was on Linux when it returned. When The f-sync Returned an error, uh the data didn't stay, dirty like it was supposed to, and so a second called fsync uh wouldn't ever ensure that there would succeed, even though it hadn't flushed the data that was supposed to be still pending, whereas on uh like a lumos and FreeBSD that wasn't an issue because the the data stayed dirty.

B

So a second F sync would wait for it to be flushed as well, uh but yeah I think you're right. That would uh make sure that it follows the the contract of what fsync is supposed to do. As per the the spec.

B

And then the the second one I was just musing about. If we get that fire was. uh If we finish the the forced export feature, do we have, you know, fail mode equals export. So if uh a pool encounters an error that would result in being suspended, it gets Force exported uh so that it doesn't interfere with any other pools that are running on the system, uh because you know when a pool gets suspended.

B

While you were trying to do ZFS list or something and it's holding the spa, namespace lock well and then doing an operation that it gets stuck on. Then every other ZFS command will not uh succeed until the suspended pool comes back to life or gets Force exported.

F

Well, yes, that's annoying when one pull effects to allows us we're trying to avoid the situation as much as we can with redundant and still obviously, but.

B

Yeah and the fail mode equals error fail or whatever might mitigate some of that as well, uh just because we will actually return an error, although.

A

There's a lot of code in ZFS.

B

Like I think, you know uh wait for transaction, wait for sync or whatever uh that are void, and so they don't really have a way to Signal back to the color that uh you know you waited for this transaction to complete, but it actually it failed, but there's no return value, although I think some of that was addressed in the forced export, pull request.

F

Well, yeah I think in some situations, if I just doesn't expect it or it doesn't accept her some I think iterations through some uh any lists or some things like that.

C

B

But yeah we found quite a few. When uh will Andrews did the forced export work uh and we made a bunch of those more important internal bits of ZFS actually be able to return an error instead of just being void, but yeah we'll have to look over there.

H

And one suspended, pull out of two typically requires a reboot correct.

B

H

Successfully on suspension: yes, if you can.

B

Get the discs back or whatever and do as you pull clear, then maybe you're, okay, uh but otherwise yeah without forced export, you're, basically stuck having to reboot, and so, if you have three or four pools and one of them goes on walkabout, then you can have uh you know your system not be very useful, and uh especially when it's the least important of the four pools that happens on it makes you really upset.

H

Yes, it does thank you.

B

I just had another thing: I was going to mention and I forget what it is now.

A

Well, I'll tell some time and mention that um the deadline for abstracts for talks in open CFS um or two weeks from yesterday.

A

Did anybody else have anything they wanted to bring up today.

B

I was trying to remember what my other item was. I should have read it.

I

I'll mention that the corruption uh healing patch finally was merged. So thanks to everybody that helped with that.

B

Yes, thank you for persisting on that and getting that through, because that'll be really handy, I think.

I

Yeah, hopefully it ends up being used and expanded on looking forward to see where the community will take it, um but yeah. At least we have it now, after a couple of years of his languishing.

B

That's uh another question: I had was about some of the metadata blocks and or indirect blocks, I think in a bunch of cases, the the maximum size of like an indirect blocked is set at 128k, but I've not get blamed it, but I assume, that's just from the four large blocks were a thing um and in particular I'm wondering about, if increasing or having an option to increase that size. uh Obviously that's probably a non-disformat change just because it changes it to fine, but um especially things like microsapps.

B

If we allowed a microsapp to be bigger um that might improve the performance of directories with uh moderate numbers of files so like more than two thousand but less than 10 000, or something like that, um because we see a very large increase in latency when writing to a directory when it migrates from being a micro zap to a fats, app.

B

F

But you started speaking about indirect blocks and we on our benchmarks. We found that default. Direct blocks of evil, 128 kilobyte create is huge over here. If we do random block rewrite, that's why on tronas WE defaulted to lower to 32 kilobyte now, but I think 64 may may be better intermediate, but like 32 works, pretty good in many cases, if not most, because.

B

Because you're writing into the the total amplification of changing one block and the amount.

F

B

You have to rewrite oh.

F

Yeah yeah you're, rewriting one, for example, 32 kilobyte blocks, and then they write six layers of 128, kilobyte, indirect and after it, and uh that's also create this amplification, but also the compression decompression for those blocks. uh Indirect blocks from Arc into GMU cache. There's a lot of CPU over here. So just a reduction of the block block size saved a lot of Benchmark, at least on the whole recipients, from synthetic benchmarks.

F

So it's it's like. It should hurt uh some disk scan or sequential operation, but hopefully prefecture should handle most cases there. So it shouldn't be too bad for to have reasonably smaller blocks but as I think maybe 664 could be better default than 128.. I never proposed to Upstream that, but we may consider that well, at least I. Definitely don't think it should be increased I, don't think it works. Unless somebody has some absolutely huge block sizes and definitely only sequential operation, but still doesn't make sense. Well,.

B

I think uh in this particular case, it's nvme over Fabric and the throughput is so high and the latency is so low that writing one, bigger block might actually be faster than than having more levels of indirection.

B

uh One thing we were considering was: should the indirect block size follow the like the record size, and so, if you have 32k blocks, should each of your indirects also be 32k uh for the same reason to avoid the amplification? But it would also mean that if you use larger blocks, you could have larger indirect blocks.

B

But I don't know if we always would want those two coupled together.

F

Oh yeah I also I'm, not sure about that and installed significant parts of over here that we saw was compression decompression from Arc to GMU, uh and uh no obviously, this traffic too, but if this traffic doesn't bother using CPU over here at may actually end.

F

If you have huge uh holes in a file, then each access to them will create the tree and there are destroys you recreate and Destroy actually to make sure you may be separate problems that could be optimized somehow, because right now, I think as soon as we commit transaction- and we see that block is all zeros because there is no indirect block- is all zeros. It will just evict that block and on next access.

F

Try again to recreate it again and that's, but that's probably more like synthetic Benchmark case, I'm, not sure how much it happens in real life.

B

F

B

A large Ziva you'd have lots of sparse bits in it that you could.

F

End up oh yeah,.

B

F

It's mostly the case, if you are randomly writing zeros over the large file ah right. So how many five people are? Writing random zeros right to be completely compressed into nothing, but on synthetic tests yeah, it's it's quite spectacular!

F

Well, I, don't know yeah, maybe with smaller blocks it could have sense to have smaller indirects, but on the other side, okay, if you can get main blocks of 16 Megs or even one Meg, do you really want to have so big indirect? What's the point you're, the first level in direct will cover gigabyte.

F

Required to be that big or how big isn't that it only gigabyte gigabyte with default 128., and you really want more than that.

B

F

That's more if.

B

You, if you modify it, then you're writing a lot more, but do the fewer layers of indirection provide any benefit and we'll have to test that out, but it's mostly the zaps that we're looking at just your files in a directory. But uh we, you know, had wondered about some of these limits uh and like what you were saying. It might make sense to have some way to to control that.

F

F

Model parameter on Linux something ideas. uh There are probably even several of those parameters for different types: yeah.

B

I'm not sure if it's really done a tour, but I do see that yeah.

A

B

So we'll look at that a bit and and uh report back what we find.

F

Well yeah, if you find that 64k is better I'll with two hands for decreasing it actually.

A

F

We're speaking about other blocks for some time, I was thinking to increase maximum uh log block from 128 key to slightly more like, like four kilobyte more so that we could feed full 128 block into one log record.

F

uh I, don't know, was it worse increase in diffuser, but at least that one specific case could be beneficial for some workloads, since 128 is quite popular blog size and EO size and speaking it into chunks. Probably not good, and it's like splitting one ear into Parts makes them non-atomic.

F

Also not great yeah make sure our log doesn't have concept of optimicity, so it may stop replaying at any time. At least we could try to be a bit better.

B

Yeah that does sound like that would be useful.

B

I said just because the last one I had was looking at the the performance of copy file range on FreeBSD and I saw thanks man for taking a look at that um does Linux do something like pathconf or something to return the granularity of uh sparseness in a file foreign BSD just always returns like Spa, Min block size or whatever 512, bytes uh and I. Think Alan Summers suggestion was fixing that to actually return the the record size of the specific v-node that uh was being queried.

B

But uh I just wondered what happened on the Linux side uh for things like that, where you're doing you know seek data and SQL and having a way to to know what alignment those holes might have.

B

I guess nobody else knows either.

B

Did anyone else have anything uh before we run out of time.

A

Seems like that's, it.

A

B

Thanks, everyone have.

H

B

And uh yeah next minute will be in four weeks back at the the regular time, not the early time.