Ceph Performance Weekly, 15 Mar 2018

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: 2018-Mar-15 :: Ceph Performance Weekly

Description

Weekly collaboration call of all community members working on Ceph performance.

http://ceph.com/performance

For full notes and video recording archive visit:

http://pad.ceph.com/p/performance_weekly

A

All right say: do you think we should get started I.

A

Think you're, muted.

B

Yep we're good all right.

B

Let's see, that's.

B

Okay, let's the a DeMatha patch that changes the hashing behavior for several different data types. This is great, looks like we were only asked for using a totally weird hash there, Robbie Thanks, okay yep, so that's good um managed to track down. Why user missing very high steep utilization turns out. It was just hash collisions, that's great I mean to review and test that I'll probably get back for it once it's tested in um like there's a second one, also more hash collisions with.

B

The g-h object, hash.

B

This one I think we need to be careful with because it's adding some additional hashing on the string and the hash field of HIV should already be the hash of the string. So if that's colliding then there's something maybe we need a stronger hash there and that's actually the issue.

B

Okay, let's see, there's a fuse change that Patrick's reviewing turns out that our luminous builds weren't properly, enabling the fasi RC, stuff and I think the other thing so that's been fixed for this lab track that down key foo, I think reviewed it. So it looks good. That's getting.

A

It merged the next one, that's and that a that topic it seems like we every like six months end up finding out that something's broken with the CRC, because crazy rocks to beat things we can we get them more. Can we fix that up streams so that just stops breaking I? Think.

B

It's don't think it was their fault. I think it was our oh really. Okay,.

A

B

I'm not I'm, not actually sure, maybe.

A

The last time I looked at any of the stuff that just it was like super fragile.

B

Yeah I mean probably what we really need is like a test that verifies that the the fast version is compiled in and that's kind of hard.

B

Yeah in six it works and master, and and this should fix an alumina so I'm at least four okay um see, there's a improvement on the OSD for the batch listing that merged and that LMD be an experiment, close I doubt about that old pull request. So that's all good.

B

Adams patch to improve assert efficiency is editor testing. It should be an easy win.

B

Let's see you looks like there's ever notice this one, they saw a sync messenger, pull request that reduces.

B

Reduce this lock intention that all sounds good Oh for just a plate outside the block. That sounds great yeah.

B

Looks like a no-brainer, although.

B

Okay and let's see the discard, the inline discard got merged into master a bit ago. Finally, and then there's a additional one that this periodic discard kind of like FS trim, that's undergoing review.

B

Let's see, there's a work in progress for me to sloth that improves the efficiency of op. Tracker sounds good, don't remember what which one this one was the one getting rid of that readwrite mutex.

C

B

Problem I'm, looking at the optimized op tracker, perhaps.

C

B

Tried mutex and.

C

Last week, if you move the dnm, most painful things are resolved. I need to make some cleanups, especially I, want to combine tiny, a tiny vector with dg2 for chart charting vector without that would Kate take care also for proper alignment. Okay,.

B

That also looks like I need to rebase. I know it's just a squashed but yeah. If you, why don't you clean that up and then we can review and test it? Well, not next awesome. Okay, that sounds great.

B

Oh, this Oh screams thing isn't merged. Yet no.

A

Key Fuu was saying that we were head failures due to 2701 and 2662 yeah I. Think.

B

It needs to get retests I think those are other things that already fixed. Let's just retest it Mike. This needs QA. So that's probably fine that local Reid thing, freeze, beckoned, small, optimization, didn't work. I think haven't really looked about yet, but it's kind of a low priority thing so not too worried about it.

B

Io throttler for game clock is still what's not sure. What are they waiting for us? I.

A

Think this, the last thing I saw was this: one guy was waiting for info from them, I think but I.

B

Don't know for all right: okay, um yeah, I, guess that's mostly at some other stuff here. All for this stuff is pretty old.

B

B

That's it for pull requests.

B

All right, so mark has a whole bunch to talk about auto-tuning. What's.

D

B

End really quick I wanted to bring up mutex is really quick. We got that thread earlier earlier this week on the list to switch to.

B

Just switch to the standard library, normal standard, mutex, stuff, um I think I. Think probably the path here is to I.

B

I just update Adams implementation or if Jesse wants to I, don't know what just Jesse seems like he was working on his own one, but I. Don't know how it's going to be any different and set up a stuffed mutex that aliases two one, either that or standard mutex and then just start updating all the uppercase mutex users to use stuff mutex. Instead they're going to be a couple that are going to be annoying because they use.

B

Like the easiest suits, so a couple locks will be harder to convert.

B

And then we can make it so that there's a scenic option so that will use the standard, mutex or orange to make option to enable the locked up one and we can make the B start. You see make line do that by default so that you start clusters by default. Get the M capture none.

B

Yeah access it I, don't know. If there's a lot to talk about there. Beretta sloth was saying this morning that he was looking at a lighting new Texas, which is some thing I, never heard of.

C

Basically offers say over soup several kinds of new Texas: some of them are non plastics. Of course there are prepared to mark, but you can still have get access to it. The things like adaptive, mutex and especially interesting one is a lighting mutex. The aligning vertex is built on top of the Intel's transactional transactional, sedation extension transactional memory. Extension TSX ID idea is to vote. If there is no, if you have cut two frets accessing the same, the same critical section and if they are, if memory operations.

C

No right to the same memory location from both of them, then a processor can can light exposing the value. The new value of log to memory subsystem to two caches. Everything could be could be clipped entirely private to the car, which means and not only lighting. The calls to.

E

C

Text writing the Cisco. The few texts calls. It also means like even a lighting the cache line, bounce that is holding the atomic clock behind me standard metrics. Behind physics.

C

There are some report reports claiming huge benefits from switching to alighting. Mutexes I was poking this night yesterday, with with them I enforced I got my deep sea and zepho HD to use it that confirms I've EV treat amusing, it and I'm getting 10% slow down from the.

C

Colliding rights to the same cache lines, this means that the animal transaction needs to be aborted and then cpu fall backs to to use while atomic operation to use well futex, which also can mean cisco. I.

B

Guess I guess my main question is whether this is something that we can fit into a standard, mutex implementation.

C

It depends, it depends because.

C

The complexity and its consequences, the tsx extension popped up in hospital, however Hardware back was discovered in tomato microcode update, microcode update brought another problems, so a workaround in gypsy has been made and a lot of distrust are not enabling the hele for default mutexes. Still at least for my urban decay's. The support. The support for a jelly is available in gypsy because it has two level controls. One is something like half illusion. Second, one enable illusion by default.

C

So if you want to switch to two arriving mutexes most likely, you would like to preserve the abstraction layer we have moment and not switched to STD mutexes right.

B

But if we, if we convert the code to use like a set mutex, then we can alias that to whichever on.

C

My platform, STD mutex, are not colliding clock.

B

Yeah, but we could, we could have.

F

B

Of our code, you set mutex and then, if we have our better implementation as long as the interface that works right, yep.

C

That's the reason why we could need to preserve the compile time mutexes it.

B

Is this something that you would you put in there like Generic, you text implementation or do you identify which mutex is it's actually gonna work on and only try to do it for those.

C

B

C

At the approach that seems most reasonable is to use to use the tsx only for those locks that are not constantly colliding in in the matter of memory writes there is it's really funny in in the official reference manual, they are recommending putting additional loads and conditional jumps just to average starve to the same location, to the same cache line.

C

A

Me I'm I'm a little concerned that this is not going to be very easy to to kind of test and really.

C

We insert a lab, we have not able to work on the patch set using my insert a note. I have to use my own to locked up with skylake on board. Moreover, the complexity there is crazy.

C

That's the weh-weh I would prefer to not to have to instead of trying to optimize the contention I would like I would prefer to have no contention at all. I mean shut zero pattern, I mean sister yeah.

A

It's I mean it's this interesting right, but it's this is. This is the kind of thing that we we sometimes do. That kind of leaves us in weird states that we don't really understand and, and users might yeah.

B

A

B

I think we should focus on just getting the are mutex usage cleaned up so that it's using all the right patterns whatever so that we can alias I will just enter mutex or a debugging mutex, because that's going to be a big enough win yeah.

C

There's a lot of unnecessary things. First of all, it's takes. It usually takes two cache lines. It acts it may it makes writes, and, moreover, it makes a lot of conditional branching that is tight, very, very well together, which means which could which could affect branch prediction I mean having those and stuff conditional jumps on the same on the same fetch, the same memory block that the factory is working on, I mean it.

B

Seems like that, just being about at compile time switch between the standard one and our debugging one.

C

B

That can optimize the debugging one to the primary. So a the.

C

Way, I think I have a ranch. It's turning out that all they come the the options we are passing to a mutex constructor are solely compiled time. The main branch is not just mostly on the HIV. It's it's about it's. The goal is to avoid as much as much modifications as possible, but it may it means it translates into staying with the Big M mutex. In most cases, it's not I'm afraid, yep, okay,.

B

All right, that's probably enough about new Texas sea star updates. If you actually.

D

Build up the test for interoperability, but the the facing work or single single messenger is not about to establish a connection or one, but it can cannot change the the pink empty message to her sister again so far, I'm still looking at it and the testees that has been put to the test branch.

E

Also, next, what I wanted to bring up from this? I was discussion about to the.

E

Excuse me, the organization of multiple OS tees and cores and messengers and based doors again, so we discussed this a bit in the past, but I think there was never like came to any conclusions yet, and the messenger part at least is starting to become more relevant.

E

So it seems like it's clear at least that we do want to get to the point where we have one process.

E

That's taking over the network and serving everything I think it's less clear, exactly how we want to fight up the quarters among logical OS, DS and the instructors within at the add process, whether they could be actually most of all logical, OSD structures and multiple messengers. Or if you wanted to do one shared spectra for everything.

E

So I have my thing: there is that we might want to have like, in most cases we're going to have more cores than then disks right.

B

Maybe at least yeah yeah I think it depends on how how busy the box is going to be yeah I.

A

Mean in the the grand scheme of you know all kinds of different deployments. We have both right. We're gonna have people that have these giant like 72 disk boxes that probably don't even have one core per disk than.

B

Other people that have.

A

Flash that have like you know crazy, huge core count to views, and maybe you only have like 10 flash disks in their box or kind of Vimy drives or something.

E

That's it for those numbers at this case well tip the CPU they're not gonna, be as DS, because they wouldn't be able to drive those as DS right.

B

Well, I mean I, don't know all right, it might be.

B

E

So in case lu b, the performance isn't as important. There.

A

Wasn't wasn't there a vendor at one point that was doing that they're kind of like shoving a box full of like cheap, big flash disks SanDisk.

B

Although there was a weird.

A

B

A

I don't know if anyone else is fighting on that kind of a model, but I'm.

B

Sure there will be cheap, low-density sure are sorry, high density, low performance, flash out there right.

E

So girl this event doesn't if I was going to stick you out anyway, that much, which is it so if we did did have separate, essentially OSD and obviously separated into different cores, are different. Subsets, of course, such that we could preserve like new middle county within one of us to you hopefully, and I have maybe perhaps one core for OSD, so using use it as the messenger core bad for that OSD for some subset. If there's more cores right over your horse for very disk.

B

It kind of seems likely that there's going to be a crossbar at the messenger point, because I think.

E

People have avoid that, though, with the protocol change rain, if we'd have a thriving logic, Louis DS that are having separate messengers. Listening on separate graphic connections, I guess they're, the.

F

Practice at all, then it's literally setting up a different messenger on different or at least setting up different, like ports read.

B

F

He wants to talk to that box as a connection to every core of that box. Well, yeah.

E

I think which is equivalent to how we're doing those T's today with multi processors right right.

B

So I think it's actually the other way around it's where we want to so it seems like if we have multiple hardware, so I'm, assuming we're talking about DP TK here. If we have your separate this I think it's what we call it. The virtual network function you know so whatever for each OSD, then you could have them on different cores.

B

Not actually sure, if that's aI don't know if that's right, I, don't know if that's how these things are actually set up, though, but if we're taking over the whole Nick and you're, probably it's more likely I think that we're gonna have the Nick with DB DK in userspace and then multiple Oh, Steve sharing it, and in that case, they're, probably all going to be shuffling to the same core.

E

B

E

My guess, let's put that I'm and clear on both DP D K in particular, but I guess that's when we factor.

A

It's this kind of question of like where, where ultimately, the layer at which you're transferring data to other cores, right and and what happens locally on the core and what happens distributed there, what happens at the DB TK level versus what happens at like the messenger level and I don't know? Do we have a clear understanding yet of kind of what?

A

What yeah I mean.

B

It kind of feels like the messenger interface, the point at which we give a message to the messenger or we get a message from the messenger like that's an easy place for that transition to happen, yeah, I'm kind of assuming that adding that topping cores or not hopping course, there is sort of the easy part.

B

It makes me wonder, though, if, if actually, if the thatthat sort of looming collision that we should be worried about, is the messenger to refactor.

F

Redeems, that's.

B

Going to throw all these sync messenger stuff up in the air and the sea, star refactor of async, listenger perversion, part of it or whatever, because we probably want both of those things right, yeah and I'm kind of guessing that the sea star is gonna, be a port. It's gonna be like copy the directory and then like change it I, don't know that it's not gonna live in the same tree. Whatever right, maybe I, don't know yeah!

B

That's the question so wait keep food that when you're, what you're working on right now is a implementation of a messenger written in C star that actually interoperates with async messenger and simple messenger. I understand that right.

D

Yes, the more like goes up the single messenger. Okay,.

B

But it but it actually, it actually implements the ACEF protocol, like the scissor one.

D

B

It's like it, it's a fresh implementation.

D

Yes, basically modeled after is it a simple messenger. I see.

B

Okay, okay I haven't seen this yet this is this: what ok.

D

These are basically two did it again is.

B

This, the one that Casey started a long time ago, is that what this is based on or something that you started. Where do this? No.

D

It's based on this because he's working, okay, I.

B

Guess I haven't, haven't actually looked at this code, yet yeah I'm wondering if we should forget messenger to for this, but messenger to is still a bit up in the air I.

E

D

Miss me too Amy targeting eight instead, is that true.

B

And point is that we have a sea star, messenger that implements messenger to only and a sink messenger does both messenger 1 and messenger and simple messenger. Only and one's messenger, 1 I think that's probably a fine end point, because most of the world will be on a sinc messenger as they make the transition and then once they do make the transition to messenger for their whole cluster. Then they can start using the sea star, 1 I.

B

Guess that won't actually work, because we still need to support messenger 1 for clients yeah, so I probably have to do both I. Take it back um and we'll need both in a sig mr., regardless just so, we can get all the new critical features.

D

Can we reduce the existing working mr. 1? Let's see starts mr. 1 when we are moving to miss you or will likely start from scratch. Hopefully.

B

It's not that significant rewrite, but I, don't know. I! Think that the part that's fuzzy is how the how we're going to internally search to the code. When we have multiple, multiple O's, to sharing the same messenger or at the same court. Are they going to have different messenger implementations that have some like back-end thing that they both share, or they were literally going to point the same messenger or or what I'm not really sure exactly how that's going to work.

A

Sage for messenger 1. Could we just do something really like low performance? You know it doesn't it. We don't really worry about designing it real well, but just kind of have it wrapped off in a corner that we can have stuff compatibility with. You know, client, 1, I,.

B

Imagine that this clients are gonna be messenger one for a long time. Okay,.

F

Yeah, we don't want to kill all our client users like.

B

B

I, don't think it's gonna be that it's not gonna be slower. It's just going to be annoying having two versions of it. Yeah it's gonna be twice as much code yeah, hopefully not quite twice as much but yeah.

B

This would be a good thing to try to pin down Ricardo I think is going to be in Beijing next week. So.

B

Yeah, there's still some loose ends on the messenger to stuff, but I think we should prioritize that early, an anaemic cycle, not manic Nautilus cycle mm-hmm, to get that sorted out because it's gonna otherwise keep fighting us.

E

In the butt yeah and for both missed nutrients Easter, we probably start looking at some logical ways to use it than one process as soon as well. Yeah.

B

B

Think we got a lot of the way there with separating the stuff context like we're not using ji-suk context in most of the Ospina right. So no rules in this printer. So we're not we're not that far away, but I'm.

E

Yeah yeah slightly more a matter of like getting a little startup scripts and everything else to work in that kind of configuration and figure out how they.

E

E

Know the install bits to I guess: I thought. Yes, that may not be affected as much.

B

D

E

Guess the first, this person post up there is just to add them all, have independent everything, except for the messenger, I guess or even in the different messengers Dubin mm-hmm. Basically, the same is process model today, but when anyone process instead of multiple.

B

Yeah they're, like they're buncha, different ways we go to it could be that when you run suppose to you like on the command line, you tell it all the USCS it's going to be, and it just does it all or it could be that you have like a I noticed you runner process that you start and then you like tell it. You know, start up instantiate, both d0 and it like that's it instantiate, so I see whether they like shuts it down.

B

So you have she's coming and going within the same process and then set those do you might actually just be a thing that communicates to the background process to like instantiate, the others to you that you asked about yeah and if you more transparent, oh yeah, it's all, but then it's like like, if you have us, if you have a system to unit file for each OST today, do they sit there and just talk to the shared process. I, don't know no they're like four ways you could do it.

B

Probably big question is whether we want most used to come and go within the same process or whether you just like, shut the whole thing down and start up again with a different set.

A

You say: Oh s, T's coming go within a process like okay. So what what happens in the case of a disk failure? What does that mean for.

B

Today, today, when is failure we just assert out on the I/o, and so all of them will crash together, but it could do something else like it could say: I got a oh and it could like do an orderly shutdown of one of those OS T's and it goes away and then you swap out the device and then you Riaan Stan.

B

She ate it, and so we could eventually get to the point where you have the process running multiple STIs and like hardware, fails and gets replaced, and the other Mo's do state running yep they're, like 10 other things that have to happen to get at least 10. Other things have to happen to get there. I guess it's probably good place to aim, but it might be that for the initial thing we just start them all up at once.

B

Let them all die and live and die together, and then we get to the can see this over time.

E

Yeah, that's pretty big gun progression in usability for administrators I have to meet the host once all the time. Yeah.

E

That earlier it looks yeah differently.

E

Like trying to make that transition is, Venus is possible.

B

um But that yeah I think the first part I'm worried about is just how how to how those Oh Steve's will have their messenger facing interfaces constructed when they're, eventually sharing I think mostly the only thing that's sort of like her entity. State that's tied to the messenger is the my utter stuff.

B

It might be that we don't use that that much so maybe we'd sort of construct a new thing or just pull that out mm-hmm but yeah I, don't know it's gonna require some careful thinking how we haven't done yet yeah.

B

And they're, like they're, a bunch of other little things too, like having multiple addresses for the same endpoint, so you could have like an ipv4 and an ipv6 address and you would just find it two ports and you could connect either one of those that's a less ambitious piece. That would still be useful. That might give us part of the way there and there's one other one.

B

Yeah no I gotta said and, of course, messengered. You.

B

A

What um what is like the Scylla DB kind of architecture look like in regards to DB DK and all of this do they have anything that they've worked through, that didn't work well or did work well or I'm, not sure exactly. That's a good question.

B

Well, it's stepping back a minute it a useful midpoint is we could get multiple. We could use DP DK to grab the entire NIC and have multiple OSC's in the same process, but they would still be running independent messengers on different ports right still using messenger everyone, and that still captures like most of our goals. Right performance really owns it, just not the messenger two ones so I don't think we're not churchly blocked by main messengers.

B

You stuff, so that's good, and it doesn't mean we can get past the multiple asti's in one process hurdle also without worrying about the messenger stuff being a blocking piece. Exactly.

E

B

E

That's it that's a step related like that yeah, but you've been a problem.

B

B

And presumably, there's no problem with binding to multiple addresses and so on in the same process when you're and you own the whole Nick yeah.

E

Yeah the question my name is Maurice. Oh okay, figure out whether we can to use my worst process that those separate addresses or yet they all have to be final.

B

Pride yeah, okay,.

E

So sounded like what that was what he was saying. Let me talk to him and it was possible to do that, but with the VPP stuff, that's maybe not the case so yeah.

F

I, don't think possible because you just like you, get interrupts on CPUs, or maybe they do polling but like so maybe they can pull like pre-register or like DMA in memory that the NIC has been writing to directly and they just skip over stuff that isn't theirs. But they're still gonna have to see it on some level at least.

B

Yeah, my guess that that one core is flowing. You dedicate one of those course to be the one. That's calling the network and then it's not just.

F

One cuz I'm pretty sure one core isn't good enough for ten gigabit anymore, or at least there was a while, where it wasn't quite angry.

C

For qet cards there are multiple ring-ring buffers. Basically, there are separated so I guess this. The same pattern could be in the case of the PDK, so we would have you could have mapping between a set of battering buffers and particular CPU.

E

um Yes, let's this could be a library depending on that which is a faster sitting. Nick Howard, where.

E

Does provide this.

B

We'll find out I think it's a perfect, perfect discussion. Topic get some clarity. There.

B

Okay, I guess we should let's talk about and talk about that loose or cash stuff mark sure.

A

Yeah, okay, so I got a little wall of text there I guess, but the gist of this is that I want to see whether or not it would work to try to do some automatic tuning of the sizes of different caches between blue store and Rox DB, and how the motivation for this is there.

A

We had some evidence and concern that indexes and filters might be pushed out of cash, especially in like rgw cases, where there's there's a lot of key value pairs in the in the database and a lot of key value data per object. So what this does is that there's actually a separate PR for rocks DB that exposes a high-priority pool information from the LRU cache Roxie B does not have this capability for high priority pools with the clock cache. So that's unfortunate.

A

It means that right now this this only kind of works well with with the LRU cache implementation, but maybe we can. We can improve that. Having said that, so the idea here is that we we prioritize the indexes and filters as a high priority pool. First, we try to always give that memory and and when there's there's drastic changes in the usage, they're say all the indexes and filters get flushed out of cash.

A

Then we very slowly shrink that pool so that if they come back in quickly, we we don't have to like reallocate it really fast right now, this this happens every five seconds. So it's pretty low overhead there's there's not not a whole lot of change. It may be that we can speed that up or that we want to slow that down, but but kind of the the goal here is low-impact. It's just kind of you know very slowly, looking at kind of how to rebalance these caches.

A

Currently right now have the next order of priority is: is the blue storoe node cache? It's not clear that that's actually a good idea in the case of of things like rgw.

A

It may actually be that focusing on giving the low priority block cache more memory is important in that case, but part of this ties into this kind of weird behavior in rocks DB, where, if the during compaction, if the amount of data in the low priority pool exceeds the soft cap that you set, then all of the indexes and filters get flushed out of the high priority pool and I. Don't know why that is, and I kind of generally asked about this on the rocks.

A

Tv, Facebook, dev group and didn't get an answer back so I'm, not sure people even really realize it's happening, since no one had any ability to even look at what was happening in the high priority pool before so there. There may be something there where we're trying to optimize around this case doesn't make sense, because it's just broken behavior but we'll find out I, guess the so so I guess what it comes down to now is I'm I'm doing a lot of testing trying to look at okay.

A

What what's happening in Roxy B's cache when we have different workloads and what's not happening, and when does it make sense to prioritize the O node cache and when does it make sense to prioritize Roxy, B's cache and how much does like buffered reads during compaction matter for that versus unbuffered reads and there's just there's a lot things to look at here, but my hope is that we'll we'll get a pretty good coverage over kind of what the behaviors are and then have more clarity regarding kind of how to prioritize those things.

A

After ultimately, after the enode cash and the the kv low priority KB cache, then we we have data that potentially can be offered as well for buffer greets and in blue store. So that's kind of the the last order right now a priority, at least currently, according to this thing, so I have a bunch of test data, but I haven't really organized it, yet I'm still collecting more stuff, hopefully next week, I should have a nice set of crazy tense graphs that will show some of these behaviors. But that's basically it that's.

A

Why I've got right now. I anyone have any questions or comments or flames.

E

There's one thing that you mentioned: the pestle is the caiman cache if you're dynamically and you figured out what that's gonna be an inexpensive behavior for xdb gonna catch this. So.

A

I have not actually tried doing something like setting it really fast than looking at profiles right now at like five seconds. It doesn't appear to be particularly much of an overhead at that that resolution I guess, but but that I think at some point once maybe the kind of overall behavior has worked out, then we should look at okay. If we, if we start doing this, really often how how bad is it yeah, the truth is I'm not totally sure.

A

Yet it might even be just cache implementation dependent too sure like if we, the clock, hash and they'll ru cache may have different behaviors, so yeah more things to test. Of course, I.

A

Think one one question I would have to is: if, if this all ends up working out, okay and and starts make and make sense to do then do we want to start looking at this for other memory like do we want to try to make this more generic than just a blue store thing, but I don't know if that's, if that makes sense or not anything, we can do to kind of reduce the number of options that users, tweakin I, think would be really good.

E

Yeah I mean a boost our case. The answer is gonna, be the major user me like this yeah other things would use smaller amounts and that had the same kind of caching behavior. That would be easy to opportun like this I was.

A

Wondering if there are a couple of other buffers that we keep in random places and I'm wondering if it's possible to to kind of use some kind of heuristic to determine how much they really need I.

E

Guess where might it be interesting would be other demons or like on the client-side caches or maybe the NDS?

E

A

Do um Josh, do you remember, do people ask about that stuff? Very often I mean are people tweaking the size of those buffers or caches and the client-side the client-side.

E

Not so much I think I'm the NES, maybe more, but.

E

Cuz, it's basically the nd Isis whole job is to catch yeah.

A

One of the things over the years that I've noticed is that people have a habit of just like finding some random tunings that somebody's made, and then you know copy and pasting them in so they'll have like 32. You know OSD threads and like randomly in various crazy ways, and it doesn't. It doesn't really make any sense, but they just like bumped everything way up yeah, that's kind of why I want a.

F

Little better off because it basically only has one cache, but it doesn't need to distinguish types between on and it actually is set by a Dean I'm out of memory users. Now, so it's a lot easier to configure that it used to be I. Wouldn't worry too.

B

A

Okay, I, don't have anything else, so that's that's it for now. Hopefully, I'll have more more data next week.

A

I'll do even anyone else have anything else they want to talk about this week.

C

Question right to our new Texas Tech's obstruction, I mean I mean and the DPD que sorry in the sister OSD we are. We have mutexes men in places, including also the shirt, the shirt common base and what we want to do with that. Maybe it's maybe it's a good time to try to resolve this problem as we are going to make some scenario of Texas.

C

E

There's I guess a couple things there. One is that, eventually, when everything is in the sea star framework, we won't need pre Texas, like I great, be like pfft. Texas are using um kind of sea, star style mutexes, which are not doing any of how the operations, but actually just basically serving as billions for lock throw unlocks within one core.

E

But in the meantime, where we have this hybrid of some Part C star and some types, nice parts that are not I certainly need to keep using existing new Texas I.

B

Think, ultimately, the code that runs in the sea star reactor won't touch any mutexes right exactly so. The things.

E

That we've converted we should yet I've void anyway, any regular textures at all and only use the sea stars dead ones, they're restricted to one core and I could have have to converted, basically across a barrier of message-passing, retrieve it from them. Yeah yeah.

C

I'm asking because even uncon tended mutex is basically a memory barrier. You have an atomic underneath, which means synchronizing means draining loud and stir buffers and spew. If you have a cache cache misses, they will be nice, be exposed.

E

Yeah, that's a kind of thing. That is why we won't get the full performed to see star without converting everything in this back to its.

C

Common part of the project.

B

C

B

We'll limit that that it's, that of common code, that the reactor is using and we'll end up with parallel implementation. So like the unit blogging for the BSC star, logger and.

A

It scares me a little bit how how long it might take to actually see the benefits and.

C

It's like a virus, to be honest,.

C

Everything even.

B

C

Everything yeah in the database, skyla DB.

A

C

We have a plan to drop Linux and move to OS v. The same the same, the same outers.

C

Sister Scylla to be everything on top of OSD.

B

A

I kind of tried to look at actually the Scylla TV code to understand what the the database piece looks like and it's not it's not like something we could just like grab and you know throw into our our code.

E

Will add their own custom Ellison tree implementation yeah? It's probably not only one.

E

C

And this could be a reason to keep digging the a actually.

C

Because in the end, because even in the content situation, if you are lucky, there is absolutely no, there is no penalty on parallelism when touching such mutex and it could be a pinpoint change. That's.

B

All right there anything else.

B

Look doc Oh have a good week guys good weekend, thanks Aaron.