Ceph Performance Weekly, 15 Sep 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Ceph Performance Meeting 2022-09-15

Description

Join us weekly for the Ceph Performance meeting: https://ceph.io/en/community/meetups

Ceph website: https://ceph.io
Ceph blog: https://ceph.io/en/news/blog/
Contribute to Ceph: https://ceph.io/en/developers/contribute/
What is Ceph: https://ceph.io/en/discover/

A

Let's see, I have to confess that this morning I went to the stuff users meeting instead of going through the pull requests.

A

So I don't have anything prepared for that this morning, but adam and I have been spending a lot of time this week talking about how uh shared blobs work in blue store, and so I imagine we'll probably discuss that quite a bit today.

A

Maybe before we get into that, though I'll open it up for other people, uh is there any topics that people would like to bring up today or are there any topics? Sorry.

A

All right! Well, then, um so uh all this comes out of the uh work from the rbd mirroring performance issue that uh was seen that we've been talking about quite a bit.

A

um The gist of it is that uh the work I had been doing to try to see if I could improve the performance of iterating over extents uh it kind of worked. I was able casey. um Your idea was good regarding the the movement constructor um that did work in the end, uh or at least it somewhat worked in the end, at the very least uh enough that I was able to run normal benchmarks uh uh fairly fast. In fact it it generally uh just for doing normal benchmarking, blue star it looked like.

A

Maybe we got even like a five percent cpu usage reduction uh with at least similar performance levels, uh so so win, uh but I ended up with really really irritating strange issues with reference counting in other parts of the code that adam and I sat down and tried to debug and theory out, and uh it turns out that uh it didn't look like this was really doing a whole lot for the the issue that we were hitting with um with uh snapshots and so that kind of got put on hold and maybe abandoned, we'll see.

A

But we we ended up looking at the ref map inside shared blob and um adam I'll. Let you talk about everything that you've discovered I'll just say I I did go through and try to replace that with a all right. I did actually successfully go through and place it with the flat map and got all the iterator validation out, uh and that runs fine. Now, seemingly, the downside is that the benefit is pretty small. um I've got the numbers in a spreadsheet.

A

Here, it's linked in the chat window.

A

You can kind of tell that it's it's helping a little bit the yellow line or the yellow uh peaks are the ones using flat map for the rough map versus using the original uh standard map, and this the standard map is for every single shared blob and we have a lot of them per object uh in in this kind of workload and that shared map um or that that rough map is basically just storing, like eight eight bytes of data, it's ridiculous.

A

It's tiny, there's lots and lots of those maps, but that's again starting to get into the stuff that adam discovered so adam. um Do you want to talk about the things you were working on and what you saw.

B

Yeah sure, uh just a bit out of focus with browser, uh I tried to find my pr, maybe.

C

B

B

Most of what I've done recently.

B

Revolves around earlier work that I did um about just printing. Oh note, metadata internals, because I knew what the structure of the data is, but how actually it's being used was unknown to me. So I made an example, um a tool that allows me to print exact metadata as it's as it encodes objects.

B

That allowed me to see few strange things like very inefficient: encoding of checksums also inefficient a code encoding of um extent.

B

Basically, that prediction was used there and, and so on I mean those are elements that cannot be fixed without modification to encoding, so that was left for later, but the same tool uh basically clearly.

B

Let's clearly observe that when we modify object, that is shared meaning we have some, let's say already shared object. A shirt object here means that we already did a snapshot of things of it, so um blobs that are in the object are already shared. So when we write to a head object that has a snapshot.

B

Obviously we replace some range with a new data. Interesting thing happens when we make next snapshot of it. Basically, we take the new data we packed it packing into separate shared blob, even if that's just a 4k, and then we make a copy uh from head object to clone object.

B

If you think about it as a one-time operation, it makes perfect sense, that's exactly what you should do, but the test we were tackling was that we had an object that we modified and we already made many snapshots. We just made one snapshot, write some data made another snapshot and so on. It means that the effect that we have new blobs that basically cover only one only data modified in a specific period between snapshots, just a sequence of that shared blobs.

B

That still is not so bad, but when we delete, because when we have all the snapshots uh we have when we are still required to have all snapshots, then we have to somehow maintain the data and that might be proper way to do it. But we delete the snapshots we in the test. We have and it looks like a reasonable, real, real life case.

B

We delete some old snapshots and the effect is that we do get more and more fragmented, blobs.

B

That we cannot merge, because we lack information that now some blobs that are basically adjacent one to each other. Each has like, let's say, five reference counters, but we do not know that they all belong to the same line of object.

B

So that's the background.

B

I made an attempt just a simulator and to show what will happen.

B

B

If we reused some free space in the blobs, oh mark, thank you. Oh.

A

C

It's the one yeah.

A

It's just the one I did, but I I didn't have yours handy so I just I.

B

Will I will share my result of a simulator? Basically, the effect is we can get rid of that additional blobs and keep number of shared blobs like five, when you have five copies of an object in reasonable uh intense for reasonable intensity of random rights.

B

That was the starting work now with so one concept was to modify duplicate, extend map duplicate function, so it will try to find a place in already existing blobs. Instead of always creating new shared blob.

B

That's one thing and then with mark we continue how to even simplify data structures around shared blobs.

B

Basically, you might not know, but what we currently have as a shared blob entity in x, column in roxdb, does not track blob at all. The object. Shared blob is basically only a tracker of how many times specific region of the disk was used uh by shared blob. It means you could have like the and and in addition to that, each object, that's referencing, the shared blob is having its own encoding entire encoding of a blob.

B

It's a speed up for read cases when, just after you read object metadata, you can read data, you don't have to reference some additional keys from roxdb.

B

That's it and for right cases, when you modify object, you modify your local blob and then you notify shared blob that some references might have been unused and, of course, if ref counter goes down, the else case is not important at that moment, so we are trying to maybe simplify uh the case here. By attempting I don't know, maybe I mean various ideas I had today, one that is either stupid or ingenious to let allocator disc, allocate or actively count references.

B

So that would be like a case when object is no longer referencing part of shared blob. It will just release, it will just release data and be done with it, and allocator will behave appropriately, so yep, that's it and now let me copy find and copy results of simulator how amount of change the shared block changes.

A

Well, adam's doing that um igor.

A

Some of the things we've also talked about is whether or not we could well, I suppose, if we removed compression from blob weather and that it would allow us to uh not keep track of the uh the bite granularity anymore, but if we could do it at the the the minolic size, um you probably know this code better than anyone. I think uh what are what are your thoughts on all of this.

D

Make sure I'm ready to answer immediately so maybe better.

D

E

Also, actually, don't want to go. We need more precise counting than the mail excise. Sometimes is because like rbd, when it's when, when rbd like, does a there's, a partial right and then looks for a hole we we need to not lie about that sure.

D

E

That could be smaller than a minute size.

D

Even if analog size is 4k, I mean we need currently less than disk block size.

E

E

I don't remember exactly which cases get relied on um because it it it worked, and so things started getting built. A couple of things were built on top of it.

E

um What I know of is the fs crypt integration in southwest, and I think that's been adjusted to to be less fussy now, um because we discovered that some ost configurations did behave differently than than we thought, um but in particular the monoxide is a user configurable thing and it has in the past, been like 64 kilobytes, and it definitely needs to be finer granularity than that.

A

Yeah I mean, I don't think we're planning on going back from 4k up unless nothing else works.

E

I mean yeah, but a user can do that, and a user config setting like that can't just break things.

E

So I mean it may not need to be like granularity, but we would need to talk and, and that definitely can't be like, like just metallic size is probably larger than we can assume.

E

Now it might be, the benefits are worth it, and so we need to have that discussion. I just as a warning like we can't. We can't merge that without testing all the other things and discussing them.

A

Okay, adam, I see you've you've pasted a whole bunch of stuff.

B

Those are links to um simple simulator, of how many blobs we will use if we allow ourselves to integrate a newly.

B

That uh local blocks into some shared blobs that we already own in an object. That's, of course, uh relates to a process of making uh extant map dupe. So the snapshot moment.

B

Okay, let me explain how it how you could read that try to be.

B

The sequence is, and the name is how many uh like 16 rays, it's how many allocation units are per block allowed and the sequence you can read it like this. There is a sequence of iterations and first there is a part that simulates right and here we see a wave sign that marks. Where we write new new element, uh then there is a snapshot and when snapshot happens, all newly written data, the spaces means empty holes.

B

So when we do snapshots um here, snapshot a later letters uh say what is the shape of our shared blob and which segments it represents of again, and we can see in dump of blobs that each region of the shirt blob now has two references and it goes on and on and when we try to, we would need to go some way down.

A

Adam um is it, is it worth sharing your screen or with like.

B

Okay, I will try to share my screen. It not always works. That's the problem.

B

Okay and for other people that that's the first link from the four I I shared, uh the sequence is uh the simulator is like this: it gate goes in 50 iteration. Each iteration is fierce, trying to write some random data to object, basically 16 16 times allocation unit in size. That's that's! That's all that's interesting and in between there is a.

B

There's a right face and snapshot phase. You can see that the places where we in the first uh right, we put some data, are converted to share the blob now in both objects, object h, which is head and object, one, which is a snapshot, snapshot number one and there is a dump of content shared blobs.

B

Only one shared blob named year, a then second iteration write something and then we do snapshot here. We after snapshot, we have two objects. The previous head object, previous snapshot and new snapshot. We can see that the reuse strategy.

B

Use the same object, I will show how it looks in a.

B

Different case when.

B

When reuse is not in effect, that's it the same simulation when reuse was not enforced. You can see in line 18 to 20 new blob appeared. So in that case, where previously we could reuse some space of the same blob. Now we had to create an entire new block and that's the case, how we do it now. That's the current uh way.

B

Blue store extent, map, dupe works and basically, you can just scroll down to see how it causes a continuous increase of amount of shared blobs.

B

And yep I mean I could share source code of the simulator, there's nothing much there, but it's really not not interesting.

B

The question here is what will be the effect of performance.

B

When we reduce a number of shared blobs, that is something that I think we do not know, and I don't think we could do analysis without practically implementing it and testing. That's that's my take on on it. I.

A

Agree adam, I don't think we can guess at it. There's too many moving parts here it just I mean the the fact that we'll have more extents than the rough map for uh jerk blob means that the characteristics are going to change dramatically compared to what they are now.

B

One more thing I would like to show at that point, which is not really obvious: if the blob size is larger,.

B

No that's the case. We do not reuse that case, reuse.

B

We even end up with less blobs than with smaller blob size. That's actually predictable outcome, since if there is a larger size of blob, there is more more ability to find some blob that will still be matching. Oh, I forgot to say one limitation: the algorithm only tries to reuse the blobs that they were already used in encoding of the current object. So if there are some blobs that are not the part of actually used set, they are skipped in binding attempts to find space in that blobs. Those blobs yep guys.

B

That's what I think about it. If any questions, so I can explain better.

A

So adam, um would it be a good time to talk about the profiling results that we looked at? Is that useful dude.

B

Yes, I think so.

A

Okay, let me open those up then so.

A

The link in the chat window is it: is it worth sharing my screen for this.

A

And have to be pretty wide, I guess.

C

All right I'll, try.

B

And, to some degree we are, we are talking in about this in a bit wrong order, because what I did was a result of us looking exact performance graph. You will show us now.

C

This is strange.

A

Trying to figure out how to make it.

A

Sure the screen is not behaving the way. I expected it to.

A

It's never done this way, but it's asking me to select from a list of windows and now I'm having to search through all this entire list of windows, the one I want, instead of just clicking on it.

A

That's strange.

B

I tried to the same and I ended up with entire screen.

A

Yeah- and I don't even see the one that I wanted to share- or I don't even see how to show the whole screen now like they changed this somehow.

A

If I break this off into its own window, maybe it will show up.

C

C

A

I have to click on a lock again.

A

It's all very okay aloud.

A

Sorry this is taking so long. This is uh not very uh working very well.

A

Oh okay, I'm gonna give up on this. For now it's uh it's has permission problems as well, apparently so anyway, uh I guess you can look at it in the chat window there. um Okay, so.

A

The reason that we're or the reason I was looking at the ref map specifically this week, was that, if you kind of scroll down in this thing to like line 78, you can see the blue star extent. Ref map t put all, and there are some get calls scattered around here as well.

A

So that was that was the effort to try to see if we could make some of that a little faster and it maybe did help this, for reference is the the version from the flat map. So this this particular trace already has the flat map.

A

Improvement, it's very slight improvement in in place, but there's a lot of other stuff going on in here too. um Definitely: management of intrusive pointer references, um there's some encoding and decoding in here creation of old, extents.

A

Alt range is taking up some time opening share blobs, there's decode blob decode, um the gist of it is there's a lot going on in here I mean it's not just one thing: it's just the sure, quantity of extents and shared blobs and other data structures that are involved when we have a lot of fragmentation uh across the snapshots.

A

It's just there's there's a lot going on so like if we need the bite level granularity, I don't know I I guess if there are ways that we can reduce the the quantity of um of data structures that we're dealing with, that would probably help us short of that. It's going to just take a lot of optimization of a lot of different areas in a lot of different areas, a lot of really small gains that maybe they add up to something I don't know, but that's uh that's where what it looks like to me now.

A

Adam, what do you? What do you think anything that is anything wrong, you think or do you agree or disagree.

B

I think it's we should uh simplify data structures. We have in bluestora now specifically that some of the cases we use them don't really make much sense. For example, we um correlate buffer cache with shared blob that gives us ability. If we read one head object, then we could read a snapshot, object.

B

From the same cache, that's a of course performance benefit, but how often do we actually expect that snapshot object will be read at the same time as that object and and getting rid of that we could move buffer cache from of course, that's an example of simplification. We could attempt from.

B

Shared blob level to maybe object level, just o node, oh node level,.

B

C

B

Idea is like also that we mark already uh looked recently ref blob ref map, t, that is a full map. Basically, in most cases, contains only four elements so again, simplifying that would be huge benefit and so on.

A

If I remember right in shared blob, the buffer space, um this was that a a pointer to buffer space that contains the map of of objects or cache.

B

No, it's actual object.

A

B

Keeps objects that keeps intrusive pointers of buffers and yeah and then relationship to uh buffer space cache, which is a collection level. uh I guess other you cache, follow all all the data.

A

So we would save some by having it at the object level rather than at the shared blob level.

B

If we attempt to do that, we could strip uh shared blobs from possibly shared blobs face it from blobs that basically are regular blobs, because now the only way of cache data I mean we cache data in regular blob by creating the view as a shared blob, and then we add it to to cash yeah.

A

Yeah, so so from our previous conversation, it looks like once upon a time we didn't actually require that the blob reference to shared blood, but now, basically just always happens. We even have code that doesn't check for it and just assumes it's there.

A

But we have some old code that actually doesn't check to even see if it exists or not.

B

Yeah, that's true.

B

The place when we can see that we now assume it's always present. It's read cache which iterates over extents in objects, and all that are present. uh We just assume that shared blob is is, does exist with just the reference and there is no ifs.

A

I wonder in this case that we're this camp pathological case that we're looking at right now, if we'd really end up with very many fewer blobs. At that point, though, may we we we'd have some for a while right. We wouldn't need it for any regular blobs, but eventually, like everything, becomes a a real shared blob right.

C

A

Case, like the volume has every single, pretty much. Every single bob ends up becoming a shared blog because of the random rate pattern, plus the five snapshots being kept well.

B

Again, sorry guys, I get noise from.

B

um I I'm thinking that maybe we should be extra careful not to optimize for some pathological case, like basically the one that we analyzing now with paul's testing. um Well, not over simplify. For such case, I mean over.

A

B

A

It's a real case, though, if you're using rbd mirror right, because it's always doing snapshots right, it's regularly doing snapshots and you're. If you're writing small random writes to the volume I mean. That's that's. Basically the t pulse case right.

B

Okay, I mean, if rbd mirror, does work that way. Then that's a real life case. It.

A

Is unfortunately, it's a bad real-life case, but it's a real one.

A

Unless we go back and just say, snapchatting is the wrong way to do this right.

B

And but still we will need to have some copy on right. We can use that. That's the very severe snapshot example as a good testing point for efficient copy and write and deletion of superfluous copies. After that.

A

Are you trying to say something I don't hear you.

A

It's it's like you're. I see the the screen is telling me that you're trying to talk but nothing's coming out.

A

Yeah, sorry, it adam or anyone else. Do you hear anything.

B

No, I only heard a strange noise from gabby.

B

I guess I was trying to deduce from code and from old pr's thanks mark by the way and maybe some documentation if I found it what was actual goal of current architecture of shared blobs, because I definitely don't want to cut down some functionality that it's not implemented by, but I should implement it one day uh and in other hand I will. I would love to cut down anything that we will not practically use and is a just a leftover burden from past era.

F

In this respect, I would definitely prefer to leave current implementation, as is.

D

And maybe experiment with clone of it or something but in france started from scratch, because making significant modification to existing code is.

D

Pretty complicated.

D

If we want experiments like that and maybe introduce some full store version 2, then we might want to fork some point and mark it as experimental and.

A

Igor, do you remember back when we actually implemented blobs in the first place adam had we were looking at some of the old code? He actually saw that we previously had the ability to share extents, and I don't remember how any of that used to work before blobs.

D

Unfortunately,.

A

A

It all happened very quickly. I feel like.

C

A

It uh it it uh kind of as the transition from new store to blue store. We we changed a lot of stuff really fast. I don't really remember why what what our thought process was.

C

A

What do you think about that about just like trying something like kind of totally different.

B

I I like trying different in a separate branch, because I will not break a currently working project, but on the other hand, if the new stuff is highly divergent from original, it will be difficult to like get it back together. So my preferred.

B

Way of solving problems will be try to find out small things that are still compatible and just try to modify.

D

My idea is that if we want to make some severe changes to bluestorm, we might want to have a fork and proceed with all modifications in that different implementation, leaving current one, as is.

D

Again, that's for for severe changes, so I would.

D

Recommend to avoid.

B

Okay, now I have to ask: what do you eagle rate as severe change.

F

Well, stuff, like.

D

Removing some functionality of some data, significant data, structural design,.

F

Something like that I mean uh well, maybe.

D

I'd say something like that, so if we are trying to if we are fixing a bug, that's fine to go in region in in the original or in the digital branch. If we want to do some redesign, then I would suggest to to consider that as a version to implementation.

A

I don't adam, I don't know what what you'd think um do you think that the the behavior with like crc, that you saw would be classified as a bug or as a design uh detail.

B

No that I would rate as a buck. Okay and igor.

B

I found out recently that when we encode crc or blobs that have like 32 kilobytes uh hole in front and then four kilobytes of actual data, then when we encode crc, we basically dumped into metadata 32 8 entries for that 32 kilobytes and then one entry of actual data, and that was very severe metadata consumption place for that poles test. When we suddenly get a lot of shared blobs just containing one allocation unit. Modification.

A

It seems to me, like that's the kind of thing that we should probably try to fix in the current code, um but maybe maybe some of these big ideas regarding drastically changing things, um maybe maybe that would be better to do than more isolated, especially for changing on disk format. I guess I don't.

D

Know that's what I'd like to mention as well. So if we update metadata structure that might be considered as a digital change,.

A

Although adam, I think even this crc thing that that would change the on this format right.

B

Yes, and no- I was thinking about uh just software modification I mean without on on disk format of a change that will basically move front of blob to a different position. But I was pretty scared that it's never going to happen in current code.

B

I mean that we are not prepared that we can move blob starting point to another place, because if we could then for blobs that we currently have a front completely unused, I could just start that blob in a different offset and just start with the actual encoded data.

D

Yeah, but this might affect different scenarios when so the idea is to have.

D

Some square chunks in a blob or in other words, to have blob fixed position. This helps to.

D

If I remember correctly, it helps to reduce the amount of blocked object instances in some cases, so instead of the idea behind that implementation was to be able to reuse these blobs and for later rights, and if you don't have gaps or if you have blood starting at arbitrary position.

D

This prevents us from from the top.

B

Well, I don't think so. If I can cut blob and move the position, then I can also extend blob by moving starting position. So that seems to still hold I mean the only thing that I agree should be preserved. Is that I shouldn't chop my blob into like I have. Let's assume I have three allocation unit modification in one blob spread over a bit. Then I should not basically create three different blobs each allocation. You need, because that would be inefficient.

D

But I I I don't remember the details, but it seems that.

D

Aren't even impossible to extend blob from the beginning, so we can do there. We can do these tricks with the tail, but not with.

B

Okay, I can temporarily accept that, but I will I will take a look at the code. What could be the the actual places that prevent us from doing that? Because otherwise, the data format change but- and I don't like it.

F

D

Data format change is probably not.

D

A big deal, if you, if this only impacts the encoder encoding decoding. So if there are no cross references to our stuff or you just need to erase the format of the chunk, then note that.

D

Let's see, we are surrounded.

D

C

D

Yes, in this specific case, maybe it's even better to to update formatting encoding rules other than modify the the processing logic which might have much more impact.

B

Yes, certainly because I was able to do the former in three days, but for changing in logic, I expect more like wix.

B

That's the pr with just a hand, hand modified, encoding change.

G

uh Suggestion did we consider the option of not using snap for rbd mirrors, I mean doing mirror doing creating snap every 15 minutes and deleting them is extremely expensive operation. I mean.

C

G

Way too, to implement albany mirroring.

A

That's that's exactly what I was wondering gabby is is maybe maybe the answer here is to go back and say. Okay, we tried to see if we could make this fast. We can't make it fast without significant changes to blue store. You know it never should have been done in the first place.

A

That's maybe that's the answer that we have to give. I don't know.

G

So you could do a very simple thing, like I'm, not saying it's the best solution, but just like from like five minutes playing. While writing you could send you could duplicate the right. Everything that you want to do could be written to some kind of a cyclic log log buffer and then in the background, you could push that thing.

A

There's like a journaling mode right, there's some kind of.

G

Like like yeah a journal, so you you every right, you want to do to the primary you do normally and then you also send the same right to the journal, and if we want to be nice we could even make a special command for the osd. So the osb would split the right into the journal and then the journal will have checkpoint and you have some background process on the osd itself, sending them to the remote. So you don't need the the client the rbd to do the mirroring, because why should you read it?

G

And- and so you have now the moment- is the rbd is reading from the osd and then send it to the other one. So if the osd knows that they have to do the snap, it could use this journal and then push everything to the remote when they have the time and they have checkpoint, and they can have point in time I mean that's just something which I know it's like down.

G

Probably I didn't do any smart calculation, it's like gazillions holdings in design, but if we spend two weeks designing it, I'm sure you could find some different solution. Sometimes you do solutions because it was straightforward to design them and they worked. But over time you realize after some time it wasn't the best solution at the time it was good to do it because it gave you a chance to do things quickly, but eventually you go back and say you know what let's refactor this thing and do it differently.

A

Yeah, I I don't know, I don't even know who, on the rbd team, would do it um on their side.

G

It doesn't even have to be them, it could be done on rados. Oh.

A

Sure but but someone would at least need to just have a flag.

G

So you can set the right to the two two osd and they're going to be a bit saying. This thing should also be mirrored somewhere else, and then the osd would write to one place, keep something on a journal and there's going to be a background process on the osd, pushing thing from the journal to the remote. Whatever.

D

C

D

Side note I can hear some similarity to pg lock.

F

B

D

B

G

Didn't hear your ego, huh so I didn't get what you're saying your your microphone.

F

uh I I I mean uh it looks so from your words. It sounds like there is some similarity to pg log stuff.

D

So the idea to lock the modifications on the object.

G

Actually, it's exactly this because we know how to replicate from primary to to replicas. So.

F

G

It's essentially replication, but it's remote, but it's already essentially the same, but I wouldn't for one thing I wouldn't use or rocks to be for this. I would use a drone.

F

Well, let's that's implementation details here.

G

Yes, I'm talking about 100, correct and actually didn't think about it. You're absolutely.

E

Committed we already.

G

Know how to replicate to do to mirror data from primary to secondary, so rep replication is another meal. It's just a remote meal.

F

Yeah I mean so if we are dreaming about a.

D

Completely different implementation completely different design. We might say that.

F

G-Lock stuff and this brothers.

D

Miraging stuff, they might employ the same.

D

Low-Level mechanics to support that.

G

I'm just saying it's something we need to consider. I'm not saying it's a good idea. It might be a very good, but before we try to go and optimize something for a use case, which is inherently complicated, maybe you should rethink what we're trying to do here.

A

I will say that we've in general, gotten feedback about snapshots being slow and and problematic. So beyond rbd mirror. There is some general sense. It's it hasn't. This isn't in isolation. um I think people are suffering when they end up with a lot of shared blobs.

G

Are you sure I don't disagree that we need to make snap better faster, more efficient, but I wonder how much better faster we could make it with 4k via a location unit. My guess is that you just going to get.

G

I don't know some extra space, some ex it's got to be more efficient, but it's going to hit the same rod block you're just going to get later to the roblox. So it's not going to take you two hours to get there or 12 hours. It's going to be 24 hours, but if you keep running avid and rvd mirror is just running continuously we've seen after 12 hours. This thing is bad, so let's assume that adam and igor are going to make things much much better.

G

So it's going to give you not 12 hours but 24 hours, maybe two days maybe one week, but if everybody is continuously running you.

A

Cannot do this, we get we we hit saturation point. Basically, once you hit like uh the same number of shared blobs, as extents so say. If you've got a four megabyte object and you you do this pattern that we see with random rights. Eventually, you end up with almost 10 24 shared gloves with one extent in them, each essentially not quite as close, but it's you know it's roughly that that's that's where we hit saturation that looks like.

G

So do we think that changing the way he sure blops our hand, they are going to eliminate this problem or just going to delay this thing happening.

A

We, I think it's bounded. I don't think that we continue to just use more and more cpu forever. um Once we get to the point where we have as many shared blobs as we can possibly have or close to it. That's when when we we end up, basically, you know hitting hitting the limit, but it's it causes a lot of disruption for sure icp usage, a lot of work during dupe.

B

I was just thinking of getting that maximum level a lot lower.

A

And, and out of my head, the opposite thought uh earlier today, I was thinking well if we're going to end up at the maximum level anyway, what? If we just optimized for the the idea that you always have lots and lots of shared props.

B

That could strangely also work because any customer would be upfront aware of operation. Cost then could like factor that, in into operational routines and procedures,.

A

Well, I was even thinking if you just assume that you have um basically a shared blob for every extent, can you can you make it certain things uh simpler? At that point,.

B

Oh no, I mean yes, but please.

G

It's just going to mean that it's like, if you decide that all the cars can never go faster than 10 mile per hour, then you're not going to see so many uh you are going to be slow, yeah you're, not going to be surprised by all. Today I had bad traffic because you're going to have bad traffic every day.

A

Yeah that that's the situation we end up in this testimony, no.

B

Actually, the traffic will be good, but although slow.

C

A

But but that's the situation that we're in right with paul's test. That's what we end up in eventually is basically a shared blob for every extent.

B

Yes, and for me the question is whether we can prevent such a case yeah. Will the prevention procedure actually give us any performance? Boost and side quest is what levels of complexity that we do not actually use. We still employ to provide for shared blobs, yeah.

A

And yeah I mean I'm, I'm admitting this I'm being kind of ridiculous here, but my my thought process was well. If, if you had, if you knew that this was the case, then could could you make the stuff sitting inside shared blob and rough map? Maybe not so so heavy anymore if it if this was kind of your you're just the way it is, but it's probably uh like, like you, said, gabby, it's maybe just being a little ready. I don't know.

G

You might be able to optimize some flows, but not in any meaningful way. If you have that many share blocks, you are going to be slow.

G

And adam: do you think the change that you guys are suggesting is going to eliminate this problem or is going to just give us more briefing room but eventually you're going to hit the same problem if we keep hitting the same uh um procedure so rbd mural is issuing a new snap, every 15 minutes and he's doing continuously random right. Are you going to eventually get the same problem or it's just not going to happen, because you have a solution which guarantee that this thing could not happen.

B

I mean if you refer to the issue, that it gets more and more computation intensive. While we go, then no, we cannot eliminate that, but what we could hope for is to like reduce uh fourfold uh amount of cpu needed to to do it. That's that's what might be a target.

G

Doing so, if after 12 hours and getting to this problem, so you are four times more efficient, so after 48 hours doing the same uh uh um workload, would I get.

B

That's not how the maximum works after 12 hours, you will get to the software that is slower, although it's still four times faster than current implementation. That's my thinking on what can happen if we properly get rid of burden with excessive amount of shared blobs.

A

My question for you.

B

Yes, there is a saturation point, as mark said, and after we reach that saturation point intention is that it will be much lower. I mean and saturation point will be the same: the effects of saturation will be, maximum effects will be low.

G

Okay, so you're going to get four x improvement, which is very, very significant, but is it enough for our case because paul was showing me uh 24 x, slow down, so now we're going to be six x, slow down?

G

I mean it's, it's it's it's huge improvement, I'm not uh saying don't. Do it I'll say definitely do that. But what about the 6x slowdown? Is that acceptable and again, if you got four weeks improvement, then never stop from doing that. You must do that, but why not real solution with the journaling, which probably should be in wall's case 2x, slower.

A

Adam I want I wanted to ask you um before we get into the performance specifically. Does your change actually reduce the upper bound on the number of shared blobs that you would end up with per per oh node.

B

Theoretically, um yes, it does, because if in reuse case, there should be at most as many shared blob per the default dragon of shared blob size as there are copies of object, no more because if you have more, then you should be able to find a space in a shirt blob that you can fit. Your new data in.

A

Okay, so like right now, the the upper boundary right with the 4k min alex size is 10. 24, shared blobs for a.

C

Four megabyte object right.

B

A

And with your scheme, what would what would we expect the new upper boundary to be for.

C

The number of shared blobs that we could hit per object.

B

Upper boundary would be amount of shared amount of shared blobs per object, meaning uh four megs per divided by 64k times the amount of snapshots we have live, but that's the upper bound, not the expected value.

C

C

A

So I think the upper bound is technically still the same. If you had unlimited like uh as you're approaching an infinite number of.

B

Things exactly: if you have any limited amount of snapshots, then you will certainly require to have each shirt object in separate bucket.

C

A

And there's nothing we can do to fix that. But but your expected number with a lower number of snapshots would be lower.

B

A

So gabby getting into what you were saying with performance, I guess this change is as valuable as um maybe the number of snapshots that you're keeping inversely to the number of snapchats you're keeping.

G

I I just understand: what's the relation.

A

The the reason we think that it's slow is because of the amount of work we have to do as the number of snapshots increases.

A

So the hope is that if you only have like one or two snapshots, you can have significantly fewer shared blogs than you can now, but as the number of snapshots increases and approaches infinity, the closer you get to having the maximum theoretical limit of uh of shared blobs per object that you you did previously so right now, after like 12 hours right, we we hit close to 10 24 sharp blobs per object. It's not.

C

A

But it's pretty close. It's like around a thousand, um whereas with adam's change, you would have a lower upper boundary at a low number of snapshots, but as the number of snapshots increases the closer than you get to that one like 10 24 limit upper boundary that you we have now.

A

Does that make sense.

G

So you'd still get to 10 24 shirt gloves, but it will take longer and then, in addition, even when you have 10 24 shared blobs, you're going to be four times more efficient processing them.

A

I don't think it's exactly that. I think it's that okay.

B

I will interrupt.

A

Okay, go ahead. There is.

B

An important extension of what I did in that simulator.

B

Thank you here. You are jeff. Oh sorry, hello, hello,.

B

A

Fine resolving.

B

Network problems better, I think so. Okay, the case is, let's assume we have four meg objects in that we typically have four mechs per 64k. How much is that.

B

256, I guess 256 times 64k, so we have 256 shared blobs when we are having shared shared blob and now, when we start writing to one of those blobs and make snapshot, then we have only in one snapshot a copy.

B

So in our current situation we had that writes in different places and we get more and more shared blobs throughout and span of entire object, but with the change that is trying to reuse space in already used shirt blob once you had all the same variants, you just had one shirt blob that was snapshoted six times and there was nothing change in it. Then it should be basically the same. It should revert back to the original variant. It will no longer be fragmented.

B

That's the thinking here.

A

So, like there's, there's like a hard upper limit on the number of shirt, blobs and there's kind of like a soft upper limit based on the emergency or for the number of shared pubs. Based on the the number.

B

Of examples of the workload yeah definitely.

A

And right now it's like. We don't even have a soft limit right. We just as we go. We just fragment up to the maximum, whereas with yours by understanding correctly, um you would have a soft limit below that hard limit that you wouldn't go beyond, but it would increase as the number of um top just increases.

B

An amount of active snapshots increases yes,.

C

A

So that would be the savings gabby. I think I I don't think it it's that you process snapshots faster, but maybe we can do that too. It would be that your your upper limit on snapshots would be now lower than the hard upper limit.

B

Correct there is still question how faster it's to process one shared blob that has 16 different allocation units, then process 16, small shirt, blobs.

C

A

We don't know that either.

A

It will make the um the get and put calls inside uh the ref map more expensive, because we do uh well for the current implementation with map. Actually it'd only be the um the lower bound that would be slower because insert should basically be the same cost.

B

But one blob is one roxdb key 16 blobs in 16 rocks db keys.

A

And the while loop would take longer because you'd be walking over more. Why is the blob.

G

Why there is one rocks to be capable? Blobs are kept inside the object, node.

B

Shadow blob per shared blob is one I mean there was a mental shortcut. We were talking about shell blobs for so long that I possibly forgot to tell it's shared.

G

Flops each one get its own private rocks the bikie.

B

Yes, for tracker.

G

Do we need to use roxdb for this.

B

Yes, we do need some place to store it.

G

Why I'm not saying someplace someplace is always good, but why do you need rocks to be? Why can't we just maintain a memory table tracker.

G

You know like the same style, we did for a location map the same thing I'm doing for snap marker. Why do we need rocks to be for everything.

G

I mean we could recover the table in disaster as we already scanned the object node and the in memory. Tab will be much faster than using workspace.

B

Okay, because if we were to recover that from restart, we would need to keep in memory all the shared blobs.

D

B

And now we don't do it, we just load the key with shared blob. If we hit it.

A

It could get pretty memory expensive if we're really.

G

Getting many of.

G

You say too many of them and we need too much memory for this.

B

I don't know I just telling you what the the result of not storing it in roxdb will be.

G

So, let's assume we have some kind of a hash table and then the hashtable will have pages some of them in memory, some of them out of memory, because roxdb is extremely expensive. Well,.

B

What you propose no now does not look like an easy solution.

G

But I know roxdb is always the easiest solution, but it's the most expensive.

G

I don't drive mercedes. Even so. It's a much better car than mine better, but.

A

Gabby, if we were going to go to that extent, then I think my question would be if we're, if we're doing that, much work, maybe or is it time to start thinking about like getting compression of the shared blobs or even just looking if we can eliminate them.

A

Entirely like there was a time there was a time with blue storm before it even had blobs when it was just shared extents, and I I look, I I keep questioning whether or not that was the better route instead of what we're doing now,.

A

I don't know, maybe not I mean we did it for a reason. I just don't totally understand the reason. I guess.

G

A

Yeah, why did we implement another level of the abstraction and- and and this gives back adam to what we were talking about when you said that shirt blobs isn't even is referencing direct disc offsets right.

B

Sorry guys, I I used a disconnect button as a mute button.

A

No adam, I was wondering so what we were talking about earlier. You were saying that sherbet shirt blobs are referencing direct disk offsets right.

A

Yes, they do and I don't understand why we have so much abstraction. If that's what's going on. Why aren't? Why aren't we just using extensions.

B

I mean historically, I could imagine how it worked, because you could think remember that we started from blobs like being 512k and if you think in that sizes, then having one shirt blob that that encompasses entire region really makes much sense. You trade, so many allocation units for one region of object. That says this is shared apart and it it's in multiple objects and we don't want to just iterate over so many elements.

C

B

That was me answering your question why we do have uh plops for that usage.

G

Sure, probably if the 10 24 shared lobes per object is a reasonable thing to do. If it's a reasonable, then definitely go for sure extent, you gain nothing from the blogs. If it's a very crazy synthetic benchmark, then maybe shed lops makes sense. I don't know that they do, but it depends how many share blocks you have per object. If it's 28 24, then it's definitely a bad idea. If it's two three four, then yeah sure it makes sense.

A

Would there be any way that we could specify a minimum shared blob size and simply just deal with the the the space usage increase? If we have to say that these two things diverge between the snapshot and, what's you know uh the the actual live object.

G

I would maybe say that once you reach these things, you should fermentation.

G

Maybe once you reach more than I don't know, 64 shirt blocks, you think. Maybe I should defragment this object and then get one single contour.

B

Nice, but we cannot do that currently, since we do not know who is an owner of the other references, you have your object, that you see that you have a lot of shared blobs in your metadata, but you have no idea who else keeps the other references. So if you do anything with your object, you will still have no way to optimize the orders.

G

If I'm going to add my memory table with active, let's put it just for the once I'm doing fermentation, then you can always check yourself in this table. If you're, not there, then do whatever you want. If you are there, then you need something and before you do anything else, you must put yourself to kept to to allocate an entry in the hash table and then you don't need persistency or anything.

B

I did not grasp your idea.

G

I'm saying if you wish to do something on a short extent, any modification you need to grab a lock and the law could be held in a global hash table. You query the hashtag, but if it's there then you cannot take it or maybe you try to grab a look. If the look is free, if it's not there you're going to create the entry and have it locked and then anybody which want to do any modific anything which require notification or short extent have to go to these things.

G

C

G

No, no, it's my bed once you defragment. It's require everybody to be quiet.

G

uh No, it's not a very good idea. It's required that all if all the active shared extents are going to be held in a memory table, then this thing is doable.

B

Well, my approach to even trying to be able to do such the fragmentation was to somehow assign shared blob namespace for a single object. I mean when you create object, you get a shared blob, namespace and then all the namespace, you shared blobs that you create for this object and all of the snapshots of this object will uh belong to the same um shared blob. Namespaces.

B

That way, you will have a limited set of possible actors that do play a role in your shared blobs, and even if that was like a hundred objects, you could still know that they shared the same id namespace id with you that that way you could optimize. But that's that was only me thinking not even being close to try to implement it.

G

I mean you could, in theory, if you move the data, you could move the data without logs, you just create a new location, and then you need to send a message to everybody which have pure blobs, that you change their location and all of them. I don't.

B

Really care for live live objects. I care about objects that are still frozen in somewhere in roxdb metadata as as stored objects. I I cannot notify them. I I need to pick them up and modify them accordingly. If I modify their share, blobs.

G

Yeah again, that brings my concept of in-memory table, not rockstar.

A

I have to say, as we talk about this, I really hate how, when we take a snapshot, it's like we have all these references to the thing that that is like the the real object right. I almost want that that snapshot to be an immutable thing and what's live, maybe can reference some of it and then maybe doesn't you know like if it's changed, but I want to be able to make the decision about.

B

Snapshot is an immutable thing. I cannot write to snapshots.

A

No, I know, but what I mean is that, like I, I.

C

A

Like that, we keep up all of these references to like little blobs of things, and we have to have them grow over time, because we can keep making modifications to the new thing, like the new thing then makes it so that now we have like everything divvied up in weird ways, because we have this old uh snapshot that has all you know different data, it's like it feels like.

A

Instead, we should have like this solid base that then, when we create like when we modify an object or change something new, then you know we maybe have some old portion of the data that we can reference, but maybe we don't maybe we actually create a whole new copy of it and instead, rather than you know, trying to pick at little bits of the the the old one. Does that does that make sense?

A

B

I will try to rephrase to find out. If I understand you would prefer to have. Only one head object always and not create additional objects that are somehow related to the head object as we as we do now, but actually create just an empty shell. That is somehow referencing.

B

The parts of original object.

A

Maybe maybe that might be kind of along the lines of what I'm thinking.

C

A

Don't I want to be able to really easily say I don't want to have references to all these little chunks of the old object. I just want to make a new one and then have this other thing checkpointed in time and that no longer matters anymore.

A

Like I don't want to have to divvy things up into all these little pieces because we have you know 4k of data here doesn't match the 4k data there.

A

At some point, I want to just be able to say: okay, let's start with refresh.

B

I'm not sure if this is not human level way of thinking about it, but not really machine implementable. I I have to think about it. If we can modify our approach that way, because it I feel like when boiling down to implementation, it will basically be the same.

B

You still have to remember what you had and the new ones need to know what's modified, but you don't know: do you don't want to have a head object being an update of some frozen object? You want a head object to be actually real and just like make a backward difference for the other object. But can it be done like you set some omap data for your head object? You might think that okay, I do set new omap data for hit object, but I will note that the previous object had set it back.

B

The different data that it had before I like this.

A

Okay, so, like adam here's, here's what the problem I'm thinking of is like once I get to like say, say: I've got a four megabyte object and once I get to the point where I've got a hundred shirt blobs for this thing, I don't want it to care anymore about referencing the old thing I just want to make a new copy. That, then, is you know one one big extent.

C

B

Sacrificing uh space for performance and readability, okay,.

A

Rid of all these references, I want to say I I don't care anymore, I don't wanna, I don't wanna, maintain all these references and all of this chopped up little bits of blobs everywhere. I just want to start over.

G

So you disconnect you disconnect you create a new head object, yeah.

A

G

Collapse things people sometimes do like you collapse all the layers into a new object, and you hope that all the old snap eventually will be removed and they're not going to affect you.

C

G

And I'll be the mirror, that's actually going to work, because you know that the old snap is going to be removed. It's I think. Actually it might help us.

A

What do you think adam? That's, I don't even know how we get there, but that's that's. What I want is something like that.

B

I don't know yet: okay.

A

But I don't know if I want it, that's what I think I want.

A

B

My thought is, it basically depends on what alternative action we could we could do, but that's the main thing for me.

A

Igor, what do you think is that is: is that okay or not okay, but is it? Is it ridiculous.

D

Let's that might be interesting if we definitely you can see that at some point we want to sort of compress this metadata share both metadata. So it grows to some point where it becomes ineffective and.

F

So this again brings some complexity, since you need.

D

um So in in terms of implementation, it brings some additional complexity in terms of any performance benefits.

A

Would we achieve almost the same effect if we just had a minimum shared blob size? That was big.

A

Like we said you, just you don't have a shirt, blob size is less than 64k or something.

C

B

Know, if that's quite the same? No, because you still will be creating new shirt, blobs fragmenting.

A

Yeah yeah, you just have fewer of them right or you're, saying that you fragment inside this no.

B

When you write something to object and you make snapshot currently, you create new, shared blob.

B

But I will go completely opposite direction. How would it behave? Let's forget for a second, how we use shared blobs. Let's assume I have one shared blob tracker for entire object.

B

Each time I allocate something to this object, a reference, my shared blood, counter, the allocate I remove it, and I have just one huge, shared blob tracker for entire object and that's it. I will never fragment it. It will be always one when I snapshot, I will reuse it and will be tracking it.

A

I'm confused um adam when you have one shared blob and you do a snapshot. What.

B

How do I I first of all I mentally did.

B

This joint two things, one shared blob tracking that we use to reference. uh How many times we use specific disk offsets in a shared blob.

B

That's one thing, and the other is that we use shared blob inside blob implementation just for data data keeping, and I want only to talk about that shared blob that is tracking disk usage. That part uh that's. The type is called blue store, ref map, t or, or something like that, yeah and- and my thinking is- I will have only one that shared blob per object when I allocate I put it in I mean, maybe only when I do first stop, let's assume it's not.

B

I do create that tracking and that tracking object tracks all the allocations that I keep in my object.

F

B

Snapshot, I just add one to what I already have in my current head snapshot and when I delete some object. Of course, snapshot inherits that shared reference to that tracker and when the object is deleted, it just removes itself from tracker, and now it works the same. If the tracker goes to zero at some region, it means a location unit is to be released, a transaction that did that.

A

To make sure I'm following you, you basically move the ref map out of the shared blob and into the o node.

B

Yep, exactly to all the all nodes that share that come from the same object.

F

D

Mean, instead of multiple shed block instances per object, you get single instance, which the sort of mapping and reference counting something like repeat an extent. We keep and references.

B

D

So what we save this way is amount of red blood instances, so we might get less.

D

D

C

Also applies to regular.

D

Blobs, since they keep this fake, shared block instance as well,.

D

Now the question is.

F

How to well, first of.

D

All we might get some performance.

D

So we we might get some performance overhead when maintaining these my pin inside that single pad whatever.

D

um So it should keep larger.

D

And also the question is how to make it persistent.

D

So if we keep one run one mapping between such an object, roxdb record, then we encode the code much more data on each envelope. Increment decrement.

D

Alternatively, it can.

D

Can perform one two well andro and mapping general. Similarly, like we have now for shared loops.

D

But we do not expose that to journals levels.

D

So well, the only benefit I can see for now is the the potentially less memory footprint in this case.

B

I mean this solution does not actually required to have one touch tracker per object, but it definitely requires so. The tracker is perfectly.

B

Segmented throughout objects, so you could imagine like every four megabytes or whatever the object size is. You will have one tracker, but it cannot be fluid in the sense that, for example, we have fluid with resharding it has to have fixed boundaries.

B

Then we could offload some requirements for larger encoding for the larger tracker.

D

You mean amount of entries.

D

Compromising this object is.

D

D

F

Well, first of all,.

D

This red object is be shared between every unknown and its snapshots.

D

B

Have correct correct: that's the exact use of it. Okay, it works exactly as the same way as shared blob reference counter and.

D

If means that amount of shared blobs, which, in this shared object, is not limited, it's actually amount of snapshots. Multiple amount of locked maximum block, multiple maximum amount of blocks per node.

B

And now there is no part when you multiply over amount of snapshots, because all the snapshots share the same object.

B

It might be that you will have some segments if you decide, but it's fixed.

D

D

But again, what are the benefits.

D

So we do not keep tons of shared blob instances right anything.

B

Else, yes, we no longer even track that there is something like shared blob.

D

Yes, we keep that embedded into this single object, in fact, and into this shared object.

B

This single tracker- I will call it tracker now.

D

D

F

D

Again, the benefits are amount of instances, so we do not need shared block or tons of shared blocks, but what else.

D

Oh, that's enough.

B

I guess I mean.

D

I I I mean internally, this uh shed tracker would be pretty similar to the current cherry blob, so it should keep a sort of mapping between offset and reference count, and it should.

D

D

Sort of persistency for this mapping right agreed and probably it's not efficient to make it persistent.

D

So it's not, it might be inefficient to to encode this full map on each and the shared block duration. So it should implement something similar to what we have now. It's a bunch of process b records.

D

D

For trading this so.

C

D

Runtime make this single runtime mapping should be.

D

Mapped to multiple persistent.

B

I agree that possibly having actual ones I I deliberately started telling about about one to make it simpler, but I think one will not work. It should be somehow segmented to to various offsets of object, but it has to be fixed. So all the clones will know when to access when they will reference or the reference disk usage.

A

And all of a sudden, it sounds a little bit. So in the background, as you guys have been talking, I've been thinking about how nice it would be if you created a snapshot.

A

It was like this fixed thing with uh fixed size, regions that were compressed, and then uh you just had like you know the ability to use it uh so like say 64k or 128k or whatever, and then, when you took a snapshot, the new version of your object could or could not choose to use one of those depending on whether or not it's changed, but um but it's not like it is now where it diffies up ever smaller. It's just a fixed size.

A

Is that, unlike what you're talking.

B

About I don't get your concept, you are talking about a different approach to compression now.

A

No sorry just forget the compression part you just have fixed size regions of the original data. That's like yours, your shared blog, but it's um when you, when you make a snapshot for what you now have as your your current live object or whatever you want to call it. I guess it can or cannot use those. But it's not like you divvy them up smaller.

A

Does that make sense.

B

No sorry, I'm sorry. Okay,.

A

Never mind please go back here, go back to your idea.

A

Let's, let's forget it for now: go back to your idea.

D

Well, in fact, I like.

D

Like I like adam's idea to to make single instance for tracking shadow objects pair, our name space, so.

D

By the remember we had, we have plenty of shade blob references and fake shape, balls.

D

Help us in reducing memory, footprint.

B

And you know guys what I could even implement some poc reasonably fast.

A

I think you should try it adam.

A

I've become convinced that the only way we're gonna get through this is to just try many different things and see what works.

B

Oh one thing a bit of topic igor: I noticed that we have a clone range function that allows to copy a region of one object into another object and change offset.

B

I have some doubts that this is implemented correctly.

B

Do you know of cases when we actually use this clone wrench with move practically.

D

Okay, don't, but my feeling was, the primary target is not to move to different offset, but rather.

D

Sort of not shorting but for a range.

B

Yes, because all the usages I've seen and all the unit tests I've seen they always have the same. The destination of that is the same as source, and it makes perfect sense for all uh snapshots and also for cloning regular objects to move offset. You will actually do some. I don't even know what you can achieve if you clone object, but move offset.

D

Yeah and I believe that it's not used.

B

Okay, guys do we have any more topics for today.

A

No I'm pretty exhausted.

B

C

B

Even later for you than for me, I guess.

D

Yeah, it's 8 p.m. Almost 8 38 pm, but I generally tend to work later than earlier.

B

As do I all because of my american colleagues.

A

Thank you guys for thinking of us.

A

Well, I suppose, um adam. I think you should try it.

A

I don't know I I've I'm trying to decide if I want to even attempt some of the things I was just thinking about in my head. Maybe I'm not sure if I should continue to try to.

A

Poke at the ref map- and I had a couple of ideas for trying to maybe make a vector implementation of it much faster, but I don't know if it's really worth doing or not, especially if we're going to change the properties of how we use it.

B

Well, if I test it, then it will definitely not go direction of being small.

A

Exactly that's what I was thinking too is that then, then it's just a decision. I guess of how how you access it if you're really accessing it like a map, maybe an intrusive map or a map is a much better choice.

A

B

Okay, mark I will I will test and I will share soft with you if it's good enough to be doing anything sure, okay,.

A

Sounds good, thank you guys for the discussion. Hopefully, uh hopefully we make progress here.

B

Thank you mark. Thank you. Igor, have a.

C

Nice weekend, yeah.

A

Have a good weekend guys bye.

D

It's one more day, one more day.

A

We need red hat, we have a day of learning tomorrow, so we get to learn other things.