KubeVirt SIG Performance and Scale, 7 Oct 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: SIG - Performance and scale 2021-10-07

Description

Meeting Notes: https://docs.google.com/document/d/1d_b2o05FfBG37VwlC2Z1ZArnT9-_AEJoQTe7iKaQZ6I/edit#heading=h.hmx9wqksaqdy

A

Okay, uh it's it's october, 7th! uh This is sixth scale. um Everyone please add yourself as an attendee and uh add agenda items um feel free to add in items as we go through. Okay, um I, the first thing I have on this list is bugs, but I we don't need to um we're. Gonna start with that. I was kind of hoping to start with something else did like david. Do you want to talk about uh like the your proposal at all like do we have any things that you want to bring up with that.

B

First, uh so for the virtual machine pools, I think we're just wrapping it up, um so I would encourage anyone who is interested in this to definitely go and look at the proposal. I guess maybe we can post that in the notes again, maybe as well or I can do that. um My goal is to have this merged as soon as possible, and the next step will be uh I'll, try to go ahead and create a pr that lays the foundation of all this.

B

So it's going to have the api and just a really basic controller implementation and then we'll kind of keep flushing it out from there until we kind of fulfill the entire design.

A

Okay yeah, we talked about this one last time. This was the removal of the vm config api. Let's see so um there are a few comments. Here, um looks like some text.

A

B

Yeah so roman and I discussed what you're looking at briefly today and we'll probably sort this out today or tomorrow. It's it's pretty minor. His comment here is we look at the um the policy selection of virtual machines either for scale in or update. um How do we have some kind of basic optimizations?

B

So if we want to select uh like vms that are shut down first or paused before we actually touch active virtual machines, how would we do that and I think that would fall either under random, the random-based policy or perhaps just a something called default policy which would be we we take at random virtual machines that fall and kind of a tiered ordering. So uh do we have any vms or shutdown all right? Take a random selection of one of those for a scale end all right, so we've exhausted that see.

B

Do we have any in there paused or inactive in any way, select one of those and then from there do we have any. They are active, uh select one of those. So we can it's just naming.

A

Okay, so this is like so the I guess. The assumption is that um you know that we'll do we'll have some sort of optimization that we're going to take we're going to use the um the running status of the vmi to make a selection, maybe before before any of these. Or would it be.

B

No, it would be the order. Policies would definitely get filtered first. Okay,.

A

B

Opinion because I think that's the expectation- and I think this would only apply to base policy when you are selecting based policies- that's kind of at random. So if you select a base policy that isn't random, so you say I want oldest first or newest first, then we don't really have any optimization.

B

We can do there because you've told us exactly what you want, but if we have a policy like random or we just call it default or something like that, then we have some leeway into doing some more, maybe user-friendly, optimizations that people would actually like and it wouldn't hinder people who didn't expect it either, because if they're choosing random, for example, um they're going to get vms that are potentially active shut down. So if we do a little bit of help there, maybe that's not terrible okay.

A

Yeah that makes sense to me yeah.

B

A

Mean I think, if you have whatever, I think that you know as a concept makes sense. I think that.

C

A

Be good to have, I think, and if there's a description in face policy, if you don't have it already, I think that would make a lot of sense, okay, sure um and then the only other one I scrolled by so these two looks like you, I don't know. I think I didn't. I haven't gone through the dock, but I think you've made some changes to it. um You need to reflect these are, but I'll have to go through and check this, um and then there was um this one uh for some metrics yeah.

C

A

Do you want to talk about this one like some ideas that we have um for around metrics.

B

Yeah, I uh I actually meant to respond to this. I think I even typed it all up, and I decided that we should talk about in the meeting and I totally forgot um yeah. So thanks for bringing that up, I don't have a great um sense of exactly what will be needed. Quite yet, I thought of a few. um Let me see you had well, oh yeah go back and do this a few.

A

We had one that was.

C

B

In here that I did think was: okay.

B

um Number of vmis restart in a pool uh that could make sense, um so the pool.

A

Can we talk through some of what, like I was thinking with these and.

B

Yeah yeah talk through what you're thinking if you- and I was specifically interested in, are you interested in these metrics for load testing? Are you interested in them for production like settings? I.

A

Was I was trying to consider a little bit of both like if I was running, I was trying to consider the perspective of you know if I was running this in production, you know what I'd like to see and then a little bit of how we can um how we can integrate with some of the work we've already done uh with the performance testing. So like the first one is like so we've. You know the number of vms were started um this. Could this could like so we're?

A

We have like this expectation that um the pool is going to be some size like we want to know. um You know, what's like the churn, like you know how many like? What's you know how vms are being killed and and recreated like? How many times is that you know? How often does that happen, because maybe you can put a rate to it so like how often like things are being because I could kill red, I could kill it anytime. I want just delete it. How often is that happening?

A

um That'd be kind of interesting to see, then there's? um Oh. So what do you think of that? As an idea.

B

I think it makes sense uh yeah. I do think that makes sense. I'm curious if it's restarted. What we're really looking at here is, uh I don't know if restarts right word.

A

C

A

um It's like um churn was kind of what I was going for, like um like the number of uh replacements or.

B

Yeah yeah, I see what you're getting at I'm just trying to think of how to accurately represent that. So it's number it's like you're, almost wanting to track the number of shutdowns and starts possibly even independently of each other, because that, maybe I don't know, maybe it is truly restarts, so a restart would be.

B

Virtual machine uh shuts down and gets started again. A yeah turn would be. The virtual machine got completely deleted and replaced eventually yeah.

A

B

A

I think the reason I used restart was because the assumption is that we're I'm taking deleting a vm and then we're replacing it with like the same one. So it's like um I it's replaced or whatever I'm like, I'm almost restarting I'm just killing it and you know letting the thing start a new one. For me the same one like it's been the same name. It's gonna have to be kind of really a vm restart from like a power standpoint, but more like uh you, know, I've I've brought in a new one.

A

That's the exact same kind of it's it's more of a replace like, but it's to measure churns the goal.

B

Yeah so maybe starts is accurate enough. The number of boots that have occurred because that's what we really care about, um we don't really care about the shutdowns unless they result. In another start.

A

Yeah the numbers started, so they had the rate, but because, like does it get a bunch of things from the number the rate like this could be just you know our the same metric as we did with like, um like the we have a little bit of a bucket of them. I don't know like whatever like a num, we can get a we get. A rate is really what I'm getting at it. Just to tell us like how how often it's happening and.

B

Stuff like that and then okay, so let's say we yeah, we get a rate for starts that occur and we could also do shut down. So that makes sense in the future um and maybe speak to the next one detached pool.

A

Okay, um I'd say kind of um similar: it measures um how to measure how often we're doing detachments um it could. It could be like there's sort of an assumption that we're doing something with that detached uh bmi like it could be forensics it could be. um We want to take it under some sort of management, I'd kind of want, I kind of want to get an idea of how often those events are occurring.

A

You know how, when we need sort of special attention on these vmis is this occurring at once uh once a day once a week, so another kind of another rate that we can. We could learn a bunch of information.

B

From could make sense, I'd like to see that in practice that that's useful before maybe jumping into it,.

A

B

uh And the last one vemma perfmetrix.

A

Yeah, so um we have the where we did all the phase transition times. um What I was thinking is that maybe we can do a little bit with the labeling here uh we can attach. um We can do like a. We can see how a pool is performing um as a unit. um We can have a label for the pool or something um so that we can know that these view mines are the ones for this pool.

A

This is how they're performing um as opposed to just generally created or cool or view marks for other pools.

B

Yeah, I think that makes a lot of sense. So if you had lots and lots of pools in your cluster you'd be able to isolate uh these phase transition times by pool, and that would be yeah, I think, really useful, because it would tell you exactly which which one's causing the trouble remember how hard that would be. What would you think that would just be a label that we put on, or would we actually have a new, so it would be a new label on the metric.

B

I guess it would be virtual machine pool name or something.

A

Yeah, I think I think so because I think the expectation or most my expectation is that we wouldn't- I wouldn't expect someone to to like, have thousands of pools they're like like one vm in them, which would be like the equivalent of the you know like the thing that we didn't really want like to have where we just label. We have like a per vm labeling, so I'd exp, you know so that so that's sort of like our the bad case, but I mean it could be possible here, but I would I would.

A

I wouldn't really expect it, but it gives us that opportunity to like um you know like to to know how you know the performance before so I think, like I think, just in general, is it as a way to to sort? I think it makes makes sense, because that that case, where you're just doing one vm pulls I don't this, wouldn't make any sense. I don't know why you're doing that, yeah you'd want to do that.

B

Yeah that makes sense. Yeah I'd be fine with that. um So I think the two that of what you've listed that makes sense to me, are um adding the label to the face transition times and then the um we should investigate.

B

Perhaps after we've created the feature, how we'd want to track starts and stops and kind of understand, turn I don't think we have a great understanding quite yet I'd like to maybe not completely uh flesh that one out yeah, unless you feel pretty confident, I don't feel confident about it yet, but that I do agree that that would be useful. I'm just not sure how to represent it.

A

Okay, yeah, so my the reason so the my expectation sort of like what I'm thinking about this being used yeah I create, um I create a pool object and my expectation is that I'd have some sort of controller. That's going to say um um you know either time's up like this. This this vmi is, is finished with its work or you know.

A

Maybe the vmi itself terminates after it's done with this work, and so I expect I'm expecting that there's going to be a lot of vmis that will be um that will be cleaned up um and it'll happen. It'll happen.

A

Often I expect that a lot and so having the rate of vmis that are clean, that are removed per pool, gives me um a concept of like say like because I haven't considered that each pool is an identity kind of when I was going off with the perf, like you know, there's an image associated with the pool and so on and so forth, and so I can tell like okay, how many people are finishing their work or doing whatever or whatever how much work is finishing for these types of bmis and how often it happens it sort of it gives me like um it's kind of like um it gives me a sense of like you know what what is happening so like.

A

Maybe I can make predictions about things like you know. This pool is getting a lot of um a lot of attention. There's a lot of things happening here. Maybe I need to increase the size of this pool, so I have more vm's available.

A

You know more warming or warm vms available to be used because there's just so much turn right now, there's just so many being you know so many people requesting it and deleting it. So that means you know, there's things like that, like I can sort of get like a bunch of data from it so that we can make informed decisions about you know what's to come next. Oh that's.

B

Interesting so in your use case, I'm kind of familiar with it, um somebody logs into one of these vms and then when they're they are done. Does that virtual machine cycle completely.

A

Yeah like it would be removed yeah like.

B

We yeah, okay, yeah it'd, be removed, and then uh then it would be fresh. So you have basically the pool is like a bunch of open slots and you you're it's either a one or a zero, whether it's being used or not, and there's no other okay, and I'm not sure if this metric would help you there as much as you would need some sort of custom metric specific to your use case, to understand, essentially how many of these slots are full per pool. So you have to associate the users uh connecting well.

A

It's it's not so much about like knowing, then how how much is full I'd say it's more like about the rate like I want to know like here's. How, in order to make some sort of an informed decision like it'd, say like okay, there's a lot of people that are starting there's a lot of starting vms, a lot of deleting vms in the school there's a lot of activity um of people that are using this kind of pool. Perhaps I should um perhaps I should increase it. You know, maybe I should decrease it.

A

I don't know something like that. There's interest there could be information based on the just on the activity because that's like what the metric tells me is because there's a lot of vms that have been replaced. um There's a lot of churn. There's a lot of activity with this type of vm.

B

I guess I I get what you're saying, but wouldn't it be more accurate to say I've got this vm pool and I've got this many um idle like open slots and then say: if the percentage of open slots to use slots falls within a certain threshold, I need to scale up or down like that seems like it would be more close to what you want, because you're wanting not to run out of uh free slots, but you want to keep the margin, probably narrow, because you don't want to use on like unnecessarily consumed resources that aren't necessary.

A

Yeah, I I maybe I'm completely complaining two things because like so, if by informed decision like I I'm, I guess what I'm saying is that as if uh a computer would do it, but I mean these metrics are for humans, so it's not like. I wouldn't use this metrics to make uh to have like the controller scale up, but I guess the the idea is that I, as a so to go to the human side of this, which is, I think, um to isolate.

A

That part is that I would be able to know that that, as a user or as a as the administrator that this um that there's a lot of activity here, I think this is really all it is. So forget the informed decision part it's more. It's really that I I've noticed that there's a lot of activity um with this kind of pool.

A

um I think that's really the only conclusion that you that you draw from this, which I I don't know. I find that to be useful, but I don't know I I maybe like, because it's you I understand, you're saying that it could be just for this use case. Maybe people don't do this like: maybe they don't complete vmis after they're done and the pool just stays full, um and so that's that's not really relevant this. That conclusion wouldn't be helpful for them, so yeah I mean I could see that um that argument too.

A

B

Okay, so if you're using it just informationally like you, just want to look at a dashboard and you're not using it to make decisions, I think what we're looking at is we're trying to measure activity on the vmware. uh So I think measuring the rate of shutdowns and stops or uh starts and stops is probably the most accurate way to to represent activity, or maybe it's the number of scale and scale.

A

Yeah, I mean I'm not sure yeah yeah there's an assumption. I think here yeah that that I measure activity. I would measure activity based on my use case by the number of um starts and stops, but then maybe not some someone else might not. Okay, so yeah. There is, I think, there's an assumption, but I I don't know I mean in other cases like in the general case. Like would you care, if you know you had a bunch of starts and stops like I mean it could be.

A

You know useful if it's something that um you um something was happening to your pool, something that you didn't expect. Maybe a lot of vms were just being deleted by for some reason, or maybe they were failing or something it might be, something that you could alert on, um because it's not something you expect so I it so there could be other use cases there. If you, if you know if it's something that you you know, if you don't expect a lot of restarts and you do see them.

B

Yeah, I can get that. I think it makes sense uh so the rate of start and the rate of stops. um I can get behind that. I think um so here. Here's what I would uh write in the document.

B

I think it definitely makes sense to have phase transitions per pool because we did want to do that per vm but per pool. I think it makes sense, and it also gives us useful metrics, because these are supposed.

C

B

Be virtual machines that are identical to one another.

B

um It will look a little strange because when we first create vms, uh it's going to take a long time in the scheduling uh or the no, because that okay forget it all right so yeah it makes sense uh to have a base transition. I think that uh start and stops makes sense as well, and I can document uh both of those as being desired. Metrics.

B

Does that make sense.

A

B

I don't know uh that that's gonna probably be one of those follow-up things so we'll we'll document in the design and then once we kind of get the base features and everything we'll look at adding those and it might get changed a little bit at actual implementation time. But I think we know enough to be able to document those three.

A

Sure, okay yeah, that sounds good to me. Yeah we can okay.

A

Okay, um I don't know, I don't see any other items on this, I mean for me overall, it looks good to me I'll, probably just read one more time um like my review. Okay, all right, uh I.

B

Will um let me clean up I'm going to focus on cleaning up this document, because I notice there's some some comments that are just kind of typos sort of things and I'll document, roman's thing and I'll document.

B

This metrics thing and I'll reply to your comment when I'm done so you'll know when uh when to look at it.

A

Okay, all right thanks all right. Let's.

A

uh We can do bugs. This is actually the oh, no we're gonna want to. Even here we go, this is the bug I actually want to look at was uh here.

A

um This was weird, um I don't know if you've got any idea about this david. This was um something we had noticed, so we brought in the some of the phase transition, metrics and there's. Actually, this occurred during um the outage that one of the attitudes was more like. It was more like that. The situation where we found the um different controller panic issue and one of the things that came out of it was this.

A

The um the label on the um the my face account metric uh changed and then so you can see like that. Yeah, the value value gets repo everything's running it's just now. It's value.

B

Yeah, um so here's what happens when a virtual machine instance is first created that phase label is empty and until it gets reconciled the first time uh it's not gonna, be set. So it's possible that what's happening is lots of vmis are created but they're not being reconciled yet, and then they all get values afterwards. So what happened here uh with your um this was.

A

The vert controller restarted um and the I think it was just the birth control. I don't think it was just for controller and it was the panic issue, and then this appeared. The um the labels.

B

B

Interesting, that's not what I would have expected, because all the vms kept running right. They didn't get yeah.

A

They're still there yeah, like you, can see, the count is exactly the same and it actually normalizes like eventually they get the label. You can see this line as it goes up becomes, and the value of value goes down actually.

B

Wow, that's weird.

A

Oh actually, no here it is, it's handler restarts um it's it's. I noticed this first one. The controller was sorry, but I think it maybe it's handler restarts. That was what he's got. Okay. Well anyway, this some of the control plane. We started and yeah the values, the values here.

B

And it recovers, though, so eventually it.

A

Does yeah it eventually finds that like correct values, it eventually, I I don't know, I don't know how to classify. I mean it just it finds that they're running like we basically with this metric is the number of running vms.

B

Okay, it's just that and we look like this. I wonder if this is some sort of um prometheus interpretation of.

B

Things over time, so here's what could be happening.

B

You said vert handler restarted, but this would only make sense in the case of controller restarting if your controller restarts there's going to be a period of time where the leader election um lease isn't given up. So we're going to have no vert handler for like 30 seconds, I'm sorry for controller ah okay, we're talking about the cluster scope again for control, we're not going to have it for controller for about 30 seconds or so uh waiting for that lease to expire and the new leader to come online.

B

So if prometheus tries to query for information related to virtual machines during this time, it's going to get nothing so there's going to be a period of time where it gets it's just going to look like there was a gap. I guess, and once the new vert controller comes online depending on when it gets queried.

B

It may or may not reflect virtual machines uh existing yet depending on if it's caught up to its informers or not, I'm not sure exactly what it's going to get reported during that time, but eventually it will all get caught up and the correct values will get reported by the new controller. So during this time, between ever controller crashing or being restarted and the new one taking control and thinking correctly, things could get kind of weird, I'm not sure exactly what would be reported and exactly how prometheus or even graffana would interpret these results.

A

Yeah, I think I think when I'm like, so I think like the thing we need to figure out here is that this during this period, so I think the test is like, um uh like I think so, like restart, we start a vert controller and then we need to get. We need to look at what's in prometheus at this point like we need to follow up. We first need to verify. We see this and then we need to like um get the data from prometheus, because that would be that could at least rule this out.

A

If it's bringing a circle or whatever. That's that's um the metric services reporting this or if it's at least something else, but let's see so that's that's kind of what I would like to see, because what you're saying is curious to me, because I'm wondering if like if we cracked, open, grafana or cracked open prometheus, we saw there that the values are just gone or something because it's synced and then it's like no there's no values for this, but we see them they're there. We have a number.

A

We have a count of these things, but we don't know what they are.

B

Yeah, that's what I would look at. That makes the most sense it would be. There is a case like I mentioned, the very beginning where there will be no phase, but it's very brief, and it would only be for new vmis at the point creation, and that doesn't look like it applies to this.

A

Yeah, let me put it here so we'll do uh let's try this so like a test with uh we like restart the controller.

A

A

Yeah, that would be, I think that would help this bug at least get a little bit further along.

A

Okay, um all right. That makes sense. Let's see uh where the bugs do we have.

A

uh We still have a few of these, like the pod disruption budgets high. um I still see this.

A

The work you add rates very high.

B

I'm merging this one right now.

B

So this is a pr um I just put in the chat. I would like to see us begin tracking. Some of these perk scale results so until.

A

B

A

So let's check this one: okay, yeah.

B

I forgot about it too. I was just looking at my what I had open once this could merge we'll start to get an artifact that shows us um the expected like transition times and everything and api calls and other stuff that occurs during our density tests. Once we feel comfortable with what we see, we should start setting some thresholds and then start you know ensuring that we meet those thresholds.

C

A

Okay, great um yeah, that's yeah! That makes sense yeah. I definitely definitely looking forward to that one: okay, good and then um yeah. So this one.

A

Yeah like so I this one, I'm not sure how we can at least make progress. I think we need to. I think this is one of those areas where we do the um we do the um uh the profiling and maybe see if we can notice anything um yeah. I mean because I like we see this as well and um like on our hardware and yeah. I think we just need to commit to doing a profiling. I think, um let's see we can. I think that's really just this. Let me write it in here.

A

What is it the um the word controller disruption, budget.

B

Is this the density test that he is running? Let me see.

B

So I guess what this came out of. um So what is the density test, so I could investigate this by running the density test myself and like just lower the number of virtual machines uh for my environment that can actually run, and theoretically, I should be able to to recreate this uh yeah.

B

Can you see me on this as well? Just so I get it. My cue.

B

A

B

I might poke around at this.

A

All right, that's one. Okay, let's see disruption budget. We have this. One that's been around for a little. While this is the key performance. This was kind of a general one. um We have a bunch of these like metrics, like here's work, you wait and see which everything you see everything seems like the same there and then I could use the description budget and uh the unfinished work yeah I mean I uh this one also see a lot of on rn2. Oh yeah, I think, did I post this yeah?

A

I did or went to 20 years ago, but I thought I posted new ones unfinished work. I did not post new ones. I did see so. Oh I remember this. This was okay, so this was marcel. Did this experiment again and we did that's what we wanted. We wanted to check pps and see if it made a difference um and we didn't see much of a change, but we did see this.

A

We did see the storage error rate go away. I don't know what that is, but.

A

That was one of the changes.

A

Unfinished work still high from the handler I wish, like. Unfortunately, one of the hard things to quantify like versus sort of imagine is uh exactly what this is like. I don't, I understand like unfinished work. It's like it's a thread, that's running long in the controller, but I don't understand this measurement like we're saying this is telling me that we have a 12 minute running task in a controller which is which just sounds crazy to me. It's possible yeah, but it's it's interesting. I think. Maybe this is another thing.

A

Maybe we just need to do some profiling on um to learn some more info.

B

Okay, we've had errors in the past, for example, where we'd make an api call somewhere, it didn't set an appropriate deadline on it and it just hangs, and so it essentially consumes a thread for a long time. I possibly definitely I think, we've resolved those I'm aware of, but that kind of thing is possible.

A

Yeah we see um tomas is looking at this.

A

A

Yeah, I think that's just going to tie into this thing, um so he uh tomasu has already looked, is already looking at making a change. I think, did he post a uh my pr for this? Oh well, you can see, I think he did.

A

Let me actually tigers over.

A

B

Roman can't join because he's he just told me he's working some some ci issue, but he he delivered a message and he says that we should make kuvert run fast. That's what he wants us to focus on so.

A

Very nice, okay, this is uh this is tommaso's um uh change david. I don't know if you're aware of it, but the um this was the um how he wants to reduce the memory usage.

B

Oh okay, interesting yeah, so I just got back from pto uh today. uh Actually I was gone thursday right after our call. I left I've literally been gone for a week until this call practically. So I will look at that. um That's interesting to me.

A

Okay, cool yeah: I kind of need to look at this one too. Let me tie myself up kind of like cc myself for a sign myself.

A

I don't know how to sign myself all right I'll, just keep it to that part. Okay, yeah that so this is, um I think, what he's doing next for the profiling.

A

Okay, uh we're p work performance um and then we've got modules using more cpus and requested.

A

um I think do we just set limits on this. Like is this like we.

A

What do we do about this? One.

A

Show contours of using more cpu memory than requested how many pods or schedules.

A

And mismatch the cpu request, usage.

A

I think we need to review this one.

A

I don't remember what what would be the next. You know we want to do with this. One.

A

All right, um we have marcelo.

A

All right, we need yeah, we need to scoop that one okay and then um profiling uh prior to fairness. I actually saw this so um I re we actually ran into this. This is interesting. Retrograde says the other day, um an issue with this uh with the current one, two one defaults settings um which groups all of the um the uh the I guess, the priority of um of uh all everything. That's a service account it.

A

It runs as a service account into um one queue, and so we actually ran into an issue because um cuber was creating or doing a lot of work and it conflicted with obs and um they were filling up the queue and they were actually getting requests rejected.

A

So I had to give kubert its own priority queue so that didn't conflict with anything, and actually there were the um the rejections. The 422s go away, the 429's go away, um but this was at large scale under a lot of stress, so it was something that I don't think many people will really see at the moment, but it's something to that. We can consider. I don't know I I I know this is something that I need to do, um or at least to.

A

I think I need to get a little bit more information about, but there's I think at the meantime like it's something we can. I don't know we can capture in the future.

B

Would you you said that we shared it with ovs this api.

A

Yeah, so by yeah, by default, everything that runs as a service account is in shares its own. It shares a queue with everything else that has a service account, and since cuba has a service account, that's obviously shared with everything else and they were conflicting okay. So this is as simple.

B

As adding a new queue and assigned to.

A

Yeah all right got it exactly yeah that's so. This is essentially like this. This issue. That kind of you know when I talked about this yep. That's what I that's one of the things I wanted to get at, but the is that the minimum but the the larger tab. You start defining the q length and the things which I don't have enough information, but at the very minimum, having its own cue, I think, makes a lot of sense.

A

Okay, yeah all right, something uh we'll keep it open for now. I think something I I'll get back to at some point. I haven't just had a chance yet um and then uh oh, we did this one right and then there's profile under high load, which is okay, um did tomas? Oh hey, thomas uh you're, here, yeah, hey did you want to talk about um your change at all.

C

uh Yeah, so I can talk briefly about it, so basically, I added pagination to cluster profiler, which means that I'm basically making list pods requests uh in pages right and I just returned to a user like continuation token, together with uh profiler results of that pods from from one page and so yeah it so that helps us to control how many, uh how much memory viewed api uses to actually store in memory, the cluster profiler results and additionally, I've added label selector.

C

So basically, you can select the ports uh with as usual, with label selector, which we all know from cube, ctl command. It's it's basically the same syntax. It's it's parsed the same way. So we have like these two mechanisms: pagination and filtering.

C

Filtering with labor selector to to reduce the memory usage and it both works, so I've checked it on on a large cluster. um Yes, so if, if you can david have a look at it, I would appreciate it.

B

Sure yeah I'm looking at it right now um for your uh pagination stuff. When does that uh token expire? I guess if we tried to do another token or what.

C

That's that's defined by kubernetes, and it says I think it's between one week and two weeks. Something so kubernetes provides this kind of limits right. So it is not precise, but you can actually check it in the uh in the documentation of a list call.

C

But I think, like a week or like at least few days, that's like more than enough right. Oh so we're using the.

B

List as the oh interesting, okay and the other option you had was the label selector, and that would let us choose just um just like all the vert controllers or all the word apis yeah.

C

Yeah yeah yeah exactly.

B

um In practice, what um what have you used? Have you used both of these or, if you forget,.

C

So so I've checked both of these, like, for instance, for weird weird api. I just there's a cube view: dot io label right, so I use that it works. So interesting is, interestingly, the page size. So I thought that like 20 is like we should handle it just fine, but it turns out that, at least for me there's some problem with uh fetching large files through the weird client.

C

um Maybe that's because of uh of the fact that I'm, like you, know, far away from the data center actually which I'm in europe this data center is in in us, and I've noticed at some point that cube ctl copy it phase as well for me with larger files. So maybe that's that's just because the cli grants client is not reliable in copying large files, I'm not sure um yeah, but smaller page sizes they work.

C

For me, I get some internal stream error when I, when I try with like 20 page size 20, which would be hundreds of megabytes.

C

So that's the only issue which I've noticed but, as I said, this might be caused by the distance between me and data center. So.

B

I'm curious, if, for your use case, the label is enough, um because the label would you select controllers and you could probably get just like one or two vert handlers. If you needed it, you'd have to target the exact handler yeah.

C

uh Yeah, so for me it works. It's okay, it's enough, but you know I was thinking about adding uh field selector as well, which I guess would cover like every use case, probably right that you can select by the uh I know, by the name or whatever, whatever you want right, because that's, I guess, that's two filtering methods we, which cubectl has label selector and and field selector. So I'm okay with adding that as well. It's like should be. You know it's not uh it's not a big deal to add this.

C

I I don't need it right now, but maybe to have like you know, um full coverage of of use cases. Why not.

B

Yeah and are you do you need the um pagination right now or I guess I'm restructuring my question make sure I'm asking it accurately or will the label and field selector be enough for you.

C

um Yeah yeah, I think I could do with label selector, but but I'm thinking like why you know, I think we can support like just fetching all of the results from all of the components, including build launchers, which we don't right now right. So I'm wondering why? Wouldn't we like to do it or provide a user way to do it?

C

If we can so yeah, I could do like labor selector and just you know, use by name or whatever, but sometimes it's just convenient for me to to run to get every every profiler my result and that's it.

B

My concern with the pagination is that these are ephemeral uh states so like what you query at one point uh may not even exist anymore uh minutes later so like pods might change, um and things like that, so the result like you just wouldn't be able to get up, it would fail. I guess that request and definitely so once.

C

B

It's gonna always fail if you keep trying to get the rest of the pages.

A

So would that mean because of like, um like going back to the original problem, which was the memory usage, but that mean that if, if say label selector was the only option, it would mean that it's basically a requirement to make it usable use it use to make it usable at all in a large cluster situation, is that we would have to have a bunch of labels on there right.

A

Well, we already have labels, it would be yeah. No, I mean you'd have to add a bunch of labels for the for to use the profile.

C

Yeah yeah, because, on the other hand, when we don't have pagination like then we would release like a feature like this cluster profiler, with okay ability to to select subset of quotes with labor selector, but then we would ship a feature with which would be like internally flowed for the user. You know not choosing not to use labor selector then, like you know it crashes, the whole uh the whole build api pod. So it's like for me.

C

It's it's really strange to to have a feature which basically doesn't work when we, when we ship it with with default arguments right because default for labor selector. For me, it's empty. Why would someone you know tell you the default is something different and with default default uh arguments, sometimes you would just for some cluster. It would just fail with you know, out of memory uh without of memory error. So that's just that architecture for me right. We should not like have it like this. That's that's just you know. One point of view.

A

So like the idea would be like we have this as a default, you know just as a protection, so, instead of having a requirement to to have a label, we use this as a default projection. Yeah yeah.

C

Yeah but but you know like you can select some kind of you can choose some kind of label selector, which selects you know the majority of the pods so that that doesn't protect us from just failing. I mean okay. I.

B

Think that your.

C

User should not should not be able to provide arguments. We just make your you know pod crash. I I think we should protect from this and pagination. You know it works and yeah. It is ephemeral, but um and.

B

If it helps you all, I think I'm fine with it, um because this again is not a, I don't even know. Maybe you all would use this in production. I wouldn't recommend it um definitely would recommend that actually, but uh if it's useful enough for you all, we should enable it.

C

Yeah I mean we could always do like default. Page size is, uh is zero, which would mean like everything at once.

C

Let's say right so then, then we wouldn't have this behavior, this female behavior by default, but then user can select a page size which is you know something smaller and then, when user explicitly selects the patched page size, then he basically agrees that okay, this is uh you know if ephemeral- and you know it can break for some in some circumstances, break meaning the request will not succeed.

B

Okay, I'll look at. I think I think this makes sense uh what you have the more I think about it. I just want to make sure for the default developer use case where somebody's running, um just in their laptop with a couple nodes and lots of uh and non-live virtual machines and everything. Well, I guess we don't collect the invert launcher stuff yet as well, make sure that we get all the information at once for the default case.

B

I think we do that's all I'm concerned about yeah.

C

Yeah, I think page size is 10 right now.

C

If you have anything in mind it's just like you know, I just you know just thought that this might be good numbers assuming the size of the the profilers right because of the profiles. If we have like you know, let's say one: one part is nine megabytes of of memory, so 10 is like 100 megabytes of additional memory, so this seems to be fine. uh If you have anything in mind just you know just comments there and yeah.

B

So the thing that I have in mind is, I just want the default to be able to handle a two-note cluster with all of the components involved, and I think we, I think 10 might be exactly how many, because we have two instances of operator api controller and handler. So that's eight, maybe only being eight, but that's all it does so 10 would definitely cover it.

B

C

I think we're good.

B

Okay, um I will review this and I think I'm on board.

C

Thanks thanks yeah I've got one more question like regarding this change about the viewed launchers. Was there any reason why you decided not to include them in your change or because I I would like to profile with launchers and see how how it you know, behaves and what's there so I'm just I'm just curious. No.

B

That's fine. I was less interested in bert launchers at the time, simply because there's not a lot going on in there really. So I mean it's just times one workload, but I could see I was thinking about this primarily from a cpu usage standpoint, but I could see for like memory and other things yeah. It makes a lot of sense, so that was probably a short side on my part. The difficulty is going to be how to get the information out of the vert launcher.

B

I don't really know exactly how that will be done, because it's not it's e. We don't control the network on the vert launcher. The virtual machine guest controls, the network, so somehow vert handler is going to have to um get that information to return it that all the dumped results and everything and also handle starting and stopping and dumping the launcher results. It'll be a little tricky.

A

Would it be crazy to like go onto the into the pods file system and just take it.

B

uh From vert handler now that would be fine, that's what I would expect so vert handler would have yeah. It would well, no you, wouldn't you have to do that. So bert handler exposes a um an http endpoint when you have this debug feature flag, enabled and everything. This is totally not safe for production, so I'll, throw that out there again, because you could anyone can hit this endpoint and get information um they would hit this.

B

You would hit this dump endpoint and the invert handler behind the scenes would just go in and grant all the vmis that exist on that node and retrieve from their file systems and return them. Now you could do that, but maybe that would be too much information. Then we get back into the pagination issue, because if you have like 100 virtual machines onto a node, then that's a ton of data being returned. I don't know it gets a little tricky. uh You could pod exec and use a copy um sure. Why not?

B

That seems like the easiest thing.

C

Thanks thanks, I I try to think about something and maybe propose some social. I.

B

Don't think we give for api the permissions to pod exec into a container anymore. We wanted to remove that so that might not work.

B

C

B

That's the best I can say, and it's possible that if you really want to profile a vert launcher that the cluster aggregation of all that might be too tedious and that um we could add some hooks where you can profile it internally by doing your own pod exactly in there and getting data so just bake in the ability to uh to turn on off these profiling stuff into vert, launcher and then manually. Do it?

B

A

Okay, all right well we're at time um any last thoughts.

B

I I need to run.

A

Okay, all right, everybody. Thank you have a good day bye. Thank you. Bye.