KubeVirt SIG Performance and Scale, 14 Apr 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: SIG - Performance and scale 2022-04-14

Description

Meeting Notes:
https://docs.google.com/document/d/1d_b2o05FfBG37VwlC2Z1ZArnT9-_AEJoQTe7iKaQZ6I/edit#heading=h.tybh

A

Okay, am I still sharing just let's see my screen? Okay, good, all right, all right so, sixth scale. This is uh 4 14.

A

Let me put the link into the chat.

A

It's probably just gonna, be awesome, solo, all right. So um all right. So it's about a few weeks been about.

A

Four four five weeks or so so um uh I haven't uh so probably the only update I have for like a few things I want to do is like I want to go through, because we haven't looked at the periodic job results. I just wanted to sync up on that again, let's see how you know if anything's changed here, I did look at this earlier, just to kind of get an idea.

A

Everything seemed okay, um we're still roughly in the same amount of time for these that we're spending in phases and the number of updates. Our number of uh api calls look roughly the same.

A

Yeah patch, we expected thresholds 215 and.

A

Yeah pretty good.

B

Can you check also the performance cluster yep.

A

I was gonna go to that one. Next: uh okay, pretty good three, thirty thirty nine yeah well under this still looking good there, okay and then, uh let's see performance cluster. Is uh this one right.

B

Yeah, oh you see I I I saw that it was missing the numbers.

B

Okay. What is that.

A

Oh, this is oh one of the uh one of my patches must have emerged. Then.

B

A

B

Because I was looking the dashboard and then was missing some data and then I just wasn't understand, but I didn't check this. The that's great okay,.

A

So it was about a week or two ago I wanted to merged okay. I haven't caught up on this yet, but I wanted to, I was gonna kind of want to go through them now. Okay, so, like then that's what this is. It's just. uh It's missing.

A

Let's see, I think, I'm missing the one. No.

B

All right probably the test, so it's update the the load generator, but it didn't update the the protest from.

B

Oh so I I'm just put here.

B

B

It's it's! If it's fine, so we just need to make sure that we fix that yeah, because it's was missing some data yeah. This.

A

Is the one okay, so this is just the one that needs to be okay, so I need to just come back to this. One looks like.

B

Okay, just this is the one that you're missing.

A

Yeah this is the patch you're missing. I just need to do some re-testing, all right, cool yeah. You probably need to rebase this too. Unfortunately,.

B

I cannot approve things um yeah.

A

Let me get some gtm okay, let me get. I don't have to get david or roman to just take a quick look at it, um but that yeah that's what we need. That's what's missing for okay,.

B

All right: okay, yeah, let's zoom.

A

So holy, so that was the so I think then is: are they all gonna that works? Let me see so what's the density test? Is this the other one yeah same thing? Okay, so it's the same. It's gonna be what's it looking for uh probably.

B

It is the first 200 yeah.

A

Yeah, okay! Well, I'm good that this uh okay, I'm glad that merged, so that- um um and it looks like everything from the syntax perspective- is working as expected. We just assumed that um that one patch, so okay.

B

A

Encouraging so we're halfway there cool, okay, yeah good and then.

B

So this link that I put look at this. This is the performance dashboard of the dashboard of the performance cluster, the one that I told you that I was checking.

A

B

B

Yeah yeah it started to miss the data I thought was yeah. I didn't investigate too much. I was you know doing something else, but I saw this and I was worried about, but now we know what's happened.

A

B

Okay, yeah, the the thing is: do you see that there is like some um big variation in the previous test.

A

B

If young you know, yeah maybe put like maybe.

A

B

Of they are not all the same tests, but some of them are the same test. You know, because we have this test creating 100 and the other test that range from 200 400 600, I think, and but even though we see some, um uh you know big uh some b variations so.

A

90Th percentile 9.44 minutes.

B

Yeah, maybe we should put the median also here you know the dashboard, but anyway the ninth percentile.

B

You see this one, the in the march 20, yes march, 20 was nine minutes the worst case, and then it drops to five minutes.

B

Maybe it's something with the cluster. Isn't it? um You know like uh the the cluster was busy. Something happened to this nodes, but it's a bare metal node. It shouldn't have any difference. So is this.

A

So when I look at this um okay, so we've got.

A

When I look at this, so is this: is this like additive like this is how much time was spent in schedules phase? Yes, it's fairly large yeah, okay, yeah. We saw um we see this as well. I actually have another good illustration of it um here.

A

Transitioning from scheduling to actually it's scheduling phase is what we see well, actually, no new schedule as well, but it's um we. This is one pattern we do see with um like yeah, so you can see like the the dark blues like you can see the large um uh large amount of times they're spent there. The light.

B

Blue is the scheduling which is increasing.

A

Oh sure, yeah, okay, this um well, I I have a different point. I want to make about this, but one of the things that we see internally when we look at our clusters and their performance when this light blue starts to increase, is when we run into problems it's actually. This is when we see kubernetes kubernetes, actually running up into issues with performance, and when we see this light blue, I can't really say about the dark, blue. um I think it's a mix, we actually seven.

A

I mean that's kind of based on what it does, but I haven't done full analysis on the dark blue side, like what the makeup of it is. I've done a ton on the light blue.

A

This is almost exclusively kubernetes, so like on this on yours um light blue yeah, you have the same, looks like you're.

A

What's how does the um hold on? I want to see the uh oh? I can't um does it show uh like what? How are you no.

B

A

So this is, um what about do you don't have pending in here.

B

Yeah, for some reason uh depending doesn't appear to me so so this.

A

Is so this would be if I interpret this correctly, this is the time it takes from pending um or from scheduling to scheduled right. So this is scheduling time or is this time from pending to scheduling.

B

It's from creation to scheduling, from creation to schedule and from creation to running this is the metric that uh you know, david, create.

A

Yeah yeah but you're, using you're, using um creation to so using the creation time stamp to the scheduled timestamp right. Okay, that's interesting, and then you put them together. I see so.

A

Okay, so the difference between these two lines.

A

Is the time in um that we spent in scheduling, though I think because it's the time we train yeah, so this is, I think it's. The reason why you don't see pending is because this is, I think, the light blue is your pending. I think your green is actually scheduling like time spent scheduling, and this is yours.

B

What do you mean because I just get this from the metric, so you know the this. You know vm creation, phase metric, that you know david created in cooper and his, and he has like the calculates, the delta.

B

You know between the creation to when the vm when the phase change. So when it's scheduling it's means the vm.

B

I well. I don't remember now, maybe exit from this, I don't remember yeah, it's good to check.

A

B

A

I would we should double check this, because um I I think I I just kind of a feeling that this I'm wondering if this is actually um this time here is actually it's actually this the same as this light blue.

A

That's what I'm wondering I don't know I kind of want to see the um yeah what you have on the because you said it from time that the creation time stamp to like, because if you do the creation time stamp to do the scheduled transition that would be this could be this. The phase transition time stamp from scheduled right is like the moment we transitioned into schedule right yeah, so all the time beforehand would be 10 would be. Creation pending scheduling would be this yellow.

B

Exactly when it's scheduling yeah, this is the phase transition. Yes,.

A

Yes, but then sched, this blue line, the top of the blue line to the top of the yellow, will be the time you spent in scheduling phase.

B

Yeah, it's not a stack, so actually it is they're just over under each other. Okay, but I think you can see this metric. So if you go to the inspect.

B

B

Okay, you see vm creation time, you can see yeah in the bottom.

A

B

Yeah, if you go down there.

A

B

Yeah, you see it's the convert, vmi phase transition time from creation.

B

I just remove failed and succeed so just to have the creation.

A

A

Okay, I forget when now what the metric that.

B

Yeah, but this is exactly the time stamp so when it's yeah, when the phase change so.

B

Current creation from the object was created when the object first appear the request, you know and then the phase transitions but uh depending does appear for me, never appeared. So I don't know. That's weird.

A

um Yeah, that is weird.

B

Maybe it's because it's too fast so it doesn't doesn't have pending, but.

B

We have the scheduling so depending is before kubernetes schedule the object. So if you see pending it means that your kubernetes, it's it the scheduler, it's suffering. So the problem is, if you see the pending, do you have this scheduling time.

A

B

Scheduling time because there is a metric from the scheduler, that is the scheduling time, it's the kubernetes, metrics. Okay, it you probably your dashboard, if you don't have, might be interested to check, because what I'm saying here is okay, I'm not sure, but I think that the scheduling time it's very small here- then we don't see any pending, but we see the convert. You know components is doing things. You know some slow down here.

A

Yeah, I just find it weird that you don't see pending. I don't know it's odd.

B

A

Be interesting to see what you find in the prometheus data and just yeah. I find that weird I mean because they definitely have the time stamps I mean they've got to right like they should. They should definitely have it. So it's kind of weird, maybe there's we have a bug there or something um it's interesting: okay, well yeah, but you're right anyway. Back to your earlier point, your first point was that right, this is we're. Seeing a big increase right like here were.

A

Is this the, which is the hundred tests in the 200 and then 600? Is it? Is it one? Is this like a 100 over here? No.

B

There were 100 dine wounds between you see. First,.

A

One like zoom in, I should see it. Oh here we go. I see.

B

Oh, this is 200 if you go back a little bit. Sorry um yeah, so you see like there is a stack here that is very small between these two big ones, yeah this one. This is the 100.

A

That's the 100 yeah and you see 49 37 17. Yes,.

B

100 kind of pretty good, isn't it so, but when we see like more, no, you know more vms.

A

Yeah but even for like for seconds it's interesting to see this.

A

How many would be it would be to really up for, because, like so, I see different numbers in terms of like what, like some of the things that we've measured, I'm wondering about the hardware in terms of use, because we see milliseconds on some of these. Like.

A

Yeah, we don't even in the 95th percentile, we see milliseconds.

B

A

We don't even reach seconds, we have it in. We have we take it takes less like for pending and scheduling phases. It's in milliseconds like we don't even reach this, like it's less than a second. It takes and you're, and that's in the 95th percentile you're in the 90 percent on you have almost 20 seconds.

B

Well, this is interesting, but to create 100 vms.

A

uh Not like at once I mean it would be like.

A

Yeah, I guess it's not fair to say I haven't done so. Let me do I'll have to do a comparison on the hardware like because I haven't, I haven't done the exact same test, but roughly like what I'm saying is that when we have um we're doing our creations of um I don't know, we do like maybe a bunch at a time, a handful of time, it's less than 100, but it's it's it's in the middle seconds. It takes for a lot of these phases.

A

That's interesting.

B

Actually, milliseconds yeah, it's very surprising. I never see me in seconds in any test that I did so and I test in another high reverse as well. Maybe the image that you are using, maybe so, if you have, as you know, nvme you know, things can get like faster to load the image.

A

Yeah, um this is interesting. I mean this is so what's this one, this is 600.

B

A

Wow, it's almost 10 minutes. Okay, I want to do a study on this like um more details.

B

Yeah, I remember that you know your colleague presenting the kubrick summit. He has also some machines, you know taking 10 minutes. I saw his. It was like some very hair scenario, but I saw like 10 minutes.

A

B

A

It in some cases yeah- I mean I guess regardless though I mean, is this. What we expect, though, like I mean I guess I mean if we can produce it and you can produce it, I mean that's, that's good, but I mean I wanted to be good to do some analysis. That's like an eight like what is happening that we're sitting like scheduled phase right where what's happening at schedule space. We are transitioning from the vm, the vmi, to the to the vert handler.

A

It's doing some work we're trying to define the domain like eight minutes.

B

The tracing part should be useful, and then we saw that there is this guy that wants to work on the tracing. Maybe we should come up with a plan, maybe give him you know, because if we know someone wants to work on that, it will be very helpful, especially to get more attraction. You know to our community and you already started something with the tracing, but uh you know we we, I think. Maybe we can point him before going to open tracings kind of things.

B

You know to have more tracing points, analyze, the logs and- and uh you know I I don't know we can just tell him- I don't know how advanced it who will make this, but we can yeah.

A

I can think of.

B

Something yeah.

A

Definitely if we can do if we're able to open there's nothing great, I think that's a that's, definitely definitely a big effort, but if he is open to doing that, work it'd be awesome, but if not like yeah, we can do like the the poor hands tracing, which is the the tracing that I did in her controller, um which we could add it to the handler.

A

It would be interesting to see some of the cases where you know where, with these vms, to see if we like actually hit a slow something slow, it needs to be like a lot of research. We need to do in terms of like what are the paths that we need to look out for yeah. So there's still some there's some good work that we need to do there.

A

Okay, all right! Let me go to the next. um This is something I saw that I thought was interesting that I just wanted to mention, so we were seeing this in one of our data centers. Recently, it's a large data center, it's like over 700 nodes and um we would see um periods of low churn and high vmi counts. um One of the vert controllers creates a ton of patch requests like in a crazed out.

B

Did you see what what was oh there's.

A

The numbers out- I don't so I I hadn't figured out, but it um it's patching like crazy, but what's interesting about this pattern, is that you can see that here it's patching this green line um and there are um right, it falls pretty quickly and as it falls, um it actually corresponds with with this.

A

This uh grafana board that like so you can see um so this uh right here this period, um where we have this green line. This is the equivalent of the high patch counts and when it falls um so when this this this green line, the rest client request falls um the others rise over api over handler those increase, and you can see in the phase transition times. There's a change, so this line the scheduling time increases. We have this. um Our signature looks like this.

A

We have um a high amount of requests from bird api controller handler fairly. I mean they did.

B

You check it out, creating just we're creating vms. You receive like more requests. You because, like a atv, each vm has eight patch per vm. Isn't it.

A

B

A

B

A

Periods yeah like during the spirit, so that's that I mean that that could be true, but it's um I just find it weird like so this this area, so I said well, this is it's low turn during this year. That's the other part of this. Is that so this these areas right here, there's like um high scheduling times, represents high return things just take longer in kubernetes. That's just what happens um and we are still creating vms during this time. So that's true, but we're not creating as many vms.

B

Did you check the vm.

A

um Yeah during this time, it's um it remains fairly, steady, like within a few hundred, so it's not like increasing or decreasing, really quickly, it's remaining fairly steady and, for some reason, the the arrest client request is extremely high, and this is what I found when I dug deeper. Is that it's it's just patching and it's only one of the vert controllers, it's not all of them. One of the controllers was just patching away. um No.

B

It's on only one controller works at a time, so other orders are for high availability.

B

A

I thought it was actually active.

B

No kubernetes it's! This is a kubernetes pro, so something that we were also discussed internally in ibm. But you know, kubernetes controllers works like that. It's only single instance by default, and it doesn't, you know to have multiple controllers. It will need to charge. You know the data across different controls. It gets complicated well.

A

That's: okay, that's that's fine, but like what I what it doesn't make any sense to me. Is that like? Why would why would they request? Why would they be? Why would we be patching a ton? Almost it's not really idle time, but at like low um at um low turn at when when you'd expect not a lot of vms and then why would be? Why would we decrease the number of requests.

B

This phase transition. Actually it's telling the performance, it's the latency, not how many vm is being created so because what is what I'm thinking is? Okay, so imagine I don't know just guessing here. Imagine a scenario that it you know you you need to create. One thousand gems and the system you know cannot cannot cope with that, because it's especially because it's busy we can see here that things are very slow, but suddenly you know their requests.

B

The client requests decrees. But here you are still having like a lot of pending. You know, requests in the queue and then, when the system you know becomes a little bit, you know less overloaded. It can now. You know it can process all these requests that are pending. You know and then you see dispersed.

B

I don't know just guessing. You know something with you had some requests pending and now you will need to process it unless it unless, if you are a senior request, because you should only see any new requests and then it gets higher this, I'm just thinking that maybe it's something that it's on the queue you know that's now. It's been processed now.

B

You know if you check.

A

B

You can check some of the work you metrics, you know the the the depth there of the kills things like that. You know how it's behaving.

B

Also, if you can get like uh this um number of great requests, you know the call the rest call.

B

You know, because you have here, you know aggregated divert controller root api, but if you can just get this, for example, create you know just to make again to make sure that maybe you see um or delete so you just check what's happening, because maybe it's deleting that you see a lot of you know this high request here.

B

A

Check what's happening.

B

A

B

The problem is.

A

B

A

Yeah this is, this is the pat. This is patch. These are um like other get. Okay,.

A

It's literally we're just during these periods during this whole for this whole period of time we're creating and deleting during the periods that you can see where there's that little higher scheduling time it's when we're, creating and deleting and we're creating more we're, getting a lot of press, because there's a lot of creates and deletes here, there's very few, so it's kind of like a um there are still creating. What's going on, as we can see, there's lines, there's data being populated.

A

It's just strange that, like it's there's the signature matches that there is like a period when we're at a low turn for some reason, the patched then tax requests shoot through the roof in the vert controller, and everything else is not doing any work or not much at all like it's. It's still doing work just on the whole lot. You know very little, but for some reason this is doing a lot of work and then and then, when we're back doing, you know more work, the api, the word api you can see.

A

It comes back to life. Quite a bit. Bird handler comes back to life quite a bit and then controller dives back down, which is a little bizarre like I would expect for a controller, maybe to go up right instead of come down, so I just find it a little weird that we're like what is it? What are these patch requests that were that are happening? That's what's a little bizarre, so I don't know I I don't know what it is.

A

I'm the reason I'm bringing it up is because if we something to look out for because um something to do a little more research on, because I just find it if we're viewing a ton of patch requests here- maybe we're like um you know- maybe we're just doing- maybe we're updating something too often that maybe we have a code path, that's constantly updating or making patches or whatever changing bmi's doing something, and that isn't that isn't activated or isn't running when we're um when we're creating a lot of vms or something I don't know, it's a little weird.

B

Which version are you running.

A

This is your 35.

B

Yes, there you know there was some, you know. Did you remember the node node update? There was like some works trying to reduce. You know that, but it was not. Patchy was the only gadget anyway. This is interesting. Maybe it's a bug. I don't know it's yeah. It's definitely something interesting.

A

Well, we're gonna we're actually moving to zero, I'm hoping soon we'll get to maybe maybe next few months we'll get to to a newer version, and maybe we'll do this again. I want to do this experiment as well, though, with the steady state job, because um I think that would get a better result like.

A

If I can reproduce this exactly in the steady state job, then it would be um interesting to have you do it in your data center and see if you can get the same thing or um to see if it's something just on land or if this is something that there's some there's a problem somewhere, but anyway I figured I mentioned to keep an eye on, because I just this is counterintuitive. This does not look quite right.

B

What would it's yeah? This is weird because you know sometimes you get there's a lot of patch and then drops yeah when you get load through the other components.

A

Yeah, it's a little weird yeah. Okay! Well I'll leave it here. I can, I don't, have it I'm going to make an issue, I'm going to do I'll. Do a little like, I said, a little bit more investigation. Maybe one try to reproduce with the steady state job and see if um and then create an issue out of, then we can kind of we can go from there. It would be really cool if I could actually reproduce this and if I'm able to do this, so this is a job.

A

Do we have to reduce this in one of our um one of our our jobs, the periodic that would be really cool to see.

B

A

Okay, all right um and then last thing is uh prs. um We still have this open, so I'm um I need to know.

B

Just just one more.

A

Comment about this.

B

Yeah, you know this is just one kind of test, for example, that I I I don't do and we don't have this in the you know in the ci.

B

It's a test that you create vms and you live there. It's what you have in your cluster isn't so you have like. Maybe an old vms know that being created there and it's there forever and and then it's kind of the stability of the cluster. You know, then I don't know if this behavior is related to that. But it's maybe you know that's why we don't see because normally the test that we do it's we create see things and destroy everything.

B

But you know your cluster. Has you know remaining vms that stay there forever? You know- and I don't know if it's not forever, but for a long time and and then you're seeing here some.

B

You know some different behavior that we don't see. Yeah.

A

That would be we. We should be able to do that with like um a little bit of tweaking to study, say test right. I think maybe just that would be a good one to add, like um kind of another offshoot of it where we, because it's really what this is. It's steady state but like kind of like you said where they run a little bit longer like it's like we'll, uh we'll let the vm, maybe let the vms run for a certain amount of time. You know hours instead of just minutes.

A

um Let's see what see what happens, yeah kind of like ability test some. You know some little offshoot or a little leg of steady state. We can do with the burst test as well. I mean same concept. Just kind of you know burst is going to leave them around. You know.

B

Disperse weight, in the words.

A

B

That's what we have here.

A

So, okay, yeah, so I'll do some follow-up and see what I can find- and I tweak this a little more and I'll see if I get a little more data on um on the uh on the patch request, something what's what's being patched, because that's that wasn't really clear to me. I didn't have time to fully dig into it. So um yeah one of this one I find I'll, create an issue. I.

B

Don't know which cluster do you have, but you can probably enable you know, uh increase the log verbosity of the virtual controller area and if it's like, I think it's the verbosity higher than three or five. You can see the requests, I'm sure. So maybe I don't know you know you can check it, how the log it's implemented, but yeah and.

A

B

You can see, like uh you know, more specific details but they've the the api for sure. If the api, you know the kube api, uh the different apis, I'm sorry, the cool um google api.

B

If you enable the log higher than f or five, you can see all the requests so you'll see what's what's kind of patch requests arriving there.

A

Yeah, okay I'll do I can dig around with it. I I haven't seen it like after I was produced. This is after it was. I saw us this one time I haven't really seen it since so I think well I'll see if I can reproduce it and see what I can do.

B

It's not happening that anymore.

A

It's I I'm not to this extent like it's. uh It was happening fairly, consistent, consistently like in uh maybe like a uh almost like a week ago, but um I haven't seen it since, like I haven't seen it in like the last few days.

B

It might be hard too yeah.

A

Yeah, I don't know we'll see if I can find, but um I don't know this one. I figured I took a few pictures because it's a little weird so anyway, okay enough um enough on that one, so the uh okay so prs, these are the three pr's. So we already talked about this one um need uh another plus one uh same with this, I mean I think that we've already talked about this a million times and then I think this one merged right. This was the one that merged.

A

Yeah good, okay.

B

Okay, yeah. We definitely need to get the merge so about the slo documents. I think the someone mentioned you know about this document should be some other repository.

A

Okay, what do you think? Where should I go? I mean it's kind of. What did I put it in like the dock section, I guess.

B

Yeah, it was community, I think also kubernetes has this inside the community.

A

B

So if you go down the last comment.

B

A

B

Yeah convert community, and I also think so I think kubernetes slows is also under convert coming in. You know, good viewer community and then it has a directory of this is cage 6k, something like that and then inside that it has. uh I think it makes sense, because since kubernetes is doing that, but uh I don't know I don't have a strong feeling of that. Maybe we can ask you know david and roman about that.

A

B

Are organizing, they are the guys that are organizing organized the old project, so yeah, um okay,.

A

I can do a follow-up. I didn't already brought my attention to this. I didn't I lost this in my flurry of emails after it's too amazing.

B

A

For a few weeks, so okay I'll check this out and talk to jean and roman, I gotta talk to them anyway for these. For uh for this one, so cool, okay, all right, um I don't think we have anything else. I think for next next time, we'll see if we can um grab um uh what's his name, is it uh um said kim uh from? I only know his irc name?

A

Was it jake kim.

B

You mean the from the this luck channel.

A

B

Okay, yeah: we need to follow up with him before.

A

B

Lose his interest so.

A

Yeah it sounds like the time zone is going to be a problem. So maybe we need to do like a email thread or something just to get like his just understand and then maybe, if he can do some, if he wants to do some design and see kind of greatest thoughts.

B

Yeah, maybe we can point him, you know if we point him some kubernetes code. You know to just show to show how kubernetes is implementing the tracing. You know and uh and ask him, can you prepare, you know some some uh design ideas?

B

How, because actually, I think he was asking how to I don't remember now how to propagate the context. Isn't it.

B

I don't remember none of his questions and we have definitely checked it again and and see if, if, if might be possible to help him with ideas- and I don't know how, how is he willing to care that yeah.

A

He so he he liked the comment that I put, which is that, like we have him post his comments in sixth scale and and then we can follow them on slack so maybe um like maybe I can tag him on here and um and we can just uh start the conversation that way. So it's like why jake, I don't know his email. Do you know my information? Does he have any? No, he doesn't okay yeah.

A

Let me follow up with him I'll, just message him and get his email and we can we'll just start with slack. I guess that might be easier, because I don't actually know how to do this, and then we can link him a document. Maybe he can follow up in here.

B

Yeah he mentioned that he doesn't want to talk personally because of english. You know constraints, but we can yeah.

A

We'll go, we can try. We know it's fine, yeah, exactly yeah, we'll start with slacking just to see what like um you're good at interest on things and for retracing okay cool all right guys. I think that's a laugh thanks for attending.

B

Okay, thank you.

A

All right, thanks talk to you later bye.

B