KubeVirt SIG Performance and Scale, 28 Oct 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: SIG - Performance and scale 2021-10-28

Description

Meeting Notes: https://docs.google.com/document/d/1d_b2o05FfBG37VwlC2Z1ZArnT9-_AEJoQTe7iKaQZ6I/edit#heading=h.jol87qyjgei

A

All right welcome to sixth scale, it's october 28th um the link to the documents in the chat. um If you have agenda items and yourself as an attendee, please, okay, uh we'll start with the first item, um so the periodic job threshold. um So this um I I think so we do actually before before. We start in the first item. So last week we didn't, um I canceled we didn't.

A

We didn't have uh too many items to discuss, and um so I figured and plus we didn't have a lot of attendance, so it very well pushed to this week. um So um the this item, the first item here, the periodic job threshold. So I don't know if there's been an update on this, since uh I think we originally talked about two weeks ago, but um I wanted to if, if has there been any change with this?

A

If not, um I just want to get some work items and kind of what can be done and to get us to work, and I can take it if not.

B

I don't think there has been okay yeah. I'm sorry, I feel like that's partly my responsibility. I have not gotten to it so the the path forward. If you want to write down action items, are we have to build that perf audit tool um and uh we probably want to gather results with it, which should be once it's actually built. We should start getting results posted to an artifact that could already exist today.

B

So we'll see the uh periodic run, hopefully successfully we'll see, thresholds. um Sorry, we won't see thresholds, we'll see, profile, results and based on a collection of those like. If we see it run for a few days or whatever we could probably start saying thresholds, which would then alert us when things uh regress.

B

That's probably what we're wanting to do over time.

A

Okay um and then this so talking about this business building the perf audit tool, um like does this- just need to be like, um like this just needs to be added to like the um the path in the make file for build or something or is this like a specific path that needs to go into.

B

I I think it just needs to be built, like um I think that it's going to end up in the expected path that the automation's looking for, if it just gets built.

A

Okay, so you have so you have code, that's kind of like that's trying to call it right now, and then it just needs to it's. Just not there. Okay, yeah.

B

I can show you here I'll grab that pr for you real, quick, because it's already been merged uh and that'll, give you an idea of exactly what code exists to to execute perf audit and why it doesn't work.

B

I will post it in the look to my thousand tabs, find the right one.

B

Oh my gosh, I'm about to open another one.

B

Sorry I gotta find this document again. Can you I'm sorry, can you post it to the chat again? It's not in my history.

A

Oh okay, let me do it again sure, oh.

B

A

I thought the um oh okay, I thought it would show all the history. Okay, there we go all right. So if you, if I was saying I was talking like that, everyone could see the document chat. I I thought no one could see this. Okay, sorry about that. Everybody.

B

So if you look at that pr and just the one code change I had there.

B

You see that I'm just creating a start time a stop time and then executing the proof, audit tool to gather results and that perf audit function um on line 64..

B

We expect to find the tool in that directory and it's just not found so it's either not being built or it's in a different directory. That's all that needs to be investigated and okay.

A

Okay, all right, I can alright. I got that because be enough to go.

B

On okay and then.

A

All right, I'm looking fine too to like what to do with this period of job and then test it. Okay. That gives me enough all right, perfect, all right so next week, I'm hoping that you know so, if I can solve this in time, I hope next week we can get some start gang on these getting these thresholds and we can start making some decisions as to uh where we stand and uh perhaps start looking at um different ways. We can gate around those thresholds.

B

Okay sounds like a great start and we can also talk about more metrics collection and things like that. So once we get all this wired up, uh we can do a lot of cool stuff. It's just yep we've taken yeah.

A

Definitely yeah and then once we get the yeah and then um I'm hoping that as part of this, I'm going to learn a lot about this periodic job and um like when it brought like. I don't know anything about it like where.

B

A

Like all this,.

B

I can I can link you to that real quick.

A

Yeah, if you have any information, that'll help me get started until like this job, that'd be good, because then, eventually, I'm hoping like when I learn a little bit more about it, I can share with everyone here and then we can all find ways to contribute to um to different tests of this job.

B

B

Let me give you the file path to the actual config of the prowl.

B

Oh wow, this is way more involved than I thought it was going to be just looking at what was done.

A

If you want, what's the repo will get repo and I can look around in there or whatever.

B

Okay, here's the chat, and I mean I would look through what marcelo has committed um as far as pull requests go: okay,.

A

B

The actual periodic shouldn't be hard to find. I it's just always a matter of going through.

B

So many nested layers of.

A

Directories, I'll I'll look through it I'll see what I can figure out with uh that'll. Give me enough to go on. Okay, all right. Thanks david all, right lum. I think we're good on this topic. um Let's go to the second bullet point, so I this actually the segways off what you're saying so additional um audit tool measurements. So I was um uh some background on this. I was looking at. um I was looking around.

A

uh It's actually doing some some tracing work and uh looking at an issue- and I found a bunch of interesting things, um different ways that we can actually measure some of the the times, and um these are all things that actually, I think, would fit just fine in the auto tool. So this is what I came up with um right now they we so we can. We can see the scheduling to scheduled transition latency.

A

We can measure that and we've got in our metrics, um but there's also some things that we can actually get off the objects um to that tesla. Some other things like latency between when the pods are ready and the vmi object transitions to scheduled.

A

We can actually see on um the pod when the containers go to ready it's in it's actually in the conditions there it's in the status and then um we can also see when the vmi object transition is scheduled, um so we can actually start putting some more some more um data points down, there's also like latency between the vert launcher, pods being assigned to a node to the creation timestamp.

A

um This is on the pod um we can see like when um the network's assigned, like the node name, is actually filled in there's a time stamp. That's that's put there um and we obviously have a trade, the creation times, amp, um there's the launcher, pods being uh assigned a network um if you have so never plug in when those get laid down. um That's also there um we could. If we, if there are padded pvcs, we could actually look and see um the pvc that is being allocated.

A

We can look at the pvc and see the work that was done on the pvc um and all these metrics. I saw that that are, there are actually part of the the server side apply work that went in there's all this.

A

There's all this stuff that that has time timestamps around who is updating what fields and when and we can actually examine it um to provide some more information here.

A

I'm thinking it goes in the audit tool, because what I'm thinking is that we can actually find we could take the break down even further to these things, which I would be really interested in seeing because when I look at the right now, when I look at scheduling to schedule, um I can see the time and um and it'd actually be nice to see like even more like what went into um you know the scheduling schedule, because it's actually not hubert.

A

That's that's that's causing like what what we're seeing here, it's some, it's other things that are going on. What do people think about these? Are there any thoughts about this? Like is that does.

B

That make sense.

A

B

Yeah everything you're saying, makes all sense to me: uh how are you this latency? um Is it exposed today in metrics.

A

This latency, um the the four I don't think so. No, like you mean like in um like in either in cuba or in humanities in some way, is that.

B

Right right do we have, I guess, a way of um detecting this today, even if it's complex do we have a way of determining this has occurred, retroactively.

A

We're retroactively um like yeah, so it's on the actual objects it's on the animals we could so so I'm way I'm introducing this here is if um the odd tool can go through and look at the ammos, the vmi animals and just kind of look through a bunch of them uh crawl through a bunch of them and and dig up this information uh after you know it's after the vmi is running, for example, but we could do metrics on this too. That's that that might be possible, because these are also events.

B

A

Brings up a good point.

B

uh The audit tool today is meant well, we've designed it to run retroactively, there's no reason why we can't run it during the test as well and have it watch for some things and then uh calculate when it's being terminated to calculate all the the metrics over that time span.

B

uh We just aren't doing that today, so yeah.

A

Necessarily watch like I wasn't sorry, I wasn't thinking that necessarily it would watch like this can be all done retroactively um just to clarify like this can be, but, like you're saying it's, they aren't metrics today. So wouldn't it would be a little bit different than what two is currently doing, but mainly.

B

Objects still exist.

A

B

A

Isn't that interception, though, or maybe.

B

uh How do you stop the problem like if we're measuring um time for shutting down a bmi or something like that? I don't know anytime, we delete the object, then we lose all that data.

B

So if we're we're expecting the bmi to exist at the end of the test for us to introspect it, if the test does anything where it cycles vmis, then we we don't have that data.

A

Right but the so the metrics, um let's say we delete the vmis and then we parse the metrics again. Aren't they gone.

B

Metrics stick around for the uh okay, so we have they stick around forever. I mean there's a there's a peer. I mean that's just the database for all.

A

Right they go into the database right, I see so maybe after.

B

Some days or something they start getting purged um but they're going to be around certainly longer than our load test.

A

Yeah you're right, okay, these could be metrics, um I mean dude. What let's I mean, I kind of. May we go down that path like does do we s? Would we ever like see when I see value, but we have do? Would it make sense that we measure this kind of thing inside of hubert, that we I mean, I think we can. The events are there and the objects? Are there they're being updated? We could see them we're already watching them.

B

This is giving us a more fine, granular understanding of what's happening between vmi transitions. So right now we have scheduling our schedule, scheduling to schedule, and then these latencies will give us like more fidelity between those and same thing with scheduling to running, and things like that. uh I think the question to me is: do we need that fidelity yet like?

B

If we we find that we need a very specific understanding of certain latencies, uh that's more than the like the fine buck or the the kind of large buckets that we have today, then that's when I would start looking at uh at these.

B

The pbc allocation is interesting to me.

A

Yeah, that was so the reason this came about was because I was I was specifically trying to diagnose an issue between scheduling and scheduled, and I didn't know what it was.

A

um That's kind of the background here and that and that's kind of why I got into tracing, to figure it out and it and through tracing I completely eliminated hubert's work queue. It was not. It was nothing to do with the vertica controller. It was everything to do with was actually specifically to do with with pvcs and, um and that was interesting is that I found what I actually found is that keywords work you is is executing uh pretty fast.

A

I mean it's it's almost instantaneously, and but I wouldn't when I, when I actually look at this, this transition, the kubert's work queue or the work that kubert's doing is tiny in this transition, and it's getting it's kind of given the. Maybe it's given the wrong impression. If you don't know that it's giving the wrong impression.

B

I think I would expect that so scheduling means that we've posted the pod and scheduled means that everything between scheduling and schedule. My expectation is that's all kubernetes, because that's all uh just making the pod run somewhere and as soon as as we see it, running we're just sitting at the schedule, but we aren't doing anything between that time span.

A

Yeah I mean it, it makes it makes sense to me like it like right. We have pods going and pods are impending. They go to like this. The pods are coming up during that time. That's the majority of the work, um but it was helpful because I actually discovered an issue in the process that I found that, specifically with pvcs, ended up being the case and there's also some other things. I found like the the amount of time, even network assignment node assignments.

A

These things seem to be. These are helpful to know um and this, and that was the other thing it's like there. So I looked at other metrics. For example, scheduling the scheduler has a an end-to-end pod time, um which was helpful and that it could like it. It gave roughly a gauge of of what to expect in in the times, but it also didn't talk about like what it was like. What what is going into this, um like, specifically to me like I, was interested in the pvc time, which ended up being really slow.

B

A

B

What was the ultimate like, how did you ultimately see or gain visibility into that.

A

What ended up happening so it basically what I did is. I went through the scheduler at the schedule of logs and noticed that the the time it took for pvc to be um to be allocated and a node assigned was was very long um and then found out. There were a lot of pvcs a lot more than expected um that were just sitting around and that they weren't doing anything and the scheduler kind of looking in the code.

A

It's it's looping through pvcs that are that it could attach, and there were just so many that it was taking so long, interesting.

B

So the way I would approach this is we want um what we're looking for, specifically with scheduling the scheduled is to understand how long the kubernetes part is taking like what what's happening at the kubernetes scheduling layer. So I would investigate if there are any metrics related to the kubernetes scheduler, that we can start introspecting and add that to the uh perfona tool to give us more understanding. But here's also the thing um if it's outside of cube vert.

B

I think it's important that we, I guess it is important. We know about it, but it's not necessarily a regression on our part. It's something I guess we'd have to take to the kubernetes community. I guess it's still important to know. Okay,.

A

Yeah- and I think the reason like so my reasoning is that, like you know, we we've talked about some of the more thresholds here um like it would be. These are the kinds of things like when here we're expecting how whatever kubernetes performance to be um and like there's that pod scheduling time that metric that I mentioned from the scheduler, I think that's a that's a big factor that was that was pretty well correlated to this time. um Yeah I mean, and then you know any of these other ones would could just be.

A

I don't know things that are that, could we can additionally see value in? um I think like some of the others, like maybe you know if pvcs aren't one, I think some of the ones that at least would be interesting, are the um what's the this one here like the when pods go ready to when the vmi objects is switches to scheduled.

A

Since that's the last step that we do, um we want, we expect that's that's in our control and that's one that we expect to be um to be really fast, but there could be, I mean. Sometimes it takes a second I've. Seen in some cases it takes many seconds.

A

This would also be useful to know, and this is actually, I think, one of the original reasons behind qps was the qps change. Was this one? This gap was a little bit larger, but this is this. Was um I know this? One could be another one that we could use, but anyway, I I think, like what you said, maybe start with the cube scheduler metrics as a as a first step on this might be an easy on-ramp.

B

Yep makes sense: okay,.

A

Okay, um next topic uh so tracing, so I had mentioned previously um doing some tracing, um I'm kind of looking for some ideas from folks and opinions. The um there's some work in the community around this, and I don't know how far we want to go with this, but there's also a fairly simple library that is really handy to make tracing work.

A

But I don't know, I don't know, I think to me my thought on this is that we can use this library that all it does is it just takes. We wrap, commands or like wrap. I don't know a bunch of areas with traces and they take over some threshold.

A

Then we we spit it out into the log um as an easy first step, and then maybe this could be something that we can look at later, but I I don't know, I think what if people think I mean this, I don't know this seemed seems the easiest way to start, I'm not that familiar with tracing, though or what kubernetes is doing. I don't know if anyone else is.

B

I think that we'd want to understand how mature this tool is like or our other projects using it and the kubernetes ecosystem, I'm a little leery of anything that involves code changes like a new package and installing, like like sprinkling these things throughout the code until we are really sure that we want to commit to something like this, because it becomes kind of a potential burden on us in the future.

B

It's an interesting idea, and I think it has value.

B

Again, it's just all if we're willing to commit to something like this, I mean nothing's permanent. It's not like it's a feature that we'd be supporting or anything like that. We could take it out at any time, but still you know it's work.

A

Yeah, um I think we already include details and yeah. I think it's all right. It's yeah, it is it's a vendor. I didn't have to add it. It was already there, it's just um wow yeah it just so. This is wait.

B

This is part of the kubernetes utils package, the tracing.

A

Yeah, oh that's what I used yeah.

B

uh I have a lot less reservations if it's yeah part of the ecosystem, like that.

A

Yeah the functions and everything and the distractions are there, we basically just we basically just need to put them in the right places, which is also another question. That's that I want to get some opinions on, because so let's say that, let's say that this makes sense. um Where would where would we want to add tracing because um I've got?

A

I've only got one. I one listed here. I've got control, plane, work used, but what are some other ideas.

B

I started small that makes sense yeah.

A

B

Curious how so, if this is part of the kubernetes utils package, it's probably part of kubernetes, I'm curious. If it's used here, I can look at the names.

A

Yeah, I didn't, um let me see my hatch, I did. I don't think I had to import anything. It was there.

B

Do you have a sample name of the function or anything like that that I could grab the kubernetes repo with real, quick.

B

A

Yeah, it's already, it was already there. um So it's um just look for trace or start trace or step trace, trace.

B

Sir, what's a tracer, let's say uh here, I was looking for trace yeah.

A

It's in utils uh cameras. He tells trace.

B

Yeah it's being used in the scheduler, which is a great spot for it. That makes a lot of sense.

B

A

So what do you alright? So what do people think about like uh some? Where do people want traces because, like I can do I'll do I could do a few of these? I mean the work you seem to make sense to me uh in in a different way than I was doing it before, because I think the mistake I was making before, as I was measuring time between keys, which is actually was measuring the time it took for kubernetes to to do it was measuring kubernetes work.

A

It wasn't actually measuring the time that the work they were doing so like that, just the work in the key or the work like in that update function would work and then maybe, like probably on all of the all across the control. Plane is what I was thinking.

B

What I'm most interested in is the vmi and the vm work queues and vert controller invert handler. So, for example, if I look at vert handler that one's really interesting to me to understand that the vm has been queued and understand where we're spending the most time or performing work on that to the point where we return.

B

So that would tell me that, for example, let's say we are performing vert handler work, q and a vm, and we get to the point where we're syncing the vm with vert launcher, and I can tell that that function is taking like almost a second or something like that. Then that would tell me that something's happening and vert launcher is causing this. This grpc call to block for longer than I expected things like. That would be like. I have no visibility into that today, but tracing would allow us to to do that.

B

A

Understand the whole reconciliation.

B

Of the vm every time that a unit of work is done and unit work, meaning it's popped off the queue that the vm has popped off the queue and executed and then returned.

A

Okay, yeah well, so what I'll do is I'll do yeah so that right when it's popped off the cue I'll do basically what I did here is like um uh start the the trace and add a bunch of steps in there, and then we can have a threshold, for I don't know what I'll just play around with it and see what the which I mean, maybe a second or something I don't know what we expect, but some amount I'll throw in.

A

There is like the threshold to report it and we'll see what we'll see what comes out of it. I don't know we'll see we'll see if I can find what slow slow is, because I don't know.

B

Sir uh yeah, did you post your pr, I mean I saw your code for some of this. Did you post it as a pr yeah or is it no.

A

I'm gonna redo it because, um like I was saying before the the way I have, it now isn't correct, because it's actually measuring the time that we're waiting for q for events for kubernetes and that's that was the that was the mistake that I was making is that it would. I thought we were doing work or something during that time. It was not the case. We're actually just waiting for our informers to get work.

B

Great yeah and just a really simple pr, maybe just do one trace with some steps in it like pick. One like vert handler is great. uh Birth controller is fine and the vmi or vm controller, and then document like how to use it and everything, and that gives you like a precedent that we can add more afterwards.

A

Yeah make sense: okay, cool.

A

Okay, um that's all I had for topics. Do people have any other items we want to talk about.

B

I think that's good, don't see anyone else has anything I mean it seems like the action items are we need this periodic job to to start moving forward more yeah and try so.

A

B

Give us developer insight into how to the tracing and the performance um profiling like that getting p profiler back those are are going to be our two primary tools, I think, and um and improving the the results.

A

Yeah, okay, so I I'm gonna do I'll take this one. I've got the I'll do the tracing as well. This one in the middle, I'm gonna, create an issue for actually I I think I haven't added to this issue um in the test framework. Has some ideas that we can look into um I'll mention the scheduler metric in there, um and this is something that we can consider areas. We can expand.

A

Okay, cool all right. If um there's no other topics, I think wonderly all right. Thank you. Everybody all right, bye! Thank you. Bye.