Kubernetes SIG Node, 23 Aug 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG Node 20220823

Description

SIG Node weekly meeting. Agenda and notes: https://docs.google.com/document/d/1Ne57gvidMEWXR70OxxnRkYquAoMpt56o75oZtg-OeBg/edit#heading=h.adoto8roitwq

GMT20220823-170443_Recording_1920x1040

A

Good morning, everyone today is the August, the 23rd 2022 and it's our weekly signaled meeting. We have field topic today to discuss so I. Think Western. Do you want to Mark us? Do you want to talk about the cab.

B

um For another two weeks, because uh atanas is out so absolutely, which is why I think I had put it for later- am I wrong.

B

Because I think I had put for September but I don't have to talk.

A

um So I want, if I understand the correctly you uh I guess you just want to update the community about the uh some delay, because Summer Time, a lot of people take vacations. Is that true? So you want to update and change that to.

B

Yeah we'd better update it to later, because he's he's on autonomous is on vacation and he's.

A

Got to go sure so if you want, we can move to the next time like the next week or maybe two weeks later,.

B

I was aiming for September 6th if that's possible, because I think I tried to put that in the dock, but maybe that didn't get across.

A

Got okay, September 6th I can help you to move that yeah. Okay,.

B

A

B

Very much I appreciate it.

A

Thank you, yeah. Thank you for the update.

B

Yeah, we do have some work done on this, but I want to tell us, as part of this sure,.

A

Thanks so let's move to next next one Ryan yeah.

C

Yeah next one's mine, um so I wanted to get some background from various stakeholders. We have some customers that mount varlib containers and varlo cuboid onto different discs and um the ephemeral storage. That's reported by the cubelet appearingly uh propagates. The root file system, instead of the variological or varloop containers and I, was wondering what the background of the feature is for ephemeral storage and whether it makes sense to report more accurate information. On those two mountain points.

D

It's my my memory on this Ryan was that the cubelet believes anything believes the disc that varlib Cuba is mounted to is the root of s, um and it would only do tracking there and then for container writable layers.

D

If the container runtime had reported like a different location, uh uh the writable layer may not have been accounted, um but uh the the root of it was not to use it overload. A term was to just not have to support too many complex.

E

D

Layout schemes.

C

What is the correct, uh should the cubelet be extended out to support uh more ephemeral locations, or should we only restrict it to the root partition, because we have a there's a lot of customers that are mounting these uh disks differently and they're, not getting the correct uh reporting from the kilowatt on this yeah.

D

So I I know with with our mutual Red Hats on that. Sometimes customers do things because they think it might give them more protection than it may give them Clarity, um but I I, guess uh I'm curious.

D

If, if any other providers on the call might be dealing with users, that might not always uh agree with the existing like default project posture or getting pushed to support more flexible disk layouts, because even for red hat like we say this is our layout and then we'll run into uh field folks who might change it and not read our own documentation that lets Ryan. Have these problems I guess but usually, if there's some local reason or local policy that makes the customer think that they need to do something different than the project Advocates.

D

So any other providers on the call or maybe users who.

D

Struggle with what cubelet does today or not I guess I'd be curious about that.

A

I also want to say that actually, that's the original, that's our assumption so to at least that time. We start that the design it is to simplify problems. So that's why we did the co-op. That's our assumption, and here it is how the node configure so, but I do say that people say that, oh maybe I, like Santa, you say that we maybe want to rethink about this one, but I just want to say that's the Assumption we made it back then.

D

Yeah, do you have users who are trying to change your default OS configuration or the question you need to honestly.

A

I'm, the only one trying to change it once in the past. Nobody want to change.

C

Well, um maybe if we can take the discussion to the issue, um that'd be best and I just wanted to raise it for awareness.

A

Thanks Ryan to reach this issue. Actually, now is the time Alexander. If you have the something, can you just Sue in the issue run created there as you can get some attention, so we do looking forward to her the more feedback on as well and that's the Assumption. If the documentation make that assumption and that's the how the node config is unclear, cause some confusing, then we fix stock. If people think about it right now, we should expanded kubernetes and kubernetes and also layout is not meet today's change.

A

We also can discuss, but that's the good issue. We can collect those feedback. Yeah.

C

Sounds good! Thank you. Ryan.

D

Do you know I can't recall, and it's been a losses? Look, do we admit any event when we see a disk layout that might not be as expected or.

C

Anything no, we do not, as far as I know,.

D

That might be useful to think about in the future. um Okay, cool.

C

A

Thanks uh relay.

F

Yeah hi hi Dawn, um so an update on the In-Place about vertical scaling. I did the pr got merged uh the for the cap, so we should be a documentation, wise and uh record keeping wise. It should be good for uh that kept to be targeted for v20 126 Milestone I also created a separate PR for the API changes. uh If we, this has already been reviewed uh and it's not changed in a while, except for rebase fixes that keep happening from time to time.

F

uh If there is no objections, I'm wondering if we can look into merging this early, um then it will save me the headache of doing rebases and then I can only focus on the the main the mothership PR, which uh currently carries the scheduler and the couplet implementation and I did get a chance to update to work on the C group V2 implementation last weekend and that the CI passed it's been.

F

I tried it out a few more times a few times on both V1 and V2 uh setups in GK deployed uh Cuba and then the product cluster with multiple nodes and then tried it out. I think I want to get a couple of more rebase tries in the CI uh before it's like I feel fully confident about it. So uh but then uh it can be reviewed at this point. I don't know from Renault. Bo had initially suggested this. A few changes there and I think I took most of his suggestions.

F

It made sense when you, you know: you'd look at both C group, V1 and V2.

G

I I can look at the V2 changes coming when I.

F

Yeah um I think that it's too early I don't have a lot of data at this point, but I feel there might be some bugs in the C group V2 on the implementation side. Do you think there is or do you feel it's stable.

G

You mean c groups V2 itself. Yes,.

F

Or uh whether memory limits are being enforced, that that's the question I have.

G

It should be like, as far as I mean from testing, we haven't seen any issues, but if you can find a failing test case, we should open an issue.

F

Yeah um no I, just so I've, been experimenting with a little demo setup. So there is for this is for the kubecon and I think our idea was to use ebpf to detect. uh You know certain workloads like this uh remote Dev environment in a pod. uh In those cases you can know ahead of time. You can note it in a deterministic way when, uh like you're writing code inside that pod, it doesn't take much resources, but the moment you hit a make Command or you want to run a bunch of tests.

F

You immediately need more resources. So that's the idea. I! Would you know you can detect that event with ebpf? And you can you know potentially uh Trigger vpa or directly ask for more resources, and if it's there, we can grant it and in that I tried it out with V1 without any. Without this ebpf code enabled the power was killed as expected and with enabled uh I didn't see that happen so I'm, uh so I'll I'll do a few more tests. I think my earliest chance I'll get. Is this coming weekend?

F

I'll look at it and then.

G

Okay, see if you have any issues yeah if we can like distill it to a specific like.

F

G

That we can do that work.

E

G

V1 and don't work on V2, then I think we can.

F

Figure out something yeah I can I. Think I can share my code out as well that it's very simple python code uh just attaches a a ebpf program to lock traces at exec. So it's very simple shouldn't be much to set it up and try it out. Okay, so yeah thanks. Please review that and uh the other question was as I looked through. Implementing this I was wondering whether they should be delegated to the CRI yeah. You have. We delegate the Pod, sandbox setup and tear down to the CRI. Why not this?

F

uh This is a question for Peter Mike I, don't know my thoughts are very early on this, but it doesn't make sense at all uh something to think about.

G

So, which particular aspect like I allowed to take a look at.

F

Your yeah yeah yeah, the thing is right now: I'm we're writing. Okay, it's like adapter that determines which C group file to update right for, if you're increasing memory, then in C group V2, you're writing to memory.max. If it's V1 and you're writing to memory limit uh bytes uh to update that limit this portion of it is uh it's uh I wonder if kublet should be doing it at all, then.

G

The Pod level- yes.

F

For the pots, okay and the reason, the reason we need to do this separately is because uh when you, uh if you're net, increasing the container resources, then you need to update the Pod C group first and then increase the containers. The order in which you call the CRI for containers in the Pod increasing the Pod matter in this case. So but then, having doing this in couplet seems kind of asymmetric or a little off.

F

When you look at it- and this doesn't have to be there right now, but before beta definitely, but something to consider.

D

Yeah, so just historically of an eye like um uh kubernetes, had challenges in container runtimes being pot aware, I, guess, and so the cubelet like organically became the manager of pod c groups, and it also was the manager of its Clause hierarchy above pod c groups which, um uh and it had just basically delegated to the containeracy group to The Container runtime part.

D

We've had okay feedback later. That was like. Oh now that we have the CRI. Maybe we could have expanded the CRI to do more. um I.

E

D

uh It's worth discussing I think it's a bigger change than we appreciate and um you basically end up having to basically defer all quality of service to the CRI, which would mean, like creation of the first of all best effort um that part of the secret taxonomy.

D

The whole thing needs to go to the CRI, then oh okay, um and then related to a difficulty that would come with that is then, when we do with when we do out of resource handling like the cuboid, is the one that's responsible for that right now, and so that's right. It has to read those two group levels itself.

D

So anyway, it's it's just a complicated topic when you unwind it and it really just comes down to, should cubelet do any c group management at all and right now it just says: uh cubelet does all C group management, except that which is delegated, and it only is delegating the container portion.

F

Okay, I understand the context a little bit better. Now that makes sense. I just was wondering what's the right thing to do here, but yeah I think I see the reasons you've been down this road.

D

Before yeah, we had some resources that were not even beginner, aware, like for huge Pages before the runtime had support for huge pages.

F

That's okay: there is a lag in Innovation all right. Well,.

D

Not even that, like you, had some resources that were only accounted because they were used in volumes, which is what huge Pages was doing. Yeah and I needed to span like the life of any container, and it just gets very complicated. So I appreciate why you're asking the question and gradual evolutions of it might make sense, given where we are now it's just historically. That's why we are where we are okay, we can.

H

I think because we're adding sandbox Services down you know down lower in the container run times now, I think it's probably valid to readdress that at least after we've got support for it at a pod level.

D

um Are you doing anything Mike at like the host level to have like differentiate quality of services among those sandboxes because, like that was that's a big issue that the cubelet's doing there.

H

Exactly right, so we we are putting a new set of services down lower for sandboxing um inside of containerdy, so that we can refactor how we're doing you know pod support today and containers on those pods um to the to the point where we should be able to support microbians.

H

uh Additionally, going forward and yeah I mean that's going to require. uh You know additional integration work with the kublet, so yeah I think it's best to talk about. How do we do that at the Pod level? You know, with some kind of sandboxing common sandboxing service support as a quality of services required.

D

You know we have a level even above pods. That's my point right. We have the we.

G

D

Root secret hierarchy with segmenting guaranteed from yep best effort, reversible, yeah, okay,.

H

Yeah we have to work together. We can't just.

E

H

F

Okay, I think I get a lot of context here. We can certainly take a look at we'll go baby steps with this one.

D

F

uh That's all I had uh thanks.

A

um Thanks Willie always, and next one is actually the I put there, and the wenjing and I recently go over the our CI test, because uh I think a year ago we have really terrible, coordinate here and relabinated issue. So then we found the this is their project, Sabo project and the thanks to the Sergey and alala and- and we achieve a lot of progress last uh I, think more than one year and also and also Daniel and I I.

A

Think Daniel is not here today, so but recently, because both the Sergey and Anna they take some time off. So that's why, when you and I spend the time and also there's the group of Engineers, like the wunjings team, actually weakness still monitor those critical test jobs. So then we found a lot of several things falling apart. So that's why we want to bring this the community bring this back to the community and get everyone's attention. I, you have the co-host. Maybe you want to talk you.

I

Want to share, let me share the I. Think I sort of people are looking in the this uh talk. Oh okay. Let me first time to do the share in Zoom.

A

We can see it, but we cannot hear you.

I

All right, I, don't know why my zoom tell me that I can only either share screen or share uh my audio.

H

I

Can anyone help me to share the screen? uh I think the major thing I want to call out is a few uh test, Grid or consistently failing, and maybe it's worse to look at it together to see if they are still relevant or they're already covered by another tester grid.

I

So we can do some cleanup and if they do uh have some coverage they're the only one cover some certain feature or area, then maybe we can work together to make them green back again, as uh as the next focus of the CI group I want to hear feedback from the group.

I

If that's something a good idea.

A

At least I want to mention that, because, when I poke around there's the one swipe, that's the job is the constant of the field for last at least the two or three months. What we can say so uh should we do. We have other swap related tests somewhere, and uh it's just. This is particularly always image related stuff because there's this kind of expressive this talk about.

A

So all we just have to find someone to to look into this one and and also some there's the comment in the CI uh that that spreadsheet people mentioned that it looks like the test infrastructure problem, um so I just wondering since uh people working the CIS sub project, they they because many Communists just say oh, this is just missing of the artifacts doing uh brought this to the sixth best group, because that looks like that to me is the test infrastructure project instead of we retired the return of the test that we need to fix of the test infrastructure in standard negative because test infrastructure problem.

A

We polish ourselves reduce our test coverage. That's kind of the got my attention, so that's why I want to broader here.

E

Don can I jump in I've been trying to turn some of these tests, green ones that have an artifact which is no longer accessible from the host I've been talking to the K10 for Sig.

E

They got me on hold because they're too busy, with the upcoming release but they're going to start trying to find a place to host required test artifacts. So we can get these things to go. Green I might want to call on people to help get traction if it doesn't start getting traction soon.

A

Well, Brad, thanks for this update do we have like the uh one bag? Basically, it is summarize those test. Artifacts is missing, so then assigned to the uh the team, and so we can at least the signal. The community, at least the wealth us like wanji and myself can chasing because the those people fail. Give us no signal about our quantity right.

A

So that's why so at least the weekend uh working together with you and the chasing and make sure they are really address those problems.

E

I'll get to take it, I'll put it in the agenda.

A

um I think that's the that's the one of the top costs for me to look at, of course, that I still have the flaky test, but fully kit has at least there's the reasonable engineer can working out, but that is test infrastructure problem uh another and we need to address the treated as the top priority and address, and another thing is just like: that's a swab test and I.

A

Look at that sweptice also some of it's caused by that the same region, test infrastructure missing of the artifacts, but the sum is not so, but the only apparent name to me is just always a fair. So that's why I kind of the? Hopefully uh we don't have the signal for the swap right. So this is, of course, luckily we also put into plan to promote it. So.

G

Make sure Don I took a quick look and it looks like it's running into an SSH issue. It may be. It may be similar to things we have fixed on chorus earlier, so maybe I don't know like maybe Peter or Herschel can take a quick look and we can get back next week like what's happening with that job. Sure.

A

That would be wonderful. I also noticed that the artifacts also some of the female caused by that simulator. So the the wood brand just mentioned that, if that fix also can fix some of those problems. Yeah, okay, cool.

A

um I think that's all from me and the whinging any other thing you want to board up here.

I

uh I think we are following on one of the uh category of a signal to cause. uh It seems we're confirming if the coverage are already covered by other tests, such as container d uh and we'll dedupe, the duplicates.

A

I

Yeah, that's that's a day from my side.

A

Cool thanks so so, anyway, next week or maybe the week after next week, we once we have like more data. We come back to the signal and report it back to the community here.

A

I think that's all from.

A

Front any other issue: people want to board up to get everyone's attention.

I

uh I have a question about: when will we do the 1.25 ritual.

G

Maybe next week and then we can also start uh some 126 planning right. Yeah Reuben are you? Are you around? Maybe two of us can help with that I. Don't okay I see you have another call yeah.

A

Friends, thanks for the update right, it just took me some time to find the Army about it, but yeah I can help with it yeah thanks uh menu and Ruben on this one and also Brian. Thanks for the updates, I will put your comments into the thing to the agenda and the proposed next week. We can, if we have some data and update to the community, yeah and I'm gonna, do the same thanks.

A

That's all today.

I

A

Thanks everyone.