Kubernetes SIG Node, 12 Oct 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG Node CI 20221012

Description

SIG Node CI weekly meeting. Agenda and notes: https://docs.google.com/document/d/1fb-ugvgdSVIkkuJ388_nhp2pBTy_4HEVg5848Xy7n5U/edit#heading=h.2v8vzknys4nk

GMT20221012-171300_Recording_1628x1120

A

Hello, everybody, uh it's a signal, CI weekly meeting it's starting late today, because I was late and we have very low attendance because of that probably people dropped. So if you're watching recording, uh we will be doing uh back triage and uh task triage and look at the state of our tests.

A

So let's get the Dive Right In, yeah I started tracking this uh test uh stability in this spreadsheet and what I was hoping to see is like this kind of picture like green, green and sometimes it will be non-green and it will be like single problem.

A

But what happens typically is uh we have this kind of, for example, when uh I'll cry or continuously, when the tub is so huge that it's just uh every like some tests like eviction and performance failing over and over again, but sometimes it's different set of pests like you see like what happened here and then like it switched again, so maybe I will need to change the format. A little bit and I want to learn like I know that uh secret, like reliability. Working group is doing something on tracking uh job statuses.

A

Maybe I will try to borrow some tuning from some, but uh in any case, I looked at tests and it all looks exactly the same as classic so uh no news here, uh but uh I actually need to start to look into that so, okay, this is it and from triage perspective,.

A

Okay, nothing new. Let me check really quick.

A

In case anything new happened.

A

No, it doesn't look anything new here: okay, I um I looked yesterday and uh it's a night and there were nothing uh that uh new for our group. um Everything is here uh and I also cleaned up this, neither proper top. It's only three right now: it's uh it needs some approval from Brunel. Maybe um I think there are two from Francesca and uh one is uh uh it's a functionality uh change. So uh it's a little bit different.

A

Okay from uh we have issues to do as well. um So if you want I can just.

A

Pay uh Peter: you have two.

A

Not container manager who else is.

A

A

Okay um last reminder was a few months back.

A

On June, um let's see if it's still failing.

B

Yeah I believe this is still failing. I.

C

B

Have this tab open and it's in uh my backlog, but it's kind of at the bottom of it.

A

Okay, let's remove stale attack from it.

A

A bit updated um okay and next one is uh this color shouldn't volume really blink is like also assigned to you.

A

It's very old. It's created 2019 before my time.

B

Yeah I forgot about this yeah. We must have trashed this to me. Yeah last year, wow.

A

It's too flaky from time to time right.

B

Yeah definitely still flaky, but yeah. This one was weird.

A

It's a strange topic.

A

So what was the original uh issue? The.

B

Original issue was the test coverage was lacking, but I think also we ignored failures in it because they were flaky or they were like made to be flaky, and so it's harder to catch failures. So I think I'm not super sure what I was talking about to be honest, um but it is kind of a Corollary problem.

A

Do you want to keep it on yourself or clean up.

B

um Yeah I can keep it uh yeah I'll I'll, keep it and um and try to get back to that. One.

A

Yeah and I'm looking through that not to like poke on anything I just uh is there is something that assigned uh we don't like. Let's unassign everything we do not plan to work on.

B

Yeah I think it'd be good if I work on this um I just have to find the time. Okay,.

A

D

C

A

The call welcome sorry for being late again um Ryan. Do you want to see what what is on your plate.

E

A

Hey I want just appear.

E

um I still need to review this one. uh Do we still? We still want this right, Peter.

B

I, don't remember exactly the state of the test now, but it does kind of unify how we set up all of the tests um and I suspect it'll help the swap.

A

Tests yeah right.

B

A

Swap tests and Fedora fails with like some.

B

And it's it's definitely because, just like the user is wrong, so yeah, okay,.

E

I think I'll take a review.

A

Okay, thank you. Ryan.

A

Yeah, if you or somebody on the call needs some uh more tasks, let me know we have plenty here.

A

Okay, going back to agenda nothing uh new um Brian, uh I, see you join a call. Do you want to talk a little bit about this uh performance test that you uh beaten to uh success? Work.

D

Yeah I'll just give you a quick update. um The most important update is that I finally found a system where I can properly test the multi-arch building of images and I tested the pr I put up a PR this morning to fix it and then I got this new system and tested the pr. It's still not good enough, so I'm going to keep slugging away at it, but at least I have a good system now, where I can iterate. On this thing,.

A

Okay, cool, thank you and I think you know about this image building procedures more than anybody on the school. At this point, I've.

D

Learned a lot about that yeah, okay, there's an amusing um thing for you. You know the previous PR I put out pick out reviewed. None of us seem to know the procedure because I failed to update the version number of the intended image. Apparently I was supposed to bump it myself in a file, so that'll be in the that's in the pr that's currently blocked in and then the system.

A

D

A

You send this PR or you wait for a fixing multi.

D

The pr is up already but I put, do not merge on it, because I've did subsequent testing to find it's not good enough. Okay,.

A

Thank you. Okay, um if nothing else for tests, I will oh by the way yeah I'll, just keep about uh uh desktop I. Think I gave up this last time uh at last time. It wasn't accepted now it's fully accepted, so it's uh it'll be implemented and as there is a PR out that agent gimko V2 like we switched to Ginkgo, V2 uh and uh Ginkgo, we do supports tags. That is not a check stack, but actually like tags uh that can conversate with the test.

A

So there is a PR out by somebody to switch to the stacks, and this PR introduces all this uh new labels. So once this is intends we can start switching test to actual like uh test tags rather than like text uh description with uh like thingy like music, such.

A

Okay, um and with that, we will go to bug triage.

A

Yeah last time we cleaned up box really well so shouldn't be too many. um Let's see.

A

A

I think this one is uh bitter and Ryan, since you want to call do you want to discuss this one, it was reopened.

E

um I have to read up on why I got reopened uh yeah.

A

I already opened it so I can comment on that. Okay, so I think I just wanted to reiterate that, um so it feels that uh there is some problem with C group V2, uh some folder being removed, that shouldn't have been removed yet, and uh somebody complains that they have uh too many of these messages. So the.

C

A

Fixed it was uh to just change the verbosity level to four uh preventing this message to be outputted, so my worry with that is, uh since we don't know the root cause, and we know that this message was written so many times like over and over again. It feels to me that we will start like if customer has a node with ports uh that being created and deleted. Very often, then you'll accumulate so many.

A

This cleanup workers that uh it will be it will lead to some boom uh problem and like uh equivalent boom killed, and my problem is that is we hit the error message, so we don't even know why? It's not. uh Why uh like what's happening, so there is no indication that it's about to um get bad, but it will go bad uh after some time.

A

So My worry is that we removing any indications that it like memory is growing and something is not working as expected.

E

That makes sense, but the original report on a bug was a logging issue and not an um um maybe. It should be spun out as a new issue.

A

uh Yeah remember: it is useless logs.

E

Because the original issue wasn't- and it was massive logging.

E

um But it could be a different problem, of course, yeah.

A

But once we hear the massive vlogging it didn't remove underlying problems that this code path will be executed over and over again. We will never stop right.

E

Well, it'll stop uh once the volume gets cleaned up and so um because.

A

Past doesn't exist right, so it's already been removed, so it feels that Google just doesn't, cannot clean up. What already was cleaned up. Something like that.

E

The cumulate, though, will eventually should eventually clean it up within its state that path, and so, if there's a pro I guess I guess my concern is: is that we're conflating the other reporters on.

B

E

As an um problem versus the original report, because the original report came in from Daniel, CB and um I, don't think he mentioned that there was a memory issue.

A

Multiple gigabytes Vlogs each day.

E

A

I just accepted policing here so if kubot produces, that might, as that many logs then eventually to die right, because no.

E

Because I know workers it's streaming, uh it's writing the log file and so the patch that bumps uh the log line up to V6 won't print that message anymore. So we won't get those extraneous logs. The logging itself shouldn't be leaking memory.

A

Accumulation uh workers that used to do this login right.

E

A

Me, but we will accumulate uh loggers that used to be like uh workers that used to do this login, so they will keep executing this piece of code and uh there is I I. Just don't know if there is, if you can confirm that this uh will stop at some point. So if there is no stop of executing this line of code and like there is no way to recover from that, then Kubota will accumulate something in internally in in its state.

A

It will keep trying to remove this thing and then another Port will be deleted and it will try to remove I. Think for that board.

E

Well, but I don't think: we've proven that the cubelet uh is.

C

E

Here the logging, the log message- is on a non-error path and so I didn't see. Anything in this uh bug report saying that uh the air that the cubelet is not behaving correctly.

B

um Okay, okay, are you worried that the cubelet is gonna accumulate memory or like overuse CPU over time, because it's continuously still hitting this path, even though it's not telling us by logging that it's doing so.

A

Yes, exactly and I'm trying to understand like if there is no exit from that then I I think yes, I. Think I'm conflating this to report that somebody else complaining about similar problem on C group with two, but they were saying that uh zero note is running ports like creating and removing ports all the time, so they get to the um really quickly.

B

And the um is happening for the the pods or like the container processes or it's is it happening to the cube list.

A

So they say it's.

B

Right so I would more expect we'd have to look at the um at the code path. I would more expect the qubit spinning on this to accumulate like to overuse CPU uh and not memory like it's I wouldn't expect any memory to be accumulating for it uh like, unless, like the runtime has a stack, that's growing indefinitely and that is taking up a bunch of memory um but like I, could see it like this.

B

It looping on this causing a bunch of CPU usage, but I would imagine it would be harder to catch it on the memory.

D

It sounds to me, like Sergey is saying that for each one of these churning pods there's going to be a new worker, that's going to start up and it's going to continually try to delete something. It can't delete and we just won't see it anymore. That would be a source of CPU churn, but also memory for every worker foreign.

A

Is also not very good, uh I mean just wasting it on something that will never clean up is no indication that it's happening. It's also bad.

B

Yeah I do agree that it's it's bad I, just you know it's not the the connection is necessarily clear, though it is I think it would be fairly easy to reproduce. If, if we like, you know, run the cubelet create 100 Bots delete the path in each of these then delete the pods and then do that. You know a couple of times. You should see a linear memory growth.

B

If this is you know if it's related to that number of pod workers, and then we could potentially, then you know correlate this issue with the umkill issue.

A

Yeah that may be a good course of action uh and with regards to just CPU churning, since we don't have any like do, we have any way out of this state since it's already been deleted like is there any way to recover? If not, then maybe we can. We can just detect that it's not recoverable or something.

B

I'm pretty sure we dropped a lot. Did we drop the log level, because this is an expected state where the Pod worker is racing with the volume manager and the volume manager tore down the volume because or the Pod is in the process of being tore down. But the Pod worker is like still looping on it. Correct.

E

B

So yeah I guess it would be good probably to have evidence that this pod worker is never going away.

A

Okay, I think we can ask a topic starter to see if it's uh the same ID here all over and over again, because because if it is the same AG, then uh it's never going away right.

B

A

B

We also could check that with the current Cube too, by Distributing the verbosity up to six uh and seeing if the IDS yeah. If we ever stop seeing an ID that we start seeing.

A

Yeah I just afraid that we don't have any idea uh internally, like I, think I think we have a lot of uh people reporting the problem, but I haven't seen any of the Opera locally I can ask David Porter as well.

A

A

A

Okay, I think I promote it. What we discussed here and let's wait for Daniel to report back and I, will talk to David in case he has aryabra uh Peter Ryan. If you have any upper as well. Maybe you can check that. It's uh um it's indeed that situation.

E

A

Sounds good! Thank you. um Okay. This is okay. There is a PR for that I, actually pretty sure I looked at this PR. Yes, that's RG, then.

A

D

A

A funeral storage limit, not question efficient physical parts.

A

A

I think it's more feature request a little bit of abuse of making pot critical.

A

A

A

I removed from the board.

A

A

Is it because of continuity version.

E

I think this is a misunderstanding of what privileged pod is.

A

um Yeah, but this.

C

Ends it's different behavior.

A

On different versions of kubernetes.

A

A

B

So in c52 we create a or at least cryo creates a private C group, namespace and I think some containerdy might do the same. So that might be what, because it's it's safe to delegate to a private C group, uh all right, unprivileged container uh in secret V2. So they might be running it to that.

B

um If you want to assign this to me, I can take a look.

A

A

Yeah I, don't think Ben is replying to the concerns that customer has but yeah. Thank you.

A

And I'm not sure about priority, as it sounds like super critical, but uh we can change it later if needed.

A

Yeah I think Ben is replying to the concerns that critique, like previous Sports, can do whatever, but customers complain about C group setting, specifically, that is different between V1 and V2.

A

Google restart.

A

D

A

And if somebody wants to try to guess what may have gone wrong by this stack, you're welcome to do that. Otherwise, I think it's. We still need more information.

D

Is that link from three days ago, of any use I think he posted a code link there to the manager, because I can't guess.

A

The line number where it was.

A

Yeah I just pointed to the line numbers that was mentioned in.

A

So allocated devices with this resource- uh probably this isn't new.

A

D

I think more information is a reasonable ask here.

D

E

E

E

A

A

It sounds like a bug. um Actually, we need to like keep them consistent, but I think it's very low priority.

D

Sounds like it clears itself up after uh after a moment anyway,.

A

B

A

Important line button feature.

A

And I don't want to Market as get help like help needed, because uh it will take I I, don't think it's if it will retrievable change.

D

A

Think it's what I don't think it will be. Three will change so somebody to assign somebody so maybe.

A

Just one can set bytes.

A

A

This is a long, long, damp.

A

Okay, what is below that.

D

A

Okay, it's just 124 here.

D

Okay, that's a comment down in the bottom yeah from neolit.

D

Looks like he knows what to do about it.

D

A

A

It wasn't that bad.

A

Yeah can bypass.

A

A

A

C

C

I am the brother of this issue, hello, Community, I'm, very new to the signal and I describe the issue for details in the GitHub. So if any questions please ask me anything about the issues.

A

Yeah, what I'm trying to understand first is: uh is it a bug or it's a documentation problem.

C

Yes, yes, that's very controversial, so uh current implementation of popular CRI implementation- uh just add supplemental groups defined in pod to the group information defined in the container image.

C

So uh if cluster administrator secure the supplemental groups field in PSP or other policy engines that can be bypassed very easily and uh in using host path volumes in the Pod, so, as you know, host path volume is predicted by Legacy uid GID information.

C

So bypassing group information leads security breaches. So we recognize that this is very security issues. However, security committee answered works as intended, and also we can read uh that definition of supplemental groups- field uh yeah. The description can be read: kubernetes, just as supplemental groups gids to the Divine. uh The group information defined in the container image.

E

C

So but that behavior is not widely accepted, and this behavior is very confusing for many cluster administrator.

C

If we can recognize this behavior is bug, we should fix that uh many Sierra implementations.

C

However, we recognize this Behavior works as intended. uh I proposed to.

C

Ideas in this issues just improve the description in the API spec or add a new API I proposed here. Is supplemental groups mode or something like that.

A

C

I think it's not.

A

uh I mean we generally can do this one uh like first one, because we need to explain what's happening today: I, don't think. Even if you implement New Field, it will not be back ported into previous versions.

A

So I would say it's uh like uh we'll start with, uh probably with cap process, if you're familiar with cab processes uh is a process to produce new features in kubernetes, and it will take some time um so I think if you uh want to take this one or like just improve documentation and explain, what's happening, it will be great um and then, if we need to do that, then it will be. It wouldn't be a bug fix.

A

It will be a new uh feature, especially when 60 security already said that it's expected Behavior yeah argue with them so yeah. um If you want to take it on you to improve documentation and write a blog post, it will be super great I think it will help.

C

Yeah, okay, so let me uh open the pro request to improve the API definition first and then and then I will plan to write cap or something foreign.

A

Thank you for joining by the way.

C

Thank you very much.

D

A

C

Assign it to yourself already.

A

C

D

A

And last one we have three minutes together.

A

A

Okay, so we are fine.

A

Okay, um we out of box and we're out of time. Is there any last minute nodes, please let me know otherwise bye everybody.

E

C