Kubernetes SIG Node, 15 Sep 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG Node 20210915

Description

Meeting Agenda:

https://docs.google.com/document/d/1j3vrG6BgE0hUDs2e-1ZUegKN4W4Adb1B6oJ6j-4kyPU

A

Hello, everybody and welcome to today's edition of the node ci sub group. It is wednesday september 15th 2021. sergey. What are we on the agenda today.

B

First item from archon uh he cannot be here. Apparently there is a some holiday in israel, yeah.

A

B

Yeah sort of my ignorance, I didn't know.

A

B

Config yeah right now.

A

uh Yes, actually.

B

A

About this, because it's not just like this- the well it's talking about moving side of the cereal lane, I don't think it just affects the serial test, though, like dynamic cube config, do we have a tracking issue.

C

Dynamic kubelet config has some interesting side effects right now and uh I've been analyzing a lot of our test failures lately and it seems like a bunch of other flakes, are caused by dynamic, kubelet config restarts being flaky. Yes,.

A

C

A

C

I have spent like 30 hours of my life in the last two weeks, debugging those kinds of things, um and so I think we like pulling them out of cereal right now is going to be a waste of our time.

C

We probably actually either want to fix the feature or prioritize killing it everywhere else before we bother moving it out. I.

A

Think we should kill it everywhere.

C

um Do we have like an alternative for it in tests.

A

So my suspicion is the right thing to do here would be to have like. I don't think that this is so right now I mean the way that we're configuring the keyboard right is. We use the dynamic cubelet config, but like we don't have to do that.

A

I think what we can probably do is uh like within test infra. We can have a default public config that we write to disk on the node. We can define that in test infra and then, if we need to change that in the test, we can just overwrite that and call the cubelet to restart or whatever, uh of course, uh that you know like the test process runs elsewhere. So, like that's effectively kind of what dynamic cubic config does anyways just like jankily.

A

So I think it would be better to not do that and then also we won't have to worry about like if we have test jobs where we need like custom, hublet, configs or whatever uh so right now.

A

It is true that uh there's a bunch of stuff that you can't actually like set uh when you invoke the tests because, like if a thing is not a command line, flag thing, you have to actively go and change the code in order to be able to like, like you, have to go and like add a dynamic config whatever to like change, how the cubelet in the test harness runs so like that sucks and that that's like something we should fix, uh because if something gets added to like the cubelet config and there's no corresponding command line, flag added, which is pretty common, uh then you can't set it so like that should just be fixed overall, and I think that there's just like a giant refactor, we need to do this release and like get it done.

A

C

Yeah I can take that on when I'm back from pto.

A

That sounds good.

B

Yeah, I also plan to uh look at that, so maybe, if I will have time before you you come back, I like I will share my progress.

A

Do you want me to file an issue for that uh and ccu and danielle, because I think I may having dug into this a ton in the last release, I probably have all the context for what we need to get rid of.

B

Yeah sure and I think rtm already removed it from cpu manager tests. If my memory doesn't serve me right.

C

Yeah I'll try to figure that out on gob.

B

Like should we just remove all the dynamic config tests all together right now, just as.

C

The first step, we should probably wait until we actually are killing the future, mostly because a I think there are vaguely rules around this and b, having a feature that still ships with no tests sounds pretty sad.

C

Plus kind of useful to know that it's still broken or it gets more broken, which is the way with the kubelet.

A

Okay, yeah. I have now actioned myself on this one. I think it's really. I think that we shouldn't just pull the tests out, because we still need to test those things and like there's a lot of stuff in the test that relies on this and like we just need to refactor it, uh we need a better way. uh We're definitely like limited.

A

I think right now from having not invested in this in quite a long time, because, like everything in the test, suite assumes that we're configuring things via like passing extra command line flags to the cubelet, there's no way without, like literally changing the test code to uh set like cubelet, config stuff, and that's not great so.

B

So once we do that, and uh we have like I'm curious if you daniel you looked at what other tests were fighting like, was this test testing serial or they were like in a regular, uh I mean: did they execute in parallel with other tests, because once.

C

You start restarting complete. I've only been paying attention to serial tests lately because they've been in a fun place.

B

Yeah, so, okay, because if you start restarting kublet, then we probably need to mark this test with serial yeah. Otherwise they will heavily affect all other tests around. It.

D

But with all these restarts, would it make the test slower to run.

C

They already do restarts um like as part of dynamic couplet config, like a bunch of stuff, already restarts um a lot of the flakes in other tests that happen because of dynamic coupling config partially also happen, because, after the restart stuff can't always connect to sockets and stuff, but by moving it away from dynamic, kubelet config, where a bunch of stuff happens out of band, we can start having useful logs for that within tests to actually start finding. Some of those problems.

C

I've spent too much time looking at each e note lately.

A

So I suspect, that's all we have on that uh sergey. I hope you don't mind. I moved bugs triage later in the agenda because we have arno with us who has an item for migrating pro jobs to community infra.

B

Yeah, I just added it into agenda because we wanted to prioritize it last time we didn't have time last meeting. We.

A

Also have that boscos pool being constantly exhausted issue somewhere. I don't know if we made progress on that.

C

That has a couple of things that will probably help in that one. The eviction pr lands that I've had open for a bit. uh We should be able to get rid of the eviction job and move that into cereal um plus. There's like a couple of like very wasteful ways. We use resources right now like um despite the fact that only one test uses gpu devices.

C

We run every test on a node with gpu devices, um and so we might actually benefit from like splitting some stuff out a little bit mostly to be less wasteful.

A

So I recently looked at this issue: yeah uh aaron suggested that we just like use the normal gce pool, not a separate note. One.

A

So I don't know how feasible this is. I don't know if anybody wants to pick this up, but.

B

Yeah, when I talk to ben, he says that uh most we can do is to like combine tests together. More and more.

A

Yeah we can do that.

B

But like we're using a separate pool.

A

uh We're hitting limits where we probably wouldn't hit the project wide.

B

Yeah, I I'm not sure how to do that. So maybe I can fall off this band and see. Maybe they can help us.

A

I will put a comment.

D

D

It's a new kind of silent meeting.

A

Yeah, it's a multi-player, google docs.

C

So much better than multiplayer outlook, uh multiplayer world, though like office 365 multiplayer, is never fun.

A

Yeah sometimes that gets a little flaky, uh so I guess going back to our nose issue here: migrating proud jobs to community infra. uh Now do you want to tell us about this, because I am not on the up and up for this one?

A

Why why I don't know I haven't read the issue. I didn't know it existed. Okay, it's.

E

Not an issue, it's basically like a theologian about what I'm doing so. My name is anno, I'm working with seek testing and case of a working group to migrate everything inside google organization to a community owned organization.

E

So we have. uh We need to migrate all the pro job for all the seeks. So basically I'm going sick to seek to start the migration process.

E

We we can migrate everything in this milestone, but I feel like I think we should migrate some. So that's why I basically added to link to the test rate, so I would like to migrate those specific jobs to the community infrastructure sounds great. I have a question in the issue.

A

The sig node pre-submits test group tag is new, I'm so glad we made that one.

E

Yeah, oh so the question about this. We basically uh project in the google aux with specific uh gc image, but we don't know if it's still.

C

Used so I went down this rabbit hole recently when figuring out how to run tests remotely.

E

C

As I can tell, we haven't actually used the node e2e images project in at least two years. I might have missed something in test infra, but um everything uses stuff from uh upstream projects directly.

A

Danielle, can you comment on that issue with your findings and.

C

Answer myself for tomorrow, because um but it.

E

Would be very useful too yeah, so it will be very useful to basically chop that project and we basically migrate everything to kids ever cool.

C

I will validate that tomorrow and leave a issue but I'll leave a comment on the issue.

C

uh I mostly lost like three hours trying to go to list stuff in that project before realizing. I didn't need it in the first place.

A

Danielle, what email should I use when I add you in uh the google doc.

C

um I don't think I've actually.

F

Built terrible.

C

Systems: okay,.

C

Which, I think should work, I think, that's technically, a google account.

A

C

A

Well, it's being oh, it's now being grumpy and saying that it was making sure you had access to the stock. But now it's done.

F

I will link the question.

A

Great uh we've been jumping around. uh We have an issue linked by mike. Do you want to talk about that.

G

Sure so last time I was making some changes to the. I think it was related to the node feature tags. We discovered this job, which apparently is never run, and then I did some findings at the last comment. You'll find out that this is just a duplicate job. We already have one that runs as a pre-submit.

G

So if it's, if everyone agrees I'll just proceed and delete this whole job.

G

By the way, I am a member, though.

A

I saw that I got the notification. Congratulations.

G

A

um I haven't looked at this one, so I don't know, uh but if they run the same test, uh oh you say one is uh being a pr job and the other is a periodic job. uh That's normal. So in.

E

A

You to be able to run something on a pr. You have to have a separate job, uh because that's how proud works.

G

A

Looks like it hasn't been run.

C

G

Yeah, it was only triggered on my own pr.

A

G

Besides that, it was triggered like months.

A

C

Think it has no tests even.

A

Oh well, what does the periodic say because, like I think we should we should just get rid of both?

A

um And uh I guess the the one thing we want to check is make sure that if we do get rid of both, uh I think that's the only thing that pulls in stuff tagged, node alpha feature, I don't know how many of those exist, but those should probably be tagged feature not node alpha feature uh and yeah. It looks like there's a bunch of stuff running in there.

A

uh I think I'll I'll comment on this issue uh and discuss, because I think that if we I agree, I don't think this should be a separate job. I think we should just use the same alpha job that we use for the rest of the project, because there is already one- uh and I think it runs on every pr by default, like it's a required test, so uh whereas this one is not required, this is like a separate pre-submit.

B

Wait, do we run it.

H

Did you find it yeah this one.

G

Yeah, the the other one runs they're. The periodic.

A

Clearly runs- uh and I guess the other.

G

One is just whenever.

A

You trigger it.

G

Yeah, the other one never runs. I think, besides my the runs on my pr, there was only one of two runs.

C

uh There's no alpha by default.

B

So if this test only needed to validate specific prs that changes sonic, maybe we need it. I guess.

G

Yeah we never really trigger it.

A

It's I'm curious why uh the like pre-submit failed the way it did because it looks like it was like an infra problem.

A

It like failed to start services. It looks like I don't know why.

A

It's probably neglected because people aren't triggering it.

G

Yeah yeah it kept failing and we we just decided to still, uh instead of just trying to find out the problem, just decided whether or not it really is important, and should we keep it or not,.

A

Yeah, so in this case we are running the periodic.

A

If, in theory the periodic is identical to the pre-submit, I think it's generally good practice for us to have a pre-submit for every periodic, because we are finding that you know we want to like, for example, run like say the the cube uh serial test like that periodic was not defined and it can be really annoying if you're like. I want to see if this test passes and you can't actually run it on the pr you have to like, merge it and wait for the periodic to run uh so like in this case.

A

You know we're not running this very often, but we're clearly running the periodic. Quite a lot. um I think in theory. We should probably just get rid of this job, but I think that we should get rid of both. We shouldn't just get rid of the pre-submit.

A

Because I think this is true, so I'll I'll write a comment on this one I'll update it saying as much. uh I hadn't seen this issue so.

B

Yeah, maybe I don't know whether we can comment on on the job saying like this- is a mirror job for that periodic. This way, we at least will have a. I.

G

Think yeah, I guess it wouldn't make sense to remove wolf I'll. Take a look at the other periodic one to see if it's also removable.

C

What was the periodic called.

G

I think it's over the ci core. There is no cubelet alpha.

C

F

A

So that looks like we're almost at the end of our agenda other than bug triage. uh Is there anything else that anybody wants to discuss, but we didn't add to the agenda.

H

E

Who I need to ping when I basically start migrating the projects in case of a problem of a test failing after migration? One need to be.

A

uh So sergey- and I are the sub-project leads of uh node ci, so you could ping either of us. You can probably just post something in pound, sig node as well and uh because I'm sure, probably not just us, want to know about it. um Hopefully so.

B

A

Yeah but better for you to ping some more public rather than pinging us directly. I think like you, can you can at us in the public message.

G

A

uh Sergey we want to go check and see if there's any action items from previous meetings that we forgot to follow up on. I see one comment. Oh I see yeah.

B

Still yeah, I still need to like go to uh architecture, thingy and.

C

And also the um affection test, pr.

C

B

B

Yeah, I'm not sure what the imran uh finished the vr I haven't seen updates there.

B

Okay, should we switch to backtrack.

A

Yeah, let's do it.

B

Do you need to drive, or you want to take over.

A

Up to you, I'm I'm happy sitting back and watching.

B

A

That's I guess that first issue: do you want to archive that one from this board, since that's like a test in for a thing.

F

A

uh I think we should triage accept this one. You can also assign me.

B

I think we did something similar for uh last release when we stop calling props on certain events right.

A

Yeah well so there were some there's some weirdness around the pod workers now and how that interacts. Once we refactor the pod worker, so uh ex triage, except ted yeah.

B

A

F

G

A

I seen the chat. Some people are dropping thanks for joining.

B

Yeah, I think we need to accept it for sure I don't know whether we can uh assign it yeah.

A

Maybe, let's not well, I think clayton assigned it to himself, uh so we can just triage accept this one.

B

B

Yeah I was like, let's close it.

A

Is that a support thing.

B

No, it's a docker d.

A

B

Look at you will be duplicated in a couple of days. Well,.

A

It's also specifically docker gym on windows.

B

So should we go? Oh it's already.

B

I don't know why it's.

A

I mean it could be cubelet.

A

I agree this is a windows one. I think we should let sig windows triage it, so maybe yeah maybe remove node for now and they can send it back if that's wrong.

F

I'll remove that one from the board.

B

B

Random interrupted system called issues.

A

That is not enough information to debug.

A

Now, for some reason the triage label didn't get put on there or it did.

B

F

B

F

Now so something happens.

B

B

A

Interrupted system call that's very weird. I.

A

uh They didn't include a lot of stats about their system. I mean that so that to me sounds like something and the colonel is preempting.

A

I can't see any.

A

There's nothing in the logs that they sent that say anything about an interrupted sis call so.

A

I think it still needs information like they added some information, but it's not useful uh here I'll use. My canned response.

I

Do you mind, do you mind, take me, take me in on this issue. I would like to have a look.

I

Thanks jessica, okay, yes, yes, yes,.

A

I'll put an update on this one.

B

Yeah I thanked francesca, but uh you will ask.

I

For information yeah, I agree with your assessment, so I will need to gather much more information, so I would like to have a look because I'm curious about this issue.

B

Okay, my teamwork.

B

Oh it's not the same ingestion. Okay,.

B

Support cni.

A

I, I don't think, that's a bug, I think that's just uh the reality of whatever version mismatch. They're using.

A

Oh, are they trying to use cni with docker shim? That would potentially be a problem.

A

Yeah, I think it's a network thing so.

A

You may want to triage accepted just because they didn't and it looks like they're working on it. So.

F

Now we'll remove it from our board.

J

B

A

Oh that one looks like it was fixed, so.

A

Certainly, I think we should triage accept this probably make this important soon.

B

We don't have privacy.

F

You can add it, it just doesn't need it.

A

I think he's probably the right person to.

F

And now we can add this to high priority books. We have a whole column for that.

B

It's a website where the website here did you how you started in the website's box here as well.

A

uh All of the pr I didn't pull this one onto the board, uh everything that is uh oh, maybe I did um anything, that's a linked repo, so I think testing for a website etc. Anything.

E

A

Can file a bug against node goes to our board, uh so that way we don't forget about them.

B

A

We should triage it. uh I think we do want to fix this. That's broken in the docks, or at least it's wrong on.

E

A

So we can act. This.

A

Like someone from the docs team is assigned to this, so they can do that, but we should probably triage accept it.

B

What is this for? Which they're looking for? Is it documents.

A

uh No, I think it's sig node.

A

Oh, it could be docs.

F

This is a totally new contributor.

B

A

No, that's the that's the cncf slack uh slack.kate. I think.

B

F

J

B

B

Do I keep it on the board just because it's back.

A

Yeah we can just throw it in the triage column. I guess it's got an assignee.

B

I don't know like it feels strange to keep documentation box anyway,.

F

That sounds like the colonel's just being slow.

J

I can talk also, I have been looking at it for like four days now,.

B

Okay, can you give us some information.

J

So essentially, if you run a lot of parts on a single node uh with lots of subparts and two efs volumes, the node will eventually crash. uh I noticed two things. One was like we'll get this error message. I need to attach amount volumes.

J

The second was, we saw a lot of dangling uh mount points, uh so if you scroll up, I think there's a wc hyphen, l even more at the top. I think I don't remember where, but yeah this part feels like a lot of dangling nouns uh and at the very end, if you look at it, what I found out was right now. My theory is that basically like, because a lot of parts are being scheduled with subcuts at the same time, what is happening is on linux.

J

Slash drop, slash, mouse is getting modified constantly, and because of that, we are getting like inconsistent reads uh for these mounts and that essentially leads the entire node. You know.

A

So that's not a sig node thing. That's a sig storage thing! uh Six storage owns all of this code, so uh yeah on here and storage on here, which is probably right. uh We probably shouldn't triage this. We need six storage to look at it. I don't know if you want to add them on github or something sergey.

J

B

Yeah. Thank you.

B

J

Oh 150 pots per five minutes.

J

Okay, it's a big churn and it's it's.

B

Every five minutes, so just uh all at once and stop.

J

At every five minutes, it's a prawn job, but you can see it at 300. You see performance degradation at 300 parts.

B

Yeah remote thank thank you.

A

B

A

That just seems like inconsistency, while things are reconciling.

A

Why would the daemon set pod still be in the running state, the node's gone, uh and what, while it is shutting down, I imagine that it doesn't get pulled from the end. Points returned by that service until like the node is like gone gone like. I would not be shocked if there's some intermediate state where that sticks around.

A

That's always going to be true. It's a demon set pod. uh I can put a note on there.

B

A

When you terminate a node uh or even if you like, drain a node damon set pods, don't get removed because they're daemon sets they never go away from that node. So like yes, the thing is gonna like even when you terminate the node, it's gonna stay running.

A

Once the node actually goes away, then the daemon set controller will reconcile it, but like so long as you know, the node is just in some temporary, whatever state.

B

Will graceful determination help because I think racial termination.

A

Graceful termination should help.

A

I think it helps with the demon sets but like, as is this is expected behavior. So.

B

So we probably can recommend them to use graceful termination.

A

Our graceful shutdowns and oh uh I'm, sorry, I'm I'm simultaneously. Writing a comment um uh is degree contribution. Is that an alpha or beta? It's a postal right? Oh it's beta.

B

It's beta, but uh critical uh priority is alpha, so it's like extra feature on top of it is alpha, but.

F

There's a termination.

B

Debate already, it's enabled by default.

A

So when you shut down a node that will happen because they're apparently using 121., I don't know what version that went. uh Let me double check in the website.

B

122 is uh beta: okay,.

A

B

Just promoted it.

A

um Degrees, I'll add a little link to that, so you can move on to the next issue and I'll close this, because I don't think this is an actual bug.

B

Okay, what's it close to going to them.

A

Oh, according to the blog post, uh this says that grace will know shut down my beta in 121, so.

B

F

A

Like I don't think what the the headless service is doing here is wrong.

B

All right, the readiness prop will also help right so with resonance probe.

B

uh A lot of than this professor is.

D

Isn't it because the ttl of the dns, maybe.

A

No, no! No! No! uh This is talking specifically about the endpoints returned by headless service, so.

E

Like there's a controller.

A

Somewhere, looking at all of these uh pods that are part of the daemon set, it's like here are their ip addresses, but the node goes away at some point because the node gets shut down at some point. They should get reconciled. But there's like.

D

Yeah, but it depends on which dns server it's asking now, if it's the core dns sorry, if it's an external.

B

One the issue that uh it says that port remains in running state after node is gone. Yeah.

A

That's normal. Okay, uh although I am surprised like.

B

It's not normal for a long time.

A

uh Well, eventually, it'll get updated by the controller, uh but like that's that's how cube has always worked so.

B

I just want to say graceful shutdown should help with this right. Yes, I don't think it's enabled david.

A

That's something that I'm confused about because they're ron 121, so it should be beta. It should be on well.

C

uh It's on, but they need to configure it yeah. So.

A

The future gets me right, yeah.

C

So that might be the explanation here: okay,.

F

The feature is on by default, but you will need to configure it in order to use it.

A

A

This is expected behavior, so um I'm just gonna close this bug.

A

I don't think it's a bug.

F

Because I just don't think this is about oh: what's this race condition.

F

E

B

Quite a support for sure.

A

But they don't actually link it.

A

I think we should mark this triage, not reproducible, just because it's so old uh and then, if somebody can reproduce it then we'll deal with it and take that label off. uh But I think, as is we should close it, because this isn't a supported version.

B

I'm just doing new information.

A

Yeah, we could do that too.

B

Oh, it's already yeah.

A

Yeah but we didn't add a triage tag for that.

A

Because we've refactored lots of stuff that touched that code since then,.

B

Yeah and what is very much about devices yeah, I don't know it sounds strange like this whole.

F

B

Use the old version as well. I remember we fixed one.

A

This is a metric server thing. That's not signed. um That's sig, instrumentation.

A

uh Okay, I will uh I'm gonna, remove sig node, add sig instrumentation assign this to merrick.

F

And I'll take this off the board.

A

I mean it's possible to signal thing, but they should metric servers should look first.

B

Would they create multiple rankings.

F

I feel like I've seen something like this before. Maybe it's gonna look at the pr.

B

B

Wait, do you restart from docker.

A

Yeah, I think it might be a docker thing. I just I think I've looked at the pr. Maybe I remember seeing something like this or maybe not. This is new. I've definitely seen a bug like this, where somebody was complaining that there were multiple running containers.

B

Does it restart container from docker, but then expect couple to reconcile properly right.

A

uh Oh, did they restart it from docker.

B

Yeah docker start. Why.

F

A

F

B

It's not supported.

A

That's not supported.

B

Like this, especially for post containers- oh I don't like the sun blocks. I always start.

A

Yeah, I think that's just completely not supported, and we should close this.

B

Okay, there are other things here that seems to be discussed heavily so I'll just yeah. Do you want to see on that.

J

B

Usually, next to cfs that.

A

Sounds like a feature: oh they added bug. After the fact yeah we can just remove kind bug.

A

It's not a bug.

J

F

Two to go we're so close.

B

Right limit for this.

A

uh That seems probably like a bug. It would not be sig cluster life cycle, it would be us- and maybe apps, but probably just us.

A

This is a very detailed bug report.

A

But yeah, I think this is just like node, so I think we should remove the cluster life cycle and uh I don't know who wants to look at this.

A

I think it's fine to triage accept this. I just don't know if this is going to break anything.

B

You can just keep it, even nobody want to take a look.

B

Yeah, I'm not familiar with this quote.

F

Neither am I.

B

B

Oh interesting, how did they get to the state.

A

This is a camp no.

B

You shouldn't reduce.

A

B

B

A

What version is this on.

B

A

Okay, well, they definitely need to add that uh oh there's a server version here, no.

D

A

A

um and they're using rancher on aws, um so I suspect so uh when clayton did pod life cycle refactor, we found a few different places in the cubelet where, like in the containers, were just ignored uh so like they, they weren't factored in properly in terms of the life cycle. uh This may be fixed in 122, um but uh you can assign me and I'll put a or you can cc me and I'll put a note on this one. I don't think I'll handle the bug, but.

B

Do you know how to remove when you needs triage.

A

B

A

This may have been fixed in 122.

B

Okay, so just remove label.

A

Oh yeah, that label will stay on there unless it's accepted. I don't know why it's very annoying, uh but.

B

Oh, is she okay, yeah.

A

That's just normal. I think that was our last bug.

B

Yeah and we like one more.

A

A

Good work, team.

B

Yeah back to normal, almost okay! Thank you. Everybody.

A

Thanks everyone.

B

A