Kubernetes kops Office Hours, 10 Apr 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes kops office hours 20200410

Description

Recording of the kops office hours meeting held on 20200410

A

Hello, everybody today is Friday April, 10th 2020. This is cops office hours. I, am your moderator, facilitator, Justin, Santa, Barbara I work at Google, a reminder. This meeting is being recorded and what we put on the internet- and so please be mindful of our code of conduct, which boils down to be a good person. I have pasted a link to the agenda in the zoom chat. Please feel free to add your name and any items you'd like to cover to that agenda.

A

Otherwise we have some items on the agenda. We don't have as many as we sometimes have, and so we should get through them all.

A

But let's start off with, when you are here great, oh by the way I see if people are not able to directly edit the the dark they, the meeting notes you need to join either sig cluster, the lifecycle, mailing list or the I think sig AWS mailing list I'm just confirming those two and then you will be allowed to post without approval, but otherwise for a fee to like click YES on anyone that needs that.

A

So, yes rockman. You have to first item about Travis CI integration.

B

So in the last two weeks they changed something or they had an issue at some point and for some of us the cie doesn't update the status, so it's either pending or it's gone forever. So, if I push a second time, the branch force push it I get rid of Travis CI.

B

The is kind of the only way I could commit something in the last week. Oh, it's very rare that I get a green light. Even though, if you go on the Travis link, you see it green.

A

That is, unfortunately, I apologize for that. It's not.

B

A

B

Let's say: I can't survive for some time, but it's it's annoying, it's with them, because I also see it in other repos where I commit. So they suggested that.

B

The repo moves to the newer integration- I, don't know, there's migrated to github apps I didn't look that much into it. I anyway needs someone with.

A

Permissions for that right- and this is this- is actually probably a kubernetes org level permission on the github kubernetes fork. So it probably isn't something that, for example, I could do. We should probably talk to either contrib X or test infra I think we do have a use case for continuing to use Travis, which is. It is the only way that I know of that. We can test on OSX mac OS, but they formerly known as iOS whatever those and so I think we want to keep that going.

A

Krauss building two Mac OS is different and we confirm that I don't know.

B

Cross building from or two so.

A

If I crossed out from the next to Mac, OS I think I get a thing from when I build up direct your Mac OS I.

B

So just cross built from Mac OS to Linux, but maybe I missed something yeah I mean we can undo build.

A

Yeah the thing we're trying to check with Travis is: can leak up, CLI be built on Mac OS with effectively go, build or like without a bunch of options, and that's what I think we can only get from Travis, but maybe maybe we can make do with other things. I suggest why don't we keep an eye on this one? If it happens again, can you ping me on the PR? You probably didn't for just missed it.

A

I apologize, but if you ping me on the PR I think we're getting closer on the PRS in terms of catching up and so I should be able to like be a little more responsive and I can see what I can figure out as well, and we can maybe loop in which free loop in Contra vex, who would be the people that would approve the change to a github app, and hopefully this is just temporary, with everything being sort of overloaded right now in terms of all things, computers and network. So.

A

Yes, I think if it happens again, let's try to capture that that PR and ping contra backs and myself I think would be the right approach, particularly if, because moving to a get up up would be that level. I believe. Ok,.

B

So I spend I raise the ticket with Travis CI I spent I, don't know time with them back and forth emails, and still they didn't manage to understand today. They just emailed me saying that they forward it to engineering, they're engineering, so I.

C

A

I see a guy, thank you for joining out by the way I see I. Think I put the link. I, don't know there was a. There was a few that had had me as well for.

C

The last week, or so soon.

A

Yeah I, particularly cherry pick. We probably need to worry about it too much, but yes, it certainly makes it harder because it presumably then it doesn't automatically merge- is that right, even with the proven else, UTM yeah. If.

B

You force push it the second time or something at some point. Travis CI just gives up and doesn't appear, but doesn't seem like the right thing to do. Yeah.

A

And I don't know if you click into Travis CI. Sometimes you can see the status of the various builders and some people have permission to hit refresh on a job I.

B

Doesn't work on all right? The organization can do it so I tried everything force, pushing the restart thing. Okay,.

A

Alright, well, there are some good thank you for the links and we can have a look at and see what we can figure out, but it doesn't sound like it. Sound like the apps thing is something that might fix it, but I'm not getting a sense of Oklahoma, confident us either. You know, let's see, okay, let's, let's keep going Mazzy 89 uh I'm at now, who is that is Mazzy 89 here.

A

Salvatori, no I don't see cemetary, okay, I actually had a look at this PR I. Think ten minutes ago and I think I approved it. There are some things which I think we should change around.

A

The big one is around enabled, but like a field but I'm gonna send a follow-on, PR and that's someone beats me to it. Just a bull versus pointer to pool problem or a potential problem. That I would be more comfortable if we didn't have to face, but I think that's that's in that's one of the lasts. That was one of the last blockers for 118 I have a sort of burned down list so for 118, alpha, okay, so I assume Lee Fela dealt with that Hackman arm. 64 support for worker nodes.

B

I gave it a try with some Hecky branch managed to get it running has to be done. You know more elegantly.

A

That sounds great I, don't know. If we cut you off there.

B

Sounds like I might have lost.

A

B

No I wasn't yet so sorry so I gave it. I gave it a try with a branch with a hack branch just to see if it works. It's not a huge change from what I saw just that.

B

We will have to start publishing various things for multi architecture and if we want to go to masters also etcd manager and various things have to become multi arch, but it seems a bit too much for masters, at least at the.

A

Moment I think where you started with just the Nerds is a great really great place to start and we can sort of get a good feel for it. I I don't know if you've sent that as a PR I personally would love to see this I don't know.

A

If others would love to see it I, we can't we might go to put up Windows forever, I, don't think we can put off arm sixty-four forever and I suspect he'll put up Windows forever either so I'm 64 feels less less different than Windows is gonna, be so yeah.

B

It's it's much simpler, so I didn't have time to make it. Let's say a bit more elegant once I get it running so in the next few weeks, I should have the PR available. That's.

C

B

That it probably can be part of even 118, because it doesn't affect anything or I. Don't know it's just the an extra if-else load, the following GART effects: okay,.

A

Well, that's I, look forward to seeing up here, I.

B

A

Yeah I don't know anyone else has any thoughts on that or any any reasons not to do this. I can't think of anybody.

D

Will be able to use the arms instance types in our ete jobs to confirm the support, so that'll be good. Thank.

A

You that will be really great, yeah and I know it seems like it seems like it of us, is doing more of the arm instance. Families so I think it's definitely something that is coming to the world. So this time it's real.

A

John. Thank yes. That's the thing for the hackman john. You have the next item on the agenda. Eight eight to six: how quickly do you want to remove the legacy at city provider? Yes,.

E

I put in a PR which does sort of the slowest possible approach which is to remove it are to refuse to run it on kubernetes 1:18 and later we could just say it's gone or we could do anything in between and things people want to do something that's a little bit faster than 118 kubernetes. In later.

A

Thanks for bringing up, I think this is the right man. You I think you are what you what it looks like you did here, which is like making it an error, but not like not toady removing it feels reasonable to me. I, don't know how other people feel.

E

Well, a lot of the feedback I got was that was too slow. Okay,.

A

Alright yeah yeah must be a change of such a change of roles. The yeah, the primary value I, can think of for the legacy EDD provider. Is it more closely matches the way we would want to structure a future ed city manager which doesn't have everything baked in, but that is we can easily get it from code, and that is the least of the problems around that so I II I would I'm inclined to like be I'm, always like that. Stick things a little more gradually.

A

So ok with someone that wants to do it faster make the case. I guess or he says, I just read the PR comments.

A

E

Guess I'll have to read the peer comments: cuz I, not hearing anything yeah.

C

A

Think I do agree with you John in that, like the kubernetes, like the the the argument for not the argument for tying things to cabanas. First I'll posts on the on the PR, but the organ for tiniest cameos versions, relevant cults versions. Is we don't want to have to support older cops versions? We don't want people to say well. I'm gonna use cops 114 because it was the last version that had feature X.

A

We want to be bets, that's what I think unity versions, so I think it sounded like that was the nature of the objection and I think I think that's the way you've done it. John is more in keeping with how we do things so I will post that comment on that pra.

A

Okay, you also have the next one John, which is an cluster validation or.

E

A to creating coaster validation, it's actually precedent, so so somebody are Jesse. Hack I had another batch of failed of rolling updates.

E

This time it was due to of API server becoming pending after the cluster validated, and so he opened 88 68, which went which merged in record time ah and.

F

E

Not really happy about what actually is any a 68 besides having numerous issues, what that does is adds a new parameter to.

E

Proceed on rolling updates until you pass cluster validation, some user specified number of times, and it also removes the ten-second wait between those successful validations table. I mean our I would rather get down to you, investigate and fix the underlying issue as I as I see it so far. The underlying issue is that nodes marks that they are ready when they are, in fact not ready. We have a workaround, which is cluster validation, but cluster validation is marking. The cluster is validated when the cluster is not ready and.

A

D

A

I think we talked about this before right and we agreed I think what you're outlining here John right, John yeah.

B

Yes, but that was different that was validate cluster. It was a different command which we made it fail if it didn't happen like that, but in this case, with all due respect, people don't use rolling update for testing. They use it for production, so I don't really want to see it fails so that I can report the bug. I want it to work to rolling update until you know, wait until it's ready and then move over and so on.

B

I, don't think it's okay to make it fail just so we can see why if it cannot fail, you know, retrying depends on people if they specify the parameter. It should have that parameter and let people do it.

A

I think, if I phrase at some point, I think there it looks like there's the default. Didn't change in terms of the validate count. Is that true? Yes,.

E

Oh yes, it did actually a previously. It would validate twice for every instance group and now avala dates, I think twice for the first instance group, but it doesn't, but now it no longer waits between those two validations and subsequent instance groups only validate once. Okay.

A

So I mean I think how come? Would you be okay if we, if we made if we change it, to continue to validate every instance group and.

B

A

B

Also change the timer to five seconds or something then second seems quite excessive. Well,.

E

Let me address that comment there, which is, if we paper it over with this sort of thing, we're not going to get that information and we're not going to fix the problem and of where we seem to be optimizing for the thing stopping, but what's going to happen, is it's going to proceed to the next sense group when the cluster is not ready or actually it only fails when it goes between one instance group in another?

E

But if we paper over those failures, it's going to go from one node to another in an instance group and your workloads going to fail we're going to have something like taking down two at T denotes. At the same time, we need to get this feedback. We can't solve the problem without reports. I agree.

A

That can we make sure that that feedback is coming I think Cochrane was saying like: can we get that type that feedback in our in our tests and I I I think that's a good way, I think it would be nice to surface I, don't know, there's a way we can surface. The error like saying: please report this without hurting someone's cluster I.

B

Understand what John is saying he is worried that the cluster is not actually right, I'm just advocating for having to have to, or whatever number of continuous successes the person chooses. I think that was the point of that PR so that it doesn't so that it doesn't fail.

B

You know, because.

E

B

E

It only fails when you go to a new instance group, and that is because I fix the bug where we currently do a check before we start rolling an instance group and that particular check doesn't retry. That check fails the whole thing stops and the bug was. It was previously ignoring validation failures, so it validate, ignore the failures and then proceed. Okay. So now.

C

E

Actually stops the whole thing now, if we want a paper over, we could paper it over by having that first, one validate for some amount of time before it actually succeeds, which.

G

Is the later error I have in the list.

E

But then we we would still have the problem of you know if it's flapping and returning success incorrectly we're going to be rolling to the next thing before the clusters ready. The two failures we've seen. One was the first one was that the controller manager was going pending and so I added a check to make sure that every master had a controller manager pod. Now it's API server, so I think there's a problem where static pods that a node can validate successfully before the API server even knows that the static pods exist.

E

A

Sounds very lexing and.

E

I think that's one of the core problems we need to address, but there might be others. But if we paper over the symptoms, we're not going to know about them.

G

So we're we're hitting that case of the initial validation. Failing because, right before we start the rolling update on our masters, we're scaling down the cluster autoscaler, which is running a coop system and because that pod happens to be going pending, like it tries to do the validation and it sees that's pending and fails. And so we put a retry in for that.

A

Look, it sounds like we need to like make sure that the papers, what we want before we do, 1:18 I've put it as a blocker for 1:18 alpha I, don't know if there's a particular set of changes, but I will certainly take a look at this, and if there are it sounds like Ryan, you might have a PR and we should try to like treat this as a high priority to figure out what the paper should be and try to find a compromise.

A

I, don't know if there's any decision, I guess we can reach here well,.

E

I think the main thing is: how are we gonna find that data to know how these clusters are behaving yeah? What.

C

E

Happening when these pods, when these nodes are coming up, so we can actually put in the right set of checks and I.

A

Think I made a great point about the like: let's make sure our ete does not paper over it, no matter what and then what I'm wondering is. Can we can we like surface it? Without? Can we do the right thing and still surface? Can we do the best thing we can do for the user and still surface there in a way that we get the report like some form of message with, while still doing the right thing, I, don't know, I mean yeah.

E

I mean: does somebody have the resources to actually do some measuring I know I, don't I wish I wish I did but.

E

And- and this is actually might be- an issue in KK because I'm sure cluster API is gonna- have this problem, if they're only working off of the node readiness, they're gonna, go super speed and and roll things well before they're ready.

B

They already do it, we just we are safer because we run validate right now before proceeding yeah.

E

But I might, in my opinion, validate, is papering over the problem of node readiness.

A

Yeah I mean no greatness, has a number of known issues like this? The other one that really bothers me is network. Readiness is not always accurately reported, so a node can be considered ready before, for example like if you're, using the ADA routes routes. Mapping like there's no guarantee that route mapping has been set up before the it's. A separate controller upset yeah well,.

E

Yeah, it's definitely ready for the CNI is ready, I mean the CNI Damon set pods can beam, you know not even pending or and.

A

I'll say it's ready, so, yes, this is one of those ones that has to send it into like upstream finger-pointing and not a lot of progress which is disappointing uh and I guess. That is why we have the father date, because, ideally, we would just be able to look at no status and I think there is slow progress being made. I, don't know that it's great but yeah, but we need to figure out.

E

Okay, so so won't meet more immediate problems. Okay, how can we know that the API server knows about all the static pods? Before we say a note is ala dated now for a master. We have a list of static pods that we can just check them all. Yes, and maybe that's what we do, but.

E

You know that won't cover the case of anyone having any extra static, pods lying around.

A

That is true, I think. Maybe we can encourage more additional static, pods to run as Damon sets, and so we do end up with a fairly static static cuts. Okay, but.

E

Yep yeah, maybe it's fine just say: okay, only the master has better pot or.

A

Yeah we yeah I'm, like you, proxies yeah I'm, just like, mmm but yeah, ideally one day, we'll get your proxy to daemon set and which has its own challenges because of like SKU and architecture, but yeah.

E

Or maybe it's the master plus cute proxy, but.

A

No, my phone runs coupons yeah. Yes, this was pretty deep. So yes, I as the action Adam I put put this on the list as a 1:18 blocker, to figure out what we want the behavior to be for when a team or might alpha and I don't know that yeah sounds like we should address that. But I don't know that. There's much more! We can figure out here. It's a ferret.

E

I mean just I'd actually like to get rid of those rechecks and get it to work the first time, but there.

A

Is there's a there's, a flapping concern right? Actually, we.

E

Can even keep it from flopping, yes and I, don't know how.

A

We never liked it so make sure it's sick right to say like X and it works now. It works five seconds later. So therefore, it's not flapping. That's a pure heuristic, but it's a reasonable heuristic I, don't know.

B

Okay, I understand what you're saying, but people also want to be able to do rolling update if they so I may agree that we can have let's say the default set to zero for something zero rejects, but once a person gets into a problem with rolling update, he will want to just fix it not wait three months for a new version.

B

Maybe it's just me, but that's how I see it. Otherwise, you will have to do rolling update manually, yeah.

A

I think that's reasonable, I think I think we need to figure out how a good balance between not papering over the cracks, giving the right user experience getting the data and trying to like it be be a more accurate, validate right that actually like to check some of this stuff yeah and like being mindful of what's coming from upstream. In terms of know, readiness that may negate some of the need for this. But it's that's not happening to take your Appetit I. We spent a long time on the site.

A

I ask that we keep going said: okay, yeah yeah, so John the you actually are the next one, which is around the PR, which I was supposed to have a look at around replaced before version field for channel yeah.

E

But they were we kind of want to get moving on core DNS. This is a blocker. Is.

G

That Walker for genius all.

D

E

Cuz, we need a fix, Corki yeah. We want to remove religious from Cordy.

A

In this okay, let's put that as a in the blocker for 118 mess and I will I will have a look at that. I think that what I said before was I know that, where there are efforts, looking at this space and I will try to find out where they all are um and I did not have a chance to I apologize.

E

And the next one is I believe in 117, blocker I haven't seen a lot of movement on final and I was like I. Think at this point, I think we should just pull the plug on it. Okay,.

F

E

Then, if they ever fix it, we put it back.

A

Yeah and if they ever fix that we'd have to update, we.

E

Wrapped up big flannel anyway, so that.

A

Seems reasonable at this point I think we and it's you're you're, not yeah I! Think you freely described this as removing it. I think that it, you are merely preventing it through validations, I, think yeah.

E

A

Validation for 117 and later yes- and we can remove that as and when they do, you fix it, I think that's actually reasonable. So I will. I I see it's already a March as a blocker on to make a decision, one six, eight, six, one four! So, let's see.

E

A

And yeah we're see proceeding with merging on the basis that, on the basis that we'd have to update the version anyway, Jacqueline you have the next issue around docker.

F

B

A

They can hear you, yes, doctors can hear anything. Oh you can't hear us okay. Well, we are talking to you I'm gonna, say we'll come back to you because we can hear you. I was nude for a second.

F

A

Back, can people hear me I'm confused yeah? Okay? Yes, all right! Thank you, and so when a coin comes back, we'll come back to hit, is, can you have doctor issues? Oh there is that fine? Can you hear us now so crank?

A

Okay, that's yeah! Oh there you are. Can you hear us? Yes, okay, talker version issues.

B

So, while cleaning up the removed, docker versions so for which we don't have represent more notice, that there is quite a mess in the older versions, so my proposal was to remove some of them. Like the duplicates lap, we duplicates like Ison wrote there. It's 1806, 1, 2, & 3 people should use, even if they want to use 1806 should use the latest available. I, don't see any reason to have all of them.

A

The is there: is there any motivation for doing so other than the cleanliness of our curve.

A

Because I I guess my concern is suppose someone is as happily using 1806 1 and has for whatever reason tested that 1806 3 breaks their workloads and then we come along and we say you can't do that. I would be much I mean I know they shouldn't be doing any of this. But that's what I'm sort of trying to balance I would.

B

F

B

There sorry some have just one or one distro supported, or things like that, that you know it's not even that we support those fully.

B

A

I I'm not saying we should go and I happy you're missing, support row for missing districts, I'm saying we should just avoid breaking existing users. That might be happy perhaps based on what our thing was. John did you my.

E

Suggestion would be to refuse them for newer versions of kubernetes as an.

A

A validation type block, the other thing that that works for me, because then we get, we have a. We have a.

A

We have, a mechanism by which we can discover if users are actually doing this and have a conversation with them about why they are using 18:06.

B

Mm-Hmm, so we already have some mechanism of right. Now we validate what users can set up in their cluster spec. We could, in theory, set a minimum docker version for some kubernetes release like for 119, for example,.

A

We write community nation size is reasonable, that that then gives us say a guard. That means we.

A

We can then remove this code safely in the next version, and it also helps users upgrade to talker that may have, for whatever reason had that version pinned, for example, upgrade to a newer docker that might have had that version pent, okay, so we would so adding validation and then later, like in the next release, removing the the actual versions on the assumption that users will have had the chance by running cups 118 to observe that actually I guess that's exactly true I guess more concrete in that, but yeah at the appropriate time.

A

So these we have a path for ordered removal and I. I know it will likely be longer than we want to, but we can talk about narrowing the window even further. If we need to the support window of Canaries versions, okay sounds cool! Well sink! Thank you! Matt chapman, a cube, router fix yeah.

H

There was a regression for coop router in 116, and I believe we fixed the test coverage here as well. So we can start detecting these, but there's a cherry pick now to back port this to 116, and unless we cut a patch here, it's gonna require anybody using cooperate or manually intervenes for an upgrade to succeed.

A

Okay cool, I mean I. This sounds like a good reason to do a 116 one I'm surprised I would never yeseo a new version of cups. You said, there's no! Thank you. John Moore. Yes, thank you. You said there's a test coverage is that like cops, EDD test coverage, yeah.

H

I believe and I did not add it. I think Peter might have added it, so he might be able to speak more to this, but it looks like we added edie coverage to test for Cooper outer they're, currently wasn't anything there and we should be able to see these failures. Now. That's wonderful news, yeah.

A

I mean I, think that's great I, think, and so there is it. There's an open cherry fact that we need to approach sorry the trick that has been trafficked. We just need to do the release to actually like ship it. That's correct, perfect, I think we should definitely do that. Any sorry and I think we should do that.

A

Any objections ready any other things, people one in 1/16, I, guess: okay, so I'll do that I think we're actually very close to cutting 118 the next one 18 alpha or the first once you have a I thought, but the next one, 18 and I just didn't see the point of doing it fast. Like you know, two minutes for the meeting, so I will do that. I, guess absolutely more on the topic. I also have been catching up on youtube.

A

Uploads I am able to do I think five a day, because I am sticking to using the API, because I refused to click in a GUI. I am that's why I have I have written enough lecture and that only allows me to upload five a day, but I will get there at five a day, so I think we're in I think we went to the end of January, so we are two days behind I.

A

Think maybe three days behind I know my math, okay, Ryan initial validation during rolling up day does a single try, which is very related, I. Think to our other topic right, yeah.

G

At first first validation check during the rolling update. Does the just a single try right now? I did an update just to make it use the same weight function, so we have to validate two to multiple checks. The question is I. Guess two questions. One you know is that the right approach is that what we want to do, which we were discussing and then to if we do merge this in the static test failed before, because it was the only thing to use. It was still using the validate clustered function. It looks like so.

G

Do we want to remove that function or should I just market as ignore it, because people are somehow using that from you know some other code that are integrating with.

G

I'm sorry, the static function, the validate cluster function, one of the static checks that ran during the build failed because nothing is actually using that now.

A

Okay, yes, and that case you should probably remove it's a private. It's a non exported option, so yeah.

E

Yeah well, the logging is a little different from the two and you might because the the one that retries is a little bit noisy and you don't necessarily want to go out. Logging on the first validation, so I think there's a slight. You might want a slight difference between those two modes. The downside of retrying on that one is, you know, paper over the problems.

E

Doesn't that that's about it.

G

Yeah and I'm willing to look at other approaches there- I just we've patched in ours for now so on our custom build so we're good to just need want to make sure gets upstream. So you know, keep patching okay well,.

A

I I, don't know what the right. Thank you for this by the way, I, don't know what the right, how we can have a conversation around this issue and I sort of structured way. We.

E

A

E

The a structured way of doing it I just please, don't fast very quickly at merging until we've completed the conversation satisfied.

A

The I am happy. We can. We I'm happy to drive that PR unless other people want to drive it I.

E

Can drive it I'm, I, okay,.

A

E

Have the most cluster validation? Yes right.

A

Now: okay, yeah, that would be great yeah. If you, if you do that, please CC me at least, although I'm actually sort of more fast pulling these things now, but um yep.

E

Yeah, let me can't figure it out. I'll help up your on that.

G

Thank you, John I see you grab at the bug by filed on the rolling updates for the notes going away during the rolling update. If you fix.

H

G

Might you're my hero, just FYI I, have had a chance to look at it yet. But it's been bugging me for about three versions now and I haven't gotten to it. So I.

E

Think it's I think it's a case of ignoring that are determining it. It was that error and ignoring it so yeah yeah.

A

I just wanna say like thank you to everyone that merges the PR. It's like it can be, I make mistakes, people make mistakes and it's always a balance between like merging and like trying to get the right. The right balance and I think it's. If we never made a mistake, we'd be going too slowly and so I think you know as its it's great. So thank you to everyone that merges PRS and they give Ana comments when we do make mistakes.

A

It's it's it's thankless and it's, but it's also important because otherwise we we would never get a really stunt. So thank you for the mergers and thank you for the cleanup and the people who bring it to people's attention. So thank you. Everyone.

A

We are at the end of our agenda, I. Don't there any other agenda items, otherwise we can jump into the release plan. I.

B

Wanted to ask you about you're moving to one eight zero, yes, I wrote in the PR but simpler here. Do you want first to replace with 118 one if yeah.

A

So this is a background. This is a PR I put up which updated our kubernetes version. Two one, eight sorry are the version of our kubernetes libraries in particular API machinery and friends. We thankfully no longer depend on KK updates to 118. The gotcha, which everyone will notice very shortly, is 118 client go, adds a context, basically changes the signature of all the calls. It adds a context.

A

It also requires a options as the final ish parameter on all methods, whereas previously previously was only on maybe update and list, and now there is a read options. There is a patch options, though these things. So there is a ton of context, threading that happens and I. Don't think it's a bad thing. It's just icky! Now it's done it was not. Those are those tedious to do, but it's fine and then I. Think hack.

A

Man raised a good point, which is at the time I started before I thread through the context it was 1:18 0 in the interim they released, 1:18, 1 and so I. Probably yes, I will probably update to 118 1 in general. We don't like follow it patch by patch, unless we actually see something we need to. In this case, it feels like there's no point going with 1:18 unless there's no probably going 1:18 0 when 1:18 1 is right. There just is a rebase s. To do so. I will.

B

Do that that's why I mentioned it yeah, thank you, but we're.

A

Not I don't think in general, we should like, because API machinery updated to 1:18 3. We should not necessarily go to 1:18 3, but we should make in general. We should do judgments.

A

Okay, any other topics before we go through the release plan: okay, I'm gonna move create the Buster API is beneath the other two, but it remains on my agenda, but I I feel like it's becoming embarrassing at this point, so I'm gonna I'm, going to deprioritize it from the list, so I can feel better on myself. So we as discussed, we will do a 1/16 one, including I, guess that's eight, eight six, four thank you ever put that in and presume there'll be some other deltas.

A

If anyone else thinks of anything, they need to get into one six or thinks everything should be one. Sixteen, please cherry pick it with due haste, but I won't get too tall this evening anyway. So moderate taste and.

A

We will do a one: the blockers for 118 alpha audits, new behavior in the rolling around the running, updated validation, the decision around replaced before version I'm guessing actually the eight six one. Four is also a decision, a blocker here right, the the decision around final.

A

It's also block of the lion team. Well,.

E

But not sure it has to be a blocker for the alpha, but.

A

In practice, it's gonna go into a first, so but yes and.

A

Replacement for a version decision and then we can cut the winning team branch. Yes, we will cut the money team branch. For me we should cut the winning team right and then we have is.

B

Why do we want to branch right now, you're.

A

Right, we normally do it at the beta, so I was there is a question mark here. I, don't know how people feel whether we want to cut the White team. Brunch I thought we want to try to encourage stability. Faster 118 has kubernetes, 118 has shipped so.

B

Yes, but it will be another three months before we release it right.

A

Hops cops, fighting I, don't have that I hope I would have. We do a beta actually like we do want to get to the point where we have a beta that works from the cops point of view at the same time as kubernetes releases, and then we essentially will go stable when the ecosystem is stable.

A

Sure, from that point of view, and typically we if we have tied the beta to two we've tied the release branch cut to a beta in the past.

A

We don't have to cut the release I just.

D

Thought if you want to wait a little bit, that's fine, but.

B

Do we see anything that we need before cutting the beta I mean? Does anyone know anything unstable? Let's say besides the rolling the updates.

A

Is just we can just go to beta now if.

B

We want to have a beta sooner I guess we could sort out the rolling update and we can just cut it after that. Yeah I.

C

Guess what I'm sorry.

B

Yeah, that's what I meant I really wanted to see why we wait for the beta.

A

Yeah I just felt intuitively that we weren't, like my feeling, was awareness already for beta, but I liked the idea of cutting the branch, because it would help us get to beta faster. That was my sort of intuition I. Don't know how other people feel about whether we are ready for a 118 0 beta.

E

I'd, like it privileges that, according us but I, suppose that could 19 could be cherry-pick.

F

Sort of important feature that we wouldn't we would want to like resolving the outstanding questions around.

G

Rolling update and having the bidder.

A

Regardless, let's cut the branch, why.

D

Don't we cut the branch one impact of cutting the branch is that all of our periodic ete jobs run on the master, so we will start diverging our signal. Basically, okay,.

A

Good point: okay: so what why don't we we have have we done an alpha of 118 or not I didn't think we had, but someone were alpha 3. So now, I'm yeah.

C

We have to office.

E

Okay, all right I mean the main thing is. Do we have things that we want to start in on 119? Has anyone got PR so they want that? Don't go into 118. That's that's! That's create a branch. Yes,.

A

I mean I've started tagging things for 119, but it's more like not a 118 blocker like I, actually started on the on the reflection thing and but I put the work in progress. Reflection setter in 119 and I also, therefore tagged set instance group, as for 119, and there was like all my work in progress into 119 as in like these are not gonna make 118, so there's just get them off my screen.

A

Actually, when we do that, we are, we are below 50 PRS, which is something I've never seen if we exclude work in progress and execute or anything already triage to 119. So that's really good. Thank you to everyone that drove that yeah. Are there any other any other things that people are are? Are there any things that people are as your last anything to get in?

A

Okay? So so why don't we keep them? So, let's, let's see where we are, after the it sounds like we're, probably ready for a beta. If we feel that if we fill that we reach a good conclusion on the open items, is that fair and John your your replacement for version? Is in that list, I think yes,.

A

Dissenters, approvers, those that a yes or no I'm cutting the release. French I believe it's a no at the moment for cutting the release branch continuing to tie the release branch to the beta and we well maybe not consumer. It's not not cutting the release branch before the beta.

A

Your point about the signal being well-made and like we want to keep as much signal as we can on that 118 beta, but when it does arrive, I, don't think we and we're sort of tends to be saying we might do a beta, but for our next meeting, if we decide that that we have a good resolution on these items,.

A

And just looking at the 117 blockers, there is the decision on the panel and then the decision on IP tables.

B

So for IP tables, I think I can add the release note how to set up with additional user data, and it can be not a blocker okay. It.

A

Would be great to see that PR I think.

E

A

E

Make it as I zone 86 14, so we.

A

Did is it not sorry, we.

E

A

Yes, yes, I'd, say yeah, so it's on decision on it is a merge. A 614.

C

Okay check that I guess merge, I'm, sorry.

B

So the P are listed for iptables is just something that disables daugher by default. So it's not the full the PR, because we didn't agree if we do it load up or with additional user data.

B

B

But that image builder PR, maybe I mean it's an easy PR. If you have time to look at it, okay, if not it's not the major thing. Okay, thank.

A

You I hoped you have time I think thank you. Everyone actually has done a wonderful job of clearing the most of the backlog of PRS. That has been super helpful to everyone. That's been doing that we have three minutes remaining in our scheduled time. I don't know if there are any final issues or final topics. If you want to discuss.

B

A

Just kidding kou-kun is now every scheduled to august no I, don't remember, focus July August.

A

Let's see how things go in Europe, yes, that was my that was my sort of feeling. I have not yet decided anyway,.

B

In Europe, at least from what I see the feeling is that the big meetings will not be possible for some time so cube con kind of goes into that category. That's very interesting!

B

Yeah, okay and we have restrictions here like you, cannot have be more than I. Don't know five ten people in the same place. So even after relaxing they say maybe 100.

A

We are only in April, so there are months to go, but yes, I think we yeah it'll be interesting to see. We I think we all hope that we will all see each other in koukin in August. That would be wonderful but yeah.

A

If there's nothing else, I guess I will draw the meeting to a close all right. Thank you. Everybody happy weekend happy Friday and see you in two weeks. Okay, see.