Kubernetes Cloud Provider Special Interest Group, 31 Aug 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: SIG Cloud Provider 2022-08-31

Description

Agenda: https://docs.google.com/document/d/1OZE-ub-v6B8y-GuaWejL-vU_f9jsjBbrim4LtTfxssw/

[dmoiseev] Introduce well-known tag for exclude subnets within a auto-discovery procedure for ELB backed services
https://github.com/kubernetes/cloud-provider-aws/issues/442

[jyotimahapatra] As a cluster operator I don’t have a mechanism to shift leadership away from an impacted AZ. https://github.com/kubernetes/kubernetes/issues/111899

[bridgetkromhout] https://github.com/kubernetes/kubernetes/pull/108095 - looking for more feedback/replies so we can get this resolved

A

All right go ahead.

B

uh Okay welcome everyone today is August 31st 2022, and this is the Sig cloud provider community meeting. uh We follow the kubernetes Sig Community guidelines, which essentially means uh raise your hand. If you want to talk- and uh you know please tweet out each other as you would expect to be treated or explicitly- please be kind to each other.

B

um I will share my screen to show the agenda here and then we'll go through things. Okay,.

B

All righty, um so, let's see normally we go through uh triage at the beginning of these meetings, um but as we are running in kind of a limited capacity today, uh I think we'll we'll come back to triage at the end. If we have time uh we do have a couple agenda items today, so I guess we'll go through the sub project updates to begin with, and it doesn't look like there are any recorded currently are there any? Would anyone like to make a sub-project update uh that isn't already on the on the agenda.

B

All right, I am not seeing any hands go up, so we will move through to the agenda items then all right. uh The first agenda item is from Dennis Moise who's, a colleague of mine, um and was curious to talk about uh this topic here so Dennis. Why don't you uh take it away.

C

uh Yep, so here you could see the link under GitHub issue, which I created a couple of weeks ago already, and that's about design discussion of possible like labels to exclude uh load balancers from Auto Discovery Logic on AWS. So the issue which we have faced within Red Hat was basically about that new availability zones which jws was introduced, namely wavelength zones that far Edge zones and so on, and these zones have a limited capacity, not not capacity but limited capabilities in terms of which load balancers.

C

We could use so and we faced quite a lot of issues because Cloud controller manager tries to attach uh machines to load balancers which are not intended to like use in these zones. So here I just want to I. Don't know ask about what people thinking about that design, which I propose how I can proceed with that? Further will I need to prepare some enhancement documents yeah. These kind of things so basically want to have some feedback on on the thingy. So.

D

Are we looking for like excluding the entire Zone.

A

D

That's what it's apparent from your discussion, for example, like we don't want to include the local zone or the wavelength Zone.

C

uh Yeah, more or less, uh more or less and.

A

C

So uh that's about uh labeling subnets, so not not entire Zone, but the the subnets which belongs to some zones, yeah, wavelet or farage, or something like that, because we cannot attach it to there's a load balancers which we, which we have the elastic condolences. Basically.

D

uh And is it mixed with a classic zones like the other regular zones like.

E

B

D

Cluster in both the zones- and we want to exclude the local and wavelength or the.

C

D

Is entirely in the local uh and wavelength Zone and not the regular zones so.

C

That's for mixed clusters, specifically. So, for example, we have an existing clusters and we want to add, like sublins and virtual machines within like wavelength Zone, and if we have that kubernetes IO slash cluster cluster ID tag on this entities like subnets and the virtual machines, so the CCM would try to attach this to existing load balancers so which breaks uh Cloud, control and manager. Effectivoice.

C

D

I see this yes uh and uh the load. Balancer controller would also run to the same issue right, not only the CCM, so the both of them use the same strategy here so I.

B

Think we need to.

D

Think through it further uh to see how we can uh solve this use case, so workaround is to like use subnet tag where you want to specify the subnets that you want to attach the load balancer to, but I guess that's not feasible in this case, like specifying subnet for each of the load balancers here.

A

D

Yeah, so let's uh take it further, you can assign to me uh myself and Nick and then I will look into it further. If you have any proposal, feel free to bring bring it out and uh we can work on it. Yeah.

C

So basically I try to describe proposal which feels reasonable to me like right now, probably if we could introduce a special attacks for subnets uh to exclude uh these subnets from Auto Discovery. So that's that would work.

D

But then all the subnets in that particular Zone, used to have this tag right.

C

um No I here I mean that auto Discovery logic, uh which we have within Cloud controller manager, which looks at uh which looks at tags so it'll be taxed on on that, so it would be attached if it marked as like kubernetes dot IO cluster with cluster ID. It would be attached so if we could specify explicitly that we do not want to attach these concrete subnets there's a lot balancer. That would solve at least.

D

And you you want like to tag every individual Subnet in that region, not only like a certain subnet.

C

Not every individual, only certain so which belongs to that wavelet zones, for example,.

D

Okay, got it yeah sure assign it to us, and then we can. If you have any changes, any uh PR feel free to bring it about. We can discuss further on a PR as well. Okay,.

C

Awesome thanks a lot.

B

All right, awesome uh Kishore. What are what are yours and Nick's um GitHub IDs here, just so I don't mess this up.

D

Mine is uh Nick.

D

Let me look up.

A

B

I can remember what his uh his icon looks like, but I don't know I.

D

Just I'm just loading my GitHub here to look it up. Sorry about that yeah.

B

Yeah no worries no worries.

D

It's nck Turner, not an nck, yes, that one.

B

Awesome and I guess should I mark this as triage accepted, or do you guys want to.

D

Put that so this is in the cloud provider AWS like a separate one. uh Do we still go ahead with that.

B

A

Just I'll let.

B

You guys mark it up, then sure all right awesome. Thank you all right, Dennis did you. uh Did you get everything you needed there.

C

Okay, so I guess my next steps is to put up a PR and proceed with that discussion. There.

B

Yeah and I sync up with Nick and uh and Kishore in uh in the issue and whatnot, then yeah.

D

B

We can think.

D

Of offline as well, if you are interested I, will also look through this issue and uh see.

B

D

What approach we can take here? Yeah, okay,.

B

All right awesome. Thank you very much um now. Tell me if I say this right got uh I think you've got the next one yeah.

F

Hi guys uh this is my first time in this meeting so hi my name is uh and uh I work with Nick and Kishore, and all the other AWS guys there, um like cluster operator, manage a lot of clusters, look at issues that arise from them and with Cloud providers. uh One of the given things is uh in AC outage could happen so I wanted to uh I have thoughts about that when these things happen most times. These are Byzantine problems.

F

We can't handle them well and one of the things that happened last time was the CCM controller was able to establish the lilies because hcd connectivity was not broken, but the leader which held the lease was not able to connect to the Internet. So, even though there was a leader for CCM, it was not very useful and CCM makes so many calls and there are heuristics to say how how to kill CCM process.

F

If I see these patterns of failure, because, let's say SPS like IAM, fails or elb calls failed or just DNS fails: Anything Could Happen. My proposal uh here was that we make something that a cluster operator can hit a crd or something of that sort, and the leader election mechanism looks at that and weighs away kind of. But apart from that, the proposal I wanted to just hear out because other platforms are here.

F

How would it how do they think about this problem of zonal outage and uh partition notes where leagues is able to get established, but the thing cannot really work and it's really not for CCM. Certainly, but CCM is where it's a compass component and when I went to API Machinery, they said that you could Implement a live. Z and Lively could fail, that's great, but for CCM I don't know how to implement a good live Z that could fail on like zonal outages, so yeah, that's the context. I can answer more questions.

F

If the explanation lacks clarity.

B

All right, so does anyone have thoughts about that about how we could how we could capture this for these zonal outages.

F

Have has anyone seen these problems while elaborating CCM or their their controllers? Yeah.

B

I I have not seen this problem, I, don't know if others have.

E

Well, just to clarify: do you see this as something that would happen via manual intervention or via like as described here, or do you anticipate there being like endpoints or health checks, or something that would make it possible to automate this I? Don't know yeah.

F

I'm, meaning that as a cloud operator, I know that a outage is going on so instead of having automated checks, look at Byzantine problems and know whether or not I know that something is happening. I could create a crd and the lease leader election controller can look at a CID.

F

The current crd will be like exclude zone or something that's like tag, something like that and, of course the controllers need to know which zone they're operating on and if a controller has a lease and is in the bad Zone, it could say well I release the lease for the next one.

G

But right now you have the possibility to kind of Mark that you don't want the controller on specific PM. You can kind of just move the Manifest from the cubelet yeah.

F

So during that time those times are, we cannot reach those instances either and as an operator, let's say: I have thousand clusters: I have to go to thousand clusters VMS and more the Manifest out.

G

Okay, but in searching with some crd, you would need to go through the Thousand clusters and apply the same crd over 1000 Masters, so where's the gain here.

F

So the the trouble is uh the bad instances are so Network partition partitioned that I cannot go into it to do anything, but I know that uh you have to do SSH or some other mechanism to get into the instance to do some action, but I cannot do that, but I know that hcd leaves connectivity is good, so I'm, leaning on the fact that connectivity to xcd is established, so I can apply that from anywhere through a public endpoint of the cluster.

F

uh We have like two or three hcd, sorry, two or three uh Master instances. I can apply the crd reliably through any master and because the bad Master can still come to HD, it will know well I have to relinquish. uh This is only relevant when HDD connectivity is present. If hcd compute is not present, it's a mood point because it's easily broken things are good. This is only when a master could still talk to hcd but can not connect to anyone else.

F

Yeah, because master and HTT are in the same VPC based on the setup. uh They are not partitioned.

A

F

So I guess uh none of us has seen any of these problems.

B

I mean it's sounding, like you're, sounding like a very edge problem, although maybe I'm confused about something to what you were just saying.

B

If there's a zonal outage, and you have a you- have a control plane, node that has that CD on it and obviously it can communicate with itself, but like would it lose quorum to the other FCD members then like, if you had a crd that was generated in one place, how how could you assure that it got propagated to the rest of the cluster if, like especially if you're at your control plane set out? You know ha across zones or something like that and you had a Zone allowed.

G

We already know that the leader release is hold by the last remaining instance, so it kind of assumes that the LCD connectivity is still there, because if there would be no LCD connectivity, the last leader would lost the uh the the leader on the instant that has that also it would be released and the leader would jump to some other instance. So it kind of assumes that we have this LCD connectivity, yeah.

F

So it's not actually is on the bat. Node actually has three different VM setups uh master has two different VMS setup ha and they're on the same VPC, so the partition has not affected them, but getting into the node working any function out of the node to connect to internet is not working, let's say um so: yeah I I faced probably five or six such customer issues where these things happen over like. If you operate like some thousands of clusters, this is an edge case and even 1.1 percent would be probably in hundreds.

F

um So that's where I I see this problem and um I'm just trying to get like socialize. This idea that do you see it's a a valid problem um uh and have anyone seen this? So that's the intent uh I'm not looking for like answers right away, though.

G

uh And what do you think about the idea of kind of removing this object so and CCM or, and the contract requires on the lasers uh just could be erected.

F

Yeah that that doesn't work yeah, it's probably we can call it like lease dealing, but the default um lease. uh uh The parameters is every two seconds. It tries to take a lease for next 15 seconds so and it's not reliable to say that yeah I could steal the lease for a bit, but the bad note could still take the lease over again. It's not a reliable way.

F

So the lease is active for 15 seconds, so even if a steal it um it's like a surgery, it's not augmented! Well so yeah.

G

Yeah I I know actually I wrote this part of the code with the five seconds in this two seconds and kind of know how to extract. If the we could have a speed brain situation actually.

B

So, back back to your previous question, GLT um you, you might actually be like the world expert in this uh in this topic. If no one else is hitting it or perhaps um it might be worth reaching out on the mailing list, uh also to see if others have experienced this just to cast a wider net yeah.

F

Okay, so in in the short term, the way I'm thinking is I could still Implement a live CJ which I control and I'll I can look at this object. So, instead of leader election object, looking at it and doing its stuff, I can still, in the short term, look at it from the live Z, but I have to implement like 10 lives. These CCM search controller, KCM scheduler, and the list goes on and it's not extensible to any component that that's out there right, so um I could still go.

F

Do it but I'm looking for some generations so that I implemented.

E

Sorry, could you repeat that last sentence, you just broke up a little bit.

B

Are you still with us Jyoti.

E

Oh, maybe we lost him.

B

Yeah, it seems.

A

Like possibly a connection issue.

B

A

B

I see you have your hand up, um do you have a question for Jyoti or just in general,.

A

Yeah I had a question for Josie about the police's um like yeah I, probably should wait till he comes back, but the my understanding is. Each of these components is using the default Cube lease algorithm, and so they need access to a working, API server to be able to renew their lease. If there's a failure in their networking, then they won't be able to talk to the API, and so they won't be able to renew the lease at which point they should give up and normally panic.

A

Is that not what's happening or have I misunderstood? Something here.

G

Well, it's a little different scenario. Just assumes that networking works, but some specific apis doesn't work so, for example, API to create or to check the status of the VMS so actually allowed to No Control wouldn't work, but the networking and the kubernetes API would work.

F

Could you repeat, please.

A

I just had a question about the leader election stuff, but I missed the Nuance that it was the AWS API that was down and not just some like complete zonal failure.

B

And just just to back up a little a little bit, Jyoti um I think before you dropped out. You were talking about the current workaround and, if I understand that clearly you're creating liveliness probes for all these things, yeah um and I think we missed. We missed the end of your statement there. So maybe, if you could just kind of finish your thought, yeah.

F

So I'm going to implement, live, Z checks for many components: uh CCM cert controller uh scheduler, uh many other things um that I am interested in as an operator. What I I think is. It will be good as an operator if I hit a well-defined label or pattern create a crd and the lease object of all leadership. Election based components could benefit from that. uh As a cloud operator. I know that here's the time for the next one hour this zone is bad.

F

So any anybody who is a team who is using the cluster I'm, not in direct contact with them, I just created a cluster for them. They could know that there is crd. They don't have to do that themselves. The leader election controller knows about that and turns it away. Saying I should not be the leader I'm in the bad Zone, and somebody else should take over and relinquishes the controls and never takes it. uh There could be TTL safety that doesn't happen for more than 15 minutes.

F

One hour we can do all of those things but yeah it's extensible to more controls than just me as a cloud provider doing something in isolation.

B

Okay, so that that sounds very actionable to me and that I'm guessing that might be in the discussion on this issue here. But you know the notion of being able to say: okay I'm, going to apply this label that describes there'll, be a zonal outage and so like this component should release its leadership and another one should take it. That is, that am I hearing that correct yeah.

F

Okay kind of that um and this uh third controller, which is not an AWS vendor thing, it could take benefit from that and move away to a different component right um controllers in as your gcp. Anybody writing a controller could use this mechanism to move away. Leadership and zones are a thing in all: Cloud providers.

B

Yeah- and that makes sense to me, like I'm, not I'm, not really sure what the next step here would be at. um Are you looking for kind of more discussion, yeah.

F

I'm, looking for like um people who record brainstorm with people in SRE, Yorks in your organizations, I could uh take contact. I could reach out to you. Next I mean people on this call to see who I could talk to uh people who are sres might be dealing with customers. They might be seeing one of issues like that and I'm.

F

Looking for contacts to to see how how Edge case is this if it's like, because this is not really a edge case- I mean so nuanced that we don't have to take it I'm, all okay with that, um but I do want to like talk to more people, so I could, after this call look at identity list uh reach out to people to see. They could put me in touch with someone who could be interested in talking about.

B

Yeah, so it sounds like right now. This is kind of like information gathering and you're trying to reach out and meet other people. um So unfortunately, you know this meeting, although it's quite large today we have a lot of people. um Usually this meeting is kind of small um I would definitely recommend reaching out on the kubernetes developer list to see if others have run into this um and yeah just start to expand your net that you're looking for people to get in touch with, because you're I mean you're, probably right.

B

There are probably others who have encountered this, it's just a matter of how do we get in touch with them right? How do you find them? Basically, yeah.

F

Okay, I don't want to do it all the time, so yeah, that's my time. Thank you. Thank you very much. All of you, cool.

B

Thank you, Georgie. That was a great topic um all right. uh Next, we've got Bridget uh looks like looking for some feedback here. So why don't you.

E

Take it out yeah this one might be sort of short, um because Nick had actually had a chance to look at this. Put some comments on then we had kind of the slow turn around um and it's kind of a meta question of.

E

Do we have yeah if you scroll down to the bottom, you'll see that you know a colleague of mine wrote back to Nick and then it's been a couple of weeks and like this is when we were trying to get in, and you know it's okay, that it didn't make it in um this time, but I'm sort of wondering, uh of course, when and if Nick has a chance. He can look at it again.

E

Perhaps if anyone else who understands this space wants to take a look, and also perhaps we need to try to get more people as reviewers just so that the burden doesn't all fall on. You know: Nick Etc,.

A

E

Yeah I'm not sure what you think of that, but that's kind of where my thinking is so I guess for the start. The first question is: does anyone else have any insights into route controller and IP changes that they want to weigh in with um and yeah and the larger question I guess.

B

Yeah I mean I I personally, do not have a ton of expertise here. I mean I, understand it works, but I don't have like experience about. You know what you're talking about here does.

A

B

Else in the call have a have a comment about this or I would like to I'll just go back up to the top here, so we can see what the original issue was.

E

Yeah, it's trying to be a bug, fix.

E

Which gets a little complicated because I guess with some providers the IPS can change.

B

Okay yeah, so this is about like the route controller needs to update its node IP if the node reboots or something where it changes um yeah. That sounds tough, so like what the Behavior now is that it does not update the node IP, it just kind of stays, the way it was or whatever yeah.

E

The bug is that uh if the node IP changes routes won't be updated, it'll just kind of sit there being like what.

B

So it sounds like at some level too and I I, don't know the route controller that well, but it sounds like there needs to be some awareness of when the node you know comes back with a different IP address um that.

E

Sounds relatively.

B

Complicated yeah.

E

Or even well, I think that's why there was some some nuanced discussion about it, but yeah I just wanted to kind of surface this one. You don't have to spend Infinity time on it, but especially if people with specific contexts aren't on the call yeah.

B

E

I kind of just wanted to see what see if anyone had thoughts about it and then also maybe uh bring up the topic of I hate to have all of this be on a couple of people's plates. If we can get more people who want to review this kind of PR I'm, not sure exactly what the plan is for moving forward on that. But maybe we need to try to make that happen. Yeah.

B

I definitely agree that this is an area where we probably could use more reviewers, because, even even if it all goes down to Nick and Walter, that's probably that's not enough people for us to scale. This I think the in.

E

My that's not fair to them right.

B

Right totally and in my impression, like uh some of the difficulty here, is just finding people who are working with the CCMS at a low enough level that they're like comfortable reviewing these things, um I'm, not sure how we expand that net. That's that's kind of a bigger question for me.

E

Well, it sounds like Jyoti understands this stuff, pretty well so I'm, just like hey, let's recruit people on this call are the ones who are interested in this stuff right.

B

F

Like good uh I'm, not a reviewer I could take a look help but I'm not sure with this one, but this I.

E

Mean this that asks you to solve something. This is so much like. Yes, exactly what what uh Jay is saying in the uh chat. Please go ahead and unmute and mention your thoughts there, because I feel like this is where we need to give people a chance to start becoming reviewers, so it doesn't get stuck on just a couple. People yeah when Nick and Walter are here.

B

um Yeah because I think oh go ahead. Sorry sorry.

A

Just I was saying we could have a shadow program or something like that. Like.

F

E

Yeah I think that's a great idea, because I think we need to.

E

um We need to make sure- and this is something I've seen in other sigs too, is like making sure that people who want to start um being contributors have that chance to up level and become you know, uh forces to reckon with in this community and especially if you have deep understanding of one specific area, and you want to start applying that to Mars like I'm just kind of looking at the folks who are very informed. Coming to this call and thinking you could be the reviewers.

B

Yeah plus one to that idea, I think you know everyone who is here representing uh you know some cloud or some large provider or something like that. You know: ask your colleagues ask your friends internally, who are also working on these projects. um You know this is a great opportunity for uh junior developers also to get involved.

B

So perhaps you know some people who are looking to get more involved in open source and they work with you on related topics internally, um certainly if you could direct them this way, we would be happy to reach out- and you know, as Jai is kind of suggesting here- maybe a shadow program or maybe something where we pair up and work on some bugs together, like you know, or just reviews together, I shouldn't say bugs um you know that would be really nice uh so yeah.

B

This is kind of a call to action, for everyone is here. If you know someone, let them know we're. Looking for people.

E

Awesome and then yeah. If anyone wants to take a look at that long-standing, but hopefully almost resolved, bug that I have a colleague reporting there be thrilled to have even your feedback of like hey the stuff that you are trying to fix here. In my you know, especially if you have fresh eyes and you look at it and you think I don't know if this does fix it because of this.

E

That would be super helpful because of course, we want to get it fixed if we want to get it fixed right, or at least as right as we can get it.

B

And I'll just mention this because uh Jai and please correct me if I'm saying your name wrongly, uh is it Jai or yai or uh Jay's? Fine, Jay, okay, I.

G

B

Saying it totally wrong, uh so Jay is saying in chat here that the Sig release team has a really good Shadow program, uh so that might be another way another place for us to look at um and maybe get some advice. Thank you. Jay.

B

Okay, any other uh comments on this topic or things that we would like to bring up in general.

B

All right, I am not seeing any hands, go up um just.

D

A quick question for Denise: uh if you are you on kubernetes Slack,.

C

D

uh What's your ID Gmail.

C

Is safe, I guess the same as on this meeting.

D

Okay, uh I was just like trying to follow up the ID that you use on GitHub. So, okay, let me uh look up using this ID.

D

A

B

Right cool um anything else, I think we're gonna skip triage today, just um because I don't necessarily feel comfortable triaging. These things I'd prefer if Nick or Walter were here.

B

Unless someone has a strong desire to to look at them, I guess we could look and see. What's there.

E

There were a couple I think one was a gcp.

E

Oh this is, this has been around for a while hasn't it.

B

Yeah, it looks like it.

E

What are our recent?

E

Oh I, see somebody reopened. It interesting.

A

E

Coop system, okay,.

B

Yeah so I'm I mean I'm, not sure here we should probably wait yeah, let's see what the other one is here.

E

Oh, that's the one we were just talking about. Oh oops,.

B

Well, I think we talked about this one. Oh.

E

Yeah, that's familiar, did we not um does it still have needs triage.

E

B

um We were asking about once they were kept for this and there is a cap. um Okay, so I guess once it's reviewed like does that mean they should be accepted? Then.

E

Yeah I'm just kind of looking going. Is there a reason that it's still listed as needs triage.

B

Yeah all right, maybe we should just accept this since there's a cap, that's open for it. Let's see what their review process on that is happening.

B

Okay I mean it looks like it looks like that's in motion, so.

E

A

B

I'm just going to make sure yeah I.

E

Think it's the hyphen.

B

Hyphen triage accepted.

E

I think I'm going by memory, though I didn't look so.

B

E

Guess, scroll up and.

B

Let me look at one of the other ones that needed triage, because I think it I think it had the command in there somewhere or if anyone knows feel free to shout it out.

E

Yeah, there's a label that gets applied. You can see the the in the list of commands. Sorry, if you click on that uh I understand the commands that are listed here. If you search in there for the word triage you'll get the uh the correct.

B

E

uh No, that's a remove, keep looking.

B

That's the only thing coming up, I know: I, just I just saw it in one of these too, though, like it was said, it needed to be triaged and.

A

Yeah an example of what needs to be done very last comment.

E

Right there, oh.

B

Yeah: okay, cool, oh.

E

It doesn't have a hyphen, okay cool. You were.

B

Right needs, triage needs, science,.

E

Yeah, that's that's where it gets very exciting. So.

F

You're right, don't no.

E

Hyphen, just there you go.

B

Okay, well Care.

A

B

If I did wrong, someone can go remarket. The other way.

B

um All right, okay, so I mean I, guess that's it we'll leave the other one um for when Nick and Walter are around okay, um I guess anything else! I'm gonna stop sharing here anything else or should we uh should we take back some time here.

B

Not seeing any hands go up, so thank you, everybody and I. Guess we'll see you next time.