Kubernetes SIG Scheduling, 15 Feb 2018

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG Scheduling - 2018-02-15

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

Let's start recording okay, so Tim. Do you want to give us an update for for the for the incubator repositories, so the.

B

Steering Brendan Burns has submitted his proposal and there's been minor feedback. Nothing, nothing major, but the intent is that everything, an incubator will eventually have to have a new home and there will be two three different types of repositories. The first is the main kübra days, org repositories, and hopefully that will slim down too as well, because right now, there's a bunch of repos in the main kubernetes org that shouldn't be there.

B

One would argue, and then the second structure rsync sponsored repos, and that's where a lot of the work that our sig would probably house other repos, so that will be owned by a kubernetes, IG's org and provided that the sig leads agree that this is a good project to sponsor they can provision and set up the repository for other people to use some of the logistics with respect to like you know what policies remain maintained on this thing's repo is still not all spelled out, but that's okay, I think.

B

That's still gonna work itself out over time. The last another type of repository or folks are calling associated repositories where it basically allows people to be covered by the CN CF umbrella. So if a company wants to sponsor a project but still wants to house it or neither main org so like if I were, you know for hep, do some ibly project that I worked on might be an Associated project in the future right where we would, we would have the CLA bot there.

B

So that means, if you signed the CLA at any moment of time, where they're working on kubernetes, you could contribute to anything. That's in the associated umbrella, so I think for most of our work within this sig they would fall underneath the kubernetes things org, and it just needs to follow it. The naming convention that spelled out in Brendan's doc so it'd be scheduling whatever right. Okay, so.

A

This is I'm not fine out, though right I.

B

It's not merged yet there's still feedback, that's occurring, but most of the feedback is pretty minor and there's not it's not blocking we haven't had any people say that they absolutely are opposed to this, and it's been out for we've been talking about it for weeks internally and we've exposed it for one week to the entire community. So there hasn't been any major detractors that we have seen. Okay.

A

So it's close to final is almost final. Yes, all right good, so.

A

A

These repos are.

B

They're actually I'm I'm, pretty certain that folks would allow us to move and create repos there now I, don't think, there's anything preventing it. Cuz the org has been created and I have admin rights to the org. I would just want to make sure that I get buy-in before I created any repos, because what we probably should be doing is sponsoring one or two folks and try to feedback on what the mechanism feels like and if it works for everybody. If there's any problems before we kind of unleash it on the whole community right.

A

You can try to see if the folks behind the firmament scheduler are interested in trying it or we can try one of the other tools, maybe a fish monster, for example, one of his smaller projects, maybe cluster capacity tool, for example, under the new repository yeah.

C

We don't mind being a guinea pig, you know we would love to do that. Okay,.

A

Sounds good so but I guess we shouldn't rush it if there are still logistics to figure out. We can probably wait another week or so if I am NOT wrong, Deepak right I mean is. Is there any issue with like waiting one more week or so.

D

Are you agree like I think it's about that if all the things are hashed out and then we start oh yeah.

B

Well, to be honest, I, don't think, there's gonna be major things I think us piloting a couple of minor stuff and you know, being the guinea pig actually would help the whole community, because in that case, let's go for more done. Do it.

C

So it's a couple of questions at him, so we can I have a test infra available in this repo, this environment. Yes,.

B

But it's not mandatory, so you have to you, couldn't opt into it. So if you wanted to be able to opt into the entire sort of test jiggery you can you cannot done yes, okay,.

C

So if you want to build out like end to end, you know testing, you know that so then I guess in that case, we would have to do that. Only for buffer Bart. You mentioned view that that's mandatory, the CLA Bart and the merge part. Don't we need to have a test and for those.

B

Yes, you you're gonna have those those BOTS some of the bots have to be on and we will have to enable that. But I want.

A

To make sure I.

B

Want to talk with everyone first to make sure that we we we everyone's okay with buying into having a pilot and getting the jiggery in place. Okay,.

C

Okay, that's fine! So you can. Let us know: I mean we're not in a rush I mean so but as I said in my email as well, we don't mind being guinea.

A

Pig you know, I need dirty.

B

Work or whatever.

C

B

To do that well, I can I can easily create the the repo today, but I can't I won't get all the other tries dad, isn't E it's crossed until I get other folks looped in.

C

Just another thing: I wanted to kind of run it by I. Think so for moment has two different components: one is Poseidon another one is formulated for moment in C++ base and Poseidon is a kind of a clue between kinetic sand for moment, which is called, which is what we wrote. So is that issue I guess we will have to make the Basel thing work with the C++ build process, so.

B

Long as there the bill jiggery works, I, don't think it will hurt.

A

Okay yeah in mind, it might take some extra effort, though, to.

B

Most of the build apparatus is containerized builds. So as long as you have like a two-stage container has build like a build container in a deployment container, which is like what pretty much 90% of the kubernetes stuff. Does you know you can build terminators locally, but, like the actual build apparatus for testing? Is all containerized builds the.

C

Other good news is, you know one of our, my colleagues. Actually, they took out the whole Federation out of the core kubernetes repo out of a separate repo. Actually, so we have expertise, so we can kind of tap into that knowledge as well yep.

B

That would be very helpful, yeah yeah sure.

C

Actually, I have two my two of my colleagues and the call the leap I think the leap was involved with that you know as part of that so well, because maybe.

B

Maybe you can send me a list of who the main collaborators will be in the repo okay.

C

B

Then I can add them and then I will add bobbi and myself as like the administrators or whatever. Okay.

C

Okay, just a good question on so I'm gonna. Send you all the list of people who are gonna be in this repo, but we did is zero, so I, don't know if you folks are familiar with from former mint was started by. Researchers are of Cambridge University I've.

B

Been following it since its inception, so yes I'm familiar with it. Yes,.

C

So the reason I would like to those guys as well, though I think you know, is already a member I'm, not really sure what Marty you you know it. You know that yes, yep, okay, okay, good yeah. By the way you know Liz here and UC Berkeley, no okay, yeah yeah. He moved here so he's a postdoc, so we work very closely with them.

C

So we should be able to do that. The only thing we need to make sure is that they're kubernetes members.

B

Yes, they would have to be communities all right. I have to take a look whether or not this made or required has that as requirement, because it's a separate, org, okay and I, don't necessarily know if it has to be a member of the main kubernetes org.

C

A

For anyone, anyone who wants to contribute I, don't think requires to be a member, even if the main report doesn't need you to be a member to country. No.

B

But you need to be a member to maintain, but I don't even know if you need to be a member to maintain inside of this separate work, because it's a totally different org. So so.

C

The member are we talking about the odd member, isn't it like the way you have to kind of get the odd membership set up, and all that and I really, you know, has that yeah.

B

So that's one of the logistical things we're gonna have to work through. So okay.

A

All right, actually I've gone through one of these papers on and fermented scheduler as well, but I was wondering if you guys can have a presentation in one of our Sigma teams, for basically everybody who is not familiar and to refresh our own memories as well, so it might be. It might be useful if you can have a like a I. Don't know 20 30 minute presentation in one of our sick meetings. If I'm.

C

Sure I'm not likely I'll try to bring along. You know as well in that I would.

A

B

Great, what do you think Tim I think they'll be good, I, think long term I know the past that you guys followed long term. If we eventually get to the point where we have a full scheduling framework, you know it's in my heart of hearts. I would ideally love to see the firmament scheduler be the mainline scheduler. You know that'd be a long long term cycle, though I don't see that happening anytime soon,.

C

Yeah, as I said in my email as well, there there are functionality gaps and which we are currently working towards, but we're all the architecture the way it does thing, that's really amazing. Actually, so that's the reason we spend like last six or nine months actually doing this. One.

E

Of the things that that I definitely be interested in hearing or seeing in a presentation is how the the resource tracking fits with the firmament. I. Guess that would be the Poseidon layer. I guess how it was the deep start. So we have a heater. Is that hmm more just you know, sort of how how the kubernetes resources are mapped to what.

C

Teacher operates.

E

On that's definitely.

C

I think so she wanna. Actually she was there on the call he's the one who wrote that code actually, so he so yeah exactly so, there's that then the firmament I, don't know if you guys read that paper, there's a concept of task descriptor or resource descriptor. So your question would be. How would you map that to the corresponding entities in exactly yeah, definitely or.

A

Other or other scheduling rules, as you guys are working on I guess, like affinity, that's.

C

Exactly so last, we started that we have a very high level design and so we've starting out with the soft constraints, simple soft constraints and then the hard constraints, and then we implementing xor rules as well, so that you know I don't want to deploy it. So I have a bigger set of five replicas I'm gonna make sure they all go to separate nodes.

C

So those kind of things you think the XOR kind of a structure and but there's a and the complex constraints we are still working through the design and all that so we're not there yet, but I think the initial ones. Hopefully we should have that bridge. Awesome, yeah,.

A

Looking forward to your presentation, let.

C

A

When you, whenever you see.

C

Your shirt- let me just say, I, want to make sure that you know that is in there as well, because I mean at the end of the day these guys that the one day being working on it, both of them multi-and. You know they got your thesis PhD on this I.

D

Also have a similar question, so not only that, like how I mean not only they, what are the gaps as compared to default schedule? Is there to behave in kubernetes and also like what are the similarities are also, there are other things that are not in default schedule or data provided by this firmament. Scheduler. Definitely.

C

Definitely I mean the whole rescheduling and the crank, scheduling and all those.

A

C

We send out all the names to you, folks and I'm, so Jim will create a repo and then I guess so. The other things the readme file, the the structure of the intubation, pretty much stays the same. The readme file which are sent out is that the right way to do it is they're pretty much the same like the way. The incubation earlier. Intubation was something because.

B

A sick sponsor you don't need to do the whole incubator thing, which was a more broader community scope thing. So as long as the emails there isn't actually a process, that's been spelled out as part of the document, but we actually comment. They commented on that recently.

B

At least Aaron did to talk about like we should we should document an actually well-defined process. Just so when you start piloting these things, we actually can help people to do ABCD, I.

C

Was talking more about murid me file, you know you go in there to our repo, for example, and you see what exactly this is. What are the key advantages? You know like a readme file. I, don't know I should sell again, but that's what I was thinking of actually, including that when you go to the or now would be great.

A

A

All right, so the next item on the agenda I want to give an update on priority and preemption. This is an effort that multiple people are working on. There are different pieces of this work, which were already there. There has been some improvement in the performance of priority and preemption, particularly in the area of improving performance of the new scheduling queue. There has been some work on adding priority for critical system components such as queue proxy and a-cube DNS setup yeah go ahead, real.

B

Quick question: when you're doing that are what components are you updating? Are you updating the coop up script or e stuff, or are you updating it because, like this affects many deployments, including both the coop up scripter e stuff, as well as Covidien,.

A

Actually, that's.

B

A

A good point, I mentioned it in that rollout dock and when I sure did I asked people who are more familiar with and chaos or like Cuba diem and all to comment on this because I I'm, not one person familiar with all of the details in these tools, so we are. What we have done is that we have updated the Yama files that specified there are. You know the bases, the configuration of these, these components like queue, proxy, Hugh, DNS, etc.

A

So is that enough for all of these tools to work similarly, I mean when they bring up a cluster you'll.

B

Need to I'm gonna put you in assignee for in the notes to add a link to the PR for the coop up stuff, because they don't need to be socialized broader. So what will likely happen? Is people will miss the boat for a period of time and then they will be added on and try to figure out where it was, but I think if we were to bra I can more broadly PSA instead, cluster lifecycle. That here is the PR that you guys should reference.

B

So if you have the exact version of one that you did for coop up stuff I can reference that. As the canonical example yeah.

A

I can I did to the dark right there. There has been a PR recently to add these, and one of the hottest a gap Sky's has approved so anyways I will I will add the PR.

A

So, yes, there has been some of this effort. There has been a further and improving or updating the documents with the new with a new behavior explaining that, for example, reschedule is going to be retired and some of the new changes in the daemon such controller class is working on a demon search, controller side. His peers are coming and some of them are already merged. The other ones are going to be merged, hopefully all before the code freeze, the only.

F

A

Item which is left is finding customers or real customers want to try this feature before we go to beta.

A

Luckily, we have found one internal customers at Google who are willing to try this, and we are going to enable the feature and their clusters very soon. If you guys know other people who have liked larger clusters, we don't want to go with like super large clusters, but we want to have this like test kind of meaningful by going to clusters, which have like several tens of now. Its are not very large clusters like medium, so.

B

Are you gonna go to beta this cycle or next.

A

B

A

And you will try really try to go beta this cycle if we get enough testing results back by day before the code freeze, which is that's like ten days away, that's.

B

Always really hard, though, like to get actual customer feedback, because no one deploys like his history has taught me. No one like in the actual community deploys bits until it's like that one or dot two um so like when 1.10 releases, no one's going to actually telex like 1 10.1.

A

Have confidence that it works? One of the bigger things for us is to ensure that this feature is not gonna break any existing customer if they don't use the if they don't use the feature. Basically, that's one of the bigger bigger things, because you know if you try it and it doesn't work well, it's easier to go back, basically not setting priority for your part, but breaking the existing customers who don't use. The feature is definitely a big big problem and we don't want to face it and.

D

I think I think mentioned in our last meeting. Definitely like we are interested in this feature are using internally for our online clusters, but I doubt that can happen before 1.10 is radius, turns okay,.

A

Yeah so yeah, if, if we really feel we are not ready or we are not confident, we are not gonna go to beta in one Chan. You.

D

Know it actually the other, so it seems like, although it would be better if you can go beta in 1.10 and then it might put more confidence so that we can use in its like. But.

A

The problem is that if we go to one Chan and if we go to beta in 1/10, the feature will be enabled by default. So everybody, including those who don't want to use this feature, will get it and that's why I'm saying we must ensure that it doesn't break existing customers, because even those customers who won't use the feature is gonna, have it enabled in their cluster. So that's why what we are trying to ensure is working fine and it's now I gotta break anything yeah.

B

And so so here's a question, a broader question that applies to this there's. A lot of the CNI providers are going to need to have priority and preemption in place because they deploy right now, as Tina said pods have you guys worked with or have you communicated with any other the CNI providers, Weaver caligo.

A

B

A

That's that's actually a good feedback. No, we haven't. While we don't expect anything to basically happen in case we don't use. If we don't use the scheduling freeze of the demon set it, we cannot really retiree Shore without communicating with those guys or without testing it in more realistic scenarios.

A

B

You have a parent issue that that kind of like can open up I can loop in the appropriate people also like if.

A

We don't West backer so.

A

Let me just put it in, and so depend issue is basically for moving to beta as.

G

A

All right, so is there any other question or should we move to the next item on Joe John? That's.

C

A good good, so Bobby you mentioned that the reef calculator is gonna, be retired. What I mean so.

A

Reschedule err has been used to ensure that our critical pods are scheduled when there are not enough resources in the cluster or there are no nodes in the cluster that can run those critical pods with the introduction of priority and preemption or clinic pods will have the highest priority in the cluster. Now our main scheduler will take care of scheduling those if the cluster is out of resources, as we thought we don't need to have very scheduler anyone.

B

Yeah, just the names terrible, the it was it's it's a separate component that was created just for great.

A

B

C

B

Not a true reschedule, er, correct.

C

B

G

C

Better because that's the problem with the reschedule because for moment has built-in rescheduling capable, so that was the thing we had against these guys, because it's a separate process basically now priority and preemption is part of coop unity, so which is good yeah.

A

Yeah, I guess, are we scheduler, or at least part of every scheduler scenario in kubernetes world is called a scheduler and our best can tell you a lot more about that.

A

Okay, so yeah, then next item is a bug that we have faced recently happening a lot more in certain gke, as well as open source, kubernetes customers, you've seen that scheduler state becomes stale or basically scheduler cash has some stale information. This is particularly in two scenarios, one for nodes, one for parts, because we've talked briefly about those before so we are working on trying to basically chase the problem down to see where it's coming from.

A

Our recent investigations show that it's probably something outside of the scheduler, maybe something like some events are not sent to the scheduler by either it's in your API server. So folks, here at Google, are looking into this issue to find out, but some of the symptoms that we have seen so far is that, for example, it scheduler thinks that a node is full because pods are running on a note, but the pods are actually deleted. They are not deleted from the scheduler cache. So schedule believes that those parts are running as a results.

A

It refuses to schedule new parts on the nodes, and what happens in this case is that autoscaler does not add new nodes to the cluster, because it believes that these pending pods are scheduled about on on the nodes. So there is a disagreement between autoscaler and a scheduler autoscaler waiting for scheduler to schedule those pods and never does that, because scheduler believes that those notes are occupied.

A

So as there is all we see that a lot of these parts remain pending for a long time and of course this is undesirable behaviors, it also seems cuz we're trying a lot to schedule. Pods on those that don't exist, and this problem is, we have have been able to add a workaround for it in the scheduler. Scheduler deletes such notes, if in if it faces the problem, that some of these notes don't exist when it tries to bind pause to the notes anyways. So the first part of the problem is a major part.

A

Hopefully we can find a solution or we can find the root cause. If you guys have any more information about similar situation or I know how to fix it or know what could be the root cause, please let us know.

D

Also, like obviously I, don't know the root cause but I think around one, because two weeks before there were some guys like whom are having I think similar issues and they were discussing about slack channel and I. Think in the end, what what those guys data they increase, the I think ups, it seems actus in their cluster when they had lower QPS a dead time. They had that issue, but I think when they increase their the other issue.

D

The issue was solved so yeah, so it seems like, like those guys, hit the same problem, but I might be nobody to see. It seems like that they had the same problem and, as I said like when they increase that, then the problem went away. That's what is in fact we can see that I should really like I'd like because I was working with. There know also like I mean whatever I could do, but in the end they told they did that and I think the other issue was resolved. Actually.

A

That could be really the case, because one of our engineers was also believing that it could be caused because of the QPS, but I will also tell these guys for working on here, I guess oversee and give them best feedback. Thank you, alright.

A

So moving tense and toleration stew beta, actually a couple days ago when I talked to David up, he told me that it would be great if we can move intolerance to beta in 110.

A

When I looked at, that, I doesn't seem to be there. There is any blocking issue for moving things intolerance to beta. So if nobody has any objections, we can try to do that in 110. I think.

B

The only thing that is required actually is just documentation changes because the it's already.

A

B

By default, yeah, so I don't believe there is any gating options, but the one thing we should do honestly is: if there's any leftover, annotation cribbed code that should be removed, and that was that was an opt-in. Oh.

A

B

Let me clarify.

A

One thing sorry for interrupting you, so it's already in beta and that's why it's yeah it should be moved to GA exactly I won't.

D

A

I wrote it wrong in that in an email and I repeated that so sorry about that, it should be moved to GA, not favor. It's already. We.

B

Should we should disable the old? There is a feature gated flag added to enable the older version of the Alpha annotations, and if we're gonna move to GA, we should probably remove that bit as well. Okay, testing.

D

Code along with it, as far as I know like for Tencent Tolleson, we already removed those things. We don't have any elf I know distance right now. We.

A

Have already removed most of that either we should double-check and shorten yeah as.

D

Far as you know, right no, we don't have any an additional stuff left, a.

B

Turbo check yeah.

A

We have, we have already done that we have. We have drawn the support for annotation, but we have to double check, ensure that there is nothing left yeah.

D

A

And we of course need to update the documentation.

E

Otherwise, I don't think.

A

There is anything major left to be done for moving it to GA.

A

Okay. So, let's see there was I guess: Robbie had one item to talk about about an issue. Do.

D

G

So the first one is related to using variants like as of non-balanced resource allocation. We are using only CPU and memory, so in online environment. What we have noticed is there are chances, notes, exhaust there PVC limits, but still so, basically the number of mounts. Sorry, the number of volumes that could be mounted on a no while helping enough CPU and memory or it could happen the other way around. So the question is, should be as of now they're hard-coded to like 39 in case of AWS and 16 in case of GC, etc.

G

So should we go ahead and support them in the CSI plugin support is by default, enabled and scheduler no.

A

I have no no problem with that, but we should should clarify things a bit. So it's not the PVCs. We are talking about the attached volumes right, yeah.

G

A

So basically, given the number of attached volumes and given the number of volumes required by PVC, we try to balance the number of volumes among the cause and we have a priori. We can have a priority function that tries to balance the number of volumes yeah, so sure I I, don't see any problem with having this other priority function and.

D

Then one more thing, I think in one of our previous meetings when we discuss about 8 I, think we also discussed I think that time in a brand grant was also in the meeting that we would like to have some sort of generic function so that in the future, if we have to do the same thing for GP user, we might add them. Also- or maybe some other issue say yourself. Ok,.

A

Right, that's a good point for TVs in particular. I think it's gonna be a little harder, but we may be able to do that. This is this is slightly different because we need to we need to. We need to basically look at the number of PBS already attached. We also need to look at the number of TV's require requested by the PVC and then combine those two which is slightly different from the other models, where you have, for example, a certain amount of CPU available on an autumn.

A

A new part is asking for certain amounts and you can balance those. So this is slightly different model.

A

Not if we can find a good generic model that supports both. It will be awesome. Otherwise, you can just have like a particular version for for TVs.

G

Okay, yeah, that's one thing and the second thing is related to matrix, based scheduling. So if we want to have it as part of standard scheduler or should we use scheduler extender to build something so.

A

So I missed the first part. What kind of scheduling you said matrix.

G

Based scheduling, we can use the matrix service to get the latest matrix.

D

And I think the context is, it was I think the first issue like that we want to address it's for best effort pod, so you don't specify resource requirements.

H

Yeah, so this is basically.

A

Knowledge here, I guess you're talking about usage.

A

I think this is something that should be part of these.

A

B

Be something created maybe for now I'm I'd be opposed or anything any metrics that are not part of the standard. Node updates. So.

G

If you want to take, if you wanted to take, keep stir.

B

Up states I think that God has to be extender. Do please you Ravi you're, really loud, but if it's part, if it's part of the main node updates, if some basic like load average, is passed through I'm, totally. Ok with that in the main scheduler. But if you wanted to add any heap, stir based information like you know, there's much more: you could apply the load average.

B

If you want to do, then I would I think I'm pretty opposed because of the the matrix API is kind of a I, don't know for lack of a better words kind of a tire fire fire. So the I think that that would be. It would open up a can of things that we don't want in schedule right.

A

I agree with Tim I think the general idea of having like usage based scheduling is great and I think it could be part of the schedule. But the problem here is that how their usage is reported today is not a part of a standard API. That's the main problem here, I think, but.

D

But since I think 1.8 do we have that native matrix server right there? Yes, that can be used with the API server and behind that could be anything at an inverter or whether hipster or any other finger against.

B

My tire fire commit still applies there.

D

But but I think another concern is I, think that we had before. We also need to check that whether the matrix being reported, those are like instant matrix or they are some based on some period of time or because I think as far as I remember the concern we had. If they are not like based on some time period, then it might require some changes in matrix at.

A

This price, so first they change all the time. Right usage is not something things I.

D

Mean I mean when you are doing like uses based like I, think you would like to have some sort offer based on history, some sort of every ISM. You have.

B

You have local history in global history and usually it's some type of quantization barrier right, like your load average, is an average across some period of time X and it could be very spiky for some short periods of time should be normalized.

A

Trying twine so.

D

Also I: we need to check whether that is already there or not so.

G

The metrics are by default, it stores for a minute, or so it won't have a database or something that is where it stores the metrics for a period of time.

B

The ID ID, almost if you either wanted it to be smart about this type of thing, you could do automatic update on insertion versus putting into the scheduler right. So if you had some type of history, you could have your own separate component. That says, you know if you have history of the actual usage data that you're tracking over time, you can.

B

You can augment the initial incoming inbound requests with something that looks approximate to reality and that type of augmenting on inbound is probably better than building into the mainline scheduler, because it could be a totally separate system, because, if you put stuff into the mainline scheduler, it will only be as good as the data that you are feeding into it and that's subject to change over time. So I think what you really want is finding a good fit right and most of the time people are super bad at sizing.

B

So the only way you can actually find a good fit with the current resources that are available is to have an expert system. That's taking a look at the entire state of the cluster.

A

So I guess there are two parts to this. One is the part that you mentioned, as for like basically trying to guess, sort of or estimate the amount of resources needed by a pod, and the other part of it is to find the amount of resources available on the nodes.

A

So the latter part, which is the resources available and a nodes, it's based on usage, basically is reported- can still be received from the from the metric server and that can be used, as maybe a priority function, perhaps to ensure that we are not trying to or we try to use the nodes that are less busy by looking at their usage as well, not only by to request the resource that we do today.

A

B

Gets really hairy and fuzzy you.

A

Could right all right, so we need to I think whatever whatever path we go forward with, we should probably either built an extend extender first, maybe and then maybe then bring it back to the schedule. A free someday field it. This is reliable enough, I would say, or we could completely feature get it looks like Tim is not on board with putting it in the schedule right: okay, I'd.

B

Be fine with extender first and then then migrating over time. Okay, now.

A

Why do you feel worried about this approach? Yes,.

G

The kind of a demo thing that I've built a prototype it actually uses extreme. Dare so that's what I was thinking first, we'll put it in extended chicklets ship hold the performance ease and then based on a different. If it fits everyone's requirements, we can move to the course key sure.

A

Sounds good yeah!

A

Thank you! Okay! That's all we've had if there is any other question comments. I just.

D

Have one minor thing to ask like for schedule? Eres, para ver: si group like we have two kinds of I think we have reviewers and behave approvers list right, so I was wondering and actually even I'm, not part of any of those lists, even even though he is not part of any of those lists. All the like, obviously like. If we could be added to like those list, definitely we would like to be more responsive al. It's no I mean unless, like we are really sure about that.

D

We would not like to take them, so is doing something like that could be considered and we could be our data. Okay,.

A

Tim, do you want.

B

Do you want to say something or the typical process is to just submit the PR for the areas that you're interested in and loop in the courage, approvers yeah.

A

Exactly so, I would suggest, go for reviewers and then first and then maybe in a month or so hide yourself silly approvers as well after you've done enough reviews. Yeah.

A

All right, any other question comments.

I

The hard one question is the eviction feature in the scheduling area, so.

A

By eviction demean the cubelet direction or preemption, we have two similar features in authorities which are named differently for removing the confusion so but cubelet eviction. I was.

I

Merely talking about the tale based evictions tank.

A

Based evictions, so dangerous based eviction is done by cubelets.

I

Right, so is that an area for the scheduling or.

A

What I mean we support also taints in the scheduling side, but not the eviction part of it. We are.

I

Basically, my question was about trying to understand the behavior. When nodes go unresponsive, I mean we are on their old kubernetes release that we have been turned on this feature, but last night we turned it turned it on. We probably don't under I, didn't read the documentation enough, but from what I have read like it's not well documented, so I was like.

I

We enabled the feature and we forgot to enable the admission controller that puts the default caller Asians and though, and so we started seeing immediate pod evictions and I guess, the admission controller is required to put the default panes.

I

But what is the behavior that we can expect from this feature and when a node goes unresponsive, do you need? Do we need to build a pod, will automatically get evicted and run on a different node? And then what happens if the node comes back online? First.

A

I

Want to understand yeah.

A

My knowledge in this area is not very strong, so I I believe it's not a part of the scheduler. For short, not is always for removing parts from those unresponsive notes, but of course, if the, if the posi removed- probably maybe by by the node controller and are added back, then they come back to the scheduling queue and the scheduler will reschedule them on existing nodes which are available that part I know, but the first part which is like who is removing those parts from the nodes? I, don't know for sure, but I believe it's.

A

It must be probably and not controller yeah.

D

I think I can I can say something more about it. Okay, so definitely like. We are talking about 10 to base evictions right here.

D

So we are like node controller is adding 10, so wherever, like I, think we are having 2 condition so like not ready and I think unreachable, and in that case it's adding those tensor to the nodes, Russell and and I think we have default admission pill again, I mean not default, I mean we have a discipline gainer that could add, like called a sensor, so that some parts could be a bit, could not be good not to be evicted for I, think five minutes or so so that something like that and I think we can definitely I.

D

Think you, those parameters yeah, if the if.

I

If the node remains unresponsive at the pod doesn't tolerate the paint. What happens like is there known if the pod, okay and then it's rescheduled on a different, no yeah exactly.

D

I

That, let's say the node comes back up and starts becoming responsive the pod.

A

I

D

I

Is still running on that node, so what happens then.

D

If I I think, first of all, if the pod has been rescheduled definitely pod is not going to go back. If the those those containers are still running, I, think I think they either need to be garbage, collected or I. Don't know, I'm, not sure exactly but yeah I mean definitely. Pod is not going to go back to that.

D

B

Coup de loop will evict. The Cooper will take it off the node, because the bounded pod, that is the location of the bounded pod, which is a location inside of EDD, that it's watching it will no longer be his heart pound to that couplet. So as soon as it comes back online and talks to the API server, the couplet will remove it. I see.

I

A

One thing to clarify here: I I'm, pretty sure you know already and when eviction happens, a part is killed and one should be creating a replacement for the pod.

F

Like that, a party's.

A

Gonna go back to the scheduling queue the party's killed yeah.

F

A

Replica set controller or I, don't know demons or whoever or whatever, controller or deployment whatever controller is behind. That part should create a replacement for the part. So if you create your idea only, for example, you or direct them there is not gonna, be any replacement.

F

For it you say a part is guilty. We know that.

I

A new part instance comes up the.

A

Part is killed and a new instance comes up if there is any controller behind our part. That creates a replacement for it if there is no controller, for example, if you're going create your part but cue cut off directly, then.

C

There's a convenient.

D

I

Guys, yeah I'm.

D

C

Just a good question you know for coming back to this for moment thing: is it possible to have two different repos one for a moment and one for Poseidon, the new code thing or how's that gonna work I mean I, think that would be the right way to do it I just wanted to it's. It's.

B

Up to you guys, whatever whatever you think, is the right thing to do.

A

Hayden, if Poseidon is the one that is related to kubernetes, I would suggest have one for both, basically, maybe for on the firmament itself is not related to quantities and probably shouldn't be hosted on to kubernetes.

B

Well, if you need both components to be deployed as a single unit, it helps to have them coupled inside of a repo just because the build artifacts will be there and people look at the email.

A

And everything.

B

Everything they need to do a deployments right there if they are totally decoupled and separate, then I'd probably recommend separate repos I'll.

C

Let you know I think, let me hide a little bit my team and then I will let you know.

A

Okay, any other question comments.

A

Okay, thanks. Everyone good eight minutes back bye thanks.

C