Kubernetes SIG Apps, 13 Jun 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG Apps 20220613

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

Hello good morning or potentially good afternoon or evening, depending on where you are in the world, everyone. This is the July or sorry um June 13th um meeting of cigaps um I'm going to be hosting I'm Kenneth Owens, um my Jack will be co-hosting. I think you might have to give out a little bit early and I. Think Janet couldn't make it this time.

B

A

Starting um no real announcements, uh we have some discussion points.

A

So does someone want to talk about last outstanding, conformance test.

B

Match will seem to be muted. Can.

C

You hear me now uh for work. Yes, there's reason it is when I mute myself, I have to switch devices back and forth. Well, whatever uh so Ryan put this into the agenda last time, Philip looked into the test itself uh over the past couple of days, I've already approved it. uh Although Philip pointed out that first of all, we can't verify the serial testing even manually before submission and their uh there appears to be a problem with this particular test that it is actually failing.

C

uh Philip tested that on his own local password, so the test is approved as soon as and Philip is working on, uh ensuring that we can actually trigger the serial test in a pre-submits, uh because currently it is not possible, um assume that that gets sold and the test passes uh to the passive rights to uh to merge it. uh So this should be handled if I. Remember correctly, that's one of the last pieces for, uh for conformance.

A

Yeah I think that's it yeah.

C

So look: did you want to add something to that? One.

D

Yeah I think that's complete cool. Thank you.

A

Okay, uh next would be staple set slices. Okay,.

E

Let me try to present everyone: I'm Peter.

E

It looks like screen sharing, is disabled for non-hosts.

A

I'm gonna make you a co-host temporarily: okay,.

A

Can you share.

B

Yeah there you go.

F

You seem to have been muted.

E

ah Okay, um I can jump in for Peter. Then. If he can know.

F

If he can't actually share and talk at the same time, does that sound, sound, good.

A

Well, he should be able to share and talk at this and there's like I didn't need him he's not muted. Presently,.

E

If, when I start sharing my my mute button becomes grayed out and I can't lie, I can't unmute myself, so I think it's a known issue. I've seen on other say groups, oh boy, um but yeah.

F

um Actually, would it be easier if I present then and then I think so yeah.

E

F

Yeah I've got the link, would you mind making me gotcha.

B

Thanks appreciate it.

B

B

One second, no urge.

A

Okay, Matt, you should be able to share now.

B

Okay, thank you and yeah. Let's go and um I want to share.

E

E

I'm here to discuss um this cap of stapler set slices I posted a link to the enhancements repo in the chat.

E

um The motivation behind this project is this long-term idea of migrating a staple set across clusters, so we've discovered some use cases for certain sets of users that may want to move safely, sets out of a cluster boundary and maybe due to scalability limits, tenant or application isolation. So, if you need to say, move out of a shared cluster to an isolated cluster or encountering features that are end of life in a particular cluster and are only available on new clusters when they're created.

E

um So the goal here is to be able to move a stapler set while it's still hosting an application um in it piece by piece from a cluster a to Cluster B. A lot of existing Solutions are out there that allow for, like a backup of a staple set to be created or a staple application to be created, underlying storage to be snapshotted and then rehydrated in new cluster. Unfortunately, this requires schedule of Maintenance and downtime for an application.

E

Next slide, please. So what? In order to do this? Building block well, there's several building blocks that are required, so moving an application over. You can consider a process of say, scaling, scaling down a stapler set and cluster a and scaling it up, and cluster B and moving PODS over so replicas of the application.

E

In order to do this, though, you need some networking configuration in order for an application of cluster a to talk to a logical application of cluster B. You need some storage orchestration for underlying discs to be moved or just to be snapshotted. If we're moving across zones, for example, and then the final piece is the actual replicas of the application that are being moved. So it takes a a three replica database.

E

That's what we're talking about here, moving over those specific instances, so maybe moving secondaries over performing a failover and then bringing up a primary on a cluster beat. So this cap is just focusing on the third problem here. I know that there's a lot to be solved here for networking and storage, um but just focusing on creating a building block that enables this third problem to kind of be solved so next slide.

E

So the core problem here that can't be solved today with staple sets if they're running in a single cluster, is that uh there's kind of this split playing split brain control, problem where, if you have two clusters, they have separate control planes, there's no way to really have a logical grouping across the cluster. So in order to run a logical app um with worker nodes in both clusters need a way to be able to have a global view of the resources across the Clusters so either.

E

um So the approach here really is to kind of split up the staple set or split up the logical application to have distinct responsibilities for slices of the app to be in different clusters, so kind of the schematic below there's a Staples controller running in each cluster cluster, a and cluster B on the left and right side and pieces of the app running in both clusters.

E

So we could imagine if there's a three replica staple set, a pods, zero and one and then cluster a and then pod 2 in cluster B and being able to move pod by pod over across the cluster. And you can imagine you know if you have more replicas, you can do this faster. You could do it slower, but the idea is having a mechanism to be able to orchestrate pod movement while maintaining application, availability.

E

um And then next slide, please I, think there's one more here.

E

So the proposed changes in the skep or to be able to slice a staple set So currently there's only the spec replicas field enables the number the control for the number of replicas in a stapler set. But the proposed changes here are allowing the user to control both the start, ordinal and the end ordinal So. Currently the end ordinal control already exists using the replicas field. The proso here is adding a new field called replica start ordinal, um which would enable kind of a start and end ordinal to be defined effectively.

E

So you can imagine uh here's kind of an example. So cluster a you have say, an app or a staple set that had three replicas zero one two. It originally started with the replicas of three. If you wanted to migrate, one of those replicas to Cluster, B and kind of still maintain a logical app.

E

um You could do this by using the replica story. Ordinal field, spinning up a new stapler set in cluster B, adding replicas one replica, star, ordinal 2 and then scaling down replicas in cluster, a so that you have the same replica application replicas across the global view of both clusters, but you're able to can able to split up the staple set using those fields so I'm bringing this problem to cigaps today to ask for um some discussion, feedback and some insight into this idea and kind of presenting.

E

This is a core problem rather than this is the definitive solution we want to take to kind of solve this problem, I just kind of want to get some feedback for feasibility of this idea. If people have major concerns, if people can think of alternative ways to kind of solve this problem without needing to add this particular API, so I'm going to stop there and just open the floor for a discussion.

A

Well, I want to understand better what you mean by migrating a staple set across clusters. Do you mean, by like a another cluster, can adopt the storage resources of a previous cluster, because there's not a lot that you can migrate across clusters. That makes a lot of sense right.

E

Yeah so there's kind of two scenarios in terms of storage that we're thinking of so one is being able to adopt the storage resources so say if you're, using a persistent disk um or yeah some network attached storage.

E

um Moving that the reference to that storage pointer across uh clusters would enable that same disk to be detached from an existing machine and then reattached to a new machine on cluster B.

E

um So that's kind of the end zone might or in uh availability, region, migration and then there's say: cross Regional migration, where you would take a snapshot of the disk rehydrate it in a different availability, Zone and then start consuming it in a different cluster.

A

So but I guess the the thing here. It would help if you could probably give a more specific example of what you're trying to do in terms of application. Availability under these constraints, right, like I'd, have to imagine that the client application for this lives outside of either cluster right, because you're tearing or does it like, I I, don't understand what the networking would look like.

A

I guess like if you have some type of like let's say, you're an AWS and using Global accelerator to Ingress across multiple regions like you were talking about right or you're, a Google and you're using GFE whatever.

A

um So you have.

A

These two disparate clusters are getting hit at the same time and you're trying to orchestrate migration of the storage along with the networking simultaneously to have it not lose availability or like what, because it would seem that, like at some point, if your networking is entirely in cluster as you tear down one like the other like, and it's just hard for me to kind of get a grip around what available means in this context, I kind of get what you're trying to do with the storage registration in terms of like I, want to be able to I guess recreate like how would you inside of the cluster?

A

How would the the volumes be migrated like you would? How would you ensure that, like the persistent volume claims are generated by the new Staples, set, actually adopt the volume, of course, mines directly to the volume that you've migrated out of your first cluster, for instance, I.

E

See yeah, that makes sense. So, let's, let's take a scenario of, say, a three bucket replica database that we're moving over and for ease of. Like example, let's say it's a primary secondary, so one primary two second, two secondaries, um so the orchestration of the storage would kind of be an external component and that's still something that we're working on prototyping and kind of thinking of, but it would require orchestration of some external system.

E

um That's unrelated to this specific stapler set cap that we're discussing, um but, like one example, could be say you initiate this process from a staple set and you determine which volumes are actually consumed by the staple set Maybe by an application selector that the user specifies um there'd be potentially some agent. That would be migrating. These references so recreating persistent volume. Persistent volume claims in the new cluster that point to the same underlying storage.

E

um So you effectively have copies of this of the persistent volume persistent volume plane in cluster B, the one you're migrating to so they'd, really reference the same storage um in the application. Once it's turned down in cluster a it could consume the same storage reference in cluster B.

A

So would these volumes be like write? One me read many write one or write. One read one type volumes like how this actually.

E

These would be rwo, so um read. Write one read write once yeah.

A

um So you have to detach it from one I guess, which I think the thing that that I struggle with is that, like without the motivating example of the external orchestrator, it's unclear on like what the best thing to do a stateful set is in order to help and without like some details about the system that would actually migrate. The persistent volume claims into consistent way like it's hard.

B

A

Reason about the correct behavior of Staples set in this context, because you are talking about probably several orchestrators that will be required in order for this to work effectively, leaving networking out of it, which I don't fully understand, like I, think they're if you're looking at replicating across geographic regions or geopolitical boundaries. Aside from instead of just clusters in the same like data center, there might be entirely different constraints there.

A

It's hard to reason about like if this is the right thing, without understanding the mechanics of some of those other pieces.

E

Yeah I think that's a fair criticism. I think that does make sense in terms of this does fit. This is a building block in a larger story.

E

um So I guess the one thing that for for um networking that we're considering using multi-cluster, Services um I know this cap was introduced in 2020, um and it's gained some traction kind of uh so being able to use that for networking across clusters would enable uh that to be set up with the application before migration is initiated, um but I yeah, I, agree, I, think the the major unknown here is really the storage orchestration, that's something that hasn't really been proposed or like brought to the community. Yet.

A

On the plus side, I do think that the implementation for really straightforward right, because all you're saying is give me an extra feel that determines the starter number of the orbitals for enumeration and then like instead of my replicas starting at I, would go at two and, if I scale up to three, for instance, in your on the second screen over here, it would be uh at two at three at four right. That's basically what your ask is.

E

That's right, yeah.

A

Yeah that seems like the cap has the benefit of. Like I mean it does seem fairly straightforward. The the kind of downsides from my perspective is that, like the motivation, is a little bit weak without understanding what the external orchestrator is, that it would actually be interacting with all right, because you're at like the the typical thing we would say is like you know, you could always do this work out of tree. You can always Fork Staples set.

A

You can always operate with your third party controller, um using custom resource definitions and just have a blast right like there's. No, nothing stopping you from doing that today. If you want to put it as a feature in core kubernetes like inside of a V1 API, it would be helpful um in terms of facilitating that to have some strong motivation in terms of the systems that would actually benefit from making the modification shepherding it in and then maintaining it over time.

E

That makes sense. Okay, so.

E

I I guess to kind of reiterate what I'm hearing is. um This is defend kind of a larger story um like I, guess the vision needs to kind of be described, with more detail, to kind of be able to motivate this. The addition here.

A

I mean that's my two cents, but my two cents is not exhaustive of the entire kubernetes community, so, um like I, think it's worth it's worth offering it it's just again like I'm, just giving you my feedback about, like you know, looking at it. What I see is kind of like the strength of it and kind of the weakness of it, which is like the feedback you asked for the feedback that I'm seeing is like the feedback that comes to my mind is like it's really hard for me to wrap.

A

My head around, like like I, get what you're trying to say but like making a change to a V1 API that we're going to support pretty much, because once it goes GA, that's it's forever and not having any client systems that would actually be able to leverage that to provide value to the end user is is a hard thing to motivate right that that's kind of the way I see it. There may be other people like you know. If you offer the cap, there may be like a bunch.

A

Maybe there are pre-existing systems like you know, VMware, and so then a lot of the other storage providers have existing systems that will attempt to migrate storage across clusters already. So maybe this integrates well, and maybe the the thing you can do is just um by offering the captain the community get more motivation and other people saying like yeah. This would really help. You know so like that.

A

That's a starting point that would maybe motivated, but as it is like, if you were saying like hey I've, got this system and I think there are some other systems in the community. That would benefit from this feature and it would allow us to actually more reasonably um like lower the disruption of doing cluster upgrades, because a lot of kubernetes providers do encourage effectively blue green or red black or how you should maintain your clusters. Right, like they don't say uh in place, upgrade, is the the happy path.

A

They'll tell you roll up a new cluster and then migrate, your workloads to it and then turn the old one down and I totally get that so, like you know, trying to do things in that space to facilitate that I. Think I think it's worth doing, but it's just hard for me to look at this and say like okay, so we're gonna, Commit This, take this cat commit this patch. Carry this patch and I: don't have a user for it right.

F

Got it okay, um excuse me, could I just uh click down a bit on your comment before about doing this via um um a custom, controller and um crds, because kind of the context we're coming from is like we could Envision how like a the um PVS and PVCs could be managed with a custom controller, but the problem was there, isn't anything you can do with a stateful set like these Staples that controller as it exists today you know, is going to try and create a certain set um of replicas and there isn't any sort of flexibility or way to extend or deal with that.

F

So like the only way we could see you know doing this without changing stateful set is basically forking. The stateful set controller. Anybody who wants to use this instead of using a stateful set for their workload would have to use our special stateful set version instead and you know, and like I kind of have a complete Fork. So a part of the idea of this cap is like: how can we do a minimal change to the stateful set to um allow these extended usages? You know, um but still have as much as possible.

F

In external controllers, so we don't have to like um boil the ocean and solve like all problems in one cap. Does that make make sense.

A

It does I mean, like totally I, get it you're, gonna you're like okay. What's the lightest touch I can make for Steve, but, like let me are.

B

A

On implementing something, is this: like: is this feature blocking for Progress, for you guys or like.

F

um So we're actually in the process of doing uh a proof of concept of this I mean I I, think the plan we have currently is actually to have a fork. Stateful set controller I mean considering that we want to do our proof of concept in the next few months, like that's kind of the um option and like in parallel. We are uh per proposing this cap, which is our current plans now.

F

um You know, because we're still in the proof of concept phase like I feel like there's still quite a bit of scope for changing how we do this, which is the um you know that this is the uh motivation for starting the discussion on this cap, but yeah like in the next couple months. We plan to have um a POC of the um volume controller out, which I think will maybe help um clarify um this use case, and you know, could could help more concrete discussion when.

A

The volume controller to be open source as well. Would you be contributing that.

F

Or that is the ultimate plan, yes, but because it's like we don't have to change anything core to do like. Currently, at least as we see it, we can do it. You know purely as a out of tree controller, so we haven't sort of started the Upstream process, yet until we I mean we kind of need to get an implementation and actually test stuff before we start making wild speculations, you know.

A

Yeah so I mean like I. Like again, my gut reaction is the touch is light like you're asking to change the beginning of the ordinal numbers. It doesn't seem like there's not a whole lot of things that would break for current customers. I can figure it like. It can think in my head, pretty trivially how to do it in a way. That's Backward, Compatible, so releasing it is fine. The main thing is like once it's out there: it's it's out there forever like and you you.

A

If you had like a like three or four different use cases where it's like these are the things it supports. Then you know that would be a strong motivation. You don't have three or four, though you've got like one. It's a good use case right but like if you're already like, if you're already like look I, got a fork it in order to make this work for my POC, then I'm gonna do the volume controllers and release them.

A

Why not run with that and then, like you, can keep the cap open and get more Community feedback and kind of try to build momentum around what you're trying to do here and then from there. You know when you have something: that's a little bit more like concrete. We can look at like, but really have some evidence and some signal to assess like okay. This is like the right thing and then we can look at bringing in and trade yeah.

A

Does that sound, reasonable.

F

Yeah, absolutely yeah I mean and for sure just I mean also the the um reason why we're putting out this here is. You know we had a few other ideas of how to do this, for, like justice and example.

F

Instead of just having a start ordinal one could do something where you have an arbitrary subset of replicas, that the stateful set controller is responsible for, and that um would give you the capability of doing something more like a cross-cluster, node uh cordon and drain thing, where you know in arbitrary subset of the the replicas of the stateful set um or be migrated which would allow you to do a cross-cluster migration in a way, that's exactly analogous to um uh a node cordon and drain um that seems sort of like over General to us and like a bit of a a higher touch thing but like um yeah.

F

This is something where we certainly wanted to start the conversation in case. Anyone in the uh my community had more experience or intuition as to like you know how the right way to do this would would be.

E

And and to Matt's point there there are, we haven't encountered some forked implementations of sample set that do offer this type of API, um like mostly just for testing like tidb, for example, as, for example, set to do this, um so it seems like there is some use case, um at least for, like you know, testing purposes that may make this uh particular API useful yeah.

F

But um in general, though I think your comments are are well taken. It is kind of hard to reason about this until you have a more concrete view of what the actual end goal is. So one.

A

Other thing to think about is in your cap.

A

Did you include what the semantics would be for setting the field for an existing staple set, or would it be immutable on creation or like how would that work like I, like the semantics of what it would do, if you're actually able to orchestrate the resource transfer across clusters within the same region or even across regions right, but in thinking about how we're fitting and adoption actually works, like I could see ways that this could I mean yeah, I guess because Staples sets have a unique name anyway.

A

It wouldn't be too bad, like you're, never going to be in that case, but like would you expect orbiting Behavior, or would you expect deletion Behavior like what happens if you set this field if it hasn't been set previously? If that's not in there, you might want to think about adding what the desired behavior is there, because I could get a little bit hairy, but I even don't think that, in terms of a coronary cases, particularly like nasty nasty, yeah, yeah I.

F

E

Some discussion about the like what the defaults are, um what setting it setting this replica star or no would be on an existing sample set, um there's I agree: there's definitely some edge cases around rolling back um to a staple set controller that does not support this field.

E

um It can lead to some unexpected if the user isn't aware of that. Behavior like it can lead to some application like effectively. You know, if you had replica star ordinal, that was quite High um say like three: um it would. You know, try to delete pods that are greater than a certain ordinal and then recreate the ones from like zero to two.

E

um So there would be some like application, like some pod churn um that may be un anticipated or unwanted um during World.

A

The fidelier bid is actually with the pbcs and the storage creation right, the storage provisioning because, like especially with a staple application, your expectation is you're going to get very particular volumes associated with your your staple set right. um Yeah you're gonna have data on them, but now, with default settings you're not going to lose that data, it would still be there. You should probably be able to retrieve it by scaling the staple so not particularly, but associating the exact volume with the exact pod that has the identity you want.

A

It is going to be fiddly if you roll back right like if all of a sudden you were at um you know like if you created a staple set, start at three, give me three replicas, so you have three four five six and then you go back to an old cluster. You have three replicas, it doesn't say, started three. You have zero one. Two three, those are gonna be brand. New hives they'll have no data, so whatever the application does on initial provision, you need is what you get there.

A

You should be able to recrate reclaim your data by scaling. It up to like six, if that's possible, but in any case you could be. You would at least be able to break glass. Get your PVCs back and get your volumes out. The default behavior isn't going to be a data loss which is like that tends to leave people in consolable right, like that's like generally unacceptable, so that you wouldn't get that behavior.

A

So I mean that that's why I'm like this doesn't seem like super risky or super like I'm, not looking at like this is like crazy I'm. Just looking at it like you'd, be helpful to have a strong motivation for it, but um that's the feedback. I have but I've opened up and does anyone else have anything to add or.

B

F

We lost you, you were muted.

A

Hey guys, if you don't mind, I'd like to move on to the next item in the interest of time, uh so we have a chance to talk about retryable and non-retable plot failures for jobs, which is basically I changed the semantics of job retries by I. Think Elder. Are you sponsoring.

D

Us, yes, can you present the readme uh for me please, so we don't run into the same bug again.

D

I, don't know your reply, you replied that you were muted. Do you want me to present.

A

The cap for you, yes sure.

B

D

So, in the meantime, I can introduce the cap. um So, as you might know, um the the job API has the back of policy currently uh to control how many retries uh the the pots have before declaring a failure. However, uh this is not very much configurable um in two senses.

D

um Let's say if a user knows a particular exit code to consider a pod, failed uh and and not recoverable, there is no way to uh immediately fail the entire the entire job and, on the other hand, um there is no way to control uh when a pod failure is due to infrastructure errors versus the the user's infrastructure. Errors would be things like, they know it goes down or they know it is preempted or keep scheduler had to uh preempt the Pod things like that.

D

You have to click load, diff it'll, do it okay! Here we go. Yes, um that is very small. um Can you um can you go into uh the um the stories? Please.

D

uh History, the stories the user stories. Okay. Yes, so that's that's the context, um so we are proposing this this uh API that precisely allows you to to determine uh specialized certain failures by by exit code or by the failure reason, and to take two decisions either to completely terminate the job. For example, can you stop there? For example, here we have the rule uh down there in the yaml we have the rule terminates when the exit code is not between 40 and 50..

D

um So in this case um yeah, the user knows that these exit codes are not recoverable, uh and then it decides that if there's a failure with this exit codes, the the pot, the entire job, should terminate.

D

um If you go to Story number, two, there is another yaml and so the the other story, the other rule, is uh based on pod status reasons.

D

um This is more useful for for uh infrastructure providers because they don't they. If somebody's providing a job system right for researchers, the the administrators don't know what exit codes the the application might have uh so, uh but what we do know is that cubelet or other controllers insert a certain reason in the body status to uh to signify why the Pod was terminated uh or to explain why the plot was terminated. So here we're adding this other rule the for example, in in the case of preemption, uh or no no pressure eviction.

D

We we can ignore this failure from from the back of limit so that the user gets more more retries. uh So that's that's the the heart of the proposal.

D

um I want to highlight uh a few risks um that one of the risks uh is precisely these status reasons that are not currently fully documented in kubernetes, uh but uh we want to to do a survey of all the the reasons that we are currently introducing and document them in in the website and also move all the constants to the core V1 package. So they they are more discoverable um and also they have. They are subject to API reviews and the um the other somehow risk is that uh you know.

D

Currently the garbage collector removes spots, so there could be a scenarios where we lose the the status uh before we can take a decision, but, as you might remember, remember, we have already decent going feature for uh for accounting job failures using finalizers. So once that's completed uh this no, this will no longer longer be a an issue, um uh and additionally, uh there are certain back to reasons.

D

There are certain reasons that are currently uh or there are certain components in kubernetes that currently delete pods, and they don't explain why one of them is keeps scheduler, for example, when it preamps a pod, it just deletes the Pod. It doesn't say why. So we want to also survey all these users usages usages of uh deletion, delete pod, delete, API and I'll add a reason for it. So we would be adding a delete, an option uh for the delete API. So we can include that reason that can be added to the Pod status.

D

That's pretty much the the entire proposal um so I'm hoping to to get some feedback.

D

um Luckily, I just I just learned a few minutes ago that the enhancements freeze has been pushed for one week, which is great, uh so hopefully we we get to satisfy all the uh risks. You know our risk mutations are enough for the reviewers, um but yes, I'll, open four questions. Now.

A

Does it work with crime job or would you disable this with prime jobs.

D

um No, it would work. uh Ultimately, it's only about the the job failing and the current job has its own um rules for for handling job failures. This is just about the job, but yes, it can be used from cronjo.

C

um I started thinking now that I was listening to how we're explaining um whether we should split the reason into separate cap, because that's kind of like one thing of the proposal and the other one is just uh uh just a job related changes and then make depend on the other. That's probably uh something else. My main question I already left at Indica as well. Is there any particular reason why you went with uh slightly complicated API approached towards defining?

C

How do you make which are um which exit code and the reasons and so forth, rather than just reusing something similar to what we have with the label? Selectors? That probably would give a little bit more flexibility for future growth rather than defining these fields, one by one.

D

Yes, um I'm going to answer first, the second question: um the the label selector as it is, is kind of a thought for maps. You know you have a key and a key and a value, so it doesn't directly um uh Translate. uh But yes, we can. We can tweak the API to look more similar.

D

uh Ultimately, I think this is more of an implementation detail that we can. We can um fix, uh but yes, I'm I'm working with uh with me. How was writing the cap to to try to find the closest um API to existing label selectors um and your so back to your first question: uh if I'm hearing correctly, you are suggesting we split this in two caps, one just for you know the delete options and adding reasons and one for controlling the job um for the con. The job failures and updates.

D

um That sounds reasonable.

D

I guess that would just need a extra extra review. I mean we're already adding seat API Machinery to the to the to the cap as participating, uh and the reasons change itself is not that big and it's hard to justify by itself so um kind of I I. Think having it as a single cap provides a bigger picture. But uh if you disagree we can we can split it.

C

No, that's something that just popped up into my head when I was reading through it, and uh my primary motivation for uh a little bit simpler API was that you basically have um similar to the label selectors approach, where you specify what you should be looking at, whether there will be an exit code, the reason field, something else, and then you specify an operator and a bunch of values that you uh work with.

C

That's why the kind of similar to labels selectors approach seemed a little bit more simplistic on the API surface, but it will be also much easier to extend in the future if you decide to add additional operators or additional fields uh on either side of the uh of that condition. I I, it's probably something that we can discuss in the cap itself, but uh it yes.

B

C

How I was when I was looking and thinking through uh what we could Implement with uh a little bit simpler, API.

D

Yeah, it's um I! Guess it's it's simpler! If, if we can, if we are looking at extending it, um but it's a little bit more work for the user, if it's, if you just want to provide Min and Max, um because now you have to Define an array of conditions of comparisons, I guess, instead of a single, a single field: um okay, but I'll I'll go back to the let's say to the Whiteboard to.

C

But yeah, no, that could probably be beneficial in the moment, um maybe worth pulling in someone from the API reviewers on that bit as well.

D

Yes, uh I added uh someone from APA Machinery, yeah, Evan I. Think I forgot his last name now, um but uh I'll try to I try to follow up with other API reviewers. If, if he doesn't respond, um do you see? Do you think the the approach of you know documenting all the reasons uh seems reasonable.

C

Definitely because um I'm pretty sure that, as soon as we expose this kind of API a lot of a lot of people, a lot of people will start relying on it. uh I'm positive that the fact that you will be just implementing this and using this in the job controller uh will be then used further by other consumers and other controllers who are implementing or working directly with bots in a similar fashion.

C

um So that's definitely something that a lot of people a lot of people will start building off uh as soon as we explore that value.

C

um When it comes to the cap itself, I'm fully supportive of the approach, especially that, if I remember correctly, I was one of the authors of the original issues. A couple years ago, when we started initially doing the job controller uh more than a couple I think yeah.

C

It was just that we initially decided that we don't want to over complicate the API and the controller, but rather gather some additional feedback from the users and only then decide that uh having those additional um options.

D

Yes, um so just to clarify I think uh the the feedback that we got from from our internal users is is that just using the the exit code wouldn't be enough.

C

uh Having both the reason and the exit code, and maybe potentially something in the future, uh is one of the reasons that I'm thinking of being able to easily expand the uh the API surface, but rather than having more and more Fields added uh the fact. If we go through the yeah I, don't like the comparison, but it's probably the closest the labels. Electoral life mechanism would allow us for a little bit more um flexibility when it comes to expanding that surface.

D

Okay, yep, that makes sense uh we'll we'll fix that.

D

Any other comments.

A

Okay, all right moving on Paris Europe.

C

A

All right I'll try to catch her on smack.

C

um It's definitely a topic that I.

B

C

C

I want to do in the long run. I I spoke with priority on kubecon about it. We did a quick session back in March or April.

C

um I just need to find the time to put the right people in the right place and we will start putting together a group of of interesting folks. Pirates have a list uh of all the uh of all the folks and we will slowly get this rolling.

C

um Like I wrote after the captors, I literally lost the currency with all the reviews, but after that I've done I'm happy to pick up the topic because I'm more than interested in growing uh the reviewer and approver.

B

C

B

I will show you.

B

Interest which plants the gaps in the chat.

B

B

And as of now, we want one to three vendors, three at Max at least one and uh from the.

B

Issue mentioned in uh um mentoring law of today that you can see that there are three to five interested folks already and if we uh get this uh in the mailing list, uh then maybe you can find more folks and Implements of protein. Folks.

C

Yeah, we'll definitely make sure to put something together. Like I said, I'm super interested in having an uh additional reviewers in both the Sega and the six CLI so yeah it's uh it will happen. It just requires some some time to get it kick off. Foreign.

A

A

Then I'll give everybody five minutes back. um Thank you thanks. Everyone for presenting in for their contributions and we'll definitely follow up on the mentorship for 60 Alliance account.

F