Kubernetes SIG Node, 13 Sep 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG Node 20220913

Description

SIG Node weekly meeting. Agenda and notes: https://docs.google.com/document/d/1Ne57gvidMEWXR70OxxnRkYquAoMpt56o75oZtg-OeBg/edit#heading=h.adoto8roitwq

GMT20220913-170500_Recording_3840x2160

A

September 13th and our weekly signaled, firstly, I want to welcome Sergey.

B

A

Thanks and Sergey used to be together with the allyla and later Daniel driving off the our signal, the CI projects, so welcome back and I I definitely miss you.

A

So let's follow our agenda and so the first one it is I think it is the Western talk about the resource control, plugin.

C

Yeah, so it's most self-monotonous actually autonis is going to present because he's done, we've been working together in the design, but he's in mostly implementation and even has its working demo of it. So Thomas can you let him share.

A

um At last, anyway, sorry.

D

A

D

Make uh yeah sure uh let me make.

D

Okay, I made you co-host. Thank.

E

D

Go for it, let me try.

F

Let me know when you see my screen.

G

Let me let me try.

G

A

F

Okay, perfect yeah. Thank you very much for joining today. uh So I will present our proposal, basically how we could have something like pluggable, Resource, Management inside kubernetes, looking forward to your feedback.

F

um If this, this is a meaningful proposal for you.

F

um So in terms of um agenda um for for the talks today, uh I will start with short summary of the current state of Technology what we have inside uh cubelet, um covering mostly CPU management memory management, topology management- how it's done today, very short, one slide overview of that, and then we will go through our proposal. How this can be extended or basically can be done as a plugable resource management mechanism, um we will have some overview of the possible architecture um and and what kind of steps we are planning.

F

So it's uh and the final two points of the agenda are actually how we want to proceed. um If the feedback is positive of the approach, what we are planning we are. We would like to open a cap and basically start a PR process for that, um and uh to show you that this is actually a feasible idea. What we have in mind I prepared a small demo, uh basically with with which we will cover at the end, if we have enough time uh feel free to interrupt me.

F

um If you have any questions um so uh first, so, let's jump into what's the current stage inside cubelet uh inside cubelet, we have four managers which deal with resources.

F

um Basically, as you know, those are the CPU manager, memory managers, topology manager and Device Manager, um and this already shows the one of the problems. What we have we have four pieces managing, um sometimes the same Hardware uh part of them are managing different Hardware like device manager, um but basically one of the issues. What we see three of them are not extendable, not really extendable, they are hardly, they are integrated inside cubelet and if vendors want to put New Logic new CPU logic, if there are changes coming, uh this is very hard.

F

um So keeping up the up-to-date with Hardware becomes more difficult with the time um right, and um this led um this kind of three managers they're somewhat limited. uh They are not exposing the hardware up to the level what we we would actually like with future architectures, um and this led also in in the community. You see a lot of custom made solutions from users uh which saw those limitations uh of uh um of a CPU manager and memory manager and others.

F

um So a lot of solutions exist out there um and yeah we. uh Another drawback is um basically, if you do configuration uh of uh of this components like the managers, usually you need to restart cubelet and a lot of the configuration is done through from administrator um and the the level of configuration which the user can do. It's quite limited and right and always has to go through this mechanism, um the other drawback.

F

What we see is if we look in today's topology manager, uh it has also limits uh um already on today's Hardware or the hardware coming this year, um basically to topology manager runs into limitations because of more complex, triplet architectures, uh which are currently, uh you can hardly basically work with with the ownership manager and must probably lose performance um um right, and the main problem for us is actually having those four pieces. And if you want to push let's say new code inside this, is this will blow up cubelet? It will make it bigger.

F

It will make it more complex right. So it's really um hard for us basically to extend CPU manager memory manager uh having it ready integrated inside. That's why we are looking to propose a plugable architecture where vendors can propose or basically write their own plugins or for CPU management um and the other kind of resource management resources.

F

um Right with with the idea that we want to basically make cubelets more um and somehow have a better separation of concerns um inside cubits.

G

At the outside of Cuba.

F

um So, um just as a short overview of the architecture of current cubelet, we looked inside the code, uh the main building blocks of the current architecture. You have a a container manager which takes care for the lifetime events and basically inside cubelet, you have the four different managers and the lifetime events are called for all these four managers uh between the managers. You have also separate connections, usually they are going um mostly from from the topology manager calling or other managers um and the other interesting piece.

F

What what we see is device manager is the only one clickable so which has some sort of register mechanism and device plugins. As we know, those are demon sets when allocated when basically created, they call the registration of device manager and, and they can be basically instantiated in the runtime. uh The the size has some meaning here. uh We we denote with the size, actually the complexity of those Miniatures. uh If you look inside the code, topology manager, CPU manager, number of lines of code of Dozer is growing and they're becoming more and more complex.

F

The only one which is a little bit smaller is memory manager, but all these components are rather big, um which we won't want to simplify long term.

F

um The other kind of important message of the current or important aspect of the current architecture. What we currently see um yeah one one drawback: uh if you, if we look into the topology manager, you have the bit life cycle, uh basically um for for containers and the issue what we saw when looking inside the code, it actually has a lot of side effects.

F

uh This kind of calls inside the topology manager there in the mid life cycles, they're they're, a lot of allocations happening already on devices and other managers which, which makes it hard to understand, makes it hard to predict from from what what happens from uh in terms of later extensions, if you extended, a lot of things can break almost probably so. This is one of the design things with which we currently also found out.

F

It makes it more difficult to extend um so uh this is what we currently have and let's move to uh our proposal.

F

um First, uh we uh we made shortlists what are our objectives and goals, but such plugable Resource Management architecture.

F

um The main goal would be to minimize your blood code base and responsibilities.

F

um So basically, we would like those managers uh and and those resource um managers basically to live outside of cubelet to be pluggable. So if you have a CPU resource manager, basically this this, the idea is to be outside of cubelet and similar to device plugins. It connects to a to certain registration service and it becomes visible for cubelet.

F

um Then the other aspect. What we want to improve. um We would like uh to give a little bit more freedom to users um to express more allocation preferences, uh basically for for resources.

F

um What you currently do a lot through configurations in in cubelet uh through the policies we want to expose this to the user through through the uh through basically other mechanisms, um and uh last but not least, um so we want to enable a plugable interface so that other parties, other vendors, can Implement uh vendor specific logic outside of cubelet, um so that yeah people can have Alternatives and can can basically Implement um code, which is dedicated to the hardware um right um and uh the idea of how to manage and name that in the future.

F

What we would like to propose is to have uh signaled sponsored set of resource plugins, similar to what you have with six scheduling. Basically, uh we maintain, um together with the community.

D

F

A set of plugins well and basically Community can push different solutions there and then those can be instantiated by the users.

F

um uh We would like also to maintain the existing experience of kubernetes. This means, if you think about the classical CPU management and topology management, as you have it today in kubernetes, this should remain uh remain accessible through a plugin which is completely end-to-end.

H

F

Compatible with the current state, so everything what we have today, we want to support, support it again through a plugin, and basically this would be validated through end-to-end tests. What are provided inside of the framework.

F

um So um any questions so far.

G

Maybe I will do it quick or.

C

I think, once you get to architecture, Arizona, okay, I'll click.

F

Right um so this those were our um objectives and goals um and uh how we would like to support that. uh We are proposing to start cap for for this kind of pluggable resource management approach.

F

um We already started also prototype implementation and, together with the community, uh we can work on a PR uh where we could cover this plug-in mechanisms and for the first plugins, uh basically with new technology, but also covering the existing technology in kubernetes.

F

um We would like uh to keep existing technology the existing um existing kind of implementation lodging still inside kubernetes. uh So, basically, we propose to guard to introduce Gatekeepers for the New Logic, which will be basically uh which will disable the standard. Cpu managers topology managers, when enabled so basically they will be mutually exclusive, um and this will be controlled through a gatekeeper, router, um end-to-end, test compatibility. This will be our Target. We already started working on that in the Prototype and we showcased that it is feasible.

F

We we already did that with CPU management and memory management.

F

um Performance impact is also important for us. uh As this flexibility comes with a price, uh we would want to know that, and we would like to put the cards open on the table for the community to see what's the price uh having a plug texture. Basically um and uh last but not least, um we want to support device manager, uh make a device manager capabilities as they are existing today. So we don't want to change anything there.

F

um So we we, uh our vision, is basically everything what you have as a device called in the past should work together also with this Resource Management mechanism.

F

um In terms of architecture, um what we propose looks somewhat similar to what you have with Device managers. So it's a the net.

G

F

Extension basically to the resource management. um Basically, um we have a resource management plugins which are demon sets and the demon sets when instantiated they register to a resource manager inside cubelets.

F

The resource manager uh lives in the it's basically controlled by the container manager, and the container manager also communicates to it about lifetime events and policy choices and stuff, like that. um So first step is registration through the similar to device managers.

F

What you have today, basically through a socket and as an answer of the registration, uh we currently basically return the cubelet configuration um which allows us basically in the plugin, to know already all the details, what you have about reserved CPUs policies and stuff like that, um then after this registration cycle finishes, we start processing the lifetime events coming from from container manager, um they are usually well defined.

F

You have basically some sort of admission cycle uh done with with allocate at container functions, remove containers uh when, when you are basically deleting containers and some of the managers like CPU manager, have an internal reconciled, Loop, which we want to cover through a special reconcile event, um those events, basically, um after certain allocation happens, uh we we return. uh If the allocation was successful, we will return the CPU set or the certain resource or location.

F

um This is usually a set of yeah, the the structural set of assigned CPUs. No, uh the idea is that actually, our plugins do not do any allocations. They return the desired allocation Set uh to the cubelet and cubelet then basically calls the runtime service. The runtime service provides an interface to allocate all the desired resources and we use it inside kubernet. So the this this makes it nice in terms of plug-in perspective.

F

We we don't need any special privileges here um and, of course, there are errors which we want to forward back to cubelet if, if the location is not not possible, um so all these things we want to capture.

F

uh This is the main mechanism. What we are thinking about, how to cover basically, this kind of applicable uh resource management um in terms of implementation um of possible plugins, we started with basically something like which covers the existing uh functionality. What you have today, we have a plugin which uh had combines the the current CPU manager memory manager and to biology manager together, they're one-to-one the same code. What you have in the current kubernetes and they are not duplicated, so you don't have a code duplication.

F

It's just instantiation of those managers inside the plugin, so actually code is not being moved. Code is not being duplicated in that sense, um as we know the Poetry manager in in this space, we we want to structure the approach in several phases. uh In the first phase, we basically will need some sort of integration, because topology manager basically relies on hints from devices and other topology resources um and um yeah.

F

So we we will have some sort of integration logic inside the plugin uh inside this kind of default, plugin, the other important uh building block. What I want to point out is the device manager.

F

uh So we want basically a single point of contact manager to introduce a single point of contact measure inside cubelet for this kind of pluggable resources and um to maintain compatibility with Device Manager and basically to to not to change the code we just want to instantiate device manager inside and all the the logic basically will remain the same as you can. It's in device manager, no code changes are required here.

F

It will just be basically the the calls which the different managers here on top uh have regarding device manager will be handled by resource manager. It will adapt the calls to the device manager, foreign.

F

Slides the architecture resembles a lot device manager um as you get it originally, so uh we think that it's possible actually to have one manager for everything, one resource manager uh which can also handle devices.

F

um So this would be our phase two suggestion. uh If we can come up with the approach where we can handle all all resources through a single point of contact single manager inside uh inside cubelet, we call it resource manager. It supports basically the same kind of events, what we discussed so far, and it is Backward Compatible with any device plugins. What you have currently and you can then add new resource plugins, and you have also backward compatibility with with uh with the standard, CPU manager, memory manager, topology manager, logic.

F

um I stopped here shortly for questions the those those are the main points of the architecture which I wanted to show.

I

I

So so I I I'm, the original implementer of the topology manager and I did all of the refactoring for the CPU manager, device manager and reviewed all of the code for the memory manager for how it's currently architected and put together um um so yeah so um like I, said I'm, not sure how many questions my my biggest kind of worry.

I

So we've tried to do an exercise similar to what you're doing now at some point in the past, um um I wouldn't say it failed, but it's always a big um undertaking, because you know there's so many things that are in place. We've been doing things the way they are. You know I'm, not necessarily happy with the current architecture that we have today.

I

There's obviously many many flaws with it, um but you know I slowly over time, at least from my own personal opinion came to the to the kind of realization that you know to me.

I

It doesn't feel like plugins for any of this stuff really actually belong in kubernetes in the kublet anywhere at all and actually belong down um at the container runtime level kind of how the cni stuff has now moved down uh to be completely runtime plug-in, rather than something that's that runs in a pod inside kubernetes, and we can talk about the details of some of that later and I'm, not necessarily saying that we don't want to go this direction, because this definitely will clean things up. Quite.

J

I

If we decide this is the right thing to do and there's people to actually put in the effort to work on it, um but I do think we need to sit down and have a conversation about whether we even want to continue maintaining these components the way they are. You know the one thing that that jumped out of me when you're talking about you know making sure we have backwards compatibility with the existing CPU memory manager, topology manager, CP.

I

uh You know all these components, you know if we went to an architecture like this, for um you know, for these types of resources, my initial gut feeling is that we wouldn't want to be backwards compatible with with any of those.

D

I

That carries a bunch of baggage uh along with it that we, you know, don't necessarily want to have to continue to maintain I'd, rather have a set of feature Gates that could turn those off and then use this new architecture with its better way of doing things instead, rather than try and maintain backwards compatibility with the old stuff, but yeah.

F

Correct so we are on the same page, so basically here or our goal would be also. This new architecture should not carry this baggage. What you have with the old architecture, but the plug-in mechanism allows you to have a single plugin which which can emulate more or less the architecture. So if people uh want the old architecture, they can instantiate a single plugin which does it um and It's Made Simple for them, um but yeah.

F

Our idea here is keep the interface or Define a new interface, which is not driven by the alt architecture, but it's actually driven by yeah a meaningful management of this kind of plugable resources, and then we we want to emulate more or less that the current state, so that users still can use what they are used to it. So.

I

Yeah that makes sense, yeah I'm, also trying to think how this fits in with the new dra stuff that we presented two weeks ago that enable Dynamic resource allocation. You know, we've always said that we want to be to allow both paths to exist, to coexist simultaneously.

I

Obviously, because not everyone's going to be able to move off of this style, Resource Management to dra, maybe they don't even want to move off of the old style onto dra going forward right, so we're always going to have to have some way to have these two things coexist, and so you know if this is the way to clean up the old style, Resource Management that I'm that I'm all for it. If that's what the community decides that we want to do so,.

C

Two things to note: Kevin, there's actually two different uh caps. Currently so there's the dynamic resource allocation, which I think we could easily move some of that functionality in here, because we we are familiar with the code there and the other one is the qos one which.

I

C

Adds yet another manager, I, don't know if you've looked at that yet in.

I

Addition I've glanced, dude, I, yeah, I, haven't I, haven't read through it in detail, yeah.

C

Yeah so from from the from the diagrams they have, it actually adds yet another name.

C

They are solving a real problem um and I noted that you mentioned uh the NRI piece as well, so.

I

Yeah I didn't call it by name but yeah moving.

C

Things down, I know what you're talking about. So what I'm going to tell you is different. Customers are comfortable.

C

At levels, um what by moving this, this complexity outside of this you're no longer having to turn off everything for runtime level uh fixes you now can just choose not to use the plugin and say oh we'll handle at the right behind level. So.

I

You can just choose.

C

To ignore this entire processing piece and if a customer, for instance, has 10 nodes of which they want to do a special resource plugin they can so when you're talking about research like it's CERN with Ricardo over there, um then you can, they can say, oh well, we want to do some new examples to see if we can get speed ups for certain types of resource managers and do there. This also gets some of the community.

C

You know the vendors, because, where I didn't tell right we're not going to hide that and but it lets Avengers start releasing specialty resource plugins for CPU Etc without complicating the community, so the community no longer has to deal with it. um So it lets. You know the maintainers sleep at night and focus on keeping things simple and running and lets the vendors deal with the the complexity.

A

So that's actually not true my co uh for all those things. It is a favor of those vendor which is like, for example, storage. Vendor here is the different type of the resource computer resource or whatever resource offer winter, not really necessary. Fever of the kubernetes offer right, so they also have different type of the platform vendor. So there are so many of the different plugin. So then there's the compatibility integration, all those kind of things.

A

So my my actually the presentation here I, because we try to have the separate of the resource management from day one actually, but because that's over complicated. So we want to make that simple. So that's why uh we have next building but from day one we think about. Maybe we should separate. We even talk to Docker packages say that switches, a name is Kevin earlier said we from day one we basically start to say: oh, can we push down that one to The, Container level but the?

A

But that time we really believe push it down to down level will be helpful, but after over seven years on this kubernetes actually sometimes I feel like what you just say actually the different level of the problem. But the one comment earlier try to say: oh this is simplified, uh uh two things. The motivation I disagree here clearly disagree when it is restart kubernetes most time it is the event uh winter offer or maybe like the platform uh offer or whatever kubernetes offer.

A

The uid will endorse what kind of the plugin what kind of the resource they are going to support offer to their customer. So anyway, they are going to uh uh promising those nodes province in those resource and the promising those clusters, so restart kubernetes is really cheap things. It's super cheap. It's not like the restart note right. So when you pack into some new resources, sometimes you to craft the kernel, reboot not even mentioned kubernetes restart. So that's not the problem for customer actually for user.

A

A lot of cases that's hidden next, also just the hidden things like the kubernetes have will restart the node. When certain error detector, then we restart the node. We could restart the container D, also maybe next kubernetes, so they have the different complexity to the customer, but that's hidden to the customer, uh restart the kubernetes most cheap one. So analysis. What happens in that? Oh that just solved that to make the kubernetes easy. Actually it's not I think about the.

A

So when you have the more things you know you have to combine, if you look at everyone, look at the cncf that huge gland and then people keep asking me my top question. Actually I received from the kubernetes user today is down. Can you tell me: what's the opinionated offer from the kubernetes I can't because we have so many plugins we have so many uh we are have that one way is the good. Is the ecosystem for the other way, users don't know what they should be choose.

F

Oh, in any case, you will not have some a double number of plugins. So, um as you as you see today, for example, you have one one GPU plugin, you have one I, don't know uh adapter network adapter plugin, so they are very clear what what kind of plugins you will have uh there are not thousands of them and um in terms of performance, um okay, you, you have a little bit better performance if you live inside cubelet and all stuff calling the managers, but the actual performance is the workload performance.

F

What you you care about and not not having a correct platform representation with the current state of managers. You leave on the table a lot of performance.

A

I I totally agree with you. I just argue with the motivation originally.

C

That that maybe is partially slanted by my experience with kubernetes, because my initial thing I was working at a AI startup equivalent and we had a hell of a time. We started couplet because we have complex networking and so every time you restarted Google it didn't play in with the network. So actually we started Kublai. It meant I had to restart the node, which meant to my neckline and I.

C

Don't think maybe that environment was unique but I don't think it's necessarily unique when you're talking about complex uh things, so I think if you're talking just regular couplet with you know a very simple infrastructure, maybe that's true, but for us um deploying a Daemon set was easy. If we were going to have to change anything on the Kublai, it stopped us. So we didn't. Does that make sense.

A

So once you have this plugable, so you are going to have the so many different demon side to different implementation, but I agree with the atnas earlier say anyway. Today, even today, we have the different of the type of the plugin for this device plugin right, so we already have that. My point is to see.

A

This is why we are simplify, actually is not I just want to make them more clear, because once you have that API there will be count on the vendor right to implement as well so end up which one uh for customer to using that's the cut. Kubernetes Community always try to answer that question, especially on the story. Network notice that there will be have tons of those questions you need to answer so customer access, the user kubernetes user today is actually most problem is too complicate, because so many choices.

G

C

A

C

A way we can pull from uh the sick scheduling group and how they're doing their plugins, because they also have many plugins, so they may have a different or similar uh there's a curse of choice.

B

Might be related question to this uh kind of? How do we do some role description so and how do we do portability of ports like let's say uh in the connect session or register section, and you pass Coupe config to plugin manager, I assume it's for current topology manager to work, as is um so then like. If.

F

It's right so current.

B

Topology manager, what is the ideal plan is uh Kublai will be configured with configuration specific for resource manager and then how admin will synchronize like resource manager set and like Google configuration. uh That will be a little bit complicated task. um So maybe what Kevin's suggesting to move this entire logic to continue?

B

Runtime may be a better solution and then, like then, like at least admin will know that they need to configure one thing and don't synchronize many things across, like a version of my resource manager and a set of plugins installed and then um I.

F

Think there are two two aspects: there is, of course, the container runtime which yes, admins, should should have some level of control, but our Point here is: we want to give more freedom to the user. Also.

G

B

Because we mean the schedule, ports and like this hints, like user.

F

Being able to choose policies, basically, if you think about we, we spoke with some some um different customers like Telco companies and so on. uh They expressed the need to change policies like between static and I. Don't know different topology policies. They want to express it in in a spec or through annotation or something they don't want to express it through a configuration, so some admin might might for some admin might make sense to configure the cluster to have a certain policy, and then users can deploy on that.

F

But actually users want to express that too, so they they want to tell I want to pin course I want to to run normal wear. So all this stuff users want to do. At least they are Advanced users who want to use that.

B

And and the role of user who is doing it? Is it admin of cluster or like admin of node.

F

Or no, no, those are those. Those are actually I see those as people coming from, for example, from HPC background from from AI background. Those are people who definitely need this kind of access to more, no more awareness, better kind of resource allocation.

F

um Also, if you think about gpus, you definitely um devices you handle that in in some way. Basically they they know about the topology and stuff, but there are certain group of workloads where users would like to to deal with their kind of topology constraints and not give it to the administrator yeah.

G

Actually, in the in the initial.

I

In the initial for the pathology manager, we had extensions to the Pod spec to allow users to control this stuff, and it was decided at the time that um no one wanted to change the Pod specs. For these.

I

um You know for these additional fields that um were very specific to topology management, um and then the proposal came on the table where we could just use annotations, and then you know the argument always against. That is that we don't want the you know. Kubernetes code base itself to be inspecting these opaque types that can be embedded in annotations that don't have any meaning to kubernetes itself.

I

If you don't want the kubernetes code, looking at those and reacting to them, if those are pushed out to plugins in theory, I guess the plugins can use annotations. However, they want, but it kind of goes backwards from the philosophy that we tried to push previously.

F

Right, but this is one of the problem what kubernetes has today, uh if you think about the HPC and AI users, they are drifting.

I

To use my back, my background is HPC and I work in a.

B

I

I know I know the pain, yeah.

F

They drift to slurm because of that.

B

Today we have at least some level of portability, and uh we have at least some level of understanding like you configure.

J

B

This is what you get and that there is no like back door uh with the plugin model. I think. Maybe we need to and I I, don't uh I don't have anything against it like I know that there are many problems, but maybe we can come up with idea like how to like split people on roles and decide, which shows will do what and then have put at least some limitations on what the resource plugins will do. Otherwise, we'll get into what Kevin describes annotations yeah.

F

It basically have some some sort of.

E

F

I think this makes sense: yes, kept some sort of limits, otherwise yeah.

I

Yeah I mean this is.

G

I

Is actually one of the things dra tries to help with because with the flame parameters they're defined by the driver, um they're checked as part of a crd? Rather than uh being, you know, opaque annotations. But you know it's a completely different mechanism using something like dra than this simple device and CPU and memory management that we have in in this architecture.

I

But it's you know it's orthogonal, but it's uh yeah go ahead. Would.

C

It be helpful to set up a working meeting later that we can talk with people interested in this, so we can get this going forward because we do have code working. We do have some Basics. So if we have some guidance on where the community wants it to be, that would be very helpful.

I

And meanwhile, yeah I'm I'm happy to I'm happy to join something like that: I'm I'm based in in Europe. So if we can have a similar, convenient time zone that'd be great, but.

C

Yeah, unfortunately, most of my team is based in Europe for you Kevin, so, okay I have to get up in the morning.

D

H

Yeah Alan, Brunell, um Dawn I think this.

H

This is detail now that requires a working group um subgroup within sick note here to cover not just here, but also the things they mentioned around NRI, so that we can balance the resource requirements coming down from the Pod specs and the controller managers right down through Google it and then see what you know we can allocate to the plugins in certain circumstances.

H

um Obviously we need to be able to manage resources across pods CPUs. You know other resources, gpus, um we get a lot of requests for this it we want to be able to run small pods or long running pods with small. You know quickly, running Services, fast services in a container and- and that's it's a primary case even in kubernetes today, right, it's not just all about long running. You know pods that are scaled across a cluster right.

F

No definitely we we can chat on that.

A

I have to do the time check here so so can we convert? Can you share the the slide back to the meeting agenda and also can we cover this to the cap and continue discussing there and Analysis Kevin and we need we have the several proposal, especially for the dynamic resource allocator. So can we converging? Can we see what is clearly defined? What's the scope, because at this moment there's some overlap here?

A

We don't want to connect the two right to address the same problem, but it is actually purpose is different, but there's definitely have the overlap.

A

Can we can we agree on what kind of things and who is going to address what one thing otherwise we'll be I can say that immediately we have to slow down both right because until we figure out because we don't want two things over liable to address the same problem, and uh so can we address that first and uh and then say the both different type address the what kind of problem. What's the goal? I also agree with the currency, the needs.

A

If we're doing this kind of things, we may not want to carry off the old baggage here and but I think there's a certain uh feature parity. Maybe we need to consider you're, not unnecessary. You have to choose executive feature parody, but do we need to consider say okay well, for customer ask, for it hasn't used to ask for this kind of things how we are going to equivalent something I think that's.

A

We can start from there because the current API is a little bit 20, because due to the previous baggage and then due to the previous package, so maybe we should have some freedom, but for user usage same usage, maybe we need to think about how to handle provide the equivalent functionality that could be now makes sense.

G

C

Where did we do that conversation.

F

Yeah, maybe this working group, what you suggest tomorrow, that we do so: okay.

I

We can start organizing it in Signal select. That seems a good place to start at least.

F

Yeah uh other than that so I think I'm. True I just had the demo. If we have some minutes left for the demo, uh I can do it. But if you have other gender topics.

A

Oh, we only have 10 minutes so I.

D

I think uh don what we can do is like we can probably move the planning to the top of the list next week and meanwhile, uh what Reuben and I are asking everyone to do is take a look at the document and then just tell us that you have the time to work on that list. I know that the top seven that that are carried over from 125 like people are actively working on it, but that's the second table.

D

We are not sure if someone is actively looking into it or planning to allocate time working on it. So just go and say yes, I'm gonna I'm planning to work on it in 126 and then come back next week uh when we go through the items.

A

So how about this? We we ask other uh topical owner and uh one by one and then hopefully we can save some minutes before timer and the first to me didn't go through because I know the one of those actually nobody, uh so Windows, CI iPod send uh sandbox failed.

K

I'm sorry, what was the question? Are we just supposed to say if we're okay like moving to next week or.

A

Oh, if you are okay with the move to next week, oh- and that will be wonderful if you want, but otherwise I- think about that now we can go over one by one. Hopefully we can have some. uh Sometimes if we don't, uh we can see what's what's what we do to do with demo. Yeah I definitely want to see the diamond also, but we can do through some other channel yeah.

K

um Yeah so for the windows here, I sandbox, Fields, I, think this is there's still some active discussions going on in that um in that pull request. I think that my question isn't James isn't here today, but it seemed like a couple of weeks ago we were settling on having the same sets of uh fields for the CRI stats and then last week there was a demo of a whole bunch of new Linux specific stats, getting added which made me kind of reconsider and say.

K

Maybe we should keep them as um like Windows, specific and Linux specific stats and I think we just need to make a decision on that um so that we can move forward with getting the CRI API updated so that we can vendor those changes into the container runtimes.

D

David, you guys are looking at these stats. uh Do you have any thoughts.

J

J

L

Go ahead, sorry I was just going to say um yeah. So, like the context, just a little context, uh the presentation last week was by Daniel and um we're trying to move forward with the cap there. But one of the big uh challenges on the Linux kind of side of the plantation for that cap is.

L

We have a lot of other metrics today that people use VSC advisor, and so um one of the ideas we're coming up with in that cap was that we're going to add basically the missing fields that were served by sea advisor in the CRI, and so that way the container runtime could serve them and then uh the kublic could basically expose them as Prometheus metrics um kind of for backwards compatibility for C advisors. So that's kind of the context of why we're wanting to add those the rest of those Linux stats.

L

uh Peter, please yeah.

J

So, for for the um for the stat summary metrics I could see it possibly making sense to share but share the structures, but if the handling of those stats are going to be, windows are like specific or- and you know thinking about uh like it probably makes sense that the windows uh have some specific C advisor like stats that are reported through Prometheus.

J

um So, given the change in scope of the um that we're thinking of passing these metrics up through the CRI, rather than having the runtime report, the C advisor metrics directly I do think that it makes sense to have separate um separate uh objects. We just have to kind of figure out what the oh like, if we're going to have any overlap between those objects or if we're going to have them, be totally distinct and have the handling be totally distinct. Between platforms.

L

K

Yeah I agree with that.

L

I think it sounds like like long term, you know later down the road we want to have some window specific metrics or something it makes sense to keep them separate. Otherwise, we're going to come back to the same later, probably.

A

Okay sounds like everyone agreed to separate after with those stats, and they just need the Peter David, and we need to just figure out how to represent yeah. Okay,.

K

Mike, did you have something to say to I saw you no.

H

I was just I was just chiming in with the yeah: let's keep them separate the the windows and the Linux, it didn't make sense. I, don't think to make it a common struct since there's a different view, what they mean between Windows and Linux.

K

Okay sounds good, so I think we're pretty much at consensus that we'll want to keep them separate, and we can work out the details in like outside of this meeting.

G

A

Yeah, so next one is the I, don't know who posted here so yeah I meet you I.

D

Think it's uh Marcus and folks from uh Intel uh Marcus. Are you on the call?

D

Okay I can give a quick one-liner.

E

D

Is basically okay, you're on okay, yeah, yeah I'm here.

E

But it's it's not my my proposal so I'm pretty sure what it is so some some of my colleagues. uh So it's free it's from Alexey, but it's fine to move it for the next week. uh Alexis is unfortunately one was sick leave, so we can't participate today.

A

Oh, uh maybe then we move to next week is that okay.

G

Yes, yes, yes, thank you.

A

Next one is the host and the uh no the network support for Windows pod I. Think the let's came.

K

From yeah this is uh me again: I think this one should be pretty quick.

K

um This is more of a heads up and I just wanted to see if node had any concerns with this, but um today you can set host Network to true for pods and if it runs on Windows it just just doesn't do anything which is a bit confusing to users, and so I was looking at either fixing the validation or doing the actual implementation to get it so that we can join Windows containers to the hosts, uh Network namespace and it looks like that's all possible in Windows, so I was I, authored it to kind of propose that the cubelet changes are going to be quite minimal.

K

um Update the CRI probably update the CRI API, because the all of the name spacing options are under the Linux run, pod sandbox, config field and I think we would want to have some options either more generic or on the Windows run, pod sandbox config field, and then the rest of the cubelet updates are just filling in those uh those fields. If the um pod spec says to use the host network mode, um so I was wondering if anybody had any concerns with moving forward with that.

K

The rest of the changes would be in the container runtime to actually wire that up and we're prototyping. That already.

D

Makes sense it's sounds.

J

Straightforward.

D

E

I'm sorry I was in mute. So what what do you have like two minutes left so uh I had a like short update on the QR spices that has been updated lately and I'm kind of trying to get it now included in one 126, but uh I think we can adopt it next week if we are now running out of time, because I have a few slides of that kind of what has been happening there. So.

A

So Atlas- and we only have the one minute sorry, but I can't stay longer anyway. I asked, can you stay longer and also another possibility? Is this we record the demo and put the link record of the link in here and the next week we make another announcements, so people make sure people didn't miss those your demo.

A

So you have the two options right now you which one you prefer when it is, we can stay longer and another one is just people can leave and if you are interesting, if you can stay longer, but anyway we record today's meeting. Another yeah.

F

I can try to do it in two minutes. It's not a big demo.

F

uh Yeah, let me share um so I hope. You still see my screen right in terms of demo. uh What you will see is uh I have a plugin uh where I reference, two kind of containers, one container has CPU manager set to none, and one container has a basically one plugin that will be with CPU manager and on the other. Plugin will be CPU static, CPU manager with one one kind of reserved CPU, um the the change is just changing the container it's.

F

It looks very familiar most probably to people who wrote device plugins very, very similar to that, um and then I have an example uh bot um which, which basically does a lot in 20 cores.

F

um So what I will do? First is just since 38 the non-cpo manager.

F

This is just applying um the the deployment um and uh yeah. If everything worked, fine, you will see a plugin appearing and basically inside sorry,.

G

I cannot see some stuff on my screen because of the zoom stuff uh right. Okay, inside.

F

The logs, you will see that we have a CPU manager with non-policy um and then, if I start the my workload, which was this slot on 20 cores.

G

If it console reacts, commands.

F

You will see basically that basically, the loads doesn't get assigned to a specific course. It can move around the basically what what you should expect. It takes a little bit time until the workload gets up and running.

F

Right, you will see something like that in terms of picture, not in your spins course can move around or loads can basically Linux scheduler decides which chord to ticket, um then I delete the workloads. I delete the the plugin and and I do a small modification. I use the static CPU manager, just by changing the container choice.

F

um And I just install the new plugin.

F

If we look at the Lots, basically you will see that becomes more uh yeah it. This time it's a static, CPU manager and yeah I can now try again the workload. uh You will see a little bit different Behavior Uh so that the course will not move around. They will be actually compact placed, but it takes some time until the workload Comes live foreign.

F

From the original CPU manager, it just lives in a plugin, so nothing changed what what we are running um but yeah. This is the the static CPU manager Behavior. What you could expect um so, um just just to demonstrate that you can change policy without need to restart anything. You just instantly the new plugin and it's done.

F

Yeah and did this is feasible, so we we managed to implement it so.

A

Thank you so, um just like what earlier discuss what's the next step right so there we can have the work group and the discussing this one, and especially on the what we should converge in what we should separate from the dynamic the resource, allocator and then another one. It is then covered this into a cap. Then we move forward. Let's see, what's yeah.

I

And so I'm happy to take part in the networking group and I know that Dynamic resource stuff, pretty well um and I, also know the existing components that they're, trying to you know abstract out here really well so yeah I think that that sounds like a good plan. That's strength, Converge on what how these two things differ and what the goals from each of them are separately and we can move forward from there.

C

Sounds great I'll start it I've read in the cignode channel and I'll tag you Kevin into who I'm wearing the songs to join, can join, but maybe Target for next week, because there's a Sunday this week so I believe people are busy.

C

G

C

A

Okay, that's all for today. Thank you. Everyone and yeah see ya.

I