Kubernetes WG Resource Management, 1 Nov 2017

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes Resource Management WG 20171101

Description

Meeting Agenda:

https://docs.google.com/document/d/1j3vrG6BgE0hUDs2e-1ZUegKN4W4Adb1B6oJ6j-4kyPU

A

All right so I just kicked off the recording. This is the I guess. November first meeting of the resource management group got to topics on the agenda, but I'm not sure if we actually have their representatives on the call. So if someone that is not recorded here wants to talk about the initial FPGA support proposal that was linked, please speak up and if not we'll move on to the device plug-in architecture document is someone on the call that was able to speak about FPGAs.

A

B

A

So then, the second topic- I, don't actually know who the Lister of this was I, probably should get a template here, so that people are required to name themselves. I wanted to drive it, but who wanted to talk about the device plug-in architecture document again.

A

This might even be a really quick meeting or a really long meeting, depending on how soon goes.

C

Fund art going so maybe we can come back to the EPA proposal. Yeah.

A

Okay, sundar: are you able to take over and draw the discussion around hbg fpg is.

D

Could you'll hear us.

B

Yeah I can hear him speak.

B

You why this really feeble you might have to get closer to the microphone or just something else.

E

Really he's good to go basically.

A

Whoever we can hear first is probably got the floor to talk so sundar I heard you say something like start video, but I'm, not sure. If that was exactly you.

A

All right right now can you can you talk and then, if we can hear you, maybe we can be productive while sundar figures out what's going on is Mike.

B

We cannot hear you yet in case you're trying. Oh hello,.

F

Can you hear me now? Yes, somewhere, we can hear you, okay, okay! Thank you. There are some confusion about the exact time. I joined a few minutes earlier and we're saying the force is not adjoined and there were stores actually one hour later because of daylight savings time. So it was a bit of a sigh sorry about.

A

All right, so you ready to kick off the discussion topic, a turn on the agenda.

F

F

They want me to get start on the FPGA stuff, yeah.

A

Workers, okay,.

F

Thank you. Okay. First of all, thanks a lot, those sanguine authors and Alexandra Kenneth's key for helping me in writing. This document I think many of you already looked at the document, so we can kind of either board section by section or it's got specific questions. We can look at that I'm fine either way.

F

Would you like me to start by summarizing a proposal.

A

Yes, so no I have not had a lot of time to review the documents, so in summary, it would be appreciated by me. Okay,.

F

Good, so the overall background is that FPGA is a different breed of devices so compared to other devices. There are additional factors like how you program the device and how you handle images, programming and such in addition, when you do the programming that could be multiple accelerators in the same device, so I could've FPGA with two separate partial reconfiguration regions and each region may contain a different accelerator.

F

Also, the you need needs to match the particular region type it's meant for so when you oxygen rate, let's say an IPSec activator, for example, would synthesize an image for a particular region. You cannot apply to some other region, you on the same FPGA, so the software's take care of all these things, and when you pick an image to do the programming, you need to make sure it's matching the participation type. There may not be something in the hardware which prevents a bad programming. Some device may do, and some may not.

F

We can't count on that. So, given all this, that is I believe there's a need for a different approach, but lot of that will fall in the category of tracking device resources which is meant to be taken up later.

F

Programming with images should be best the plugin enough information for that and that again can be taken up later and the third category, where we need some changes in handling security contexts. So what I mean by that is that container or a VM were to open a device and try to manipulate it.

F

For example, by doing a em map manually, not the PCI registers to address space that requires privileges, so there should be some way for the continuity expressed that or for a plugin to say that this kind of operations requires so based on all the discussion you had in the document or the past couple of days. I think the overall tenor of the feedback is that we should not make any exchanges just yet we should try to stick to the API is already proposed and see what limitations will arise.

F

Out of that and based on that and also based on further resource management discussions, we can make further changes.

B

Mainly because, like we can start with something very simple like saying, FPGA nodes are dedicated and then like people are off to you, do whatever they want with those nodes to begin with, and we don't like really try to solve multi-tenancy or like security there to start with, and then, as we understand the overall life cycle, we can improve the system over time.

D

Also, my impression is that the current idea koala has to start with something simple like to support a statically programmed FPGA and may also support a container programming model is Otis's.

B

Yeah so yeah, that's a good point in that, like in let's just keep it very simple. I think we can make some safe assumptions that people aren't going to reprogram every day or like every hour and so have like a really simple controllogix or give people a complete solution that they can use, even if it has some limitations and then like start prioritizing, what limitations needs to be addressed right away and then then bring it back to the community.

B

Essentially I'm just worried that, like we might end up like adding too many features, looking at it just from an infrastructure perspective, whereas like actually shoes for pinpoints where users might be somewhere else in system.

F

Ok should fair enough. That's fine.

F

To answer all your questions: yes, the main things we want to address our static click program devices as well as contain a program model, but I should also emphasize that the current EAP is do not allow for handing local memory in any form. So if you've got two FPGA implementations which differ in the amount of local memory I, there is no way they express that or to request for a certain amount. So even the local memory has to be outside the API. For now,.

B

Devices are not really shared like and if you have to share, then we need, like better. Api is essentially like. For example, we share the main memory and, like we share CPU and like that, requires an extra inbuilt logic into cumulus. For to make that happen, you don't have a good extensible way for applying such a logic to other resources.

B

So, to begin with, if we can just like statically define what the memory part memory sizes would be for four different accelerators that you burn into an FPGA, then that might be good enough to start with. Maybe you express that as part of your resource name, for example, we just find the easiest possible way to to have users consume this resource, and then we can come back and like find out. How do we even have to improve it and if so, like? What would be the right user experience?

B

F

Let's say I do a POC to show how the current ApS can do. You say FPGAs. What kind of feedback would they expect in the future? Should I go to some customers and get that feedback also good enough for developers to play with it and point of the problem areas.

D

From my understanding, I think a major problem more to support a container program, the devices it's because the local memory. So we expect a clock in not only managing the computing devices but also manage some local memory associated with other computing devices. So.

D

So I think you're, imagining in the document that you can start by just a hybrid computing devices to have some statically allocated local memory with that model, and so I do say the your pointer or supporting like a dynamically local memory allocation, and this information from Daniel is expected to be a local knowledge or Hakeem. Is it true.

F

Initially, it can be completely hidden by the plug-in, but what I'm trying to convey is that in general it may need to be exposed a resource. So the container may come and say: I want 4gb of local memory for this IPSec accelerator, but not all devices may actually have 4gb of local money, so you need to actually pick a device at the scheduler level or the coordinate level which actually contains it that much resource.

F

So to begin with, we can make some simplifying assumption, like all devices contain enough resources for any use cases. That would be a kind of a very restrictive assumption, but you can start with that, but in general, we'll add to certain some more exposed sources and the contents would be able to ask for that right.

B

So you could like homogeneous like as nodes more systematically, so you could like consider adding labels like this was part of the original device plug-in designer cell that, like plugins, can expose node labels. So you can. We can assume that on a given node, accelerators, foo and bar that you have program onto an FPGA would have so much amount of memory right like we can use node labels to do some sort of like cheap scheduling for now. Okay,.

G

F

B

Like, overall, like your previous question, was like what information do we need in order to consider adding more features like personally I would prefer having real customer feedback rather than like, because we are really creative and like we can think of awesome use cases, but at the end of the day, the customer might not be that sophisticated. So I would like to see like overall end-to-end workflows, like I mean probably like network acceleration was one use case, other use case. A text on your dock is about like predictions like ml predictions.

B

So I like like see overall solutions for that, whereas the end user I can walk through a journey or a guide and actually kuratas, and then we can prioritize like how we're going to deal with the different issues that the user would face. Okay, so.

H

It's a bit of a catch one one, because if, with the current API, we basically are very restricted in the with the container programming model, so what you would not allow customers to do with with the current API used to is for them to go to and and really work on, containers that could program FPGAs, which is a use case. That's is very important as well, so.

B

It's like installing drivers right, like you, didn't know you program it the way you wanted and then you bring that node online into a cluster because you've already programmed it the way you want it and then you can. You can start having workers run on that, but.

H

As that's that, basically saying that the the operator and the user are the same entity, what we're thinking about is a user coming into the cluster with mt FPGAs and getting a specific region to program whatever I it needs. It sure.

B

Okay, but can we simplify that scenario by like saying that the user is going to get the whole fu G? At least in that case, I mean I I'm, just not seeing why we have to like tackle all the problems at the same time and.

H

I I agree with that. I definitely agree that we want to restrict ourselves, but the the thing is by not by another acknowledging that's user need to understand how much memory they can get from from how much memory they can request from from for a specific IP or what are the security contact context that they their container is going to need for programming the FPGA we kind of remove a one very important use case and put it outside of the of the user reach.

H

So I fully agree with with your your statement that this we should go and get stuff, but with the current API we can't even have customers using that going through that specific use case. So.

B

It's yeah I just threw that thought of like having cycle what, if, like these nodes, are dedicated in a sense that you're not sharing these nodes with other workloads, at which point you can safely like hand over maintenance to the end-user and then like being free to like reprogram the machine and like use it. However, the, however they want- and this is actually a very common factor here and kudos- are we have like specialized hardware, that's used by a small subset of users. Then you just.

B

Bending system to like deal with all the scenarios.

I

H

So it would be, it would be basically restricted this specific use case to having a container fully using a one FPGA and not sharing the FPGA anymore, which is which is a I mean it's it's a first, it's a it's a good first time, I'm just trying to understand and make sure I understand what you get.

D

Multiple containers are sharing at people, always no I.

H

Understand and I think it's a it's a fair point. So we could. We could say that the the container program use case would be. We could restrict that to one FPGA per container or one container refugee yeah.

B

I mean fact: I am I, going one step further and saying, or the initial version for maybe for like the next six months. What, if you just have users like completely owned nodes that have a few GS and you've, given a solution like you get cid or whatever, that simplify is burning effigies for them and you use the equivalent a PS simplify table. But then, like those nodes, are not being shy, at which point like your security restrictions are reduced and and like you can, you can like use the use.

B

The workflow that I was mentioning earlier right. First program then bring the notes online, and then you submit jobs to them. So when you simplify that whole lifestyle.

J

But if you want to allow user to give up programming rights, they still need to have a bit of extension of API, like advice, plug-in need to say which additional cup-cups need to be provided to the container.

D

Yes, back yeah.

B

So that's what I was gonna sighs a sensor like have an admission controller that looks at any resource that is of type F, ETA or fubar, and then inject the right capabilities that will need during admission time, rather than like during runtime.

B

So I'm. Just saying that, like there's enough cumulus extensions at this point where you can like actually build a workflow for for this for this device and start adding value to users. Even before, like extending the system.

I

F

There was some discussion in the document about schedule, a extension admission, control mission controller. Why would you prefer admission control or a schedule? Extension.

B

You modify the pots back once it's admitted by the API server, except for like the object meta, and the assumption is that, like all the security policy is that an administrator is put in place are are being evaluated as far apart mission. So it's really not like it's, not the coolest model to have some extension running later in the in the lifecycle of a part to go and like elevate privileges or like change the path specs. So that's, but that's not what the extensions were actually designed for it.

B

D

Well question so I said to the four containers to to specify enough for security settings use this back. I'm.

B

Saying that if they really want to like improve the user experience, they could do it automatically via admission control. But as I saying like to start with us like the dead, simple solution, you can just have the container request right: the right kind of capabilities. It's.

D

Just like us to start with something really simple: we can just have continual to require now for security settings.

A

You'll be able to do that as well fish with a mutating web hook in one a right exactly.

B

A

Sorry, 1 9.

F

Just to go back to one of the previous points, if you look at today see API is the actual code. It it takes a part spec. The scheduled one takes a part spec pointer, so if it were to modify it will actually get reflected back in the main scheduler. So without any changes scheduled extension EAP is, it looks like a. We can actually modify the portsmouth. I understand it's not modest meant far, but without any further change, little seems to work right. So the.

B

Thing is, you cannot modify the paths back during scheduling face because it's in a pot spec after the pot has been admitted by the api server and updates to the boss, spec or disallow, except for like the image name or annotations.

B

So what we're suggesting you do is like, as derek was saying, maybe a focus models we're like. Maybe you have an extender, but external is also acting as a web hook and what it does is like whenever it sees a part that Rick that requests a resource that your external is is supporting. Then it injects the right capabilities or any other security privileges that you need for the part right and then it goes through the regular admission process and an operator can enforce waterfall security policies that they want to, and so there's like.

B

No there's, no, like elevation of privilege, is happening in the system.

G

B

If I, were you probably do what did simple thing? That saying was saying it's like not even deal with security in automated way, just have the container request the right capabilities, the.

G

J

B

If you're asking, if we can hear you no, we cannot hear you.

D

I think, after the initial that comodo and after your gathers, some user feedback would be great if you can like a come back and maybe have another document describing the like the further needs, and maybe at that time we can discuss how we can support I got the more advanced the use case better, like we hope we can have enough data to show the important the importance to support those use cases.

D

D

A

On this particular topic, any other items you want to close on. Otherwise, one.

J

Question you know what documents we also mentioned to think about mission where they allocate call. So what was the story about about it? In previous proposals? Oh, like in real initial device, begin implementation.

B

Which I'm sorry I think I missed the last.

J

Part of the question, Oh what-what was way in initial device, plug-in implementation. What we don't have right now: deallocate API yeah.

B

There was no use case for it and like what we said was like most of the Dirac Asian stuff that one would want to do. They can do it as part of allocations. Yeah.

D

And also mostly to make seem simple because I think in communities we are trying to be stateless so to be able to reconstruct as data from the current state. The Iranian state so just to make seem simple. But.

F

The plugin may want to track the resources internally right it if it's not exposed to the api's it may internally, tribe, be the sources and the devices it's managing.

D

That information say if the best packing dies and restarts it will need some way to repair that information right.

F

Yes, that's the job of the plug-in itself. Let it do order at once.

B

Plugins will be stateless and they would have the logic as much as possible.

B

Logic would be any logic that can be abstracted and centralized in the cubelet I mean our assumption. Is that anything you can do a DLL kid you can you can sort of do it during allocation phase? If that is not true, and if you have like a really concrete use case for Dirac aid, then like just please simple.

J

Just like shut down were power on specific device, especially if it's some something external well fine, but.

D

Currently, on Kublai said we don't have a good way to know when the allocation actually happens. How do to that later during, like the next time location, just look at the active part and reclaim my knee results is a not use active part. So just aren't cool I said that we don't really have a good way to know when the allocation I would happen.

B

That's entirely too, in a sense that, like Cupid is the entity. That's deleting the pod object, mostly yeah.

D

J

Think in with the waste plug-in manager inside couplet I've seen well, where is a function which does with delegate it? Just didn't call were same day, allocate medic and we plugin yeah.

B

I mean same theory: it's possible.

B

It may be a little bit involved and it's not like very straightforward, but I think is you didn't add it because there wasn't a clear use case and come up with use cases sure, but even if in the get perfidiously water have heard so far as like the device may be shot so I don't really see how you're going to like shut down the device so probably lately or you can shut down.

H

Parts of the of the devices and same for for GPUs will you I mean when you do locate when a part is no longer using a GPU, you may want to do some power measurements on your GPU once you once you have hardest on is done with with current GPU. Can.

I

H

I

Great and so just for the geolocation port for GPUs, we it's not that we didn't have a clear use case. We do want to wipe the memory we do want to be able to do a few things on geolocation. We just moved that part to the allocation, a part because it was more reliable and it might be interesting to shut down the GPUs, but that's not usually what our customers do.

B

Yeah I mean you feel never heard of this before I, like totally that everyone here that, like this may be a valid use case. It's just that, like we haven't, had ask for it yet so it just may be that like we need it in the future and we just don't have a concrete and I've use case for it. Yes,.

D

I think if we have a strong use case to support this, we can I did that it's just like in the initial prototype. We want to stand with something simple, so we kind of push the reclaim over dress out at the next allocation time.

I

So um I do want to come back on two quick things that hurt that I couldn't touch, but the first one is I. Remember fish, mentioning I, don't remember who's mentioned. Is it this schedule extended and I've been through that part during the initial implementation at the device? Looking for GPUs and there's a lot of not really straightforward issues that you're gonna encounter?

I

So, for example, you, if you have to inject things in the pod spec in the schedule extender you end up having to delete the pod and recreate it and the schedule extenders not the best place, you probably want to do it at the admission control level and the other one I was looking at I. Remember is so I heard you mentioned that you wanted to inject the security context and I didn't understand that we I think this was the one who was pushing like on this.

I

It didn't understand why the device began can't do that, because this feels like exactly what we would want to be able to do it with the Vice plug-in interface like.

B

It might logically make sense, but practically speaking as a cluster administrator I don't want my pots to have elevated privileges like I would want to like know that either it's a dedicated node at which point like I'm, not really applying any sort of security policies. I wish my I don't really care what what the security policies are on that node or it should should be like a centralized policy and policies are enforced currently at the cluster level, not at the node level, and so I would prefer like not injecting edition security privileges.

B

A missile system like, even even if it could be at the scheduling levels, all.

I

Right and device buildings are supposed, or at least we expect device plug-ins to be deployed by an administrator. A cluster administrator and I mean I'm, not exactly sure if security context is needed, but if it is needed for for you to program your device point earlier, a speech game is that it wouldn't make sense, at least for the cluster admin, to say. Oh, it's these, for example, these this group of user wants to be able to program. This PPA, then I understand that they need more spillage for their containers. That would make him right.

B

In any case, I'm just listing that we should do it at the class level, because cumulus security policy enforcement is happening as part of action, and we should just stick to that.

D

Yeah I think the vicious pond age it shouldn't be in the scope or the best package.

H

Okay, so you're so you're, assuming that the the user would will know exactly what kind of security and permission and capabilities he knows about the specific FPGA hardware and drivers and kernel interfaces.

B

To begin with, that might be that could be an working assumption or, like we said a few times like you can have that hook or initializer or whatever you wanna use to to like. Do this as part of admission rather than after admission. So you will do it before the paths package is accepted by the API server, but.

H

The the device plug-in has an intimate knowledge of the kernel interface that it says there are going to be needed for programming a specific device. The API server does not have this nice knowledge correct.

B

And the model that Cuba Liz's chosen is to like enforce security, a cluster admission time. So, given that that's the model that the project has chosen so far, we cannot like really change that now. Just for this use case, so yeah.

H

I'm not looking at fishing in particular, but I, think it's a generic use case where you need specific permissions for programming a device and then so. So what you're saying is that the communities architecture is saying that this should be a priori knowledge, correct.

B

And you can leave user pain by having add-ons at the cluster level which have or which we work closely with device plugins that that will then like elevate privileges appropriately.

D

For the container programming model, it seems reasonable to expect your containers to know what kind of privilege need to require. Yeah.

H

D

D

B

I mean like this is pushing like doing things in the ex-owner or doing things in the in the device plug in a shot of pushing us towards an imperative design, which is exactly what criticism. Whiting, so I would say, like just start with the thing that's possible today, and then we can explore further, if necessary, in the future. But.

I

He does raise a good point in that admission. You don't know where your part is going to land right. So how would you know what security context you mean well.

B

I mean you know, you know what devices your part is not a consumer and you could presumably create a cluster level understanding of like what are the security through is just necessary in order to access a device foobar and then just apply that right.

I

But if I say, for example the example deep use because it's easier for me, you know you want a GPU, but you don't know if you're gonna have a very old GPUs, for example, a Kepler GPU or very new GPU, for example, and if, for example, Cillian nvidia, it's not the case. It's not the case.

I

I rose, but if I take the potential example that you're not going to do the same thing for a very or GPU, then for a very new GPU, then you still have this column that you don't know what TP is and I have an admission.

B

Having more accessibility is right, like I, don't want to conflate.

D

The GPU and the new GP different enough. They probably also required you from the quota warrant under those scenes that continued to need to I, think yeah.

B

Let's start with something very, very simple and then and then we can think of extending like adding any mutating the ports back in any day after admission, this sort of pushing cumulus to us an imperative I think will face a lot of resistance from people. So I just feel like it's not it's not worth the conversation at this point and unless we have exhausted other options, and we have a concrete proposal for why we have to change the model.

A

Okay, so are we ready to do? We want to discuss topic two for today or no now that we can hear you.

I

Sure let me just show my screen.

I

Can you all see and the design document? Yes, yes, um so I hope now that everyone has taken a look at it. I wanted I, think I wanted it to so I wanted it to come back on base grander goals, so it in my mind, I, actually completed their list of difference.

I

Central Cummings and fix bugs so um as I was completing this list and what what one of the goal, or at least the main goal of this design document, and that that I tried to address is that the number of bugs that we've had a number of shortcomings that could coverage that the people who tested them I mean the facts, isn't back to performances, but that's pretty much limited.

I

That's a that's a signal that says the current architecture that we have is is just not good enough and if we're going to continue and adding features were if we're going to continue adding features yet then I mean this architecture is just going to continue in service more bugs. But that's my my feeling, looking at all the the number of bugs that we've had because well the features, the number of thoughts that we've had it pretty much astonishing, and so that's.

B

I

Trying to solve here and that I want okay, go ahead.

D

Kind of like an integration problem like, for example, the best pranky and those educational resources, it doesn't manages it's kind of like the integration with the resource name API and like, for example, the it we know tested. It's flaky, I think the the it was I, don't think it's too freaky right now. I think it was flaky mostly at the beginning. When it'll have some issues.

I

What I'm trying to point out is that we've had a lot of bugs and we fix them, but we've had a lot of bugs I think it's mostly due to the architecture rather than or I. Think it's mostly due to do. I protected I think.

B

I think that's why there's some disagreement or not and that like it's beyond just a cubelet site implementation, there's like a really device plug an implementation that we are using for end-to-end tests and then we are relying on like GC infrastructure and we are relying on our driver, installation mechanisms and so on. So there's like lot more variables and just the Vice plug-in architecture. So it might be a little bit premature to like say that it's the device plug-in architecture, that's the cost for all bugs I. Think that's far.

B

Jiying is trying to point out by giving concrete examples.

I

So this is also a second understand that also I was also thinking that, if we're going to continue adding more and more features- or at least if we can continue and adding more features, then it would be good to have a reliable test infrastructure that it is not likely that does not take 60 seconds and that we can actually pretty quickly say. I mean I. I've, looked at the test and I've written most a lot of tests and what I feel from the viewers of the scene and from the coded ribbon is that writing.

I

History is very painful, because we've got a lot of setup. We put a loaded, tear down to write and that at least is in there in a big part due to the architecture. Yes,.

B

I

That's why I'm trying to change it right.

B

So you're saying you're looking at it in one way, I yeah I mean possibly like internal unit tests could be simplified a little bit by by having a more modular object-oriented model, but there's also the overall like functional integration test, and we wanted to improve that as well and there's some work already going on for that which a few of you are like shepherding, which is about like having stop plugins, which you can run and they boo the node and the trustor level setting and then you can then like do so, can scale testing and we can try and reproduce.

B

Real-World scenarios in having having really good unit test coverage is very, very helpful, like all they call excited about that and, frankly, they care about very helpful. But I think you should also like think in terms of how can we better spend our energy like if we spend all our energy, including just unit this? Would that be enough? Or would we have to spend our energy on the class level?

B

This too so I think it's it's world, okay, I'm an overall system problem and then like prioritizing, where we have to spend our time and then going ahead and like solving about them.

I

I think there's multiple things here, at least from my point of view. A lot of the tests that we have are really are at least and at what was the CM package a lot of these tests or more integration tests than unit tests, and so we could probably move those tests.

I

But the point is mostly that I'm, so I just wanted that I guess what I was trying to give up so and the the point was mostly that, if we're able to at least get a few tests here are at least the integration tester nuts and place the unit test effort, and if we're going to go to to go on down the road. We're saying that our main point for this milestone is to have stability, that integration test in unit tests should be object. That's right.

D

So maybe we can move to the table proposed a katara change, I think, like I, think we should ponder. Is there like improving unitized performance? It shouldn't be this so legal, the so-called like fatheri architecture, but maybe we can look at your proposal. I see you, you made some good observations and, like the current architecture, has some limitations and then maybe we can simplify it by looking at how we may simplify the locking logic or the current code.

D

I think it's a good, ponder and I think I comment in the document that maybe you and I we can say whether we can more to the my nature, the best parking manager and the best Honora interfaces into a single public interface.

D

Since now the crew in ban territory, and then we can have a isotrope component to start the cache. The information for the best part in that can simplify the locking logic.

B

Organization, I just want to make sure that everyone else in the call, possibly including me, I, I'm a little doubtful if we're all like other of the nitty gritty details of the internal architecture. So would it make sense? This discussion will happen between the folks who are like very much familiar with architecture rather like in this studying.

D

B

Until you discussing well I'm, just not sure what it's like, but everyone's time on this yeah.

C

Like to subscribe to this I remember, we did some meetings very early on where there was the device plug-in being discussed exclusively. Do we need to do that again and have another track where we talk about the refractor needed you.

B

Know I think it might make sense like there may be a few people trying to work on this together, and so whatever it means, is helpful to facilitate collaboration and communication and I. Think we should just do that, whether it's a meeting or slack or mailing.

D

B

I mean keep it in the public as much as possible so that other people can also participate, but I feel like this is really specialized discussion and it's also a little bit subjective, because how one writes code might be different from someone else, writes code. So.

H

B

I'm, saying is like just spend your energy wisely and money.

C

B

E

C

Know, do you think you could you could arrange like some kind of some kind of poll in the working group slack or we figure out how to go, meet and talk about this.

I

And I think I'll just do that after the meeting in which I mean if we decide to go through another than I think which is the end of the meeting.

B

A

Think that was it for today's agenda. Thanks for folks, you were joining and if there's something else that people are discussed today, we can adjourn him. That's all right. I.

E

Just wanted to plug so Kerry Zhang Rhys, our and github, and Balaji and I, are putting together a topic proposal for kube cotton, the contributor summit to discuss in intra node, topology enhancement so stuff like making the CPU manager and the device plug-in manager coherent in terms of Numa affinity for decisions. So just a heads up that will be submitting that as a topic and hopefully it gets accepted and if so, I'm looking forward to discussing it with everybody.

B

My you know my gut feeling is that we would not really be having enough time to actually go or that think the the meeting that we had in May, where we actually had a couple of days to to discuss this with the WIPO. It's probably a better setting for it, but in any case like we can give it a shot and see if we can make some progress.

E

Okay, well that that's an interesting point to you is, you know: do we want to have around to you of the face to face that we had last year, I think that was super productive, yeah I would.

A

Prefer we do that then try to make people feel that anything that's happening around the coupon event is like a decision-making event. It's not supposed to be that.

B

Because all meetings are expected to be public except for the dev summit. So it's and.

A

Even the dev summit is not a decision-making event.

D

B

I think maybe a different way to state what farik enjoying a saying is that, like you, have to make sure that most of the decision-making folks are available when we are actually, if you actually want to like make progress, yeah.

A

So I would be interested in if we had enough subject to discuss potentially doing another in-person meet up as a extended group. We just need to get that scheduled and figure out if I could get a host yeah.

B

Someone can like start collecting topics and create an agenda. Then that might be a good start.

E

I was just just to wrap up see you guys are saying that. Maybe that's not the right forum for that topic.

A

Well, I I say that for two reasons one I won't be there and so I don't want that to be the decision-making forum and then to like, when invites went out for this, it's it's intended to be more like a place just for people to discuss things but not decide things. So as long as we understand that we're not coming out of these things with decisions, then you know people can discuss what they want to discuss, but I prefer that we have a more dedicated topic or discussion for it. Okay,.

I

I'm just quick question and would it be possible to have a Google Hangouts and for that I won't be able to attend.

D

A

That's like a shared problem or no so yeah. We have to figure that out and I think that was even there's even debate on who would be invited to things like the developer summit and trying to get that ironed out and I.

A

Actually don't know if this steering committee shared that yeah, but, like point being like we shouldn't purpose, the topics that we discuss here to be too focused on potentially trying to drive decisions where we can't get wide attendance, yeah.

B

I I think I mean if I just stay straight, then I think Cuba is probably not the right setting for discussing I. Think all discussions, especially in this area,.

B

Like attract more wider participation from the community, then cube con might be a good venue, but even then like you're, not making decisions that.

E

Yeah, it makes it an offense I agree. So, okay, but I'm, excited that you.

A

Want to talk about the topic, because the topic is important to all of us. So if you have material.

E

A

It beforehand well to see him well.

E

It gives us more time to think and we can queue it up for the the next meeting that we have okay well,.

A

Thanks everyone I got a job to do another discussion, but if you have topics for next week, add it to the agenda, preferably at least one day prior and will sync up again thanks again.

B