Kubernetes IoT Edge Working Group, 15 Jun 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes WG IoT Edge 20220615

Description

June 15 meeting of the Kubernetes IoT Edge Working Group. Container Orchestrated Device Working Group on the Container Device Interface and dynamic resource allocation KEP - Alexander Kanevskiy (Intel), Patrick Ohly (Intel). K0s approach to air-gapped edge environments - Jussi Nummelin (Mirantis)

A

uh Hi welcome to the June 15th meeting of the kubernetes iot edge working group on for today's meeting on the agenda. We've got two presentations.

A

um The first one is on um container devices interface in conjunction with the container orchestrated device working group. The second presentation is going to be on k0s and its approach to air gapped, Edge environments. So with that said, let's move right on the first talk will be started by Patrick.

A

uh Only from Intel um go ahead. I think you've got co-host permission, yeah.

B

Thanks so um perhaps as a more background for for the reason why we are here uh Sasha and Kate and I, we met at the reason kubecon, and we figured out that something that we've been working on with container device, interface and dynamic resource allocation may also be relevant for the arcery project, and we've been asked to basically present here to share what we've been doing so that everyone who is interested in working on act.

B

We can can make up her mind and perhaps also can take a look at the current cap that we have pending about Dynamic resource allocation.

B

The work split is basically that Sasha has been driving the underlying container device interface work that has been progressing several container runtimes Now supported what it does is essentially giving a container orchestrator like kubernetes, an ability to inject additional devices environment variables into a container in a in a fairly generic format. I, don't have too many details about that because it's been moving along fine, but what contentious part right now of a work that actually needs that that's really leading attention right now is the dynamic resource allocation and scheduling part that I've been working on.

B

That's where we are taking CDI at the lower level and try to think about what a kubernetes API might look like that actually takes advantage of these new capabilities and run times, and it's clearly related to the current device, plugin API and its limitations, because, as you know, in Agri it it does have its limitation. It's it's supposed to be used for node local resources. You have workarounds for that in Agri.

B

It doesn't support arbitrary parameters like strings or booleans, or anything. It's really just about counting things that can be counted.

B

Then there are questions about the life cycle of a resource allocation that are not handled well, because it's tied to a container it can't survive longer than the container itself.

B

All of that leads us to an approach where we are trying to do something that is closer to how storage handles uh a volume.

B

So we we have a concept of a resource claim that matches the persistent volume claim. We are trying to build something that is ex where vendors can extend kubernetes.

B

Just by writing a plugin, we call it resource driver.

B

uh We have a resource driver handles all of the resource, specific limitations that then kubernetes doesn't need to know about whether it's available on multiple nodes or just a single one. That's all handled by the resource driver.

B

And we also want to support an easy deployment so reconfiguring the cube. Scheduler is something that is both hard to do, or in some class some managed clusters not supported at all, so an approach that relies on scheduler extenders to do something similar, wouldn't work for several users.

B

So our approach from a design perspective is based on communication through the API server by updating the resource claim status. That's the shared scratch space, basically, where the different components store their state and communicate. What the current state of a resource game is whether it's allocated whether it needs needs to be allocated for for a certain port or for a certain node. All of that is handled by the resource claim status and the results, because the resource claim is also a separate object.

B

It can be referenced by multiple Parts, however, at the same time or one after the other. So if you have a complex device, that's hard to set up, then it might be beneficial to just have a resource claim created once for that device and then amortized for cost of setting up that device by running multiple Parts against it. As as needed later on.

B

We had took some other Concepts from done from volume provisioning. We do have a resource class that is owned by the cluster administrator. That's where some common parameters privileged parameters are set, and it also identifies with quotation marks.

B

You have a provisioner which is a resource driver that handles this resource class, and then the resource claim has the parameters that the user is allowed to choose, which may be num amount of ram that it that it needs from a GPU or for an fpga, which kind of program is supposed to be pre-programmed into it, and then the port spec needs to be extended because we really can't use the existing resource fields to handle resource claims. There's nothing where we could reference, but we could use an annotation, but that's not a proper kubernetes API.

B

So we are proposing to have something new in reports back and have this property supported by core kubernetes. That's that's the key difference compared to something done out of tree with annotations. We are really aiming for something that will be supported by your future kubernetes out of a box and ideally also will become a requirement for a conformant cluster.

A

Are you ready to take questions as you present, or would you rather yeah?

A

Okay, I'm, just kind of curious I, like the model of uh of uh using, what's gone before on the persistent volume claims with the CSI storage interface and I actually worked on that spec years ago, holding the formational meeting so I'm quite familiar with it, but I'm wondering if you've gone so far as to pick up the other things related to it uh like namespaces, uh so that these things can have some semblance of uh governance, and maybe even that there are techniques in there that align certain CSI drivers with the ability to embed secrets so that if these devices you're talking to need authentication there, you might be able to handle it in that process too.

B

Secrets are not part of your official API, but they don't need to be. In my opinion, these can be additional parameters provided as part of the driver installation in in some custom parameters, uh some custom, tierd or or just really the parameters that the the deployment of the resource driver itself takes the content for pod pod parameters or programmer parameters.

B

The user doesn't well well. Yeah. I can see some cases where these parameters might be something that the user needs to specify, because it's his credentials that can be handled by the custom parameters that the resource driver supports. We we fully support custom parameters with crds. That's the latest iteration in the cap, where the resource driver creates a crd that describes which parameters it accepts. That needs to be in a separate object, because that's how crds work and then the resource claim doesn't actually embed the parameters.

B

It references this external object and that object then can have user supplied credentials or secrets to actually get access to a resource, and that's that's outside of a cap, because we don't make any assumptions about what the parameters mean they get published and referenced in the in in kubernetes. We expect resource drivers to be kubernetes specific, so it's not as generic as CSI is. We don't have plans to support other container orchestrators simply to keep it yeah to keep to keep it simple it. This is kubernetes specific. So what what a resource Drive event?

B

Does it basically watches for these parameter objects and can take whatever it needs to know uh from there? We don't need it need to pass it through some generic component, like we do in in CSI with a sidecars. It is a bit simpler in that regard, as you mentioned, you work on on resource claims, I also decided to merge resource claim and persistent or persistent volume claim and persistent volume into one object that solves quite a few race conditions that still exist nowadays for volumes where a volume might leak.

B

If one of these objects gets deleted. At the wrong time, protection against fed is not complete yet in in volume provisioning. Unfortunately, and.

A

One of the uh one of the things that historically has been ugly with those volumes is even something like a kubernetes cluster node crash. Well, it owned. Well, it owns some resource and getting it cleaned up, and if we can maybe learn from history and try to I.

B

Tried I tried I really tried to keep that whole resource claim self-contained so that at any time, that's the source of Truth and it really represents what needs to be done. Next.

B

uh It implies that there will be more API calls to update that state, which is kind of one of the objections that I'm getting from the reviewers, that this is putting a higher load on the API server. But it also at the same time, is more secure because we can rely on hcd, storing an atomic update and not losing that state, and it also protects against concurrent modifications of that same resource claim.

B

We can have multiple components, trying or multiple schedulers trying to use as a resource claim for for for a part, and only one of them may get access to it, because if the API server will serialize updates and only one of them gets applied to the actual database.

B

So yeah, that's what we are currently discussing in the dynamic resource allocation Camp, it's under review, that's I! Think that's on my status slide. uh Let me perhaps show how we envision this whole thing to be implemented. So we do have four kubernetes changes that we need to make as part of the cap implementation, there will be a new controller manager or controller inverter controller manager that handles embedded resource claims. That's modeled again.

B

After what I did with generic FML inline volumes for for volumes, it's the exact same concept, a pot spec describes everything that it needs from a resource, but then the controller creates the actual resource, claim, object and the rest of the components almost never sees or needs to do anything with the embedded information. It will still work on on the normal, separate resource claim, and that's also where we store the updates and the state.

B

Then the scheduler needs a new resource plugin. That knows that a port that reference the resource claim can't be scheduled unless the resource has been allocated and reserved for the Pod. These are two separate things where you have resource claims. That might be persistent, so they are allocated, but they are not really assigned to that to the part that needs that currently wants access to it and it might not be able to get exclusive access, because some other part is is also trying to get that. So the schedule needs to know about.

B

All of that uh we do support delayed allocation, that's the same as wait for first consumer for volumes and in that case, control big scheduler is, is basically taking on the coordination between different resources that might be needed for apart different volumes, but it also needs- and it basically goes through the usual.

B

Where can I run the Pod checks and at the end it decides that there might be some suitable nodes and it will try to allocate the resource claim for both nodes. But this might not go. Raw right might not go well because it had outdated information. So there is a feedback loop described in more detail in the cap, where it's trying again elsewhere. Until eventually, everything is is ready for report and only then part of the pot gets scheduled I. Think in the cab.

B

I was discussing with Kate with the archery handles that scenario, and we make we may come back to that later, because I think there is. There are some challenge with doing the same thing with Device plugin interface, because Cube scheduler has an incomplete view of what the extended resources really mean if they are shared between nodes. But let's not discuss that now. It's uh let me just finish here and let's describe the resource driver, let's see out of tree component, so couplets when it sees a part it.

B

It needs to know how to inject and where to get the CDI description. The CDI files CDI is based on Json files to describe operations where the container runtime needs to do involves files get created by the resource driver on the Node when a pod is scheduled to report into the node and the the cubelet needs to call a prepare call once on the Node before it can actually start containers and that's what happens locally on the Node between the resource manager or I.

B

Think it's now called the CDI manager in in the latest prototype and the resource cuplet plugin, the resource plug group of the plugin. That's part of the resource driver that runs on each node, where resources may have to be made available.

B

The resource driver controller that runs as part of a control plane, either inside the kubernetes cluster or potentially outside of it as a standalone process. uh That's the part that talks with the scheduler through the API server to to Mark resource game as allocated and internally the resource driver, controller and resource cable plugin. They still need to communicate somehow with each other or track which resources are available where, but that's an information implementation detail of a resource driver, so the cap doesn't prescribe a certain way of doing it.

B

There is one section that I, where I outline my ideas, how it could be done with crds, and it sounded like RP already is doing something like that. So your path towards using Dynamic resource allocation might actually be fairly easy, but of course that that would need to be validated whether it makes sense. This all cap needs to be accepted first and and so on, but at least it's seems conceptually similar, so that leads me to current status.

B

We had tried to get this cap accepted for 124, but there were too many open questions and, as always, those questions just came up when people actually started reviewing caps work shortly before the deadline, so we never fully clarified all of them right now. Tim Hawkins is our main API reviewer. He had some questions about the API design, so we need to finalize that um scheduler performance is a major concern, because, obviously this whole cap is more flexible.

B

It tended to be more flexible, but with flexibility come additional costs like calling out to the resource drivers, one way or another to do, checking off of where resources are available and that's harder to do and slower it's expected to be slower than the current extended resource model for Simple Resources. But that's simply the price that you have to pay for for this extra complexity and I, just don't see a way how to make the current model flexible and fast. So something has to give here.

B

The question keeps coming up whether that is the right compromise for our users. We, we just don't know yet it's part of the challenge that we have. We are just designing this. We have a prototype, but we don't have users, we don't have real resource drivers, so any questions about is that good enough basically has to wait until we have this as Alpha and potentially with some multiple resource drivers and real workloads testing this implementation before we can decide that so I'm kind of trying to deflect these questions.

B

But it's it's one of the things that comes up and the last point is just the auto scalar integration. Six scheduling asks for that and and sick Auto scaling. People are also looking at vocab. The underlying problem is that we are proposing something that will work with entry, uh Cube scheduler, but then will not work automatically for just the autoscaler, which is an out of tree project built on the same source as as the scheduler.

B

But then will does does a few things differently and six scheduling is wary of adding and allowing features in core kubernetes that are not. Then then, can't be supported by the autoscaler I'm again questioning the attitude a little bit, but it's hard to argue with your reviewers and perhaps I really need to find a compromise here.

B

The problem is that an out of three component is basically now blocking entry enhancements, whether those additional out of three features are relevant for users or not, and talking with Kate I heard that aqui, for example, or Agri users are not using Auto scaling. So a fictional argument is based on Dynamic. Resource allocation will work just fine without cluster Auto scalar support, but we still need to come up with some solutions. So the current idea is for that for Alpha in 125.

B

We just outline a plan, and that is in the cap, but then later validate with a vet plan actually works, because it will need more further work on on prototyping and hopefully that will convince reviewers to accept the care but I I just don't know yet, and because we didn't get this cap accepted for 124, we used the time to implement the Prototype.

B

If you got to to this last pull request, it's basically a pull request from my own Rico into my own repo that he has all the commits that extend kubernetes with new API, with core components that I mentioned and work is underway to also support for Google changes.

B

But it doesn't have the latest API changes from vocab revisions that we are currently doing for 125. But it's good enough and it has instructions for building from that Branch bringing up a cluster with localhost a local cluster up and have have this whole thing. Working and there's also an example driver in that prototype and with that I think I almost made it with enough questions for for five millimeters for questions, I'd I.

C

Just had a quick uh one question about um kind of going back to the Pod spec side of things, um and you mentioned how maybe you'll extend things by having more parameterization via crds.

C

um What how much can someone specify in a pod spec when they're requesting a device that has a driver behind it and how is that going to be generalized um in the Pod spec format,.

B

So in the ports bag we have a report spec level, a list that we call resource claims and the resource claims in the simplest way. The simplest entry is just the name of a resource claim object.

B

That's for the persistent case and the resource claim then has the link to the parameters, another object and that object can be as complex as the resource driver needs it to be so.

C

B

So the entry API is as flexible as it's possibly can be. There are no limitations, what parameters are allowed or not allowed.

C

So a user needs to First Look at a resource claim to see hey this device. um Is that camera with this frame rate and then goes and creates their pod spec requesting that specific resource claim? Because it knows it wants that device for those properties, yeah.

B

With user creates the resource claims, so they basically write down everyone. They need in different objects. First for parameters vendor resource claim, then reports, big or some higher level controller that references resource claim. So it would be free objects that go with an app deployment.

B

um Yeah Evie, we have a slightly simpler case, is with ephemeral resources, the ones that are meant to be used only by the part, then this pod spec resource claims list contains a resource claim template that basically embeds almost the same information as a separate resource claim object, except that it doesn't get created by the user himself.

B

It gets created by this resource claim controller.

C

um And so every every single device will have its own resource claim. Where can you have a generic resource claim for um a set of devices you.

B

Can have a resource claim for a set of devices if that's how the resource driver works, but the resource claim each resource claim needs to be handled by one resource driver.

B

But if your resource driver knows how to deal with multiple devices, that's perfectly fine.

D

And if your device is kind of shared type, so when we're seeing claim can be used in multiple pods.

C

Okay, um that makes sense, but um you have to have a one-to-one relationship between resource claim and resource drivers. So I guess I'm asking these questions from the perspective of awkward. You would make sense to run that as one driver um and be able to create all these different types of resource claims for different types of devices or not resource claims, but be able to create um I guess the potential for users to create resource claims for various different types of devices, rather than having multiple drivers.

C

Making separate connections um with cubelet, which is the case of the device plugin interface, is that you have to create that connection for each individual device, which is fine, but also one limitation that could have been bettered is making it so that you could have one driver make a connection for multiple devices.

B

Yeah I think you can make the parameters complex as you need, and that's that's where it's basically up to Resource driver to decide what works best for for the users and basically proposed beyond your own structure of what what goes into the resource game parameters.

A

So you mentioned this um term ephemeral devices. Could you uh expand on what an ephemeral device would be versus one that isn't.

B

uh It's not not for device. The device of course exists permanently, but it's the allocation that is tied to the life cycle of a part. That's what I call ephemeral um that basically means that a pod starts a resource gets allocated for the port and when the part gets deleted, the resource claim also gets deleted and deallocated and that's creation and deletion is handled by kubernetes compared to the other case where the resource claim is, or the life cycle of a resource claim is managed by the user. That's that's the key difference.

B

Everything else how these resource claims get allocated is the same.

D

So, to compare to draw our knowledge Alliance uh right now, device plugin interface is always ephemeral. Devices so like as soon as for the device is requested. Where allocate it comes to the voice plugin and when report is done. When decides what this this device is not taking longer than we use, and if you get this dialocate call to the West plugin apis one well, it will be the allocate call so in in line in reports. Back like the clean template will be getting similar life cycle.

D

What you have this device projects, but if you have more complex devices, especially like like long believe it uh like even pipeline of split devices like in our example, is we we have a complex fpga cards which have like multiple components inside so like radio and code of decoder, ftg and then uh special crypto, block and network card.

D

So like all this pipeline is something what you will accuse it up at once and one when you all can use it for for a long period of time, so for what it's too better to have different life cycles. So you first, you do a location, make sure what is done, and only afterwards you start to start reports which will be consuming this pipeline abilities.

A

Okay, thanks I, think I, understand, I. Think we're going to have to call that the last question here so that we allow time for UC. So you see, can you introduce yourself in your topic.

E

Absolutely let me try to find the correct buttons: I, don't usually use the zoom so.

A

I think at the bottom, at least in my client's share, screen yep.

E

I think I found it.

A

Okay- and we can see that that looks good.

D

Oh that's wrong.

E

All right, all right, so I'm, I'm, I'm you'll, see I'm from uh your antis and and I guess this. This whole topic started by I I think you can't met uh Edward from from our side on on keep gone and and and uh then the topic of k0 has popped up and and uh well here.

E

I am talking about a quick intro to k-zeros and and and and I'm of course, I I believe that we're all quite versed in the basics of what kubernetes is so so I'm I'm, I'm I'm, definitely gonna try to focus on on the kind of uh what makes k-zeros unique and and and how those sort of thing the the kind of uh what is it so those differentiating features how they fit into this H and iot space in in general, at least how we see it.

E

So uh uh one of the one of the things in kind of unique in K zeros is that well I don't know unique, but, uh but not so obvious, at least is the fact that it's it's actually packaged as a single binary. So so uh we do it. We do it quite differently, as as k3s does it. For example, so uh where K3 is actually compiles, everything in like, like you, you actually have a a single binary that runs the k3s process, that Cube API server, containerdy and cubelet and everything.

E

So we actually packaged the the K series binary more like uh kind of like a self-extracting binary, so we actually embed other binaries into the single golang binary by just it. It was actually as easy as just concatenating the files into into the end of the file. So and then, of course, we have to calculate the offsets and everything, but but uh but essentially it's it's like a like a self extracting binary.

E

What it may actually means is that uh we can. We can break with Upstream components. Super fast, so say that there's there's a uh container DCB published we just basically in our build pipeline bump up the version number and that's about it.

E

We don't have to worry about, like the typical golang modules dependency excuse, my French, but hell I, think we've all seen and and and felt that so uh small version bumps in golang modules are not not as easy as they should be so uh by by a bit doing this uh kind of real embedding of different binaries gets gets, gets us a lot quicker during the different component updates and what what not and and what what it also means is that we're we're not depending on on like uh operating system packages at all.

E

So so, if you want to run kubernetes we are, we are K zeros. You just installed the case versus binary, and that contains everything you need. You don't have to worry very and figure out that, okay, from which three for a while I'll I'll pull my containerdy packages and from which repo I'll pull my kubernetes packages and and so on.

E

A

Can I ask you what the size of this binary is? I mean the concern I have that maybe you meet find is just what kind of network bandwidth it would take to get this to an edge location, yeah.

E

So it's uh it's I, think it's like 180 mix. Currently so it is, it is a bit Hefty and we we know that it's a bit Hefty but then again by by building a big binary, reduces a lot of other headaches, so I mean, as in engineering, it's always about trade-offs. So uh so yeah.

E

uh Yeah so so we don't really have to worry about the operating system packages and and and therefore we can, we can kind of more decouple the the operating system and then the kubernetes components life cycles from each other. So you can freely upgrade your OS and everything still works, and you don't have to figure out kind of worried Warrior that if I update my OS packages will container the magically upgrade or some other component magically upgrades and everything breaks down and whatnot.

E

um We, of course, include all the needed bits and pieces to run kind of full-blown cluster, but of course you that you, as a user you're, afraid to swap out some components if you, if you want so we bundle container de Esther runtime, there's options to run Cube, router or Calico as the networking or and and and of course we we bundle, let's see the as the main storage for the API server, but we also bundle this uh coin and and SQL light for for the data store for the control plane.

E

If you run like single node, Masters or controllers, uh you might might want to save some resources and and run it with coin and sqlite. Only. We all know that that's it is uh it's quite almost as hungry as for resources as the API server. So but but I said you can you can you can run with that with basically any zero runtime? If you, if you want to install docker, just install Docker and run it, but we just don't embed all of them. Of course, that would be a humongous binary.

E

If we embed all the options.

E

um So one of the one of the kind of uh unique things in case xeros is uh is is a feature that we call like control, plane, isolation so and- and it's it's actually quite quite a bit more than the usual isolation by using using tanks and tolerations. So you you can actually actually place your controller and worker nodes really separate from the network point of view.

E

uh Let's see if I have I have I have some time. I'll I might be able to demo this quickly, but uh but but basically, you can have different network segmentation between the controller nodes and worker nodes and or different set of firewalls and Nets and and whatnot in between.

E

So what it actually means is that is that, by default in k00s the controller nodes, they don't run in workloads. So when you spin up a k-0s controller process, it'll create the API server at CD and all the typical control plane components, but it's actually running those as what I usually call naked processes. So there's no container they're running, there's no cubelet running even static, manifests or or anything on the on the controller nodes. So they are just like your playing Linux processes.

E

What it actually also means is that uh there's no like uh cni stretching between the control plane and the worker plane, because we don't we're not running cubelet or anything on the controllers. So we don't have to stretch the cni network.

E

Some of the questions I I, usually get like like okay. Why why? Why is the usual master or did it actually change to in in 124 already the to be the control, plane, role, label, yeah, I, think think so too yeah so anyway, the the?

E

Why not just use the usual labels and things and what not to Mark the control plane nodes, uh what I've or what we faced in the in the past, with different other distros and and solutions that we build we we've been building is, is one of the problems is that our back doesn't actually cover the question that okay, is you see actually allowed to deploy stuff on controller nodes?

E

I mean there's no wire back to the filter stuff like node, selectors or colorations or whatnot, and that's actually a pretty big hole in my opinion, in the in the Arabic system, and also the fact that the the cni needs to work from node to node and and Etc. So it kind of uh assumes that the network is flat-ish, at least of course, you you can do a lot of fancy things with bgps and whatnot, but uh but but then again you really have to do it at work always for the network. Configuration.

E

And, and the API and cubelet connection is, is something that that uh that that has always kind of bugged me a bit that the the API requires always direct connection to the cubelets. So when you get do this usual Cube CDL get locks and whatnot, the API is actually calling the cubelet directly usually and all this. This basically boils down to quite limited deployment. Architectures.

E

So how we do this with in in K, zeros and and and of course, we're we're kind of uh building on on top of Giants and and big shoulders, so we're utilizing a lot of a lot of Upstream stuff, so uh in case you're, a controller node there's a component called connectivity, server and and connectivity is actually missed.

E

This API server, Network proxy Sig groups, job or or the the assets that they are building uh so so basically the server opens a a socket and then on the worker note we have an agent that connects to that socket.

E

So this basically creates a TLS secured grpc tunnel, which is used as as kind of like, uh if you ever use, reverse SSH tunnels. This is basically from the concept point of view, but they're pretty much the same thing.

E

So now, when my my Cube API wants to call the cubelet say on a on a node, a it'll actually connect to the server via local, unique socket, and then this connectivity server actually then pipes the connection for this reverse tunnel to the agent and then the agent forwards, the connection into the cubelet to get the logs or execute yourself into a pod door. Whatever you you want to do so. This really allows kind of uh this kind of like a not disconnection but uh but but kind of like a network level.

E

Segregation between the worker nodes and the and the controllers.

C

um Just a quick question here so in this setup um say there you do lose the connection between the controller and the worker node. Does the connectivity server have like cached information there that can be immediately retrieved or what's the workflow there if it can't get to the agent.

E

No, it doesn't cache anything. So it's uh so if the if the connect, if the tunnel is broke, then it's and it's broke so so uh it doesn't cache anything or or do anything fancy in in that regard,.

C

And so in that case, is that an error case or is that the specific scenario it's trying to help with by just telling API server try again later.

E

uh I think it I I think it just returns actually in a connection error.

C

E

So I at least I, don't remember ever seeing any like retry, laters or anything.

E

So, basically, basically how it how it would manifest itself is it's like um if you, if you do like Cube CDL get logs or something you'll, just see connection error in your Cube CDL level.

E

So one of one of the the kind of examples that that we've seen in our in our user basis uh is is a case where the control plane is is kind of centralized to a a regional side or kind of on-premise cloud, whether that's on on Amazon outposts or openstack or whatever. It doesn't really really matter. I.

E

Think in this specific use case, it was actually Outpost if I remember correctly, but anyway, that the workout plan is then split in the different different sites and and uh and and a site in this case is actually kind of uh either like a factory or a part of a factory.

E

So it could be that the paint job part of the factory or the assembly side of the assembly, part of the factory or the testing side, part of the factory uh and and uh and the workers were actually actually super small industrial logic controllers that are running one tiny arm, V7, CPU and and like a half gig of RAM available for the for the whole set of of uh OS and kubernetes components and everything.

E

The the use case is basically that they they wanted to have a super. Fine-Grained controls on on how they deploy applications and the workloads that are actually collecting data from different uh different kind of robotic systems and and the Machine already that's actually doing the assembly, jobs and whatnot.

E

So so that's why that the kind of worker node is? Is this industrial logic controller which then, through the Io Io bus, connects to these robotic systems and and whatnot.

E

This has been an interesting, let's say interesting. uh You know big quotation marks exercise for for us, so, and, and and that's that's that this has been actually a use case- that that's kind of driven us for some of the some of the choices that we've made.

E

For example, uh at one point we we actually changed the default cni, the BQ router instead of Calico, based on this specific use case, because Q browser is much more lightweight than Calico and and back then that was like a year and a half ago back then Calico didn't even support arm 33 bits, so we didn't really have a have a choice.

E

Let me see if I can actually pull out quickly. One of these I have this some of these boards. So there there are. These uh super kind of Raspberry Pi size a bit bigger than a raspberries, but there is kind of industrial, great controllers that are actually running kubernetes I. Don't have them plugged on right now so yeah, that's that's been pretty cool.

F

uh Hi you see this is Sean. May I. Please ask a question regarding this scenario.

E

Of course, yeah.

F

So so I'll be honest, I I am part of a company that actually makes its own hardware and Industrial grade controllers and everything. So one question is: uh are the the k0s workers over here that you show in the figure? Are they actually scheduled from the controller themselves or are they actually, let's say hard coded onto each each uh controller per se like the way you mentioned it over here?

F

So my question is because most of the times for industrial, I o buses, you require, let's say, file descriptors from the operating systems themselves right, and so how does the controller know where which file descriptor is or uh where it needs to deploy this particular worker pod?

F

If you know what I mean right like, for example, SMD site has a has a can bus controller right and the can bus is then available from your operating system on, let's, say, slash: Dev slash can zero whatever. So, how do you? How does the controller itself figure out if assembly site uh worker node actually has that file descriptor or some form of U Dev Discovery protocol, or something like that.

E

The the scheduler doesn't doesn't really know it, so it's the it's the human that that has to make the connection currently at least, but that's that's actually actually, where I say see a lot of a lot of potential in the in the contact container device interface that that was just percent that so so to make the scheduler more aware of these sort of things, because it'll it'll help immensely these sort of use cases yeah of.

D

F

E

This case, the the at least as far as I've been following, of course, I'm I'm, not an expert on this industrial devices and I O buses or anything so, uh but but as far as I know that in in this case, the the the scheduling kind of scheduling logic regarding the devices and their capabilities is, is kind of like a human action. So they know that that okay, this node, is actually connected to that robotic system. So we can deploy this sort of data collection. Applications on that node.

F

Yeah I think that that pretty much answers my question because uh I'll be, let me be honest and learned about it. It's uh since, coming from the industrial Edge side, it is quite conservative when it comes to uh understanding what actually happens on the on the floor itself. Right like so for example, chances are you buy a controller? You first have to sit down program it accordingly to your needs, and then you, let's say figure out a way to extract that data and probably send it somewhere.

F

Upstream doesn't have to be an on-premise cloud or something like that. But it's it's yeah like I, said trickling down kubernetes on the industrial Edge is a bit more tricky compared to standard networking or standard applications. Web apps.

E

Absolutely absolutely, but, but that's that's also, where that a lot of the potential and benefits is so so I mean if you, if you, if you think that how things are done nowadays, so there's actually human government going around the factory with a USB stick and installing the different applications in the different controllers and whatnot so having a having a centralized way to manage all this via kubernetes API is going to be fantastic at least on some day in future, probably not today yet, but.

C

Yeah I will say you might be able to enable this right now with awkary, um and then you can look and see if uh kind of the dynamic Resource Management um kep. Does that even farther and more specific to your case, but I'm happy to talk about that offline afterwards, if.

E

C

To talk through the scenario, absolutely.

E

Absolutely I I need to get myself first, a bit more acquainted with that free, but absolutely am.

E

I mean the the minimum system requirements with is is basically quite modest. You know in a way that uh we don't embed a lot of stuff on top of the playing Upstream kubernetes.

E

So when you, when you, when you set up k0's clustered, you don't get much bells and whistles, so you get Upstream conform and kubernetes and that's about it. So we we don't bundle and force people to use like these deals and other large bits and pieces so uh and and I said, especially the worker nodes.

E

We've we've actually managed to run the worker nodes in this tiny half a gig of RAM boxes with one arm, V7, CPU and and I think that was a nice nice kind of a surprise that it actually still leaves a bit room for the workloads to run.

E

uh Air gap setups are supported out of the box and and I mean we. We don't do any any like like big Magic on air gap setups, but but I think the the way we've built, K, zeros and and how we package package it is, is actually quite simple, so I mean, if you, if you have this typical air gap, kind of uh your cluster runs in a bunker case, then you don't have to worry about installing different OS packages and whatnot.

E

So you just basically have a USB stick where you have the K series binary and a tire ball for all the system, images and that's about it. So basically you need two files to set up your cluster.

E

What k0 does for the image turbo? Is it's actually on a when it boots up. It finds the finds the terrible in a in a specific location kind of predefined folder in a in a in a disk gun, and then it automatically Imports the images for containerdy and- and you can, you can of course have multiple turbo files for your own application images or whatever you need to need to run on the nodes.

E

What we have found out is is that the cube Cube lip image carpet collector can cause some gray hair, unfortunately, or if I'd have here, it would be gray already, but but uh the the main problem is that there's no like uh like configuration. But okay, whenever you run the garbage collection on images, don't touch these ones, because these are system images, so don't go and remove them.

E

So we've seen we've seen I've actually managed to see a case where cubelet removes the pause image itself.

E

I was like what that's weird and containerdy actually even allows you to run uh to remove a pause image, even though you have like 10 of those post containers running I was like okay, now I've seen, probably almost everything but yeah I'm planning to to try to find some time to to see if I can rewrite the cap 1717, which, which is about to to Define some sort of a configuration for the garbage collection that that okay, don't please touch these images, but I haven't yet, unfortunately, to form the time, but hopefully, in future I'll I'll find the time foreign that's about it.

C

E

Thank you for presenting.

C

E

No worries no worries, glad to clap.

A

Maybe I could answer my own question on the project site, but I'm just curious about how much the traction is in the the number of users who are deploying this now and the the number of contributors you've got on the project.

E

uh So number of users, users, is what what we've seen from the anonymous Telemetry is around like a thousand plus clusters online.

E

So pretty decent number of clusters, online, I I, think I can't remember the the amount of contributors that we have, but but uh we we definitely want to see more. Let's put it that way.

E

So so it's it's mostly mostly done by the core handful of people here at morantis and and uh we we, we have seen some external contribution, but uh but of course, as basically as any project, we definitely want to see more contributions.

C

um I know you had a demo that you wanted to share and it seems like we might not have time for that. But if you do record one and want to share it out, we can still put it in the agenda. Notes um for folks to look at later or you could even drop it in the um the working group. Slack.

E

Yeah yeah, yeah or.

A

If there's any already out there up on YouTube or something that I'd like to see it myself, yeah.

E

Yeah I'll I'll see I might actually have a have a recording available already somewhere I at least I. Remember, recording something a while ago, but but it's it's easy to re-record, so I'll I'll put it out in in in coming days.

A

Okay thanks, so we've got maybe two minutes left Kate. Do you want to give an update on the movement of this group from the kubernetes project over into the cncf before we close sure.

C

um So it's something that we've been talking about for a month or so and at kubecon you we presented that we are moving from the kubernetes space to the cmts space as a part of this working group.

C

um uh Part of this is because some of the projects we talk about here aren't specific to kubernetes and also um a lot of kubernetes working groups are focused on expanding the kubernetes API, um like some of the work that um the like CDI working group um and cod working group actually um is working on, and so we're expanding to a bigger scope, um and the update of that is that we have um merged in our transition to fall under the runtime tag in the cncs.

C

So as of now, we have all the same links meeting times, slack Etc, but we're going to start gradually, transitioning that over to cncf, so um having a different Zoom link having a different um agenda hosted by the cncf I think we're going to move our slack, but that one's a little hard, because I think we do have a good slack history. So maybe there is something we can do around that of staying in kubernetes.

A

Okay, thanks Kate and we're out of time. So uh thanks everybody for attending uh we'll get the recording of this up soon. If the speakers can drop a link to your deck, that would be nice either patch the link to the deck right in the agenda note stock or if you have trouble accessing that drop it in the slack Channel and then I'll update the agenda node stock. Later thanks so uh bye, everybody and we'll see you at the next meeting.

E

A

E