KubeVirt KubeVirt Summit 2022, 24 Feb 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Summit 2022: Volume populator support

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

Okay, we're on time so thank everyone to attend this presentation.

A

I'm pleased to introduce michael he's going to present the role to volume, populators uh he's going to talk about creating persistent uh convert disk the past present and future. Thank you michael. It's only you know.

B

Thank you, marcelo, hey everyone thanks for joining me, um those of you on the us east. I hope you are. um This- will be a good presentation to eat lunch too. I hope, okay, um so the road to volume, populators creating persistent keywords, is past present and future.

B

um So this presentation is basically about the fact that kuber cdi will be supporting uh volume populators for initializing, your kuber disks. um How does this compare to the existing data volume api and you know what to do about that? You know. What's the future, for both of these things, volume, populators and data volumes, and hopefully we'd like to get some ideas from you in the community right. So I'm a bit um back in the beginning of thecube project.

B

uh You know uh we're basically running virtual machines and kubernetes uh virtual machines are typically stateful workloads. You, you know if.

A

You stop your vm.

B

And start it back up, you probably want the same data that was there when you stopped it so so stateful workloads and kubernetes means persistent volume claims, but persistent bond claims are initially empty and hypervisors need something they need uh disk image files or they need devices that look like this commit that look like disks that basically have operating system files that you can boot from.

B

So the problem is: how can persistent volume claims be safely populated, with disk images before being consumed by hypervisor? So that's that's kind of the problem statement that we had a couple years ago.

B

uh Thing we considered was knit containers. um So for those who don't know you know kubernetes vms run in pods uh one. The main container is running basically qmu, with your virtual for your virtual machine to communicate with kvm and host blah blah pods can have an ink container, so init containers run before the main container.

B

Theoretically, we could um do whatever we wanted to do in these init containers. We could populate them at that point and have everything ready for qmu and um that was explored, but it was kind of not a great match. A little awkward I mean when you configure your virtual machine, your create. We have a virtual machine.

B

Crd, we have a virtual machine, instant crd, um you know any containers are things on pods. um What, if you have a bunch of them? Your images, it's a little awkward um and the other thing that was made them a little tough to deal with was the fact that we want basically this population phase. We want this pvc to initialize it to be initialized once the first time the vm has started or the first time the volume is used.

B

A

New containers didn't seem like a.

B

B

Enter data volumes, so we needed something um again. This was a while ago and um we figured that creating our own custom resource would be the best way to go at least at that point in time. um So we created this data volume abstraction it has essentially two jobs.

B

The first is to create a pvc per your configuration.

A

B

Is to populate that pvc with some source from some source, but simultaneously we always knew that um kubernetes. This is a problem that a lot of people are going to have in kubernetes. So we at that time went to the sig storage kubernetes upstream community and began a discussion for you know. What's a generic solution for this, and actually someone from the cubert cdi team was the first to propose a cap for volume populators.

B

So data volumes were our thing. We could do right away yet we still wanted to participate in the community to have something upstream. They fixed a more generic problem. Okay, so I mentioned the data. Volume has two jobs. uh The source section of the data volume spec specifies you know what we want on that pvc, and in this case you want to download a file that has a seros image.

B

We did. We do a little more than simply downloading a file, um but it's basically the idea. There are other sources like importing from url importing from registry cloning, uploading blah blah blah. Okay, the pvc.

A

B

Specifies what uh the pvc, the data bottom controller, will create a pvc on your behalf.

B

This is where you specify what you want to look like: what? What are the parameters?

B

um So the pvc section spec.pvc, is the precise configuration we also have the spec dot storage, which will do some defaulting for you via storage profile storage profiles are key vert option that basically the storage profile for each storage class and we'll choose say the best access mode for you. If you leave it out of your data volume spec, so there's some defaulting done there and it's kind of nice. You know less things to know the better all right and the data volume status.

B

So I said the data volume has essentially two jobs, and this is where you can track the status of those things. um So the phase is, um you know it succeeded when it's done. There are different phases based on the different source types progress. Some source types will support an incremental progress meter.

B

Not all of them and conditions are nice for working with cube cuddle weight.

B

So you can wait for the pvc to be bound, which is you know, one of the first jobs of the data volume controller, and you can basically wait for ready to be true, and that means that the day bomb is ready to use, and otherwise the kind of opposite of that is this running condition where, if that's true, it's not ready.

B

Okay, so we're going to go through what happens uh when you create a data volume in a virtual machine instance. You know you've posted these two manifests to a cluster. um What is going on behind the scenes? What is the synchronizations so at this point, you've just posted a day volume manifest and virtual machine instance manifest they're both pending next, the data volume controller sees this data volume manifest, and it will create a persistent bond claim per whatever you specified in the data volume.

B

It will also create a worker pod that does handles the source status, the source definition and the data volume. So in this case, this pod is going to connect to that url and download a file and some stuff.

B

So um the data bomb controller sees that okay, I've created this pvc. I've created this pod and it's running my face is important progress so to a user that is watching this. They can see. Oh the import's in progress. You know it's doing its work. The controller doesn't really care about import and progress. It is still pending. It's waiting for data launch to be complete before it does anything before it can start running this virtual machine instance.

B

Okay, so once the import is complete, the pod is succeeded. The data volume sees that the pod succeeded, so it marks itself as succeeded. The vert controller sees that the data volume succeeded, so it will start a vert launcher pod, which I briefly talked about earlier.

A

B

Runs qmu um so that pod will mount the pvc that has some freshly mounted data freshly configured data and now the virtual machine instance is scheduling.

B

And, finally, the virtual machine instance is running, the pod is running and this is kind of the final state. um A couple. Well one thing: one important thing to note is that the data volume kind of has no not much of a purpose at this point. uh It's done its job. um However, you can't delete it because it owns the pvc which um you know. Maybe it's fine, maybe it's a bummer or some people, but that's.

A

The way it is right now.

B

So that is kind of flow data populator with data volumes. I think that's most people maybe know this okay, so enter data enter volume populators. So while we were using data volumes happily and uh the community was slowly making progress on volume, populators and we're. Finally, there um flying populators will be beta and kubernetes. 124.

B

previous versions can use the any volume data source feature. Gate populators will work best with csi pvcs because well, if you're, using a csi pvc you're, probably using the csi sidecars that are provided by the sig storage. So they know how to deal with this data source rep, which is basically to do nothing.

B

um Other provisioners you know, can handle it as well, but it's more likely to have luck with csi volumes and there is a pretty cool library called the volume populator. That makes it relatively easy to create controllers um to create your own populator and I'll, be showing a demo of using that library to create essentially.

B

Exactly what we did earlier with data volumes?

B

Okay, so yeah? What I'm going to show here is basically in a populator world what we did with data volumes before so this is the equivalent of the previous flow with date, with the data volume you know, in the data volume it was going to one stop shopping, where you had one resource that had your pbc deaf and your source with populators, it's a little different, so we create a pvc directly that has a data source ref to this populator resource.

B

You know an api group populator cbi.keyboard.io import, zeros import, um so that's basically it uh it may look a little weird to you. Data source ref doesn't a doesn't a pvc already have a data source field and you are right. uh The data source field is currently used for creating a pvc from a volume snapshot or from another pvc, but uh because of weird behavior that has been around for a while.

B

um Essentially, the csi provisioner would ignore a data source that was not a volume, snapshot or persistent volume claim so, and it would just provision the volume and apparently some people in the world were taking advantage of that, and rather than make those people very unhappy, the decision was made to add this data source ref field instead and deprecate data source. So eventually everything will be going through data source, ref.

B

Okay- and this is the volume populator custom resource which is essentially, you could think of the source section of the data volume you saw earlier um we're going to grab a file from a url. There will be different populator custom resources for import clone upload, okay. So this is the difference in the flow here, in this case, we're creating a pvc and a virtual machine instance that references that pvc, uh initially everything is pending. um Okay, so we've got a couple concurrent things going on here.

B

So in namespace ns1, uh the vert controller is going to start the vert launcher pod up right away. uh It has it as far as it's concerned. um You know it doesn't.

A

Care that the pvc.

B

A

B

Does a lot of waiting for data volumes, but not for pvc? So it launches this vert launcher right away, but the pod is going to hang out for a while, because the pvc is pending in another namespace, where our populator controller is running, that we created with that lib populator.

B

B

Running in this ns2, that is watching for pvcs that are getting created and it sees this one you know and it will create a pvc called pvc one prime. It looks just like pvc one except it does not have the data source ref, so it should uh bind right away and we see that there is a p a pv created up here.

B

You know every pvc has a pv, it's one-to-one relationship. The populated controller will also create this pod to do the population which references the pvc, so that's running, so this is all happening in another namespace.

B

We're downloading a file, basically doing the same thing. We were doing with data volumes, but kind of ended up in another name, space.

B

All right. Eventually, that pod will succeed. The population will finish. The populator controller will set the pv to be retained and delete the pvc. We see no pvc here there used to be a pvc here, so.

A

B

Released it's just kind of there, but the pv has the data that we populated the pod has exceeded. So the next step is the populator controller. Will manually bind this pv pvc one?

B

So pvc one is now pointing to that pv1, and this is sort of the final state. So once that pvc becomes bound, the pod can become running and now the virtual machine instance is running. So this is a much simpler flow from like for controller side. It's just starting the pod up right away.

B

It's letting the cube controller manager wait for pvcs to be bound. It doesn't care that you know they're not bound like with the data volume case. It has to wait all right. um So here is a demo. I'm gonna have to um share the terminal. If you give me a minute.

B

A

You see a terminal, it's.

B

A that is like readable.

B

Okay, so um right.

B

So we're going to apply this manifest here uh so in here we've got a couple. Things first is this: this is the populator custom resource that we saw. um Referencing url um here is the pvc that is referencing. That populator and down here is a virtual machine instance that is referencing that ppc and yeah.

A

B

Populator was created with that um library uh and, let's see if it works, uh and these other consoles over here will watch pvcs.

B

There's no pvcs and down here we'll watch nice.

B

And we'll create the vm the cmo here.

B

So we should okay, we see a pvc is pending, we see the vm is scheduling. um What we'd like to see next is pvc gets bound? Okay. Now we see the pvc is bound and the vm is scheduling.

A

B

B

Hopefully the the vms vmi should start running soon. It's still scheduling.

B

Okay, it's scheduled, and now we see a booting in the other terminal there. So populators.

A

Are ready to use now.

B

um You should check them out that this.

A

Demo is actually.

B

Quite easy to put together all right so now you may be thinking um wow. Okay, we've got data volumes, we've got populators, that's two ways to do the same thing: that's typically not a great place to be in software engineering. So let's this is where I want. You know the community and to get together to maybe help. Oh, let me stop sharing. Let me go back to my slides.

B

Okay, so yeah, so we've got these two ways of doing the same thing or well very similar things. So let's talk about the pros and cons and the path forward. Hopefully you'd like to get some input from the community.

A

B

The way I see it with data volumes, we've got this nice storage profile integration that I mentioned earlier with you know you don't have to specify every parameter of a pvc. It's also really nice status reporting. Basically with the pvc.

B

You know, pvc state. This is basically a boolean. That's either it's bound or it's not. There are no conditions, there's not much there. So um with populators supporting something like a percentage progress um we've got to think about and yeah. Similarly, they both have these conditions, which are nice, but there are some cons with data volumes um that whole synchronization step um is a bit of a bummer, and I didn't even get into the weight handling wait for first consumer, which is pretty much a presentation on its own. um It's very complicated.

B

uh It involves creating like weird doppelganger pods and it it works, but it's a little clunky and um well not.

A

B

um As I mentioned before, data volumes can't be deleted once they're created, otherwise that will take the pvc with.

A

B

And they're tricky to back up and restore. um You know right now we're talking to backup integrators, and you know a lot of the work is around um handling and the right way to make them restore correctly.

B

um We wrote a custom valero plug-in for this, for example, and it's basically about managing data volumes, so volume populators pros community standard. um We finally got here. It's been a long road, but uh it's nice that we're here you can create your own populator using that library pretty easily. Let's say, for example, you have disk images and bittorrent that you want to download write.

A

Your own populator.

B

Or you know you can use it right away like I just did and I think the shared configuration is kind of a nice thing. Basically, rather than having a url and 100 different resources, you have a url in one place and reference that resource that has the url uh the cons, minimal status and, uh as I said earlier, not all provisioners uh will work with populators just yet. um I think that will change, but um I think like for, for example, like local storage provisioner, I don't think works right.

B

So, what's coming up um the kuvert cdi team plans to release populators for each of the existing data volume sources.

B

We hope you build your own populators we'd love to hear about uh keyboard populators being uh distributed in the wild. That would be great and the daily volume controller will be updated to use populators internally.

B

The timing on that may be a little weird because, um because well not all provisioners, I think all of our supportive prisoners necessarily will work with populators just yet, but that will, I'm sure, will change eventually. So this is where we kind of want to discuss with the community. You know: we've got these two we've got populators we've got data volumes. Where do we want to go?

B

Is there still value in data volumes? um Should we tweak them a little bit?

B

Maybe that owner reference thing um you know, maybe you should be able to delete a data volume once it's been populated stuff like that, maybe we should have some new resource um as a data volume replacement or maybe we'll just get rid of them, but you know that may be difficult because um well, for one reason, data volumes, specs are part of vm specs in the data volume template section, so that could be quite quite uh it'd be hard to get rid of them. But um this is where we want to talk about.

B

You know what, um where do we want to go as a community with data volumes and populators? I think that the populators solve a lot of the nice techno technical challenges like way for first consumer, um and I think the back they're you know easier to back up. um So I.

A

B

There's a lot going for them, but you know: there's still some things that data volumes provide that you know we potentially have to figure out what to do.

B

Maybe we can go back to the community and ask for we need some status support. I don't know, but you know I I hope we can work together. I'm gonna probably send something out to the cubert dev mailing list talk about it in our weekly meetings, but you know um love to get your feedback.

B

All right any questions or feedback on anything that was discussed.

A

Thank you, michael for the very interesting talk um you know I I was very you know surprised with that. It was very interesting, especially for you know, performance standpoint that you, you create. You know the virtual launch report before and you let all the containers be initialized that I showed my previous presentation that it can takes a lot of time and if you can do that in parallel, it's awesome. So it's very interesting. So let's see that we have some questions here.

A

um So daniel asked what other protocols are supported for volume of populators. Besides http.

B

So I guess for official kubert, you know, data volumes will support http as well as I guess registry imports. So you can com import, a disk image that is in the container registry. um But there's really you know, there's no limit to what uh you know. You can do whatever you, whatever you can do in a pod and write to a pvc, um but I think you know we're going to support and everything we do now, which is basically importing from a registry uploading from your desktop copying. Other pvcs.

A

Okay, um so jennifer also asked a question: if de la volumes went away, what would a clone workflow look like create pvc with volume, populator source creates volume snapshot and a new pvc reference. This next shot yeah. So this this gets back.

B

To some, uh at the same time, we started talking volume populators with the community. We started talking, um namespace transfer um and their volume populator has progressed, um you know it progressed. It took three years, but we got here. Unfortunately, um the whole namespace transfer thing uh is still under discussion and there's not a consensus there. So there will be a um a clone populator and I think it will work pretty much exactly like data volume.

B

The existing data volume populator that will do all the snapshot stuff under the covers, um so that that will be there. um Some of the authentication stuff will change a little bit like now. If you're cloning between namespaces, we do um weird auth checks, but we're gonna have an alternative way of doing that.

B

um So uh the short answer is that.

B

There will be a clone populator, but uh if you want to use the sort of kubernetes primitives that that's fine too.

A

Okay, any other question.

A

Okay, don't sing so now so michael. Thank you very much for the presentation. It was very interesting again: okay, okay, no, no question! Thank you. Thank you.