Kubernetes Data Protection Working Group, 23 Mar 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes Data Protection WG Bi-Weekly Meeting 2022-03-23

Description

Kubernetes Data Protection WG Bi-Weekly Meeting - 23 March 2022

Meeting Notes/Agenda: -

Find out more about the WG here: https://github.com/kubernetes/community/tree/master/wg-data-protection

Moderator: Xing Yang (VMware)

A

Hello, everyone today is march 23rd 2022. This is the kubernetes data protection ring meeting.

A

So today fine is going to do an update on the cpt work and and then uh right then we'll just take a quick look of the a new report and I think that's ready, and then there are a few open. Your shoes, okay, fun.

B

A

B

I can share my screen, but there's not much, uh except for the document that we've already been uh seeing. uh So you see my screen now yeah, so um we discussed most about cbt. Discussion is either on slack or in our private meetings with a few engineers, and we captured all of the meeting minutes on the document that you can see.

B

We we posted there on the slack as well. You can see the document that we like for this. One is the meeting minutes. We have two meetings so far and um this one is the meeting of the later meetings thanks to the later one on march 18th.

B

uh The idea that I just want to summarize what we've been discussing is that in the previous meeting with the kubernetes community, we I present the cbd service, and then there was some opinion from many people uh to you know propose that we should have also have a second look at the uh the approach that we use, the crd, uh the the resource uh instead of using a service.

B

So after discuss with the the engineer involved, we like say: let's, let's give it a try to see how it go. You know, let's discuss to see if we can find a way to overcome the limitation of the cid approach. uh The limitation that we uh try to overcome is that they see the size of the c arctic. The the size of the cr on the uh kubernetes api server would eventually you know, hogging the resort there, especially if we create a cr and it will be stored in the xcd.

B

If that is the case, then it will eventually run out of spay or hogging the the resort there. So we discussed about a few idea um that uh to overcome that- and one of the way that we can uh we talk about- is the uh aggravation api aggravation a lot of this? Still, uh we still in you know, exploring we still haven't have any conclusion yet so we're just still uh researching uh and there's a few engineers proposing.

B

I think um one engineer proposing the the I think is sean sean he's proposing uh like a workflow here, but again it's still, uh I mean not not this one uh which, which one is it.

B

uh I think we have a boot. I think I'll put a link there. Somehow, um let me see if I have a link anyway, I will post a link, but uh sean have proposing. um I think let me see, I think he did have a link somewhere.

B

uh I think here this one uh sean proposing um like a wolf low on it and again uh this is just his proposal. We haven't talked much about it because uh you know he was there. He was not there in in our last meeting. We can either talk now or maybe we continue in our in our next meeting.

B

uh Another um idea that we propose is we we.

B

uh We think that we should do like a prototype and uh just just try to rough out all the uh the idea to see if it actually work this way or not, and this proposal that I put here is we're going to try to do like a cle approach. That means that we want to create us. It's similar to you know any customer resource yeah. We we create like a cid with respect. uh The spec is, basically we don't. We simply sit for this.

B

For the sake of this prototype, we just dumped the api uh that we have uh in our document in this one, like you know, we just take the the request and put it in the spec and the result will be put into the respawn, we'll put it in the status and we add a little bit.

B

You know a little bit here and there's like state and error to uh see how it go and if this one uh we, if we create this one, uh then we should create, like a controller, to listen to that event right. So the controller simply uh when this event, when this cr is object, is created on the api server, then this controller will simply listen to it and call the you know we're going to pick one of the storage, maybe powerstor emc or aws or vmware ebs right.

B

We whoever implement this, will be we'll pick one right and uh what we do is uh when we receive that cr object like if it is um created, then we will call the different snapshot api or the of the specific um stories that that we have at hand. Then we gonna translate the result into the format that we want here and responded to I mean and uh whoever listened to create this c object will wrap the response right so that.

C

Is the very high level this controller is a sidecar.

B

Eventually, it will be a sidecar of the csi driver, but for now it's just a prototype right, yeah.

A

Yeah, I just want.

B

Yeah, okay, yeah, so so, and then um we also proposing to create, like a backup controller, to employ this. This whole workflow here right, so it will create the cr. It will wait for the controller to you know, update the status and it's it's backing up the data according to the result. So that's the high level of the backup controller, uh the detail I put it here, but um again we still in you know we're still in in discussion.

B

So I just wanted to give an update of what we have so far, and uh I want to point out that we in this pm in this prototype, we only focus on on one scenario. That is the um this is a file system, pvc, with the block volume in the back end.

B

So that way we can employ the the cbt right, the the chain, blockchain chain, block tracking right, the the difference or snapshot for block right without the uh volume in the back end. For example, if this a like an uh file system, pvc with the nfs in the back end, then we cannot do anything. So we only focus on the file system.

B

Pvc with the block backend right, so that is my update for for the you know, for the cbt thing for the cbt effort that we have again, uh we communicate with each other on slack on this w g data protections channel. So if anyone who are interested in the project or want to contribute just jump in and discuss there, we, I will try to schedule a meeting every week, usually on friday.

B

But again uh it's my. It might be hard for everyone to participate, because I understand, for example, you know there's one engineer who really wants to participate, but he's in india and the time is robbed so and so forth. So we will. It will be hard to accommodate everyone. So that's it for me.

A

uh Okay, you can't go back to that. The can you go back to your document.

B

Oh, the document yeah, the the the meeting notes right, yeah.

A

Right so the controller, the backup controller. So that's the one okay yeah, because you went a little quick. This is basically it's doing the end-to-end backup.

B

Yeah, it's it's trying.

A

To do that back up.

B

With this cbt service, with this.

A

B

A

But this one is the one that is going to also create the volume snapshot, objects that control, who is creating volume, snapshot.

B

Yes, it will create the volume snapshot, objects, it will.

D

B

It will uh I I was I I was uh we go discussing about. You know whether we have also do the data mover here, uh but uh dave. He mentioned that we should. um We should use some kind of uh an open source data mover. uh I haven't looked into that to be honest, uh but.

A

So the flow would be, uh but okay, so.

B

You want me to talk about the flow here. I can.

A

B

Okay- let's just talk about this a little bit detail right, so basically it will be a file system, pvc right and then uh what we do is the controller. We will create a snapshot of that pvc when we have that snapshot.

B

What we do is we're gonna try to because that's a block, a block snapshot in the back end. What we do we're gonna, try to create a pvc block mode out of that snapshot right.

B

When we have the block snapshot, we will then I will block pvc. We will then, uh if we do the full backup, we will then move all the block back up over the block of that draw block device right.

A

So now your poc, your okay, so the data are you going to use some data mobile for this plc? Yes,.

B

Yeah we have to do some uh in this in this prototype. We're going to use some data mover.

A

B

All we have to implement one, it should be simple: uh it just should be uh copy block into a back-end into a backup story: okay and then um now with the cbt right at this point, we're going to create an object, a cr object with this format here right. If we have, if we already have a previous snapshot right, we're going to specify the snapshot base here and the current snapshot and volume id and so on and so forth and start offset will be 0 and and so on and so forth.

B

And then we will create that object, and this controller at that point will listen to this snapshot. This object and it will go ahead and call the api of the started to get the differential snapshot and then post the result on the status here.

B

At that point, the backup controller will see the list of all the blocks that have been changed. Then it will, instead of backup all the block, it will only backup the block that has been changed, so that is and after that it will.

B

Of course it will delete the um it will delete the the pvc because it doesn't uh the block pvc, but it doesn't use it anymore and then it will delete the uh the the volume snapshot object and it also delete the uh the cbt object here, delete it all and done with the backup.

B

uh I have done a small experience to experiment to uh convert from the five pvc to uh block uh pvc using the snapshot. So what I did is I take this five pvc.

B

I create a snaps volume snapshot from it and after I have the volume snapshot, the volume snapshot have the is mapped to the volume snapshot content and in that volume snapshot content, you're gonna have the the handle of the back end of the snapshot in the back end.

B

At that point, then I can I wrap that handler and that will create a new snapshot from it and then, like that, I will create the new pvc.

B

The source of the the data source of the pvc is this new snapshot, the new block snapshot, and at that point I have a block pvc.

B

When I have a block pvc, then I can mount that block pvc to any data mover as a raw block device. So that is the the data path. I done some experiment, but I got I not. I did not get all the way yet so a lot more work still needs to be done.

D

They found just a quick question like um so is this: um is this backup controller uh essential to prove out like uh the cbt um prototype.

B

D

Expanding like the scope of the.

B

Prototype a little bit, it only illustrates that it can be done. uh It's not essential to uh the the, because the thing that essentially this one right is that this controller and uh and this and the crd here you see the main piece. This one is just an effort to tie together uh a workflow that illustrates that this cbt can be done. Okay,.

D

For what is worth like, um you know, if you want to show like the diaphragm actually yeah.

B

D

Okay, I would like to put together a diagram of some sort. I think.

B

Or something like that, yeah.

D

B

I I think it would be nice uh yeah. Maybe I would work on that uh diagram. I think we do have a diagram already. uh It's just like. I do not have it on top of my I I do not put it in this document. Yeah is.

A

That, in that white paper.

B

Yeah, it should be in the white paper yeah.

D

In the white paper.

B

We we post a diagram of this workflow here, but again that diagram doesn't specify the the diagram we have in the um in the white paper. Do not specify this use of crd. It's just calling the cbd service in general.

B

Yeah got it thanks.

A

Okay, just show the I have that page up. It's okay show that.

A

This is the one.

B

Yep yep, that's that's it yeah and you can see there's a block called different snapshot service right. So this.

A

Is this is your? um What is that? The controller that you were talking about, yeah.

B

Yeah yeah right now, is it just a block right there on the diagram? We need to expand that blocker to a little bit more detail, but again we still in research mode. We still didn't try more so so so we haven't. You know concrete on that one. Yet.

B

Okay, that's it for me.

A

Thank you thanks.

E

For driving this song there's a lot of work here appreciate it.

B

Yeah, it's a contribution for many people.

A

uh So the for the aggregated service, so we're saying we still need to. We haven't really got through that one. Yet right, because uh uh yeah.

B

We need more people who know much about occupation api. I actually personally, I do not know so. I heard that some of the engineers who um will participate in our next meeting we'll know about that. um But let's see yes.

E

Yeah, so uh this is dave so over at cast and um captain's actually been using aggregated apis, a fair amount. So we wanted to contribute on that part and we'll get with sean because he had the original proposal as well.

A

Yeah, that would be great yeah just to see how that would help solve this uh concern over the the size of the change blocks.

B

Yeah, so the from what I understand about this is just that it will not save the uh object on the kubernetes api server, but it's saved on another server and how we manipulate that is. It would be uh the detail that I do not know. Okay,.

A

So basically, then, in that case, then, the crd that we have, that would still be the same crds or the crd itself needs to be redesigned. So I have.

E

No idea yet, okay, so we haven't flushed it out fully, but the concept you were having was to take the change block list out of the status and instead have more like a cookie that points to um a set of change, block resources and those change block. Resources would be provided by the aggregated api server and pretty much generated on demand by calling csi, if that makes any sense, so, okay um and then the advantage of that we haven't, we haven't actually figured out exactly how to do it yet.

E

But the general concept is that then pagination is just handled by the kubernetes pagination um api for uh for iterating, over lists of objects, and and so then it becomes a little it's. um It's not quite so imperative that the api looks less imperative driven and we should be able to take all of that stuff out of the fcd, but it's still accessible via the regular kubernetes apis.

E

So it doesn't add like a new, a different api to talk to, but it runs through the aggregated api server. Instead, so.

A

Basically, the the um you'll have a probably have a new kubernetes api resource that represents the changed blocks, but instead of putting that into status, it's a separate api object. Something like that. Yes,.

E

Yes and then there's an aggregated api server that serves those.

D

Yeah and to add to that, from the request perspective like um we're, aiming for like not two drastic changes between different approaches, like you know, going back to the funds example, whether it is you know, like parameters of the snapshot based natural target, like those will still be required.

D

So it's just a matter of like the extra like um you know, you might recall that all the pagination parameters like offset max size- you know those- maybe you know we're hoping to utilize- the humanities of list request mechanism, but otherwise the differential parameters, but you know we're aiming to not disrupt those too much from a request perspective.

A

Okay sounds good.

F

Hey thank you. um I had a question there. um Is there a concern uh with using normal api server that uh the the change list will be uh too big for storing in the cluster? Actually.

A

So so far, I think you have some data right. Why don't you show the data yeah.

B

So I I can I can it's not real, um really big, but it uh you guys can see it here right.

B

So I do some calculation and I saying that uh once 1.5 megabyte, which is the side limit of objects of the of the cle object, 1.5 megabyte, we can potentially have about 98 000 chain blocks, just the metadata.

D

B

Tip block right: this is not you know, cast on stone, because if the chain block itself contains the contacts which may be many by, let me show you the the the the api that we have and it might have a contacts that contact field. We don't know exactly how many bytes that one has. uh What is it.

B

Right here this this context, we don't know how many by that one that's contact is, is a vendor specific, but if we just take the size of the obsessed and the size of the uh you know the the size and the boo and the zero and and this field then approximately for each cr we can contain 98 000 um metadata block, uh which is, I think, would be very uh substantial. I mean for for uh the difference between two volume.

B

If we can capture that 98 000 and besides I mean, if there's a lot more than that, then uh it might be just better just you know to back up the whole volume right and- and uh so that's why that's just some data that I have uh here and I've done some calculation too, like if the block size is very small like 512, then it's not much right. It's 98!

B

It's just about 48 megabytes, it's not really impressive, but if, uh if the block is big like 2 megabyte block, then this one can the metadata that can describe 192 gigabyte right. On top of that uh dave also mentioned many ways we can combine. For example, if subsequent I mean the the block that lie next to each other, we can combine them into one block right.

B

We change the size, we can simply saying you know the side big or the total size of these blocks that stand next to each other, so we don't have to map one on one between this chain block and the physical block, but one chain block can be you know multiple chain block by multiple block on the physical uh volume that is lying next to each other again the size here, because we have this size here so so we can express this in terms of the size of multiple uh block, multiple physical blocks.

B

This is a valuable size, though.

D

Yeah, I think the fact that the responses are you know, unbounded in sizes, that definitely worrisomes and concerning and enhancing um you know we're putting a lot of thoughts into how to uh mitigate that, and you know I wish like there is a cookie cutter way to say: oh, you know you know implement it. You know all will be fixed right, so it feels like they have to be multiple approaches to it. I read somewhere that um I mean like I guess.

D

The fundamental goal is like the first goal is like if we can avoid storing it in scd. That will be great um enhance all these talks around. Like pagination aggregation api.

D

um You know stuff like that, and uh if we really have to store it, then I think someone's making suggestions about like uh some sort of um garbage, auto garbage collection mechanism somewhere in this slab channel. So I think the way I look at it is there'll be multiple approaches and things need to be implemented to mitigate.

B

I agree on that because they for each. We already have like a timeout here right and for each of the respond we can put like a timeout here and and after a certain time, we can clean up the object.

B

uh This is one way right so, like I said like like, like he just mentioned, that there's multiple approach to this. To avoid having too many objects uh or too many things on the on the api server.

D

Yeah until we have some sort of working prototype, it's really hard to- um I guess you know tell everyone. This is exactly how we're going to solve it. You know.

F

Yeah well, one of the reasons why I was uh bringing that question up is even if we use uh aggregated api server, uh that api server needs an lcd to back uh whatever we are posting there right. So um I was wondering if we are just moving the problem space of uh putting it in a cluster lcd to uh putting it in a different lcd which is solved by the aggregated api server.

D

Yeah, I mean that's definitely a good point right but, like um I guess, with the reason with aggregated api um custom, api server is because we have control over that front. You know if we look at the matrix server. If we look at some of the service master solutions out there, they have aggregated api server that um don't necessarily store things into scd uh as they work with um and handle and manipulate the the resources objects there.

D

So I mean like kind of going back to what they were saying earlier. They might become a more um you know like the balance between a declarative versus an imperative kind of uh invocation method, yeah, um but yeah. I think that definitely agreed against someone up there that don't really don't store everything in general. But yes, std will be a core thing in there, even with like a custom api server. This kind of what does sdk is gonna. Ask you when you write the go code, it was gonna.

D

Ask you which city are you gonna point to you know, but.

E

Well hold on hold on so so for the aggregated api server backing like the change block list it. Wouldn't it doesn't need to to store a copy of it. It can go to the csi driver and just ask for things on demand, and that was, I think, what we were thinking of doing so it wouldn't actually read at all store it someplace and then serve it it just whenever it gets requests for certain types of change blocks. It goes and asks the csi driver for them using the api. That's defined.

F

D

Yeah everyone agrees on that. I think at this point I think it's just at the code level.

D

um You know like we, we can talk about it when we get third but like um of the api server like library, like um they're gonna ask for references to std whether you use it or not. It's a different thing, but it's part of the setup and bringing bootstrapping the api server is gonna. Ask you for like a cd path.

E

Yeah, so ncd isn't really exposed out to the user. I don't think through the kubernetes api server. We have an api server api that happens to be backed by ncd, but that's not that cd is not really required.

D

Right yeah so yeah. I feel like we're talking about different levels of things here, but like uh yeah, let's yeah, let's pop it properly, I think we can all agree that the goal here is to not store anything.

B

Yeah we can talk about that in a separate meeting.

A

Also, it's good to uh write that in your in a cap. Like the reason, let's say if we decide to go with this aggregated api server, what has reasons you know the whatever you have on that slack? You know the calculation did. I think those are all helpful. Yeah.

B

We, when we decide right now, we're still in uh explorers yeah.

A

B

A

Like, uh even if, like alternatives or things like that, also.

B

A

All of those I.

B

Agree, I agree. Yeah.

A

Thank you, okay. So, let's see.

A

Okay, so I just want to show this one quickly, so this one uh it's ready, uh if you guys want to take a look and provide feedback, and then we will submit a pr to get us merged, basically just to talk about what we did last year right. So we have the uh you know the white paper and caps uh and the work that are not in a caveat. You know we have cbd.

A

We talked about things so basically, I went through the our agenda just captured things that we have been doing in the working group.

A

And yeah, I can take a look applying and comments.

A

Okay, so now we have there's a question here from anja now: do you understand the highlight? What is status container notify cap so that one um yeah so shanty, and I need to talk about it and then see how we address those comments, because uh we thought we have addressed those, but then I, but definitely uh there are reviewers who still think there are concerns that are not addressed. So we just need to go back to that and think about how to address those comments. Those are uh not straightforward.

A

That's why uh we did not get to that immediately so, but we do need to get back to that. Do you have more? Do you have more comments.

G

No, no thanks just wanted to know. What's going on there. Oh.

A

I also want to I. I also want to ask you, uh so I know that we we do that because we want to be able to request the application. This is from our point of view. Is there any other use case so so, for example, right now right? That is, of course, a better solution. It's more like kubernetes native is more secure.

A

Without that, currently you could do that using hooks right uh yeah. I just want you to understand. Is there any no requirement.

G

No from mine there's no additional requirement. I've been just I've been following this uh since a while back and.

D

G

Wanted to know where this is, I think we had another spec earlier, where we had a book action framework, but it has. It has been a while, like around two and a half years. I believe, since those things.

A

G

And we don't have anything anything going there. So yeah.

A

We got we have a up and down, as I just said, yeah, so so this one uh yeah, I think it's this is so. This is not part of the content. Notification is part of signal right, so it's pretty hard to get an and signal, and also it's actually a pretty big one. uh So we need to address their concerns.

A

uh Yeah just really just need to think about how to address those, because you know we thought it's a dress, but it's maybe it's not right. So there's still some things that we need to put out. Yeah they're. Also, there's also there's an api review question that I'm still trying to figure out how to how to address that so yeah. So that's why yeah, but we will get back to that.

G

Okay, sure thanks. Thank you, yeah, thanks for that, okay.

A

The next one, uh okay, so you have a question: the next release of the external snapshotter right, so that change got emerged. And then I think there were like a couple bugs resulting.

D

From that, yes, those are.

A

Fixed now, uh so I want to I'm checking with the person who discovered the you know. The additional bug buy that buy, that origin fix just to see if things are running fine from the other.

E

A

uh Because I don't want to release something and then something is, you know something else.

D

A

Here fix one thing but break something else so um now, for so normally we do release after uh every kubernetes release so like after 1.24 release, we normally like a few weeks after that we will be doing a snapshot of release, um so that will be. When is that what's the ga date for is, that is that uh I think the 19th of april right so probably.

D

Like beginning.

A

Of may that definitely we will have a new release. That's that's for sure. I was just thinking. Do we need to do a patch release on you know, anger yeah? They were branches. I was just thinking want to do that because we actually just did one uh in january.

A

G

A

Normally, we probably want to do that, maybe like every quarter or something.

G

I understand but like this, this is kind of a major bug in some sense I'll say that, so it would be better as soon as we get this out uh right.

A

Right but there's also it's been it's not like. I mean it's been there for quite a long time, and also some looks like it only happens in this one particular scenario that is, when you submit a create snapshot, request that uh that is uh uh invalid volume.

G

Most of the failure scenarios- it's it's gets triggered so uh we have been facing this and I think it did not get caught till now, because not a lot of people were using it. uh So.

A

G

Mean that's what I thought. Okay,.

A

Yeah, I would just okay yeah we could. We could consider uh cutting that earlier. I I'm just I just wanted. I just want to make sure that we're not.

D

C

G

A

Want to be a little bit cautious, yeah.

G

Yeah, I I don't really understand, especially.

A

Have you have you been testing the master branch? The code with this.

G

It's slightly not that easy.

A

G

That set up yeah.

A

That's why I'm a little hesitant and so that's why I'm asking the person who actually discovered the bug in that bug fix just to see if they're fine, if they're fine with it, then maybe yeah. Maybe we can yeah so.

G

E

A

Okay, so I was thinking so we could cut a patch release in the 5.0 and maybe possibly in the four dot x branches. If this is, I think this is a problem that has been there uh since very early since the beginning, so maybe the photo we which, which release? Are you using currently.

G

uh I need to check, uh I think it's probably the just cut, I'm not aware what.

G

A

What makes sense to uh cut a release for all the stable branches, but like for the food that, oh, I think we we just recently cut a patch release, but but I'll see so, okay yeah. Maybe it still makes sense right. So if we want to release this on 5.0, maybe it also makes sense to release this on 5.x, okay, I'll I'll, see.

A

G

You thanks for it.

A

Hey thank you for bringing this up all right. So that's all I have here. Do you have anything else? Anyone have any other issues you want to discuss here.

A

So I think we we already talked about the the removal. I think last time right. So the decision is to do removal, so we have already notified the seek release so that will be included in the application. Removal blog right so.

A

All right, uh so, if you don't have uh any other topic, then that's it for today. Thank you. Everyone.

D

Thanks everybody.