Kubernetes Storage SIG, 13 Aug 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG Storage 20200813

Description

Kubernetes Storage Special-Interest-Group (SIG) Meeting - 13 August 2020

Meeting Notes/Agenda: https://docs.google.com/document/d/1-8KEG8AjAgKznS9NFm3qWqkGyCHmvU6HVl0sk5hwoAE/edit#heading=h.m2cjoevcxwkk

Find out more about the Storage SIG here: https://github.com/kubernetes/community/tree/master/sig-storage

Moderator: Saad Ali (Google)

A

All right uh today is august 13 2020.. This is the meeting of the kubernetes storage special interest group. As a reminder, this meeting is public recorded and posted on youtube. So on the agenda today we have 2020 uh planning for the 119 release.

A

Upcoming deadline is the cutting of the release itself, which is going to happen on august 25.

A

So in this meeting we want to get a end of quarter end of release status, update for all the projects that we've been working on for this release and then in the next meeting on august 27 we will do a 120 planning session, so please come prepared for that and uh we'll uh uh do planning for that. At that point so reminder today is just an end of quarter or end of release uh planning. uh We want to get an end of quarter status.

A

Let's change that to end of release since releases are not aligned by quarter.

A

Okay, so first item that we have is uh csi online offline resizing volume expansion a month. Are you on the line by any.

A

A

Okay, we will skip over that uh a month tends to join later in the meeting, and we can come back and get a status update from him when he joins.

B

I'm here sorry.

A

There we go hey a month, uh so we were on the first item here. uh I wanted to get a end of release status uh from you on this uh csi online offline, resizing volume, expansion fix issues, and then you can talk about recovering from resize failures as well.

B

Yeah, so the we made uh the only one pending item that we have in external resize controller and I think similar fix will be needed. That is that, when we update when we expand pvc on the on the resize controller, we are reading from informer to check the pv status, and sometimes the informer has still data. So uh it could it results in flakes.

B

Basically, it doesn't affect end users, but it results in flake that we are trying to fix in the resize controller, and I should be able to fix it by today tomorrow and the other things are in and that was for fixes, then for recover from resize failure. uh I have pinged tim, hawkin and jordan multiple times about if they have a chance to either have a call or like like think about whether the allocated resources field belongs or what we need to do.

B

uh Obviously, the code and everything is already ready and done, but I will maybe once the 119 release is done. Then they will have more bandwidth to review, but so that's where it is.

A

Okay sounds good, would it be uh okay to mark the first one is completed.

B

uh We still have some issues that we moved like uh so like the one where the pvc could be deleted where resize was pending and uh read, write many issue that did not make the cut because of the refactoring change got.

A

B

A

We'll pick those up in 120.

B

A

Okay, so then I'll keep it as started for now and then uh that'll be a signal for us to carry this over to the 120 uh sheet once we create it.

B

Yeah next week, also, I'm trying to like schedule a call about resizing and we can talk about for 120 plan in that call.

A

Perfect sounds good to me uh thanks a month. I think we've got both of these items. Covered uh next item is supporting containerized csi node, plug-in in windows with storage, proxy, uh deep uh jing, or anybody working on windows on the call, maybe kk.

A

Anyone on the call familiar with what's going on with the windows work the csi proxy.

C

Michelle said they continue to work on bug, fixing they are in good progress on track. That's what I heard last time.

D

Yeah um deep had cut an rc of the csi proxy for beta, um so I think they're they're, getting in good shape.

A

Okay, glad to hear.

A

And the goal this quarter was to move to beta, so that looks like it has been achieved. I'm gonna go ahead and mark that as completed that'll, be a signal for us to when we carry it over to change. The milestone.

A

uh Next item is snapshot fixing issues uh shin you want to talk about this one.

C

Yeah, uh so there are two main things that we want to get in before we cut the new release. uh One is to move the apis and client to a separate goal module so that one now is actually looking very close. So I think the pr is almost ready to go and the second one is to add this validation hook. uh So andy and sean has been working on that there is a cap that has been reviewed and he also submitted a pr which is still adding a boiler plate for the web hook.

C

So I so we're getting close. I think it's still on track to get it done uh at the end of the month. So it's I think I see andy earlier whenever he's here on the line.

C

Do you want to add anything? Andy, oh chat. Do you want to add anything andy.

E

Can you hear me? Yes? Yes, yes, so there's a work in progress. Pr, I'm not sure if, there's anything more to add.

A

Okay, and are we targeting uh the 119 release or the 120 release.

C

So 119 would like to get this one in okay, uh the web hook, because I got it because we need to do this, like in cup of releases, making sure that we can ga in 1.20. So we'd like to get this one in another.

A

One and the api pr, uh whatever.

C

Yeah, that's yeah that one is almost ready just to. uh I just need to take a look. Take another look, so I think it's already uh updated address comment address. I just need you another review, so it's almost ready to go so so mainly just waiting for the validation hook thing, um and there are some other bug fixing that are in progress, but mainly it's this one, the validation. I think it's the big one that we need to got it.

A

And uh I think michelle mentioned to me- there were kind of some concerns that came up around uh validation. uh Do you guys want to talk about that at all.

C

So yeah, I think there are some concerns uh about. There are some concerns about like re. uh If you remove the, um if you, if you uh make it more strict and then uh some there are some invalid api objects that can not get removed after that, uh then, basically uh so, basically meaning that the user setup, they will always have those. um So I think for that one we don't really have a um perfect way to completely get rid of those, because we don't want to like automatically deleting them.

C

Then it could result in data loss. So we want user to be the one who are responsible for removing those evaluated objects. um Michelle are there any other concerns from you and jordan.

D

um No, I don't think we have any major concerns. I think we've identified sort of a a rollback procedure that would work if users get stuck in this situation, um so I think that is sufficient.

A

Sounds good well.

E

One thing one thing I want to add here is um how how we're going to have the users know when it's okay, to move on, to check that there's no invalid objects left in the cluster.

C

uh But okay, so that would not be something that we are checking automatically. Is that um almost like? This would be your users responsibly? Who are you.

D

Yeah, so I think something like check developing like a script or a tool. That's based off of the validation logic that we have, I think, could would be something we could potentially provide to make it easier, because otherwise, like everyone has to write their own tool, that does all the same things that makes sense.

B

Yeah, I think the one of the problems that we have seen is like. uh The cluster upgrades uh could be blocked by uh objects that are stuck in like terminating state or, like the states like that. So like we have some end-to-end tests where we run the snapshot tests and after that we have this objects following snapshot, objects that are stuck and now that cluster cannot be upgraded because it has volumes. uh Sorry, it has objects that are stuck in terminating state.

C

uh So those sides to be removed before upgrade can continue.

B

C

Okay, all right so okay, so that looks like it's important to have this tool: make sure that not stack there.

B

Because upgrade process doesn't know how to migrate those objects, those things yeah, so it just.

A

B

A

uh So for folks, listening along the issue here is that volume snapshot was moved to beta, but it didn't have a validation web hook, and so potentially you could craft a snapshot, object or other related objects with invalid data or with invalid fields.

A

And now in the 119 release there is a web hook, that's being released, that does validation, and so any newly created objects will be properly validated and if they're invalid they'll be rejected at creation time.

A

But the problem is that you have a set of existing objects that could potentially be invalid, and if you try to do mutations on them, what the challenge was, you know if you had the web hook installed and it was very strict, it would cause failure because your object was invalid and you would end up with objects that used to be okay, but as soon as you upgrade, they will no longer be okay and so to work around that the team decided to make the validation more lenient, so it will validate only new objects, but existing objects that are invalid.

A

It will allow them to continue to exist for some period of time and and eventually the validation will be made more strict. As I understand it, what this means is that it gives users a chance to delete those invalid objects, and I guess the discussion here is: how do you detect that you have invalid objects and uh and remove them, and the suggestion is around uh creating some sort of tool to be able to detect that any other comments on this topic.

F

Is this something that could ultimately be generalized? I'm sure other groups are having similar problems.

F

D

uh Anyone want to take that, so I think one action item we have from the discussion with api machinery folks is we're going to open up a feature request to api machinery so that they, their like the crd um back end, can be more lenient in allowing a deletion flow for invalid objects um that if they can solve that at the crd layer or the api server layer, then that kind of reduces the need for us to have to to build all these out of band tools and and things like that,.

C

Oh, and also they have this immutable field cap. That is not sure the status of that. Actually, if that one is there, then we don't have to go through all of this right. So.

D

C

D

C

D

If we had, if the, uh if, like the crd scheme, had all these validation features built into it, when we initially released the apis, then then we would be in good shape, because the crd schema like we wouldn't even need a validation web hook. um But since we've released an api without using that schema, then adding it to a new schema will will be a breaking change.

A

All right so takeaway is, if you're using snapshots beta, be aware of these changes in the uh new validation web hook and uh make sure your objects are valid all right. Moving on to the next item, uh we have non-recursive volume ownership, fs group, uh going to alpha the last status update here was uh we're gonna remain in alpha for 120.

A

uh Is that still the case the month? Any changes here.

B

No that's the same case I'll open up a new or update existing cap pretty soon, but we have a very good plan now: okay,.

A

Sounds good so we'll carry that over to 120.. uh Next up is sc linux recursive permission handling uh yawn. Are you on the line.

G

Yes, there is no update, we need to move it to 122.

A

Okay sounds good, so we'll get that move to 120.

A

uh next item is file permission handling for windows, and I imagine this is the same status. I'm gonna get that move to 120.

A

uh next item is file permission handling and projected service account volumes. uh This was completed so no need for a status update on that csi entry read-only handling this was never assigned.

A

So we need to move that to 120.

A

next item. Is issues related to assuming volumes are mount points uh I believe it was assigned to andy. There were a number of prs fixing various issues. Michelle do you know where this ended up landing.

D

Yeah, um I think we have a couple of prs in flight, but they're not in 119, so we will be targeting 120 for those.

G

A

All right, we'll get that carried over next item is storage capacity, tracking uh that was completed.

A

Do we have a final status here? I think last status was pr.

D

um An external provision yeah patrick's, still working on the external provision or changes. Okay, I think yawn. Do you think it's close.

A

Yeah, I think it's close okay, so we'll keep that as done and then pvc inline, ephemeral.

A

There was, it was done so, okay, we'll just move on from that one uh spreading over failure, domains design. uh We said we will move this to 120. uh shang, any updates or.

C

No update yet okay.

A

Next item is volume group and.

C

Yeah, so I need to uh schedule, but next week I'll schedule a meeting next week.

A

Sounds like we're move that as well uh and then csi out of tree for the nfs driver uh was completed.

A

And then csi out of tree iscsi driver, we were looking for folks. I think we didn't end up getting anyone so we'll move that to 120.

A

uh fiber, channel and flex, uh since nobody really was using these, the plan was to deprecate them.

D

Yeah, it's still on my plate. I just need to push the button. um I will try to get that done in the next couple weeks.

A

Cool sounds good to me and uh do we want to move them to 120 or you want to keep them in 119.

D

I mean it's all out of tree: um okay, I'm just going to push the deprecate or archive button on the thing soon, okay I'll say: preliminarily.

A

We're moving it to 120, meaning the 120 spreadsheet, uh and then we can. We can remove it if needed, uh and so this is a a call out to anybody on the call. If you are using fibre channel driver or you care to use it or you care about the csi fiber channel driver and you don't want it to be deprecated or then you should jump in, and this might be a good place to volunteer to help get this into good shape.

A

Otherwise it will be deprecated, as well as the flex adapter, which was useful in the early csi days when there were a bunch of flex drivers, not a lot of csi drivers. Folks were using this as a intermediate step. Not much of a use for it anymore, so plan is to deprecate.

A

Next set of items is for the kubernetes incubator organization, which is being deprecated one of the repos for the storage, kubernetes storage, sig. Underneath that organization is external storage, and since we want to deprecate, we want to make sure any items that were under that repo have a new home if they are still important.

A

The most important item there was the existing core static external provisioner, and that was pulled out already uh and there were a number of other provisioners external provisioners that were there that were highlighted as important and are in progress of being moved out.

A

uh Gluster fs uh was uh the was one of the ones that is currently being moved out. uh This is currently in progress and we're going to move this into 120. uh nfs. uh Karen. Are you on the line.

H

Yeah, uh so the nfa server is mode. uh I think I'm getting new pr merged there.

A

Nice and so do we want to move this item to 120 or it's going to be completed before then.

H

Yeah we can right, we can move that to 120. Maybe the automation in terms of image creation is pending.

H

A

And then uh similar.

H

H

Right uh there's some cla related issues that needs to be resolved, but I think we should be able to yeah.

A

Similar status.

H

Can be applied here, close.

A

All right so we'll go ahead and get those items moved over to the 120 spreadsheet.

A

uh Next item here is deprecation of uh the storage external storage, repo that is dependent on the items above so that will also get moved over.

C

So that one is actually only archived.

A

Oh, we did interesting.

A

uh How did we archive this before the items are.

C

Because those are actually already moved right, um I know I think karen.

H

Code is already available in the new repos.

C

Yeah, so if there is anything that is still left, then because it's archive you can still move it, it's not like. It doesn't exist anymore. Okay, yeah.

H

So there were only one thing: was there were a lot of enhancements that came in after the initial moments? I went and uh tagged all those issues and we asked to take that into the new.

G

A

All right thanks a lot for all your work on that then.

A

uh Next item is volume snapshot namespace transfer? This was a design. uh We didn't make too much progress on it, so we're going to go ahead and move it to 120.

A

Csi volume health um plan is I'll, go ahead. Shane.

C

So yeah, so this one is looking good. uh There is just one more pr one or two that we want to get getting, but those are kind of a small piazza should be in there soon and then we are ready to cut a.

C

A

A

uh Do we want to mark this as complete for alpha.

C

For this quarter,.

A

C

Think yeah, I think we can.

A

Sounds good. Thank you.

A

Next up is the object, storage api uh also known as cozy uh jeff or sid. Do you want to give an update on this.

A

Are there them on the line?

A

Okay, I can provide an update here, so we had a cap review meeting last week and there's going to be another kep review meeting right after this meeting on the same channel at 10 a.m: pacific time and they're working towards getting the cap approved they're hoping to get that approval after today's meeting, uh depending on the outcome of uh that meeting, so uh a preview and process meetings every thursday.

A

And if you're interested in joining those meetings feel free to reach out to me, I can point you in the right direction. If you look at the sig storage calendar, in fact, you'll be able to find it without having to ask me so that might be a better first option.

A

Next item is csi ephemeral volumes.

A

A

The uh existing api for csi ephemeral volumes- I wanted to do bug fixes here uh last status. Update, was waiting for a generic solution. No updates move to 120. uh Is that still the case.

G

Yes, that's the case: okay, basically waiting the whole discussion around generic inline volumes. Well, generic ephemeral volumes and inline volumes and everything comes together.

A

Makes sense all right? Thank you, john.

A

Then we have fs group support and csi, which was marked as completed uh last status. Was there was a small pr out that needed api review was that completed?

A

Clayton did an initial review and I just pinged him today to look at it again.

H

Okay, this will still be an alpha without the pr that just cleans up a few things.

A

A

And that pr is going to go into the next release, not 119., that's my expectation.

A

A

Cool. Thank you. Christian next item is vsphere csi migration. I believe we have a meeting on friday to discuss this uh anything else to add here.

C

I think yeah, I think that's it. We are uh maybe just yeah waiting waiting for that meeting. I think there are a few things we are still evaluating but uh like to see what is the community decision on this deprecation.

C

B

uh One more comment- just I had would not be sphere, but in in general about the plugins that we are migrating like. We are logging in the log. This plugin is deprecated, it will not be supported and it gets locked each time, fine by plug-in, find attachable plug-in. That call is made it's it's quite spammy. Actually, at least in some cases is that necessary.

A

I think that maybe we can move that check to the plug-in load time, so it's logged once per binary execution rather than every usage.

A

Okay, I would be okay with making it less. Spammy spammy sounds bad okay, cool.

B

I can make appear for that cool thanks.

A

uh Azure csi migration, andy from microsoft, was working on this last status. Update we had was azure, disk was moved to beta azure file remained in alpha and azure file has dependencies that need to be resolved before ga any updates on that from anyone.

A

Okay, we'll carry that over and then we'll break this out into azure file and azure disk for the next quarter.

A

Aws csi migration.

A

uh Any updates on this one.

A

We're going to move that to 120., if any of you know folks at aws who attend the storage sig, maybe ask them to attend the next meeting on august 27 so that we can do planning and assignment properly.

A

uh We get to get good to get the ball rolling on this as well, then we have openstack csi uh for cinder and plan was to move that to 120.

A

Then we have ceph fs and cefrbd migration. Humble was working on that and we're going to get that move to 120 as well.

A

Finally, we have a set of cross sig items that we were working with. Other sigs on first up is sig scalability, immutability of secrets and config maps that work was completed.

A

Second, is with sig apps. We have an issue where pvcs created by stateful set are not removed, and this issue wants to clean that up. uh Last status. Update here was that a cap was created uh and uh any any other updates here.

C

uh Are you talking looking at it.

A

Yeah stateful set pvc.

C

You see the cap yeah I've seen there's a cab there's been reviewed. I think.

D

um Matt has been reviewing it: nice, okay,.

A

So we'll get that move to 120. uh Do you think that it'll be ready to implement in the 120 time frame.

D

D

It needs it basically just needs sign off from us and sig gaps, cool.

A

All right looks like good progress on that. Then uh next item is volume. Expansion for stateful sets uh last status here was kept, was raised, addressing comments any anything else on this.

B

One yeah, I think kk, is not there in the call today but uh okay. So I had a call with him uh I think last week and where we discussed the cap- and it is still missing some important details about the the flow actually.

B

And uh so I thought he's going to update the cap with those videos, as we discussed in the call.

A

Okay, so it might be that uh 120 is going to be a design instead of an alpha.

B

uh It could get be become implementable, but it needs like it needs to use from us and from like jordan or tim as well. So.

G

A

So we'll keep an eye on that and uh try to get that. Moving uh next item is execution hook. I think we have some good news around that shang. You want to talk about that.

C

Yeah, so we had another meeting with uh signaled and also uh jordan and tim also there. So we actually it's pretty good meeting.

C

We uh reached consensus at the end, so there were a few decisions we made there uh to address the concerns from the front signal side uh like uh we don't want to do retries in the cubelet and only the external controller will be the one to do retry if, if it fails for the first time um and then also have a pod level status instead of uh each for each container has its own status uh and then also, we need to uh add a section uh to uh talk about.

C

What is the impact on kubrick if we are adding this feature um so, but actually after the meeting there is a derrick actually added the comment after the meeting, because he couldn't join the meeting. He had a conflict, uh so he was asking us uh if we can't do a poc outside of cubelet um so and then team was saying that it's okay to do the outset, but then he does not want that to conflate with the final design, because I'm also not quite sure what that means.

C

If we do a poc outside of kubelet, do we need another design for the poc itself so that that's the one thing that we have not yet resolved? Yet I want to check with the team and see what is his idea, because that's the last thing uh they have this uh comments in that doc.

C

I didn't get a conclusion on that party.

A

Yeah nice work driving that this was uh pretty challenging. So for folks who are not familiar with the situation, we have uh a kind of work that we want to push for uh the snapshot uh controller.

A

Ideally, we want some way to be able to have a qs on qs hook, so that before we take a snapshot, kubernetes can ask an application to pause and flush its buffer, so that we can achieve consistency- uh and uh you know sheng has been trying to drive this. I think for a very long time.

A

uh Initially, we thought this would be a standalone controller that would execute these qs and unqs hooks, but I think the feedback that we received was. We should make this generic, because it's kind of a common problem across kubernetes being able to signal an arbitrary container or an arbitrary pod for various different other reasons, so try to make a generic api, and there was also pushback for security reasons not to be able to uh exec into an arbitrary pod or container and keep that logic centralized in cubelet.

A

So xing went to signode with a proposal and they pushed back and said you know this should actually be an external controller, not part of cubelet. We want to keep cuba as small as possible, and is there really a need for a generic hook, and so shane was kind of caught in the middle uh required multiple meetings to try and resolve that, and it looks like finally there's a light at the end of the tunnel and there is consensus, hopefully reached.

A

uh Hopefully, derek's comments are going to be addressed soon and uh and we can make progress on this in 120..

A

uh I think xing next item is the kubernetes util mount library uh we're going to go ahead and move that to 120. I think work was done but not completed. Michelle is that correct.

D

A

Okay, so we'll get that moved over to 120. all right. Thank you. Everyone for the updates, uh that's all I had in terms of getting a 1.19 uh update uh and so next meeting on august 27 we're gonna, do 120 planning uh I'm gonna copy over the spreadsheet to 120 and we'll move over the items that we didn't complete.

A

We can go over assignments and any additional items that you want to add. Please come prepared to the meeting to add those you can add them as comments uh if you'd like ahead of time, and uh that would be okay as well I'll, send out a a note before the next meeting about that.

A

uh Beyond that any pr's to discuss or designs to discuss, please speak up now or anything else that you want to talk about.

A

Okay, well, if there's nothing else to discuss we'll end a little bit early today and give folks uh time back in their day and we'll reconvene in two weeks. Thank you for your time.

F