OCI Weekly Discussion, 10 Mar 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: OCI Weekly Discussion - 2021-03-10

Description

OCI weekly developer's call recording from 10 Mar 2021. Notes/agenda here: https://hackmd.io/El8Dd2xrTlCaCG59ns5cwg#March-10-2021

A

On that note, uh john you're, we so we do have a number of items. So let's try to be fair because the we try to leave the presentation. Discussions um tend to have longer discussions and if we don't have a longer discussion, then what's the point of having a discussion so um face image. What actually can we do the base image annotation proposal, because that one, I think, will be short and we can time box that at maybe seven minutes?

A

uh Oh it says: jason can't attend.

A

John, what do you want to do in this one? You were kind of shepherding this one. Do you want to just kick it to next week.

A

A

All I see is -20 holy crap um mike dead, maybe sec, okay hold on um first of all, here's our hack, so everybody please sign in um okay. I was trying to capture more uh so with that. Why don't we get to uh content editing? uh Well, john can't do that one either john. If you can't talk we're gonna, just punch all your stuff to next week.

A

Sorry, you can talk all right. Sorry. The floor is now yours.

B

uh Sure um so yeah I mean I was the one who originally made the issue, and this is like my first time petitioning or going to the oci uh for respect stuff. uh So yeah I'm a little new to this. I guess, um but uh we basically have a.

B

Registry, that has on the order of hundreds of thousands of images in it, and we did some analysis and we looked at what being able to serve specific customers.

B

Specific formats uh would entail in terms of cpu overhead and how much benefit that would have in terms of their container start times or container push times for build for builders, and what we found was that different users have significantly different benefits from different compression algorithms, for example, in the data center, where networking is super fast, having no compression is ideal and the oci image spec added the ability to have layers without uh compression in them.

B

So you can just upload a tarball layer as opposed to a target uh gzip layer and uh for uploaders for example, or for excuse me, home users. um Doing a very high level of z-state compression uh is actually beneficial, even though it's very cpu intensive. It turns out now that everyone's working from home, you know people have like 10 meg internet.

B

You can spend all the cpu time in the data center and it pays off um so based on, go ahead, sure, pros and cons, so yeah you're never going to make everybody happy right in the current um distribution, spec and image format don't allow for different users to get different re-encodings of the same layer um or different compressed versions of the same layer um unless you d duplicate that in the image itself um and that just becomes wasteful in terms of storage server side um and it doesn't allow for for upgrades in flight.

B

um So the proposal kind of talks to uh primarily a adopting the http content, encoding spec, which allows the server to serve up different encodings of the same file, format or same file. Excuse me two different users, depending on on you know how they decide to set their accept. Headers.

B

There's a small part of the issue that describes a mechanism to determine um which content encodings are available for uploaders, um because http doesn't allow for the um uh client to like interrogate the server um in a trivial way. So uh that's that's really the only extension there, but the the big benefit is on on download.

A

Can you explain.

C

B

How how that would work with the content encoding uh on the download or the upload side.

C

B

C

The download side.

B

um Like in the data center, we would configure all of our docker daemons to pull and say you know, accept um identity or accept encoding identity. So don't ask for a compressed version um of the the data and it would serve the tar balls as is, or you know, 302 to s3 with the tarballs, as is um for home users, who are doing like a docker pull.

B

um It would uh be able to do on-the-fly compression for images that have not been fetched before and then the first time you fetch a layer, it would be able to re-upload it to the s3 store, with uh after doing compression with that level and subsequent polls would be 302 to that new poll. That's like the very specific of how we're planning on doing this, and we do this with um our like java, tarballs or java jars, for example.

B

um But it's a it's our approach. You could also do this with like apache 2 and just rely on its standardized behavior, which is on the fly compression and no caching.

C

So in this case, the actual images are referencing uncompressed, descriptors.

B

uh They would have to reference uncompressed uh tar layers in order for this to work. Okay, yeah. That makes more sense.

D

E

Going to be my um like concern also is, if the yeah, it breaks the content adjustability, if, if you're, somehow expecting the registry to re-encode the layers, but if it's just yeah, if it's just on the content transfer, I guess that makes sense.

B

Yeah, unfortunately, there's not really a way yet to tell people that they should upload their manifests without compression, um but that's something that, like at least we own some of the build infrastructure.

B

So we can start to migrate people to doing that, um enriched in add a rejection policy when we have enough people moved over, but right now, if they upload a tarball, it's super slow um because their their upload time is just um like abysmal and uh they're uploading 10x the amount of data, because it's uncompressed because there's no way to say you know upload this in a compressed fashion um and then in the data center, like they want uncompressed when we're pulling their. It says it's.

B

This asymmetry, if, like the builders, are very different machines than the runners.

F

Yeah this this gets into some.

B

F

D

B

The one other thing- that's not in the proposal as today, um because the um standard is not standardized upstream yet um is uh for zested custom dictionary support. So this was a case where we found um doing offline dictionary generation can get us um almost eight like we already have like a 90 something percent compression factor, um and it can get us from like the 93 94 to like 97 to 98 compression factor. um So it's a a pretty significant one. You can get from doing offline compression.

A

I think we all recognize the like. I was watching there. I had container day things today and uh there was three different sessions on the compression formats and nidus and different approaches and so forth. So I think we're all seeing that in to mike's point like there's, there's always trade-offs and how do you support both whether we're all working from home and how long that works, or what does a product like dockerhub do which, for the most part, never has somebody close to it?

A

It's always meant to be far away, so you want it to have the optimization for the internet, but yet for all the private registries for customers they should have the optimizations of the fast network, and hopefully their registries are always right next to their compute, not cross region polls.

A

So, and then how much does the author need to play a role in that difference um like this is yeah we're experimenting this with a teleport. We see, you know we try to make it completely transparent to the user, the user uploads it we expand it. There's teleport nodes that know how to say they negotiate say: hey, I'm teleport enabled. Are you in the same region? Do you is that expanded? Yes, yes, then bang it's done and it's much faster. I think the problem we're facing is: where does that happen?

A

How is it exposed to users? Then we have this infrastructure. That's made on these content, just on these digest and descriptors that assume a certain way that the digest is forming. If that's changed, what does that promise mean?

A

The other thing I'll just add that one of the reasons I've been pushing back and concerned about this is that we keep on assuming that registries, which are generally speaking, dumb storage devices right, there's a cacintas and there's a gazatus, and we promised that the gazata is the same thing that went in um between the tough stuff. Looking for upstairs updates and time stamps between some of these conversations, where we're doing conversion on the fly.

A

Not only is it going to be compute intensive, potentially, um but it's also a matter of are we should we be changing content on the fly, so I just I. I just wanted to raise those two concerns for us to think about. How do we incorporate these.

B

So so, in your first point about um digest, since this is not like, you do the digest. After doing the encoding decoding step- um or I guess before the encoding step and after the decoding step, so the content encoding and the like content addressable nature of the store have nothing to do with each other. um This content encoding is just like an artifact of the fact that the registry protocol is over http um and uh storage on disk will all be this this the same, um and then the other thing is like it's totally.

B

Opt-In, there's no demand that the user has to, or the registry has to do, on the fly um mutation. um You can still allow users to upload with manifest that have tar balls in them that are compressed with gzip. um I would say that, like it would become it. It is becoming very exorbitant to do z, state plus g, zip, plus uncompressed, and storing that at rest and that's becoming more expensive compared to the compute, at least from the economics um in aws.

A

Have you done the cost in the compute? Like that's, that's part of the storage is not free right. We don't necessarily charge as much as we should, and it's not just the size of the storage. It's also um there's just a bunch of overhead related to how many list apis can support how many objects, the deletion management, then the the in.

F

This case that doesn't hurt enables.

A

F

A

Than you have really.

F

Hard to hear you.

F

A

I yeah, I don't know if you got another mic or something and it's hard to win um if you have another option, so this.

E

A

E

G

E

Sorry go ahead.

G

Yeah we're both sort of talking the same time, so this is just a request for on the wire compression over http, where the client and the server are going to compress and decompress, based on the requested compression format right for the protocol.

B

uh It allows for them to do this. It does not require them to do this.

G

Right right and then the I mean, there's also the opportunity of doing lazy uh pulls. I think we've got a starter gz uh little little thing that we could show off right in container d.

G

um I'm not sure how that would affect this. We have to take a look at that uh you, don't you don't want to have these compressed packets sitting in a in a cache in on the server for a very long time period right.

B

So, like I, as far as the star, gz format would not be significantly affected by this um unless the server is trying to do on-the-fly compression because of the way that, like some of the compression algorithms, the streaming compression algorithms require that you see through the entire file before you do compression um you're.

G

Not just just looking down the wire you're, also thinking, okay, this the server is probably going to cache it on in-store.

B

Yeah, like the the economics going back to steve's point on upload, like we storage, is cheap for a day and then overnight compute gets really cheap. You run compression overnight at like z, sid 13, and you compress overnight and the next day your like your compute, was basically free and your storage cost has now gone down and you've cut it by fifty percent and um yeah. That's basically the.

G

Thing that happens, the current native compression is that we're using in in the store right.

B

um Yeah and also that we have no way of upgrading right now like if, if we go from z stage, six as the default, which I think is the what's in the image, spec um there's no way for the server to be like. I want to use zested extreme, um which can have significant benefits compared to the whatever's in the image spec.

G

Did did the image spec say limited too, or it just.

B

Had a list of proposed you.

G

B

It requires that you use a specific level, because the um image spec does the content uh digest after the compression.

A

A

I think so the question is: what do we want to do? Is next steps when I first saw this, I thought this was the other one where there was a negotiation to figure out how to not have to upload something that was already in there um there's a little bit of feedback on it.

A

What's next steps here, I haven't read this in detail because I just saw this today.

B

Yeah, so I mean it's broke, there's two parts of it. um I I can turn it into more formal language and break up the two parts. um If that's helpful, um but yeah, I don't, or I can wait for feedback on the issue before uh doing that.

G

Just in fy, I'm looking at the language and it, and it literally only says, must have at least the you know these these formats gziptar right it doesn't, it doesn't say you can't also have other formats.

B

I believe that things go very pear-shaped if you start to mix and match gzip levels within a given registry, um in your your cost kind of explode as well.

F

I mean presently.

G

G

A

A

Sorry, I'm also just trying to find my comments because I thought I commented on this, but I don't see it so.

A

We we're on two-thirds, sorry, that's a stupid question. We're on 236 right.

H

Yeah only justin and um v-bats commented that I can see.

A

Okay, that's what I thought. Okay, john! I'm not! I john, I think, you're talking about the content encoding, one.

A

All right, why don't we give this some big time? Just because there's a few people that commented, it's certainly a meaty one um and go from there and then because this encoding one is very much similar conversation.

C

I don't know the encoding one's.

A

C

I think simpler for us to resolve um like that's it's just it's just like it's just adding functionality. That's in hdb today, just formalizing it, the duplicate ones, someone wrapped my head around with what that one's asking.

E

For yeah, this content encoding thing seems to me like just sort of pointing out that, if you're very clever you can like transfer it in different formats,.

B

And store in different formats on the server.

E

C

Yeah, I think that makes sense um from the client sides. I think the builder sides have been pretty cautious about just assuming stuff's not going to be compressed and throwing it up to servers because registries don't do their own compression today.

C

But if you have your own build pipeline and your own registry then yeah, I think it completely makes sense to have more control over.

C

A

D

A

Of us got confused because sargon was on 236, but he was wanted to talking about 235s, and so I think a couple of us got confused on what was what, anyway, all right. So what do we want to do with some next steps? I'm just literally trying to facilitate here.

C

I would say: formalize 235: do we even discuss 236, because I had that one open as well, but uh I haven't talked about 236 at all, yeah, so yeah. I think formalizing 235, like you, mentioned, that that makes sense. I'd like to see what, like the formalized version of that would be so we can run through like where the client edge cases might be. Did I break up the.

B

um Description on how to handle upload negotiation because that's the part, that's kind of outside of http from the the download side.

B

And even preferred ordering between those two, if.

H

You want to explain that one.

B

um So in the um two-step upload process, um the proposed idea is that uh when you return the 302 um to the like upload location, the session location, um you would have a header, a custom header which says which um encodings the distribution prefers.

B

According to the http, like weighted preference spec, because right now, there's no way for the server to say which encodings it prefers for upload.

C

Gotcha so uh with, without that today, the clients would have to assume identity, because if they start sending compressed content up, uh I can yeah.

B

Yeah or our experience with playing with this with registries, is that they just will take the content, encoding and store the like double encoded version or the the encoded version, and then when they serve it, they don't didn't like record that header, um so uh things go really weird uh right now, uh so you need some way to tell like have the registry say that I support this feature.

C

So is, are most registries ignoring the encoding.

B

Yeah we played with them, I think so. We, the the standard distribution um we used and I feel like the other one we used, um but both basically just took the blob and stored it and ignored the content and coding header completely.

C

Okay, yeah: that's not good.

B

And then, when they served it, they served the content, encoded version without setting the header back.

G

C

G

That makes sense.

C

I

G

Registry implementation.

C

Are you are you keeping track of whether something's compressed or uncompressed, or are you doing some sort of heuristics.

B

So where we are decompressing the blob on upload and storing it in s3 decompress until the like cleanup process, the janitor comes along um and then the janitor turns it into a z-stead version um that that's how we're going about it. um And then our registry um can see if this eastern version exists and we'll serve that up. Instead,.

C

Yeah, I'm just trying to think of how a registry would like just any generic registry would implement something like this, uh because they don't necessarily know the content, uh the content type, that's being uploaded. They only know the digest in the end.

B

uh I believe that they, I mean they implicitly know the content type from the.

B

When you upload the layer like the standard clients will say the like blah blah blah.

I

They they recognize it based on the media type.

A

Whether he's done with it is a different story, because I think to mike's point, I think we're generally just treating was blobs yeah. Where is that getting up was like getting sense.

C

In the manifest.

B

Is what I what I mean.

C

Yeah, that's the, but it's it's hard to connect with you from like any generic registry. That's insane like yeah! You could have a registry, that's really smart, but if you have a registry, it's not necessarily the same.

C

If you have a somewhat eventually consistent back end right for the most part like the manifests are uploaded. I guess this thing the manufacturer's usually uploaded afterwards anyways in the flows. So you don't even know when you just you just start getting bytes coming up. You'd have to look at four compression headers.

B

The I mean the generic way to implement. This is uh say that you only accept the identity in coding and only store identity, blobs.

C

I

So would you upload the identity blocks first and then have the registry look at that.

B

So that that's so what we want to do, at least for our use case, is that people who are building and pushing on their laptops will compress with zstead people who are building in the data center will upload identity, blobs um and then any generic registry that wants to implement this would say. I only accept the identity, encoding um and store the identity, encoded stuff in its blob store.

I

How would the people on laptops be able to tell the registry I am going to upload this kind of identity.

B

So if the content encoding flag is missing uh or not present, it is implicitly identity, encoding.

C

I guess the promise, so the identity coding just means that it's completely opaque to the registry, but the content that comes up could still be compressed, so you could still end up double compressing stuff unless you actually look at the content right, yeah but like so yeah. That's that's why? I think, like you, you really, if you're an environment where you're controlling the build side and the registry, it makes sense. But if you're, just like a generic registry service like it seems hard to uh to handle that.

B

I think that, like for the generic registry service, they would just want to store identity and rely on the user to g-zip their layer uh appropriately and then on download. They can do smarter stuff than that, but they don't have.

D

D

So, to put that another way, you're saying the use cases are extremely narrow for actually being able to get benefit out of this.

B

If you have a completely dumb registry that can't re like uncompress and compress data.

G

It's non-breaking, but if you had both the client side and the server side set up you could you could do additional compression right.

B

Right or I mean if, if either the client or the server had some level of intelligence, um you could get benefit out of.

G

If the is expecting identity.

B

But like if the server has intelligence, it can decompress on the fly.

B

Which is like a pretty common thing for cdns to have.

D

Right but the the content addressable digests are on that compressed content, so whatever gets uploaded is what needs to be downloaded.

B

But you do the digest after you do the content, encoding or decoding. Excuse me on download and then on upload. You take the digest before you upload. It.

D

Right but that digest is on whatever compressed bits you upload so either. The client needs to know in advance that the registry is going to handle the compression and thus provide it. The identity digest or the the uncompressed digest, and let the registry handle the compression or it needs to know that it should compress the blob and provide a digest. That is the compressed blob.

B

I mean- maybe maybe it's best if I take the specification and break it into two and formalize language a little bit, because there seems to be some misunderstandings about the protocol level compression versus the um image spec related content, addressability.

C

Yeah, obviously, you can't touch the content if it's part of the digest. I think that that's kind of why it's getting that, though, that unless the builders are building with un compressed data, you don't get any benefit from it. So and you wouldn't necessarily automatically do uncompress. Unless you knew you were so, you were using a registry that was taking advantage of this um so yeah. It makes sense to me I I would like to see like if you want to formalize that what what it would actually look like it doesn't do any harm.

C

I mean it. It makes sense to have support for compression at the actual protocol level, um so I I think it makes sense and yeah. I understand how it's kind of weird here, with the two-step upload for doing that negotiation.

B

Okay, I will go ahead and clean up the language and splat drop.

A

I think that there's an interesting point around: uh what do you do about gzip and um wait? Did I read that right.

A

How does it get handled for double compressions things being uploaded already compressed.

B

I mean I I I can add this to the language but like if a client is uploading, data entered as a smarter client and it sees that the server side, content or accepted encoding is identity, gzip bz2, let's say, um but it only wants to upload a targey z blob, because it's builder built only a tarjay zz blob and the manifest that it has is only a charge, easy blob. It would then have to upload a targey z format with the identity, encoding.

A

I'm just trying to wrap my head around the sequence of events is: when does it get known right because the blobs get uploaded, they get uploaded through the rest api. So it's not like. It's go straight to storage, so there's a chance to to look at it, but the manifest is a separate asynchronous put which has to be after the blob is set, so it can validate it. Where is the understanding and correlation of the two.

C

I mean well the.

A

Client will always.

C

Know the client always knows whether or not the blobs are compressed or not, because they have the media type.

A

The client knows, but when it sends it to the server, I don't basically.

C

If it knows it's compressed, it's always going to use identity, because the client would then be knowingly doubling compressing, which the client should never do.

A

Just I guess sorry, I just think through that make sure you capture it, because for even the people that know somewhat about this, we're forgetting the particular put on that and to know what level of detail is known between the.

B

Two that makes sense yeah, I mean I'll split it up into two and I'll. Do the upload one first, because I think that's the one, that's more nuanced um than the download.

A

A

All right does that conclude the two.

B

I don't actually.

G

Think we talked about the second one.

G

Gzip, wouldn't be the only solution. The the intention was that you could extend the image back to you know to have other formats as well, but this makes a lot more sense to just use identity. You know basic guitar when you're doing a push pushes are aren't the primary case right for using registry. It's the polls. So if you, if this solution, I think, makes more sense than the other prior discussions that we've had right, where we could just push an identity and then have that be.

G

You know negotiated between the client and the server on each pull.

C

Good intentions, but I think we realized pretty early on that that wasn't the best idea to just ease up everything and it it was. We actually thought of pulling that back a few times and the clients, um but the reason we didn't is because yeah the havoc it would reach on uh havoc for registries which aren't handling compression at all, just bloat storage, so yeah, it's kind of a difficult problem, but the client I mean there's nothing stopping builders today from doing uncompressed.

A

All right now, 236. sargon you're back up.

B

uh Cool yeah, so this one is uh simpler. I think uh at least less controversial. um It is a mechanism to extend the deduplication that uh cross blob across repo mounting uh kind of lets. You do, um but it allows you to do this without having knowledge of the other uh repositories that are uploaded to the registry.

B

um The really common use case is that someone updates the ubuntu base image, and then everybody in the company builds on top that new ubuntu base image and there's a docker push and then, when they do the docker push um they end up having to pay for upload time on those lower layers that the registry already has.

B

um We don't have any way for um one trusted clients to um get a deduplication upload unless they do unless they're able to figure out which other repository uh has that store in it and a lot of clients. Don't keep track of, um like repository name to blob.

B

um So this basically allows the client to say I am trying to upload file with um blob descriptor sha, 256 foo, and the registry then, can do a proof of data possession uh challenge against the client and allow the client to prove that it has. This data to securely do deduplicate at upload time.

A

So I love the idea, because this is a demo. I've always hated that when I push an image that the registry should surely know of, I have to wait for the upload for it to just say: yep. I already got it, but it had to upload it first and then it just tosses it on the server like. For me, the user, the expensive part, was already paid.

A

um The thing that I just always get nervous about is the disclosure challenge, so I think we just have to somehow specify that and I'm just going to look at mike, because I can see him standing there if mike and I have repos next to each other on the same registry and that registry, you know, has decided that it doesn't allow sharing of layers across two security boundaries, then that it shouldn't it shouldn't basically acknowledge that that layer exists, because that basically tells me that that content already is in the registry, and I can somehow circumvent it now.

A

If mike and I have the same permission, boundaries and I happen to have probably push in addition to pull rights, then it makes perfect sense for it to be smarter and not upload um and to be, you know, just say, don't even bother uploading. I already have it.

B

Yeah so there's this is addressed in the um proposal where uh the registry can do one of two things: it can either completely ignore issuing challenges and never issue a challenge um allowing the user to deduplicate, or it can always issue a false challenge. The issue with uh with this is that at push time, if the user tries to fulfill that challenge, um you are open to a bunch of cryptography, related timing attacks um and that's called out explicitly that this does not try to be resistant to this proof of data.

B

Possession protocol does not try to be resistant to timing attacks um and that trust boundary evaluation would have to be done at evaluation of the proof of data possession.

B

um Now saying that, like a lot of layers, are you know, tens of gigabytes and the timing attack may be able to disclose a few bytes out of that image um at a time and uh for people who are hosting registries that have strict cross repository tenancy requirements, um you would not want to issue challenges.

B

On the other hand, uh registries like docker hub if a docker hub um uh node has a or docker hub store, has a uh public image. It can issue challenges against images that refer to blobs in the public, but you never want to refer to someone else's blob in the private. uh Now that's complicated logic. So, like you know, for docker hub, it would probably never issue challenges, um but for corporate registries or or other registries, like you know, if you have an amazon ecr, um you might want to be able to do this.

B

This is again kind of the work from home use case of like everyone's working from home and docker. Push is really slow.

A

Yeah, I mean look, we it doesn't matter where, where we live, where we work, I haven't found a customer, yet that or I haven't found a scenario yet that some customer will complain about performance even when they're. On the same note, there's always the opportunity to be faster.

A

So I think it's just if you address the the security boundary acts issue and allow that negotiation to be done, because we've had this debate before here, where some registries consider it perfectly fine to share images across org boundaries um and because there's more there's more beneficial savings in storage than the concern, for you know, hacking somebody else's layer for somebody else's image, um so I think there's just a trade-off. So as long as the registry and the client can negotiate that and let the registry decide what the boundary is for them.

A

That point: that's why I kind of use mike and I as two repos like it- should allow cross repo mounting not just in the same repo, but it must somewhere. There must be a determination, it says, but I actually do have the rights to mike's repo, for instance. So it's allowed to to do that. um I think we're going to see this more and more. It's not just working from home, we're pushing more and more for ephemeral, client build environments because it's the only way to have a safe.

A

You know uh build, so does that mean that that every one of those clients isn't going to know about the images that it already pushed or the layers that already pushed so that's kind.

G

Of great, if you've got a secure store on each side and you can validate it, you got authentication, then you can trust what the client's doing, because it's using your self or co-managed store as well right. That makes sense to you, yeah.

B

I mean the the the thing that you don't want like, even if you say that, like your challenge, protocol um is resistant enough to attacks that you're, okay, with like full sharing um so like in a corporate environment, we're basically fine with uh disclosing that we have store uh a specific bob, but we don't want like um hr to be able to.

B

You know fetch it's blogs, um or vice versa, if they're uploading across different parts of the repository different parts of the the registry, um so we need some way for them to prove that they actually have initial access to this blob.

B

Somehow, our initial possession of this blob and that's what the proof of data possession protocol is about- and I think justin was the one that brought up a concern around the proof of data possession protocol, and you know whether we would have a preference for a proof- data possession protocol that requires a ad hoc uh generation of proof points or just allows people to store an unfinished digest.

B

um The storing an unfinished digest is easy with sha-256, but with the um blob descriptors today, you can't easily do the other approach to proofing, because you have to shot 256 again over the entire blob.

B

So you need to have a merkle uh tree construct for a hash. It's like sha, 3 or blake 3 or blake 2 or similar.

A

Okay, I think you're, but I think you're saying you're signing up to be able to de-tank- or you know, document detangle that and give the option. So we can.

B

Yeah, I mean both proof of data possession protocols are in there. uh It turns out that, like cryptography, a language is very uh complicated and, uh like it turns out, we have one cryptographer at work, so I wouldn't want to bring a data. Possession protocol to them that it turns out is not sufficient for most people's uh security needs.

B

It doesn't satisfy the trade-off. So I think what I would like is a um an under like to understand what the community thinks in terms of minimal, viable uh security, um and then you know talk with folks internally to kind of get some formal language and get a formal uh definition of this. uh The spec.

A

C

Maybe I'm over.

A

Simplifying this, if I can pull, if that, if my layer has digest a b and c just and I'm trying to upload it, and if I have pull rights to mike's repo as well, so I have a graph access to his.

A

If it's computer, that my one of my layers is digest a and- and I can- and as long as my identity goes to the registry, which obviously it does as long as it can determine that I would have been able to pull cross mount to a out of mike's repo.

A

Then I'm missing the concern, because it's really what you're talking about disclosure problem so.

B

We don't really have the problem where um there's two two things like one: our clients, don't necessarily know where they got those layers from um because those layers were not like they were given out via, like, I think we're using build kit and build, uh or something like that and like it doesn't keep track of this information um and the or or people are pulling from docker hub and then pushing to our internal registry.

B

um So so that's like, like let's say, mike's registry is on docker hub and everyone's building on top of him, his image. When people build and push to internal registries, the internal registry can't dedupe. um Now, if you relax the guarantees to say that, like um any repository can share blobs with any repository, uh you can totally do what you're saying um we don't want to completely throw security to the wolves.

B

We want a little bit of security and being like you know, you have to prove that you had mike's blob at one point and you can't just probe the registry for random hashes to see if it has that in there or not um the specific risk. Kind of. Is that, like, um let's say that there's a known vulnerable layer, um these security people publishes?

B

um If you then probe the registry for all the known vulnerable layers, you could then use that to um get down.

A

You can exploit.

B

Them yeah yeah, exactly yeah, but.

A

But let me just ask the question: you know: try to pretend to be a security expert, but it I I just know about the disclosure problems is the idea that, even though I do have pull rights to pick up continue to pick a mic here, if I can pull all the images from mics so that the fact that I can discover digests?

A

Actually I don't understand why that would be a problem. I'm still trying to fight why the vulnerable problems, the things so.

G

You don't have questions he's talking about a different solution.

G

You guys are both wanting to solve the same problem in a different way.

A

Yeah, I'm just going to.

G

A

And it's got digest a b and c and let's say mike- and I share our registry and there's nothing in that registry.

A

Yet so I pull image from doc rub it has layers a b and c and mike pulls an image that has layers a d dne when the first one of us that pushed to that registry will get layer a in there and then let's say mic goes first and then I push whether layer a gets tossed because it got pushed to the registry and gets tossed, because it knows it's deduped or the client could tell the registry. By the way I have lay. I have a digest. A do you know of.

A

Do you already have digest a that? As my identity, you would have de-duped anyway.

B

A

I'm trying to.

B

We have a slightly different security boundary where not everyone can read everybody else's repository, but we don't.

A

B

A

I don't have access to yours, you're in the third repo in the same registry. um The way we actually do it in azure is it's per registry sharing, and then we we're discussing doing more granular. But if mike and I have two different registries in azure- another acr we'll never share layers between each other right.

B

A

Different teams in the same registry can share layers.

B

But in the uh case that I'm talking about, we have lots of different teams that cannot pull each other's image right. But we want to be able to do d-dupe across those and do this without allowing people to not steal other people's layers to uh arbitrarily.

D

B

Other people's layers based on on hash disclosure, because there's other parts of the system that disclose hashes of players.

A

I guess I'm being, I didn't really think about it as if I can't pull it then I shouldn't dedupe it I guess was the thought, so I was tying the permission boundary closer and as long as it's defined, the registry can define what like. If what I guess, I'm trying to say, is, I know, registries try to do different optimizations and we should allow that.

A

So as long as the negotiation can figure out that I'm about to upload a digest, a the registry can determine whether it wants to consider that a dupe or not and just says I allow that to be deduped. So don't bother sending it to me whether it's because a registry.

G

Allows layers to be shared, otherwise they could just have the copy of the manifest and be pretending.

A

This is why I kind of keep it simple and just saying: if I can't pull it, I can't de-dupe it.

B

I mean that can be.

B

In the proposal, that's also talked about of no proof of data possession and totally relying on the trust boundaries of the registry um and that's a doable thing as well, and I think, um if that's where we want to start, I am fine writing up the language for that. And then, if we wanted to get into more complicated use cases of like soft multi-tenancy, then there's extensions, and I want to make sure that we leave a specification. That's open to such extensions.

A

Anybody else got a security hat that they have an opinion on this one.

G

I think we want more security, not less.

A

No, I'm asking actually I'm looking for other people that have a kind of.

G

That data there could be privacy, private information in that data. We need to make sure that you have all the data that you say you can push. I believe when we, when we allow these, pushes um what.

C

G

A

C

A

That's what I was about to say is what, if I just.

C

Registering to know whether or not it has the digest in the first place, and the registry.

A

But that's what I'm kind of getting at it if to know that the digestive layer to layer digest exists in the registry. If I could have pulled it, does it matter that I found out that it it has it like? What's the and I'm just asking, is I'm really I'm trying to think what is the problem if I could pull that digest what you can't.

C

Pull that digest, though,.

A

If I can't, then it shouldn't say that it shouldn't acknowledge that exists, because you'd.

C

Have to pull it from a different.

A

Well, that's what I'm.

C

A

Is my identity goes up with it and, if I couldn't have pulled it from wherever that digest is because the digested unique across all repos, if the, if I couldn't have pulled that digest, then it shouldn't acknowledge it? Has it that's it.

B

And the problem kind of comes into any time you need to do a auth z, evaluation, you're, like typically scanning through a big list and uh use that in two steps. The first thing you do is you determine that this blob exists and then you look at everyone who can reference it and you go through everyone who has access to those references and validate this. That becomes like a big timing: vulnerability um because, like at least with something like s3, it's super slow, like you have seconds between those steps.

B

So if you know it takes zero seconds, you can say this registry doesn't have this blob if it takes one. Second, you can say this registry has this blob, but at two seconds, if it rejects you, then you know that this registry has this blob and um you just don't, have access to it and in the reason why you don't want this beyond like.

B

Turns like if people probed our registry for say java players to be able to determine all the java vulnerabilities that exist in our ecosystem or have existed in our ecosystem.

B

So that that's kind of where the like, just relying on off z, constructs, falls down.

B

That's fair, so I would say, like you know, either we go with a completely insecure approach, which is just like. This is just a way to dedupe between everyone and everyone who talks to this registry has to trust everybody else or you have to have a proof of data possession protocol.

A

I'd be scared of anybody that says that we trust everyone. I.

C

Guess here, though, when does a registry actually return this uh proof of data possession, or this request for it in the first place,.

B

So when the uh user initiates the upload, they would say I am uploading blob, descriptor, um foo and the registry at that point can issue a challenge. But why would a registry issue the challenge um like it? Can you you have to you? Have you're gonna? You have three choices, you can say I will never issue challenges.

B

I will issue challenges on potentials of d-dupes or I will always issue challenges and just invalidate them um from uh from a security perspective, and this is where I need to go talk to our like crypto people and see, if, like those are all viable options um before I like write language up around that, um but I think that if you just say like, if you just do the check of existence and don't provide a challenge, if um it doesn't exist, you might be able to get around the timing problem.

C

Well, I guess yeah. The point is usually the way we do. These fallbacks is yeah. If the registry is basically giving up the information that it has that blob already by issuing the challenge, and if it doesn't, then it really changes the protocol like if it always issues the challenge, whether or not it has the blob or not, then that kind of messes up the protocol from the client perspective, because now they're always trying to prove they have something that the registry will make them upload. The data anyways.

B

Yes, I guess you have to do the authy check. um I guess let me think about this a little bit more then um and see if there's a more secure way to do this. Otherwise we we have to yeah um yeah.

A

I mean we're already tossing dupes today what I just I just, never really poked to figure out when that happens like I know it gets uploaded and then eventually we say yep we've got it ready and we toss it. What I don't know if that's done asynchronously later on or when that operation happens. So this whole auth check is happening. It's just. I haven't really poked deep enough to know where that check happens, to know like you're bringing up something.

C

On the registry, though, fair, that's right.

A

The client, the client, will never.

C

Know from the client's perspective they don't, they shouldn't know like yeah there's, probably you could probably poke into any registry and there's gonna be some sort of timing attack. To tell like oh, I uploaded something that already existed but based on the response, but there shouldn't be anything in the protocol that leaks that information.

A

I've got a hard stop so um until next we'll just move. However, there was one agenda. Item left, just move it to next week, uh the templates at the bottom and we'll pick up next week.

A

Thanks folks, good talk, that's fun!.