Kubernetes API Machinery Special Interest Group, 6 Jun 2018

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG API Machinery 20180606

Description

For more information on this public meeting see this page: https://github.com/kubernetes/community/tree/master/sig-api-machinery

A

Okay, all right we're recording, welcome everybody to the June 6th API missionary meeting actually got a longer agenda than we did the last few days. So let's get right to it. First up is I. Wanted a sleek report. I didn't warn people that this was on the list.

A

Let's see, I know Jamie and highway and I think Jordan had been working on like so me. We can hear what the status is go ahead Howie. So there was a deadlock.

B

In Cobb collector we're on resync mechanism that can cause talk contractor and on the test. Grades has to itself all the test period that involves contractor failure, has sent deadlock shares and the fixed force merge. The two days ago, okay, Jordan secure I copy the issue. Member thanks. So much I'll see how it goes. Okay, cool.

A

Next, test set up failures: I see somebody filled out the powerpoints.

C

So we're seeing two kind of categories of test flakes one is failures at the beginning of. If you have machinery owned tests that are just in the setup phase like create a replica set and wait for it to create a bunch of pods and then we were gonna go, do garbage collection tests on it, but the test is actually failing in the setup step and I I've spent a fair amount of time trying to figure out what's going on here and haven't had much success.

C

It doesn't there's nothing obviously related to garbage collection or API machinery at all, really it's and similarly for the aggregated test. It's running a deployment and waiting for it to complete and then getting into the body of the test and we're seeing fairly get numbers of flakes in the setup. Stefan I don't know if that's something that we should reach out to see gaps on or if it's a systemic like watches, are slow and it's like a scalability thing, I'm, not sure.

C

The next two bullet points are ones that are actually actually appear to be things we need to drive and so I'd link to the triage boards I've dug into both of those a fair amount. We tried to add I, know Jenny added some more logging output to the first and I added more logging output to the second and we're still not having a lot of success in driving mister ground. But those are ones who stay on. Ok,.

A

A

We should definitely stay on top of these, so people will be unhappy with us. That's totally valid to pollute the Signet machinery a slack channel with running commentary on these things.

A

Yeah, ok, yeah! If you yeah, if more eyes are useful, please feel free to run it ask okay.

A

Let's move on I wanted to give a short update on the apply working group. We are nearing a alpha quality thing.

A

There's it's still going to be a lot of stuff to fix between alpha and beta quality, since I'm expecting to be out for a while I think the team is going to focus on dry run because it's necessary for apply overall and there's I think pretty low risk of like complex technical decision going wrong and that's probably actually appropriate to go into master, but since master is going to be closed for future for priests for a while, we'll probably put it into the feature branch and then merge into master once it's open again yeah, so I just thought I would enter this any more comments on apply.

A

All right, like you want to ask ask us about storage space.

D

Yes, I see someone actually answered the question yesterday. I will surprise to select, didn't notify me, but anyway, the other half of my question is really about ASD and I. Don't know if anybody knows the answers to these questions, but it looks like my questions are now for its D, so I post about as deep dev like and putting a slight channel. My question I don't know I'll just try Sweeting it here see.

D

If anybody thinks that this is a useful forum for discussing it yeah, it seems like something that the users are basically would want to know and I didn't find it. Maybe I'm just missing something. But this is you know a planning exercise, alright, so I'm planning for a workload. Suppose I can estimate a stream of Rights there's elite rays, characterized by a revision number a key and a value, and presuming that all the so my question is I'm planning for database size.

D

If I know my workload can I predict what the database size is going to be I'm. Assuming that all it goes in the database is rights not reads or other transaction details, but I, don't even know that I think that's I.

A

Think yeah Jo. Do you want to sorry what what all we're trying to plan sed database, sighs yeah do do reads: take up database, sighs, I! Think the answer is a very tiny amount. They take a very small amount, less to rewrite a.

E

High linearizable reads: do get bridge the wall on, but they typically don't take up much space, most ignore them and the more logs get rotated in a pretty tight man, so they're their size usage but slow.

D

In in its D, there are three things: I stored on storage, there's the right head log, the snapshots and the DB right, yo mom I was focusing on the DB part of it. Yes,.

E

And the question is: how big can it be.

D

Oh yeah, okay, how can I can I predict? How do I predict how big my DB is going to be your DB is it's.

E

Yeah, it's actually principal, so it's the size of the objects in it, multiplied by the combien right effects of your compaction interval. So it kubernetes case you have to it would be do it unless you have a dominant object I'm. So in one use case where we have like yawns that are large and those would be written in a pretty high rate as heartbeats those dominated space, and we can basically calculate it based on.

D

That, let's just go through I, mean obviously with a mix. There's a mix, but let's just go through it from one object type. Yes,.

E

So for that, if you have like a say, you have a two kilobyte note object in your 1000 notes. Then that's roughly roughly, you know two megabytes there couple megabytes there and then, if you're, if you write them once every 10 seconds and you're compaction or bold, is five minutes, you just do them. Okay, well,.

D

Let me make sure I understand: what's a multiply right, so kubernetes, every five minutes in the default configuration every five minutes. Cabrini is going to do a compaction to the revision that was current five minutes ago. So, just just before the compaction, you have ten minutes of history, that's correct! So if I'm- yes, let's just take, say a nominal cluster size- say 5,000 nodes there heart beating once a second, so that's 500 updates a second.

D

So is it just a matter? I have 600 seconds of 500 writes a second of a two kilobyte object yeah. It's that simple experiments show know that the database also seems to depend not only on the history, but the current contents.

D

All right. If there's a million objects on I, only wrote ten of them in the last ten minutes. Yes, is there going to be a lot of database? Oh yeah.

E

Well, that's that the the database file does not automatically size itself down so the way. Yes, so that's probably what you're mentioning yeah that's an effect right, if you, if you have a lot of activity in grow your database size and then it shrinks back down. What you'll have is a mostly empty database with mostly three pages, but it won't actually size it down unless you'd be fragile. Now,.

D

If that's not the issue I was trying to refer to. Let me try again you're saying even those that I've been creating a nodes. At a rate of you, know, ten, a second all day, and so in the last ten minutes, I.

E

Sorry we lost you, there.

D

So the storage space has gotta include that 860,000 that exists, not only the six thousand was created in last ten minutes. Yeah.

C

It always includes current objects. Current dogs never get compacted yeah.

D

One would one would expect that, presumably the keys as well as the values are by object. You mean key plus value yeah.

D

Now next question for those. In my example, those 6,000 objects that were created in the last ten minutes. Are there two copies one historical one, current or just one copy? It depends on.

E

Your compaction, but compaction can reduce it down to one tell me how explain how what you mean by that? So, if so, if, if you, if you haven't changed in objects since since his sort of object, trails outside of the compaction window, it'll be compacted in only two current, my objects and.

D

You don't know for sure that might so. Let me just repeat my example, and it's just part of the question, but if I, if I just Sam, create a cluster, okay can create a cluster creating nodes 10. A second forget the heartbeats. This is just same creed. Just talk about creeps per minute: okay, I'm, just creating nodes 10 a second; they don't heartbeat, okay, so just before compaction, there are 6000 nodes that had created in the last 10 minutes and there's another 800 safe the end of the day.

D

Another eight hundred and fifty six thousand knows that I created before the last compaction, so I understand that for the eight hundred and fifty four thousand, there are just one copy for those six thousand that I created since the last compaction do I have one copy or two I.

B

E

Not totally sure what exactly happens there, but.

A

You would usually have to if they are edited after created.

E

Right if there was any, if there was any action on them in minutes just from an Anthony perspective, there would only be one and rest unless some additional right yeah. So so.

A

Really, the question is: does those kubernetes upon a post do a right and then an edit and the answer? No I know it? Doesn't it shouldn't unless, unless like you.

B

Know in that measured it doesn't.

A

Yeah I, but you I, would be very surprised if you got to if you wrote and then got two copies and that's a database, but are you seeing the database? What do you see.

D

I'm I'm not sure I, don't have clear evidence on that, because I'm doing mixed work, less my cry started by asking just but pure, creates I. Think I've got an answer and it's plausible. So now, let's move on to updates so suppose that I'm, let me make an example here. Well is try asking so I think what so, if there is an update since the last compaction saying say if we're given object, this one update since the last connection.

D

There are now two copies of that object in the DB or introversion before the update in the version after right, okay, and if there are three updates again, all those since the less compaction. All those versions are inmates because they have to be yeah.

E

And they're all they're all full copies, it's copy-on-write, so there are full full records. Okay, now, let's do delete how much space does a delete? Take up a delete is basically just a tombstone. How big is that I? Don't know exactly, but it's a couple nights: okay, very small, plus a few lines. Ism.

A

Is pretty small okay, but that doesn't it doesn't remove any any copies from previous updates yeah and all these.

C

I'm still there right, it's still something in the connection window, with a delete. Will compact away correct, yeah.

E

And yeah it will all come back.

D

Okay, very good I think I've got my answers. Thank you, yeah sure, yeah.

E

And I'll set aside, one of the things we've been looking at over on the NCD side is: what is a maximum DB size before you start? You need degradation.

E

Historically, Exedy is recommended to gigabytes, which I think is way too small, we're going to even.

B

The very near term.

E

Increase that recommendation again right, which we know, is reasonable and we're looking at the performance at 16 and 32, beyond that, your actual physical memory limit becomes insane mm-hm.

E

At that point, we wouldn't we wouldn't push this like 256, because most machines are in have less physical memory and once you hit that limit performance, degrades traumatically removals, okay,.

D

um Actually, one of the follow-up to my question, um it seems like the question I was trying to ask is something that should be documented. Doesn't it seem reasonable to be in the SD documentation.

E

Yeah I think I think there needs to be a little more clarity on on how you reason.

D

E

A space and some of the assumptions that people make about it- it's it's not entirely intuitive I- think that's a good recommendation. I can think that's the ad speak. Okay,.

D

Thank you, cool.

F

E

That CD should document that a Tunisian document of the basic, the basic math that we were just describing and on the criminal side, it's it's harder to say exactly how much space you're gonna take many on scale dimensions, but there's through system there's.

A

Some like complicating factors like a long going read, can hold various parts of the database open and cause fragmentation. Basically, if I understand correctly yeah, so they're nodding.

E

There are, there are some. There are support cases there. Those might be some good things for us today, Konami at speed inside.

A

Okay, um let's see till did you say you're into Goshen? Do you know that would be great.

A

G

Gonna switch the.

A

Order here, because I didn't put that in.

G

No time I know.

A

Yeah here for the whole time and the topic, big consensus on creating a repo to break out generic controller library from Cuba. So.

G

The context here is a coup builder sub-project API machinery focused on building libraries and tools to to simplify the user experience, increasing a variety there's. Other projects such as operator SDK and a couple operator kit projects then tried to build similar libraries, and so one goal is to.

G

We talked with the operator s decayed folks, walking, I'm working on that and came to the conclusion that, like what we're trying to build is actually very, very similar to one another and that we couldn't build the solution that actually needs both of our needs in that we could each bill on top of, and so we'd like that code to be moved out of the coop builder repo, so cou builder could be like a project that consumes libraries and maybe adds porcelain on top of standards to the libraries at any SCA goodies.

G

The other goal is that certain projects, there's number of projects who said hey, we actually just want to consume the libraries we've already written the project structure or something like that, but would like to switch other libraries without using all the the tools and framework.

G

So, specifically, we want to break out a set of libraries into another repo there's also possibly in the future. We may want to create an additional repos if that makes more sense for in the structuring at the libraries themselves, I don't.

A

Personally, have any objection to this and, in fact, I think to continue to scale with sig needs or needs to delegate more to some projects and like treat those as first-class citizens, so I. If you think that makes sense, I don't have I, don't have any like procedural objections or other okay sorts of objections.

A

I would want you to make sure that the sub project lists is up to date and yeah yeah mold file, wherever that is, we.

G

Are good we we are creating like a name of a repo is: does anyone from the sake feel that, like they want to review like the naming conventions, maybe we're establishing stuff for these repos first easy enjoy the reclose themselves? Yes, what what name are you thinking, we're thinking about platform as a suffix, so starting with controller platform, I.

A

Hope that works out, because that's a great name use that name and then network out.

G

But we were thinking like we could have saw leads been working on blogging tools. Actually, that would that we're thinking putting like logging platform or maybe comment on platform and having the platform something's be a way of saying these are libraries you can use when building your stuff.

A

A

Don't know, does anyone.

B

A

G

I, can it's already sent out an email I can send up a follow-up email with saying, like there's, been lazy, consensus achieved in this meeting. There were no objections and then setting a deadline, maybe a Friday.

G

If no one has come up with the reason that they don't like the names.

A

Yeah, that sounds fine, great I, don't think anyone objects. Thank you for moving those things around yep, okay. Last item on the list: name: I, don't recognize dynamic auditing proposal, yeah.

H

I'm Patrick, what's up to you, we're trying to push through a dynamic auditing as little to dynamically configure your the advanced auditing features. This would be primarily owned by sig off, but we are looking for participation from you'll, hear I and that's been one of the asks from sig arch as well as that you'll are involved so I'm just trying to bring awareness to the topic and see if we could possibly get some reviewers on the documents and approvers potentially from the sig as well all right, you're. Looking for our reviewers and.

A

Approvers yep- and this is.

A

A

Is there a specific item that we want so presumably you're not asking for review of the API, because this group doesn't we do api's?

A

You were asking for review on what exactly.

H

Basically, I Brian grant from cigars was asking that we have Yolen bulbs in this and in the caste, I'm, not totally clear on your entire. What you would cover, but he seemed to believe you all should be involved as a sever prover. It wouldn't be in here it'd, be in the main comments section.

C

Yeah I think, like the audit stuff in general, its kind of in a weird place halfway between machinery and like oath, cares about the audit ability aspects of it like the puffer enforcement policy, and things like that and machinery cares about it because it threads through all the machinery you have to.

B

A

B

C

So that I know the dynamic aspects would would impact it. So.

A

From the comments Brand has made here, it looks like.

A

Okay, he wants conversion on the web hooks and he's worried about the API.

I

C

Yeah I I was trying to get kind of agreement on the use cases before we dove into like, what's the best API to accomplish those these cases, yeah.

A

That makes them I I think the project overall is so there's there's two things going on here. There is just getting api's reviewed in general I think the project is super bottleneck down there, I think honestly, Jordan has been doing more far more of those than is fair, so yeah, that's that's not helping for the project. The other thing that if there is a web book it is this adding a net new web hook. No.

C

So you can already configure audit what looks the configuration is the configuration static file paths then at start time, ok willing to make that dynamic. So.

A

There's really got so. The problem here isn't adding a new weapon. It's making sure that the webhook configuration is is like at least not completely alien to the admission control level configuration is that.

C

uh And also I think adding the possibility or like not multiplexing but I believe today you can only convince a single audit webhook, and so this would be, presumably maybe that's part of kind of getting back to the use cases, letting you create one more than one with different verbosity and focus and filtering and and yeah.

B

A

If this is a case where, where we should set up like a sub project or something where we could have one person who centralized where we centralize all knowledge of web books or a couple people, because we do in fact have a lot of web folks and they aren't configured in different ways.

A

So that may be something to keep in mind. But, okay, we.

F

A

Have a we should have a reviewer on this camp over here, I can imagine asking Chad to take a look or some chose not here or you, yeah yeah.

C

Okay, definitely ask the person who's not here we can go ahead and commit them. Yeah.

A

Yeah hours, lots of cell ok, I will get both both of those people to take a look and Jordan is actually in the API approvers list. So once the design is further along than Jordan or somebody else can like work with the work on the API yeah.

C

And Tim all Claire are on the off side is mostly driving the audit work and so he's been involved as well. So ok.

A

Man died from off.

C

And then, if you want to tag in ok.

A

Yeah I think Jordan I'm going to try and pull my weight on the API reviews. So maybe we can find a couple more people because I think there's too much stuff going on to one person. Yes yeah and that way, maybe even like I, have a heart attack.

B

A

I'll, add that in the notes here.

B

A

You never spell that one.

B

J

Okay, so I'm good awesome.

A

H

A

Okay- and we are halfway through our a lot of time, but it is nice to stop the meeting here. If we don't have anything else, does anybody.

B

A

C

Maybe circle back to the Blake stuff, just to figure out who's, taking point on those yeah.

A

Should we assign some extra items who wants to talk to see gaps about these test setup failures, I.

C

The replica set, one is probably clearer, so I can I can take that one. So does the replica set setup for garbage collection? You.

A

Know it's okay to want. Somebody else, do something: what's Noel Jordan, okay,.

H

A

You want it: okay, test failures, an orphanage who is currently.

A

Is anyone currently working on this guy I.

C

Made Jenny had added the logging around it. I think she was trying to dig into it, but I don't know what she still is: okay,.

B

A

What kind of so like easy easy to promise? Maybe maybe, if maybe you can also Esplanade or say something in those like channel. If there's like nothing obvious to do or or something like that, just so, it doesn't get dropped.

A

A

Just see you there right, okay and the aggregating a server test place.

A

This is the is this the one where yeah we were trying to promote that to like be a conformance test and it turns out is flaking which doesn't really help our.

C

Case so there again, there are two two pieces there there's the deployment not becoming ready, I'm less sure that that's a cig, apps issue, I that actually just kind of started cropping up.

C

I added some logging to try to dump information about what was going on inside the pods. But if someone who has access to the stack driver stuff where the container logs get scraped could take a look at it, that might be more helpful. I know Walter had done that a few times, but I think that might be google internal still. Yes,.

A

Walter was on vacation, but it's back today.

A

Thank you see you at Chef taco, going I.

C

Hope it was a good vacation.

A

Yeah: okay, we get everything there thanks. Well, so wait. Jordan said there was a deployment. Not ready was one issue. Did we have.

C

Anybody under one the I added more logging around the case where the deployment went ready.

B

C

Then the thing hitting the service end point was just getting a 404 back and explicably I added a fair amount of logging around that and we're still seeing the failures and the logging didn't turn up. Anything that seemed useful Jenny saw some denial errors, but those aren't those are consistent on good ones and bad ones. They're not.

F

Okay controller trying to get discovery from it, but it was a yeah.

C

Like the open, API, a aggregator tries to aggregate stuff and I. Don't even think this image has open API, so yeah yeah.

A

I have seen there is a issue floating around with somebody complaining that if the metrics API server isn't healthy well, the condition that I'm worried about is if the metrics API server isn't healthy. Then the aggregator can't aggregate the opening can that's somehow prevents of conserving requests from API servers that are healthy, at least that's. That seems to be the claim and that issue I'm, not sure if that's really the case, but if it is, we should fix that. That's not good I failure.

B

A

Server shouldn't spread to make more the API and available I.

C

Think it was during registration at one point, there was a synchronous path that, like on the first hit, could block aggregation, so existing things would keep serving fine. But if you registered the new API service Wow, the open API aggregator was trying to do the first hit on a 1-1 that was failing. Other things would get blocked from registering as.

I

Sure, if you, if you have during the night, that yeah.

C

This was a while ago that we track it down and I thought we resolved it. Yeah.

A

I thought so too, but I saw some some recent comments on this, like yesterday or the day before I. Just just you know.

B

You've already left.

A

Comments on me on the issue so is probably sitting in your emails. I don't know, hope you find it or not, doesn't usually help me find it.

A

Yeah I think we have a general like thing with discovery where like parts of it can fail, but that doesn't mean that the result you've got back is not useful unless the very thing that you needed yeah.

I

A

I wonder if we could like think about handling all such things in the computer and make sure that that it doesn't give partial discovery failures to people I, don't know, maybe it not for the future.

A

Some since clients seem to have great difficulty handling the case. We're like half of your discovery work to lose this one can point that wait, for whatever reason are you believe it's our fault for splitting discovering into multiple you guy calls fix for another day. Okay, all.

C

J

All right, we'll give you all a few minutes back. ah Okay,.

D

It's just one more follow-up question on that on it database sizing thing. Yes, the answer given was there's a just tiny amount of bytes for each read, so does that mean for each key who's in with a key 0 I think that probably means is each key. Is there, but not the value so being said here is.

C

The actual act of reading, so when you do a consistent, read part of that is ensuring that the entity remember returning the result is up to date with House quorum, and so that involves writing something out to the.

D

E

Global index yeah, it just writes it just writes in normal rap and trees and the operation it performed. So if you do a rain it would it would only record starting in that range or if you did see the period, only record that superheat lisaknott it.

C

Actually got even more more efficient as far as like if ten read requests arrived within a certain window batch those up and only put a single entry on the left, like am I am I totally making that up. I thought that was part of why it's really so much more efficient at queries. Yeah.

E

I think that happens, there's also some ongoing work to try and make when they're, as it will reads more concurrent, because right now, they're still fairly long stuff, but yeah there's some optimizations in patents. Okay, thank you. Cool.

F

A

See you all in a couple weeks or somebody will maybe maybe I'll be out by then I, don't know.