Kubernetes SIG Node, 25 Feb 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes SIG Node 20200225

Description

Meeting Agenda:

https://docs.google.com/document/d/1j3vrG6BgE0hUDs2e-1ZUegKN4W4Adb1B6oJ6j-4kyPU

A

The cap and if there's any other, called blockers, no.

B

I remember last time we talked about, there are couple things that I study and the process and why I think for nothing? Unibrow, okay, so it's Derek here I want to ask the barrack so.

A

Dead dead, I can't join today. Oh yeah yeah.

B

I, don't remember I, don't recall, he says that's time. We summarize the the meeting and then we want to pursue right. Yeah.

A

And I think like Giuseppe can give like an overview, but we are already running quite a lot of tests like 140 something node conformance and next you're gonna target running all of the node conformance, and we could do next week how that goes sure. So so this is just a status update want to ask her. What's the blocker, oh yeah yeah? Basically, we want to get the cap up roots to make sure that we can get phase 1.

B

At least I, don't I, don't recall any block herbicides. Last time we have like the status found. No, the conve not confirmed, has not eat we test and some certain things we didn't have much broader issue and Oh. Any other comment on his room.

B

Okay, miss motor makes topical. Also, you propose next copy yeah.

A

So we discuss this sometime last year and I wanted to bring this back up so to give a brief overview. So today, when a pod has a termination grace period, it's only ever passed when we are calling stop container and the problem we have today is when we are rebooting a node there's, no guarantee which process will get killed then so we are at least we are relying on system D to kill our pods and system.

A

D has a setting where you can specify the termination grace period, the stop timeout on the Scopes, and if we set it over there, then system D will actually do a graceful termination. It will wait for that time. It will give a sick time, wait for that time and then give us a kill, but we can only said that if we pass that down as part of run pod sandbox, so we can set it for all the containers in that pod.

A

So this is this is just for, like the setting that we already have, for the part should be made available as part of run, pod sandbox and the objection last time was what, if some pod gives a very big timeout, will it block and no reboot, so I think we have two options over there like in the short term, the the container run time can impose a limit on that time, so it can say: okay, if you set a limit more than five minutes, I'm gonna, just keep you down to five minutes.

A

You can't specify one hour and limit ranges would be another option as a first class thing in Cuba notice,.

B

Yeah I think I reach that problem for the for the. If that is set to now and what we can do, but uh but I don't recall any other problem, but I, don't think I, don't think I think that's doable right, so I saw UPR. I just saw up here.

A

So we were also waiting for run, see and we got that merged into run see a few days ago. So that is also.

B

So ICC to the Luigi and an and how opposed to have the most of the inputs and inside about the CRI, okay yeah, so so, but the I so far. I could all say that any problem for this one so yeah we can move forward. Bay, sounder, no okay,.

A

All right thanks, Don thanks.

B

So next one I think the spec you this is come back this, the the path at least papers in the FATA stickers here yeah.

C

The reason and back is because we sort of keep talking about it at this meeting and then end on well, let's take this offline kind of note and then and then sort of get nothing back, so so I think we're we're kind of still in this place, where, like I'm, not sure that we've even agreed that it is a bug. You know when I.

D

C

Brought it up, there was general agreement that it's a bug and there were two different proposals on how to fix it, and I was hoping for some kind of steer or blessing about about one proposal or the other. But we've kind of gone down down this this route and come full circle, and so like I, want to get to at least some resolution of. Like is this a bug. Are we gonna fix it and if so, like? At least you know what? What is the path forward for for getting a fix.

B

If I recall, I might be wrong, remember something wrong, so so so I I recall it is the people think about.

B

This is not bug it's just because the so there's the to the powder it is sell for it is the smallest entity for kubernetes, a schedule and manager, of course down to the note, particularly node, and it's actually break into the more detail, but the to report that the powder status back to the API server we want is treated as as the single unit, so United States, prima single others, is not bad, especially the true proposal we propose layout here and actually is the limit destructive to the holder that could kubernetes right, sir, so especially phone at the eight.

B

We want to change the hook to become to the asynchronous, and that's actually that in for the purpose, many people randos hookah and that could be potential change. The container state has like a you think about containers. They wowie all you make you have to wait anyway. You have to wait for the hook now to come back. This.

C

Isn't about container status.

B

Momento, it is because containers status, actually, the initial container status is out of the powder stickers. So a little we design how the lifecycle management actually do have a one principle. It is the we will once that, every single container, with our powder run at least the ones we will define that powder. It is the reigning state. Even a company might die later all other kind of things, but the iDevice we won't think about a punishment. So it's part of the part of the lifecycle management.

B

So so this is why to change that hook and people think about that's two big changes and they were about to expose more problem so and then back to the actual.

C

Other thing, I'm I'm, not sure I, understand.

C

Why, like I, don't think there there's there's any need to like change the container status or mark the pot is running or anything like that.

C

The the problem is that we're not like, while the hooks are running we're not getting the pot IP on the pod status, and so one proposal was to as soon as you set up the sandbox and get the pot IP then send a pod status. Update that contains the pot IP container status is unchanged. Containers haven't been started, yet the other proposal was to allow the hooks to run asynchronously.

C

So that, then you you could get a pod status update that that was sent and it sounds like you're saying. Well, we can't let or like it we're worried about letting the hooks run asynchronously.

C

So maybe that's a reason why that fix is not appropriate fix but like I, don't I don't see how that sort of supports the position that this is in a bug or that the other solution of of just sending the APOD status, update containing the pod IP isn't a good solution.

B

So I try to explain why people think about the situation is not at the back and that you think about the even reason apart. We could update the couple state hers if I, the other people think about the party itself is the smallest of the men management and created in kubernetes and in this think about the one and the pipe design principle initially and I just stayed what I remember I could force discuss it before. Ok, it's not ok, I'm, on which side and I just tried to summarize what I recall.

B

So so people think about the sender that part of state her stood before the first pass of the old one and in the middle sender that's not wanted powder Staters, they could could end up. We just have the pub IP and the better. Then we decided to kill Randall Rustad. So that's they. They try to avoid a such a situation and also they worried about.

C

I'm not sure I understand avoid what situation in.

B

Want use in the middle to sender, powder Staters and they only raised the pad IP, but not a venue refractor of the powder Staters. So this is kind of the one of the initially foundation design principle in the kubernetes, especially the part lifecycle management. They want every single container run, at least the ones based on that when determining powder status. So what do you proposal? No matter? It is the async as a hook.

B

Oh, it is negative, and also next in the middle, after sandbox, created I, have the powder IP and send the powder IP, but to container all the status is missing. There is this pending or whatever they just think about. That's, not original powder lifecycle management, so this is basically a I. Can remember the the argument there. Okay.

C

Yeah I mean fair enough. Yes,.

B

Then then the real it is then we just kind of continuously. Can we have a good way because it's just state hurts right, so not reading in my perspective, is just status and is the way we way to talk about. Maybe could be the product later change and I think that we there's a way to say: oh, how we're going to redo those kind of things. But then the concern for engineer here it is even calculate is the complexity. It's not just like the okay stand.

B

Another state hers is the complexity, change the whole did a single loop in the Cuban eight internally, so then last tip and resolve pricing, especially for the a single hooker. The last end result question: it is look at the potential. Can we send the powder IP earlier before we have like a company of the palace takers, I think nurse. You also have imposed some of the complexity that had I also didn't really : stuff.

B

So this is the what we left the last time and but the one question I did ask: can someone did some measurement to what's? What's the potential scalability impact? To add another extra pad powder state, her, sir, so that I, that's what I remember what we laughter.

C

I mean I think that that is accurate, so like. Where do we go from here like like what? What are the like steps that we're gonna take to like resolve this issue? So.

B

Do you try the what I suggest that experimental you try to you? Have some nekopara run some tests and you to measure the ups without your PR change extra hard as other stickers and with your Pia and extra powder stickers and what's the QP is the folder? Well.

C

So, no and I don't I, don't really know how to how to do those tests, so I can I, can try and and figure them out but like I have a concern that there is not a clear goal post and that the goalposts might get moved like that.

C

We're just looking for an excuse not to accept a PR rather than saying you know we have this budget of QPS and you know make a concrete statement about it like if, if the QPS stays under X or if the QPS doesn't increase by more than Y percentage like if we can make those kind of statements, then I think it's worthwhile to maybe go down into this. This sort of like well, let's measure it kind of modality, but like I, don't want us to just sort of say well.

C

Well, we don't know so go measure it and then that'll give us like ammunition to say no so so like. If we're just looking for ammunition to say no, then then just say no now or or whatever.

C

But if, if we have like some sort of specific idea of like what is the budget or what is an acceptable amount of increase and can commit to that, then I think it's fair to say: okay, well, you know, try and develop a solution that that stays under that under that budget and if it, if it stays under that that budget, then then we'll accept the PR.

B

If people have the different problem, south style, the problem is even like the I think they're in the community. Many people think well, they don't see this problem and they're so far, only a factor calico, and so that's. Why that's? Why we so so it it is not like the okay common problem. Everyone tried to solve, but they do is common concern. For me, people many people, single mothers could be potential cause the actual problem for the community right you the step in in here for the reliability here.

B

So so that's what I try to propose. It is a way even at the Iowa you initially actually I do agree. So maybe we should fix this one, but I've got a huge pushback for the community. So if I was having the like the like the people and we'll try to stay here, actually it's not that here's, the data I, don't think about that's caused the problem and I could prove this actually can help or not chemical kisses, and also it's not really have the it's, not that too disruptive. So this.

C

Isn't but like what does that mean not that disruptive, basically like I'm, not I'm, not contesting the point that this may cause an increase in in QPS. But what I'm saying is that like?

C

Well, if that, if that really is the only issue and we're gonna, we're gonna accept this PR as long as it doesn't increase the keep es too much then like, let's actually put a sharp number on on what too much is, and if we can't do that, then, like it's not worth going through the engineering effort of trying to put this solution together and measure it and and all of that kind of stuff like like, we have to be able to say, like what is too much and then we'll measure it, but like I, don't want to go into this mode of like well, let's measure it and then like, oh well, we weren't gonna accept it anyway, because because of three other reasons that we we haven't even talked about yet so so, if that, if that is the only reason that that we're not willing to go down this route, then let's agree on on what what the number is and then you know we can.

C

We can go forward with with trying to figure out how to do this. This testing, or whatever.

B

That's not that's the only constant from my perspective, but I also heard in last time. We talked, and some engineer is the constant of all that extra complexity, so but I'm not sure they are here or not.

B

So unfortunately, also therapy is not here and the last time he also released some concern. So I don't know how to give you, because the longer the Cuban I to have the copious limit per second David. We.

E

Do have a large-scale ability test that you can't just trigger on a PR, so I think the best standard might just be. If the scalability tests we have passed them, then it's reasonable. Mr.

B

Kitty, we also have the good point that hissing about that's not only concert. That's the other concept is the communique.

E

F

B

Yeah, at least that's my personal, only concern so far for extra power, the status so but I do agree, maybe that it's not the only person is the signal to communicate. I think it says.

C

That's just yeah I.

G

Had a question so there seems to be a PR that addresses this issue. Yeah.

C

They were, there were two PRS that that addressed it in two different ways, and then the question was well like which one is the right way and that's what we've been talking about is like.

G

It seems like we've, been talking on a more nebulous level rather than talking about this actual code and what it does and I mean. Yes, it looks like we're going to, you know, add a status update and but I mean David is right in that you know it looks like. Let's see, GC g zp 100 performance has been run on this cube mark ii ii ii. Gce big has been run on this, both of them passed. I mean in terms of passing, I'm not sure what that means.

G

You know if it monitors for QPS increase and would fail, or if there's any artifact in those tests. We could look at to see what the gps increase was yeah.

C

G

That might be something to go down. It doesn't address the problem of yes, the UPS will increase what is an acceptable increase exactly.

B

I think the how much increase they've asked the proposal, then we can make the next stack at least that I might only concern that it is the concept I think we need a measure of sixth, but others do have other concerns, like the people's think about the actual complexity and I personally, don't buy that. Well so so we could address those problems so that review Pia. So that's kind of a separate issue I.

B

But do you came here to ask? Oh just me.

C

This is the back like well, I mean and also like I mean I feel like I can't get any anybody's attention other than by coming to this meeting right, like I've, tried talking trying to talk to people on the issue, I've tried trying to talk to people on slack and like I sort of feel, like I'm, just being completely ignored offline, so I am I'm somewhat skeptical of, like you know, tabling this discussion again of like oh well, we'll take it offline because then we don't and so like I, don't really know how to how to move forward here when, like I, you know can't get code review on on this, like I, don't think anybody's actually, like done any like code review around complexity.

C

For for this PR. You know there are these sort of vague, like performance questions, but like nobody, who's concerned about performance has actually like look at the test results for this PR so like what? What are we actually going to do here to move this forward?.

C

Like is, is some maintainer gonna gonna, you know commit to doing a code review or like yeah, basically I like I, don't know what what my next move is to actually get some resolution here.

B

So also I'm going to assign to after we discuss with the Derek and then sign to sign, to the people, to review this one and but I suggest that you try the test with this fix and to measure some data also, but I don't think we were happy on that one. So then we can come back and discuss I thought.

C

We were just saying that, like the the performance tests are being run against this PR, they were no I.

E

Don't know if there's a different one, that we should be yeah.

C

Like what what test are we talking about specifically, if, if that's like the next step,.

E

A large one or not cuz, it's big.

B

Oh yeah I saw that GT 100.

B

So, okay, so we are going to assign to the people to review up here and.

E

The correct one is the one by Ted you or is there a different one that you want review well.

C

So the one that that Seth linked to is is the one that sends the extra pot IP so like. If that is, though, the way that that you know sort of we as a signal community would prefer to solve this problem. Then that's the pr2 review. There was another one that he opened and then has closed and I'm, not sure. Why he's closed it?

C

I don't know if it. If he just sort of personally felt that the the sending the extra pod status update is, is a better one or or what.

E

Yeah I worked with this guy before he sends a lot of changes in the hopes that they'll get in.

E

But if that's the one then I can definitely take a look. I did yeah.

C

I mean like I, don't I don't have like a particular.

C

Strong preference for for one solution over the other and like, if there's a problem with this particular PR right that, like you, know, Ted sent, and we have a problem with that. One then, like you know, we can like me and my organization can- can consider. Okay, we'll do our own PR that that, like does, does this better if their affair issues with that, but like before I go down that route. I would like to have a decision made about like what is even the right way to approach this. This solution, I I, didn't.

H

Have one sort of unrelated question.

E

If, if we don't find any other solutions, dong, do you think we should move forward with this or.

B

I think that we talked about couple times. The base actually is I.

E

Don't think we're that.

B

Is not acceptable of the solutions we've seen. This is the better.

E

B

E

I think some people aren't super comfortable with hooks calling out to the API server, but that maybe isn't worth blocking this if it's otherwise, okay, wait.

C

Say that again, I.

E

Have talked with some some people who, like consider this slightly abusive behavior of start hooks but Center. What abusive making calls to the API server during odd creation, I.

C

Think you misunderstand the hooks: aren't talking to the API server to get their their IP.

C

The hooks are just trying to talk on the network at all and in order to talk on the network at all, calico needs to learn the pot IP from the API server right.

E

I think it's the thing right, but that's the part that people are not ok.

C

So the hook the hook is not he's not doing that.

C

The hook is not involved in like talking to the API server.

B

So it's not it's they're aging yeah.

C

Calico was watching the API server. It's watching all the pods.

E

C

No, in this case, calico is not a CNI. Plugin calicos is handling network policy, but it's not a C and I plug in so so we're watching the API server to find out about pods and what their IP czar. So we can enforce network policy and if the hook wants to talk on the network and just like waits until the network is available, it'll wait forever and it will block calico finding out the IP so that it could network. So there's this deadlock.

C

E

But anyways I hope we can I, if the change I, think if the change does doesn't break anything, then we should try and move forward. Okay, I can review even.

B

I talked to you about this one yeah.

E

B

Should still feel like the extra complex you can make us the problem so.

E

B

Off tonight, there's.

E

The please talk more yeah yeah.

B

I I I, don't I, don't convinced that actual complexity but I think that's worth to discuss. No.

E

And then my other question was because I misunderstood what you were doing as well, so I don't know if it's relevant, but from inside the container. Do you know if you can.

H

Access the pods IP like can the container.

C

Find out its own IP, well, I.

E

Guess I was imagining. You were doing this with the pre-start hook, Oh No.

C

Yeah, the pre-start hooks are our users, doing whatever they do on the network, with their okay.

B

B

Okay, nice move to next one and the next one. It is the the macro sense, cubed I think the next one is always ready to the qjg next up here.

B

So the bat are you available? Oh you just add, ask yeah. Please.

D

So it's a simple review: approval request for two tiers: one is a part of a series of the Earth's and it's actually covered by the Securities exception. So we still have a chance to get into 118 and another one is a bug fix and both of them are quite long in being in interview and the the first one was actually accepted and already passed, API review and only like last review step is actually needed for it to be merged, so Jordan actually promised it to be merged.

D

If signaled review it last time- and it was reviewed couple of time by Derek and I updated it a couple of times. But it's it's from my point of view. It's in a good shape and like if, if anybody from signal can connect, surely accept it. That would be great because we are kind of approaching the final deadline for 189 and I start to be a bit concerned about the future of this PR and the second one is actually I.

D

Don't know what what is happening, because there is no movement in this PR at all like since October. Maybe when it's good lgt a.m. and it's a bug fix it's not like something new, so I from my point of view, we have to have it well.

F

D

If people don't have like answers, but if, if they have the edition, mention them in DPR, so that's that's my point. I will.

B

Take a look at the second one and I think you directly is already on the first. He also I did the state hers on the signal the agenda say he is going to finish this. He will try to finish this today. Okay,.

D

Okay, thank you. Okay, thanks.

B

Then it's more to an excellent proposal. The proposal.

F

B

F

Right, can you hear me yes,.

B

F

So I'm not doctor I'm, not the author of the proposal of the cap, but I just want to erase attention of seek maintenance maintenance if they can take a look on it because again, the camp already passed a lot of iterations before it is someone of metal did some review on it, but it can be nice like to hear the point of maintenance if it's acceptable. If it's like near the finish like on what stage.

C

F

B

So this one actually the I, hope the SAS and also the comma chicken is here. Actually they do, we did make it some process provides. This was the girl, largely proposal and there's the several efforts going on. I hopeful hold on. Are you here today.

B

When this powers are actually there and also.

B

Discussed couple times this way, yeah and there's the last time we talked and there's the super or next their face milestones. They were dependency to a fortune. Today, everyone is walking on this numa numa awhile memory management is not here.

F

Okay, I will just try to ping them via slug.

F

B

Next week, discuss is more detail if you and invited those people make sure those people can come back going on a now and discuss okay.

F

Great. Thank you.

B

People from the inhale and the sky, stark and and from the Red Hat and the corner, although all those people and also objects, of course you- and there should be actually speaking on this Numa awareness for a while. So though so so I hope they are being here. So we can discuss okay.

I

I'll try to ping Conan on and me, and that will also you have a look at this proposal.

F

B

Okay, next one I think the next one it is I think the P is next. True is always kinda passed, though so I believe will oppose me and the dire card across the pasture and somehow we're not do B. We don't have enough of the perimeter even with fooled, but the frontal signal the perspective. We both approve those testers dawn.

J

The one one, six zero six two you're approved and it went in oh.

B

J

So Derek had approved it. He was not in the owners file, but you were so when you did approve it went in and I saw the email, that's merged, and so thank.

F

J

For doing that, 160 or six I think the other one is more of Francesco I. Think he's on the line. I think he wanted some help with just upgrading general.

K

Yeah, that's me: hey hey, so in yeah, I'm working with Victor and with Artyom and yes, the I discovered reviewing the the proud jobs that these tests are not run. This test are not running and they're, having some troubles actually testing on my on my environment, so I'm just asking what do you? What do you suggest just to remove the whole day, I? Think no or how should I ping to get help in the in having the test run in pro I.

B

Don't know why those ties to Daniel and.

K

That's the only thing that prevent me to remove the whole than to get this in.

B

So so you men, you didn't want, but to learn her to succeed right.

K

No, that's the they. My sure I didn't to managed to bring up the test in my environment. It fails in a couple of ways and he got stuck so I. I was wondering if there is like a pro job or some automation.

L

B

So it's not just one: it's not a run for the not trigger the pre submitter pass the running, but okay.

E

What do you mean that you tried to run it on GCP and it didn't.

K

Work I mean I mean that I tried following and to end the documentation about how to run the test on CCP and I got failures and well it maybe I just made mistakes in setting up the environment, but if those instructions are a curated just try again and until I succeed, I guess it is.

F

K

True, so are those instructions still up to date? They I'm not sure if the instructions room- okay, okay, but.

E

If you paste your error there, probably some people on PR, okay,.

K

Okay, I'll do that I guess! That's all I need to know thanks thanks.

B

So now baby, it's your turn.

E

Actually I'll, let Vinay go. That's.

B

E

Isn't very time-sensitive so.

B

So we lay do you want blow okay,.

M

So I I have appeared, that's ready for the stay, a PA changes, so I am running behind on the implementation as well. The I got pulled off because we had a couple of people from the company were stuck in China and then I don't go and fill in for some of the work, but I think I'm not really confident that will make the March 5th code freeze with high quality, but the APA review can I think we can start looking into that and then at least have it ready, I'm not sending a peer to kubernetes.

M

Yet because I want the implementation to be in pretty good shape before I send the PRS together, but I'm gonna have I'm, gonna, think Tim and Jordan. This pull request, number one which is in my repo at this point. If David direct, you guys want to take a look at this and have any comments, please let me know but I believe the last changes I committed just a little while ago, should address the comments that they had.

M

The primary Tim's feedback was mostly about using like drop disabled fields and couple of fixes in the code, and he had some questions about the commenting to clarify and I addressed. Those Jordan had a couple of fairly important changes where, if we have a group city, earth client, a client go that is of a lower version, we need to support that and when I used set defaults, the defaults don't go to set the default values that was getting in the way. So what I did was a remove.

M

The changes from defaults don't go and then I'm using the admission control that we have added the plug-in to set the defaults and I was able to figure out how to create user. Older, downlevel, cube, CTL and then test that, and it seems to do the right. So far, so I've created the code with the changes, the latest changes and then added a lot of unit tests as well. So I have fair amount of confidence in this change.

M

So if you guys get the chance to look at this, please do but most likely this is going to go to 119. That's my feel at this point: I.

M

Mean this feature.

B

M

That was birthday. Thank you. Any questions.

E

Now, yeah, okay, thanks.

M

B

Should be for David, you go and that's the last one you wanted.

J

Yes, yeah yeah. Thank you this one. You know what we're doing is. We are topology manager we're going from or targeting to go from alpha to beta and 1.18, and so we have this PR and the only thing this PR does is change. The feature from you know default false to true and it's a very simple change and as a result, what happens is the scaling test fails and the scaling test fails?

J

It tries to bring up fafa 100, hollowed nodes and the hollowed nodes are not coming up and I have looked at these logs and looked at them, and I cannot figure out why that hollow node is not coming up and what I have done is I have set up my own.

J

You know. Gp, environment and I am trying to reproduce this failure, but it is it is. It has been fruitless so far and I have spent days on this thing and.

G

J

Would be great is if, if it would be possible to look at this job and running and get on the job and interact with it or if somebody can look at this thing and just say hey, why is it failing? I have not been able to get to the root cause. I suspect it may be just a slight memory increase usage, but really been turning the feature on the only thing we're doing store in the container UUID. So it can only be a teeny tiny amount of memory. They were adding I think so.

J

I'm not really sure why the hollow nodes are failing to come up and it's been a an exercise of you know what is the Holland.

F

Node and just I've got my environment.

J

Sort of set up but I'm not quite able to reproduce yet and the code freezes next Thursday and so I was hoping. I could have figured out by now, but I don't and so any anyone can help out or provide some pointers or look at this thing in debug and David. You should kick it on there and debug I saw you shaking your head. No, it's! Alright! I I.

B

Just look the the people from the signal, the signal scalability and who writes that code and the hope they can help you to look into those senses. I just assess Italy.

J

Thank you. Thank you very much. So I guess I will look on the slack channel for I. Give.

B

You, the name from the goods, appear how to give the name. Okay,.

B

So I will call this one David. Maybe you can't pick ready yeah thanks.

J

Thank you very much.

E

So I think my intern actually presented something like this more than a year ago, but uh I'm hoping in sort of the 1.19 ish timeframe to get my tracing work in, and so the basic idea of this is that we can use open telemetry to collect, distributed traces on what kubernetes controllers are doing so I've been to a number of SIG's, but now I'm gonna talk to sig, note and sort of present what I've got and what changes are required.

E

Hopefully it should be kind of hot, so my goal from this is just if you have feedback feel free to ping it to me, your email me or or leave it for me somehow and so I can collect all that make sure it's all addressed, but first I'll do a little bit of a demo. So the basic concept is that everyone can see this right yeah.

E

So if I create something and pass in this trace parameter, then we have so. This will start with a config map and I'm using Zipkin here, I have the collector configured Pacific in, and so you can see that we can trace requests that are going to the API server down through to the LCD transaction and so for a quick primer.

E

You, the open, telemetry libraries, have a an HTTP server wrapper so that I get these traces or these fans here for free based on the HTTP request, sent to the API server, and then they also have a ERP C dial option that you can use for the client for @cv, and that gives you this cool, sed transaction tree. So really, without almost any code changes we can get traces from API server and that's all cool, but wouldn't it be better if we could know what say the node was doing when creating a pod.

E

So if I instead create a really simple pod, let's see, I can now get this view. That includes not just the API server request to create a pod, but then also things like the schedulers work that it does schedule the pod and then all the way down to the cube. Let's work to sink the pod and even the container runtime, even traces from the container runtimes G RPC calls for cyclic humans.

E

G RPC calls to the container on top and, as you guys all know, we have actually not just one jar, PCE client for the cubelet. We actually have many, so we can do this with device plugins. We can do this with any of the other cubelet plugins we have and that'll. This is a tool, I think that will help node greatly and being able to diagnose sort of where's. My pod stuck sort of problems, since we can pretty easily get interesting in telemetry.

E

That sort of shows you what's happening to two apologist arts out and then.

I

There's question: so it's practically a modified version of couplet of modified version of API server. Now our components modify it right. So.

E

I've made changes to the API server to get these two traces here, or these two spans here I made a change to the scheduler to get this change to get this fan here and then all of these changes here are changes made to the keyboard okay. So it's.

I

A global tracing.

E

Tracing is obviously more useful when you have lots of different components, because the whole promise of distributed tracing is that I can have two different binaries that both exports telemetry. That can then be joined back together. Yet to give you something more useful than safe, logs or or metrics.

E

Cool and the last demo I have is deployment, and so the the way that the the previous two demos work is that adding this trace argument put an annotation on the pod object or the config macked config map object and whenever a component acts on that object. For example, when the cubelet goes to sink the pod, it reads the annotation off the object and uses the trace context stored in that annotation.

E

So deployments are interesting because, instead of just having a single object right now we have sort of this weird hierarchy of objects, and it turns out that by applying a couple rules, for example, when you create an object like a replica set based on another object like a deployment, you pass the span context from one object and put it in the next, and that gives you some pretty fun and even more useful behavior than being able to see a single pod.

E

So I just created the deployment and if I pop over to Zipkin I can now see this trace here and it's sort of like the pod trace, except now. There's five of them I. Think because that's number of pods in this deployment- but you can see sort of that even with a very large number of spans, that we would get from a system like kubernetes.

E

It actually still provides a very useful way of viewing what's happening over time and, like I said you can see the the initial creation of the deployment you can see the creation of the replica set for the deployment, and then you can even see the replica set controller creating the pod. So all of those layers are there.

E

That's how you get this this tree, that we have cool so there's one more thing: I want to show, which is that open telemetry is pretty cool because you can send to not just Zipkin, but you can also send to Jaeger, and you can also send to stack driver as well. So if everything worked, I should have all the same.

E

Let's see that the config map we should have all the same traces also for.

E

In stock driver as well so I've set this up to send to both stock driver and to zip Caen, and so like this isn't a vendor specific thing because we're using open telemetry, we can send our traces wherever the heck we want, including yeah, including a bunch of them, and so I think this is pretty fun and as someone who's spent plenty of time, debugging node issues. I think this would be very useful.

E

So the only thing let's see I've got 2% battery, so the only thing left the three changes that are actually made to the cubelet here. That I think deserve a little bit more scrutiny from this group. Are we have to use a G RPC dial option when we're making connections to all of our clients, so device plug-in container on time? Any of the other plugins right?

E

The cubelet has to be configured on startup where to send the traces, and so if we do use the open tracing agent, then you would send to a local payment set and the qubit has some startup stuff that it does and then the only other sort of big change that some people find annoying is that this is. This does mean we are going to start doing context propagation.

E

So a lot of our functions will need to have goings context context plumbed all the way down, so that we can link everything back together, but the cubelet changes aren't too significant. I think.

E

M

Is a really cool t-shirt? Do it.

B

Thank you and everyone, that's all for today. Thank you. Everyone for attending today's meeting.