Istio Extensions and Telemetry Working Group, 17 Jun 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Policies and Telemetry WG Meeting - 2020-06-17

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

That's true, J yeah.

A

Okay, so I put a bunch of stuff on the agenda, but I don't want to dominate the discussion. So I wanted to start by just opening the floor to see if there are anything that we think we should discuss that we aren't discussing or any items that came up or concerns. We have about the 1-7 release that you should try and talk about before we dive into some of the other.

A

Is there anything that anyone wants to.

A

A

B

A

So then, do we wanna that maybe we should do this and it was like if, in order do you want to provide some updates on the extensions API, so I'm worried? If it, the RFC conversation, will take a while I, don't wanna? Yes, okay, yeah.

C

So the TLDR is that for four forward references, the the sidecar EPA itself needs some clarifications because of the because the override semantics are somewhat poorly defined because they were never needed to be more precisely defined. So we in the inverted API we have clearer, override semantics right, you can do canary and you can specify configurations for specific filters.

C

If that is move to the side, car, then site guard needs to have those semantics. So that's that that's kind of one gateway attachment point actually works quite well. It is straightforward and then virtual service is the only other attachment point which makes sense in a in a limited sense of the. The reason is so the part that it absolutely makes sense is that many times people have a need to attach behavior to certain paths right.

C

So if it is some protected path, then apply some Otzi filter, otherwise to an apply some Otzi filter or apply a filter or not apply a filter. So that is a common use case, especially at adding dress. So we so we do need to support that.

C

However, we cannot allow you to specify a full filter manifest there, because that can lead to API, which is impossible to implement, so you can specify so that will allow you to specify things like one filter order for one path and another filter order for another path, and that's you cannot implement that in in Envoy.

C

So so I will so I have I've done kind of a rough worked through of these three resources, so I will send out something full review at like by the end end of the week, and the issue, I think, is that it's not clear to me if our difference is the better API after all, so we really need to.

C

We really need to see if we want to do the forward reference or the inverted reference. The proposal that that already exists.

D

Just is because of their Indian racial doctors, only side covered mentioned and I mean psycho itself is.

C

Right so so sidecar is so sidecar is a primary resource, so the backward and forward references are like are considered with respect to other resources right.

C

So yes, so site I mean all our resources are kind of backwards in that sense, because the sidecar resource is going to select the workload that it applies to, and in this case the forward reference is going to be from the sidecar to the extension to the extension manifest so I will I will send so I have a I've kind of have a rough thing. I was I, will send, send it out, maybe tomorrow and I. Think the the other thing that I'm proposing is that we have.

C

We have a meeting of people that are kind of intimately interested in this as a as a working as a working session and we'll make sure that we we have it at a time that's convenient to to everyone, so I know Daniel and of course, neeraj and and a few other people are interested in and then from our side, Miko art and will will be there. So I think I think that we should book an hour.

C

So I'm proposing that we do that next week so next week, when we don't have this, we don't have our normal meeting. We will have a specific meeting for dpi itself. That sounds.

E

Great Amanda I know the biggest hurdle this API is is sometimes getting the networking working group convincing the members there maintainer is there and also getting the nod from TOC.

E

Should we try to solicit some feedback now or do you wanna do that after so.

C

I think I mean we can we can solicit some some feedback now, but I put the part that I want to finish and kind of have an internal agreement on is the two styles of api's and what they are bringing to the table and what they are leaving out.

C

And and actually on a on the related matter, there is a requirement section at the top of at the top of the doc, and we haven't had an internal agreement that we are definitely committed to those requirements. So those are those are the two specific things that that we need an agreement on.

E

Makes sense so Mandar whenever you set up the meeting just add in the agenda that we want outcomes of those two things: okay,.

C

E

Otherwise, we normally get derailed. Okay,.

C

E

A

Okay, yeah so look for is here updates and then plant that meeting yep.

A

Two weeks ago, dis obtained a nosy tracer work. We had someone I, think come in on the slack Channel and say they were working on a prototype for switching to open senses are open, so mushy for tracing did that happen doesn't even know. Mm-Hmm I.

C

I did I did not say, I did not see it's exe OPR and in fact he he had. He had basically implemented the whole PR in his own branch. So I don't think there was a whole lot of work to be done to submit the PR, so I will actually just follow up now. Okay,.

A

Yeah it'd be nice if the work was already done, if we could really grab it. Yep.

A

I'm gonna lean on you again and are here: do we want to provide I know you were working on finalizing the monitoring and updating of for wasn't. The extensions and distributions did.

D

A

To share out any of those details: well, yes,.

C

So let me let me share that clean.

C

Up, actually, uh how about you, you go to the next one, while I look at the docs and okay.

A

I know nice: let's do yours, then, first because I think yeah, but she lectured Patterson, maybe.

E

Short just click on it and you will be able to figure out where it is. Oh.

A

Yeah I was supposed to look at this this morning. I just didn't get to it before the meeting yeah.

E

I just wanted to ask if this can be cherry bugged, or are we considering mixer in the earlier releases frozen I just don't know the current state I.

A

Don't think there's a problem with cherry picking, yeah.

C

I think I think I think I think we can. We can cherry pick. Just fine I mean it. So it's it's clearly an important thing that vanilla found and um yeah yeah. We thank.

E

You so it's just.

C

One minute.

E

E

So I'll be cherry picking two like 1 3 and 1/4.

C

1/3 1/4, that's that's! That's.

E

C

E

F

E

Concerned about 1/6 and 1/5 so.

F

E

We had talked about. You know that mixer is like an abandoned project. That's.

D

E

I was just making sure we all agree. Ok makes sense.

A

Do we even have a since we, the review to release managers, still work on the 1/4 branch at all I? Don't yes,.

B

One for Toledo, oh okay,.

A

F

It was just this last one was.

A

F

So wait one for 0l yeah.

G

Now it is 1/4 is 1/4 is dead. Sorry, they switched that hang up and unmute button on my on my meat, so I accidentally hung up.

A

Okay yeah, so we're not! Yes, there's no point in cherry picking you back so when I gotta cut your releases there yeah.

A

Okay, did you get time to bring it up? My daughter, yep.

C

Just share my screen and I mean stop presenting.

A

C

You can help.

C

Okay, so so this so this, this is this particular dock is about like very specific, very specific problem, which is what are the different failure modes when you're distributing was some extensions and more crucially. What does the user do in response to that right? So this? So this is somewhat agnostic to what API you used to actually configure it, because it could be on what filter API.

C

It could see the new API that that we're designing or it could be, the telemetry specific API, whichever whichever it is, but so being able to categorize all the modes of failure and then giving the user specific feedback about what is going wrong and we're just enough so that the user can go back and then look at the logs and and kind of do other things right, so that so it has. It has a very specific focused goal, and then we actually have kind of catalog or different modes of failure.

C

There are multiple ways that we're distributing the distributing bytes and actually now there is a another mode which at at some point, quat will will present. But and so there are inline bytes where the XTS level itself is pushing the whites. Somehow, then there is the URL fetch method, and then there is the out-of-band file based delivery, which is like distributed file system or a demon fetcher, or something like that, and then the kinds of errors that you can have are I'mme. Of course none permanently terror, so you could not.

C

You could not read it at all, eventually consistent read, which means you could not read it now. But if you just wait a little bit for propagation delay, it's it's going to be fine, and now the error will be gone. The module.

D

C

No I mean Tata timeout is, is the only otherwise it's the whole thing problem, but no timeout is is the is the only way so I guess you only know it correct. So you only know it after the after timer saying, okay, you start with temporary error and then after timeout you upgrade it to permanent, saying, okay, I doesn't look like this is going to succeed.

C

Invalid module means that the basel model is invalid or compiled against just the wrong api version, or something like that and then the last one is kind of the more standard error of just the configuration was invalid right. It did not make sense.

C

Now the interesting thing to note is that with inline bytes, if you send the bytes in line, then the first two modes of failure- I, don't know op because they exist XD. A server itself is like giving you the bytes and if it doesn't so yeah so basically the first two modes of failure are knobs, because just by construction, the bikes are there. It could still send you invalid bytes and the configuration could still be wrong. So those are still possible.

C

There so the so I'll just kind of not cover this part right now. The and I'll just go to URL fetch. So your fetch, of course, has the other two moles of three all right: you, you could timeout yeah, so you could timeout and the error becomes permanent and file based delivery is is somewhat different.

C

It will knock immediately and end synchronously. If something is something is not not present, but.

C

It specifically does not support eventual consistency, so that's kind of another part of this proposal, but I don't actually well I'm, having second thoughts about that. So let's not go there d. So the main upshot of this is providing a counter which is dimensioned by error, type and filter, config name.

C

Which is dimension by error, type and filter config mean where error types are these right: permanent eventual in valid and invalid configuration and filter config names are the actual configuration names that that the user has used in the config.

C

So now, just by monitoring this metric right. So this this will be a Prometheus MIT metric. Let's say now just by monitoring this metric, the user can actually know so where the error happened. So again, what kind of error it is and which filter they need to go? Look at. It won't give them the error message and all that, but that's not the intent you, so you should be able to set alarms on this thing right and then with Prometheus.

C

It already tells you which proxy at this error, so, for example, if one of the proxies is older version for some reason and that's. Why that's why it said API like that's why it said in the valid module, then you would clearly know that you will see a counter that says: error type invalid mod you going up and it will point to a particular particular proxy.

C

A

Do you think it's worth adding the build version, as part of the accountant is as I mentioned there is that if you think those are closely tied, should we just include that information along the place.

C

Build version of the proxy itself yeah that probably makes sense do we do we have this information anyway, because that, because we scrape the job and the job has some information.

A

In this, this do build metric itself from, but I don't think that the job will just tell you which high right, okay, the instance IP right. So it might. If you want to correlate the version you might want to.

E

Do you want the proxy version or the API version all.

A

Right yeah: do you investors yeah, you probably want both unless they're tied exactly to each other. So.

C

They they are tied because a particular build of one. What office to proxy is compiled with a certain version of API now yeah, so that so the question is which one of those two things is more relevant from an exact.

D

C

I guess I guess a bi probably is more relevant there. So it's debatable yeah.

A

I mean I worry like so, if you tell me, I have the wrong API. How do I know which version to go get to get the right API? Are we gonna have that easily links somewhere? Yes,.

C

We yeah we've, all people have habit easily link but I think I. Think your question is still valid. Like dig deep, you just want to present the the most useful information right there or does it you have to go through several levels of indirection I mean.

E

If you add more dimensions, you can have more information and they don't have to go multiple places, but I feel like either. You add both or the most relevant thing feels like if you're not going to add both envoy and a bi version. The most relevant thing is the API version. The.

B

Api version is pretty static; it will not even run if it's not doesn't match. So, not not. You can do it.

C

Well, but what quad, if so, so, the the scenario here is that I download a module that is available in to a beer virgins and I just downloaded the wrong one and now I try to install it, and a few of my approxi said: oh I this. This is not the right version. I, don't support this yeah.

B

I understand is just you can do that ahead of time right like this, it's all visible in the module and and epoxy and has to match exactly so. Why would even try to deploy something doesn't match.

C

So so, okay, I I, get your point, but we are talking about modes of failure here right. No, no one tries to induce a failure kind of by white design, so this so this is.

B

C

B

C

Case where I just didn't know and I had tried to install it anyway and how do I know what went wrong? I think.

B

I'm just trying to say that we shouldn't focus on errors of preventable, and this is a clearly preventable there. Yeah.

E

He said, I think what is saying is we can do something beforehand where this ever will never happen.

C

E

C

Well, okay, the I think that's a that's fair, but you know you know their system right. So yes, so, for example, you we config validation, can be done ahead of time and all these things can be done out of time.

F

So I spent all weekend trying to work on something like this and I had a lot of trouble with declaring functions as being external and then wouldn't read them and I tried to see if I could come up with some config validation. I couldn't I I, what's gonna be key, is some kind of way that a user can tell what the problem is. I I didn't realize that this was going to be presented today.

F

Hey I talked to Mitch Connors about distribution status because his distribution status stuff isn't working properly for failures with Envoy filter, webassembly stuff, but I'm wondering if, if a CLI tool could pull from Prometheus and tell users about these problems, oh.

C

So, okay, so thanks ed, yes, absolutely and and again you know, you know in a layered system. The the focus of this part is simply to expose those primitives so that CLI tools or UI or I, don't know Callie or something else can expose it in a meaningful way.

C

So I think config validation for.

C

For extensions is a separate topic that we we do need to discuss right. So basically config validation a priori, but this is, it is only gonna cover. What happens if that thing fails and you actually go all the way in and now you realize that there is a problem.

F

So right so I I was using Waze me and woz me sets up mounts and if you screw it up, the mounts are incorrect. The push arrives it's rejected because the file name is invalid, correct and what I discovered was that Mitch Conners distribution status was claiming that all of the nodes had the distribution of the filter, but it actually didn't have been rejected. So he's off fixing that I'm.

F

What I'm wondering, though, was say user had that they get this alert because of this metrics. How can we lead them to discover what the problem is right? In the case of Waze me, everything was correct, but maybe the demon set was screwed up and didn't read the cache properly.

C

Right so, okay, so I think I think that we we need to look at that mode of failure, but I think that in that case we would have seen the so okay. So so you envoys sent back an ACK, correct and.

F

The part that pod became stale right was.

C

F

One bug is Mitch's thing, but I want to I want to go further and help the user to see why it went stale which I had it with me. Thea sigh right seized on the Lord, my car yeah.

C

So I think I think that the the way the wait way to look at it, but most definitely is that these are all exceptional situations right and that's. Why metric as the first line of defense on that you can set alerts and then the alert will tell you that there is something going on with this proxy or these proxies and that's already specific enough for you to go and look at the logs right, because this kind of metric is telling you that a particular proxy trying to load a particular module and something bad happened.

C

In fact, after this is added, even in the file-based mounts, we will record it as an error, so that so that that that's kind of the other part here, because there are multiple ways in which we deliver and some go into warming- some don't go into warming. This method of.

C

Exposing this information in metrics is consistent across different ways that that configures is deliver. Config encoders, deliver yeah.

F

I think this is great I, just I want to work with you on maybe a command line to okay. Now these metrics okay.

C

Well, okay, perfect I, think I think that that that that would be awesome, yeah, let's, let's absolutely do that and I am not sure how exactly this interacts with status, but you and which have thought more about it than I have so.

E

Leave status I think it is another debate going on, but let him talk about.

F

I'm gonna, let you continue the presentation I'll just make comments later on on the document. Okay,.

C

Well, thank you actually I. Think that's I! Think that that's basically it that, like that that counter, so the proposal is that that we implement that counter and which is automatically scraped by Prometheus, so we'll make sure that it's put in the right namespace so that it is picked up by Prometheus like normal and then the rest of the story is we expose it so CLI tool and or angriff on the dashboard yeah.

E

Okay, quick question for humans are so other than configuration error for a particular was or a bi incompatibility issues. Is there a third car that I can't think of, for example, VM related issues that happen at runtime, so.

C

So any number of things can happen at runtime, mm-hmm right, but root. So, okay, this is not dealing with with runtime errors. I.

D

Said this is the distribution right.

C

So yes, so in so many like many issues can happen at at runtime and those will show up already in other places, but but but I but I, but I agree that there is. There is kind of more runtime status, sorry of status, runtime metrics as well yeah as they pertain to the extension. So we can actually have a similarly dimension thing. I can flick name and that tracks tracks.

E

Errors- how about a distribution time, VM incompatibility issues- and this is basically me going on a limb here, so is it possible that a particular Rossum module can only run a particular type of VM in.

C

No, that should not be possible simply because the the Oise of VM actually abstracts out whatever else is, underneath it so yeah, so so that that that should shouldn't be possible that there are. There are certain other kinds of failures and we're not very close to getting them, but like once, we have the whole capabilities-based thing, whether you can read from the file system or not right. So right now there is no ABI to read from file system, but there could be and.

D

C

May have local policies that say you cannot read from file system and now there is it. There is an issue, but but I think that even in those cases right again, these are exceptional circumstances. So, in those cases it's still important to have an alert that points user in the approximate direction and then they can then they can pinpoint using logs, ultimately yeah.

E

That makes sense I.

C

Got a drop thanks mocha. This looks great all right, I'm done! Thank you. Oh.

A

C

A

If we have Part C API version, should we monitor that separately? Who are you talking about preventible? Should we try and have a metric that exposes a bi version as well? Just so you can see the status of your cluster in terms of compatibility more easily. That's.

C

Probably good quad, what what do you think I mean it's? It is very firmly hardwired to the proxy version, but yeah I, don't.

B

Think much what has been given to that is only one API version right now and it's just I think we need to improve the platform itself. So we have a major minor version placed in the API yeah. Just no one has.

A

Really worked on.

B

The inversion add a bi-level, quite yet. Okay.

F

Can I ask about the the runtime field, because I I noticed the the the angry filters we ship use, the run boy run time envoy, wasup run time does null and the ones made from solo, I/o or envoy Watson that run time. That v8 is that I know that's not the API version, but the runtime version itself. There might be language features.

F

Do we want to attempt to validate or report about that? That's.

C

Okay, that that's that that's a good question so kind of ultimately going forward.

C

The talk, v8 is going to be the main, and the default I mean. Maybe the Volvo meant something like that, which will come but v8 is, is the proper wasum, runtime and stack means like that will be the main thing no sandbox is used now, but we will reduce the use of it as v8 and then that other pipeline improves, so we could potentially add the VM type as a dimension.

C

If that's, if that's your, if that's your question, I just don't know where exactly we will use it and how does it help us in this sort of a bunion, but maybe does that? What do others think I.

B

Hopes of migration.

C

It does help with migration de that that's true so, but does it help as another dimension here or I guess it does right.

B

It helps during migration and have migrated it so I think yeah feasible time. Okay, okay,.

C

That makes sense.

C

Okay, great so I will stop presenting out, because Doug has another one to cover yeah.

A

Okay, yes, the next thing on the agenda: unless there's something else, was this RSC for telemetry API, that I threw together and mainly I, was just trying to hurdle us towards our one-seven goals, which was having to find 23 API. So I was hoping to.

B

Get a lot of feedback.

A

And I think we have got a lot of feedback now so now I sort of have to make some tough deciding decisions. So if you take a look at a dog, I tried to capture all of the parts in various ways that we can configure telemetry in the system today, and some of the requests that came in and for issue is filed by community users, and so this is my attempt at distilling out the set of requirements.

A

It's been outdated based on some feedback, and so maybe we should I just wanted to go through them and see if people agree. If this is the right set or if they think there's more, that should be here or less so. The first thing I want to have one way to control telemetry for is, do I think having to learn a different way to do tracing it differently. Do logging at point do metrics it's just not the generally good user experience and it sort of leads to sort of these fragmented implementations.

A

So that's why I put that first requirement there that goes hand-in-hand it the second one, which is simplifying user experience right. You shouldn't have to change things in three places to get the home assure you want, you should have a way that you can do that in one place or through one action.

A

The next one I was thinking the workload level, but Peter and others pointed out that that, in the future you might want to do it at the listener level, or maybe even for external services. There's a provide overrides for customization I. Don't know to do others feel that way as well. Is that.

C

It it's a so for for telemetry I mean these are part of kind of complete description of where's like if something goes so. Yes, I, don't know how common it would be to attach different telemetry to different ports, but but yeah I think it's possible yeah.

B

I wasn't sure either healthcare I mean it's very common problem. Ok,.

A

B

Set so you have health checks, objects, yes,.

A

Yeah, that's true.

C

Okay, yeah, that's a it's a good point.

A

C

The I think we should have health checks as a separate heading anyway, because I think yes, this will solve health check if I'll check some on a different port.

C

But if health checks are not on a different port, we need to solve health checks, polluting telemetry.

B

C

B

Different ways people do have checks. Oh right.

C

Yeah, okay, I think that's that's fair, but I just thought! Yeah I just want to make sure that then we do track just health checks as general as well.

A

The other thing I put in here just I, think I saw this and some of the other dogs as well, which is being able to have our you know our back from the controls in configuration and I know. Quite you had called out inflation over the operators.

A

Do we do we believe it's different than the network administrator, and if so, should we separate those personas? So I don't know if others have thoughts on that or wanna talk about that.

C

So so I think that again we don't need to support it fully. But, yes, just like you know, if someone really wants, if particular user really wants to kind of go to that level of detail and that level of granularity, they should be able to what yeah I we.

B

Yeah, we don't.

D

B

Main concern is that network operator in history is basically God level, so.

F

B

Anything to network and API, who is the good one to make another one god so and that's not always the thing that you want for people to customize a tracing span. It was this escalation of privileges, basically yeah.

C

Okay, can you elaborate on on that example, a little bit so someone changes, trace paths right.

B

So if some app developer wants to change their spam tables, you shouldn't have to modify cluster level. Networking api's, oh okay,.

C

Yes, yeah yeah, absolutely okay, yes, that that makes sense. Now.

C

Now that said, right, I think the the trade off do you have to do is that if it leads to a complicated API and this sort of requirement is rare, then you can always ask whoever has access to the network logs to make that one change. Yeah.

B

I get your point, but I think the goal here is to make an end, live and user API, so there's always a possibility to build another level on top of this right to enforce our back, but that that's just building more and more layers of in between you and the system.

B

So if you think of something like you know, get get ops, yes,.

D

B

That but then your API is get notice. If there's really not no point making another level of indirection.

C

I think that it's a it's a matter of where we draw the line right. Do we like how far do we go to cater to really sophisticated scenarios and what does the baseline system support? Yeah.

B

It's my discussion, I'm just wondering.

C

B

If we want to solve new problems- and you know, make new capabilities, we have to do that. Otherwise, we're just building another level of interaction and that's not really helpful.

C

Okay, yeah, that's fair, okay,.

A

A

So what are you exist in use? Cases I think, is that's pretty basic. The last one I added in response to the original.

B

A

Which is just used proxy, config and I think that this invalidates the proxy config, because the feedback I get that was getting was that we want to have it such that you don't have to restart the pods and want this to be dynamic and and go out, and so I want to make sure that everyone is on. That thinks that that is a worthy design goal on the farm. So so.

C

Prompt so proxy config requires early start because it is used in at the injection stage right.

C

C

Okay, but so that so that that's just from an implementation side, we can change the implementation, and maybe that won't be an issue, but I guess the more important thing is trying to get away from large-scale and instant changes that affect the entire mesh.

C

So in that sense it's actually a feature that, if you change something in proxy config, you do need to go through a very deliberate step of deploying those changes.

B

G

B

Applies to extension, API driver I mean it's a general problem correction. It.

C

Is a it is a general problem, app absolutely.

C

Yeah, that's true, so you might.

B

Wanna add that the modes of failure is a cascade. You know we have to stop delivering what some extensions that look like they're correct, even though the crash systems.

B

So we need to make sure it's possible to deliver configs for extensions. They are awesome so that it doesn't cause pull much crash.

G

B

Think but extension apply is going to be what's used as implementation of this totemic api right.

C

B

Needs to support features we want meaning. It should be possible to all out config gradually. Extension. Api is right.

C

So it's it's actually interesting in in in some ways. Yes, it's clear that the extensions the extension API is an implementation of this, but we should not let the design or the current design of the extension API influence, what we perceive as the requirements for the telemetry yeah, so so I think that being able to push so Canarian right. That's that's kind of the bottom line you should be able to. You should be able to canary changes, which also happens to be a major requirement for the extension API, but I guess it also applies here.

A

Yeah I think, if you're, making sort of namespace or group in work work group. Why changes we don't really have a story for how to do this, with single work well or in a selection? Doing this.

B

That's a question: how to have an instructor, but it just means in you. You know find enough granularity for selector selectors, so you don't deploy nice white all the time. No.

D

C

And the interesting thing is that the way it is set up right now with proxy conflicting static at boot time inadvertently gives you that capability, because it means that your canary essentially is restarting some parts and that's your canary, but but on water filter. Api, though the work, the thing that we use today to deliver telemetry does not have that feature so to speak. It's like instant and everywhere.

A

Okay, okay, okay, so I I'm, not sure where to go with the design. Yet sighs doesn't process. All of this, it sounds like so.

C

So so Doug took the part that once once we are once we have done requirements and and and I think I think we do. Maybe other people who comment more, but the part that I would want to ensure happens is that either coexistence or.

C

So if this is implemented on top of the extension API, then coexistence is solved just by layering. If this is implemented directly in the control plane, then we need to make sure that it coexists nicely with the extensions API. It's simply because the ways these functions are actually delivered is still going to be extensions. In many cases.

A

Yes, so yeah I was hoping that the extension API would sort of get to a stable and approves station before we try and resolve this, so we could even decide if this was necessary on top of it or how it would fit. So maybe maybe that's the thing, maybe we think on this. While we finish the extensions API so that we can look at how it would layer or start to design the way right now- and maybe that's my next dentist is to go back and show how this could layer over the extensions again. Oh.

C

C

So well, but for mint from an actual API perspective. This is what we want. The users use to configure telemetry right, I think that's that's kind of the question so for standard telemetry, there's this API and for anything else, that's more bespoke. There's the extensions API I think that that's what we need to. We need to figure out.

A

A

Well, yeah, so I guess the view of the extensions API to me view that as a something the user should interact with, or do we view that as an improvement, an envoy filter for wish things could then compile down to essentially is there? Is that meant to be exposed to the user directly?

A

That's what I don't have a good feel for yet yeah.

B

Probably because there are there telemetry providers, their own extensions, they need to be able to use, has to be exposed to the user. So.

C

I guess Doug: your question is: is it the only thing that's exposed to the user and do we do we actually create a more first-class elementary PR on top of it and exports that so I think we need to think about that I.

C

Think Daniel last I mentioned that there is clearly some boilerplate that you need to do to use extension api's with specifically awesome, right and I think even that time, what we discussed was: yes, we can always add in there or some defaulting behavior, or something like that on top of on top of that API.

C

So at least the way it looks right now is that the extensions extensions API will be exposed to the user, so it so it has to be a human consumable.

C

But if there are better ways to expose like some functionality, then things can be layered on top of it. Yeah.

A

I guess what I was thinking was I'd like to go through whatever mostly final version, the extensions API, okay and.

C

A

See if I can flesh out three use cases right completely be tracing for a couple things and see what that looks like I mean use that as motivation, maybe to add another layer or to say we're fine with this layer for now, okay,.

C

Yeah, oh that's about that script.

A

Okay, well thanks everyone for the feedback and please, if you have more Adam to the dock, that'd be helpful. It's still a working document. It's not a final proposal by any stretch of the imagination, I just wanted to get the conversation started, which we managed to do and that's all I really had. Is there something else we should talk about with the last five minutes of our time? So anything else, it's honest questions on or is curious about or wants to discuss.

C

All right, okay! Well, thanks everyone yep! Thank you. Thank you.

C

C