OctoML Apache TVM Community, 2 Mar 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Apache TVM Community Meeting, March 2, 2022

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

um um Okay, so welcome everyone to the march uh second tbm community meeting and I'm andrew royce, I'm one of the pmc members for tmc. Excuse me tvm, um we'll start the meeting with um just if there's introductions for anyone who hasn't attended. One of I guess the tvm, or maybe the microtvm community meetings before. If anyone wants to introduce themselves to the community, say hi, see where they're from and say what they're interested in tvm go for. It.

A

I do see a lot of familiar faces here, so um you know, I think that um almost everybody on the call has been I've seen before so um yeah. That said, um we'll kind of just move on here to the agenda.

A

So um just right now we have the uma accelerator interface and um that's something that's brought up by um michael kleiber and and some colleagues that um at uh university of uh munich, as well as at bosch um and as well as then, if, uh if time permits I'd like to also bring up this remove code owner's rfc that kind of affects code review for everyone.

A

As far as announcements and news goes, um I don't think I have anything super specific, although I do want to congratulate leandro uh for being promoted to the pmc, um so um great work and we're happy to have you and and excited for uh future contributions. um Thank you very much yeah.

A

So with that, I'd like to move on to the um umo accelerator interface- and uh I guess I'd like to turn it over if um this is the the discuss thread here- that uh has uh started the discussion um and uh you know we have uh our our image reference here, and uh so I don't know if um mike or one of the folks behind this wants to give a little overview of the proposal here, and I can also have to add context too, as well as, if that's helpful hi. Can you hear me?

A

Yes, definitely.

B

Okay, great so, apparently all of us have major technical issues, um so michael kleiber cannot join. I cannot see the screen. I only have audio and two other colleagues from bosch can also not join even with their private computers. So.

C

B

C

I hope I didn't screw up the zoom link or anything like that. um Yeah so give us a few more minutes.

B

A

B

We can fix this, no.

A

Problem anyone else have trouble. um Should I.

D

Well, I think I think one of the things that can happen is, if you have a zoom account and you're logged into it. When you bring up the url what'll happen is get presented with a screen to say that you should sign in as the moderator and then there's a link underneath that and like kind of a fairly small font that that's, where you're supposed to click to actually go into the meeting. And so it's really easy to miss that nice.

D

um So that's that's actually quite a bit different than I guess what we had set up with the with the other zoom link before which just okay and actually someone's asking about media.

A

Access code, let me let me grab that real, quick from I apologize. I tried to be fancy and use this g calendar integration, but I'm not sure if that worked or not. So, let's see if we have a.

E

Intro, I think, there's also a difference in the link, so the one you sent privately had a j on it before the number everyone here is.

A

A

Okay, well great job on my part in setting this all up um hello, hello, hey mike how's, it going.

F

ah Good so far, I finally found the code.

A

Sorry about that, I must have uh oh, I see I see huh interesting okay. Well, hopefully, that new link solves the problem for everyone um not sure what happened with the whole one.

A

um So we were actually uh we just kind of started along our agenda a little bit and um uh apologies for the difficulty in joining here, um but uh we had kind of gone along to the point where we were gonna start bringing up the um um accelerator interface. I wasn't sure if you wanted to give a brief uh kind of overview or some background on the proposal.

A

I'm also happy to add context here as well. um Just wanted to make sure that um I just wanted to see if you wanted to kind of discuss kind of rationale and and kind of the goal and all that and go from there.

F

uh Absolutely can I somehow share my screen. Yeah.

A

Sure, let me let me unshare mine, uh sorry, where where's.

C

F

Oops, it doesn't. Let me.

A

Oh, that should work hold on.

A

F

I think it should work. Okay, I know it will. Work, can.

A

F

Screen yes, powerpoint slide; yes, okay, good! Then thanks! So I'm not sure what you have already told about the uma proposal. Yet uh nothing.

E

So let me just yeah good, then I will start.

F

From zero, so basically a group of people uh which are mentioned here uh so uh mostly from uh germany, so powell kristoff from tubingen and myself ingo from bosch, uh philip raphael, daniel from munich and johannes from munich. We all had kind of the same goal or problem. How else you how we want to how we want to bring it, so we wanted to integrate an accelerator into tvm and we saw so yeah. There are a couple of ways how this has been done in the past, like the esos.

F

U is a really good example of it. Vita, of course, is really cool and we thought yeah. How can we somehow share synergies? How can we save some of the workload and because we are quite a people, a number of people here we thought why? Don't we put our efforts together and try to define something that we can use in common and that's how we came up with a first version or the idea of uma, which is a universal modular accelerator interface?

F

So the idea, basically is: how can I bring an accelerator easily into tvm, so the goals here are, as I said, easy integration and, in our case, first of external hardware accelerators. So what do we mean by external accelerators like everything that is not in? um That is not directly in the processor pipeline?

F

Sorry, and what does this include? This includes a couple of apis uh to hook into tvm glass infrastructures and finance infrastructures, because we saw for the previous approaches of x accelerated integration. This was done different from one accelerator to the next and the motivation behind this is, I mean we see.

F

This is work, which is all I mean from. uh Let's say, a company perspective. It's not differentiating, because integrating an accelerator is yeah, it's work and we can raise synergy because we don't have to do the same thing x times. Four accelerators.

F

So in our previous discussions I mean uh you mentioned in the post as well. We so with andrew and mark together with the other people mentioned before. We already had a high bandwidth discussion on how this could be done, based on the first posts, and we came up with- let's say an intermediate idea on how this could be done, and this is what we also want to discuss with the community today.

F

Basically, a layered api approach, so which is layer. The first layer is like a user layer, porcelain layer in the git slang, which is yuma. So the idea is to have a straightforward python, only and stable api wrapper, which is connected to the layer below called the plumbing layer.

F

So the thing here is it's an easy and clearly defined template for integration of accelerators. I mean what I have seen so far. There are so many ways to go from. I want an accelerator to it's integrated.

F

There are really a lot of ways through a maze that you can go, and none of them is wrong, but there are just many things that you have have to do and what you want to provide here is a template that everybody can use, and I mean the focus group here is really let's say a classical hardware or software engineer, without deep or deep compiler knowledge who can integrate it.

F

So a short learning period should be together with such an easy and stable api, so the second layer, then it would be plumbing layer, so maybe andrew you can help me here a little bit from my understanding. This was a collage like api. That will will is still to be integrated into tvm plus other tvm apis, and this is therefore way more powerful api connected to the core compiler and the target audience is really. It was it's really experienced users and it is c plus plus and python yeah. Maybe.

A

I can it done summarize this.

F

A

Yeah yeah, I mean that's, that's that's kind of what we're talking about, but maybe I can just uh add a little bit more detail here. um There are a couple different pieces of this of this proposal and um some of them involve code generation, and some of them involve graph partitioning and some of them involve middle compiler, passes and, and uh the proposal essentially is to uh unify. um Oh, you can keep your screen share. If you want mike, I sorry about that.

A

um uh The proposal essentially was to to unify um all of these different pieces, which are kind of the common touch points that you have when you're trying to integrate an accelerator behind some some porcelain um and so kind of the idea, then, is to kind of look at what the common um uh kind of lower level apis or interfaces are that people exploit there at the compiler level and then um uh make the porcelain wrap those essentially, and so, when it comes to the topic of kind of graph prediction, um we were kind of discussing about trying to make sure that at least the way that things are recorded on the ir module here is um is similar to what we're doing.

A

Today, and in fact, actually, we one of the questions we had raised at the beginning of the proposal was: are you guys proposing to do um to introduce yet another partitioning flow? And the answer is no.

A

We just want to sort of wrap the existing partitioner in an api and and so then, as efforts like collage get further along to the point that they're sort of rfc to the community level, um you know the the migration path will be the same for accelerators integrated using luma as they are for most of the accelerators uh in the codebase today that use kind of the existing the existing um uh partitioning flow. So uh I would say collage-like, but really it's more like um for for all practical intensive purposes.

A

Today, it's more like using the same partitioning flow, that's um currently using the compiler now so, hopefully that clarifies a little bit there.

F

Yep totally makes sense, so basically I mean this is what andrew just mentioned, so this is top level chart where uma would hook into the tvm pipeline. So basically, all that is yellow or orange here is where it will hook in so uma. Partitioner is basically just using the existing partitioner and wrap it into a nice porcelain api. This is what andrew just mentioned, and uma pipeline is. uh I will show in the next slide, is basically the steps to go for.

F

Operators that are supported for an accelerator and all everything, that's not supported will go the default, tvm path.

F

To my pipeline, my pipeline has a couple of I mean this is a proposal how it could be done. So the idea is going from the composite patterns through um uma, lowering step, which is basically an implementation of the relay to tier paths with that has already been proposed in our rc10. So it's one special implementation of it, and it has the components that you can register um your own primitives here. You can also use topi primitives, so it is similar to what the ethos you rfc has been done.

F

It's just a, I would say a generalized and formalized way how it could be done for a bigger number of accelerators going here. The first step is to go to execute the tvm lowering steps resulting in stir, and here it will be possible to have a hook for executing uma schedules, so accelerator specific schedules and accelerator specific passes that can be registered specific for the user's accelerator.

F

All of this, then, is resulting in nstir. That includes also tr extern calls and can be called into a default library of an accelerator and last it would be uh cojon or the last part of the proposal would be a cojon part.

F

So here it could be possible to have a specialty of the c code gen in particularly so in particular, we think of a python interface into c code gen, so that yeah also from the user level, uh modifications to visitors and to and small notifications to c code can be made, and I mean in our discussion with andrew and mark and the cogent. uh The coaching part was already a bit controversial whether we should keep it in here. So this. I think this is what we should have up for discussion.

A

Yeah definitely, I think um I wanted to talk a little bit about um kind of the um the hooks that we should make available, essentially um kind of in the orange boxes um that would hook into the orange boxes. um I wanted to solicit like feedback and and thoughts from the community about yeah. Just I mean there are lots of options in tbm. As far as we can add an accelerator, but you know presuming, we add kind of a you know, a higher level accelerator porcelain here.

A

What are the kinds of things that you'd like to what kinds of passes you'd be able you'd like to be able to run? What are the kinds of schedules you'd like to be able to define- and um you know essentially just I think the cogen part? Actually I don't. I don't know if that's like, I think, there's maybe some question about how we should implement that, but I think that might be just be a straightforward wrapping of tier to runtime.

A

uh Just so long as the the first two things are are not are kind of resolved in a non-controversial way. So that's something I wanted to talk about for discussion, but should we do do you want to finish your presentation here and then we can uh open.

F

I'm also, I just have two uh uh snippets as an example, and maybe if paul is in the call, maybe he want to take over here.

G

Yes, I'm here yeah yeah. I can just give a quick um overview here. We updated on the feedback we already received in the discussion um a little bit of a code and then the the top level api, basically and wrapped it all into the human back end. Basically, you were back in this. Our parent class here which we need to inherit, and then we can just use, for example, ultra trail back end, which is our accelerator, which we are using.

G

um Yes, as a proof of concept to to try this things out at the moment, and what we can see here is that we have these these different levels of of registration, of different things and the first one would be relate relay function, registrations which is basically registering patterns for the pattern matching and registering passes, which are then executed in in a relay to relay um pass.

G

We can register it with some stages. This is the little one and two, for example, in the register relay path part.

G

The idea is here to do the same things as later on in the in the tir passes, to have different stages where these passes are registered. In so, the user has a little bit of options here to say.

G

Maybe I want to do these passes before there has been the partitioning or after the partitioning, for example, and those are the things for the for the relay to relay parts or pattern registration and passes, and then there comes the relay to tir register registrations um where we have operator strategies which can be registered, um for example, if at the moment uh someone has a custom, tensor expression and wants to integrate this into the flow, um this can be done on this level, and then there are the schedules and tir passes similar to what was shown in the picture before so we can have custom tlr schedules which I applied um and we can register multiple of them and then there are dir passes again with different stages in the lowering process where they are hooked in and in the end, this is currently highly under development.

G

This is the part with register code gen.

G

We have to think about what do we need on the python side, to register to register this code generation and basically, what we are thinking about is we want, to just add some includes into the code generation or add some custom c code without having to mingle around with the actual c plus c host, and do this just on the python side and then insert it through the api.

G

This is currently the um the idea we are coming from yeah and that this is all basically you have to do, and then this is a little snippet how it can be used. The idea is, basically, you have your front end.

G

You insert a module in tbm and then you have one or multiple back ends and you can register them using using this code in line three and four and use the partitioning currently which calls the the standard tvm partitioning and then it's just a relay build and everything was registered by the back end or by yuma in this case, and it hooks into the relay build using the normal tvm api to finally generate a in this case c. Runtime code.

G

Yeah, that's the the overview of a little bit of code here.

F

Yep, so that's basically it see. Can you guys hear me now.

A

Yes, okay, it seems like my zoom client has froze, but I can still use spacebar to talk to you guys, so I can't see what anyone's doing, but I can at least talk um great thanks for the great overview um so uh yeah.

A

What I wanted to do was uh just I mean I I don't mean to take away the mic from from you guys if you were, I have some more to say, but I just wanted to discuss a little bit about um kind of the different scheduling hooks that you guys um wanted to um to be able to provide. I think, there's kind of a question of whether or not we provide um scheduling, hooks uh sort of um prior or sorry.

A

uh Sorry, I want to talk about the different uh compiler passcodes that you want to provide. So I think there's um you guys have already motivated having kind of relay passes here. You've motivated having um tr passes, and then um I think, if I recall correctly from the um call there was uh maybe some desire to have some um like communication between the two different passes.

A

um But I didn't know if you guys wanted to elaborate between that and then I kind of wanted to open it up and see what the community thought as far as like. If there are other things that might be missing here or if um this seems like sufficient as a kind of a v1, and we should try to implement it or you know this seems to be a good idea or a bad idea just to get generally gather everyone's thoughts.

F

Okay, maybe first on the motivation that you mentioned between the schedules. So, basically, what we see is that the primitive that is implemented um as a as a custom, primitive always or can often have a relation to accelerator, specific schedules or accelerator specific passes and also relation to the code gen. So these are the things that we see that are uh tightly related and can interact with each other in this pipeline here and that's why we think that these three hooks are needed.

F

For example, if you add some something in in stir or sorry, a tbm script into your custom, primitive, you might want to pick it up in a custom path and there might be things that are in the custom path that you want to translate in a special way into your target code.

F

So that's basically the motivation behind these three things.

F

Okay, android. I think we cannot hear you anymore you're on mute. Oh.

A

Yeah, sorry, I think that makes sense to me um remember to hold spacebar now um yeah. I wanted to gather some feedback from everyone else and see. um I don't know if the arm folks have any uh thoughts, having kind of blazed this trail before um or if there's um you know things that seem like they might be missing or misplaced here um or if anyone else has any um thing they want to contribute here.

E

um Yeah, I can. I have a few questions. I think overall work looks good. Anything is going in the right direction by structuring uh the registration of the passes to support an accelerator.

E

um So the first question I had uh would be: would the v1 of this would support a tensor expression? Te that is obtained from topi or is it harvey, is keeping that.

F

So my understanding is- and maybe andrew can help me here. My understanding is that using the create prim func function, I can always go from te to sjr, so it should be natively supported.

E

All right, okay,.

G

Yeah, maybe I can add to this um yeah we we are from our side, um had a big interest in going directly to to tir, but we realized that, but we are using basically topi and lowering this one at the moment through the um standard lowering similar to what is also done in the ethers. U um and lowering this to schedulable tir, and at this point we also are supporting the injection of custom, tensor expression, so custom te in the flow as long as this is still part of the the main lowering in tbm.

A

Yeah, so I guess one question is, then: are you guys sort of invoking this te compiler um from from your flow, or is that is the idea that you run a pass and kind of um submit the te into um ir module and then let the standard flow invoke it from there. I guess.

G

Yeah yeah, we are using the standard flow to to lower the ete and on to tr, got it okay. Does that answer your question, though,.

E

Yes, yeah, um I kind of follow up on that so see the proposal. There is the explicit differentiation between sto and nstr. Would they appear as different hooks in this design?.

G

I'm not sure what you mean with different hooks.

E

um So, for example, here it mentions you register a scheduling passes as opposed to tier passes.

E

um So I guess my question is: will this particular pass the accelerator specific lowering contribute back next year to the call compiler or will it uh is their ambition to contribute the sta back also to the compiler for some reason sometimes you'll see meta? Should you well.

G

uh Yeah, it was from your from your question as well in the yes discussion um yeah. I think it's at the moment a bit misleading how it's uh drawn here. In our you know our graph, um so s tier will be worked on by by schedules, um and this, of course returns um st as well, and these passes, which are currently like in the box behind the schedules, are passes that are injected into the tvm lowering through the past context, and so it's so these passes do not return as tier.

G

They are part of the um basically part of the lowering from st to nsd. We are calling at the moment, tbm lower and are injecting like the passes that are registered on this level into this tvm lower.

G

I hope this answers the question.

E

Yeah yeah that that answers the question. Thank you.

F

What it was this is the answer you were hoping for: manopa.

E

Yeah, I think I think that makes sense. I just wanted to make sure you know to see if there's another hook being introduced or not or not.

E

I think that's not not likely to going to happen. I have another question, but I would want to cure some uh others in order to have a chance.

A

Yeah, I mean go for it. I think uh I think that's not useful. Just to clarify the last point. um You know you just wanted to ask like uh it seems like.

A

Currently it's focused around um uh just working on standard tier now and perhaps as the tensor ir kind of matures. Sorry, if I got that last explanation wrong, but um as the sensory arm matures, then perhaps we could um think about injecting that instead of te or something like that further up the pipe, but does is that kind of um does that kind of match what you guys were saying and and is that what um manuka was asking for.

E

Yeah, I I think in a sense yes, so I guess I was looking so sorry. You know. There's your pipeline. We currently leverage g e and t share deliberatives. I was trying to see whether we could, uh I don't know, not commit into anything but extra the possibility of using this interface uh yeah. That's that's! That's the reason for the question, um but it seems like the first step in this flowering flow is to get away, get rid of te and get to st as soon as possible.

A

I see so it would be more helpful for like ethos to if we were going to explore essentially using that would do um have a way essentially to emit te and and then operate on that as a pass. There right.

E

Yeah, essentially, I was trying to see whether this this has the feature to deal with te for a bit before st appears.

A

So I think that could maybe be added as just a separate uh pass registration is that does that seem uh right.

G

I mean te is currently supported, so so this is the the custom primitives that are uh showing here at the execution lower. So you can inject your own custom, tee.

G

But yes, and as long as this is a valid uh step in tvm and it's not replaced by by the tir or the the relax um ambitions here, um this will be part of yuma as well.

A

Okay, sorry yeah! Sorry! I my screen is completely frozen, so I'm still looking at it.

G

A

Cool, I think, if there's an agreement that that makes sense.

E

Yeah, okay, so so the last question I think hopefully last uh was the one that I had to ask was something that we already dealing with related tier hooks in rfc 10, here, uh it's kind of figuring out a lowering order, especially when it goes from related here and which might not necessarily match with the partitioning order.

E

So the reasoning for this comes from the fact, for example, in datasheet world we would like to see memory available after cpu functions have been compiled before we compile it as u so it kind of uh we are kind of exploring the possibility of. Can we lower the c functions first before those two functions are lowered, any any thoughts on that, uh whether this interface could uh help with that.

E

Can you elaborate a little bit more you're, lowering which part first? So if you want to lower the cpu parts first, because it's already partitioned by lowering the cpu password, we can definitely know what's the memory required by them.

E

F

Basically, if we look at this figure, you want to have the the part which goes the default path be generated. First, um yeah.

E

F

The first thing.

E

Yeah, that's a specific requirement that we have, but what I'm trying to say is that whether that is configurable in this flow, the order of power conditioning is related here.

F

Okay, well, I haven't thought about it yet paul have you thought of about it yet or christoph.

G

Anybody I mean at the moment we are once everything is partitioned. We are calling relay build, so there is no um ordering anymore there just, but what's done in the build, so yeah.

E

Why do you need this monopod? Why would you need this uh so says? As far as I understand, related tf hooks takes the full ir module, which not only includes accelerator functions. It also includes the default c target functions in this case. uh So when we're doing the mutation, the ordering kind of matters, uh because the latter one has more information than what goes in first.

A

Okay, yeah, could you sorry could you clarify a little bit more there you're saying like when you're mutating the um uh when you're, when you're um creating this ethos use schedules you want to um what's the reason you guys need to know the exact memory um uh requirements from the rest of the graph.

E

uh So so, depending on what's live when the function is being lowered to it, so it can compile it differently.

A

I see okay makes sense, yeah, okay, so yeah. So one of the things that we kind of encouraged with this um proposal to start with was using sort of target hooks as a way to register um sorry using the target registry. I think that was kind of proposed in the related tier hook as a way to kind of register.

A

The register, like the the various hooks here, and so maybe we could explore whether there's a way to encode some kind of priorities at the target level as well, um maybe a little bit tricky because it might make composition a little bit trickier, but on the other hand, if it's kind of a flow design for a specific like um hardware accelerator that would be used in a particular soc.

A

With this you know, kind of the back ends are kind of all pretty well established for that soc it might mean that, like the priority levels amongst the the back ends, that would be enabled at any given time um wouldn't be likely to conflict or or wouldn't uh have that problem. um Would that maybe be a way of resolving this.

E

Yeah yeah, I guess, if we can take it, take chat a bit on the forum uh yeah. It's not that simple, uh because this order can so it has a degree of freedom that order can be interpreted. I mean the optimal lowering flow. The orders need to be interpreted differently at different hoops. That's the complexity.

A

Maybe we can um follow up with that part on the forum with some more detailed kind of overview and all that.

A

Okay, um are there any other questions about this, or does anyone want to propose any other? I was just curious if there's any um other folks in the community that might have explored accelerator integration before and kind of wanted to um ask if there was any missing features or or how can we fit this in with our efforts or anything like that, um give a second here if anyone wants to bring anything up.

H

um Can I ask a question sure I think yeah, um first of all, yeah thank you for the um exciting proposal. This is very interesting.

H

um Personally, I'm not very familiar with the accelerator passes so like this can be like a very, very basic question, but like does every isolator go through the same sequence of passes or is there any like you know, um specific special passage can? Can there be a special path for certain accelerators.

A

Yeah, so I mean I can kind of take this and and folks who have also worked on this feel free to chime in here as well um yeah in general, we um we don't have a specific set of registered passes and, and also um this proposal isn't necessarily trying to um like box everyone into to using a particular um uh like, like one particular flow.

A

But what is challenging with tvm is that there is a lot of choice in in the way that you might want to go about implementing uh well, almost any sort of um processing, pipeline or compilation pipeline on top of a relay graph, and um there are some common touch points that are actually helpful. If you kind of um can can reuse, um uh they're, actually kind of the the common touch points that are mentioned as the the sort of the plumbing. I guess of the apis here.

A

One of them is the uh the part that divides or partitions the graph between an offloaded portion and sort of a portion. That's compiled using kind of a target host flow or essentially compiled for a cpu.

A

Another part is kind of the the part that actually does the scheduling and the last part is the part that does the code gen and so the the goal.

A

uh The goal with this uh proposal is uh uh basically to collect all of those common touch points together into a single api so that at least it's you know, it's not going. It's not attempting to sort of um box people in here, but the idea is that it at least kind of provides a path that that's kind of like a way to get started um and and then, as you need to deviate. um Obviously you can then dive deeper beyond the porcelain into the plumbing.

A

If you need to um maybe I'm missing some other parts that others want to chime in here as well,.

F

I think it was a pretty good summary of the intention here so yeah I mean I wouldn't see that there is one or two uh really special uh schedules or primitives for accelerators, but rather I mean I'm coming from the hardware side. As a hardware engineer, you think how? How would I want the software to use my accelerator and that's how you build your passes?

F

This is basically the way of thinking if you're coming bottom up and you want to add passes schedules that generate the code, so that that your accelerator can be used very efficiently.

A

Right, yeah yeah, that makes sense um cool, okay, uh any other questions from the community or things that anyone wants to raise here.

I

Yeah good question: yes, hi hi everyone here federico also from from germany. um I am also right now fairly new to tvm. So this also might be a simple question, but I have been working on integrating an accelerator of my own and I noticed the vta uses this kind of memory, information regions to declare scopes for buffers for buffers that are not in the main memory that are inside the accelerators internal memory.

I

Let's say block rams or scratch pads or so on, and I have been wondering if um you have thought about implementing these kind of apis to register this memory regions in the uma.

F

Yep thanks for the question, so I mean this would be. Let me try to share again, so this would be exactly the hooks, for I mean where the schedules and passes can be brought in, I mean for the vta. You have pretty specific um schedules: uh how layout how internal scopes are handled, and I think where you could bring this in, would be by writing schedules and passes together with if you need custom primitives for your accelerator.

A

Maybe you could also talk about the usmp work as well. That might be a way to.

E

A

E

Yeah, so um I think, adding on to what kyle said, I think that's right. uh So if, when you have the freedom to get get your own scheduling passes, I think the storage scope is the way we kind of use that so local tech storage scope usually indicates that the target primitive function has this internal memory um so use simply the infrastructure memory.

E

Planner plays a role if, if the accelerator shot, you have a shared memory model where certain memories can both be accessible by cpu and accelerator, in which case you can decline to be still global. However, uh usmp has the feature of specified access which targets can access which pulls so in the end, uh there would be a workflow workspace, full generator that uh you're supposed to place it in the accessible memory by both cpu and the accelerator. If that's something the accelerator loves.

A

Good thanks, oh and I think was there one other question I heard at the same time.

A

Yeah, uh if not, um we can, I think we probably just have eight minutes left before the top of the hour, and so I won't uh dive into the code owner's thing with with that. uh Only eight minutes left, I think um so perhaps we can uh wrap up then, if, if that's uh that's all the discussion to be had um going once.

J

Going twice, I will just say about the code owners that um the rfc is on the tdm rfc's repo. If anyone wants to look at it in the meantime,.

A

Yeah, maybe we could give a brief like one minute plug for it, or something like that. um I had a couple tabs if I was going to show, but I can now no longer use zoom for my computer, so I will not do that um somehow. I seem to still be able to talk to you.

A

So uh one of the problems we have right now with reviewing um uh prs is that um we adopted this code owner's mechanism from github last year and that sort of automatically uh requests reviews from from folks, depending on the sort of code paths that are touched in the tdm repo. uh Unfortunately, what that does, though, is uh because of the way tbm is organized. It means that um the same uh set of core committers get uh blasted for every single pr.

A

That's um not every single pr, but many of the pr's that are raised even quite simple pr's and it becomes very difficult to figure out who's um who's, owning. What and, and um uh who is sort of uh should be following up on on which pr, as well as kind of sort through the the github review spam that's um coming into to your inbox. And so we want to get away from this to make it more possible for for folks to um to review prs and so.

F

Pedro, your own mute.

A

I know sorry, I think it times me out after a little while uh anyway there's a proposal up that basically explains um uh uh sort of motivates a new way to sort of uh assign reviewers via mentioning them in the pr, by using cc's and um kind of a way to ping those uh pr's if they become stale uh like if there's no traffic for a week, for example, um so I'd encourage everyone in the community to take a look at that um it can might influence kind of how we um all review code.

A

But I think overall, a lot of the core committers are pretty positive on this proposal, so I think it's likely we'll accept it and if there's um any discussion, it'd be great to have that discussion on the rfc or the discuss forum. So um how about we'll we'll give a plug for that and, uh as always, we're recording everything or taking notes and everything and we'll post notes on the discuss forum there so great um yeah.

A

If that's all, I think we might call this meeting a wrap and um again uh we kind of have this standing um time slot weekly. We may not meet every single week, but if there's more that wants to, if there's anything you want to discuss, please post it up in the agenda doc.

A

You know about a day before the meeting and we'll make a decision at that time if it makes sense to hold a meeting and thanks everyone for coming.

A