Eclipse OMR Architecture, 21 Feb 2019

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: OMR Compiler Architecture 20190221

Description

A description of a new benefits driven inliner developed as a research project at the University of Alberta, and a discussion around contributing it to Eclipse OMR.

A

A

Okay, so welcome everyone. So today we have a sort of a special topic, so Andrew Craig and Erica troll is going to talk about some about some enlightening technology that was under development as a research project with the University of Alberta, and the intention is to contribute that to omr if possible. So the purpose of today's meeting is for Andrew and Eric to basically take us through the technology there and for us to understand the best way of of integrating that within within Walmart, so I'll turn it over to Andrew. We've got some slides, yeah.

B

And they should be on the share for anybody who's looking at the WebEx. So thank you for the introduction, Darryl, so I'm going to do most pretty much most of the talking today. This work is a is a joint research project between IBM and the University of Alberta and the heavy lifting on the implementation, as well as a lot of the innovations in how we do some of the in being lining in stuff.

B

Pi's work that Erika Cho has done as part of his master's thesis at the University of Alberta Eric would normally be presenting, but he's in the throes of writing up so I volunteered to do the talk just so, he did had one less thing to worry about that he's on the call and will be able to answer questions and correct me if I get anything wrong, so I'd like to start just by sort of sketching the current state of the compiler.

B

So if you look at what Omar has at the moment in the way of inlining, there is one inliner, it is called the trivial inliner. It's a very basic inliner that works by in aligning a limited amount of small methods. So it has, you know, sort of a budget heuristic. It takes only small things to inline. It has no real concept of what the best thing to inline. Is it just picks, small things and inlines them? It's sort of the minimum viable concept of an inline er.

B

It knows how to stitch bodies together, but it doesn't know much beyond that now. Open j9, which was can the other part of the IBM Java technology that got open source, has a massive and complex inliner. That's called the multi target and liner that's being developed over the last two decades.

B

It's very Java centric, it's full of heuristics its operation. It does a form of guided eager in lining with some backtracking in what it's doing it can miss opportunities, because it does tend to go in a depth first manner and has a budget. So once it's used up, if it hasn't searched a particular path, it won't even be considered.

B

It has a single metric that it uses to judge the worthiness of in lining something, and that means that it can conflate a small method with a low benefit and a large method with a large benefit, because the division will kind of result in the same answer and you don't have a good way of choosing between them and the code is relatively convoluted just because of all of the development. That's happened over time and it's a bit hard to reason about and control the inlining in some circumstances in there.

B

So obviously this is not ideal for the omr project. So when we sat down to start doing this project, our question to ourself is how can we do better right if the one in open j9 isn't one that we could generalize into a more nicely and the wine Omar? Isn't that smart? Yet how can we build a better one? So if you sort of start from first principles, inlining provides a number of benefits to the optimizer, so it reduces function, call overheads.

B

When you're executing the program we call less stuff and it provides improved opportunities for optimization by amalgamating code units together, you can discover facts that you cannot discover looking at them in isolation and therefore you can do better things to the code to make it go faster. Now, inlining can also have negative effects right if the method gets too large, it can be hard for it to be easily compiled.

B

You may run out of resources or the time complexity of some of the optimization algorithms may become a limiting factor on being able to do something useful with it and in lining the wrong things have an adverse effect on hardware behavior right, if you inline lots of stuff, but it's not stuff that actually runs you're going to reduce the locality, you can have all kinds of other negative effects.

B

Now, if you, if you look at the current state of the art in in liners, not just in Colmar and open j9 but sort of across academia and across industry, most of these pretty much all of these in liners are guided using sort of a single metric. So you have a budget of some kind and you choose candidates to inline until you fill your budget right, so a standard knapsack packing problem.

B

Now you guys the in liner by inflating and deflating that metric all right to make things look more or less attractive, but in all of these cases, size an opportunity for optimization are conflated and it's very difficult to achieve optimality because of that completion.

B

So in setting out to try and look at doing a new in liner, which was 10 as the goal of the research collaboration that we had, we wanted to separate the notion of cost and benefit right. So you a cost, is the amount of space or the amount of constructions or whatever that you're willing to grow the method to your budget and the benefit is how much better we think the program will be by having inline to that method right.

B

So it's a measure not only of saving the function, call overheads, but also the opportunities for optimization that that inlining can unlock now benefit also necessarily needs to include a notion of relative execution frequency right. So if cost is just the size of the things having execution frequency factored into the benefit makes sense, because if you have two things with equal optimization opportunities, the one that's more frequently executed is the one that you want to in align right and we want to make the in liner guidance in the new implementation much more scientific.

B

So if you go and look at the multi target in liner, some of its decisions can at times appear rather magical, and it requires some careful analysis of the code to understand why it chose to do what it did and changing that decision can be tricky so the basis for this research project. So before this was background, work that was provided. You know that we done at IBM before we started. This collaboration was that we developed an algorithm to solve the knapsack packing with dependencies problem right. So the standard knapsack packing problem right.

B

You have a backpack of a given size, you have objects of various sizes or weights, and you wish to fill the backpack as full as you can. That is a solvable problem. It has well known algorithms in the literature now, if you add dependencies between those, so you can only include a if you've included B, for example, there are, there are very few algorithms that can can solve that problem.

B

So what we developed was an algorithm to solve that problem, now that this algorithm formally proven to be optimal in all cases yet, but in practice it does produce optimal solutions, so we set it quite a lot of different different problems to solve and in the kit, modeled off of sort of inlining and those it was able to solve those problems and produce the optimal result. The algorithm is based on dynamic programming and it uses two layers of backtracking to allow the optimization during the search to help you find best inlining solution right.

B

So what I'm going to do is just run very briefly through how this algorithm works. Just so, you get a flavor of how this knapsack packing problem is solved because it's sort of intimately tied in to the representations we've chosen for the new inliner and how the new inliner we built operates. So the fundamental currency of this algorithm or the fundamental data structure behind this algorithm is something that's called the inlining dependency tree.

B

So this is derived from a call graph, but the thing that you need to note is that each note node is called site-specific right. So in this example, here we have a called B and C. So you have a connected to be and an a connected to C B on C called a and F. So you connect them. If they were to call B twice, there would be to be nodes as children of a where B one would represent call site.

B

One B two would represent call site to right, and so this does form a tree. There are no cycles or a flick with regions or merges it's it's strictly a tree in the graph sense of a tree right, and that has a nice property for doing these searches, which you'll kind of see unfold.

B

So we take this inlining dependency tree and we annotate the nodes with costs and benefits, so the notations shown on the tree in the slide. You have a cost. On the left hand, side of the slash and a benefit on the right, so node E has a cost of 1 and a benefit of 7. So therefore, if you have enough budget to in line E, you're, probably going to want to inline it because it's the most beneficial thing, there was a question in the room. So.

C

One question: so if there is a recursive call in the method, will it in line to make the tree deeper or does it just be visit so.

B

In the description of the algorithm, as we wrote it on you when you're building the IDT, you already are like you give this algorithm the budget that you're going to work with so for recursive calls the node. The node is repeated as a child of itself out to the maximum depth that your budget can accommodate right so forward, yeah it effectively. It does the loop unrolling by representation, so the costs.

D

And benefits here are just the nodes themselves right.

B

D

Nodes themselves are in yes, there's no transitive aspect. There's.

B

No transitive notion, so the costs and benefits are isolated to the node itself right and for a given budget. We wish to pick the most optimal subset that we can to in line for this thing. So this thing is a dynamic programming algorithm. It works with a table. It considers the the nodes in the IDT in a post order, traversal from lowest benefit to highest benefit at the same level right so for the post order. You'll visit all of the before you visit the children and four siblings.

B

At the same level, you will visit the lowest benefits to the highest benefit, and that is just to maximize the number of combinations or number of possible solutions that you generate into the people for each given budget right, which is what allows the deoptimization search to work.

B

So we're going to we're going to we're going to run this algorithm for a budget of five which is and so we've created a table. We've got five columns, column. One is the solution for a budget of one column. Two is the solution for the budget of two and so on, and we then begin considering nodes in the order that I described so first, we consider node a node. A has a cost of one and for all cost is the only node that we've considered and we have enough budget.

B

Therefore, we will align it: okay, considering node B, okay, so node B. When we look at column one for node B, we can't inline node B on its own, because B depends upon a so when we go to try and put B in, we have to subtract B's cost from the available budget and then look at the previous solution. If we subtract these cost from the budget, the remaining budget is zero. There's no way that we could fit it in there for the solution in column. One is a in column.

B

Two we have B, we subtract the cost of B, which is one, and we look at the solution in the previous row, a column one which is our remaining budget, which was a can we combine a and B, is the solution valid or all of the dependencies of be included in that set, so that we can graft beyond to an answer? Is yes, therefore, the best solution so far at budget two is a B and that holds for the remainder of the columns right.

B

So it's looking at the previous row, which is what these arrows indicate nodes, node B a similar operation, happens column two, you have a B because there is no way to fit b in right. If you go and consider D and you look back at the column, one right taking one off, you have a well, you can't have a D, so then I would have to consider D and its predecessor B. So now I want to in line BD.

B

The cost of that is two that uses up all of the budget, so there's no way to possibly in line BD right. So the best solution at two is built a be at three. When we look back at column two in the row before a B is there we can graph B on to that without violating the dependency solution. It's solution is abd right now, likewise for see now the more interesting one is then, when we consider node e right so consider look at the row, the last row row e and look at column three.

B

So we begin by considering E, which has a cost of 1, and we look back at the row before a B, but we can't graft it on there right. The dependency C is not included in that set, so it's an invalid solution so that we include its parent. So now we're wanting to consider inlining c e. We consider c e and compare that right with sorry that should be pointing at the row C, not the Rho D.

B

It should be looking at 8 at a and column 1, and you end up with a c e as the solution right, and this shows that we've actually backtracked right. The thing that's interesting is you look down column 3 and we decided it when we only considered up to see that we wanted to go abd, which is down one side of the tree. The backtracking has undone both of those decisions and now selected a see E as the path to in line.

B

So this is this is one of the backtracking in.

E

B

The other one has to do whether there's an overlap. So if you had say AC is a solution and you want it to graft c e. You end up finding out. Oh well, there's an overlap that overlap indicates sub optimality, and then you continue backtracking up that column. So there's other way. There's these other ways of grafting, the solutions, those.

D

Absolutely No duplication in terms of auditing here right. These are just distinct notes. You're saying this: what belongs in the set this.

B

Is what belongs in the set? So it doesn't say anything about the order in which they should be in line is just the set of things to in.

D

B

But because there's only one edge ever coming in yes kind of implicit where it's implicit, where it comes from right, you can derive it just by having the set of notes right and because it's a tree, you know you have two in line anything. You have two in line the root and then the rest is pads from the root to the various nodes right which may pass through things in the subject in the subset. But the none of the paths that are in the solution will pass through a node. That's not in the solution.

B

All you have to be able to inline every node along the path. Okay. So if a node is.

D

Then, in the solution, every node from the root to that node will not be.

B

There, because of the dependency part of the algorithm okay- so that's that's the basics of this algorithm, that's the basics of how this algorithm works. This was an input to the research project, so this this was formulated sort of in isolation with the idea that it would be useful for inlining, but we hadn't actually built an in liner that used it right and we had an implementation in Python that we used to to prototype this now in this I had cost numbers and benefit numbers now.

B

Cost numbers are kind of a relatively sort of straightforward thing: to have some kind of intuition as to how to derive right, it'll, be a measure of the number of instructions in some way right. It could be the number of instructions say in java bytecodes the thing that you're planning to in line, if you're doing inlining for java or it could be the number of nodes in the tree, representation of a method that you're planning to in line it's just the cost of the the cost of in learning.

B

Right like how much this thing is worth right in terms of code size. The thing that's a lot more magical and was left is sort of an unknown in the design of this Inlet. This packing algorithm is how do you derive the benefit? What is the benefit number right? How do you have you cook that up and that's really where Eric's research picked up was trying to figure out how to build a benefit number right so how to compute the benefit?

B

Well, there are really two components that we want to consider in the benefit as I sort of stated earlier. We want to consider the frequency that is, if two things have the same cost two in line. We wish to inline the one that is run more frequently, because we will save more in terms of the call overhead and the other one is optimization opportunity if two things have the same hotness in it same execution, frequency and same size.

B

We would like to inline the one that is going to provide more opportunity for the optimizer to optimize it, because you will end up with a program that runs faster right so in the formulation that eric has the frequency of a call within a method. So we have to derive a frequency ratio right, so the ratio is derived as the ratio of the inche method, entry frequency to the frequency of the call site, and in java we're doing this just with the standard profiling infrastructure that we would normally use that block frequency.

B

So the API for doing that. Our API is that already exists in Omar. That ratio can be calculated. It's calculated for a single call right, so it's a single node, and so you can get a multiplicative factor if you follow a path right. So you know if going from A to B is ten times hotter, but then from there you go to something: that's cooler, you! You know you get a fractional multiplication and the whole thing kind of works out as you do it transitively across the tree.

B

So that's that's kind of how the frequency part of this works. Now. The real question is the real part of this research from what eric has spent most of his time on and where he's made a lot of some great innovations is the opportunity for optimization.

B

Essentially, we want to model which optimizations may be unlocked by the inlining of a method where there's information from the caller that will allow the optimizer to do something good in the Cawley that we would not have been able to do in isolation. So the idea that Eric, adopted and ran with was to run an abstract interpreter over the program, representation, computing, symbolic values or constrain on values, so that we can model what we know about values in the program. So in his current implementation, this is an abstract interpreter.

B

Over the java bytecodes, we chose the java bytecodes because, in the context of open j9 use of OMR, the generation of trees is expensive. It consumes a considerable amount of memory and compile time and so writing an abstract interpreter to traverse the trees. Well, we would necessarily have to generate the trees in the first place, which was prohibitively expensive. Now you could do it over trees, but for reasons of efficiency and comparability to the existing inliner, we did it over the byte codes.

B

So this abstract interpreter is run starting at the root method and you start constructing constraints on the values and when you get to a call that you will begin interpreting the Cawley, so you don't you, you take the Cawley in isolation and you will interpret that Colie and what you're trying to do in that interpretation is to find the opportunities for optimization. Now these are done. Sort of as pattern matches in terms of the operations that are that are being seen, got some examples, the ones that are currently looked for.

B

When we see one of those opportunities, we want to record the dependency of that opportunity on the parameters of the function. So if you're say you're going to do branch folding right, one side of the branch is a constant. The other side is an expression that is dependent upon the on an input parameter.

B

We can derive a constraint that says well if the parameter has between value and value B or is equal to value X or whatever, then this branch could be folded one way and if it's in constrained in another way, we can fold the branch another way right, and so what we will do is record those in a table and we store that as a summary. So what you end up is a summary of potential, optimization transformations and constraints in terms of the parameters that will allow that offer you need to be real, potentially realized.

B

You take there's a question in the room yeah, so.

D

So when you say constraints, how is that related in any way to the value propagation notion of a constraint, or is this a completely different constraint.

B

Hierarchy that so Eric is recycled, all the value propagation constraints. So so it's it's producing value, propagation constraints and.

D

B

D

Infrastructure.

B

D

The next question, then, is you mentioned something about the act of Al jamming is quite expensive, I, don't know if you meant the specifically memory or time or both perhaps is what you have.

B

Both is both was what I was alluding to in the context of the open, j99.

D

So interpreting and creating constrain yeah um it mean it does incur some overhead as well. It does.

B

D

A nine-tailed.

B

Evaluation for that which I'll show we we can we've measured the cost of that.

D

But I'm saying it does it has the same feel of walking over byte codes. You have to walk over by code files and you have to generate nodes in one case here to generate constraints and the other right on. So is that really that much less expensive and you will I understand you have data so I'll wait for it um if I, if.

B

I may just respond on that point yeah. If, if we were to do the abstract interpretation over the trees, you would not only have to walk the byte codes and generate the trees consuming compile time and memory. But you would then have to walk that representation to generate constrains.

E

B

The abstract interpreter was written in terms of the trees, so we chose to skip that step and do it directly for the reasons of efficiency, but you could build one to work on Omar's tree representation directly. We just didn't do that because of the constraints in open j9, where we were saying yes make sense. um So the other question I have is around this.

D

Particular implementation- it's iterated over AMA bytecode, yes, is looking for Java Unity's, or is it more general already understand? The topic here is contribution to Omar, so has that wok been abstracted in some way or is that the.

B

Future that your antidote Saudi, so the API for calling into the abstract interpreter, is defined now in terms of how you would pattern recognized the opportunity that you are looking for. Currently, those patterns are formulated in terms of the byte codes, because the pattern is implemented in the abstract interpreter right. So, if you were to write a different interpreter, you would have to recognize the pattern that would allow you to perform an optimization, but the optimizations that are being recognized are optimizations that exist in Omar, so they are not Java specific optimizations.

B

So we're looking for things like branch, folding I've got other examples right. So there are things that the Omar optimizer is capable of doing, and you could certainly abstract some parts of that from the interpreter. We haven't done that at this.

D

Point so the capabilities that you're looking for our presence, the Omar compiler, but the way you're looking for them is still fairly java sense. Yeah.

B

That's that's a fairly java centric at the moment, right: okay,.

D

B

And so once we produce this summary table, we go back to looking at the call site and the call site has constraints on the actual arguments that are going to be at that call site for the. For that particular call. You can intersect that set of constraints with the constraints for each potential optimization and where the answer is, that it unifies and matches and the constraints in the table are satisfied.

B

You can take those optimizations and those are the ones that have the potential for occurring right and we store a benefit metric with each of those potential optimizations which I'll cover. So you can add those together multiplies them by the scaling factor, and you get the benefit of in lining that particular elicit.

D

Negative, actually, so, when you pass a parameter in yep.

C

D

Have a constraint associated with that parameter, you have other constraints that are coming in from the call D you're intersecting those two and if something better happens as a result of that intersection, you say: there's a.

B

Well, it's a basically if B, if the constraint in the so you have the method summary table where you have these potential optimizations and the constraints which must be satisfied. So you take the information you have at the call site and the intersection is to say: does it satisfy this constraint?

B

If the nth answer is yes, it satisfies the constraint and it does that for all of the parameter constraints for that optimization right, because we have one for each of the parameters we intersect each parameter and the answer is yes for all of them, then that optimization can be unlocked by the information in the caller. If you inline, that Kali is, the Internet has done four values: integer.

D

Integral values or types as well.

B

So we use we've taken. Erik has taken most of the constraints in a Lee propagation and employed them, so the ones that are being used, which is what I have on this next slide, so integers in terms of integer ranges or specific values strings so in terms of constant strings, there's representation for that in value propagation, and we use that no constraints objects in terms of constraints on the class type and on null Ness and arrays constraints on the array size, as well as on the type of the element.

B

The element type of the array, so standard value, propagation constraints, I have.

F

A question yeah, so the constraints it looks for them on the parameters. You said right, yes, that.

C

F

True, the Kali, so you.

B

Are so there are two sets of things that are computed so you, but when you compute a summary for a method body, you interpret that body without any context, producing whatever producing symbolic constraints in terms of the parameters and when you find an opportunity for optimization that is dependent upon parameters, you record the constraints for each specific parameter: that's required to unlock that potential opportunity.

B

That method summary is stored and is independent of the context of the call right, you've interpreted that method in isolation produced symbolic constraints in terms of symbolic parameters, and that will tell you that there's a set of potential transformations that if you goes each if the constraint for that thing is satisfied, the optimization can potentially happen. It's not a guarantee, but it means that the information should be there.

B

You had so when you encounter the call you build that the first time, if you don't have it. Meanwhile, when you come back to the call site, you have a set of constraints for the arguments at the call signer, so it the does. The set of constraints that I have at the call site satisfy the constraints that I needed for the particular optimization and that allows you to filter the set of potential optimizations to a set that can actually happen.

B

Based on the information that you have available right and if I need to point out that it's not you can have false positives and false negatives with this, and that will just mean that you will make inlining decisions that are not optimal because you approximated in some way or had a lack of information. It's not a correctness, concern right.

B

So the precision of this abstract interpretation and whether say in a loop, you compute a fixed point or you do a single iteration or multi iteration, or how you do that right is a function of how much time you're willing to invest into the abstract interpretation, and you can get a more precise or a less precise answer. The less precise you are, the more likely you are to make a suboptimal choice. But if it's good enough, most of the time, you will get a good answer right. So.

F

Can it look also, at other things like say, Oh, a field of an object at.

B

Present no, the I. You could extend this idea of the summary table to include heap objects in the context of what Eric was doing in the time and whatever what we had. We focused on the parameters. Marriner thank.

G

You can we catch the because I guess context. Les yeah.

B

The method summary tail was computed once for a method. At the moment, it's computed in each compilation, but in theory it could be shared across compilations. If you wanted to write unless the method was redefined, the summary will remain the same because it's a brand and abstract interpreter, which is a state machine over the byte codes and produce an answer that is independent of any context in which that method is being called.

B

So at the moment it's putting per compilation, it could be reduced by sharing, but we didn't want to get into the complexity of the memory management associated with that.

C

So if you have an opportunity where the specialization who spreads across more than one color deeper than one call right like this parameter something to your content to the parameter, to a cult method when it's used in the Cawley, but it's used in a kolbe of both the goals. Yes, then you will catch that as long as at some point in the algorithm, you can find a way to inline the first guy yeah and then, ideally when you're evaluating the second guy to evaluate the benefit will be high because they'll be this energy.

C

Yes, but if you can't find a ways inline, the first guy yep you'll, never see the opportunities.

B

C

Saying we do, you won't be able to.

B

Realize the opportunity yeah all.

C

Right, the algorithm won't find don't be able to find its way into the benefit.

G

So if you guys, like your shell method to just pops, primitives out, is kind.

C

Of a prime example in a lot of state, you can kind of like an interface anywhere, you go through a layer and then that's.

B

Where it actually gets used right so when you yeah, so you can, you can but.

C

B

Got me a story.

C

That's a limitation of the current limitation of what you do now. Yeah, it's not necessarily a long. It doesn't have to it.

B

Doesn't have to remain a limitation of how it works in general, all.

C

B

So on the screen, I have one of these sample method summaries. So there's just some examples of branch folding here, so it records a benefit metric for the branch folding and at the moment, in the bytecode abstract, Java bytecode apps interprets the number of byte codes that will be eliminated. If that branch fold happens about the benefit, it records the location of the opportunity and the constraints in terms of the parameters, so a blank means that we don't need like. That argument is free. We don't need any yet to satisfy any particular value.

B

A constraint in the column gives you the range that has to be satisfied for that particular opportunity to be realized right.

B

So we implemented this with the java bytecodes in open j9. It's at the moment. The optimizations and models are branch, folding null check, elimination, check, task, elimination, folding of constant length, strings and some opportunities for partial evaluation right where you would be able to do a compile time, evaluation of part of an expression, because you would have some constants that you can fold away now, we being able to run a large amount of stuff. With this, it's managed to run day-trader and things like that.

B

The data we produce is for the de capo benchmark series and specifically I'll show results that Eric's gathered for a Vora PMD, loose search, Lu index, spot flips and fun flow, so a fairly diverse set of things.

B

Now each of these benchmarks was run in isolation, so we would start a JVM to Eric would start a JVM to run that specific benchmarks alike, loose search right. So the there was only one thing that the JVM was trying to do.

B

The benchmark was run iteratively, so we would run the benchmark again and again and again to give the JVM and the JIT an opportunity to warm up what Eric did was measure on the experimental set up the number of iterations that were required for the JIT to reach a quiescent state where the number of compilations compilation had ceased.

B

He added a few extra iterations to that like three or four, and then we would measure throughput performance as the amount of time to execute the final iteration before shut down after the warmup right, and we were measuring compile time.

B

So the total CPU time consumed by the compilation threads as reported by the v-log, the compile, compile memory so that the total mem we consume during compilation also from the dialog generated code size as in the number of bytes of instructions generated also from the V log, and that was just subtraction of the addresses specified as the range for the method and the runtime. So that would be time to execute the final iteration after the warmup period, eg a representation of the steady-state throughput.

B

So there were three configurations in this evaluation: there was a baseline, which is the current open, j9 heuristic inliner, the multi target in liner. The frequency of what's called labeled is frequency, which is this new in liner algorithm that we have where all the benefits are set to one. That's basically just been lining based on frequency and then there's the analysis, which is the new in liner that uses both the frequency scaling and the abstract interpretation right. So we're adding the cost of the abstract interpretation in the hope that it will do something good.

B

The evaluation was done on x86 64 Linux on skylake, we ran with a fixed heap size and the heap was set sufficiently large to prevent global GCSE occurring during the run, and the machine was isolated with nothing else running on it at the time of the evaluation done.

B

Okay, so run time, so all of these bars are normalized to the baseline. So all the baseline bars, which are the dark blue bars, leftmost bar, is 1 the run times for the middle bars or the orange bars are for the frequency in liner and the gray bars are for the analysis in line where we added the abstract interpretation right and lower is better right.

B

So, if you're, if you're running for longer you'll have a higher score right, so fought is not doing as well, because it's running slower right, zahlen, right we're so generally it's roughly on par, there's Lu index and fought where it's somewhere, but between 10 and 20%. Worse. In terms of run time, sorry.

D

For asking this, but can you go back to the previous slide again made sure I understand the lead, so it is a progression right. Rico Berger has more analysis, as frequency plus the intelligence on abstract, encouraged.

B

D

It's a progression.

B

Okay, so that's the runtime, so in general, it's on par with the current heuristic inliner. There are two outliers where it's not found offered not managed to do as well note we were modeling, a very limited set of optimization, so optimizations that may have been important for those benchmarks may not have be modeled lieu index, for example, is a very loop intensive benchmark and there was no real beyond the frequency. There was nothing that was specifically things that blue index would need now compilation. Time again.

B

This is normalized to one lower is better right, so, for example, lose search. It was about we consumed the new inline are consumed about 70% of the compile time of the current inliner based solution in sun flow. It was getting up towards 2x the compile time and I'll comment on that afterwards. Does this run at warm and.

D

B

Off level, all off levels cold enough everything we replace all in liners, there's no, oh no other in liner other than this in liner running.

B

Compilation, memory usage again lower, is better normalized to one so a four I use 25-30 percent more memory, but like blue index, it was last memory rain about 90 percent, 85 percent memory generating code size lower is better, so it's generally producing smaller method bodies managing to match the performance in a number of cases, there's a bit of difficulty in comparing the base line in liner against the new in liners, because the base in the base line the base line, the current multi target and liner is capable of adjusting its budge at certain points based on heuristics, whereas this inliner is completely fixed.

B

You give it its budget. It's going to stick to that budget. It's not going to increase it or decrease it depending on what it sees. So it can be a bit of a moving target that you're trying to compare to. We try to get these as Eric spent quite a lot of time, trying to get the sizes as close as possible in terms of budget, but it's certainly possible that there are variations happening that were nailed down right.

B

We studied the top-most methods, but there's still variants that could occur so I just wanted to comment on a few things from that analysis. So, in general, the new inliner is more expensive than the current open, g9 inliner. In terms of compile time and memory there are two. There are a couple things I want to call out. First, is that the current inliner does not do a full exploration of the state space right. It is an eager inliner.

B

It may backtrack at selected points, but there is no guarantee that it will consider every possible inlining combination, whereas the new in liner for a given budget will consider all possible combinations, so the state space that's being searched is larger, which necessarily in some cases, increases the nunnery and stuff that's being consumed.

B

The compile time is generally comparable. We had one case where it was sort of 2x, but a lot of those bars were very, very close and in some cases, lower right so in general, is doing fairly well on that the memory was in within 20 percent of the baseline and again considering that it's doing a full state space exploration. Some growth is to almost be expected.

B

The abstract interpretation is relatively cheap right. The difference between the orange bars and the gray bars in terms of the memory and the compile time right, there's not a significant difference right, you don't it doesn't cost you that much to run the interpreter. Now, obviously, as you add more patterns or more complex patterns, the cost will go up, but the majority of the costs at the moment is doing the state space exploration with the gang algorithm.

B

Now the inline new in liner can produce the same performance with less code, so we saw that with like lieu index. It produced something like about 25% less code, but it ran at the same throughput and the runtime performance is generally pretty good. Considering the number of optimizations modeled and the lack of any Java specific heuristics right, the the baseline has several decades worth of knowledge of Java and what things are good to do. 15 liner has none of that.

B

It was only working based on the frequency and that limited set of optimizations that it was modeling and it got pretty close in a number of cases to the to the current in liner.

B

Now, there's, certainly a lot of room for this to continue being expanded upon the research collaboration between IBM and the University of Alberta is continuing and Kareem Ali, who is Eric's supervisor in the principal research investigator at the University of Alberta on this project and I have discussed that there will probably be another master's student starting later this year to continue work on on this inliner. A lot of the information propagation that I was describing is downward information propagation, so I know something in the collar I wish to use it in the Kali.

B

That's certainly valid, but upward propagation is something that's currently missing and is one of the main things we'd like to explore and in other masters, we'd also like to try and model some more complex optimizations.

B

The main one that we have in mind at the moment is escape analysis, because that can have a huge impact on how you choose to inline various things, and we would like to try and see if we can figure out how to model that outside of the scope of the academic research, obviously for consumption in omar having an abstract interpreter over the trees would be desirable.

B

It's not something that we currently have and I don't expect that it would be something that the University of Alberta would produce at this time, because they're interested in the questions around how to model optimization cheaply how to guide the inliner in a better fashion. The bad abstract interpreter is more engineering less interests, so we would like to contribute this work to omar, so the proposal for how to contribute this would be that the core of the inliner would be contributed to omr, and that would be the knapsack packing algorithm implementation.

B

That would be the all of the code for basically doing the unifications of the method. Summaries. The interfaces for the method, summaries all that stuff and to contribute an abstract but unimplemented api. For the abstract interpreter, so being it for so that the inliner has something to call to be able to do the abstract interpretation. But there would be no implementation there in omar.

B

It would be something left for languages to implement and we would the current proposal that I have for how to put it in would be to provide an exit option to optionally enable this.

B

This new inliner, if your language has support for the abstract interpreter and to make the switch between trivially aligning in this in the optimization manager, so that you don't have to special case the whole optimization strategy, and we would look to contribute to the abstract interpreter that we that has been built, the open, j9 community, so that they could actually continue experiments with this follow-on research could be contributed both to Alwar and open j9, as appropriate, with as much going into omar as possible right trying to remove the language dependencies.

B

D

Obviously need to refactor the current abstract interpreter that you have to use this new api if you created the trait in order to refactor it on time.

B

So eric, I believe, has written it with having an abstract api in mind, there's a well-defined set of connections between the abstract interpreter and the inlining Mehcad. That's actually driving this so that API exists the API for doing the recognition of the patterns. I, don't believe, currently exists, that's kind of too tied up in that abstract interpreter, so that might would be something to pull out later on, but the actual here's, the set of hook, points that you need to call into the abstract interpreter to do this thing.

B

This is what it needs to give back that that exists.

D

B

And can be pulled out from the current implementation in a hook. Points like I, don't have them to hand.

D

And you mentioned something about replacing the trivial in liner with this Nomar. That's really.

B

D

Point until we have non-trivial in liner in omar and.

B

There's no sub.

D

S or replacement for that well.

B

Let's put it this way, the current trivial in liner does the absolute minimum. As far as in lining is concerned, there is no large scale in liner in omar all it all. It has is the trivial in liner, so what you can get from the trivial in liner is quite limited. This would be a significant advance on that for being for open j9. The equation is a little bit different because they have the multi targeting liner, that's heavily tuned for their language. Yeah.

D

B

Trivalent liner on.

D

Its own has been used in production as well right now,.

H

D

Use that cord, for example,.

B

When I can't afford anything.

D

B

Can't afford anything more yes,.

D

Having the ability to choose between either one I mean in the code, obviously, in strategy pick this one, the right cord strategy at the LMR layer pick the other one right.

B

So there, if you look at the optimizing, so the optimization strategies in Omar at the moment have one inlining entry in line. If you look at the optimization strategies in Omar, there are two versions of inlining actually three, but we'll leave the third one out. There's there's one that uses the multi target in liner and that's the one that you find in more cheap, warm hot scorching and there's wine that appears in the cold strategy, I think in Omar.

B

It would make sense for both to exist as enumerated, optimizations that are available so like the keep in liner and the liner and.

D

B

Little bit more expensive in layer for one of the better word, but not cheap in liner Highland.

B

This is less cheap, inland and I think that, depending on your use case, you may wish to replace either of those with something more sophisticated. It's just a question of the time versus benefit trade-off for a particular language which one you want to put in a particular salon. Yes,.

D

What I was asking yeah have the ability to yeah.

F

B

So that's all I kind of put together so I'll throw it open for questions to be rock-throwing. How many.

A

Optimizations is in liner, oh, where oh.

B

Right now, so the list of optimizations that are currently modeled is this list here, these five transformations or what it looks for and with terms of partial evaluation and say subset of all things that you could do CSV on I guess is their plan to make that more exhaustive yeah. Well, as I was saying for the follow-on master's project at the University of Alberta, we want to try modeling something much more sophisticated.

B

The current hope is to model escape analysis within the current context, this kind of Java specific, but would provide the model for how to deal with sort of heap based things, which is basically the upward propagation and the heap are the two things that I sort of view is not being represented in this, and the extension of the work is to figure out from an academic perspective how to make that work. So if a particular.

A

Project Omar wants to they have their own set of language environment, specific optimization yeah, what about it, so they can actually add their own. What what is the mechanism by which, like these work on the on the generic or optimizations, but how does the the language you think the language environment, specific optimizations? How can they participate in this.

B

Well, all you need is an entry in the method summary table right, so it's just an entry in the method summary table, so there's just a kind and a benefit number, and the only thing that the that the inlining algorithm actually really cares about is the benefit number, which is what uses to drive its choices and the constraints. So if you have language, specific optimizations and a language specific, abstract interpreter, that's looking for those opportunities right, you would be able to just inject the opportunity into the method summary table and away.

B

You go right based on this list. That's.

C

If there's only two, it would be relevance in or more right now, anyway, right branch, bolding for evaluations, the other three are probably Java centric, depending.

B

On how you implemented your language, yeah yeah,.

G

How difficult is it to model an organization which effort is involved.

B

Depends on the complexity of the optimization, the.

C

Same crushed, abstracted river depends.

D

On the complexity of.

B

A language what it, how much code.

C

Is that in the java piece originally, how does that compare to the size of say I'll gen? I.

B

Significantly smaller than I LJ and eric do you have a slot count for that? Take.

H

A pick how many.

B

Lines of code is the abstract interpreter for the java bytecodes, just the abstract interpreter. So.

H

Just a out stock interpreter for the java bytecode, it is actually quite large because there are many instructions, but each instruction itself is just a few lines of code. You could.

C

Have conflicts it's.

B

Not the plumbers of what would the summary is flow insensitive at the moment again, if you slow, sensitive humans.

C

Are going along, it.

B

There's a lot of different ways that you can expand this to be consumed more compile-time to to spectrum to do better.

C

Does the current apps drive API assume that you're doing a walk over the like you have to do a walk over those psychos their operations of the method in order to know.

B

If it's a at the moment, I believe it's basically, we would like the method summary table for this method and go how you achieve that entirely.

C

Our degree so it sounds like there's a source of you could provide a very dumb, simple, quick answer, final question: right, yeah, very much value, and then you can build up to refine that right.

B

And the frequency in liner, that's being used, is basically doing that it runs the abstract interpreter to consume the compiled time for the compiled times are equivalent for evaluation, but it then forgets the answer right. Basically, it does all the math computes the answer and then says it was one right and then proceeds on that basis. There.

G

Wasn't much difference between the frequency and the analyzer result right so.

B

In de capo, like because you're only modeling, a very small number of optimizations use, the number of those that's going to have a significant impact on performance is not going to be large.

B

If you take a micro benchmark where the particular optimizations happen, you can show very large speed ups in in those, because something that you would have not chosen to inline, because it was three or four levels away, suddenly become something that you are interested in lining, because you see the opportunity, as you model more things, it'll have a greater impact right, but the frequency is is very important, like there's always being two parts to inlining.

B

One is the frequency, and one is the opportunities or the intent Able's right, and it certainly managed to do the frequency very well, because otherwise its performance wouldn't be anywhere near.

E

B

It is right we had various points where were bugs in the implementation, where the frequencies weren't being calculated correctly right, and you ended up in a world of hurt, like you know, 50 percent 60 percent below in terms of performance right. There were implementations of this that we're in that neighborhood of awfulness when there were mistakes and how the math was being done and the answers weren't great.

B

C

You don't have good profile information, so if you don't have good profiling from print, if you don't have good build in frequency information or things Euripedes.

B

Well, the algorithm will bias towards the things that you said are doing and if everything looks the same, it'll pick something right.

D

C

D

Because the data you give its present yeah so.

C

This algorithm is.

D

Of the interpretation slow in tentative at this time in the current version, yes- and it's not a general rule of thumb, you think, will hold and to go forward. Like let's say you did move to modeling is given out. You were probably too expensive to do a flow, sensitive sort of simulation address.

B

D

Would be doing a fill.

B

Insensitive, yes, I believe so and I. One of the ideas that's being discussed is that we could in theory, run the flow insensitive to sort of prune off the least interesting parts of the state space. And if we wanted to try and do better on things that we thought were important, we could then redo flow.

D

Sensitive, that's also where bit.

B

D

Hardness level could come in yes, sir.

B

You could whittle.

D

Down the candidates of where you want to do so sensitive analysis and then do them. Yeah.

B

D

Do them at higher level, whereas you don't even do them correct.

B

Correct so, and this was applied at all op levels right, so the same naivety was being applied at scorching, as it was a hot that.

D

You don't have we.

B

D

Have it right now they don't have.

B

That but there's certainly scope and it would be within the architecture and design of. What's there too try to do that. Obviously,.

D

If an aunt doesn't even run, and that are most level, there's no point in relating it- those are things that you can build into them. Yeah.

B

You know point simulating it or you could ignore the entries from the Optima opt table depending on how how much compile time does it take to model it and how much compile time or do you just not want to factor the benefit in because it's not something you're going to run right, you're, always free to ignore an entry in the in the summary table to say yeah I'm not going to do that or I. Don't believe you.

D

In general, because a rule of thumb, one that we had followed last decade, this we tried not to simulate a more complex off from a less complex alt.

F

D

You're in the position of simulating practically everything, yep or conceivably, everything, if you, if you wish.

B

To guide in lining based on that got potential transformation, then.

D

It comes down to how complex are not to view in lining, as maybe it is complicated enough that you would be willing to spend on trivial amounts of effort, simulating a bunch of other arms anyway.

B

Well, I think I. Think one of the bits of intuition that I released seen is that the inlining is very important for unlocking the most powerful optimizes.

D

It's special is special.

B

Because it's an enabler and because it's an enabler, you have to invest the time for it to enable the right things. If it's too naive it just bumbles around and sort of makes.

D

It also in the very in other end of the spectrum and some of the when it runs versus the only other thing that I think comes close to this, which is the register pressure simulator, which are very, very late yeah. It does try to stimulate a very complex process, but it's not yet abandon. All these other arms be honest, yeah in some sense, this is harder. Does another like a salt.

C

And not run or mobility.

D

C

Last big evil question yeah. So how do we test it and verify that when we change it, you know bring it so.

B

I know that Eric has written some unit tests for the actual inlining, the actual algorithm itself. You know he has problems and solutions for that. The Eagles, runners and alone I believe they're standalone. Aren't they Eric the ones that you had? Yes,.

H

B

Now, the in liner itself, well, that has the same kind of testing as the rest of the in liner, the rest of the in liners right. It's settle.

C

Over, why don't you.

B

Run it and see what happens? We haven't invested more than that generally beam, but.

C

As part of this work, didn't you have some recording facilities or recording the decisions that the Inlander was making him yeah.

B

So the the inlining plan that's produced can be consumed back into the compiler. So if you swipe the inlining table, it can consume that back in and repeat that inlining, basically ignoring halt. So.

C

It might be interesting if we could find a way when, when someone makes a change in liner understanding, how that both inlining plans change across a set of tests actually do in one Eric.

H

I wanted you have.

F

H

To do yeah I think at the moment it might not be working because there has been some changes, but it is something that we can explore on adding back again. Yeah.

B

Well, we can get it working again. It's just over things. We refactored. We just need to refactor the test to match the implementation in.

C

We're tasks like having having a facility so that you can verify that when you make a change in liner, you have some way of understanding what teams you've. Actually, what that, what the impact of that change has been on these, the things that we've run normally would be nice, even if it's a something that a human has to go on review right, something that benefits only would get run as part of a pull request right. When you recognize it someone's changed, you could request that that test be run. Oh yeah.

B

Eric did have some scripts for doing dipping of inlining plans from her beau's logs. That could be something we could look to contribute at some point. Yeah.

H

I still have some this, like some scripts, that do a love analysis via the verbose logs and the different I. Don't know in IDT's friends in enlightening plans and comparing and contrasting and stuff like that, yeah I.

B

Think Eric spent the best part of the last two to three, but staring at I was figuring out how this things are working right.

C

Little figured we could capture some of that expertise when it's being so, we don't have to recover it of the project.

G

Is to build on marks evil question of evil. Why don't try one of the other challenges we have in terms ability of the JVM in this context is reproducing non-deterministic bugs, and it seems that frequency and the inlining decisions are a large, very large factor in determining whether you're going to reproduce something or not.

F

G

Having the ability to, for example, I track the enlightening table or file and then feed it right back and make the same decision is very useful. Yep.

C

I've generally thought to it goes back to that enabling question or labeling point that you were making, except for the performance consensus of what what gets presented to the compilers but senses of what inliner decides to do. Yes, amiable would force them to do a consistent thing again. Reducing air conditions quite.

G

Valuable, thus, we have days that try to inline hang on.

B

G

There's bad some of the off.

B

G

Have various other ways that we try.

B

To coerce the inliner, we've had some success with that, but it's not being absolute. Owing to the complexity of the current multi-target Ian liner yeah. Should you be subject.

A

To the dirty air.

B

Of diplomatic we put.

A

Maybe just go back to the question that you just asked about the complexity and so for something, let's say like branch folding, how much effort? What do we? How much code we looking at for over remodeling that I'm.

B

A

To get a ends of the.

H

B

Lines: tens of yeah, yeah yeah so up.

H

Sorry, modeling branch folding, what's quite easy, yeah in Java code. The arguments are, you should have placed on the variable array and so whether meet at the beginning of the method, and so we keep track of the variable array and the arguments, values and whether the arguments values have been to the facts or also if they have been overwritten the variable array. Undone. So that's just keeping track of the arguments, and that has to be done basically always. But what happens is just what we are a strictly interpreting a branch or any statement.

H

Well then, we can say is the value that I is a test, an argument, and if the answer is yes, then we can say: okay, perfect, we can say well the real test. We know that the argument will destory, the branch will fall, one way or another, one perfect and that's great and then again from the color. We can inspect the argument and say: okay, be the value of the actual argument that we are passing is going to be this one. Does it pass the test in the methods summary?

H

If so, then we optimization is going to happen. If not, then it won't happen. So a branch.

B

Building is recognized at the point in the interpreter where you're interpreting the branch all you're looking at is the arguments for the things that are being compared, and can we express that as a test of the in terms of the argument, the answer is yes, then we record the possible choices, if not move on right. So all of these optimizations are hyperlocal in that sense, they're very constrained in the space that they're all happening, so the complexity of picking them up is quite simple.

B

If you were to model something much more sophisticated, then yeah, it would take more code.

A

B

A

I was just wanted to get a little more of a concrete sense for that building I was going to ask, was, can you talk a bit more about the costs and how those were to how you derive those and how they and how we just how we differentiate those across different architectures? So the cost is es. Java byte.

B

Is Java byte codes in the minimum number of java bytecodes? It's the same currency that the multi-target in liner in open j9 uses. So we, its budget, is in terms of nice design. Its notion of size is based on byte codes and we've set the budget in terms of byte codes as the number of nodes that you're going to allow in the method for purposes and compilation and.

A

I assume that as there's an API to replace the abstract interpreter, there should be an API to replace.

C

B

Any omar language yeah on if there isn't it wouldn't be hard to add good.

H

C

Kind of related to that, so it's weighing cost versus benefit, so it's assumed that a benefit is expressed in a comparable to number of whitecoats number like. Is it or like it assumes that the abstract interpreter is translating the benefit consistent thing that trades off against like those Network? How is that? No, because I decide how much of like how much of a place code is worth how much of those in is it there.

B

Is no notion of that, because what you are saying in running the packing algorithm, isn't you have a budget in terms of byte codes and you have a benefit number that is an abstract number and the solution were actually compares. Costs and benefits. Edges, look at them independently. It looks at them independently. It's trying to get the largest of total benefit for the given budget of byte codes, so there is no than the really nice property of this is that there isn't a relationship between the benefit and the cost.

B

At the moment, some of the benefit numbers are derived in terms of number of byte codes eliminated, but that that's not something that any other part of the algorithm knows anything about have to know it. The goal is just a benefit number. That number is unitless, it's just I figure is better. Okay,.

F

So there wouldn't be a notion of say the benefit would be this much if you were able to identify this optimization that would fold away some expensive opcodes or something like that. No.

B

So at the moment that so, if you, if you think that the transformation is going to save a large amount of compute time, you should be giving it a bigger benefit. So at the moment the benefit estimation for the branch folding is based solely on the number of byte codes. But that's just what's implemented in that abstract interpreter. You could add more for calls or you could add more for whatever right object allocation and it would.

B

It would bias the inlining to pick two in line those bodies where you're going to eliminate branches that eliminate object, allocations and costs right, and while we don't have the data right now, the final part of eric's thesis is actually doing a statistical analysis to show that the things that we've incentivized are being incentivized right. So when we say we want to do branch folding and we want to fold more branches, that's what the inliner is actually doing.

B

That doesn't necessarily translate to a performance benefit because it could be other optimizations that are determinant on the performance, but he's trying to do the statistical analysis to show yes, that the goal was to increase the number of these, and that is what happened across the board and the initial results say that, but he's still finishing up all of this statistical analysis.

E

Before servants of the main idea with two algorithms, you measure benefit by basically see benefit in fully knowing which argument elastic so yeah. Yet about :. Knowing some of.

B

Those few so that's what the next master started.

E

F

B

Cream and I have discussed the ideas for that, but it didn't fit in the scope of what Eric had time to do. Eric's contribution is this notion of abstract interpretation, the notion of modeling these things, the notion of how that fits into the backpacking with dependencies.

D

Algorithm references.

B

D

B

Upward propagating.

A

Other evil question is surrounded documentation.

B

All right so there's so there's not we've got to talks in terms of PowerPoint slides as well as there's a Matt Eric's master's thesis which is being written, which describes all of the algorithm in excruciating detail and we'll have all the evaluation and there's plans for an academic paper. So a more come down to turn up of a coated so like because I've chosen to be contributed, I'll defer to Eric on the quality of that we can set the contribution bar on that as appropriate I.

B

It's I have looked at it. It's not it's not a horrid mess. It's not beautifully documented at the moment, but its research code. Well so I mean the bar think that we.

A

Have was yet for alomar.

B

A

The compiler particular is at least doxygen yeah that.

B

Can currently be added as part of the kind of contributing if Eric doesn't have time all right.

H

Nah I I can make time for it, yeah I'm sure.

A

What is your timeline for when you think you might want to make your pull request um up it? It better go.

B

H

So I was thinking about starting as soon as possible, in a sense of just where the a structure the new inlining class should sit in and after contributing that the contribution of the astrick interpreter, as you mentioned, or someone else in their own mention, still needs to be polished in terms of code. But it's just a matter of publishing a little bit and pushing it, and that can be done. I don't know, maybe like what do we submit a pull request and continually being over that so I think I. Think in terms of.

H

Readiness I think it's ready. It certainly could use some code reviews and a little bit of polishing, but on that yeah I'm excited to contribute this as soon as possible. So.

B

Eric's timeline for finishing up his thesis work is by the end of April. For him to be completely done, the code is close to being ready. I haven't done code reviews on it myself, I've certainly looked at parts of it, and I was able to find my way around, but I haven't done a comprehensive review.

B

I can certainly do that before the pull request is open up if people would like and then the pull request can be opened up where we can just open it up for open, slather and basically yeah I think it will be a case that we're looking at both of us would be looking for feedback from everybody about what needs to change, though, be finite limit on the amount of changes that can be done, obviously, because it's one person in a fixed amount of time, but we certainly try to accommodate anything that needs to be accommodated, there's already being quite extensive evaluation in terms of the compile time and the memory and that kind of thing.

B

So from that perspective, it's pretty tight, but in terms of some of the rafts and some of that other stuff, we there's probably room for improvement.

E

The propagation up again a good book for this, it's sort of a prerequisite you know going to enable the bogus who you might be the.

B

Current in liner works without the upward propagation and the proposal for the contribution would be to contribute it without the upward propagation. But.

E

Do see them give more weight to campus benefit, because some days you just don't even see the benefits.

B

In the other cases, so yes, you are biased. To certain your mind, you are rightly I, wouldn't say otherwise. Spec.

E

There's some cool, for example, just as a load- and you say: oh, there is no benefit with the cooler Kendyl experience and loop yep.

C

E

Can be pretty big with you by yeah.

B

Cool yep, but the current trivial in liner wouldn't even find that so.

E

B

So I'm not going to claim that it solves the world's problems. However, I think that it, my personal opinion, is that it is a significant improvement over the current state of affairs and has significant scope for being enhanced in ways that are understandable, maintainable and extensible. Unlike the current in liners that are candidates for inclusion in omr beyond the trivial in liner, the.

E

Clinton some peaking right now. The.

B

Peaking is only in the multi target in liner is java specific and it's generally disabled, pretty much at the moment because of HCR.

B

And the peaking is more to see the size of things you can do a little bit of a type of propagation. It doesn't do full value community.

E

Distress is quite.

B

Limited and it's quite expensive because it does tree gen right, that's the reason. The reason that we didn't do tree generation for this abstract interpreter is because of that cost now for om r. It may make a lot of sense to build the interpreter for the for the nodes, because being able to run an interpreter over the nodes, even with the cost of generating the nodes, means that all languages would get one by default. And if you want a better one, then you can create your language specific one.

B

It's just to be able to evaluate this in a large-scale context and in a way comparable to the current in liners that are available as amongst omar and open j9. It was done with the byte codes because of the evaluation context in java, I.

B

Don't think it would be a huge undertaking to write that abstract interpreter.

B

It depends on the optimizations right, but to get it to the same state that the java one is. I think it would be a relatively straightforward undertaking, because in most cases it's just push a value. Popper value, constrain, arrange, you're done yeah.

D

B

As more things are at it, you need to add more things to it, but what you put in is what you get out right.

A

Any other questions for Andrew we're.

D

Okay, look forward to sing.

A

Some code, okay, thank you all.

B

Right, like an Iowa word, Taylor to get some pull requests open, call, Thanks, yeah.

H