Rust Programming Language Rust Verification Workshop 2021, 11 May 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Leveraging Compiler Intermediate Representation for Multi- and Cross-Language Verification

Description

Zvonimir Rakamaric

A

uh Thanks for the introductions, rajiv, um let me share my screen.

A

A

A

Okay, so uh hi everyone, um it's been a really fun workshop. So far, uh so thanks for organizing, I really enjoyed the talks, uh and I'm glad to be here and thanks for inviting me to give this talk.

A

uh So what they'll talk about in the next half an hour is um our experience, uh we're doing multi and cross language verification using smack, which is our software verifier and I'll in particular, focus on rus?

A

If you have any questions, please either unmute yourself or ask them in the chat one of my students, who is a collaborator on this project. uh Mark baronovsky, um he's here as well and so he'll be keeping an eye on chad as well, um and there were two other students who attribute this to this work: uh jay, garzella and shah, boheh and uh and this project is supported by nsf, amazon and vmware.

A

Okay, so before we get to you know, multi-language verification and and rust, and so on. um I want to give a little bit of background information on our smac verifier.

A

So this is a tool flow of smack, so smack is based around lvm on on the front-end and boogie intermediate verification language on the backend side- and this is the tool flow that we've been using for year, not years now, where we're mainly focusing on c, which is processed by klingon lvm, intel mir, where we use a bunch of kind of analysis, optimizations that are built into lvm, to simplify it and then the core of smack is this lvm to bpl model module that translates lmr into boogie code?

A

I'm not sure how many of you are familiar with boogie. It's a pretty popular intermediate verification language, supported by a number of verification tools and the one that we mainly use is called chorale and coral is basically a bounded model, checking engine with some advanced techniques to make it scale better.

A

But we can use a bunch of others as well.

A

Okay, so um are we so? The main advantage, I would say of the smack verifier, is that the couple's source language details from verifier implementations uh so because we will leverage lumir and boogie intermediate verification language, and that enables, um as we'll see also in this talk fast verifier input, language and translation prototyping.

A

So, for example, adding a new backend verifies trivial. um If you want to write the new verification algorithm and you support bookie, you can trivially add it into the smac tool flow um and also you can experiment with different translations different info languages, modeling and so on.

A

Then again, this really facilitates uh fast, empirical revolution and reducibility.

A

If you develop a verifier for boogie, you plug it into the smack tool flow and you have instant access to maybe 10 000 benchmark or something like that on which you can try. uh Your verifier, compare it with the existing verifiers and so on. So the tool is free and open source. um It has a very permissible license. It is it's easy to use in the industry we host on github.

A

It's not super big. So if you want to hack into smac, it should be too hard to wrap your hand around the code.

A

So we've used smac on fairly large and complex software projects, and we can do that because we can plug it into existing build systems and we worked on that quite a lot.

A

So you know we plotted on something like open, ssh and clearly to be able to do that. You have to understand their build system. um Smek is fairly automatic, scalable, precise and soundy. um We try to kind of list all the assumptions we make in terms of soundness in general. False bugs are uh quite rare, so it's remodeling is really precise um and then whether we miss bugs or not, that depends on the select backhand verifier.

A

Usually we use coral, which is bounded mullet checking, and so the typical reason why we might miss bugs is if we don't roll loops enough and then it generates error trace for debugging, and we have some experimental support for deductive verification. We pre post conditions and it's something we want to work more on.

A

Okay, so that's the overview of smack. Let me just quickly walk you through how this translation works into boogie. um So the first thing we do is we translate to lmir using clang. We don't do anything special there, it's just off the shelf uh clan compiler um and here is a small example c program. This is the yellow mirror code that you get on the right and then the meat of smack is this translation into boogie.

A

um I want to spend a little bit more time here. I'm not going to go into details. um You know original c code is on the right. It goes to lmir and you get the boogie code on on the on the right and, if you've never seen boogie, um you know it's a procedural language um and it's meant for verification, so the type system is kind of similar to what you see in smt solvers.

A

uh So there are no things like pointers and so on. um Usually you model memory using areas, and you can see this area dollar m here- that we use to model memory um and another important thing uh that I want to stress out is that you know the original c code involves malloc in boogie. We translate that to the invocation of this procedure called malloc, uh but we have to provide some kind of model for this. um You know we don't translate the actual allocators into boogie directly, but rather we model memory allocation.

A

To make it suitable for verification- and this will come up again, this issue of modeling- let's say the standard library and so on I'll talk about it more okay, um any questions so far.

B

I was wondering when you do the translation. What optimizations do you use from llvm.

A

That's a great question, honestly, I don't know of the top of my head. I would have to look at our source code. uh We kind of played with this a fair amount of uh we spent a fair amount of time, picking and choosing which ones work well. For us.

A

The the main one we leverage a lot is alias analysis and I'm not going to talk about how we do it in this talk, um but basically we leverage it to split this gigantic memory, map dollar m into many smaller memory maps um and that really helps with uh performance.

A

And then, apart from that, uh you know we do stuff like constant propagation. um What else do we do we do this? uh What is a mem to reg pass right so that we get rid of lots of memory? Accesses uh memory? Access is really kind of slow down things, so you want to optimize them away as much as you can.

A

What else do we do? We do some kind of rewriting where we try to get rid of structure accesses and we try to rewrite them into memory. Accesses.

A

Sometimes we do loop and rolling using llvm just kind of ahead of time for certain kinds of loops, so that sometimes helps um I'm forgetting a lot of them. So I will have to look at the source code, but that's a good question.

B

A

Do you handle this.

B

Go ahead, I think: how do you handle any concerns that so these optimizations themselves might hide undefined behavior, yeah, okay,.

A

C

This is a really.

A

Good question- and it's uh opens up a whole kind of other cancer kind of worms that I wasn't planning to talk about. um So in the presence of undefined behaviors I mean smack kind of depo defines depends on what clang and lvm do, and so they can get mass by optimizations.

A

In fact, they can get massed, even if you don't enable any optimizations, because clang will sometimes generate interesting, lvm ir depending on whether you have undefined behaviors or not. So we don't provide any guarantees that undefined behaviours are not going to get messed. um Typically, they don't, uh but we certainly see kind of corner cases where we just don't see them anymore in smack and you might miss bugs because of that you can also get false bugs because of that, and so on. So.

A

Yeah, but this.

B

Is a great question.

A

B

A

Okay- so I didn't want to talk more about this, but I I have this dream that uh all these kind of frontends, like clang and rossi and so on, will one day have a target called verify or verification uh where the kind of things they do will be verification, targeted and, for example, they're not going to mask undefined behaviors, um and there is a bunch of other things that these front ends kind of. Do that makes verification harder.

A

um So it would be awesome to to have a target where you know they don't do it because it makes verification harder. But that's that's kind of a dream of mine.

A

We'll see if it ever happens. Okay, so let me go ahead um yeah, so we published this work. It was kind of an experience paper, we're trying to add support for verification of more languages to smack.

A

So this is the basic tool flow and we added uh seven kind of additional languages and we played a little bit with verification of these languages and we tried to learn what the challenges are and so on and rust was among these among these languages.

A

So how do we go about doing this? We developed a suit of micro benchmarks, so these were just like tiny programs that tested for various language features and we implemented.

A

You know one such program in every one of these seven languages and then we ran smack of that on all of them and we tried to extend smack kind of with with basic things, to support all these languages and here's the table. That summarizes the results, um and you know xs marked features that smack that was that weren't easy to support and smack in a particular language.

A

And if you look at kind of x's versus checks marks, you can kind of see that you know c c, plus plus rust, fortran d uh work fairly well, almost kind of out of the box. I'll talk more about this, while languages such as objective c, swift and kotlin are much harder to support, and this is typically because of kind of dynamic features in languages and big run times and so on um that are present in objective three swift and kotlin.

A

uh So the the more similar the language is to see the kind of easier it is supported in smack. um So fortran is an example of a language that it worked almost out of the box. We didn't have to change smack at all and we could verify fairly complex programs in fortran without any problems. um Just because it's the way it's compiled into lmir is very similar to c, okay and now I'll focus on what we did with rust and how we went about supporting rust.

A

Okay, so coming back to the this tool flow smack um to support rust, we had to add a couple of different things. So, first of all, um on the front-end side, we had to add something are called ras models, so these are mainly models for standard library functions of rust.

A

We have the same thing for c um and we had to do the same thing for rust as well. uh I'll get the get back to that aspect um again in in the in a couple of slides. I find it to be maybe the most complicated aspect of supporting rust and smack.

A

Then we have some common models, such as, for example, modeling of memory allocation. So these we just reused from what was written for c, so we didn't have to do any work there. So memory allocation in rust. We basically use the same model as as what we have for c and then in this lvm2 bpm model uh module.

A

We had to extend it to support some additional lvmir instructions and features that we did not need for c and I'll talk more about these in the upcoming slides. So you know, smake doesn't support the whole of the whole of lomir. um Lmr is really big, and so we support the subset and the subsidy support was mainly driven by what clang generates to support rust. We had to add a few more things.

A

um So let me show also a simple example: um kind of how the syntax looks like if you want to apply smac on rust, um we've seen similar syntax uh in other tools that were present in this workshop.

A

So you know the main thing that we need is to be able to introduce non-deterministic values um and in the main function. You can see the syntax for that uh mark. With the student of mind um I kind of like the syntax he came up with.

A

So basically we can compile this into an executable as well uh and in that case, instead of non-domestic value. This value five is going to be plugged into this test case, um and then um you can check assertion assertions very similar to the way you would write assertions in russ. So daniel is asking. Why do we need to make a cert instead of just a cert?

A

um It's funny, because we played with a couple of different syntaxes and it's really easy to support, assert as well, but then raz doesn't have assume, and so we had like smack assume and assert um and we were kind of debating what to do to make it consistent. And then we had a smack in front of a cert, um but we can support assert as well easily. uh It was just kind of a syntactic choice.

A

uh It's not a big deal.

A

um And then you know we, if you run smack on this uh x, is going to be unconstrained, normalistic value- and it's going to discharge this assertion- and this is very easy example.

A

Okay, so a little bit of these extensions, we have to do so to support this raj genetic element element mir construct. We mainly had to support, structure operations and check the integer arithmetic, and then we also model some of the rust libraries. That's a major challenge. Talk more about this.

A

What do I mean by structures operations so lvmar supports structures, so a function can return a structure. You can take a structure as input and so on. um The c compiler clank rarely generates coded user structures. uh Rust generates it all the time, um and so we get a much better support for structures in smack. We model them as android's functions.

A

um I'm not going to go into details, how we do that, but that's kind of one extension that was needed and then rust uses this checked, integer, integer, arithmetic, uh all the time um that wasn't too hard to to support. um Typically, we perform operations in double bit width and then we check for what we needed.

A

uh You can turn off overflow checking and then basically, don't pay any performance penalty um so that wasn't too hard. um Most of the extensions are mainly kind of engineering work that we have to think about a little bit and then modelling of us libraries.

A

um So for things like management management and domestic values, we invoke existing smax models that are written in c through foreign function interface, and then we have modules for some popular raw standard libraries such as vector and box classes, but many more needed- and this is kind of a big showstopper- that we've been working on a lot in the past couple of months.

A

So we applied smack on some small, let's scale real world programs uh just to see how far we can push it. uh So we picked this utils library, which is basically implement reimbursation of new core utils, which is well known kind of test bench from the clip project and we verified a couple of simple utilities. There.

A

Great question from ralph: I was actually planning to talk about this more so give me a couple of minutes and I'll get back to your question about whether we can just verify implementation versus modeling of vector.

A

uh So in interest of time. Let me skip these results. uh We have.

D

A

Work a lot on performance and scalability. uh You know this factor utility. It's like a hundred lines of code. If you want to verify kind of a notion of functional correctness, it takes like 15 minutes, which is way too long and they're kind of good reasons. For that that I can talk about more okay, so one advantage of using lmir as opposed to.

A

Something like uh mir is that uh we can easily do cross language verification, so we have examples with that mix, rust, unsafe, rust and c, and we can just handle them using. You know off the shelf smack once we did this edition additions, um and so so, I think, that's that's kind of a nice side effect of using yellow my r. um I have one such example here. You know this fibonacci implementation in c.

A

I won't go into details, but you know we can invoke it from rust. uh We can pass those initial values for it. We have also a simple implementation of fibonacci and rust. We can check that return. The same result- uh and this is all really easy to do with smack you compile all these things. You link them together and it just works.

A

Okay, so some final thoughts, um some trade-offs of I want to kind of discuss a little bit some trade-offs of such a design of smack, um mainly the fact that we go through love ir and not some higher level representation.

A

So some advantages. So we avoid the mess of dealing with target lanes directly, for example, something like rust closures, just work, they're compiled some kind of function, pointers in lmar that we support in smack and you write russ projects with closures and you run smack on it and things just work, um then um adding back and verifies and solvers is easy. I already mentioned this and we have access to all sorts of llvms analysis, optimizations and then.

A

Finally, in the context of ras this cross-language verification where you combined like rust and c and maybe unsafe rise, basically comes for free. It just works again. There are disadvantages as well.

A

um So that's why I said these are trade-offs, so we lose a bunch of source level information in particular in the case of rust, like type information, you know, non-aliasing, information, original structures and so on, and because of that, we definitely before pay performance penalty in in certain cases, and coming back to this question that ralph asked about um why don't we just use the implementation of the vector?

A

um That is, they understand the library um there are again kind of tradeoffs here uh such such implementations can often be really big and really optimized and they're, often not really kind of nice for the purpose of education, and so again sometimes we can use such implementation and things just work, uh but often we see a huge degradation in terms of scalability and performance, and so in that case, uh if an expert user such as mystery mark goes in- and you know, reads the documentation of the library such as vector and implements it in a simpler way that is more suitable for vacation.

A

We see huge improvements in terms of performance, um so it's kind of a trade-off. You know, are you going to spend manual effort and and get uh more performance or you just use all the shampoo meditation?

A

And then you know in terms of vector. You know you have a fine line, five line. We saw cases where we have a five line program that uses a vector and it takes. You know 15 minutes to verify, because this implementation of the vector drags in like lots and lots of additional code that smack has to go through. um So.

C

So it's kind of a.

A

C

Are there like abstraction or summary mechanisms you could use like verify the actual vic once and then like store that and and then only use the specification like you would do when like doing this in a program like logic, for example,.

A

Yeah, that's a great point. Yes, so so we could do that um and that's something that we would like to do. More of um people even publish these papers. You know automatically generating specifications, summaries and stuff like that. So that's something we could try to explore as well, but so yeah you could write a simple summary. You could check whether the implementation matches the summary and then you could just use this summary in smac.

A

So that's something you could do as well.

A

um Okay, so mark is taking care of a bunch of questions in chat, so I'm not going to jump there. Let me talk, uh maybe a minute about ongoing work. There is lots of it and we are really interested in collaborating with people here. um I think they're kind of interesting projects that we could kind of come together on, um so we want to modern more of the standard libraries.

A

um This problem kind of starts comes over and over again comes up over and over again, as we are moving to more and more kind of target applications uh we're working on integrating smart smack with russ verification tools. um This project was presented here as well. uh This should be really straightforward, because we already support all the features that are needed, uh currently we're kind of focused on verification of unsafe rust.

A

um I think this is when, where smack can excel, we'll start with checking memory safety, but we're interested in other properties as well.

A

We're looking at the checking concurrent trust as well uh smeg does support verification of current programs in some of the backends. So it shouldn't be hard too hard to add that um we're kind of at the point where you're looking for good target applications.

A

uh We are collaborating with my colleague, anthon bertstrep, from uc irvine on some rust os verification, but we are always kind of keeping our eyes open for for good kind of medium-sized benchmark, let's say uh to drive the work and then um there is this benchmark suite that we have been working on um it's open source. It's on github.

A

uh We really want to expand it more, we'll kind of look at all of your projects, all of the github repos and we'll try to get your regressions and benchmarks from there and integrate it into our ras benchmark suite.

A

I had some thoughts that maybe at some point we should try to organize the competition uh as part of sv comp.

A

So I know dirk bear well and he will run the competition for you as long as you provide him him with benchmarks, and so uh the organization we could get almost for free um and I have kind of love hate relationships with these competitions, but uh they can be kind of useful to drive the area forward and so I'll create. Maybe a topic on the chat about this.

A

If other people are interested, maybe we can do something about it. Okay, I'll stop! Here uh I think I have maybe another minute or two for questions um any other.

D

Questions uh yeah. We have a couple of minutes. um Ralph asks. um Can you say something more about how precisely you model llvm semantics.

A

uh Yeah, I see the question about pointers. um I can open the link later and we can kind of discuss it.

A

I mean I think in okay, so we have knobs in smack that you can kind of turn in terms of how precisely we model things and how sound we are. um So in terms of modeling of memory and pointers, you can crank the knob.

A

So I'm not sure exactly what you mean, but, for example, we have an option where we mod the pointers at kind of bite level accesses at byte level, and so, if you turn something you know into narrative bytes and then you read the second byte you'll get the right value out of it and stuff like that. uh What.

C

I mean is that pointers aren't just integers right. Pointers have extra prominent information, which is required to precisely model things like the details of get element, pointer, inbounds and and horrible details like that.

A

That's the kind of.

C

Stuff, I'm talking about.

A

So I mean get element pointer. It just turns into point arithmetic.

C

Yeah, but it has roots right, like you can't, uh if, if you leave the bounds of the um in particular, get element pointed without inbounds uh can is allowed you're allowed to create pointers that point outside the bounds of the allocation. But if you dereference those points, it's still ub because it still is like attached. It remembers the original object the pointer comes from, and you can't use it to access other pointers, but yeah.

A

C

The top the like summary here is pointer provenance, basically, is what we call this.

A

So so we we do some of that uh I mean it's not so we have an option to do like memory, safety checking and then in that case we track a bunch of additional information about objects and pointers, and so on that allows us to check things like that. I'm sure that we are not modeling the semantics completely and totally precisely, but we do a little bit of that and I'll take a look at the link you sent and then maybe I can comment more.

C

A

D

Okay, well, thanks for the talk uh xonomir.