OpenSSF Securing Critical Projects WG, 19 Nov 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Securing Critical Projects WG (November 19, 2020)

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

All right, the recording, has officially started.

B

Sweet um okay, so we have a couple presentations today and the first one is from josh who's from um the isrg. Is it I can't remember the relationship with let's encrypt and isrg, but.

C

Isrg is the legal entity, so that's our organization and let's encrypt, is the name of a service that we provide got.

B

It okay cool, um so he's gonna, be talking to us today about some of the work he's doing, and then we also have another guest jordan. Who's gonna go over. Some of the work he's been doing on a tool looking for malicious packages in pie pie, so it should be, should be a good hour. um So josh do you want to take it away? You can. I can stop sharing and you can take over the screen.

C

um I sent you a link. Do you want to share my slime? Oh.

B

Yeah I can choose, I can.

C

Do it so I can flip through this.

B

Right here, whatever you prefer.

C

All right, why don't you go for it and I'll just ask you to move along.

B

Oh hang on: that's not gonna work I'll just do it in this mode.

B

C

All right, peace, everyone, I'm josh from internet security, research group- we are probably best known for running watson, but we do some other things as well, and we have a memory safety initiative that we have started on over the past year. It's not a full-blown project for us yet, but we are getting there um learning a lot, so I'm glad to get a chance to speak with you about it. Today, let's go to the next slide, so I think a lot of people in this call are probably going to be very familiar with the problem.

C

um Memory. Safety is a big problem for the internet uh software infrastructure, mainly when I say that a lack of memory safety is a problem. I'm talking about c and c, plus plus code, which um you know the cause of new vulnerabilities every day um it is sort of a plague, um the safety of the internet. So let's go to the next slide.

C

And when I talk about this internet software infrastructure and that being unsafe, we're talking about this from top to bottom, so web browsers, you know they're at the very top of the stack. That's how a lot of people experience internet. It is millions and millions of lines of c and c, plus plus code. We know that code is not safe. We know we cannot make it safe. The vulnerabilities are non-stop and you go down from there until you get all the way down to the kernel.

C

You know the linux kernel is the most common topic for us, but it's a problem for pretty much all kernels. Sometimes even the windows kernel has security issues.

C

So there's a lot of unsafe code out there and we depend on that to run the internet, and uh we would like to fix that. Let's go to the next slide.

C

So this is talking a little bit about the ubiquity of these vulnerabilities memory. Safety is not the only issue that can cause vulnerabilities, but microsoft. Estimates about 70 of their vulnerabilities in their products came from a lack of memory. Safety.

C

Google is saying about 90 for android vulnerabilities and you know a general look at zero days um found being exploited in the wild, but 80 were memory safety vulnerabilities.

C

So I don't think we need much more evidence that this is a huge problem. Let's go to the next slide.

C

You get privacy violations, you get huge financial losses, denial of public services, human rights impact.

C

And well, I think that we all know about these things. I'm not sure that we quite gross yet how big the consequences of our choice of programming languages is. um Let's go to like slide, and we know how to solve this. It's it's not an unsolved problem, because we don't know what to do.

C

We know what to do here. We don't know, we know not only how to make it better, but we can just fix this. We know how to get rid of it, make the whole thing go away, and the answer is that you replace code that isn't memory safe with code that is, memory safe.

C

It is a lot of work um when I talked to people about this starting a couple years ago. You know really common reaction was this sounds like a lot of work that seems almost unreasonable, we're not going to go rewrite all the code that the world depends on.

C

You know, c and c, plus plus, are here to stay they're not going anywhere. um I don't think that's true at all um yeah. It is a lot of work, but we have a lot of smart people in this world and a lot of resources and if we decide we want to solve this problem, we can solve it. So it is a lot of work, but it's not that much work. It's doable and I think, with the right partners.

C

We can fix this for a lot of the internet software infrastructure over the next five to ten years. So let's go to the next slide.

C

So we have this memory safety initiative at isrg. We have two goals: move the internet, software infrastructure to memory safe code and the second one is change, the way that people think about memory safety and I'm gonna go through what we plan to do about both of these goals. Let's go to the next slide, so we're a small organization we're about 16 full-time people. Today we are not going to go out and rewrite.

C

You know all the c and c, plus plus plus code, that is critical to the infrastructure ourselves. That's not going to happen, so we view our role as coming up with a strategy facilitating and coordinating the work that needs to be done, raising the money that needs to you know we need to raise to do this work and then communicating with the public about what we're doing our engineers might do a little bit of work here and there, but for the most part, we'll be enabling community and maintenance to do the work next slide.

C

So the first part of our approach is to identify projects that have the best return on investment potential. So we consider a number of factors. So usage is a big one. The more heavily used the piece of software is the bigger the benefit you get if you fix it. Second, one is security sensitivity.

C

You know it makes sense to focus on pieces of software that are positioned to be exploited more so than other pieces, so it makes more sense to fix something, that's on the edge of a network than it does that, simply that, then it does to fix something that is simply buried behind a number of layers of protection.

C

Another one is: can we do this in a modular way? Can we replace particular components of a piece of software instead of a wholesale rewrite, because wholesale rewrites are even more difficult to pull off and the last one is maintainer project and cooperation, so the more a maintainer is willing to participate in this effort and cooperate the easier it is to do this and we'll get into that a little more. Let's go to the next slide.

C

So maintainers have really valuable knowledge, but even more importantly, they have the ability to ship memory, safety updates to their existing users.

C

So if you're worried about the safety of a particular piece of software, if you need to create an alternative to that software, then you're going to have to get users to switch to your alternative, which is a big. It's a big hurdle.

C

If you can get the maintainer on board and fix the actual project, users can be protected through just a normal software update, which everyone should be running. So it's very important to have maintainers on on board. Whenever you can, if you can fund those maintainers, it helps you to create buy-in and alleviates resource concerns.

C

So we really try to talk to maintainers and see if we can get them to participate in this. It's not going to work for every project, something maintainer just aren't going to do it for whatever reason, then we might have to come up with another strategy, but whenever possible you want to work with them.

C

Another next slide.

C

So we really prefer a modular approach. It is difficult to go to any project, whether it's curl or the linux kernel or a web browser. Anything and say you know we need to rewrite this from scratch and then we and then release. You know the results of that effort, rewriting from scratch.

C

I'm sorry using modules you can find existing memory, safe libraries that can be swapped in replacing libraries that are not memory safe. That is not too much work and gets back to this point where we're going to find a really good return on investment.

C

Breaks it up into manual pieces and you get incremental value delivered in a reasonable amount of time. So if you want to round up rewrite something, it might be years to deliver that you swap a critical library out for a memory, safe library.

C

You can do that, maybe in a couple months and ship that out and everyone can read the benefits of that next slide.

C

So the the last part of our approach that I want to talk about here is building trust by providing additional success stories over time.

C

So it's hard to walk into a project and say you know we want you to ultimately move this project away from sea and, let's say, for example, towards rust. You know, maintainers may not know rust. They don't know. If russ is the right choice um could be some other language, I'm just using russell as an example here, but they may not.

D

C

The language they may not know if it's the right choice, it might just seem like too much work to do this. You know they don't have the resources they're already stressed out, there's any number of reasons why they might not want to do it.

C

So we need to go and listen to maintainers, hear their concerns and come up with plans that will work in different situations and then we need to be able to go to maintainers and show them how this has played out with other projects and say you know.

C

For example, curl had maybe had the same set of concerns as you and here's how we dealt with those concerns and here's how everything played out and how we're able to deliver a safer product um so we're starting with some some projects where we have maintainer buy-in and have you know a good path to success and we're going to use those stories to help other maintainers make the decision to go down that road and we're going to build up.

C

You know a corpus of success stories and uh hopefully that will convince more maintainers to help out next slide.

C

So we're focusing on three things at the moment. We may do some more things shortly, but the first one is curl, so we've publicly announced the project to make curls hdp and tls code memory safe. This project was our first project because it really embodies all the things that we've talked about here in our approach.

C

You know I had to talk to the maintainer for a little while, but eventually we got him on board, so we've got maintainer cooperation. Here we have raised money to fund the maintainer to do a bunch of the work that helps with buy-in. It helps helps us get a better job done.

C

um We've got a module approach going on here, so we're not rewriting curl from scratch, but we're replacing openssl with russells, and we are replacing the networking libraries with a safe networking library called paper- and this is you know, the work is actually getting pretty close to done now. So a couple more months, we'll have something ready to go and that can ship in a relatively short amount of time. You don't have to spend years rewriting curl.

C

um So this is a perfect example of what I'd like to accomplish and the kinds of projects we want to work on. First, and once we deliver these success stories, we can move on to projects that are a little more difficult.

C

So next up we're going to be announcing some things about apache hcpd. This is a bigger project. Obviously there are a lot more people involved, a lot more stakeholders and we're going to learn how to help them move in the right direction.

C

So I'm not quite ready to talk about the details of that yet, but we'll be making some announcements soon, and the last example is the linux kernel and honestly, when I started working on this, we made a list of all the projects. That might you know be something we could help with and at the bottom of my initial list.

C

A year ago I had the linux kernel and basically thought you know, I'm an optimist, I'm willing to go out and try to rewrite most of the software that underpins the internet, but uh the linux kernel is just too hard. You know it is going to be hard to get buy-in. It's going to be hard to make it work.

C

There are a lot of roadblocks for that, but then I learned about some work that alex gainer has been doing getting limited external maintainers, adjusted to the idea that maybe allowing rust modules is a good idea, and then he went out there and wrote some proof of concept modules and is now working on a patch that will enable rust modules in the linux kernel that can be merged into the linux kernel.

C

So alex has done something that I did not think was possible in this amount of time and we are getting shockingly close to be able to invest in linux kernel modules that are written in rust, so hopefully, within the next six months, we'll be announcing some efforts to rewrite some key kernel modules in rust.

C

C

So I feel like this is the topic I need to address, which is you know, memory safety with c and c plus plus is a problem, and there are people building fuzzing tools and static analysis tools.

C

Those things need to be built. They're important because we're not going to rewrite all this code, certainly not in a short amount of time, and these things really do help. These just are not what we are working on. So that's just not this project, we're glad that those things are happening, but it's not us.

C

um They do introduce a bunch of overhead, so projects have to run them, make sure they're running you got to keep running them and they don't ultimately solve the problem programs projects that you fuzz and apply static analysis. They still have more safety vulnerabilities, so they're important mitigations, but they do not solve the problem and they introduce some overhead.

C

And you know our focus is on just trying to to fix the problem entirely.

C

C

So I said before that our second goal was changing the way that people think about things.

C

So if you go ask an engineer on your team, you know: can you set up a reverse proxy or something they're, probably going to pick a piece of software?

C

That is millions of lines of c and c, plus plus, and if there's anything, we've learned about c, plus, c and c plus plus again, it is not safe. It is full of memory, vulnerabilities they're going to come out, but you know it's 2020.

C

We've learned this lesson a million times, but it's still pretty stuck pretty standard practice to stick millions of lines of cnc plus, whilst on the edge of your network that has to change, we can't we can't continue doing this, um we're just going to get been over and over again and again, real people pay the price.

C

Think back to that slide. You know it's financial losses. It's hospitals getting shut down. It's massive privacy breaches. We can't keep doing this. We need to get to the point where you know: sticking apache or nginx, or something else that's written in c in the edge of your network is seen as irresponsible.

C

It's not safe. We already know that, so we need to get to that different mindset.

C

We're going to get there by investing in communications and helping people understand why our effort is important.

C

C

That is the end. um I believe I've come in just under the 20 minutes, so I want to thank paul care and alex gainer for the work that they have done in helping us with this problem and also daniel sundberg who's, the maintainer of curl, who has been a great uh initial partner in this project,.

B

Okay, cool. I think we can take a few questions. um Jordan, I'm not sure how much time you need or.

E

No we're doing just fine we're more than welcome.

F

Yeah, uh so if I, uh if I can jump in uh david wheeler, I agree with you that the modular approach in general is better. One challenge with the modular approach is that now you have the challenge of conversions and dealing with the date with differences in data structures and approaches between different languages?

F

Have you found any better ways to to make it so that crossing between the modules is less painful than it has been in the past?.

C

It really depends on your language choice. um You know, I don't think that the answer to every memory safety problem is rust, but rust has really made it possible to solve this problem in the way that, like I, don't think this project will really would have made sense prior to having rust, mature and one of the most important things about russ is that it works really well with cnc plus code. So the foreign function interface is fantastic.

C

It has no runtime, there's no conflicting garbage collector. None of that stuff, so it's possible to make really sensible fast interfaces between rust and c and c plus plus.

C

So you know in the module approach one of the nice things that a logic projects can benefit from the same module so, for example, we're investing pretty heavily in the russell's tls library to replace openssl, so we're building a c api. That's going to be in the russell's repository that any c program can use.

C

It will be very fast, it's much more intuitive than the open, ssl api. We invest in this api once and we can do it in curl. We can do it in apache, we can do it all over.

C

um This stuff really pays off in that way, and thanks to russ, we don't have to worry about the the trade-offs we used to have to worry about with conflicting runtimes or the speed of conversions between one language and another.

C

I hope that answers your question.

B

So I'm curious, I I know it's a modular approach and it's something we've talked about in this working group before too, as every project's different, you know everything needs a different thing. um Do you do you have ideas how you might like scale your work like? Is it is it funding? Is it? Is it people like doing these negotiations with maintainers or have you have you? Have you thought about that?.

C

Yeah, I think we need to find ways to focus on the most important things. The most important things for us right now are to build up some success stories so that we can refute. You know concerns that people have about this stuff. So, for example, we need to get curl done when curl is done and it's working well.

C

You know that gets us the ability to talk to more people. um We need to focus on high value projects. So if you want to reach a lot of people and scale, I mean depends on what you mean by scale. But if the goal here is to deliver memory, safe software updates to a lot of people, then you just got to pick the right project. So you know um focusing on something like russell's and saying you know: russell's is our choice for a memory safe, tls library.

C

We can do all the work here and we don't have to convert eight different. You know open ssl libraries to memory save code. um That's not gonna happen. So beyond that. If we're talking about scale in terms of how many people can we get working on this? In parallel, part of the answer is just having the funding to do the projects that we need, but I think we also just need to be a little patient.

C

If you rush out there and try to you, know, fund everything and get a thousand engineers working on this tomorrow, it's going to be a bit of a mess right. We need to be careful and build up our case, um so we want to get the funding that we want, for you know the next set of projects that make sense. We want to have that available, but I don't think we want to scale too fast.

C

We want to be strategic about it.

B

That makes sense I'd be curious um how how you're defining what projects are important. I know you said based on usage. If there's you know, if we could collaborate on some of that together, what you're, using for heuristics to come up with your list could be interesting.

C

Yeah the way I try to think about it is instead of thinking about you, know: apache nginx and linux kernel as being three different things, for example, count each instance of those things in the world right like pretty much every web stack these days, not all of them, but most of them use the linux kernel right.

C

So, let's say you've got you know several billion instances of linux kernel, that's its weight. You know, then, there's a certain amount of weight to nginx, based on how heavily that's used. How many instances of nginx are out there in the world, um so you want to look at how common something is obviously we're only looking at things that are written in c and c, plus plus.

C

So you know. I know that in a lot of vulnerability, analysis that your working groups do you're talking about a lot of javascript projects and dependencies, and things like that- and certainly there are security concerns there. But you know luckily memory safety is not one of them. So we're really only looking at commonly used, cnc plus plus applications.

C

They tend to be lower level except for web browsers.

D

And how easy it will be to use this new memory, save project in other languages.

C

The question is, how easy will it be to use which project.

D

This new project, like new, call and can it, for example, be used in python for web communication.

C

I'm sorry, I'm still not quite sure. I understand the question, but I can say that you know curl is going to work exactly the way it did before. There's no change to the behavior of curl once we change the networking and tls libraries.

C

So if it works today, it'll keep working but.

D

But can you use it as a library in other languages.

C

Can you use curl as a library? Yes, there is a lib curl library, that's part of the girl project and lib curl will use the memory safe networking in tls.

G

Hey a question: you mentioned something about funding josh. This is dave. Stewart from intel. Have you uh where do you? Where is your friend income today.

C

Initially so, we've been self-funding, some of it and the first external funding that we've received is from google.

G

Okay, I'm I'm I'm. uh I was trying to look it up and get my question answered and I'm I'm failing so far. So anyway, yeah.

C

We haven't actually published it yet this is pretty recent, so google is uh funding the work with curl and some of the work of apache.

G

Okay, but isig is a non-profit basically, so you look for contributions for these things.

C

Right, we're non-profit um our you know we are not. We don't have the funds to fund all the work that we would like to do ourselves. So our role here is to figure out the projects and figure out the best return on investment. Where is the best place to put in resources and then we'll go raise that money and make sure that it gets to the right people to get the work done.

F

C

So pretty soon we're going to come talk to you about some linux kernel modules that are very widely used for intel.

G

Yeah, I I yeah, no I'm I I love it come come talk to us. um uh I I this is. This is sort of a continually depressing topic, because we were just launched some fuzzing uh efforts and found some code that it's like yeah memory safety, and I was looking with the over the shoulder of the researcher and staring at the code and I'm going wait. They're not checking any of the input parameters here and they're using it to index into arrays, and it's like wait. They're like and then she sees the researchers said yeah.

G

This is crap code. So it's like awesome and it's code that it was written by an intel person. That's maintained by an intel person today, um not like it's part of their um sort of day job right and and it's uh what's distressing.

G

If we hadn't launched this fuzzer effort uh for a separate, you know, project they'd be like well, we would never have found this or- and I think these bugs have been around at least until since 2016 and maybe since 2011, so that doesn't make me that makes me incredibly depressed, because if we hadn't done this fuzzing, we would never have found it until well. Maybe it's being exploited every day who knows by.

C

G

Criminals or something like that, so um yeah, it's it's a and it started getting us really interested in gee. I wonder what other parts of the code you know not necessarily stuff that we've done, but other parts of the code that might be similarly crappy. So um it's a it's.

D

G

And and nobody would have um in their day, job said: oh, you should rewrite this intel and rust right. It's like it's good.

C

G

Never would have touched unless there was a bug so.

C

Well, it's an important point about fuzzers, which is that you know part of their value is in helping you make cnc plus plus code a little safer, but part of their value is in pointing out to you how bad this code actually is. You know, and it's not about intel engineers or google engineers, all human engineers, writing, c, c and c, plus plus code they're, not good enough at it. It's not a particular engineer. Humans do not write good enough code to keep your memory safe. The answer here is to have the compiler.

C

Do it for you and that's what you know rust and java and javascript memory safe languages. Put that burden on the compiler and compilers. Are you know they should be doing the job? Not people. People will fail every time, even with fuzzers and static analysis.

G

C

A

Have a large conversation.

G

About this, but I I probably better, not uh you know, okay, I I I I have a lot of intellectual curiosity relative to this, uh and the rust conversation is one that is uh it's not a new one uh in my mind, but anyway, that's that's great thanks. Thanks for your presentation,.

C

A

The fuzzing conversation you just brought up made me think of maybe another way to help pitch this to companies to chip. In you know, google is spending an enormous amount of money running fuzzers every year. I'm sure a lot of other companies are too that's going to continue basically forever.

A

Maybe if we can come up with a way to repurpose some of that budget to these longer term, rewrites and memory safe languages, that's actually a pretty good investment over time and we could turn down the amount of money we're spending on fuzzing.

C

Yeah I get I get pretty concerned about the how much complexity is layered onto modern software development. You know, you've got to do. Ci, you've got fuzzing static analysis.

C

All these other things you need to do in order to responsibly develop software, and when you make the process that complex you're going to increase the chance of failures, you can increase the chance that people don't take the steps that they should be taking.

C

This is a place where you know the compiler can do the work and you're not adding a step, and we don't have to add this layer of fuzzing and static analysis from memory safety on top of it. So I think another benefit of all. This is removing complexity from the general best practices for software development.

F

Yeah josh, if I can push back a little bit, um you can have memory safety and you still need fuzzers and you still need static analyzers. You just don't need them to detect for memory safety properties, there's other properties.

A

F

You want memory safety, uh but uh and that, but you know that statement I just said uh doesn't mean that this switching to memory safe, is a bad idea.

C

Totally, but if you, if you forget to run a fuzzer or you're too lazy or something if your code is memory safe, the consequences of your you know the consequences of not running the fuzz are much less right.

F

Depends on what it's doing, but uh so it's it's still better how's this! I you, but my point still stands. You still need fuzzers. You still need static analysis, but this, but uh you're also right that those two are not guaranteeing the things that using a memory safe language does.

C

B

Cool, thank you josh. This was. This was really helpful.

C

Thank you looking.

B

Forward to talking more thank you.

F

B

Yeah and then the link to those slides are in the um meeting notes. If anyone wants to go back and and reference them um jordan, I don't know if you have slides or you just want to talk either is totally fine yeah all right. Do you want me to share that, but.

E

No, that's uh that's just a link to the blog post, uh so what I was hoping to do is just kind of briefly give an overview of the research that I did and then talk about why I think that the openssf is an awesome organization to kind of take it to the next step.

E

Talk about what that looks like what that means and then try to kind of figure out what those next steps look like and we can kind of go from there. So hi everybody. My name is jordan. I am here just representing myself today, uh because uh I recently uh moved from working at duo security. uh You know part of cisco and have joined stripe as a security engineer on their team. um So I did this research like in the middle of that week-long gap.

E

uh So uh so I still haven't figured out the uh you know the process of coming on board uh under stripes umbrella, but I'm working on that uh and already talking to people about this effort, some more to come there. So today, I'm just here as jordan, so the research that I did I wanted to see if I could use dynamic analysis to try and find potentially malicious packages on pipeline.

E

You know we've all kind of heard the incidents in the past not just on pipeline, but on any package manager. You know a there may be like a typo squatting package or you know, reuse credentials, give an actor access to a popular package. They try to put malicious content in it and that could have pretty significant effects. You know I always try to air away from you know, spreading fud and in in hyping things up, but it could be significant.

E

You know if, if I ever feel you know calm, then I'll just go to libraries, dot, io, slash experiments and see some of the metrics that they post up in terms of just how often these are downloaded with like one maintainer. I haven't been touched in in years uh and and my anxiety goes right back up. um My hope was to try to figure out you know. Can we use dynamic analysis to find malicious activity with a high signal to noise ratio? Now?

E

What does it look like um because some package managers pipe included already have some? uh You know malware checking um capabilities. You know, for example, there was just some some effort done on pi pi to do more of the static analysis, work which is great. You know that that takes a huge chunk out of the equation, and the goal here is just kind of a raising the bar exercise uh to make it harder for attackers to put malicious content on on package managers.

E

But in my case what I did was I set up a pipeline that installed every package on pi pi, while it was running a tool called systig looking at what syscalls happened whenever packages were installed.

E

The benefit of this is that, since we're watching syscalls that happen between pip in this case and the kernel, we can see things like file, accesses commands being executed, network connections being established.

E

That removes any kind of potential obfuscation that came with the code. That's one of the things that makes static analysis a little bit harder. um You know to kind of offset some of the more expensive nature of this dynamic analysis, actually installing the package, and it was pretty successful.

E

You know I show a few case studies in in my blog post that show just how powerful this is being able to take an obfuscated block of code and show exactly what commands were executed exactly what network connections were established, and I think this is really promising um for a few reasons you know. The first is that it gives that high signal. You know. Yes, there are a lot of sys calls that happen whenever you install a package.

E

You know that that's expected right, but there are some things that would make us at least kind of raise our eyebrows so to speak. um That would warrant further investigation. You know, there's network connections really in general. I think it's worth looking into. You know. Why is this package reaching out somewhere?

E

Just whenever I'm installing it um likely is the case? It's benign. It may just be installing some helping components that may be. uh You know installing some other setup scripts, but that's always something to kind of look at with suspicion or, if they're accessing different places on the file system. That would make me concerned. You know, for example, if something's trying to read ssh keys or if they're, trying to um access, you know secrets on disk. You know in common places those are suspicious.

E

uh We can kind of get a feel for things that we would want to at least investigate um as early as possible before these packages would be downloaded.

E

um You know many times, um and so the research went well, you know, I think it looks really promising, and so what I'm excited about the open ssf is that whenever we think about the next steps, you know that was a one-time study and I talked in the blog post. The benefit is doing this continuously watching new packages, as they update doing this same type of analysis, though we can catch things as early as possible.

E

The benefit of doing this with the openssf is that it's a central organization that brings everybody together in a room and we can make this one capability accessible to any number of upstream package managers. um Compare this with other options. Right. One option is that we ask each package manager to run this infrastructure themselves, that's difficult both because we're repeating work and because I have nothing but empathy and respect for all the people maintaining our package managers.

E

uh You know many of them are volunteers uh on limited funding, limited budget limited people- and I would rather this be done in a place where it's a centralized capability that everyone gets the benefits from, and the other alternative is that I've had companies reach out to me saying: hey I'd like to use your thing just at my company as part of my ci pipeline, I said I don't think you should like. I don't.

E

I see this as something that we could solve centrally for the ecosystem, not something that every company should be doing, because it's not really a special thing. You know it's something that we're all accessing the same open source packages. We might as well solve this and solve it once uh and so that's why the open ssf makes perfect sense because we're getting everybody together, we can have really really close ties with maintainers of upstream package managers, and we can set up some kind of just communication.

E

You know, honestly, you know where we figure out. What do we do whenever we see something suspicious um kind of who's? You know how do we triage that until we have a a confidence level uh to bring it to the maintainers upstream, uh the goal being that we're not overwhelming people with false positives? You know that we're not um asking them just like investigate all these yourselves with the already limited time that you have um so where we are now is that the research is done.

E

I am starting to put together a roadmap of what things need to happen before this is kind of considered robust. For example, I just started with pi pi. um You know npm, ruby, gyms, you name, it are all great candidates uh and they've, I'm not the only one. Who's done dynamic analysis uh in the past. You know, I think, there's even multiple people on the call who have done work uh in this space uh really similar to what I've done.

E

um So I'm excited that we can all kind of put our heads together um and figure out what this looks like uh so yeah, making this a continuous scanning for multiple package managers um trying to increase the signal to noise ratio. You know right now. My process was very scientific. I got a whole bunch of data and then I just grabbed a lot like that was that was pretty much it because I didn't even know what it was that I was looking for.

E

I was like, let me think about what would be suspicious and see what I can find. It was very um I mean, there's a whole whole science oriented thing, um so we would prefer for people to not have to do that as much as having that be done for us like in an automated fashion uh and then use the expensive people cycles where they could be used most and be the most beneficial.

E

So all this together. That's that's why I'm here uh is just to kind of chat with y'all figure out. What is this even the right subgroup to be in because there there are multiple subgroups under the open ssf um for two to hear from y'all about y'all's perspectives. What you think can go well what you are worried about and then just try to figure out what next steps look like.

G

H

G

Much very cool, very and and it's great standing up something in a week between jobs. I love that story. It was awesome. Well done. uh Sorry.

B

Oh there he goes yeah. Thank you, jordan. This is. This is really interesting. I I love the idea of being able to take this and scale it out to other package managers and everything, um I'm not sure where we go directly from here. I know dan and I on the google side, are looking at some ways that we might be able to support this and support this work. um So that's that's where we are yeah.

G

Sorry, I was I I had a family member mentioned. Something to me. I was I I apologize. This is living at work now. um What uh um I was gonna say uh was that yeah, I love what your thought process here as well. Is you know? One of the things we stood up a few years ago was a capability which, at least on the kernel side, would try and build the latest tip and try and boot it on a huge number of machines right. This was like the boot problem.

G

We call it our zero day lab, and so we we did this every night and you know, based on if we saw boot failures, we'd send mail back to the the kernel mailing list and said hey this patch. You know we bisected, and this is the patch that was was causing problems, so I think, even even without the research, I think what you've basically said is. This is not terribly scientific, but it's something at least gives people an indicator where to look and be suspicious right.

G

If somebody wants to really you know nail into this, but even more so if something, if a new package or a revised package that goes into you, know, pie, pie, you'd want to say: oh look, uh maybe a piece of email that would poke up a a diff right. You know- and that's that's you know, that's the sort of thing that an organization like uh um um you know, google, intel, microsoft, somebody you know or even like the psf.

G

That's that's another interesting possibility, because psf is in theory, um um you know their their goal is to try and improve overall python right uh capabilities like this and so psf. um I think I think in uh collaborating with those guys as well, at least on the python side. Now you know node, uh php, etc. Ruby, as you said, there's a lot of other package managers that that you know, but this says this says real promise too yeah.

E

There's there's another thing to mention which is there's the question of whenever we see weird things happen, what do we do and and there's a couple of answers? One- is that if there's something that's weird in terms of actively harmful, you know, in that case, there's an opportunity to let the package manager maintainers know so that the package can be removed right and these, but there's another option too. You know, I know that we recently being um all of y'all um doing doing all the work.

E

The open ssf has released the scorecard capability um and the goal is to try to give people answers to the question. What's the risk of like bringing this package into my organization right, you know like what you know. What are some heuristics that I can kind of get a gut check on on.

E

Do I need to care about this kind of information might be useful, hey we're letting you know it runs these commands. It makes these network calls. um You know anytime, that it's installed. It doesn't mean that it's bad. It just means that you may want to think about it, and I think that's that's useful. So it's it's not the only output doesn't have to be just going to package manager maintainers.

E

It could also be giving people downstream more information about the things that they're installing and the things that they're using and um one super quick note you mentioned psf. I didn't want to give a shout out because we have dustin on the call and dustin reached out uh shortly after uh the research was published. So I know that you know I don't want to speak for you dustin, but I think there's interest you know and just kind of what this might look like right.

I

Absolutely psf has a ton of interest pipia's tone interest. This has been sort of ongoing issue with, like you said this package manager and every other one as well so uh yeah, and I wanted to chime in like not only is having some form of analysis of every project on pipi uh interesting, but also if we had a way to sort of classify the behaviors at install time of a project like this package makes network requests. The users could potentially in the future, have the ability to say just don't install anything that makes network requests.

I

We have that metadata about everything on pipi. It gives users a lot more control over their ability to install and use the projects there.

G

I don't know yeah.

J

Sorry I called the.

G

Pie pie because it but but pie pie is something different pie, pie yeah. I know pie pie really well. Yeah.

J

As you guys scale this out, um we just did a assist, call monitoring project for one of our commercial products at smallstep that plugs in directly to ebpf and then has like a collection stack that dumps the events into an elastic cluster.

J

It's like, I said it's part of our commercial cluster, but our commercial product, but um if we could help you guys scale this out in any way. I mean minimally we'd be happy to share experience, if not code.

E

That'd be great yeah I'll say that the the hard part about this is the the piece in the pipeline that installs the package and watches calls. You know ideally right now. It's just a c2 instance. You know and and I I throw cystic on it and then I spin up. You know a couple of containers, one to watch traffic and then one to actually install the package and systick is running on the host, with some filters set up to only catch those syscalls right.

E

I'd love to make that scalable, where, instead of running it on an ec2 host, everything's just thrown up in its own container um and and we go from there. So I'd love to chat with you more about that and.

J

Yeah I'd be happy.

E

To share what I know so great that sounds perfect, then uh the oh yeah, one thing I wanted to mention was the re one. Another reason I'm excited to be here at the open ssf is that another question was given to. uh I think it was was josh about like where funding comes from. That's hugely exciting to me, because, right now my funding is my wallet, and so that would be great to not have that be the case, uh because I think about that on an ongoing basis.

B

We're working on that on the google side, happy to invite friends and family members to join us for those on the call.

G

F

Buttons and plushies don't forget the plushies. Oh.

B

I sent jordan one and a shirt.

F

So um let me uh I'm gonna jump in real quick. I can't speak for this entire working group, of course, but uh in my personal head at least, uh I think that the major repo, the major repositories, pipi nodejs, are critical projects in their own right. I mean you know if I wrote down a definition of what's critical uh and pi p, I didn't show up there, you know or didn't at least meet that criteria. That would be kind of surprising to me.

F

So so I I mean you know you could certainly feel free to talk to other open ssf working groups, but certainly I think this working group would be a perfectly reasonable place to discuss this. um I, I guess, there's there's several issues as far as your implementation. You know the the whole analysis.

F

uh The one thing that concerns me is, you know, there's several different kinds of malicious actors here involved. um uh You know some are just going to try to slip in and they're attacking. You know the person who wrote the code is actually getting attacked themselves.

F

You know their passwords or got stolen or whatever, um but then you have the ones who are actually in uh intentional diligent initialist code or they're really uh willing to put in effort, and so they may try to evade any detection mechanism you put so it seems to me that you want like the general mechanism of gathering the data open source, the tools to implement rules to be open source, and maybe the specific rules be the hidden secret sauce that you hide off a little bit somehow so that you don't reveal what you're looking for, because if you reveal what you're looking for some adversaries will try to get right now, I you know, maybe you know the disadvantage.

F

Of course is that then you have a lot less cooperative help on developing those rules. So maybe you can make some rules. uh You know fully released and some not so much, but you know I'm just kind of thinking about the pros and the cons, but I do very much like the idea of analyzing stuff so that uh in a common way, so that you aren't just easily uh emotious actor, can't just upload random garbage, that's dangerous.

E

Yeah, so so a couple I I agree with you and and a couple things that in the past, whenever I've thought about you know, evasion that I kind of lean on the first is. The first is that my goal is is not necessarily to eliminate. You know the the problem entirely. You know it's very much just to raise the bar and to make it more costly, more difficult to introduce malicious code into any of these package managers and at a low cost to us right.

E

You know if I'm spending a dollar and they have to now spend ten twenty dollars. That's that's a pretty good trade-off, yeah. The the other thing is that if we think about the level that we're monitoring at um we think about the rules, but the rules to me is really just what do we filter down to to look at kind of manually, um because if we look at the syscall level as soon as you make a network connection, I'm gonna know or as soon as you, uh you know, invoke a shell.

E

uh You know and call exactly where we're gonna know right and even if that's probing the system uh to look for if certain files exist, if it's kind of profiling, the system, we have the capability to kind of know that too right- and so that's that's a benefit. So, but there is the. What do we elevate for people to look at? uh You know we.

E

We can be careful with with that, but I think there's a lot of promise to to this in terms of raising the raising the bar and at least giving us the positioning to know whenever certain things happen.

F

And maybe maybe one option here is kind of split this up I mean if, if you do an analysis and here's what we see you know here's you know here, you know you could download this uh 50 gigabyte file of every syscall and you know the in in json form and so on. uh You could then stick a whole bunch of different tools to look at it.

E

You know what I mean like whenever we that's that's one of the things that I considered doing uh until uh I opened up my wallet and I was like there's not much here, uh so I can't go and give that everyone access to like a terabyte of data on s3 and uh and get away with it. uh But there's there's tons of opportunities there to team up with folks who want to go digging for stuff that we've missed.

A

So that kind of ties into it's the next step- and you mentioned this a little bit before- but once you've found a malicious packet or a package we think is malicious. What do you do with that information? I know this kind of fits into the category. I guess of typo squatting packages in my head um and I know from a presentation one of the other working groups like the ruby gems um people have had actually a whole bunch of trouble getting typo squatted packages filed as cves.

A

You know the cv authorities don't actually consider those to be a cv. The package.

D

A

It's meant to do, it's just meant to be malicious, and so that itself is not technically a cve, because it's not a vulnerability in that package. It's by design, um and so it almost points to a need for a separate place. To put these flagged malicious things that are malicious by design.

I

I'll shine in the the on the long tail of all the projects on these package indices. Basically, anything we can identify, as potentially malicious, is going to be a valid use case for a given project. So I think anything that you know blocks them up front prevents them from being installed. Anything like that is just probably not going to work, but raising awareness to end users so that they can sort of see the behaviors of a package that they're, considering or evaluating.

I

I think that's really going to be the crucial thing, and then you know the industries can then build something, and it's like you know, flag. This warn other users about this in some way, that's my thought.

A

And there's probably some that are clearly malicious, though that you might consider taking down on the pi pi side right like some of the type of squatting ones. I think have been taken down in the past, like if a package just uploads all the keys in your home directory or something like that when it's installed and it's clearly camouflaging, is something else.

A

If we were to identify those very specific behaviors, then yeah, probably yeah, there's definitely a long set of blurry stuff in the middle, though.

F

Yeah I mean you, you could make, you know, have different categories like suspicious and you know, and hopefully eventually you know, we won't install install suspicious packages unless you ash, you know dash dash, you know, install dangerous, suspicious things or whatever.

F

And then at least and then at least there's a uh protection for joe average, who or joanne average, who just you, know, they're, expecting something simple and uh they type dash instead of underscore and didn't get what they were expecting.

E

Yeah, there's there's so much room for us figuring out how opinionated we on the open, ssf even want to be um in terms of you know what what do we want to give up stream or do we just want to try to give high signal upstream and then let package manager maintainers do what they they feel is best kind of for their ecosystem right and there's a lot of a lot of room in the middle, uh but I do want to do a time check.

E

I know that we're like at time I apologize for taking over 20 minutes. um You know I'm happy to kind of stop the discussion there. We can just kind of work together um figuring out next steps. Offline.

B

I I think this is fine. This discussion is really interesting. So don't worry about the time. I think we can move stuff to next time.

F

B

F

B

Cool, I think we can do one more question. If someone else has one more question, they want to ask.

F

Not a question but an observation. I agree with dan that there's a whole lot in the middle. I think that was you david said that, but maybe what we really need to do is start by.

F

You know trying to do these sorts of things, because I think that pi pi would be more confident about, say, removing or not allowing install by default. If they were very, very confident in the signal and if they're not very confident in the signal, then I think it's quite appropriate to say. Well, you know I maybe I'll. Let them find that information, but I don't want to you know hold folks back.

F

So I I think what we really need to do is drill in on the trying to make at least the information available and then as a separate step, uh work on trying to be more.

H

Proactive, I have a question like: uh did you notice that malicious behavior only during install time or did you try running some things too, like like importing or calling the api.

E

Or something great question, so um in my research I strictly installed the package that to me was kind of the trust boundary. You know the assumption was you could make an argument, but if I'm able to download it, then I can at least see what it is that I would be installing before I I start using it. um So there's there's a chance there right.

E

So um that was my trust boundary, but I know that there have been papers in the past that I even reference in in the post and they go to the trouble of importing the libraries just to see if there's like an init function that runs um and so so there's opportunities to expand that way as well. Yeah.

H

And did you run into any scalability issues with your pipeline.

E

H

You able to analyze all the packages or and how much time did it take like yeah.

E

The biggest scalability issue was my credit card and the it does not scale well, uh not the, but but honestly, you know, since it's not running in workloads, it's running on ec2 hosts. That was something that I had to consider. For example, on the tail end of the analysis, there were some weird issues where some packages during install happened to hang up or take longer than expected, and then that had some downstream effects, I've got to have some orphaned, uh cystic processes that I'd have to go and clean up.

E

Overall, you know statistics wise. This took about three to four days of running. You know the hosts running. uh Turning through packages, I had about 12 to 15, like micro to medium ec2 hosts running, uh and then it cost me about 120 bucks all in I think um so so scalability wise, I think, there's always the opportunity to scale that middle section, the actual analysis, but that was running on all the packages. If we think about what a continuous pipeline looks like those needs become smaller.

H

B

Cool all right we're out of time. um Thank you all. The presenters, josh jordan, hope to see you guys in here again and looking forward to working more on this stuff together.

E

Awesome, thank you all for having us. It was awesome to chat thanks so much.

B

E

F

F

F