CHAOSS Risk Working Group, 28 Jul 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: CHAOSS.Risk.July.28.2020

Description

CHAOSS.Risk.July.28.2020

A

About that part of it, okay welcome to the risk uh working group meeting. um I will put a link to our meeting notes in that I have a separate. You know. I got real fancy and created a separate tab um for a separate browser window for this because then I'll share. The notes here is the link to the notes, and I will share.

A

These are the risk working group notes. Now today is july 28th uh you can see that I just entered a fake. I don't know if any of you know who robin yount is. He was a player for the milwaukee brewers. uh He started in the major leagues. The age of 18 was the mvp a couple of times: one lost a world series in 1982, etc, etc.

B

I'm sorry sean, but you know I think, you've got the wrong demographic sitting here.

C

B

Now I I could be stereotyping and there may be other fans.

A

But yeah yeah, no, there was a well. It was actually a big covet outbreak on the marlins, so they're starting to cancel major league baseball games. I think it's back and they'll be gone in a week.

A

It'll be the shortest baseball season ever um but enough about copied. um So the things I I have on the or we have on the agenda that I kind of carried forward from last time and uh are take classification, work, um forks and some discussions about uh pull requests, stakeholder influence and code complexity. These are metrics. We might be able to work on uh during the meeting. But one thing that's so we have that we had this spreadsheet of that labels. How many times labels were used across nine different collections of repositories?

A

um I've all those repositories have been updated, and so some of these numbers are nearly double at this point. What they were, um but um so kate went through and did a classification of process check, solicitation uh classification, a classification called classification. Don't confuse me um affirmation is that information, yeah affirmation.

B

D

A

Wider while it seems to be even wider, maybe I should just okay, you know what I'm just gonna, wrap format, format, text, wrapping, wrap, okay, um and so this was kind of the the code section that kate developed and she went through the most used codes and and put them in a bucket- and we talked through this at one of our prior meetings and I think generally agreed with the classification, but one of our questions was okay, so a label is used so many times, but sometimes repos are dominant.

A

In other words. So if I, if I'm kubernetes, I always like to use kubernetes, I pick on them as like the gigantic super active repo. So one of the questions we had is which tags were used by the most repositories, so the most distinct repositories.

A

So I looked. I I determined distinctness using a repo url, because sometimes we will have.

A

We could have um some collections of repositories that include a small number of overlapping actual repos right, so more than one party might be interested in analysis of kubernetes, throwing them out there again. So essentially, this is on the collection of. um I have to look really quick to see how many total repos are in that collection.

A

um I believe, there's 13, 000 repos uh unique repos in this particular collection was it's just a sample, um and so these are the ones that are that occur in the most distinct number of repos, um and I think I'm not sure I don't know. If I can split my let's see, I don't know if I'm able to split.

A

A

Okay, I can split a browser screen and share it and my home screen my screen was too gigantic, but I guess if uh sorry go ahead, kate.

B

No um one of the things I just are wondering when I was thinking about the repo pounds is you tend to have to use your skin cf and kubernetes? They tend to all be start within the same umbrella and I'm wondering to what extent some of these are sort of like those umbrella ones in the same umbrella tend to share same cultures, and so I'm just wondering, if that's a dimension to think about here.

A

It probably is so when we so things under cncf.

B

Under open embedded because they've got their own, you know cultures of things and you know open, embedded things like that.

A

B

Like the debian, you know, things are working more in the debian ecosystem will follow certain patterns.

A

I'm making some notes here.

A

This kind of comes back to a discussion that we've had with the to do group about. Do you have that cncf fleeing candy by chance.

E

Elizabeth, I don't. Sadly um it was like cnc yeah. I can't go for it.

A

Cncf landscape, I think, was the page.

B

D

A

B

A

Is I found it's landscape.cncf.I, oh.

B

There you go yeah, I've got it too.

A

um And to kate's point, I think what you're suggesting is that we consider which of these buckets um well, these are not. These are so these are, are all of these considered cncf projects or like.

B

They're also related within the cncf.

A

So cncf is like, for example, this is pretty gigantic.

B

Like well, cncf isn't quite all of them. I think I think these are you know parts of the whole ecosystem. It's an ecosystem mapping, but if you look at uh cncf dot um projects a second here, let me give you the link in the chat.

A

um I think they've grayed out the ones that are not cncf projects. If I'm looking at this.

B

F

B

These are the ones that are properly cncf, the ones they've graduated, um the ones dating.

A

Getting getting chat when you're sharing your screen is always a little tricky for me to figure out the ui on, but I did it.

E

A

E

On that page, you can filter by their relationship. I'm not sure if that's how you got there kate, but um on that landscape, there's a filter on the left.

B

Well, no, I I just. I just knew that. There's a project page sitting there filter.

A

B

G

A

Yeah, so if it's any kind of a cncf project.

E

So you can yeah go to like member products and projects and.

A

Basically, if I click both, would I consider member products projects to be part of cncf, or are these more likely to follow the conventions of their owners.

A

What do you kate? What do you think?

A

Well, I don't know what a member, what's the difference between a member product project and a cncf, I assume it's one that was sort of adapted under the umbrella after it was made.

B

Yeah, it's basically it's been put under the cncf and it has incubation which is what's going to establish the set of cultures around it, and that's why I was thinking that there might be. You know, common behaviors that are prescribed for cncf and then therefore the tagging. I do know that there are common behaviors for things that are, you know, being packaged for debian, and you know working inside the debian ecosystem.

A

So if I was to look at graduated, I would expect to see more consistency than if I looked at incubating yeah and sandbox early stage. Sandbox is top secret or oh here it is okay. So these are the sandbox projects.

A

B

Yeah, let me just consider.

A

Kubay edge and we're curious what that is, but not curious enough to spend our time on it.

B

Okay, I'm just gonna heads up that I do have to have. I stole my half hour hearts right guys.

A

Oops, that's no problem all right, so I have the cncf projects listed here, so a next. The next cut might be to actually go explicitly after a collection of cncf projects and compare them with not cncf projects.

A

Would you say that's the largest ecosystem that is probably common across the chaos community, because certainly everybody talks about.

B

Yeah well, I think, there's a high overlap between chaos and cncf, based on the open source offices that are there.

G

B

Let's do so wow.

A

Yeah and yet I know some of the like some of the places I pulled repositories from are probably is that for undersea cncf? No, no, I wouldn't think so. Okay, so some of the places I'm pulling repositories from are not I mean, I think, there's a fair number of them that are not cncf projects, but also a fair number that are so.

A

I should look at that um uh for sure and and break it down. I guess if we can put them into ecosystem cultures, that's that will be interesting to see the differences. I do think it's useful, though, because when I looked at when I looked at the text that was at the top for a number of repos using these things, like bug enhancement, uh I don't know that it was like kind bug.

A

I guess bug was pretty popular, and so we have some confirmation, and so I think I think the next thing I'll do or anybody that wants to do it can is sort of cross-reference.

A

um I would expect some correspondence between how many repositories are using a label and how many times the label is used, but I think identifying the outliers is probably a next step.

A

uh How? How, because the questions that we wanted to answer with these labels, kate are sort of they're focused on how you know how effectively they're using um are they reinforcing practices? uh What subsets are used? Where are.

A

Which ones might be associated directly with risk.

F

B

The ones that are associated, like you know the bugs and the issues are they're actually associated with risk that uh a lot of the safety, um some of the I'd say. The safety standards that I'm being exposed to right now seem to care about what are all the node bugs known effectively.

B

And known as a been mitigated type of deal, and so the bug, the bug indicators on issues are one way of starting to signal this. The issues could be more than just bugs.

A

Critical, I think, is probably a good one.

A

And other similar ones.

B

So it's a classification ones, I think suppose the affirmations.

A

Yeah oops hang on. Oh it's, my daughter, smarter than myself, so once one's with the category of classification, okay,.

B

I think affirmation and negation are you.

B

B

Put a question mark behind it because that's sort of the hypothesis, so I think next exercise is probably to sit there and go through those ones that are showing up in multiple repos and just basically add the classifications and see if that works.

B

You know the next meeting all right. Just uh I'm sure I've got the links.

A

Yep they're in the notes here I'll just put a-I-k-s yeah there um for and then I guess maybe I should put that under. Like another bullet that says, um classification of.

F

This list, bad classification.

B

I'll work my way down through the top ones, anyhow.

A

Yeah, I think yeah I think going like you did the last.

B

A

um I think that covers us for tags. Unless there's, I think we want to do analysis of the tags themselves in this meeting at this point, but I think anybody else have any comments on tagging or questions about what we're trying to root out here.

D

I have one question on the tags, especially the bugs and criticals. So are we like ascertaining the quality of quantity like more bugs, are more risky projects, less bugs less risky projects or the type of bugs that stuck in the riskiness of a project like in what terms are we are thinking in those categories.

B

I think it's more a question of eventually it's going to be, you know, have the bugs get resolved, you know, and is there someone addressing bugs? I think it's, the bugs that are open and no one's looked at are the biggest things were risk, but so I.

A

B

A

B

I think I think right now we're sure, let's see what the landscape looks like and make some hypothesis.

A

So one, I guess one one hypothesis might be that if you can instead of takes associated with risk, then looking at the state of related state of issues with those tags state of the issues and resolution time for issues that get those labels is important.

A

It can reveal how quickly risks resolve.

H

I have a question: yeah um yeah, it's because I'm brand new to this, if you have any sort of documentation methodology and how exactly you're pulling things. I know you mentioned that you're counting by urls versus repo names like that was just one little thing to just get clarity on what exactly you're looking at, because that might help. If I want to have any feedback around it.

H

C

H

um I was just thinking as we're talking about what could be indicators of risk, um assuming I guess this is all I will look at the the methodology of how you pulled it. So I can know what to ask about, um but I'm assuming there are certain file, types or branches that might have more influence than others in terms of saying where the bugs are like. If the.

E

Bug is in a an owner's file.

H

Versus something that's more executable than that potentially has a difference profile, so I don't know how much, how available that is in terms of what's being pulled uh in terms of file, type and location within a repo and functionality of what that thing does to know whether or not it should be of higher priority or not.

A

Yeah, um I think this is a there's, an important point you're making actually about so. The the methodology is we're using auger to pull the data, um and so we have and what I've done is I've taken a set of auger repositories and I've created a sort of a meta aggregation database that pulls them all together into sort of.

A

I don't know postgres has this thing where you can make tables that are connected to tons of different databases, and so that's how I'm doing this, and so that's how I'm aggregating the 13 000 in this set, which may be larger when I go actually count again, um but I think making that transparent is really important.

A

B

You're gonna need to give me access to that repo sean.

A

Yep uh yep, I'm definitely going to give you access to that. I think providing a public user for that database is a good idea. I just provided the first public user for an auger database when I released the community reports project earlier this morning. So um where.

B

Did you release.

A

That uh it's in the chaos organization, okay,.

E

A

It includes the pull request and newcomer reports that you've seen us run for zephyr as well as us. I thought about what it contains. Yeah.

B

No, it's okay, it is. Is the methodology there such that if teams want to take and run it themselves, it's there now for them to start doing.

A

Yes and it's it's right now, it's connected to k like the I, the read-only credentials, I've put in there for show so people can play around with. It are just two repository chaos, databases, but as so, I like, I might not publish to the world on a repository the credentials for a read-only user on a metadata base that spans a whole bunch of auger instances, because I don't wanna have to buy two hundred thousand dollars in equipment.

A

But I think sharing it within a working group is uh something that can be easily accomplished without sort of public sharing of credentials, make transfer and publicly available, or at least available or maybe with credentials.

A

And so I mean anybody that wants the credentials, can have them it's it's just you know how well it is when you put credentials, you can read only stuff out there on the internet. um Just makes me nervous uh and so there's queries, uh queries that I I use, which I can. I can share.

A

uh That have been used so far.

F

Thank you sure.

A

And there's also um yeah, so there's some other things we do with auger like verify that we're the number of, for example, pull requests that we have on our database is consistent with the github or gitlab metadata about the number of pull requests or issues in in their system. You know so we kind of have a strategy for ensuring data completeness as well, um but but yeah. I think I think I don't know sophia. Do you have ways that you've worked that make?

A

You know I've worked in academic settings to make stuff like this transparent and I'm. What's? What do you think? The best way to make things like this transparent would be because the whole auger infrastructure is a very complex data collection system and then we're pulling data out of that complex data collection system and the complex data collection system is transparent, but the data we collected is, you know less.

H

Go ahead, um I think I mean I'm actually working on a similar project internally um and it's for me. I've been focusing on trying to just sort of list out the schema and sort of things that you could actually look at within the database.

H

So if you didn't have to expose the tables themselves, but knowing the types of tags or schema or categories that you can cory against um and then generally what's in it, you don't necessarily have to expose the database, but I would expose potentially the query so that you can see how you're counting and what you're counting um and then knowing the query itself and knowing the schema and schema available, then I can ideally extrapolate what else what other things you could run on top of the same database like I think for me, that would be sufficient to get an understanding of what it is and how you're using it as well as learn from that and potentially expose.

H

Knowing the I mean the general github api, and what types of things are you could pull from that data set in general, then, ideally, you can also say knowing. This is the schema that the auger data set is pulling in. Is this sufficient to answer this question or knowing what's available in github?

H

Is there other things that we would want to aggregate into it, and then then, I'm less familiar on the limitations of whatever could support um in terms of how how and how you use that data set and what you want to aggregate, but I'm learning so I got to start somewhere.

B

Yep so, and that's fine, like I said you just gave me an action with my name.

D

B

Can't do anything there, I'm making sure that.

G

B

Me not for you, and so.

A

Yeah, I know yeah yeah, I just noticed. I had left your initials there and like no that's.

B

Not what I intended.

A

B

Yeah, I like to say if I can see how to do it I'll, take the action otherwise now he helps me all.

C

B

Guys um offline then.

C

All right, thanks, kate, sure.

A

And uh so the other things is, is that bernard do? Did we release a candidate metric for forks under this release or did uh in the common working group, or did we.

C

Not end up getting all the way there. I don't remember I don't think so. We have released in the comments.

E

We didn't release forks, that's uh that one's in the early early stages.

A

Okay, and so another one that's been proposed, is uh stakeholder influence.

A

A

And I think this was this one was proposed by. I remember this person was here last time. Stefka, maybe does anyone remember I don't remember the yeah. I think I think we worked on this one last time and it's.

C

Analytics yeah, let me look at the google sheet.

A

The what the what yeah, the google oh yeah and this one does look like it was last edited at the last meeting.

A

I I it's not, I don't think it was so.

C

But king's titan, it's.

D

Not mentioned even here.

C

F

A

It was the media.

A

Jay, I think it was johann link lineker. Who was the one that proposed that initially and he was not here last time, so we keep it, we keep it on the list of the metric that were being developed um uh so bernard. Can we ask you to maybe take an action item on the making progress on forks.

F

A

And pull request discussion was one. Anybody got a particular interest in oh yeah. Well, I guess pull requests.

A

I do not understand when google, how google decides that different tabs are signed in as different users, just so pull request discussion, essentially the amount of conversation that surrounds a pull request and whether it's accepted or declined.

A

um So in augur. We have mean comments for all closed, pull requests and then for the slowest 20, and we have a few other metrics that are available.

A

I can take an action item to work on this one, since we have the.

A

A

um Sophia are there what kinds of things are you looking at or interested in, that might be different than pull requests, discussions or stakeholder influence or code complexity? um I don't.

A

Have been on the call, but I how can this? How can this working group serve your interest since you're here.

H

Yeah, so um I've been thinking a lot about, um I guess, trying to think about privacy lines here, um but mapping out dependency points within a project, um so say the number of owners, maintainers approvers that are getting things moving forward and so essentially not not the bus problem, but more looking executionally. How many people are in that path and how much that changes um as a way to detect whether or not losing that person will disrupt that workflow or how many workflows are knowing that, I guess.

H

As a google example, we have one individual who's on like over a hundred different 100, different repos and the approval changes, because for so long but like that's, he's a backlog and not that we can identify individuals but trying to have a little bit better of a mapping view of how approval chains are consistent, changing how dependent or not dependent they are on subsets of individuals throughout the project.

D

Are you trying to look at the bus factor through and identifying the individual.

A

Though you're worried about projects slowing down at some point.

H

Yeah projects slowing down or getting, um I guess getting hung up if things are shifting too around, but maybe this is just a hypothesis that I haven't looked into as much um I'm not as familiar with with auger and the chaos tools yet so I think for me, my action item is to look into better understanding these things and how to use them. I've mostly been working in bigquery and github data sets on bigquery.

H

That's what I have accessible and that's what I started with, but I realize that's: that's not the only point of information. Actually, your comment on measuring the consistency and data between them. We know that the archive data is a bit lossy, um so it's not necessarily the best thing to look at, even though a lot of my initial analysis has looked predominantly at the archive event stream, um and I I can tell you when it went down and how much data we lost recently. So I know it's not complete.

A

Yeah yeah, and, and I mean if, as long as as long as the data is in the repository or its platform, we.

E

A

um It's I think, with the archive, if I think they're doing such a continuous job that if it's like uh they just have spots, I think where, if it's broken for a period, it's not like, they have a mechanism that goes back and gets it.

H

No, you just may just lose it.

A

Yeah, and if those I mean, though, that might not be such it doesn't go, you know if it doesn't go down that often you probably are not you're not getting like, uh I'm sure you're not getting. I don't. I don't know enough about it. To be honest uh data quality, wise.

H

um Yeah going down is not the biggest problem, I think it. It just drops things. um This is the amount of requests coming in sometimes maxes out. I guess the input or, however, the input is being processed by the server. So we know that there are a certain amount of logs that are dropped, and that is variable. um There's one point in time when some of the analysis on it and found up to 20 of logs are being dropped um but other times I don't think it's that lossy.

H

So we just, I think what I've been mostly looking at is the deviation between the monthly polls and the yearly equals, and we are seeing deviation of the hundreds within thousands of ratio so like if you're looking at a number of 10 000, then in the monthly database, it might be like 10 700., seeing deviations about that size. Okay,.

A

Those those are pretty significant. I mean, depending on where you are in the release cycle,.

H

A

When you have that loss.

H

um So I'm only looking at that for overarching trend, data and comparisons and not very actionable to do data now, because it's not, we can't.

A

I think those are very risk appropriate questions, because there are some, I think I think we've historically looked at what is the likelihood of a project continuing to be sustained, but there's also when projects are very important and evolving quickly. Are you know, bottlenecks, I think, are another risk to progress. If I'm, depending on a project, continuing to make forward progress and it doesn't, then then that's a problem so yeah I can.

A

I encourage you to look at the the metrics we've released uh just to see what what kind of metrics exist and also I will I'm trying to think.

A

I will try to think the best way to well I'll share a bunch of this information.

A

Inside of, uh I think the risk notes afterwards I'll just include them as part of the risk group minutes, even though they're not part of our meeting and then email you all when um I'll just email the chaos mailing list, one that that's back online when I get that done probably tomorrow, given my to-do list today,.

A

So any other questions that anyone has on risk I mean we have. I guess code complexity is another. Is the final outstanding metric that we're developing and uh that's one that we have measures for it in in auger, so we have a. We use a tool called scc, which is a which employs a kokomo algorithm for counting complexity, and it's in a table called uh repo labor in augur I'll. Just illustrate that really quick.

A

uh So it's obvious always things like when you see this table you'll get an idea of really.

A

It would not be a lot of work for me to build a query that summarized this, but essentially it goes through every single file in a repository and calculates the number of lines uh which one's our code, which ones are comments which ones are blank and then a code complexity number which is very often zero, indicating that's a pretty simple file, but some of some files are more complex and I've seen javascript like giant javascript files, get super high numbers these.

A

These complexity numbers can then be used as a proxy for calculating relative increased labor cost uh of maintaining a piece of software or developing a piece of software, and essentially, when you find a non-zero or a non-one level of code complexity, uh the more of that you have and the more lines of code that are involved, the higher your labor cost for maintaining it and you we can actually store that as well.

A

But right now we just store the data that we could use to calculate that, because everybody's labor cost is different and so telling somebody what their labor cost is isn't necessarily always a productive endeavor.

A

So well, if uh that's kind of where the risk group is today uh any other questions or topics that anyone wants to bring up before we call our, I mean our meeting has technically eight more minutes if we want to use them or seven more, but um I'm okay, not using all the time. I do not feel like all the time needs to be used.

A

Should we say goodbye.

C

Or do you want to bring up anything else.

A

Yeah, I think, um with the to do items that uh I have here and that kate has, I think by my you know, in a few days I'll have some of mine done and by the next meeting kate we'll have some of hers done so we'll start to develop a clear, clear picture of um that was not.

A

That was not mine. That was a develop, a clear picture of yeah, of um where the data comes from, which will be helpful and.

A

Start the development of a couple of metrics and try to move some some of our metrics towards release uh during the interim period, because I think it's, I guess, we've got three more days left in the current review period um for metrics and I don't believe we released.

A

We did some release interim releases, but I don't believe we did any releases that were like new in the release period in this working group, which this is what it is all right.

A

um Well, if there's anything else, I guess I'll stop the recording say. Thank you and.