Continuous Delivery Foundation MLOps Special Interest Group, 7 May 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: CDF SIG MLOps Meeting 2020-05-07

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

Hello, hello, hey Terry, hey Jesse, hey.

B

Good Hagar good.

A

So I invited a few other people what they might join up a bit later. So there we go rule he's from cloudBees and it's worked in our various data science things. So what? If you view, other people would show up this week, Terry so maybe before we start introduce themselves of it so I know all of you better Terry tell us until tell Jesse and roll stuff you've been working on and about you. Yeah.

C

Okay, so I've been involved with Jenkins expert for a while now as a contributor and I've been putting together the ML ops components within Jenkins, X and I'm also driving the roadmap within the CBF sig.

C

Then we're trying to get to a point where we can. You have a clean description of all of the features that we need to have within and lots of the methodology and pain to clean. A picture of what implementations need to consider what features then they might benefit from including.

A

And uh I guess: Jesse Jesse you're on my right on the screen up there. So yeah.

B

Hey I think invite so I'm, just no mic through startup community, so I'm founder of code lingo. So we were interested in basically getting insights out of source code and I just have a general interest in ways to improve the productivity of developers and development teams. We don't actively use ml or AI in code lingo at this stage, but we certainly have in our future roadmap and yeah super engine.

B

Thank you for the introduction v for the invite I'm super interested just to keep an eye on this space and I was reading through the the city F readme.

B

It's just really helpful to get a bit of clarity around this definition, so it's lovely to be at least a fly on the wall in a think group like this, because I think it is I really love a lot of the points that was made in that readme and it's just lovely to get a bit of clarity around the terminology and and in the problem. Space yeah. Oh.

A

And Terry do like do we need like, because this is part of the city after we legally need to say anything, this is being recorded. It is being recorded.

C

All of these all of these sessions are automatically recording, so I don't have any control over the that the startup of this it's it's set up by the by the CDF for us in.

A

The CDF sir, for for Jesse enrolls benefit is a sub-sub foundation of the Linux Foundation, so the sort of big in the open-source community and it's sort of a very, very small sister of the CN CF and other other things like that. Roy old. You want to mention some of interesting stuff that you encountered in the past that sounded like you are pretty interested in the I.

D

Started with matching that mean some years ago, at Otto's, working on the first implementation of the service service, so that was the first times of a loop. No, yet his parka streaming was not ready and basically, as a developer, I'd fall in love with this kind of new challenges that machine learning took for me to to build something useful and after that, some years ago, I started on cookies, where I am working on the more DevOps side of things, TBT team, so I'm very involved in releases.

D

That kind of things, and now, when Michael told us about these I thought like. Oh and I can get a past amazing thing: a marriage with Kareem amazing thing and it, and it also makes a lot of sense. So it was like, and my interest come from from the icy era, sharing a lot of missing things that can be done, and it's quite interesting for me. So that's the reason why I'm here, thanks Mike, for the invite.

A

Yeah so I got interested in this through talking to Terry and I guess my day-to-day interest is in applying some of this stuff, so I'm a bit of a I think the word is dilettante was with things like this, like I, dabble in things and I've used different libraries and tools throughout the years, but lately I got my hands on a lot of duck. We started collecting a lot of data for jira issue tickets and you know sort of give up, see data. So I started looking at things that we can do by analyzing.

A

That great, you know those great big chunks of data and making predictions, and that was kind of interesting and I believe the industry is sort of settling on calling that kind of stuff AI ops, you might even turn AI ops turn around, whereas ml ops, as described in the sort of the roadmap which tarry largely authored, is more about the application of data and and models and deployment of models and governance, and all that sort of stuff sort of the parallels and DevOps.

A

It's not so much about the what you can do with ml or I, it's more about how you get to production and auto production. A lot of stuff around pipelines in that so I think that's pretty much an accepted definition, but part of the idea of sort of this special interest group is to narrow down sort of that definition. A bit because, like Terry's mentioned, there's, there's interest from all sorts of interesting companies like I think was the Intel.

A

You are mentioning like some of the people working on, or maybe you could talk about very briefly.

C

Obviously this is this is an area that lots of companies are very interested in right now.

C

There's the with machine learning applications in production environments in the majority of cases they are having to treat those things as special cases within the overall software development lifecycle, they're, often having to build their own platforms to allow them to deploy those assets and, as a result, there's this. If there's a lot of spoke technology out there right now, which doesn't sit cleanly with the sort of the rest of the learning experience that we've got from. You know 70 odd years of software development, experiments and learnings snow.

C

So there is a lot of interest in bringing machine learning back more in line with the rest of our software asset management approaches.

C

Pants were a lot of knowledge that we've gained from that software space back into the machine learning teams who who typically tend to come from a maths background. Rather than a software engineering background, and in many cases, then there have been exposed to the types of challenges that we've spent least daily in trying to run software in production environments.

C

So so so yeah there is. There is general interest in in in trying to solve these problems, hopefully in a and an open and generic way, rather than yet more layers have been compatible standards and products that won't work together and.

A

That's what we sort of went over last week. There was a in that document. There was a table towards the bottom, and the first thing we talked about was sort of the role of notebooks, so I think this week we're going to move down that list and talk about handling data and stuff I had a few other things. I wanted to just bring up, though I thought word would be interesting just quickly.

A

So when was this library that tensorflow has I've mentioned it to Terry in an email called a didn't it just paste it in the chat. If you can see that I came across that, because I was using a service that used it under the covers and I was curious about it. So I started looking into it a bit more and I thought this is like Terry talks about you've got data, scientists that are whipping out our code or now code and notebooks and and then you've got engineer, is trying to bring that to production.

A

But this is almost the inverse or the into that yang. You might have developers that, in my case, I have access to a lot of data I'm, not a data scientist I can do the training courses and I knew a lot of that mess once that I find tools like this interesting because it's like it removes me from twiddling or too many knobs with the I prefer edits and things like that or even choosing. You know how many layers and things like that.

A

It's a tool that will it uses machine learning to select a reasonable approximation or not approximation, but which model to use and how to tune it and I've had some pretty good early results for that. So I thought that was an interesting tool and I mentioned it to Terry is something we could add one day to sort of a library of quick starts so that when a developer comes across this, they can go. I can format my data this way and I can normalize things that way.

A

But I don't really want to learn about all this stuff. We don't have data scientists on staff, so I thought that was an interesting tool and something that Google are investing in. So it's called aid in it and it uses a neural network to train other neural networks and even our ensemble like one ensemble, then you got bunch of different algorithms or different types of neural networks, sort of glommed together and that's something that's usually hard to do. Just cuz, the sheer permutations you can try, and so this works.

A

If you've got a good like you might not, you might not have data science as much data science experience, but if you can throw computers at the problem in massively parallel scale- and you can have it try a bunch of permutations for you and I thought was fascinating idea and with with sort of you know the cloud having lots of stuff on top. It's starting I think it might take off. So so that was that was an interesting one. I want to bring up.

B

Yes, okay can I just yeah a thought in here: do you have an overall mental map to help us position these tools and the different activities in the life cycle?

B

So what I'm thinking of just putting my traditional software development head on we've got the old sdlc, but it seems like there's a similar life cycle that needs to be acknowledged when we're looking at integrating models into into but action services, and that's everything from you know your initial idea to actually building the model tweaking the model and then actually and then deploying that as well and I. Imagine there's a whole set of tools and and mental models to help us grapple each one of those stages.

B

Do we so I'm coming to this Colts understanding cos only work been done on kind of the equivalent of an SD or C life's I called map carry.

A

Had some good diagrams, if I served, you know for requester out of your slide deck heels or something carry I'm sure I saw something so.

C

There are past.

A

In the last week, well, there.

C

A

C

Are some documents? The the Linux Foundation also has a burn a on EC, which is incubating the series of machine learning based projects, and so they have a map of all of the projects in in their ecosystem.

C

I, don't believe there is one overarching map that gives us a good picture of everything in the mo op space today, but it would be worthwhile to try and aggregate something like that. Sorry.

B

Just to clarify my question: it's not so much getting a map of all of the available tools out there, but if I understand correctly, a part of the mission of this group is to help software developers use we're really talking much more about the actual kind of art or craft of using machine learning and trying to bring that into the space of more traditional software development and so I suppose what I'm thinking from the software development standpoint is it's it's a bit like what would you say, I, don't almost like a cart or if you're learning a martial art or something you know, you kind of start with some kind of basic structure or outline and okay there's a there's, a thousand different take next, you can do there's a thousand different things you can learn, but there's some kind of any contributes not to follow it or not, but I think what's lovely in the software development scene, we've got things like.

B

Cd has a reason as a concept and CI has erosive as a concept and, for example, code lingo, where we're trying to get across this idea of continuous integration, which is a which is another story, but I'm I'm wondering if there's some and of course stop me here if I'm railroading, but if, if it's, if there's any work being done on again, some kind of equivalent to an S DLC to help guide this practice. Let's put it like that, like what's the, but what.

C

Doesn't recommitted.

B

Best practice steps if.

C

I, if I put my Jenkins X hat on what we're doing in in that space, is creating a series of reference projects, as quick starts, so for for each type of machine learning problem that you want to tackle there.

C

There is a reference example and you can fire up a new project based on that reference example, and you will get a complete set of running code with all of the all of the steps built in showing you how to train that type of model, and then you can just iterate around that and gradually modifying it to match the data sets that you've got on the approaches that you're trying to derive so so we're doing in a in a kind of opinionated way that works out of the box.

C

So so the amount of knowledge that you need to have to get something running is minimal, and then you can have your learning experience by by tweaking an existing project and bending into it. Does what you want it to do. I guess.

A

I guess Jesse's sort of asking like how do you like if you use one of these things, how do you step back and see the big picture of like what? What is what is train? What's a training stage of a pipeline? What's the input to the training's where, where you know, if you were to draw a left-to-right sort of you, know, you'd like to do that, where do the data scientists live? And you know things like that exist I. Think sorry. So,.

C

So the challenge with that at the moment is that you know there is no standard way of doing that and this.

B

An opportunity for us to spearhead that and is that appropriate for this again I'm coming in cold, so so, if I'm being so.

C

Yes to an extent, but actually what we need to do is we need to.

C

We need to demonstrate that there are there ways of adding value to the machine learning ways of working that exist to date without constraining those systems too much, because a part of the challenge is that we, we can't say here's the best way of working in the machine learning space, because it's it's a technology, that's in its infancy and and we we have no understanding of how it will actually turn out that the techniques are evolving so fast that it's very difficult for any one person to keep abreast I. Think.

A

Terry I think Jesse comes from probably a similar position to me where it's sort of the developer angle, as here are the things I know what at what are the things I don't know, whereas the more sort of pressing problem is what Terry's saying, which is you know from the notebooks and data science angle, because that stuff's already happening, but maybe maybe like just a dictionary somewhere, would help like a dictionary definitions like training is like this. Its inputs are these its output. Is that a pipeline for machine learning senses?

A

Is this a you know, a model is a deployable. You know what even is a bottle like. That's that's something that a lot of people in practic, probably even diverse, learn just don't really know undercover. It's like it's like well, the tensorflow model is a bunch of config files and binary protobuf file, like you know, there's meat, maybe there's a space for some definitions like that, but that would I kind of just kick that in my head, I, just sort of look things up, I think.

B

God shows at each one of these stages that would be helpful to communicate to the community I.

A

Think for for developers for sure, like you know, model models could be big. They might go on a phone, they might go on a service you're. You there's a lot of code that goes into massaging data before it goes into a training stage, and you want to you know just when the training up puts a new model. You want to do the CD kind of rollout thing with that, just as if it's you know a jar file from a Java compiler, there's there's a lot of things like that.

A

From the data scientist point of view, it's in some ways it's more complicated because they kind of don't want to know that so that's kind of what like they part of the challenge, is to educate them, and so in antares solution to that, because he's working in this domain, you know with sensitive data and and and a scientist is to sort of give them these rails to ride on it's like we'll, create things this way and you'll have the repo with this in it, you'll have the repo with the service in it and things all work together nicely you just ride.

A

The rails, thank you.

A

So what if we, if it's hereditary, what if we jump to that so all we're doing last week, we just jump to the technology requirement section.

C

A

I'll post a link to that in the in the resume check, yeah.

A

Because that might help explain things a bit: it's yeah.

A

So last week we talked about educating data science teams regarding the wrists of GP notebooks in production. So have you heard of roll.

C

A

Jessie you've heard of yeah.

B

Yep a little play yeah.

A

So it's it's in there's a change pending, for um you know a pull request to change that so I guess we could move on so.

C

Just let's just cover off that piece of old business, see you you you! You questions the phrasing.

A

C

A

I think I might have misunderstood.

A

It's yep: what's that pull request there I've pasted in the chat? Okay,.

A

So you wrote that um Kuban not know jupiter notebooks are the tool used to train data scientists as they can easily be used to explore, like don't data scientists use jupiter notebooks, not the other way around. So.

C

They know that's not not giving me what I expected.

A

It's not merged.

C

I was expecting that link to take me to the.

A

A

C

Kelsey and their policy home.

C

D

I believe we are dealing with here sure is I have felt as impedance mismatch when a data scientist is talking with a developer, so the feeling I have when you were discussing this thing is we are having that exact problem? It's like we're talking to that. A scientist are we talking to developers? Are we talking to both, and this connects to what you was talking before I believe so maybe some sort of impedance match between terms vocabulary needs and that you know things may make this more readable for both sides.

D

I don't want to talk about sites, but it's true that, in my experience how data scientists talk is different on how developers talk and if we don't have a clear common vocabulary, easily understood by both sides. It's not going to be as clear as it could be so.

C

Yeah I think you're you're right in that, but I think we also have another problem here, which is just the habitual one. So, typically the the universities are using a limited set of tools to teach machine learning and they're doing it from an academic perspective. So what they teach is what you need to do to get an academic understanding of machine learning, but they don't teach any of the practical elements of how you use those assets in in the real world.

C

So, as a result, you see everybody being taught to to do machine learning by using a tube with a notebook.

C

Everybody thinks that Jupiter notebooks are machine. Then.

A

I I see your phrasing now sorry I understand your phrasing necessary yep. That's it so.

C

Maybe I feel a little bit to make that yeah.

A

C

A

If I was seeing the word train and thinking training model, you literally mean data scientist learning how to do machine learning, I see.

C

The word train and.

A

I think model but yeah. It's.

C

A

C

Let me change that to educate and then that will that will be clearer yep so because we saw the same problem with the University. So initially everyone was was being taught to program in C and and then we had to retrain everyone when they came out into the industry and then the university switched over to Java and then everybody gets trained to be a Java developer and then you've got to retrain them again when they come out into industry.

C

Sorry I missed that and.

B

Somehow, no one learns get when they go through uni.

B

C

Iii I am regularly encountering data scientists who refused new source code control with their models, because actually it doesn't work well with with things like cubes notebooks. So it's a it's a it's an impedance mismatch, and so they push back against them.

C

So so what I'm suggesting here is that that's we're using Jib do notebooks today, because they're familiar not because they're the best tool for the job and actually the fact that we're using them in production environments is more an indication that there's a huge gap in the market for the right tool, rather than any strength of argument to say that they're doing a good job in those action environments.

C

You know the reality is the Jupiter books are very inconsistent. You can you can execute part of a program in the Jupiter notebook and get the result, which is complete nonsense because you only executed part of of your notebook, but it will still give you a result and it that result is meaningless.

C

So there are lots of lots of examples where you can even draw bad conclusions from looking at a notebook and and ensuring consistency is quite hard to enforce in that environment.

C

So so the the point here is really that what we need to do is flag the fact that other tools are needed and encourage people who are working in in in those spaces to develop the capabilities to support the ml op standard with with their tooling and obviously the the likelihood is that we were. We are going to see things like Visual Studio, in introducing new functionality that is better aligned to supporting data scientists, and, at that point probably tubes and notebooks will be go back to being a more niche desktop right. Yeah.

A

They I when I see different notebooks I, see literate programming and I can understand why? Because, when I there's a lot of you know, people like to say, 80% of the work is messaging. The data I'm preparing it and there's a lot of things.

A

You do because I just do it in code and put comments in there, and you can't remember why you did it, whereas when you flip it around in where the comments comments are rich in first class and and the code is little, paragraphs like literate programming is then you've got this big trail of why you've made that decision to drop that column or why you normalized it in a certain way. Is that is a lot of machine.

A

I can appreciate how it got where it was, but it doesn't mean it's the right. You know it's. The name. Notebook implies that it's you know a little book of notes that you have on your desk.

D

D

Like works on my machine, my.

D

Ownership and when the Jupiter net was work.

C

Yeah, a lot of this is about extending the definition of done to to an mo ops scope, rather than just the with a data science experiment. Let's go.

A

So one I guess they're looking at the next thing down. In fact, the next few things I think are all related like treat MLS it's it's first-class citizens, you know, develops process providing mechanisms by which training sets training scripts and so does Rapids may L be versioned how their auditable across their life cycle, training sets, is managed. Assets to me sort of the overarching answer to jump to a solution is the what people call get ops, I guess, like every every I'm sure, probably agree.

A

Every change is like a change in a git repository, be that on a branch or a pull request from somewhere to somewhere else.

A

Everything that happened is, you know, commit with a char and and all that sort of stuff. Like that's kind of to me, that's the overarching answer to a lot of these problems.

A

If, if it's done right because it's worked so well elsewhere and DevOps like it's, it just makes sense. Don't you think yeah.

C

um Perhaps we should um you start by clarifying what the process is that we're doing right.

A

And what they not be, what what the yep.

C

So if you've had a chance to look at the document, your you'll know that it's broken intersections and we have. The challenge is section which is just spelling out what the what the high-level product that that that people operating in this space are encountering.

C

So we mostly fleshed out this. This section I think we have a fairly broad coverage of a lot of the key problems in this space right now. So what we're doing from this point on is working our way through each of these. These challenges, which which we've already spelled out and and then looking at the technology requirements that are brought up by those challenges, to try and spell out how we might develop new capabilities to address those challenges.

C

So so, what we're doing is collaboratively fleshing out this document to increasing levels of detail so that we're we're telling the story of our thinking process is, which, which can then be, you know, shared across the whole community. So anyone interested in building a product in this face has got a very firm foundation to to work from and also where we're looking to build, and it's based methods for solving some of these problems.

C

We've got the evidence to to argue for a particular standards approach and and and you you are completely free to contribute to any of these sections ad hot. We don't have to go through this in sequence, I really welcome people just going through the document and and and submit.

A

C

So if you, if you feel there are things that you can contribute, then feel free to follow me for requests. It's just a markdown document.

C

The only thing that's mildly complex to manage is the these tables at the at the bottom, which are a little bit of a hack to get them to display in in github, but you'll see from the repository how that works. So.

A

I was I was planning on doing a pull request or two, but one thing that would help me would be if we could enumerate sort of what the type of assets are in the machine learning plan. I think I know but I'd like to talk through that. If we had time I'm just running a bit long time, that's.

C

Yeah sure I mean, and maybe what we need is a is a glossary section right.

D

C

Here you know we have got a section for references, but you know if we want to explicitly pull out a list of key terms, then that would be the place to add that.

A

Second note:.

C

And this this table at the bottom will probably work on later in the year, because once we've got the list of Technology requirements, then we can properly evaluate what work is already going on in those spaces and where we've got big gaps and this this paints the picture of where we are against the roadmap.

B

It's the impedance between the software developer and the data scientist just terminology or is a also a philosophical difference there, and it would be useful, slash appropriate to have a crack at addressing that, because I'd certainly be quite interested to see. Terry. How you see the world from your perspective and I could certainly give it a crack kind of how I could see the world from from from a software developers perspective.

B

C

Yeah I think that's a that's an interesting one, because not gonna be the same everywhere, but I think there. There are some extreme examples where you know.

C

I have seen entire departments where everyone is a data scientist and no one has done any software development, and then they spent several years building a bunch of models which they've been then completely unable to put into production environments because they got to a certain point and then they don't have the ability to actually build all the services and webpages and everything else that sit around that to to expose it as a product.

C

I'm. Just thinking that.

B

Is an example could be like one potential difference where we're talking earlier about. You know the work works on my machine experience with the jupiter notebooks and that kind of and of one proof point I feel it's quite a common. It's almost a religion amongst software development developers that the code doesn't work unless it's tested. You know so it's kind of really drilled into us, then the importance of unit tests and integration tests, and just because it works for you there.

B

It does not mean that it by any means that it's done or it works and I wonder if that's different in the data science, scientists, community, yeah.

C

I mean that that's a good example, so.

C

There are certainly situations where the people and not giving consideration to you the the right level of testing on on on models. So the the the challenge is that the the headline statistics on them on a model tend to be. You know things things like accuracy.

C

So so the the types of testing that are done by default are often more about measuring the fit of the model to to detecting the feet, which is a good one rather than the overall reliability and consistency of the model against an extreme set of real-world challenges.

C

However, that is changing in matures and we are starting to to discover new types of testing challenges. So so there is a whole area of adversarial processes where you're deliberately generating inputs into a model that make it do things that you want it to do against its intent.

C

So you know there, there are a lot of people working on. You know tweaking images in such a way that they cause a a model to recognize something that isn't there.

C

So so that's that's one of the fields of of testing that we do need to consider, and we also have a lot of awareness around things like bias and fairness, where we need to be able to establish that the decisions that models are making are not. You know, inherently biased in some way, and they act with reasonable levels of fairness across the the populations that the model is intended to serve.

C

So so we are introducing a new level of testing complexity into the into the problem space, which introduces a new set of challenges in terms of how we actually test for that. So so, partly this, this question does need to discuss how we, how we treat machine learning assets in respect of things like unit testing and integration testing, but also how we extend those paradigms to cope with things like detecting ethical problems or baiance.

C

I think there are some sections further on that. In fact, we need to update this table because there are some additional things in in here, but here you can see. We've actually got a challenge section specifically on on testing machine learning assets.

C

So so we should probably start to expand on there and if you want have a think about that and and and contributions and thoughts in in that area, then feel free to do so.

A

One of the reason this stuff sort of comes up- it's quite topical in the news like people will say. Well, how does the model explain how it made a decision which is never easy to do because they're often a black box, it's not, but that doesn't necessarily mean it's impossible. It's it's the same. If you wrote, if you wrote a whole lot of, you know messy cohesive statements and loops and stuff, and then it makes a decision to approve a loan or not how do Jessie's a lot of things? You know what I mean yeah.

A

All real production systems are built under duress that way, but the reason is you can trace back through the changes in the code and who approved the request. Was it covered by testing? Did the testing cover these scenarios? Did it have acceptance testing? And you know it's kind of the same thing? If you can do all that with the model, it's not like, you have to crack open. We have many layers of new runs. It is and doesn't make sense, it's more like you know what set of data trained it. Can we reproduce it.

C

Challenge here is that there's a trade-off between explained ability and privacy, because the yeah, the answer to why did you make this decision may come down to because of this piece of information? I got out of Michael's bank account right and.

A

It's or it's a whole lot of people's bank accounts that were trained to this model to spot fraud, and you know how I just go. Well he's the corpus of data we trained it with, because that's privacy, you know that's private information and I guess the reason it impacts him more.

A

This than with regular spaghetti code is often this stuff is applied to like approvals or decision automation or image recognition, and things like that, so it just tends to get applied in more sensitive ways, because it's solving sort of the edges very, very complex problems, I'm not always like it might be.

A

If it's a recommendation system- or you know a movie thing or it's gonna suggest you know in just this case- it might be suggesting if it's for some code- or you know a policy change or something like that- that it's spotted you, you would work better. This way, there's not really an issue there. It's like it's just a recommendation or a bit of automation. It's not! But in you know the grand scheme of things, it's super risky to not be able to explain things and trace it back.

A

The challenge which we do associate, we do with source code. It's all the time. It's like a normal thing. Yeah.

C

I mean the real challenge in this code is going to be.

C

Regional legislative compliant for these times and solutions, because everybody's scared of AI taking over the world and.

C

Ai can't vote so from a political perspective. It's it's much easier to to introduce legislation that protects human workers rather than introducing legislation that protects the capabilities that machine learning can bring.

C

So so we're already seeing a strong skew against being able to use machine learning solutions in certain territories, and that means that the standards that and any AI solution will be held to will be much higher than the equivalent standard for a human worker in the same situation, and so the expectation for AI will in many cases be unrealistic in terms of compliance, because we have no mechanisms by which we could. We could prove the same thing with the existing humans doing doing the role.

C

So so there are, in many cases, no good solutions to many of the challenges that exist in in this table.

C

What we can do is help to spy on ways that that the begins have provides approaches to some of them and just incrementally moves things. Small words may.

B

Be one way of phrasing: it is that you're, looking for fault detection, so the hypothetical is that the AI made an error. It's you know it did something wrong and what you're trying to do is trace the source of that fault and.

B

It's yeah, I, think I think it's not so much about kind of opening up the brains and having a look at the neurons inside of it, but if we could track if we could have some source revision around the dials being tuned and who tuned them and good notes around, why and the foot presses at that stage, then hopefully that would give you the the data exhaust that you could look at as a problem in itself to try to track back the source of that fault.

B

The thread that led led to the development of this model with the with the implicit fault baked into it. You.

C

Know and I think maybe the the best approach for us would would be for everyone to just pick a topic area in here which interest them reference back to the challenges table to understand what the description the core challenge was and then and then just write something up, and that would be a useful contribution to this this technology table and you can do that and submit it as a poor request on the document, or you can just message me with it with some text and tell me where you think it's relevant and I'll make the P R if that's easier for you, but I would just like to get more input into this document right now.

C

We we very much need to build this as a collaborative asset and get multiple viewpoints on it and I think we can use these sessions to then iterate over the contributions and make sure everyone's comfortable with the wording of them and refine them until they're, always happy I. Think.

A

That's good yeah I've made a note to put some things: I've got a I've got to get off scene, just getting lazy, it's much later for Jessie, of course, but he's an idle things way too. Chipper.

C

Everybody yeah yeah.

B

Just one one super little quick point on the regulation: couldn't we side set that because that's more around preventing what the AI will do and could we just say: well, that's not really a problem space for us we're more about, like like fault detection versus preventing.

B

You know like it may be a discussion for another time, but I just think you know.

C

My my consideration in that space is that host organization are going to be required to prove legislative compliance in there nice Aleutians, and they will want to include that in the governance processes for their sort of releases. So so it really needs to be a feature of CI CD platforms going forwards that there are mechanisms built in that. Allow you to to demonstrate having gone through that audit process.

B

Red stop button on the back of the robot.

B

Cool okay, thanks guys.

A

Good chat in we'll probably do at the same time in two weeks time it seems to work with people. They could do it a bit earlier if, if it works for others the world, we can chat about that there is a slack channel as well, so the CD f1 and messengers flowing there yeah.

B

Cool for me, this sounds okay. It gives me a chance to get the kids to bed alright.

A

Yeah, that's good yeah, yeah, all right all right thanks! Everyone and.

C

A