Kubernetes Machine Learning Working Group, 12 Apr 2018

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Kubernetes Machine Learning WG 20180412

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

Okay, so the agenda is empty, so maybe you can just have a quick meeting today. I had one thing that I wanted to discuss, which is like at a high level. There's lots of existing ecosystems that are trying to do some sort of like ml solution, with or without kuvira's I've been having this question on like how can we be effective and actually make a difference in this space by us? I mean like folks who are participating in as I was working room?

A

Maybe just a kick kick start this the start process I was thinking that our goal could be I. Think I think Connor had summarized this pretty much in the in the umbrella issue. That haired file, which is our goal, is to make it easy for for both power users, as well as like, as far as like SAS, like solutions as in as in use cases that have some amount of scale, and in this case, like specifically, are on ml. To succeed with Coronas is. Is that fair, like good? Would that be?

A

The would then be the set of use cases that we would go after or are then, like other things, that we should consider.

B

So, are you talking about amending the the Charter that we have in.

A

The Charter that we have is like kind of big in a way in that it says that anything related to keep others ml, like you sort of start considering it and see if we have to solve it, but that when it comes to execution that becomes a little harder because the scope is too wide so trying to find out.

A

If, as a group, if we can just pick one problem that we all think is like really really important and I'm just like make sure that works really well and then move on to the next or I mean we could do multiple things to buy like. If we can have like some concrete, achievable goals, I feel like, then we can structure a conversation just around there and then start forming some execution plans.

B

So we, before that kind of support for data sets at a high level, was a good confluence point for a lot of different, tooling Balaji started a small project within cube flow called KBC. So that's one potential area: okay,.

C

B

Balaji, did you want to what's your thought out and kind of like how to take that to the next step.

D

Yeah so currently, some of our internal users are using PVC and then some folks in ku flow community also are trying to use it and- and we have interest from so pachyderm project, a pachyderm is another workflow and pipelining framework for running ml workloads, so packet on was trying to integrate with goo flow and they are looking at kvcs the glue for data. So that's another, you user story that is emerging for kvc use case or at least the data set management. But but in terms of what KBC salts, it's it's, it's kind of one.

D

One part of the data story. There are some use cases that it solves, or some some barrier to entry issues it salts. However, there are many more problems, maybe that we can address in within the data set data set area or a traffic management area.

A

D

There is some background noise, but this is going to repeat what you said if I'm not wrong, so you you said, as a group, we should concentrate. One of the goals you could concentrate is improving the data set management issues or problems that are being faced by the users within the kubernetes ecosystem. Is that correct, yeah.

A

Okay, I personally of the data set idea in that, like storage, consuming storage right now, is this a problem with curators and and it's something that ties a lot into the existing cumulus architecture itself, and so I? Don't see it as like an app that you can deploy on top of cube because it needs some underpinnings within Q at least yeah.

A

So that could be a conceivably that could be an area where we can invest our energy.

A

D

Definitely yeah. We are on the same page on that.

A

A

So if you can like identify certain user journeys today, where people suffer and then like then come up with what we could do differently there, then that would help or if there are specific use, cases that are already known. Those can be shot here that will help justify or actually understand how we can make a difference in that area.

D

Okay, but why why that templating both take both Connor and I, meant about how we do? How do we get from co2 container to kubernetes at least that's the templating idea, so right right. So so probably we can make that clear and and- and that seems to be a little e obvious problem- more obvious issue with almost all data centers in ml ml practitioners, where they good sorry so.

A

It's saying that the by templating what was intended to to be conveyed there was.

A

D

So just in image and then to kubernetes yeah I think.

B

There's like two two related problems, so that's that's definitely one, and there are a lot of tools that kind of cropped up all at the same time, in the last couple of months, like scaffold and draft and Intel just open source, another one, that's pretty similar called MLP, but so that one is it's fairly similar to something like draft or scaffold.

B

But I think that the novelty in that project is that it focuses on templates specifically for machine learning frameworks, and so you can say like MLP in it and then you can say like I want the distributed, tensorflow template and then you get like a clean, get check out with kind of like the scaffolding, for what you need to deploy distributed tensorflow.

B

So it's it's I! Guess it's in some ways kind of similar to what cube flow is trying to do, but with the connection that it also can help you build the container it it's released, kind of as a as an initial alpha, just in the concept of like release early and get feedback, I think as far as like how the the architecture works and how the how the code gets deployed.

B

The main thing that we are concerned about is just reducing that from like you know, I have this idea for a project now I need to like copy and paste all of my co-workers best practices for running stuff on cube. So that's the part that we wanted to address.

A

If we can, if we can make whatever we build genrich's, it could apply for any different class of problems and that it just becomes a. If you can imagine a plug-in model like transfer is just a plugin, then that that same construct could apply for any other source image problem that exists in the Cuban airspace.

A

A

Some other Googlers, who have been specializing in in just like building containers and trying to get that working.

A

Because at least.

C

E

Especially with the problem there are specialized.

C

A

We can we can bring in so we can at least like bring them together, and that, like there are some people who focus just on the user experience which should be the developer. Experience in this case then like getting them to interact with to understand axes like there's.

E

Once a cache we're actually having a conversation with scaffolding, next Monday, so you could, you guys, could come to see gaps on next Monday April 16th and ask Apple team they're, gonna, demo, scaffold and they're very interested in building containers, but I'm not entirely sure whether they're interested well. We're gonna talk about that in general, yeah kind of like what that looks like.

E

B

Which what data is that the gaps meeting? Just for the notes.

E

Next, yes, April 16th next Monday.

A

Yeah I mean maybe you could just evaluate, we could just like publish a doc that evaluates the existing solutions and and sort of like just go over a user journey or like how one would do with the existing tools that that could be a good starting point and that, like we, don't have to jump to a solution right away, but you know at least know what is the state of the ecosystem or then identify us to do next steps?

A

There are a few solutions out there which are good. You know why I'm saying it right. The flaw hub was one and then rice and oil also has a solution somewhat similar to this I. Don't know that could be more but I.

A

Think as a group, you should probably not be opinionated on what the experience should be, but rather just provide the primitives that that people want to build and I hope here. That solution can actually yeah.

D

That make sense, so we.

A

Also need like owners for each of these specific items so for for data set, who could be sort of the champion to to keep pushing things through.

A

D

A

I think, having a having a simple proposal in the m/l working group in the criminals community repository like stating what are the pain points and like what could be the use of Chinese I. Think that that that would be a great starting point, because we can say the storage stick, for example, and have them.

D

Sure make sense, I'll also cross-reference the umbrella issue and make sure people can track all the issues from a single issue.

A

Is are there any volunteers to do at least like take a first stab at looking at the state of the system.

A

B

Did someone speak up there? No okay, it can take this fun. I can probably go tackle this with Nik Lentz he's back done a plane from Europe right now, but otherwise you'd be here. Okay,.

A

Okay, I just feel like once we get a direction on what to do. We can. We can start executing.

D

Sure so maybe one other thing to add to this kind of a templating user pinpoints is around.

D

Some kind of a similar interface for running in localhost, and also and also in kubernetes, so, for example, initially the users might want to develop in their laptops or just some servers that is locally hosted but then later on. If they want to, they want to deploy this in production.

D

They want a familiar interface or similar interfaces to do both. So that might be something we could look at as a part of templating.

A

D

Yeah, so there are three categories: I think in this one is people some scientists and data vendors and are like comfortable with CLI. The other category is UI. Third category is they want to just work you from the core itself? I think there are I mean I mean I, can only think of those three right now, but there might be more, but but each of these solutions are different. Probably so maybe we want to think think over that as well. That could be I.

D

Mean that could that could actually spawn more issues when we start looking over all these three categories: okay,.

A

Okay, so what you're saying is so again I'm rephrasing just to make sure I understood you you're, saying that there will be different, different user personas or like working styles, we're the same app could could be consumed. We are different.

A

Ux primitives, like the data CLI on UI, and so are you recommending that we focus on the app and the experience of consuming that app.

D

Yes, and and especially like looking at these three different categories, so, for example, in graphical UI case, it's a Jupiter hub in code. First, there is like solutions like meta particle in CLS case. There are several solutions, including case on that pin cool flow Q flow and there might be other solutions out there. Yeah right.

A

So I, like the meta particle idea, would sort of fit into the source to image aspect of templating, because it's it's an it's.

A

D

That's true it it kind of spans both those areas- yeah I, didn't I, don't have a very good example for code. First I guess.

A

I don't know it feels like this in the community, so maybe maybe we could just like you could just like. Let that, like let that happen by itself. I don't know if we can add additional value on that sure.

D

Make sense I think Jeremy's here, maybe he can chime in.

F

Didn't quite understand, question about you per have what was the question.

A

The cumulus community, we should invest in in different apps, which provide different modes of access or running in my labs and and one of the apps that was mentioned, and either just mutually agreeing that there's enough momentum in a cube flow community around Jupiter. So we could just like yeah.

F

I think that makes sense and Jupiter Jupiter how about attracts it has its own kubernetes community, which is most of what coop flow, is actually leveraging so yeah.

A

And like Jupiter hub make sure that they work with so it's nice to be done there, maybe I'm wrong like just knowing what is lacking, might also be good.

F

Yeah we mostly in cupola, we provide you, know very, very low hanging, fruits and syntactic sugar, so that we provide a a form form that sort of allows us to easily spawns notebook images that are that we maintain and curate that are there a good set for you know ml related workflows right. So it's very low-level. Very you know not much. You know it's not we're not adding a whole lot on Jupiter hubs.

A

Well, Jim I, don't mean to like I, don't mean to stop us from investing that, but if you, if you have some ideas on how to preserve itself, could be extended more in order to like integrate better with with cumulus I think those are useful things to explore as well. I think anyone who's not here, has some ideas. Last year on on further improving integration with Jupiter have.

E

Investing in Jubran yeah.

C

I don't know, what's left to do problems like some.

A

Multi-Talented.

E

Investing in that, we just want to do enough to get it over the hall.

F

Yeah I think the the the biggest question that I've come up with looking at with looking at Super Hub is that to some extent they're they're building with a platform which, for like data scientists, which kind of makes sense because they they don't. They don't necessarily want to assume you're running on kubernetes, and so some of the things that they do to support non burn areas.

F

Use cases leads some potentially to some friction in terms of how you have differences in how you would sort of architected I think if you were sort of making it only for kubernetes and building it into kubernetes native way. Well,.

E

They're, not yeah they're, not just I,.

E

Mean that might change in the near well in the medium term future, but as it is right now, kubernetes is not the only way to.

F

I've talked to you, you me before and I think what he told me basically was that you know where they see it. I think uptake of Jupiter hub in kubernetes is people trying to scale sort of massively so either because of like you have I think like a common use, cases like massively mock, MOOCs or mocks whatever those the the appropriate pronunciation is where you want to have sort of a whole bunch of people in the class using the same Jupiter, Hoglund and jib, and making that scale out. That's where kubernetes then comes into.

E

Nettie's and you have a bunch of different teams who are gonna use it. That's a single.

F

E

Number two but yeah like I, mean you, don't necessarily need kubernetes to run Jupiter hub and add value. Okay,.

A

Yeah, okay, so one of the other things that Betty mentioned biology is, like you said: local goes as remote, there's a there's, a project that I mean that I am planning to look at and in the near future, which would be like getting mini, cube to run with cheap use, not just CPUs.

A

If that is a project that this working group would be interested in and I can drive it through. This working group I would love to get support on Windows and I. My my expertise would be limited to Linux as far as I know like Mac cannot handle external GPU. So if, if I ever left, my own means I would probably just add support on Linux, but I said as if as a community, we can collaborate there and I get it working on those, and maybe on that would be awesome.

E

Is there any value with the current category of GPUs that are available other than on the Mac Pro and the Mac Pro is gonna, be older silicon supporting OSX in Darwin, so there's a GPS, which means you can buy.

A

C

It into December I, don't think.

A

That works um well.

C

Maybe does something.

A

But I don't know whether it works through VMs, because the model for it or mini cube is to run at BM.

G

A

So that's where it gets a little tricky and I, don't know how it exactly works in Windows, either. Windows in theory supports virtualization, which GPUs, but the devil will be in the details like getting it. Performant.

E

A

Whenever GPUs come into play, there's like lots of got chance that I've put a people.

E

Doing that or people by an external GPU and it.

A

E

It into a laptop and running on top of that and you're. So for what on.

A

The typical use case is gaming,.

E

But would you use this to run an ml like it doesn't seem like a cost-effective way to run machine learning, workloads it's it's.

A

Less cost effective, it's more about yeah, it's just. Everyone has a laptop as a developer, so it's like. Okay just have to buy this external device again I'm hypothesizing here I, don't have concrete yeah. If to me, it's like lowest in the priority like water, walk like what I would like to see is like mini cube. Working for simple use case is like a Linux laptop or like an Alienware laptop or for sure definitely.

C

A

For people I use right are like the dgx workstation is that that that fief, some people buy so I mean that could keep extending that could be like more hardware in the future and like I, think this would extend to like a six and FPGAs to so. If, if there is enough interest in that area, then that could be another project that we can drive and I'm happy to share that yeah.

G

Sing, that's I, seen a story, interesting problem, I'm working in Emacs of research and we are lab of sixty scientists and the speaker walks for you, you see is that everyone has a workstation. So it's not a laptop, but you see Lisa machine with local GPU. Why you run the first test just to me to make sure Sally see that your model is achlys works when you scale written on a cluster. That will be something like flow because you don't want to ugh GPUs for ten years.

G

If, at the end of the day, is going to crash right, I.

D

Second, that I had the same I used to work in Argonne, National Lab. Before this it was the same situation there you are a workstation and then for a cluster access, so you would develop in your workstation, with the GPUs then deployed to what are the cluster is available to you.

D

That might be also preferred way of deploying and developing, and even texting for all sorts of ml practitioners and data scientist.

E

Yeah I used to do that a lot with mobile workstations as well, but never with like an external GPU plugged into mic. My yeah.

A

That's uh yeah, it's out there, it's possible I, just don't know, and it's feasible for I. Don't.

E

Know I feel like every job I've ever had that was related to HPC. We always had some type of high-power workstation with the GPU attached, where we did local experiments before pushing anything to a compute cluster. So much.

C

E

That ease when you're gonna be you're gonna, be like launching a huge bass job using something like an picker slurm and.

E

D

Know exactly like some of the some of the users have this HPC and remand mentality, so they expect that kind of an environment. I think that stay experience with.

E

Reserve works right, like you, gotta take a whole GPU. You probably even only test it locally before, even if you're spending up dynamic resources before you spend that money, so yeah.

A

That's okay! So what I'm hearing is like resounding? Yes for investing in that area: okay, I'm, I'm, happy to take the dollar and I can post a proposal.

A

Okay, so that that would that would be just the local development, but in theory like the same local development could be replaced with a cumulus cluster and they're like you could submit the same pod and like get the same set of whistles if you can string logs and if you can get metrics down so there's other aspects around that which is like killing some of the extensible resource, metrics available like logs and metrics available through familiar tools like chicken logs wouldn't work unless, like the pods, are lying around.

A

For example, I guess: there's some usability issues there, specifically when it comes to jobs.

A

But any case like I, don't think I don't think I have abandoned to tackle dad but mini cube, just like I'll, probably post a proposal on that.

A

Corner like going over your on policy, the other one that you mentioned is like providing an API for tracking experiments and metadata can. Can you like give some give some more idea on what that is sure.

B

um I guess it's related to a proof of concept we have going internally, and maybe it doesn't make sense for it to cross into this this worker, but basically it's a way to just put a little bit more structure around the concept of an experiment for a data scientist, and so they can have like some some result. Metadata, maybe pointers to you, know their output directory for each job run on shared storage or blob storage.

B

Maybe you know one or two high-level metrics for for each epoch, for each job run and then just the ability to kind of manage those as as easily as they can manipulate regular Corinne Dee's objects because otherwise they're out there alternative is to kind of you know for each each time they do some distributed hyper parameter optimization.

B

They need to kind of redesign this taxonomy of labels that they use to kind of like stitch things back together and if there's no consistency around this and it's hard to write tooling for those users and it's also difficult for them to kind of you know collaborate from project to project if they're rewriting some of the base mechanics every time.

A

Yeah, it makes sense and I'm trying to translate that actually do the storage one, which makes sense that, like the jobs.

A

Kind of sucks for this use case because we need like we literally need stateful jobs, ken who still was not here anymore. He said that he was thinking about it and he also said that implementing it as a CRT would could be done much faster. So.

A

F

There's a lot of activity around this already you sort of in the in the community right so like you know, there is a cart, cart tip or is it cat some cat tip, which is basically a busy a busy a clone that somebody basically just introduced into coop flow and is hosting it under coop Club, and it's built on model DB to provide both sort of hyper parameter tuning and then using model TVB for providing. You know, model browsing and experimentation, and then tender board as well is also sort of looking at. You know this.

F

This problem of trying to you know, surface people theta and make it easier to surface, and you also have other projects like Studio ml that are working in this space. I think. But my view is that some of these projects might benefit from you know using crts under the hood. But to me they sort of seem like apps in and of themselves that we would run on kubernetes to provide this functionality.

D

So I want to mention here that I mean when, when things work, it's all finest in like when then things work as they're expected to work. It's all fine, but the problem comes when when we want to when you're doing some hyper hyper parameter, optimization and something goes wrong or what you are doing, trying to do some it when something goes wrong, let it do the jobs and the logs of the jobs.

D

So maybe maybe there are frameworks, but we need to like concentrate on the debug ability in terms of like what went wrong if something goes wrong.

A

They go with storage a bit, so it does make sense to be. If you don't look at the storage, so I don't look at the jobs API and look at the stateful sets API. It is possible to have like dynamically proportioned, storage and and the default model is one where like so it is not even reclaimed.

A

So in theory, if we have a construct like stateful jobs, we could have the same workflow by like a storage object is, is created for you dynamically, on whatever storage that has been configured and then you can like, as long as the objects are available on the API sever or like some other extended system, then you can deterministic you figure out where you can find the data, including your check, points for for each and every run.

A

So so that's why I kept going back to stateful jobs, because even if whether we do it as a security or like a built-in controller I think we can add value there for for any sort of framework or any sort of app I.

F

Guess so, if I understand, like the you know, the use case here in this, this keeps coming up in a lot of these cases in coop flow is basically you want sort of a permanent archive of like the jobs and models that you train, that that persists after the job or is finished or after the model was deployed right, and so yes, you could sort of route the you can sort of insert the API server in front of that storage layer.

F

But it's not clear to me why you would do that as opposed to just having you know directly. Talking to you know an appropriate storage or database back-end, possibly with a web app or API server in front of that. Well,.

B

There are a couple of I mean yeah. It's definitely like for the volume that you know results could generate. You want like a more scalable data store, but in general, like you know, for experiments, if you were to keep track of that as a CR D, for example, there's usually like at least one order of magnitude, less experiments than jobs, and so the scalability of that doesn't seem to be a concern.

B

F

To me, the the advantage of the of the control of the controller pattern is that you're sort of managing some set of infrastructure. You have multiple resources that go through some states and you have to manage those resources throughout their lifetime. Keep them up, keep them healthy right, and so, when, once you get passed to your your, your job is finished and your or your experiment is done, and you just have some record of that experiment that you want to persist and it's immutable.

F

Then it's not current me why you really need a controller at that point anymore. Oh.

B

Well, at least internally, we're not even using controllers we're just using the resource types we're using the the object, graphs that is afforded to us by owner references just to make it easier to keep track of how jobs are related to experiments. For example,.

A

B

A

I just want to make sure you're not confusing two issues. One is like just handling storage. In that every time you launch a new new job or new experiment. Do you need to like provision storage, and that could be just like scratch, space that needs to be persistent or it could be like the actual model data that you're publishing?

A

Well, in any other case like once, it goes beyond a single user, then like managing this data or like managing access controls for it and like and like setting it up in such a way that you can take in Cod in different classes of users like it becomes a common cubed of their storage problem, and and that's why I was saying stateful jobs because you get you get the rest of the the familiar storage pipeline available for ml box.

A

Lots to I, don't deny that you could, in theory like have your own connectors and like have data bases and stuff like POSIX file systems or object, stores like I, mean or textures, are not even represented by storage, API CL, which is a separate problem. But you could go that down the route of using databases, but just sort of be on your own. At the point, I mean this by itself is not gonna help you it's not going to improve your life anyway. There.

A

That was one on storage and the second one is the archival and I think Jeremy brought up good points, which is there's a scalability limit sort of implicitly in that, like there's a scalability limits on how much CRTs we can have. So we have to like be limit. How many actual objects we can store. We can throw against API server. As far as I can tell. There is no archival solution available for criminals. Isis know that it's like not even a single database that you can, that has been like designed.

A

Yet it's open source and available to people it just archives. All the objects. I completely agree with Connor that this there's already enough primitives in the API it to form a graph on how these objects relate to each other. As long as those API.

A

So but on the other hand, as a working group should be in.

A

F

We would, we would be I view it in a generic problem and we would like to see it solve the generic problems, but we have a proposal floating around in coop cloud to solve this and basically it follows the same sort of pattern as cluster level logins, where you basically just have a Damien in your cluster that monitors the API server and then emits the objects to. You know some back-end that it's that it's configured to talk to right. So it's very pluggable, very customizable, I, think I.

F

Think one of the questions that's kind of floating around is whether we should just admit the things as text and sort of assume that you can use whatever logging back in that you're currently using or whether we, whether a columnar datastore, would be more appropriate for some of the it's data, provenance and ETL sorts of analyses that we think we'd want to run to support. So we want so like the data provenance question.

F

Like you know, if you want to extract the graph structure to see data provenance like what's the best database data structure, I did or that.

A

Okay, this thanks for that update, Jeremy, I I still feel like it's. A the problem of our table is like a gendering resolves like a generic one, and it's probably much easier to talk about rather like trying to do it specifically for certain classes of apps within cutest. I also feel like every every increment, as provider would probably have their own choice of yeah.

F

So so I agree with you 100% and I. Don't think, like anything that we would actually do or proposed is actually specific to Kubler, I'm now and so I think we would love to see that it gets up streamed and solved in kubernetes, like this notion that you can automatically persist an archive of all of your records and integrate that with like audit logging and data provenance. Logging like that'd, be fantastic, but you know it's mostly about speed of execution and easier to produce hype, and then we can always upstream later yeah.

A

I was I was considering like us as this community, whether we should investor or not, I'm, basically seeking opinions on. If that is like a really serious problem, and if we should help prioritize.

A

A

F

I think it is a serious problem like if you look at a lot of different applications like spark or our ago, any place where you run at jobs.

F

A lot of the traditional systems have these sort of historical records, of all of your, your jobs like Apache, airflow, I, mean or, and so, if you're forced to you know sort of delete these resources from the API server, because it's it's sort of overloading the API server to spawn and lots of jobs, then that becomes sort of a limitation compared to these other sort of non k8s native solutions.

F

So in that sense, I see it is as a problem, and so so as one concrete example with Argo, we use our NGO for CI CD in coop flow and we're finding that we submit so many jobs that eventually slows down things like in this case. It's the it's the UI, it's not the which I think is related to the API server performance, but it we end up having to delete these things. So it is an issue. Okay,.

A

So I think we should probably file an issue against Q risk. Awareness on this actually is like try to get more opinions and data on this issue and yeah I can try to find out if any folks, the Google are are actually thinking about this problem and like have them participate in the open position.

A

Jamie, do you want to file that issue? Given that you have, you seem to have most contacts on that.

F

Yeah, do you have any sense of what what we repo or where that should be filed.

A

Against cumulus, okay.

A

Okay, I want to make sure that all the points you mentioned are actually received and digested. You also mention metrics, but metrics is something that is going to be very, very close to each of those apps that are running right in this case each of those ml frameworks. So I don't know what cumulus can do in a generic manner and that, like there's already four medias integration and there's like, and it should be pretty easy to expose metrics as from meteors data.

A

So maybe maybe what is needed is like adapters for these different frameworks to expose what are the metrics they have in the Prometheus format. Is that what you're going after or why are you going after, like metrics and if.

B

You're, actually, um you know kind of like progressive values that let the data scientists know how training is progressing so things like loss or some specialized f-1 metric, or something like that. So yeah I think you're right. It's totally external there's, just more of illustrating how it would be used. Yeah.

A

I agree, I, think I think showing maybe would like just a couple of popular frameworks that it is possible to use for medias and like expose these metrics. That would be awesome just having like a very simple documentation on that.

D

Yeah, that would be very useful. Some sort of a best practices guide for setting up the monitoring system for some some work loops.

A

Okay, ask the hard question next.

A

Okay, all I hear is silence, so maybe you can just file an issue for now or, like maybe document this need in the in the community repository and see. If folks want to pick that up, you.

A

B

Note-Taking I can file the issue at least.

A

Yeah I think part of the problem is that it who were authors, such a guy, has to have some good understanding of the frameworks themselves and finding those people in the cumulus community is sort of hard because they're, mostly dealing with with like Gendry, curious concepts and so I. Don't know if I'll be successful in finding a person to victim.

A

A

Okay, is there any other topic, anyone has to discuss.

A

Okay, should we have some AI for the next meeting, which should be two weeks from now?

A

A

If we can so, we identified a bunch of things today, source image, tear sets mini cube and- and there was also like now- she does really cube and then there's stateful jobs and storage management for each and every job run. And then there was archival monitoring, I. Think if you can at least start getting deeper on any one of these topics. That would be awesome.

A

Okay, I'm gonna, I'm gonna, try to throw names and see if you can find someone who can who can like step. Look for the next meeting Balaji. Do you think you have enough data to start having a discussion on DSS.

D

Sure I can at least start southern issue and and discuss why we created the KBC project. Maybe that will be a good starting point and we can add more more details to that later, on I mean, but but but I think some of it was already presented, but we can go into more details of each of those. Each of those points. Ok,.

A

I'll make sure storage box to our next meeting.

A

There may be, like other possibilities, yeah.

D

A

So then we can keep that as a main topic for next week, but it will also be great if we can hear more user stories and user pain points.

D

B

I give it color to the to the channel, but I didn't get anything back yet.

D

About that, like an ego, router, maybe like I, mean send this request for user painless to see gaps and same node.

A

A

That would be the widest net I. Think I can think of that might reach a lot more people who aren't really developers or participating in the community but like they might be interested in showing up.

B

Like you know, working at the attendance of the first meeting, there were a lot of people there like we had eight or eight or nine companies, so there should be plenty of people that have content.

A

Sub-Communities.

A

Until there is like actual execution of stuff happening, people are not sure how they can engage and like hopefully like stuff that we have identified, shows that that shows the rest of the community that we are actively working in the space.

A

The other thing we could do is like once we we mean a month or so from now, once the drill down a little bit more on the topics that we have identified, we could probably go and present, and the criminis community meeting after the the mean kunis community meeting and sort of give everyone a heads up that hey. We are you're, organizing ourselves through this forum and.

D

Yeah, that's a great idea.

A

Okay, so is anyone gonna send out a cumulus announced, email.

A

I'm happy to do it, but if anyone wants to do it and okay with that, okay see the fan.

A

A

G

And just just analogous to find some pain points. Maybe it's it's worth looking at the paper. That's a girl released for visa.

D

G

D

G

A lot about, what's the scale, that trying to reach with this project and see whether we think this column rank is right. What would be missing to run something adults on kubernetes? The paper is pretty far out, so let me give you the link. They insist, it might be awful games that is and finding what's where it will break.

A

G

Can take a look and and try to to least what, where I think will be the main issues. I can start with that. You.

A

G

Get in the truck, if you want.

G

Even mention we have a few people's I think free in income flow. We are developing kind of a clone of this year, so I'll try to talk to them and see what they think about that, and why are we really pinpoints? Yeah.

A

You just need to just need to like get more inputs at the stage.

A

Okay, you only got two more minutes of issues and now thanks everyone I'm bill me in two weeks from now.

G