Ceph Ceph Code Walkthrough, 2 Aug 2018

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: 2018-08-02 :: Ceph Code walk-through: Intro to Teuthology (not strictly a code walkthrough)

Description

Presented by: Josh Durgin

Every month the Ceph Developer Community meets to discuss one aspect of Ceph code, to spread knowledge of how it works and why it works that way.

This monthly meeting will occur on the last Tuesday of every month via our BlueJeans teleconferencing system. Each month we alternate meeting times to ensure that all time zones have the opportunity to participate.

http://tracker.ceph.com/projects/ceph/wiki/Code_Walkthroughs

A

A

B

You hear me now: yep.

C

I, don't blame.

D

B

Let's get folks and make you drink.

B

Alright, let's get started, though, today we're gonna talk a bit about technology and stuff testing in general. Try give you guys an overview of fun.

B

What villagey he does, how you can use it to run tests and how to sort of looking at test results so I guess the basic and start off with his son pathologies well based around testing packages.

B

Currently, though, the first step to getting anything tested is to push your branch of the stuff like it repository to UM get up: daikon, /, f, /, SF, CI and any branch there will automatically get sent to ramen, that's F calm, which will go ahead and kick off, builds and generate packages for sent to us in bun in different combinations that we need to run tests on them. You can see kind of latest packages and builds going on over time.

B

That says, I come and once those are finished, you click any of those and it will end up seeing my output if something fails and I've to be basic builder done, packages are created out of those and you'll find those under the base.

B

Url shaman in kind of currency, and if you wanted to down download those manually, you could actually go and find out the URLs for the individual packages. But in practice you you don't gonna need to worry about that. You just push your branch or to FCI, um wait until the weavers our builds, and then you can start running pathology jobs which wish go down, go and grab those packages.

B

Before it pathology, testing stuff, basically um or most changes and stuff, whenever before we merge PR, you will things up into a test branch push this sei way for its build and then kick off a test suite launched by $2 G.

B

Basically, all you need to launch test suite is a check out of the technology repository.

B

You can see that more these on the CT lab on the theology that EPA that stuff, like a machine, even take a look at my home directory and see I, think I said up there, but basically on that machine. There's a simple machine wide! That's etiology that you know file, but you can think of all the settings needed for it to connect to the lab cluster and schedule jobs. There.

B

Though, and once you've read checked out on that machine and your branch is pushed to fci and Packers have built from it I'm trying to Sweden, he was saying you're, basically, just are just running a topology suite command and specifying which branch it read running basically off of and it'll add a bunch of jobs to like you, which we are. You else finds it um never say web interface. For that II. Do that's after I. Come take a quick look around there.

B

Turn my screen, so we didn't.

A

Look through that.

B

So poppy do has an emperor view, both the jobs that are scheduled as well as individual speedruns. So you can take a look at the current queue and you can see- and these are various suites that have been earned the queue right now. um The key was very simplistic. It's just friends pasted in order with the thirty field, and you can see who scheduled these jobs all these ones scheduled by ethology or just automated, runs.

B

And the CPU lab, we have a few different types of types of machines that might be used if Smith me, which are kind of best all SSD machines, OVH for his powerful machines and also mirror machines which are older, Hardware. All our disks.

B

So if you go- and um you can also kind of look at past- runs in copito, you can.

B

Filter out my past friends, for example, by suite, if you wanted to look at all of the say, friends of the art, pretty sweet, you can't go with it, go there and you can even just change that URL to this by the sweet parameter that to whatever you sweet, you're. Looking looking for.

B

Julie I'm into the results of a given sweet. So, for example, this one um you'll see the results of each job for the results of all the jobs are stored in the CPA cluster. There's a that is actually a stuff FS um file system, that's holding all of the job results, but he can also access them through this open your face here. If you even go into one of these failed jobs, you can give links to the two villagey that live, which is basically that the output from.

B

That same directory and pathology will collect all of the blog's massive cluster, which includes all the demons, and everything else is paying there. You can go inspect and accordance or any other kinds of test output.

B

So if we go I'm on a qaddafi machine this these are also accessible directly.

B

That's a file system.

B

And this emphasized file system for withholding all the results is mounted on the two-dollar machine under slash a.

B

C

B

If I NEX exact same job by going to the develop knowledge machine, slash a slash, the description of the of the suite, which is always like the the user, is running it in the date and the bit of parameters you can see if the web interface and then that you can go inspect the individual jobs there.

B

So, when you're looking at in the please.

B

Wait: three: six, nine to five. We can just go into that right three and check out what is here.

B

So one of the important.

B

B

B

The test Suites are all in, and the Ceph repository itself.

B

They're, all in f, QA beats.

B

B

You can see there's a number of different Suites here, and the primary ones are mostly based around individual components. You have like reduce our video wif s danceable.

B

um There are a few more generalized ones like there's a power cycle, speed which beam and do not run too frequently, unless we're doing here over these, since it has some hardware and there's a whole bunch of different upgrade suites for testing upgrades for in different scenarios.

B

Even within one of these folders there's a number of different sub sub Suites, so, for example, within redose there's, some is a manager, sub sweet, there's a monitor, thrashing sub sweet there's a general thrashings suite, which is basically randomized failures and randomized changes to OS DS, and you also see these suites have a verify fiction, which includes things like valgrind.

B

But but in general all you need to know to schedule. One of these feats is just then the name of the sweet, though, if I wanted to.

B

Okay, I can run the the radar suite here just by that's fine, the name burritos.

B

And to theology, when you're, when you say to the sweep we'll go check out a copy of this f+ from from the branch that you're scheduling against um I, get this the sweets from there generated jobs from that you, those jobs up and then run those jobs with.

E

Okay, so we execute these, do we need to have some pollution or anyone can be executed, can see? There is a let us say if you want to if I want to exclude some of the script which is already available. So what are the I mean? Is there anything how to Rickles.

B

So it's varies depending on that some of the tests how easy it is to execute a number of them can be executed to college environment, but some of them are dependent on burying setups with them. Similarly, to our test lab.

B

So, for example, if we take a look at.

B

One of these sweets here.

B

But I guess we can say uh yesterday looking at what what the structure of the do that input files is here. So basically, the sweets are composed of a number of my fragments that are all concatenated together and merged.

B

To form a list of tasks.

B

So the general format of the input is a bunch of email files or one yellow file has a list of tasks and, for example, this one uses. This task called the work unit task which basically just downloads some scripts from the Ceph repository and runs the shell scripts against a stuffed cluster that has already been set up by an earlier piece of this suite.

B

So these were these. These working scripts are one example of something that's very severe easy to run as I have a technology environment. You could run these against a visto cluster or another stuff cluster that you already have setup. They generally only assume that you have a stuff cluster with a kind of client key.

B

Some other tests that are more specific to pathology, that I actually use some more code, that's saying in ecology itself and are kind of orchestrating the how the cluster is operating, for example, about trashing of Sweden.

B

This is gonna, be a.

B

Little more complicated in the fashion sub suite, because it's using mainly this travesties task here, which is part of um run by technology, to orchestrate the cluster and inject various kinds of failures in the background, while other tasks are going on. So this kind of test is more difficult to run. Sio2, ecology.

B

But looking back to kind of what the overall configuration looks like, though, let's take a look at, for example, this original configuration your age to get that config that yan will file from this. This job and that'll give you an idea from what what's needed for it to run in a visual test, but the pathology though there it's a whole bunch of extra stuff for these Java scheduled mr. sweet.

B

um Most of this isn't necessary if you wanted to run a job manually, the things that are necessary for learning it manually are just a list of roles which basically means it. These are just like, what's going to be run on different machines, so this means that you have one machine running mother de Mond at sea and 3os T's. Second machine running another monitor a manager and Rios T's, a third machine which just has that client keyring career done. It.

E

Well, that means one mission there are two monitor is running and.

E

B

A

B

That this you know, syntax, is kind of funky with exactly a list of lists here. So the first annotation levels like one list and there's a few different ways to write this. You could have um instead of saying I like two dashes for the list. You could say: yeah Shannon said mind that a mom dad be there and more like I'd casein stylist a little bit clearer.

B

So there rolls is one aspect that you need. The second piece is the actual tasks to run. That's typically for pretty much every test. You have the install task, which goes and installs stuff packages.

B

It installs the yes, the all the software packages first F and it's its dependencies that it needs for its from the tests and it will install the version that's best to fight either as a option beneath install tasks or, in this case you've included. It has a top-level.

B

Override section, so this is over I Section is used kind of extensively within the test. We just to I'd be able to add extra configuration to various tasks without having to kind of duplicate that configuration everywhere. You can kind of override one setting in one yellow file and that it'll become it'll, be um combined later with another yellow file without having to duplicate that configuration.

B

The the general format other the overrides is that these are all these always could get merged into the configuration of tasks. So everything below the set this stuff label here would get merged into the configuration of the stuff tasks and, similarly, a little bit. We have an overrides to meet for the install task, which is telling us Billy installed tasks to add some extra packages that um namely RBD NBD in this case, and which version fun to install this could be by sha-1 or by branch name or by tag.

B

The other task that pretty much every test is going to use is the Steph task which goes and install and sets up the stuff cluster, though after the packages are installed, are installed everywhere, and this Steph task is the one who looks at the roles and sets up the monitors in the OSD. Is the managers and the client keys.

B

So once that's in place, you basically have a stuff cluster running and you're ready to run whatever tests. You need on talk about.

B

If this particular test, it's tit, it's doing, it's using a thread show us to use tasks to injection random failures to the asti's. In the background, and while that's happening, it's going to run this RBD fsx workload, which is an hour already, the kind of stress test.

C

Just I have questions if I submit. We are, for example, for me, my core luminous and I don't run. These us weeds today indicates somehow that they should have to be taken from the mimic or luminous branch, or the totality takes that from from that that a specific branch it'll.

B

Take it from the same branch that you're scheduling against. So if you say, let's take a look at the schedule command, you used to run a sweep.

B

Right, sweet general you'd use the Kassala g-gosh sweet command and is hold my interruption that it has, but the basics are.

B

That you, you would specify the.

B

It was like the running for boss mode just for extra key button, I'm so I'd resv. You testify this week to run so you might run that their rate of speed and then you'd specify the version of the branch that you want to run against they. This might be say if you're doing it backward like with luminous.

B

Exertion thing, and um just by specifying that branch it will brat a little which one it's a cue run as technologies, we command it will clone the suppository from that branch to um you know wherever you're running this command in order to read the QA speed from that branch and this path and specify that job that exact branch name in the job should submit its queuing up for the packages for that from that branch will be installed and also disinfect some sanity checking.

B

So if that branch doesn't have packages available, it won't schedule anything illy if you're nearer, so if it doesn't exist. Of course, it'll give you an error. If I tried during this right now, it'll is tell me this: this branch doesn't exist.

C

A

Quit I'm sorry, sir.

B

C

B

C

Was a different question: I have a bunch of questions that I've been ok down the once a test. Fails, that's a continue or it breaks in that point, and it.

B

Depends on the nature of the failure, and so if something there's some fit like, if there's a failure in a background sort of task, it will keep the foreground. Tasks will keep continuing until they're finished and then it will notice that it's failed at the end.

B

But if there's a if it's a foreground task like it's running a bunch of functional unit tests, and it fails that immediately than that, then that the job will will fail across the entire suite the the test that individual test runs and jobs are independent of each other. So if one job fails, the rest will still try to run.

C

So that's that explains why on failure or the explanation effects a single test right I mean there is no multiple failures in within a single okay, yeah.

B

Exactly and it's only detect and something I really listen to the first failure there. So the first thing that detects is wrong. So if there is, um for example, one of the things that checks is, it looks at the cluster log at the end of every of the entire test to see if there's any areas that aren't supposed to be there and there's the guy there's a wait list so that tests that are supposed to introduce errors can.

B

But if, if something crashes before that, then it the reason reported for failure will be that that crash, rather than um something that is in the cluster log. That may also be an issue, so it won't notice more than one failure may give a job.

C

So our looks as well as chord amps stored or only the lofts yeah.

B

Blocks and core dumps, and whatever else and it's in this archive directory on on the $2 G node, if you were developing a new test, as you were, writing files to this archive directory, they would be gathered as well.

B

Because all end up in it for a given job in this remote directory, we have a directory for each machine and then these are dug.

B

Work I'm gathered from that machine, so yeah the syslog and the log directory, which has all that stuff logs basically come from more like stuff.

B

If there were core dumps there would be a core dump directory that you.

B

And one of the thing I wanted to mention about and scheduling, is that um these, the the weight of these suites are structured and they're, made out of many different fragments of gamo files that are combined in all possible combinations. Basically, so there's kind of a combinatorial explosion of the number of jobs based on all those different that giant matrix of different settings. What we typically do is we sample that matrix and we run a subset of it at a time.

B

So, for example, for the rate of suite, if I tried to run the entire thing, it's probably um five hundred thousand jobs or anything like that. Typically, we use the subset parameter to run say um you can specify they run one out of thousands or one out of ten one hundred tubs not running a hundred thousand jobs, but you're running a reasonable sample of them. And if you went through 0 through 99 out of 100, you would have run exactly every single configuration possible.

B

The nerf or I scheduled a suite I always like to do it. I dry run first just to make sure I'm not going to UM schedule like thousands and thousands of jobs at once, and then you can adjust the subset and, as appropriate and further I give in sweet sweets. Don't need a subset, because they're small enough that you can just run all the all the configuration in them every time, but the radius feed is one of those where you, even if they want to every three tip, which is something to keep in mind.

C

The other day, I started a run and I tried to stop it with a little. He kill command and it seems I have it I had it had previously and it worked. But this time it required me to you, sudo I'm, not sure why so I couldn't stop the is that's.

B

Yeah, so technology kill and there's a few things, and so how these these kick me. They end up running eventually answer jobs are processed and pulled out of the queue they're run by a different worker on this machine, our different UNIX user. So you would need a sudo permissions to be able to kill those processes from that other user.

B

Whereas if it's, if there are tests that are in the queue already, but they aren't running it, then um you don't need any special permissions to remove them. Like you don't do the geology kill command, will both remove things from the queue and kill, try to kill, running processes to.

C

Okay, to afford for the selections of machines is there any suggestion on when you use with your mirror.

B

But I guess general: if you were doing some kind of performance related tests, its it's more important to figure out what you want to use. Smith, use, honesty and Miri is older, hard disk hardware and you also wanted taking into account them, and you can look at the current state of the queue and papito and see what's already queued up and you oftentimes there's a whole bunch of I got automatic, runs waiting for smithy machines, but you can um schedule against mirror machines and it look your Tesla's to kick off faster.

B

Something very urgent to run, and you can add me priority, feel then run run young test with say, if I already 100, which will um get, which means that your test look at run before the automated jobs, which I think are scheduled at priority. 1000, though, if you have anything urgent, you can do that then you're, just so kind of go to the front of the line.

B

Looking units are much faster because they're all SSD, so if you just in the urgent I, would recommend using them I.

C

Also seem that some tests are flapping or you I mean. If you have a look at the branch, the number of branching grid are by far you know much more than than the green ones. So is this I mean there is some you know, agreement on when a pass around can be considered as past or tail depending on which tests are failing.

B

Yeah, so there are certainly in a larger system like that. There's lots of race conditions, especially and and some of those are very difficult to fix or reproduce. Though there are some like known issues that you'll see very occasionally and.

B

But that will often be that they'll present with the same kind of back trace or our failure mode, and you can search in the tracker for that exact failure and see. Okay. This is a known issue. It's on unrelated to my change, because it's a it's a totally different system, that's I'm, unaffected by it.

B

And they're also you're also occasionally not failures for exam from the 11 infrastructure or from github um I'll go seasonings. Things like if you see something like um packages failing to install because the the mirrors timed out. That's obviously not the fault of your code know or if get downloading a test from github fails because get a write down or sorry to read them with us for some reason, then, that's obviously not I feel you're. The curator.

E

Take from there for stream directly.

B

It's installing that from the from the packages, but for some of the tests, it's it's a downloading some scripts from github to run the test.

E

So that means that you will take the packages from the existing repository and not from there, but how we doing then uses the current branch.

B

So when you're scheduling it tells it you tell it which branch to use and- and that's where the quiz to install the packages it's. But it was looks of that branch. I mean in shaman. Shaman has an API that it uses to find out where the where the repository is a birth packages, for this branch are, though configures them fiends and whether they're sent to us, or we went to to use that repository and installs packages from there.

C

Are there any guidelines on I mean how to relate the changes? One of the Belapur has made, which it's good, Iran or.

B

Yeah so and I guess in general, can use tend to mean made either to UM a few different areas so like like I was mentioning before there is that they're kind of sweets do different areas and stuff like their sweets for rbdr, so I profess for rgw um and, for example, within the greatest feed there is there ones for more specific, the monitor or more specific to the manager.

B

If it's something that may affect multiple systems, then just running it through, like that, a subset at the rate of sweet or if it's a, if it's a common core, that's used by, like with re d and self s for those tweets and make sense.

C

Just out of curiosity, have you ever run a coverage destined to check how I mean deep Deus and when testing is Corinne D? Yes,.

B

Time ago, pathology had support for cutting code coverage data as well.

B

We didn't really find that that useful, unfortunately, since I've didn't relate very well to functional aspects that were not covered, so it's kind of bit righted since then, and at this point I'm not sure exactly what the coverage ratio is.

B

Maybe we should look at how what, if something fails, where to look to figure out why it failed.

B

And so the most, the most basic failure reason you can see in papito um just lists the.

B

Like the last crash with a little log line, that was not expected and that's Sam, it's a good stuff place to start, but to see what kind of where that came from or if it's a command that failed for example, see why the command failed. You want to look at Iguodala G that log for that job.

B

So, whenever I open elegy, like the first thing, let's fail, the first thing I do is the latest search for I'm crates back because.

B

That's you time like, and so whenever there is, there was a failure, usually its duty like a command failing or something, and it generates a trace back in that tooth, ology log explaining um exception why this failed, though, in this case, have a command failed error, which is please the explanatory. We was running a command and that command exited in non zero.

B

In this case. um Yes, you may see that, like a bunch of kind of boilerplate for geology- and you can kind of ignore this address- you limits coverage stuff, that's kind of the coverage stuff in particular it's a relic of the code coverage that doesn't really function anymore, and then you have the actual command its retina, which in this case is the Ceph test of our BFS x. Rb, stress test.

B

So because this is a stress test, you can kind of look back a little bit through the log and see bear this test was last running, and but it was doing at the time it failed.

B

So basically- and we get the standard I've put in a standard error from I'll- make all these brands that are being run, and this particular case um I'm familiar with our our EDF a sex test, and this is the output from it here.

B

So you can go back up and see.

B

More about where it started the failure bill and okay what it was doing when it was feeling hey.

A

Josh, if you, if you unfamiliar with that particular case, how do you correlate to the output.

B

Yeah, so that's a good question because a lot of these tests, you would have to be a bit familiar with what they're doing so I understand what their output is again looks like.

B

Few of them, like the API tests and the more generic functional tests, are using kind of unit test frameworks. But you can often search for like the.

B

Word failed or error to see where that unit has framework before the failure and which guessed it was reporting a failure with.

D

Another trick is once you're at the trace back is to search backwards, first F version. So if there's a there's a crash, an assertion, failure or something like that, it'll usually appear in the log, just above where the command area doubt or whatever yeah.

B

That's the excellent point, but in this case it's just as a test man that doesn't do that trace like that: it's not pretty deliberate in stuff dude, um but if there was a back trace from got a demon or something I'm searching for separation, it's a good way to find that.

B

See if it's there a good example of that this run.

B

Okay, maybe it's not a great example about Miss run.

B

Me go to or a different run, real quick.

B

Okay, here we go.

B

Run, who that should the failure reason as look at the you can also look at the failure. Reason in this summary that Yan will file meets these haploid directories. This is the same thing that's displayed in papito. It reports it something times out waiting for. It happens like it to appear after OSD dead, three restart, and this typically means that LSD three crashed there should be a trace back and there in the geology, log or in the West II logs themselves. I.

B

Looking at this, I would search for trace back to see.

B

Their failed in this case, there's a isn't one fit one exception. While it was reconnecting the machines which isn't didn't cause, it has to fail at that sigh. Just remember. We tried afterwards and here's where got that error waiting for the gyro is c3e to be started if I search backwards from here for.

B

Again, this fact from the USD um showing which is hurt, we're hitting and where that came from.

B

Back that way, it's worth noting in the lugs here for the demon output like this, the prefix for each log line includes which team this is coming from they can tell this is from what was t3 and it was running on smithy 145.

B

So if you want to investigate further, you could take a look at the remote directory. First Matthew 145 um I need to see that cord up there.

B

And running in the final commands on it confirms that it came from Sophos key number 3, because then forget.

B

Quest III is logged and going to the end of that log, you should find that same trace back.

B

This case looks like the OST was not able to finish the blogging piece back before it. Crashed so does not appear in they always T log itself, but it didn't apology, like witness, often good enough.

B

All right any other questions too far. I think we've covered a lot of different things and I'm sure this yeah I have.

C

B

What could be explained clearer, yeah.

C

I assume that when a test fails, the cluster is decommissioned. So if is there a way to a stop that to break the test? There and I mean log in and have a look at the environment, yeah.

B

So the brought the jobs they're scheduled through the sweets and queued up, there's not a good way to do that, because I would keep those machines around and unavailable for too long. If you ran it, I guess we'd um that had a whole bunch of failures like that, and it could just lock up a whole bunch machines for a really long time. um Instead, what we usually do is we would would if there was a some kind of crazy bug like that, we need to go in and investigate that interactively.

B

We would done last couple machines manually um run that same UML file that that, for the job that failed manually on those machines and um and there's that and when you burn you're running it manually, there's a parameter, you can add to your yellow file called interactive on error. You said that the true, then the job tests will pause when it hits an error and you can go and inspect the machines. At that point,.

B

So Ryan something manually first, you want to do with some vodka. Coke machines insist, we can just say, run a simple test on one machine. For now the lock machine you use the to follow, Jesus, not command. um This form is just saying block, many say how many you wanted lock say one and you can say I, think it's machine type say lock. I'm your.

B

And they'll go ahead and output.

B

The machine that you've got locked and the hostess's age key that machine. This is something that you'd have in it. You want to put in it in a yellow file.

B

You you'd run your test with a little yellow file here.

B

Makena were gonna run on I.

B

Have some rolls for that machine? They had a couple of these another manager.

B

We need to ask so, let's say we're install on the Rhine stuff and.

B

If you have a test you wanna investigate manually, you could say interactive on error, true and if you're, if you're kind of developing a test you could also. This is an interactive task which just goes into a Python shell and until you exit the Python shell yeah.

A

B

Else would this be positive, running BAM on the machines.

B

So if we try to run something manually.

B

Use the technology command itself, which is the thing that ends up being used by the workers when they're, taking things off with the cue.

B

This has absolutely small number options, at least, and typically you just want to specify proposed modes ticket or in you look that way. Something goes wrong and you want to say I saved, but if you want to save the log files after the test is complete, as well as the output from the to theology command itself. Now you can add an archive directory.

B

B

Then the other, the other thing you need is your mo file, since we have all of our information about this test, both the machines that we're going to use the roles they're like those machines and the tasks in one file, that's all we need. If we had those more than one file, we could specify more files here.

B

I know you're running to.

B

But it a consequence of bringing against master to unstable branches, so running against master and seal branches is slightly different because master and other stable branches aren't in this FCI repository there in the regular get up to calm stuff stuff ah story, though now mo file we have, we want, we don't want.

B

They were install our testing branch or changed their repository. They were that we're going to use so that we can leave its SEF repo.

B

We can use a stable branch in this case or master.

B

And just to confirm that which, which variables this is need to sweep you I think you can take a look at an existing job like this. This one here.

B

It's been, the specification is actually repo for the packages and then sweet repo or where, where the pastor grab them.

B

And if it's archive directory, you um already exist, but it'll it'll, stop it so I just remove it. First.

B

There we go so it's when install stuff packages or apparently it's unable to connect. This machine so looks like this machine.

B

May not see, as you can see, it's fine to a Python frontier. This is a sample of one of those failures that were deposit to go into interactive mode, and this case since the the it failed. Don't even connect the Machine there's not much I can do I'm just going to control, be out of that Python front thing. I end the test. So let's see what's happening here, I try manually escaping into that machine.

B

And there's no route to host, so it sounds like that machine is dead, I'm going to.

B

Use different machine and mark that one down.

D

When you specify, when I'm, usually locking machines, I specify the destroy and I, have to wait, 10 minutes for fog to go and image it when you did that one I just give it to you right away, I wonder.

B

If it's not specifying uses the default of um I, think after what the default is and the vceo we didn't.

D

Have to wait for the wait wondering.

C

D

Behavior changed recently: if you don't specify destroy it, doesn't even image it or something or just gives you a strange.

A

You can't lock on Kentucky man line and not to lock machines manually just.

B

Yes, I. We can also my curious yes to use the lock locks option. Have the developed, ethology I go ahead and lock machine for us at the beginning of this test, instead of manually specifying it? Let's try that oh.

C

A

C

Lock them right after you use it right.

D

Yeah it'll clean up right afterwards, so you want to make sure you have interactive or interactive on air, so you can actually go inspect it. If you do that.

D

Also I think it that's why it was version in US release, yeah.

D

A

Is that does look? Do.

D

Sorry, oh that's type in origin, Oh.

A

Like sure Leo's it like melons.

B

In favors, okay,.

A

No I think I think it'll bring default to I. Think that's! That's my yeah and.

D

B

La commission machine with at the beginning here and unlock it when the exits, um maybe the mirrors, are having problems right now we try smoothy.

B

Block will beat until there's a machine available if you can't lock one immediately.

B

This one seal in the strange okay.

D

Haven't actually tried doing this in a long time, I always just go through sweep.

A

That bug that keifa was working on like from mimic acts, I brand that interactively like. So it's fine yeah.

B

There's a problem with clustering, Allen or something just this technology box.

B

In case, that's basically the way you can run something interactively and going in space. What's what's going on.

B

If you're unsure exactly what it's it's doing, it's also worth many pointing out where these tasks live, though there there are scams, generic tasks that live in it in the technology graphic repository there in pathology, live tasks.

B

These are all that basically Python files and still.

B

Either have a ask method, or they will be a class with a setup and teardown, which is their entry point, and the back string for that ask will have information about what kind of options it has, and example. This is the interactive tasks. It's just a single pass method which doesn't have any options.

B

And it simply runs a Python interactive, prompt.

B

The more steps for posit Ori under fqa sweets at their undertray tasks here you'll find things like the staff tasks. They install the stuffed Buster, you switch frayed deaf tasks. You can see all bunch of options that you can specify like an for final store.

B

You could specify which file system to use- and you can add different configuration and settings that you would be added to as if that comm file, that kind of thing, but in general, if you want to see more about what exactly is going on, you can go and look at what these tasks are doing, what options they take and go from there. So I think that's been about everything. I wanted to cover today. um I feel there's to lots of questions and feel free to we catch me after this as well.

B

If you have any other questions, any questions right now.

C

Yeah I have my two last ones. Are we gathering some metrics for the runs? I mean the CPU consumption memory or events have specific metrics.

B

Yeah, so we at least used to be gathering a bunch of those into a graph Anna. It's not sure that that still is working anymore. um We do have a performance suite which they had just last year, which does gather more of those in from bad stuff, based on like using crack bail, but in general, that's an area where we could improve a quite a bit.

B

For example, it would be awesome if we had.

B

Like that, the historical data, from a run that we could see in the stuff matrix type UI after the the run is complete. We could say okay at this point using a all the CPU, and this is where memory exploded during every year, I kind of think.

C

My very last question is I've seen this central platform porous. This is something that is going to be migrated to or.

B

I'm, not sure I, don't really use that century myself that much you don't find it that that useful um I'm, not sure if others do I think the idea is that it's just like collect with failures and um show you when you've seen the same failures but I think in the past. It hasn't been up to me at least.

C

Thank you very much. That's where Michael! Yes, no problem.

B

All right anybody else.

B

Okay, Thank You, Josh no problem and, like I, said before I feel free to reach out to me. If you have any questions in the future and this recording will be sent out once it's available thanks, everybody.

A

E

Thank you just says: thanks ray.