Jenkins Google Summer of Code Office Hours, 24 Aug 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: 2022 08 24 Git Cache Maintenance

Description

Google Summer of Code project for automatic git cache maintenance in the Jenkins git plugin.

A

Welcome this is the 24th of august 2022. This is google summer of code, get cash maintenance rishikesh. What topics do we have.

B

uh We have a lot of topics to discuss today. So, okay uh mark. Can you like share your screen and you know open the detail. The plugin. Can you run the git plugin.

A

Sure can you bet so you want it. You want the current build running.

B

Yeah the latest one.

A

Okay, hang on just a minute, then I'm going to share my screen and let's, let's start that so share screen.

B

A

We go share, okay, good, so first, let's go to um get the latest build and I assume no changes in the get client plug-in, but I'd better double check that I've got no further changes. You've that the changes you had made were already enough. Let's see and the pull request. Do you remember the pull request number one.

B

And I'm not sure.

A

That's okay! I I can. I can certainly find it. I was just uh let's guess, 1310 and we'll go to github and find it nope. I.

B

Added a few tests also.

A

Okay, so let's go get this build.

A

All right, so what we want is we want this link address and we're going to go here. Jenkins, manage jenkins, manage plugins advanced and we are going to in that url paste it deploy.

A

And now just for safety, I think we should go. We should grab the get client plug-in most recent poll request version just in case I'm outdated there, and so following the same pattern.

A

Let's go find your build, which is right here. 862.

A

Okay and this one.

A

Advanced url, deploy.

A

B

A

Now this jenkins controller is at the moment relatively busy, so it may take a little bit for the restart coming soon.

A

We released a pat, a security patch or security fix for the git plug-in today, and so my controller has been busy adapting and making sure that the security fix is applied, that it's on all the right branches, etc.

C

uh What was the security fix concerned about? Mark.

A

Passwords with using the with credentials, workflow pipeline step were not being masked relatively low risk, but it was a security fix. Nonetheless,.

C

Not being lost.

A

Yeah, so they were being displayed as literal text in the build log.

C

And the very large.

A

Okay, coming soon really truly believe me.

B

So I've added tests in the you know get client plugin, so the detail, uh I've tested all the uh maintenance tasks created.

B

Okay for maintenance tasks, less than 2.30. I was able to write test because the way they work is completely different from how it works and versions greater than zero, because if you look into gc for versions less than 2.30, uh we are using the uh gc auto command, okay, which you know uh works based on. You know if I need to like it, works based on the status okay of whether it needs to execute or not so it is internally uh it's an internal logic of whether it needs to run or not.

B

Even though I run a get gc auto command, it's not a compulsion that it's going to. You know run a full gc, so testing that it was kind of like I couldn't do it so I'll have to look into that. But other tests have been written for versions greater than.

A

We should see those tests, those automated tests here in the commits or in the in the files changed. If we look come on, come on.

A

Slowly, slowly.

A

Well, either github's not very fast or my computer's, not very fast. I see this little blue line, yeah advancing but nothi, nothing that hints that it's actually doing the work. All right while we were waiting jenkins, is back. Let's check that we've got the right, plug-in versions.

A

Okay, 312 0 and 4-12-0; okay, those aren't obviously out of out of range. What would you like to show us rishikesh.

B

Oh, I would like to show the you know table which I have created.

A

Okay, so let's see for this, we go to get maintenance.

A

And so if we say this, one every minute commit graph is pretty lightweight.

B

Yeah, don't run the prefetch, because research has an issue, as we have seen. Oh.

A

Okay, do you want to run any of the others.

B

Yeah you can run the gc or incrementally pack every two minutes. Oh okay,.

B

C

Save the configuration.

A

Oh no did I make a mistake. There.

B

Not it's fine, it's, but it is okay.

A

Okay, data saved and now execute.

B

So we'll have to wait for a minute and then you can see the results in the statement.

A

B

Behind the scenes, this isn't very optimized because I don't know how exactly to read files so basically what's happening is whenever I try to add a record into a file into the file. What I am doing is I'm loading the entire file, adding this record to that linked list and then writing that language back into that file. So I'm.

C

B

Sure, if this is the way of doing it or do I have to read every line and then just append it to the end, because that kind of implementation, I wasn't finding it anywhere so.

A

And I'm not aware of a way to append to jenkins serialized xml files. I think you have to completely overwrite so the technique you're using as far as I know, because an xml file commonly has a beginning tag and an ending tag, and if you were to append you've now somehow added something after the end tag.

A

So I I don't expect okay, so no data available.

B

We have to I didn't refresh this page because I was not able to you know, get the data. You know.

A

And I guess it's possible that it's not run yet. Should we look at the logs just to be sure yeah? Yes, okay! So let's go look at logs system log.

A

Okay and the lugs will probably be cluttered with all sorts of interesting things, because okay.

A

So I'm not seeing anything in that log. Is there another place where I should go to? Should I create a log recorder.

B

A

Okay, new log recorder: let's call it maintenance or maybe get maintenance.

A

B

Oh we'll go for a task executed.

A

B

Yeah, uh you can add another one: okay, what a task schedule.

A

Okay and all log levels, anything else you want to add.

B

um I think that's fine.

A

Okay, why didn't it finish the task.

B

Oh, can we go and look into the deeper thing.

A

A

So dashboard manage jenkins, get maintenance, no entries yet.

B

Oh, do you have any caches on.

A

I this have many of them, let's, let's double check, but if I look.

A

Yeah, it has quite a number.

A

So maybe 10 or 15.

A

Let's check that um they have some whoops.

A

So there are these some things here that have have content like.

A

This here, let's just sort by.

A

Okay, so there are definitely repositories that are not empty and it looks like some of them have commit graphs.

B

So uh what was the log scene right now.

A

Okay, so tell ask your question again: the.

B

Logs, like it doesn't display and like the logs, also are not running well. Okay,.

A

B

A

So found get a modern git version running a commit graph.

A

Unlock the cache, so it looks like it's doing its task.

B

um But then this isn't running, I'm not sure. Why uh can can we.

A

B

Run the original maintenance thing from the terminal by running that nbn command sure.

A

You want to run this.

B

Command, oh, no! No, not that the ambient hp! I uh you know.

A

Okay, so you want to run a maven hpi run yeah.

B

A

So you want to run a fresh jenkins from a maven hpi run.

B

Yeah because uh the maintenance tasks are running here on the screen, it is showing its running, but then uh I'm not sure. Why is it not writing to a file.

A

Okay, so this because this thing is sitting inside of a docker container, it's more challenging for me to get to get into it. Would you mind if I do that from another system.

B

Yeah, that's fine.

B

Because I feel uh the file path is, you know the location of where the file is, you know, is having an impact here.

C

ah So rishikesh can't we can't we check the file where you're actually storing the data.

A

Okay, well, let's, let's try reading that.

B

I think the file's name is.

C

Is it at the parent directory.

B

C

B

A

No, no okay, so so you're saying that there should be an xml file here.

A

And yet I see q.xml and workflow flow execution and content mappings and I see a configuration. But I don't see any data file.

B

Maintenance records.xml.

A

Right is it in a sub directory.

B

No, it is in the uh it is actually in the plugins directory. You know it's not in the jenkins directory, so you know wherever that plugin is. I just stored it in that folder. Whenever I was developing it, so you know the plugins, like the git plugins root, directed.

A

Okay, well so let's go try to find it.

A

And you said the name was something a-I-n-t-e-n.

A

Like that log, okay,.

A

And you boldly gave it a space in the file name. Oh, I will punish later. Okay, good all right. So here is.

B

That what you were looking for, no, no, no.

B

The file's name is maintenance records.

B

Like if you look into the core of the you know of where I created the file, it's stored exactly in the git plugins directory. So it's not in the jenkins folder.

A

Well, but if it's this, so this, the directory, I'm looking at right now is the jenkins home directory.

B

uh Yeah, that's what I I didn't store it in that directory, because I wanted to ask which path. Where do I store it? So I just stored it in the kit plug-ins directory, so.

A

I I'm not sure what you mean when you say the get plug-ins directory.

B

Oh, like when I, when I was developing the you know this feature right. I have a good plugins directory right, so in that directory only yeah.

A

Okay, so you you explicitly stored it to slash home, slash, shrushi, 20, slash.

B

Something something.

A

B

A

Okay, so we will, we will certainly never find it here. Then, okay got it all right, so I'm looking in the wrong place, but so you probably didn't put it in my get plug-in directory either then, so, how would we find it? You should go, modify the source code.

B

No, I like, I didn't catch you like. If we are uh do we have don't. We have to build this thing again so that you know you can read it. You know that file gets created again.

A

We can certainly try so, let's.

A

Okay, so I think what you're saying is: let's build this and run it right. Yeah.

B

Yeah because I was not sure of where do I put it in the jenkins home directory, so I just put it as a you could temporarily.

C

So if we build the plugin, what a file we created at that point of time or at the point of time when we actually enter data.

B

Yeah, when we enter data into it that time the file is.

B

Basically I'll have a check to see if the file exists or not, and then if the file doesn't exist, I'll create that file.

A

Okay, so come on.

A

All right, so the thing that.

A

We probably need to do as a temporary dependency is depend on that.

A

Nope that didn't do it.

B

Don't we have that down little.

A

Yeah now well so, let's when I tried to do that.

A

So using the incremental build that's on that branch, rc 3100, which is, I suspect, out of date, yeah we're now at 32, 32 42, and it's it's now building, because it's probably not publishing that one to the to the incrementals, because it wasn't up to date with the master branch, so will be a while before that's ready for an incremental. So what I'm not sure how to go, find an incremental, maybe hang on, and I may be able to find one.

A

A

Okay, so there's an incremental that I was using.

A

We could try that one.

A

I don't know if this one actually has your changes in a fusion cache given as 3.12.0.

A

Let's try it and see.

A

A

Okay is this one finished.

A

It is not- and it's probably 30, minutes away, okay. So how do we get this, so we can do some quick diagnostics here.

A

So here is the branch that you're working on and when I did a maven clean.

A

B

Creates, oh, if you want, I could share my screen.

A

That may be the best yeah. Let's, let's do that because I'm obviously not being successful here. So, let's I'll stop sharing and let you share yours.

B

B

Oh wait: I don't wanna share that.

B

Can you see my screen not yet, but do I have shared access.

A

You do you you should anyway, the security panel says: participants are allowed to share screen.

A

We just saw something blink there.

A

Hershey cash you're still there.

C

A

Froze yeah, I think we may have lost him. He'll be back.

A

And in the interim, we're busily building.

A

The gate, client plug-in so we'll have an incremental that we can use.

B

Oh I'm not able to share my screen. I think there's some mature.

C

Have you given permission to zoom? Yes,.

B

Yeah yeah, but then you know my app is crashing and then.

A

Well, so let's see what what alternative could we do? We could certainly try to build.

A

I mean I built and pushed 3.12.0. or built 3.12.0.

A

And it goes into my repository, but so then why? Wouldn't it find that.

B

Like, oh, if you run our mdn hpi around command like it's not running, because we don't have that jar file right.

A

B

A

It's not running because it can't it can't resolve the it can't resolve the dependency on the get client plug-in that is declared.

A

So let me try it again, just to be sure hpi, let's do a maven clean, minus d skip tests install see if we can just we can compile the plug-in without using hpi run and it says, could not resolve 3.200.86.

A

A

A

Using the version of the palm that's on the tip of the branch it says could not resolve dependencies.

A

For 3.11.1-rc 3100, something or other so it can. It tried in repo jenkinsci.org public.

A

Why didn't it try in well? I guess public is the right place?

A

What is okay, so I I think right now, the place where I'm stuck at least, is that I'm waiting for the incremental build to be published for the git client plugin, and it is probably still 20 minutes away from being published because it's got to rebuild itself based off of the master branch.

A

Now, how could I? How can I use a local snapshot dependency? It just seems like that. Should work shouldn't it.

A

3.12 dot, zero.

A

A

Because that should resolve it locally.

A

And now it seems to be resolving it so for shikesh we may be able to do a a maven hpi colon run. It says it did it and installed it. You, okay, if I share my screen and we'll try it again.

A

Okay, so here is the build that I ran and now I'm going to just do this hpi colon run no need to skip tests. No need to do anything except that right.

A

Okay, now I need a tunnel that goes to that computer, and here is that tunnel, so it will be localhost.

A

8085., okay, so opening my web browser now.

A

Okay, here it is now this one has no caches.

B

You know the cash buying.

A

Oh right right: okay, good good suggestion; okay, let's do that even.

B

Yeah, you can go into a cash directory in the work folder.

A

Okay, so there make cache directory.

B

You know you can fake, I think it's cashes, um oh, is it yeah yeah.

A

Now does this need to be a bear repository.

B

Oh yeah, it's fine! It's fine! Yes!.

A

B

It okay, if it's not bad, no, no, no.

A

Okay, all right, so we now have a directory here. Git play clientplugin.get.

B

So now we can configure the linkedin starts in the ui.

A

Okay, get maintenance! Oh interesting: did you see that yeah.

B

We're missing a picture: yeah yeah, I don't know, and these versions is not supporting, but in the normal one. That's comes.

A

Oh, this is so old. Okay got it right, so notice that it's running an ancient version, 2.332.4.

A

Save execute right.

B

B

Can we have the logs as well.

A

Yes ah and there's an entry ah finally.

B

So very good yeah, so this is how data is appended into you know into that fight. So the thing, the reason why didn't work is because I think the path for where I am writing like to where I am writing is you know different like if you go into that folder right you'll find a maintenance report file like in the gift line in the plugin directory.

A

Oh, oh, that's a very you are very bold. You went up one up several levels. Okay, all right great.

B

Yeah, so this is the place where I started. That's why? I guess it wasn't.

A

Interesting, very okay, very good, but it makes it easy for you to diagnose and debug okay. So so we have a record there, and so, if I.

A

Create more more directories, for instance like this.

A

A

A

We now have four directories, and so we would expect eventually those directories will be touched.

B

A

And here we see an incremental repack.

A

And so I could give it lots more work to do by.

A

What shall we do something very large like jenkins, dot, io.

B

uh The thing is when you clone it right, the repository is already optimized when you clone it from the top. So.

A

Right, yeah yeah, I was being more.

A

I was actually trying to be a little more unkind here and was going to make it.

A

Jake, it's about.

A

I don't know okay, I give up on that one. I need to find something that I can clone easier. So how about the junit plugin.

A

Okay, so the question is: is this how's it doing and it's already been through four passes on git client plug-in? It doesn't seem to have yet detected. My others, though fushikash.

B

Yeah, it would take time because I think we added it into the queue right, so it was one minute two minute three minutes, so they were, they are all having you know, the previous.

B

So like, if we wait for another minute or so I guess we will be seeing that as well.

A

Okay, good, oh and now that's interesting, so in this case the the repo size, so it may have been that the the commit graph command initially run was somehow seeing an incomplete repository, and this one then sees the complete, is very interesting. Okay,.

B

And then there are the search functionalities as well, where you can search your those things are working.

A

So if I search for 18- oh very nice, 56.

A

Or this very magical number, that's great. I look for commit.

A

True, apparently, everything matches true, because every line has a status of true.

A

Very nice, so so that search facility is a natural part of the tables. The data tables that you included.

B

Yes, yes, yes,.

A

Uli dr hoffner will be so pleased well done.

B

uh The thing about it is now what what exactly is happening right now is I'm going and loading the entire data, like I'm reading that xml file creating a list and then displaying it there's no way of you know, lazy loading. It like you know not getting like only five chunks of file it out and each other assume.

A

There are like 200 300, because.

B

All 200 300 records are, you know, loaded into this table.

A

Well but but I think I think, that's very practical because because you read them, you're also discarding outdated records every time you rewrite it. Aren't you yeah so you're not allowing it to ever really grow large.

B

No, no, I didn't get you.

B

uh And the reason why the other caches aren't coming, I figured it out because uh you remember, we have a static hash set. You know a hash set in the abstract, get scm plugin that you know uh reads all the caches when we start the champions controller, okay and- and we only add the caches from the ui. So if we restart this, uh you know uh jenkins instance, then only we will be able to see those caching.

A

Okay, so restarting.

C

So rishikesh, there is no way for the and there's no way for jenkins to poll the updates in a file right. This has to be an operation where the plugin updates the file and that that, when we refresh we'll be able to see the results of those repositories.

B

Yeah yeah! Yes, yes, yes, there is no way of pulling right now. I I don't know. How do I do that polling mechanism? I tried looking into it. There was this ajax request, but I couldn't get the data from the java file.

C

B

Yeah so now I think we will be seeing other plugins as well.

A

Well then, and it's still it may be it. May I don't think there's any change here, so I assume we may have to wait. One minute before the commit graph will run again.

A

Now let me check as well did did I make some other mistake.

A

Okay, those all look like bear repositories.

A

ah There we go: here's elastic axis.

B

So yeah we bought the bit plug-in yeah so.

A

Yes and the git plug-in very good, let's give it well, but none of these because we're not doing oh, we now now, oddly here, there's no entry for garbage collection on any of these, even though this was trying to spread it every third minute.

C

There's a second page as well mark.

A

Oh, oh, I need to look. I need to look at more pages very good.

B

Oh, let's stand that is gcc.

A

Oh, oh right, is it well okay, so is it that it's.

A

Is it that it hasn't completed the incremental repack and the commit graph? No.

A

Now it's etc. I.

B

Think I think the reason behind this could be. uh Can you scroll up come on because every first minute we are adding a comment graph and every third minute? We are adding a gc, so I think there's a clash and only the comment graph is being added into the queue because of our front syntax, because if you think about it, the gc also is added into the queue. But uh you know only the comment graph is getting the chance of executing it.

B

A

So if I do it so you're thinking that if I did it every seven for commit graph, that would give gc an opportunity to execute yeah or let's, let's do, let's see one two three.

A

So if I, if I want a distinct bit every time, then what I need is 2 4, 8.

A

No no, but you said you think that it's that they're colliding with each other's definition.

B

Yeah, because uh what exactly is happening here, this comic graph is also being added every minute into the uh original queue right and gc also is added. So first, the comment graph only is being entered, even though the gc is present. So I think it is not getting an execution. It's you know, other than starvation states.

A

B

A

C

Now this is saying that um let's say I have four depositories and commit graph, while the third commit graph is running. Operation is running. My first gc for the first repository has come into the queue now once these commit graphs are over, should not the gc start to run, and then the other comment graphs get into the queue.

B

uh But it depends on the way you know the you know. Data is added into the queue. So uh basically, what I am doing is I'm just iterating through all the you know, caches or like iterating to all the maintenance tasks and then adding them. So if you think about it first, the comment graph is added at the first minute, then an incremental repack is added. Then again a common graph is added, okay and then again uh a gc also is added.

B

uh So you know, if you think about it, every you, if I'm adding it every alternate minute. I feel the comment graph.

A

And yet we're we're definitely seeing inter incremental repacks.

A

And right now we've got 27 entries, so it's it's not a not a trivial amount, but there are only well how about how about a different approach. Let's attempt to garbage collect every minute.

A

Oh now shikesh, I thought you had said that there was some some issue with gc or was it no there's an issue with prefetch.

B

uh Yeah, that was initial prefetch.

A

Okay, so so this should have redefined it so that it will garbage collect every minute and we could even go so far as to say hey, let's not incremental repack and let's not do commit graph.

B

A

Sorry say that again for shikesh, I missed.

C

We can see gc's, oh.

A

C

A

Oh well, that's! Oh there we go okay, very good. So, okay, so there's still an open question for me on how do we? How do we assure that all the tasks get run? So, if I do, if I now put commit graph every two and incremental repack every three.

A

B

If I now there will be a collision between incremental repackages.

A

And now I don't have a way the table doesn't give me a way to sort by execution sequence right, so I can't see which things executed most recently. Can I oh.

A

I can see how the duration so git plug-in spent 600 is that milliseconds 600.

B

A

So it spent 600 milliseconds running gc, whereas the node label parameter only spent 98.

A

But so we'll go ahead.

C

No, no! I'm sorry. Please continue.

C

No, you were saying something you can I'll. I can say.

A

Yeah and- and actually I apologize now- I don't remember so it's it's clearly getting late for me and I'm not I'm not thinking as clearly as I should reshop you go ahead.

C

I wanted to ask what this previous execution column is signifying I mean I see a question.

B

Like that last executed, that date, you know that date and time would be displayed. This is concentrating a random number. I put so.

C

Okay got it, and that makes sense.

A

Okay, so we're now at 37 entries and we have- we definitely have gc.

A

B

I think if we refresh the page right, you get the first five uh things as the latest ones, like the latest.

A

Okay, good yeah.

B

And without any sorting, these would be the latest ones.

A

Okay, so it performed a commit graph and.

A

Okay, that's a little surprising that it would show multiple gcs one right after another. On the same on the same repository.

A

huh Okay, how about let's look for gc.

A

There are already 20 entries with gc good.

B

Here I was thinking yes, if you look at this example, only the rate at which the file is very very fast. So is there any way of me cleaning this file because I didn't add any mechanism of cleaning you know, or you know, having a fixed size, because uh data would only be added into the file, but there's no way of you know restricting it.

A

But tell me what would what would because you're rewriting the file every time you every time you add new data and you're disposing the your you've got some disposal process for the data? Don't you so you're, you're you're, saying I'm only keeping this or are you keeping data infinitely.

B

Yeah yeah, for now it's like infinitely because I didn't add any way of you know removing the old data that mechanism has to be added, but I was not sure so, I'm not sure about how do I proceed with that.

A

Well, but isn't isn't the removal of the data just a matter of deleting it from the linked list and then, when it's saved to disk it will be, it will be gone.

B

Yeah but then how many records do I store? That was my question like I assume, if there are many many gentiles or caches okay and then, if we have a fixed amount of size of, like you, know, 100, I think we wouldn't even display data of other caches present, because you know all of them would cross 100, for example. So what would be a fixed amount or what would be the fictional.

A

Yeah good good question, so should a should be, do we ask the user to give us a value? Do we just choose the value ourselves.

C

Is there a way for us to um to not show each? I mean to club these uh to only publish the record for a repository when um whatever tasks were designated uh for them once the first batch of that has been executed, then we publish that instead of publishing each entry of the repository with each task.

A

Sorry ask your question again: richard.

C

So my question is that let's say I have a repository, the git plugin repository and I have uh I have commit graph and I have gc so once the first commit graph in first gc, and that is the batch of tasks that I'm going to run. uh You know that is, that is the first. uh I mean series of sequence of tasks that are going to be done for this repository, so once that is done, is it possible for us to then show the result instead of showing each record?

C

Because, with this approach we will have, I mean we can't we don't have the control there um of what rushikesh is trying to say right if you're going to delete if you're going to delete entries after a fixed amount of rows have been created, you cannot make sure that each repository which was present within the cache is going to be displayed on the table, because it is very well possible that, since comment graph was running every minute and there are, let's say, 20 repositories. It would only I mean the table would be filled.

C

uh You know within let's say 10 minutes with 100 entries, and then you have to make a delete, because that is how your optimization strategy, or whatever the disposal strategy has been said, so so go ahead. My my question is: how does user make sense of this data in the sense that if we, if we are able to batch, I mean if I'm able to see for a repository?

C

um What is the task that has been run and it can be multiple tasks and, along that the count of the number of times the task has been run, that what I mean still I mean it would only it would consume less I mean space within the table is what I'm trying to say within a row f, I might be able to show more data.

C

I don't know if that's possible or not, but I I guess that would make it more. um You know easy for us and easy for the user, because, right now, when we have collected- let's say 67 entries, how do I? um How do I make sense of this data.

A

Right, well so so could could I try a different analogy for me. I think I think we want some sort of. It would be nice if we had some form of a sequence number to tell us which thing was executed first and and which was executed later.

B

A

That's his previous execution or something else that might, I think, help people comprehend what the sequence was, but in terms of the shell, we limit to a fixed number of records. What if we? What? If we used a different limiting algorithm and said we will limit to not more than n records per the combination of repository name and task, so think of the repository name and task as a job in jenkins.

A

It's not but think of it. That way, and we say I'm gonna keep five I'll, keep seven, no matter how many there are. So if there are 10 000 cash repositories, we'll keep 50 000 records because we need to keep some record for every one of their of their repositories in every task that they ran.

A

If we, if we use that technique now rushikesh, that means you've got to do something more sophisticated as you delete things from the linked list, but but I think iterating a linked list and discarding things from it is not that that painful.

B

Yeah, so what you're saying is each repository we would hold like five records, for example of them you know, and and that of each type so assume elastic axis is a plugin five commit graphs up there, you know, and then five gc of that is what you're saying right.

A

That's what I was thinking, what do you does that sound reasonable to you? Does that sound like that? Might work for the user.

B

Yeah, that's that sounds reasonable. We could kind of store it in a hash map as well. You know.

A

Oh right there certainly there are other data structures that make that style of storage much easier. Aren't there.

B

A

So that that sounds very reasonable to me to say: okay, we're going to keep, because we think you care as a user, about the task that's being performed and the repository where it's being performed we should probably now now is there?

A

Is there a way with these, this very elegant data table to do some form of parent-to-child collapse where all commit graphs for a single for a single repository are grouped together automatically as parent. You know, I I don't know you, you can look at the data data tables and see what what uli has has made available. I'm not sure if it's got a grouping concept or not.

B

It has some concept of a collapse or I've seen that oh.

A

It does okay, it does.

B

It does so uh like. Can you explain like what was that uh feature about the collapse thing like grouping.

A

All I was thinking was okay. Today I see at the moment I see many rows with elastic axis plug-in gc and for visualization purposes. It might help me if those were an expandable thing where this shows up as one and older copies of older results of the same thing are hidden under it as a collapse and expand.

A

Now now, that's that is so completely not not required right. It's it's just okay! As a user, it might be easier for me to understand what's happening if I collapse and expand to see what the history looks like.

B

That actually makes sense that would even you know, be easier to read. I I would I would try seeing you implementing that even see how it fit in.

A

Yeah now now perfectly understood, if, if the ultimate is hey that doesn't work, yeah, it's or or gee that just doesn't make sense, that's a bad user experience. Don't do that, then I I completely understand that as well. This is this is actually really quite impressive. I mean look at this. I can sit here and search and there it is, and now the the 25 applies to my search.

A

Oh oohlie has done julie and his students have done amazing things here. This is great.

C

So, as a deliverable, we've achieved what we want to show to the user, and I think what we've recently discussed is an optimization that we could perform if that is possible.

C

B

Right yeah: oh there are few things like. If you go into that terminal right, I think you would find like if you open the terminal from which you have started, this you'll find it you will find get versions. No, can you open that dominant yeah yeah? You will find these okay. This is something I'm not sure. This thing keeps happening because this thing actually, when I want to get the get version of the underlying uh computer right to check whether it is or should I run legacy maintenance or the normal maintenance.

B

I need to call the underlying git version, so this thing keeps happening. Is there any way of stopping it? Because I couldn't.

A

I I think there must be, but we'll have to look at it and see. I it's not immediately obvious to me. Wouldn't we can't can't. We somehow remember that we found this version before. Is there a way to remember that?

A

Okay, there we go. That's that's kind of elegant. We see. Okay, there's there's my cue now we watch to see when it moves.

B

The thing why I didn't uh you know create a field to remember. It is because in jenkins we have a way of changing the you know, get executable right, the underlying so assume on the next cache. When I want to run uh the next dash, which is running like the get uh maintenance uh task. So then I would be using the uh version set in the you know: ui global configuration.

B

So that was one of the reasons why I didn't change. You know then store the get version.

A

And and that that that makes sense, at least at some level, because I could on my controller, I don't know why I would, but I could, on my controller, have multiple command line. Get versions installed right where I'm and I've got several different command line. Get tools for some specific need: okay, one two, three, four, five: six, so it just! I think it just completed more, and here we go 112 records.

B

This yeah, so that was one thing I wanted to discuss. There were a few other things. Okay, one thing is about documentation. Like do I document the code which I've written or when do.

A

The answer is yes, okay, so the documentation should go into this location. Here.

A

There's this readme and given, given the nature of this, that it's got a what I'd call a very nice ui component. You should probably take a screenshot and embed the screenshot.

A

Just as this this picture has a screenshot. You should probably put a section in there that describes it and has a screenshot look. This is how it how it looks.

A

B

Regarding you know uh documenting the code, do I do that as well or oh, you know the methods and the parameters of method takes.

A

So java javadoc for javadoc is highly recommended for public public methods. So yes, otherwise somebody else has to do it and if it's me I'll just make wild guesses as what your intent was and I'll write. Those wild, guesses and people will then complain. Mark you made a wild guess and you were wrong.

B

That was something I wanted to ask. Oh any.

A

Any suggestions on the ui.

B

Yeah, the ui footage.

A

I don't have any, I I find the cron syntax a little bit challenging, but it's very much. The way jenkins does things so you're. Absolutely consistent with the rest of jenkins, cron syntax is how it's done. I just uh I'd love to have a calendar picker. You know all sorts of exotic things like that, but the problem is, none of them are functionally rich enough to replace cron syntax because kron I can say at daily, I can say at hourly.

A

You know I can say yeah and then there are all sorts of now. I guess there is one that it would be nice if we could do a.

A

Could we get an online help available that would coach them on the cron syntax, because we don't have help icons here and, and that may or a help, help icon even for the commit graph? What is a commit graph and how does it help them? What is prefetch and how does it help, because we can certainly describe that in the online documentation in the readme here?

A

What are these tasks, but the user is accustomed to reading the help from a question mark right next to it,.

B

Oh because I tried adding the help files, but you know I was facing some kind of problem while adding the headphones so.

A

B

A

Yeah and we may have to- and we may have to request assistance from someone else, because my my success rate with adding help is far less than 100.

A

I have to work very hard to find the right place to put it in the in the ui elements.

B

So there's this commands, I was thinking of you- know, suggesting users to use commands such as hourly daily rapid. You know and.

A

B

Taxes, because uh the underlying architecture of our layers, assume, if I put common graph hourly and gc, also already both of them, don't run at the same time. You know there's a random and you know: there's a random. uh You know time selected at each hour and both of them are scheduled in such a way that you know jenkins is not overloaded.

B

So I was thinking of you know, adding that as well into the readme, so that you know it will be beneficial.

A

And I think that that is a very wise, that's a very wise thing for you to recommend, especially because it it avoids the risk of them, making a typographical mistake which causes them to run much more frequently than they wanted. There's there's it's not free to run these operations right.

A

The the execution time is a reminder to us, even on a perfectly packed repository and one that's as small as the get plug-in is it's something under 100 meg that still takes 600 milliseconds, I'm I'm sure if we did actually, we ought to just just to be absolutely obnoxious.

A

um Minus minus bear minus minus reference.

A

Just a minute bugs.

A

A

All right so cloning now.

A

Okay, now just to show you how embarrassing this is it's 105 megabytes.

A

So when that one runs and now that you said that the way we want we get that to be seen, is we restart.

B

B

Also regarding the prefetch, what is there like? We just commented or like what about that, because we have private repositories as well right. So how do we proceed with that?.

A

ah Right so prefetch pre-fetch will. How is the cash being popular? Oh there really. Isn't you don't know the credentials for that that repository right, you simply cannot because they must not be written to the disk if they're written to the disk, that's actually a very bad choice.

A

So so well, so maybe what the answer then is? Is we just skip prefetch? If it fails due to credentials, because we cannot, we can't do a pre-fit prefetch without authentication and in order to have authentication, we would have to somewhere record the credentials that were used to access that cache.

B

A

B

Way of knowing it before and that it's a private repository, because if once I call the command line, get right and then once I call the command line get and then I start scheduling running the prefetch command, then I have no control over the process. Okay, so.

A

So a technique you can use is you could do a git, you could make a call to ls remote and ls remote will fail if it's a if it's a privileged report. If it's a private repository.

B

So you're saying of get ls remote.

A

Yes, so there is a there is in the get client plug-in. There is a method that invokes ls remote and if you call that method, it will fail for you. Okay, if if the repository is is private and you have not provided credentials.

B

Okay, that kind of makes sense, then, using that method, I think I can, you know, skip the maintenance. uh You know for private repositories and run the prefetch.

A

B

Yeah, I think that was it.

A

All right, I am going to get some sleep, go, go ahead. Yes,.

B

Can we refresh the page and you know see whether we bought the cash or not.

A

Yes, let's do it, okay, get maintenance and let's look for jenkins dash bugs there. It is.

B

10 seconds.

A

And the execution time for that gc is longer than any other gc right. Let's yeah.

B

It's around 10 seconds.

A

Okay, so if we sort by execution time very good all right and that- and that is definitely let's look at- that- that is a no-op.

A

uh Because if we look in jenkins bugs get objects, there's nothing in objects except pack and in pack there is exactly one file: a pack and one bitmap.

A

Now there isn't a commit graph yet is there.

B

Elbindo objects.

A

Yeah so so it's run gc twice now, that's a little surprising that it's run gc twice but has not run commit graph. That may be back to my collision.

A

So, let's make it two.

A

Three: five save okay; now we refresh and now in jenkins-bugs.

B

So I think, with this we.

A

B

What we you know, the deliverables, what we wanted there.

C

B

Other things which you know can get better, so I think I would be working on those the test for the get client plug-in also has been written only for the legacy. I think the legacy commands those are not working. uh I didn't write the test for those okay, yeah I'll I'll I'll. Look into that as well, and.

A

Very good excellent work, rishikesh, really good.

A

So we will plan to meet again next next week. Now I have to warn you next week. I am arriving home next week on about 12 hours about 24 hours. Prior to our scheduled meeting, I leave alaska on an airplane to return home after having visited my grandchild, my new grandbaby was just born in alaska.

B

Congratulations.

A

And so I may not be when I, when we meet a week from today, I may not be, I may be even less functional mentally than I was today. I apologize for that in advance, but I may be very sleepy.

B

We can schedule the meet for next day as well on the thursday as well. You know.

A

So so, for me, I'd prefer tuesday, and then let's see if we need it or sorry for you what is wednesday I I must talk in your time zone, so the wednesday morning meeting actually works quite well. Just if we find wednesday when you, when we're meeting that I'm not not useful, then I may say: okay, let's try for another day.

B

Yeah, because if we have to meet you know, we can, you know, discuss and get things done. So it's fine if it's on a monday or a tuesday or a thursday or.

C

B

So yeah that's up to you so.

A

Great very good, well shikesh! Thank you very much. I'm going to go ahead and I assume we call an end to our session and I'll. I hope to post the recording tomorrow thanks very, very much. Thank you.