GitLab Sharding Working Group, 18 May 2020

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: 2020 05 18 Database Sharding Working Group

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

All right so first topic at one point in time we had a capacity planning issue where we were going to measure how much how much Headroom we had before we hit the scalability wall, and then we have some anecdotal information and the capacity planning issue itself was actually closed.

A

We don't have specifics that I'm, aware of and sadly I don't see, Alberto or Jose, who are probably best suited to answer those questions. So does anybody know if we have these specific measures in place like I've, seen anecdotally, like CPU or probably like double provision of what we need and I've seen measurements on other areas? But do we have one single source of truth that says how over-provision we are or how much Headroom we have for our primary database.

B

No I can bring an update on the effort that has been done so far, and actually it's been reinstated these days so basically in part also the migration the upgrade to 11. There was the idea of try to run the specific existing benchmark for gitlab GPG, if I recall correctly by the acronym right um to do first up regression, benchmarking for phosphorus 11 and then to scale it up for capacity planning purposes, but the DBT tool was was a discussion with the team behind it.

B

They said that it's not yet for these years, it's ready so for the upgrade we had to write our custom benchmarking tool that was used for only for unlikely designers from strength for performance regression on phosphorus 11 operate. Now this tool could be leveraged and and used more thoroughly for capacity planning. Basically, it recorded. It has like a simulation of the most frequent queries with random parameters, and they can be replayed at higher speeds. This tool can be elaborate for capacity planning and actually Alberto asked on the team to do this start doing.

B

Ideally, I'll get back to the GPT development team and and started work to also make this tool. Dick is already a tool design for this to be used for this, but man that will apparently take a lot of effort. So I guess short time. We can continue with the tools we developed and you know improve it. It will also take a bit that probably we're talking about this or a couple of weeks, more than anything else to try to get it to the point where we can do a composite blood capacity plan.

A

Their issues that you can link here that we can follow along with.

B

I'll link them there, I need to search for them that I'll link them there. Yes, okay,.

A

Thank you, but I guess. The unstated comment here is we're not hitting a wall right now. We feel like we're comfortable with Headroom I think a couple a month or so back. We said we felt like we still had probably a year or more horizon on growth. Given the current provision. Hardware and known growth statistics.

A

B

A

B

What to say there we're probably over-provision terms of view I'm, not that sure we are totally over proficient in all the areas. There are a lot of disk usage and sometimes you've hit the walls on the maximum disk speed. On the other side, there is potentially yeah we're running with very old CPUs right now and if at some point we will decide to give us the pleasure of having some modern CPU on the US East one, then we could also have more have more Headroom there.

C

Okay, just quickly what CPUs are you talking about of our because we've moving a lot of stuff to see cities in a degree in that region, so.

B

Yeah, so there's no second generation instances, there's we're running on n1 and one hike man 96 right now, and this is first generation. There is the second generation which, at least on any instances, is not present on the USS, the last time I checked in those couple of weeks ago and other than n2.

B

We are also we submitted a request that I think was not the proof in time for Pete, for this after 8:00 to 11:00, to also switch to modern, modern, er CPUs from n1 series, basically to jump from Haswell to skylake, and I believe we are still on hash. Well,.

B

But basically I'd sent two instances which are significantly more modern and can scare more on several regions, but not on this one.

D

Yes, unfortunately, we can't get into in all of the availability zones in US East one. We can only get it in two of the three that we use.

D

But we can get c2 in all three.

E

House, that is c2 of any said a said, a much better. Is it much better much more performance, C, 2 or C.

D

2 is is the same as n to accept higher clock rate and slightly more expensive, but I think if.

A

D

Difference in in what turbo boost frequencies are available to them to the nodes, but it's the same generation as n2.

B

So see to compare has well on n1, which is what I believe where we are right now. Yeah, the hospital scary late, but I believe we're still on hospital should provide significant difference. Yeah.

C

With Redis, we've seen a very big big step up and we're running on this very much everywhere.

E

But the yeah, the main point is that we know that we have some Headroom there, so not that we don't believe that we need it now. But it's good to know that you know it'll be a an easy fix, can kind of easy.

E

A

Anything else we want to cover topic before we move on to the next Andrew. You were gonna, say something about pricing. Favorite The Voice is correct. No.

C

I just wanted to say that there is a it's slightly more expensive, but, like our experience has been that because of the extra power that we've got, it's actually costing us less, because we have smaller fleets so, but that's obviously different from how you'd use it for for Postgres rides, but for web fleets it means we can do more with with less nodes. So it's actually a little bit cheaper, yeah.

D

I think for Postgres, it's gonna be improvement anyway, but simply because we need we don't know. We were like, like I've, said previously, we're way over scaled in terms of CPUs.

D

We could easily cut our number of cores by half just on n1 so going to see we could. We can easily cut the number of cores in half which would save us money overall.

E

Anyway, yeah so I didn't say anything on the beginning at the beginning of several was talking, but we we working on this horse a which off today, by the way so the office today and Congress throughout Iran myself we're working on this ice off made largely because you know Craig a myself with me that this is what's operating. We wanted to clarify this as soon as possible.

A

Then Christopher asked below about the table row size. I know we have an issue up everything.

F

Yeah, like another issue, where we're talking about table spaces and counters associate with those table spaces I think that they all got addressed and such that the overhead that were back on in a sane state but I just was checking word that something else when I'm monitored or periodically or if there's alarming, already set forth.

A

Thought there was alarming set for it, but maybe Alberto or Alvaro can correct me if I'm wrong in their idea do.

E

You know anything about that: I.

C

Far the biggest table there's been the major Kristoff's right and that's the most problematic and there is a quite slow going migration to move those two to object: storage, but I, don't know what the second biggest is, but as far as I know, that's the biggest by some stretch but I, don't I, don't have any of the details. Yeah.

F

So, that's that's all it's just you were asking which one should we be measuring? It was like that's another one.

A

A

Thank you. Yes, points.

G

There sure so for the for the lack of a better term, I've called that anti sharding features and I think this is something that we should talk about more. It's in I think there is product questions further than that and also organizational questions, but maybe I can trust. You try to explain what the intent of sharding features are. So I think something that that's been sort of accepted that we go with is a namespace sharding idea.

G

That's that's saying that many features and and their data they live inside a namespace and that that's what it enables us to to apply this idea to to get a product, but on the other hand, reality is also that it's not exactly aligned with that. So we have features that don't believe inside a name days at all or they they are being accessed from a different perspective. For example, users are global. They are not within the namespace, so they can.

G

They can work in many different namespaces and there are a couple of examples for that where we we know that those features they don't they're not going to play nicely with with namespace base charting and that's really another choice of whether or not we do it on the database level or on the application level, or it even surfaces with partitioning that you you're basically optimizing for one thing, and then you have a couple of other things that don't go well with that pattern and I think at this point it would be interesting to to identify those features more and then also talk about how we want to tackle those going forward, for example under users, since they are global.

G

Now we might want to ask is: do we want to go in that direction where those are confined to a namespace? So, as a user, can you only sign up within a particular namespace sort of reflecting the idea that that people only care what is inside their namespace? If the customer is inside a namespace, their users only care about their namespace? This is something that we want to go into the direction that we want to go into.

G

That might also I think it's very relevant for forget Lacombe, but perhaps there are also different trade-offs for self hosted. Installs.

C

Another mentioned like briefly in Arcola Lily undress, but it was just mentioning to people is anything. That's got a water increment key. That is one of those things as well, like you know, like a user ID or even a job ID on a pipeline job that has a globally incrementing water. Id needs to be considered in the same discussion.

C

F

Sorry, Christopher good Andre, so I was just gonna say that you know I think I think. Obviously the first step is to recognize these.

F

My suggestion is is and if it doesn't work or we think there's a better strategy- that's fine, but to raise it with the specific feature teams to see if we can get them to potentially start working issue, because you know the divi theme can scale out to solve everybody's problem. We've got to get these back to the feature teams. Potentially. From that perspective, you mange- and you know, like one of the I hate, to say this way, because it sounds incorrect, but a you know, thinking about it more.

F

It's probably the right way to be thinking about it. Anyways is, is you know if you have a feature that you want to run calm and it can't scale and that's you know sharding supposed to help solve right. So so it feels like that there really is future team's responsibility to make sure that they their thing in the framework so that they can scale effectively.

F

So maybe you know we wouldn't obviously not turn off any existing features, but we would probably part as it highly for them to go work that so that they could get it addressed. Mm-Hm.

G

And currently, where we're going with the audit log feature and even they're, even even if that's a that's a quite separate feature, quite a small feature in your lab- it's not very much entangled with everything else, but even there we we already recognize that there are different ways of using the audit log and I think. This is a really good example where we should, where we can work with with the second team, owning that feature figure out how to do that going forward.

G

If we apply a name space based partitioning idea, how do we deal with the sort of cross-cutting anti sharding access patterns because they come become more expensive for day they even break if we don't design them differently? um How.

F

Long, do you think it would be to create the list of the one, so we know about like you, think, that's something you get get back with a week within a week or is it like a couple weeks.

G

Well, I, don't know if that's gonna make for a complete list, but like identifying I was going to identify a couple of good examples. For those names like sharding features, um perhaps not a complete list, but it.

F

Doesn't have to be a complete it's trying to get a sense for. Are there a handful of these or are their tens, tens of these or their hundreds of these? That's that's. What I'm, trying to get at more than anything else, was called how we attack it. May change based on the answer to that question. At least.

E

F

Sense for you know, researchers I think.

G

The fundamental feature is the user is one because users is sort of on the same level as a namespace, its books, global, and that really makes it a hard problem. In that case, that's why I was wondering if we have a sort of strategy going forward to perhaps confine those users into the namespace.

G

If that is something that we can even do on get like, um where that would be acceptable.

A

You know one product rep had to just jump off for kickoff meeting so and then the other concern is even if we're trying to constrain it to namespaces.

A

That timeline doesn't really sync up with what we're trying to do with charting. If we're trying to implement charting now that solves some early scalability problems, then the timelines don't seem to sync, then we'd have to find a way to iterate on that maybe feature by feature and I'm not there'd, be a lot of discovery that would need to be done there so and part of this discussion.

A

Sorry, if I'm stepping on what you were gonna continue at thunder as part of this discussion came with our conversation with Sid last week when we talked about our approach with Auto bogs and focusing on the creative that date, and he was concerned that that's a local optimization, as listed here under the first bullet point d and use again using the specific audit log functionality. If we partitioned on namespace, then that would we believe it would break some functionality within gitlab.

A

Like the instance wide audit log view, it probably wouldn't work if we left everything else in place and we partitioned on namespace, whereas we pretty sure that performance and functionality would be greatly increased by focusing on partitioning on the date.

G

Yeah, so the for the audit log, the the date, is sort of ideal, with the features that we that we know already, and in that sense it's a good choice for for the audit log. But if we were, if we wanted to sort of strategic strategic lis explore, how do we deal with those those patterns like? How do we deal with those entire charting features and give and perhaps discover a good example for that?

G

You could also explore the idea of applying the namespace base, partitioning approach and then figuring out how to how to deal with the admin view how to deal with it by user view. I think that would be sort of valuable for us to to figure out, because it's such a such a good example that we are going to see more more of if we, if we follow that namespace based idea.

G

But it's certainly not the the ideal choice for for the audit log, as it's done at the moment, but I wonder if we, if we you know, should be rather like try to optimize this particular feature for the audit mark as it is now or do we want to assume that we want to go forward with namespace base charting and then explore that more and discover those problems. More kind of the point we're at at the moment.

H

A

H

H

Even for the auditioning I mean there might be cases that there are features that I do sharding, but are we blocked from making progress on the charting uh based on the namespace I'm, guessing that we can do some charting a war proof of concept here now when making some progress, while thinking about those additional reading features how we want to deal with that? That's just a my my thinking at this point.

H

We need to iterate on the ideas instead of a complete vision or a picture, what could be the blocking things that we cannot move forward? I mean at this point. Everything is it's based on the analysis, no fact to support.

G

Yeah, no we're not we're not blocked. In that sense, we can. We can execute on the by by time partitioning, and we can also execute on the name space based partitioning with the ladder. We know that some features are going to break so I think it's sort of we already know this is going to happen right and- and it's not not working well with the with the change that we would implement.

G

So that's why we wonder like how would we treat that from a product perspective, because we can't just let them break right, but it's it's not a blocker, but it's a good time to ask that question. I feel.

H

Cool thanks, yeah we. This is all the question we need to keep investigating. Meanwhile, we also want to make some improvements. I mean progress here, to connect the data, whether something doesn't work or something actually works, the prove it.

G

Yeah, like we could one thing that we discussed is like going at and sort of prototyping each of those paths like implementing the by time and by namespace, with the existing feature set and then doing some measurements. How things are going I think from from the existing data. We can also already anticipate how that looks like, and we that's why we, where we know that things are going to break, but is it something that we should do like prototype that forward, because that's nothing that sort of directly feeds back into the product right?

G

It would be more exploratory at this point in time. Yeah thanks.

A

Yeah that leads to the next question about being able to adequately test, and we have an issue that we're asking questions on Joshua's gonna follow up with our security folks to make sure that we're okay with the pathway identified for testing production like data prototyping, does make sense and want to make sure that people understand if we do go with the time partitioning, because we get that performance improvement, it still feeds into our I.

A

Guess our research into what implementation of partitioning and charting will require and what may be gotchas that we encounter along the way as we go to the namespace strategy partitioning and it gets us the advantage of getting that audit law and performance improvements as well. So it's not if we go down a different partitioning strategy in this case, the time base. That's not lost effort that that makes sense. It's just not a perfect alignment with the bigger picture of charting and partitioning on namespace across the entire database.

A

The other question there on that topic.

A

All right and I think we've in the conversations we've had we pretty well covered. What's been done, so you can read the epic on audit log on what we've identified, what we are and we have a meeting database team as a meeting for more specifics on what we're going to implement this week and then and that's in details there and what's happening next.

A

So anything else, this working group meeting I.

A

Guess one other thing I wish the air is Sid did ask for an update on what's been done so far, and rather than trying to read through this whole talk. I tried to summarize in the working group handbook entry here under the meeting recap and I will keep that going on a weekly basis.

A

All right there's nothing else. Let everyone go thanks. Everybody.