GitLab Repeatable Database Creation, 11 Aug 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Monitoring for Repeatable Database Provisioning (2021-08-11)

Description

https://docs.google.com/document/d/1Xe7RoTr10mpsxyR73vENYpW-Jwgw5Keh4Cx-W4rUveI/edit

A

All right I'll share my screen.

A

So thanks for going through these these items um already like it sounds like you already have a discussion here. um First of all, I just wanted to summarize how this will be set up. um I answered your question, like I think I think, between five and ten shards total across all environments, where each shard is a full petroleum cluster. So um if that helps with like sizing to give you an idea.

B

So when you say shards, because I know that there's some confusion on that.

A

B

Because people use it interchangeably with the term partition, so is the idea that we're going to have replicas of the main postgres cluster in another project environment, or is it that we're splitting the existing plaster into separate chunks.

A

um It's going to start out with this. It's basic. I think the first phase is probably going to be a replica like we'll, create a new database. That's going to function, a new database cluster, that's going to function as a replica and then there'll be a feature flag that will start using the replica for certain tables, probably ci to start and then and then eventually we'll you know make it so that that cluster is no longer a replica but functions on its own um independent of the main cluster.

A

So what is a shard? A shard is a petrone cluster consisting of n number of postgres nodes, plus console cluster plus pg bouncer uh a load balancer, um and that's, I think, that's it.

B

So when you say pg bouncer, do you mean both layers, so the one so pg bounce running on the patronus and then a separate, dedicated, pg bouncer layer in front of it yeah yeah, exactly okay, okay, uh cool! That's that's really helpful! So uh are these uh going to be uh in separate gcp projects.

A

No, the well um there's going to be a dedicated gcp project per environment, similar to what we have now. I've created these they're called like production db staging db and then there's some sandbox uh projects within each project. Then there'll be n number of database shards and each shard will have like its own network space so that we can and then the current idea is that we'll peer, the vpc in each environment, to the corresponding environment, like productiondb, will get peered to production and then using firewall rules will give production access to individual charts.

A

What about the existing.

B

Postgres chart, I should probably say, is it going to be migrated to that project.

A

I think I think if this is successful, we'll the new the current database will become a shard and we'll fail over to it when we do the postcards upgrade. So when we do, the next postcards upgrade it's going to require a failover anyway to a new cluster.

A

So I imagine the current petroni cluster will remain in the production project until we do the next upgrade and then we'll move it.

B

Makes sense um so going going to my next question, um like obviously like my understanding, is that this entire effort is uh driven primarily by the limits of the scalability of the existing postgres cluster, so we're starting out to have some scalability, but obviously, at some point, we'll reach a limit to that as well, um and I'm curious like obviously it's not going to be in a few months.

B

It might be in one year in two years, whenever that happens, or we might not reach it ever, I'm just curious like how frequently do we intend to create those charts.

A

The forecast for where, like how much runway we have for the database, is sometime next year before we like, I think the estimate was like april or may of next year before we get to like critical, um where it becomes critical that we have to do this, um we're we're kind of like doing doing the sharding now to get ahead of that, because we figure there's probably going to be lots of issues, and um we just want to I mean- and we don't know like it's possible- that that timeline could change right. Okay, cool!

A

um Is that enough background.

B

A

Great, so um the reason why I schedule this with you is because you're, like the most knowledgeable person on the monitoring end.

B

A

So- um and I also like, I think I think I probably have enough information to do this, but I also want to make sure what I'm doing the same and yeah to have one other person look at it. So.

B

A

um We obviously need to prometheus server, and the first question is like: do we do a previous server per environment or premedia server per shard? um Based on what you know, what would your first thought be.

B

My instinct would be to be to create one per gcp project.

A

Okay, great, I was kind of on the same page there. uh I don't think it's gonna be operationally difficult to have one per shard, but I thought it was overkill. So I I figured one project.

B

And and my thinking behind it is that the the most like again like up to my best knowledge, the the the limiting factor, would be cardinality, and I wouldn't expect it to be that massive with.

A

What we've just described, yeah, okay, um so that's simple um so down to um well, let's jump to 2c. uh You made a comment here about indices yeah. So I'm thinking I don't know what I'm thinking here. To be honest, I'm I have no idea, but I I I maybe better not to use the existing index. So we don't mess it up, um so we create a new one like uh pub sub, post resident staging um db or um or if db, shards, yeah actually yeah. I was thinking something like yeah.

A

I guess we have to include the environment name and arts too somewhere, but.

B

So there's a few things to consider here. um One thing is that the volume of logs- I don't think like one of the reasons why we were splitting um indices in the past, was that the load volume was massive and like elastic, wasn't just able to cope with that. So splitting it into separate indices, allowed it to to use more shards, but.

A

I don't think that.

B

Would be the case here, I think the the main one one of the major factors to consider here is the ease of reasoning about it. So, basically we have a schema for how we create those indices.

B

So if we had, for example, um a postgres index for gcp project, it would be easy to find the logs that the relevant logs and the reason I say that is because that that was the schema, that for the naming convention that we use in the past component and then the gcp project, but that you know that might not be the best fit like we might want to reconsider that, for whatever reason I'm just voicing like you know what what we were doing in the past, like.

A

And and my point being.

B

um It might be uh perfectly reasonable to to say we're sending we're sending all postgres logs from production to the single index, because we don't want to split them in in separate indices, because there there will be a transition period where the existing um postgres chart will be still running in the gitlab gprd project. So the question would be like you know: how do you transition? Do you like sent to both of them, etc, etc?

B

A

B

To be honest, I it's just something that I thought about just something to consider. I I don't have any suggestions on this.

A

Did we, um I forgot, the index name, uses g prod and g-stage as the environment names or is it? Does it use production and staging? Is the environment.

B

That's a good question.

B

So I'm looking in the production elastic cluster and it's using gprd.

A

So, given that we have n b equals gprd db and n equals, you know g stage db. Should I just create new indexes using gprd db, gstagedb, keeping everything else the same and then um and then we just use the project because that's that's sort of like it's, not the project name. Project name is production db, but it's the same kind of yeah. It follows the same convention.

B

Yeah, I guess I guess so what would be the alternative across the alternative approach? I guess would be to put it all in a single index in the one that's already there and then differentiate by label.

A

B

How do you think, like from the perspective of of querying those logs, how um difficult would that be.

A

um My concern my concern is that, like I somehow calls a mapping conflict and I like cause things to break right.

B

Well, but it would still be, it would still only be prosperous logs, so I wouldn't expect there to be and presumably um deployed using the same method, so it wouldn't be like in one environment in one gcp project, it's running in kubernetes in the other, it's in chat, and thus you know you've got different fields. um So I guess it's something to to to figure out, as as we move along.

A

Yeah I mean there's like in this habit right now, where we go to postgres logs, to look for slow queries and I'm concerned about making sure people know like to filter by shard, or you know.

B

Yeah yeah, that's it.

A

Yeah so maybe better.

B

On the other hand, on the other hand, some like eoc might um get alerted for uh possibilities performance they might go to the uh yeah. That's true, yes, and you know forget that there's multiple indices they need to check. Like I don't know like I think, yeah I I don't know, I think I think we'll we'll need to or you'll need to figure out as we go like make it like, make all the um basically make a decision here.

A

Yeah I mean I could it's it's. We can probably delay it like we could do. We could use the existing index on staging we're not going to do this on prod for a while. Yet so we do use the existing index on staging. We have to come up with a shard label for the main cluster, so we'll call it like main or something.

A

Okay. I think this is better. I mean like I prefer to do it this way, just because it's simpler, yeah, um okay, let's.

B

Yeah, that's that sounds like a plan, starting with g, starting with the existing index in g stage, and if that doesn't work, then changing the approach. Yeah.

A

All right, great um okay, on to number three so um obviously like we moved a lot of our monitoring stack into kubernetes, but for this I'm thinking that's overkill.

A

uh If you knowing what you know, do you tend to agree or do you think you would try to get this in kubernetes? That.

B

Running kubernetes would be an overkill well, um we would still be running well so and in point 2d, you mentioned that you wanted to get the matrix into that. That means that some components of diamonds will have to be running in that project and prometheus um as well. um So we've got uh at least one thanos component, I'm thinking about thanos cipher, but there's at least one thomas component and some prometheus.

B

Both of them need to be deployed, configured maintained so at some point we'll need to update them. So my thinking is what's the easiest way to update like so, let's say we we hit a bug in in famous, which is something that happened in the past. um What we've done like we, we had to upgrade monitoring components on multiple occasions, but on one occasion we had to patch the thomas binary and just having a single um source of that binary was very.

A

B

To rule out so like I'm, not saying that we're going to hit that scenario again but say we need to, for some reason, run out and upgrade to prometheus or thanos doing that over over chef. Considering that we're trying to move everything into kubernetes might not be the easiest way to go um so.

A

Well, first of all, there's no just to clarify. There's no chef here at all.

B

Right: okay, okay,.

A

B

Postgres is not managed with share no other than that.

A

Well, right now we're um we're flirting with the idea of using omnibus, but you know we're not ready to marry it uh so we're going to, um but right now we're just doing uh ansible on you know, bass ubuntu with the omnibus installed and we're doing that everywhere. We're not using the part of this is not using chef at all nice.

A

um So I don't think it would be complex to like configure prometheus and thanos and even pub sub with ansible, but you're kind of right like having one process for pushing those changes with um gitlab home file like with kubernetes and another process, configuring it with ansible. Maybe it's not ideal.

B

So but having said all of that, um like I don't know if we have a case where prometheus is running inside of kubernetes and it's scraping endpoints on gcevms- and I don't know what the networking implications of that would be um so like it might be trivial as in it might just require.

B

The vpc peering between the um sap networking, the cuban eddies and uh and the vms, it might be as simple as that I don't know um so. That's that's something to consider um so again, like there's, there's pros and cons both I think.

B

What's what's what's your uh take on this.

A

um My take is like for expediency. I feel like it'd, be simple for infant. Like you said like for inventory generation, it's really simple to do this and ansible, because I can just write out the prometheus config. um I don't think setting up thanos and pub sub is gonna be tricky either. I would just like create one vm and install all of these components on it and then be done with it. um That's my that's what I was thinking.

B

So I was, I was updating, alert manager uh the other day and some of the prometheus instances didn't really deal with that very well, um and I had to go through uh prometheus conflict uh like basically go through contact with all arbiters instances and there's uh the conflict is spread across multiple places and like this would be if we were to to uh run this in with ansible. That's yet another way to configure it. Just just just you know like I'm, not I'm just I'm just like voicing voicing that.

B

I'm not saying that it's a bad thing or a good thing, necessarily and my point being um since we intend to. I will. I don't know if that's still the case, but up to my best knowledge, we intend to move um chef um conflict to ansible conflict. So perhaps this is a good first candidate because he would be starting from scratch, and we once we once we have all the ansible playbooks for prometheus, then moving the existing prometheuses that are managed with chef would be much easier.

A

I guess that's a possibility too yeah. um I don't know if we intend to like do that or to try to go fully to kubernetes with like having the ability to scrape things outside of the cluster right yeah.

B

Yeah, because is one of the things that you could you could leverage if, if prometheus was running on a vm, is the prometheus uh service discovery mechanism? So it can talk to gce to discover, uh scraping end points, um and I I don't think that's possible. Well, I don't I don't. I don't know if that's possible in in kubernetes like it might be possible that you just tell a prometheus business running in cuba and he's hey.

A

B

Like scrape just do a service discovery in gce in that project and it will work out of the box, maybe maybe that's what's going to happen.

A

B

A

Yeah, so that's that's.

B

uh Sorry, I don't have a simple answer.

A

Yeah, if it wasn't for like they can like, I think the configuration will be simple. Keeping up with versions and deployments is kind of a pain. So I don't. I don't know what to do here.

A

um I guess the easiest way. Ideally, we would just extend our gitlab helm files pipeline to include the new these new environments and there would be a cluster in each environment we just deployed to them and somehow we would tell kubernetes like what did how to scrape the virtual machines um yeah.

A

So maybe it's worth investigating.

A

um What about um I guess if we create the kubernetes cluster in these new environments, we should we might as well put pub sub in them as well. You know too right.

B

uh Do you mean.

A

B

um Well, not necessarily if, if, um if your intention is to use the existing indices, then you could just configure fluency on vms to forward logs to a topic in another uh project. Okay,.

A

B

The existing prospect infrastructure would just pick up those logs and you wouldn't have to do anything there. Might you might need to rescale resize some of the box of peat deployments due to increased uh volume, but I doubt that like postgres doesn't look that much, but it's it's zero.

A

Does make it simpler, um yeah, okay, yeah, you're right, I guess we use the same index. We would do that.

A

um Okay, cool um thanos um yeah, we'll have to create the side car. If we deployed to vm we'll have to configure it, but I don't imagine that'll be too complicated.

A

Thanos stores, independent defender sidecar um so so does thanos store, have to like be in the same project or how does that work now, like? We have final store running in each environment's kubernetes cluster.

B

Correct yeah and we've got a separate: we've got a gcs bucket in each project.

A

A

A

And dinosaur is only running kubernetes or is it running in the house as well.

B

uh That's that's a good question. um I and the short answer is, I don't know. I think that then managed to move all of it to kubernetes, but I don't know I've definitely seen some um vms named after thanos components, but I'm not sure if, if they're still being used like what's running on them, I don't know the short answer. Is I don't know I. I think the intention was to move all thanos stores to run uh in uh and they've been started.

B

One of the reasons why they were moved to kubernetes was that because we started thanos store.

A

B

So if you, if you wanted to um have the metrics discoverable in thanos, um like that, that would be one way to do it to basically have a gcs bucket thunderstorm thomas compact, for a gcp project.

A

Yeah, okay um yeah: this sounds to me like the yeah. The easiest thing would be just to have a kubernetes cluster and um you know just have to figure out how to scrape these extra because we're not going to be really monitoring nothing. It's great writing, including it's all going to be outside like petroni, um and I mean I guess we could run some exporters, like maybe a stackdriver exporter or something in kubernetes, uh but I think most of you.

B

A

Really, like pg bouncer, patrony console it's all going to be outside.

A

um Okay um b, I think we already touched on see we already touched on as well. So I think I'm good um what I'll do is I'll spend some time. Thinking like we're thinking about how we can monitor endpoints and scrape endpoints outside the cluster.

A

Maybe do a quick poc for that.

B

Yeah sounds good um yeah. Do you have do you have some perhaps a a gift, love issue where you're tracking this to separate? Because I.

A

B

Curious to know uh which way you'll go because if you, if you decide that you want to provide ansible uh module or play for um for managing prometheus, that would be really helpful to know.

A

Yeah, um the issue is.

A

It's there um I'll, probably flash this out of it. We have the epic for like creating all of the ansible plays, and we have weekly demos if you want to. If you want to join, there's, also youtube videos. If you want to see what we've done so far, cool.

B

um Sorry, sorry, I couldn't be of more help uh if you've got any further questions feel free to to message me I'll. Try.

A

To help you yeah sure, yeah, no you've been a lot, a big help and uh yeah. I appreciate you taking the time I'll um I'll, let you know how it goes cool all right. Thanks all right talk to you later ciao.