GitLab Scalability Team, 20 Apr 2023

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: 2023-04-20 Scalability Team Demo

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

um So we're talking about lightweight, lock contention in uh in our postgres replica databases in how we're coming up, uh how I think we're we're close to a Tipping Point every um every weekday I'm turning daily Peak.

A

So this is an incident where we crossed the Tipping Point um and you can see the uh this is kind of a high level, generic um uh generic representation of um our um our connection, pool um becoming saturated. um Obviously, right here, that's that's an effect, not a not a cause. um It Cascades to the rest of the application stack and kind of the the usual way when uh more people want to write queries than than the system can handle um so I'm going to dig from here into uh into some causes.

A

So this is the graph I want to show, but I want to show a wider Titan span, I'm going to try, try for two weeks and we'll see how long it takes to render.

B

A

A lot of interesting things to show and I I'm gonna resist the temptation to go into uh to go too deep into the into the um the tracing.

A

I think I may have closed my tab that had the data I wanted.

C

B

Share but this.

A

This will this will be. Oh, this is two weeks is not quite enough.

A

Let's go over the four weeks.

A

So plot that renders.

A

Okay, so this is our. uh This is our incident, and what we're looking at here is.

A

A few times a minute we take, uh we look at a snapshot of a standard, postgres catalog view called PD stat activity. It's got one row per uh per postgres, backend and, um and um some of the really interesting data it gives is uh what that back end is, uh is currently waiting on. If anything, um if it's active and it's running on CPU and it's not currently in a weight State, then there won't be a value for a weight event, but in this case we're filtering to when the weight event is the lock manager.

A

Lightweight lock, um this particular so lightweight locks in general are, um let me take a step back um just because the vocabulary can be a little confusing. Normally, when you talk about database locks, we're talking about what formerly is called heavyweight locks. This is a lock on a table or an index that says hey I'm using this table.

A

Please don't change it to schema at this very moment, for example, um when uh when we need to update data structures uh in memory data structures um with uh with a safely concurrently, we'll use kind of a mutex mechanism called lightweight locks, and in this case the specific flavor of lightweight lock that we're that we're talking about is the one that guards the hash table. That records who has heavyweight locks and that's why the lightweight lock is called lock manager.

A

So just want to clear up the terminology, because that that can really hear people off um so as as, uh as that suggestion kind of implies, um when you see contention over the lightweight lock that guards access to that heavyweight log. What we're really seeing is contention over the frequency with which back-ends are trying to change the state of what tables and indexes they have locked.

A

So the resource that we're starving for here is is access to the lightweight low, to the this particular lightweight lock, and it's it's specifically driven by how often we acquire and release heavyweight locks on tables and indexes by extent, I'm kind of jumping into solution before I've gone through the whole problem. But I'm gonna I'm Gonna Roll With It. By extension, um this means um when we partition a table. For example, uh if we partition a table into 10 pieces, we now need 10 times as many locks and it's not just locks on the table.

A

It's locks on all of the indexes of the table. So if you run something like something like select star from projects where ID equals one you're going to acquire the lock on the table and every index for that table and each one of those each one of those heavyweight lock acquires requires acquiring and releasing one of the lock manager locks. There's only 16 of these locks, every table and index is guarded by an arbitrary one of those 16.

A

It really kind of hashes the lock tag and and does modulo 16 across them to determine if we choose which of the 16 Mark manager, LW box is acquired.

A

But what we're looking up to here is um things like schema changes where we do partitioning or adding indexes or or something even even less obvious, where we perhaps change some application codes so that it needs to do an extra, join or um or um um similar kinds of things where we do refactoring on on a query that um to split into two part, two queries instead of one, um if those queries happen in separate transactions, which is very likely um for the way that we we uh we manage our connection pooling uh we'll, have to acquire and release those heavyweight locks um separately for those two queries, so some very benign um and often helpful uh changes on the application side and the schema side can alter the rate of of acquisition of this.

A

uh These lock manager, lightweight logs and that's I, think uh what we we don't have. A good way to measure at this point. Query rate is the best surrogate that we have and it's not enough um uh when I say it's not good enough. What I mean is um are during the incident.

A

We knew that each replica DB was getting uh um about 60 000 queries per sec per second, um uh when we reintroduce the two additional replicas that went down to about 50, 000 queries per second and we're okay at that point, and we were not okay at the 60k, but historically uh a month earlier, we had been perfectly fine at 60k and I. Think the difference is the rate of uh how often- and we acquire these Subway locks. The block Mains are lightweight.

A

Locks has probably increased in the meantime, due to some change that normally would have been benign if we hadn't been at the Tipping Point. um So.

B

Can I still interrupt you and ask some clarifying questions um because I want to keep keep up with what you're saying so these are replicas. They serve free traffic, uh but uh so read queries still require heavyweight locks. Yes, these are not logs that are happening because we're modifying something but read queries require them. That's.

A

True um I should I should also clarify that the same thing happens on the primary database, but what we starved for is the replica database. It was on the replica databases, but the primary is Justice susceptible, so um yeah.

B

um So uh any is any query any read: query we do taking heavyweight locks, everyone yeah.

A

Yeah um there's uh like you know, there are several modes, the lock uh that heavy rips come in and um um the the weakest mode is called access shared and it only conflicts with uh with access exclusive.

A

um When we're making schema changes like adding a column to a table or or whatnot yeah.

B

Even if there's no conflict, the the query takes the lock and releases the log and it's updating this Central one of these 16 log manager, structures and that's where the problem happens. Yes, exactly: um okay,.

A

There's a big caveat that I'm trying to find the right place to add. um There is because this is a known contention. Point, um historically, a postgres introduced uh what it calls a fast path for trying to acquire a heavy weight. Lock um that um essentially says under the right conditions. We can avoid having to update the shared memory data structure um and only update a local hash table instead um and the the require I can I'm going to close for now I'm going to gloss over the requirements for it.

A

I just want to mention that this exists.

B

Yeah it sounds like this. This is a known problem because otherwise they wouldn't have split that data structure in 16 pieces either. That's.

A

Right, that's right: yeah, okay, but before they split it, yeah.

B

So naively, every query needs to take a heavyweight, lock and they've done lots of optimizations to reduce the pain of this, but it still happens, and then the other problem you're saying, is that, uh depending on things like joints in queries, you don't know how many heavyweight blocks a single application query needs. So there you can get unlucky and run a bunch of queries that don't hit an optimized path and I do need a whole lot of locks.

B

And then um we end up in this Zone, where we have to update this one of these 16 things too often and request contention.

A

Yes, great summary: yes, thanks yep, absolutely so uh so back to the graph there's two things: two visual aids I wanted to kind of talk through. um One of them is this graph, where we can see that this is obviously the incident that we're talking about, but this is a stat graph, grouped by uh by replica we're looking only at uh um sorry. The vertical axis is counting how many backends were at the moments that we that we pulled the the state um were stalled waiting for uh for this particular kind of lightweight lock.

A

So if we, if we go to just a line graph, we can see that, on the most uh the most badly impacted replica, there were 659 um stole stole postgres backends, all of which were rating for this type of law, and this is this is really really serious. This is essentially all practically all of them were sold waiting.

A

um This is uh this is Max over time, so we're looking at the the worst sample in a five minute window, um but I guess it's also important to remember that that um that we only poll a few times a minute and these locks are normally.

A

These lightweight locks are acquired and released on a scale of microseconds, so there are many many of these events that were not actually witnessing, um um so we can take this uh as uh as a very small representative sample of the amount of contention that's happening um and I wanted to I wanted to mainly mention that, because uh these other points in time, these other kind of normal, uh normal weekdays, also exhibit this kind of contention and we're only seeing the tip of the iceberg for them as well um on from a practical matter, it's normal to have contention over um over.

A

um You know, um locks and lock-like data structures um for short periods of time. It's a really only problem when the contention escalates to the point where it impacts, uh latency and limits throughput, um and the the difference between between the workload on this terrible day, and these more normal days wasn't too huge. It was. It really was just that we we pushed a a few more percentage points toward these replica is by taking a few of its peers.

A

Out of out of use um like we went from, we went from about sixty thousand to fifty thousand to kind of back off during during this day, which is the same thing that this this adjacent day was uh was was on, and you can see that we still have contention on on those days when we've got 50k um queries per second hitting their replicas, but it's it's tolerable, we're not violating the app deck, so we most definitely we're violating the app decks on on this day.

A

um That doesn't mean 60k. Queries per second is untenable. It just means with the current mix of of lock manager, acquisition rates and hold duration.

A

um We are in a dangerous position for for having this come back, especially if we take one or both of those, those old replicas away. um I. Think most of the folks here are probably aware, but we're uh all of these. All of these database servers are running kind of old VMS, they're they're, the N1 machine family um and uh we're we're wanting to switch them to uh to N2 or or the AMD variants of N2, and these last two ones.

A

The node, 101 and 121 are specifically um um those those newer candidates for for machine type and there's been some there's there's been some some hopeful talk about.

A

uh Maybe it will be okay when we, when we upgrade all of the replicas to, or rather all of the journey nodes to to be these newer machine types um they definitely performed better during the incident and I'll kind of show show here that we didn't have any samples where they were where they were overwhelmed, but I also want to point out that they, um they also only got a slight bump in uh in the query rate, and we know that they have. um They have.

B

I thought you said they weren't participating in the incident, and the incident was resolved by.

A

B

Them back right.

A

So um yeah sorry I kind of glossed over the details so um prior to the incident, uh let's see so o4 is the is the primary so I'm gonna leave it out of the mix here um so prior to the incident, we had um a one through a three and five through a seven and um those are the the old six um and then 101 and 102 were the seven, the seventh and eighth nodes um we've removed, nodes, 06 and O7, which are old nodes um the day the day before the incidents um during APAC shift and.

B

um uh Okay, I thought we removed the new nodes. We were testing with. We removed moves from old.

A

Ones, that's correct, yes, and then we put the old ones back in service um to prevent the recurrence of the incidents the next day um so um yeah. Why did.

B

The new ones do so much better.

A

um I think they did slightly better and it was just barely enough that so, um um but that's a fantastic question so uh and that's kind of where I was going so um some so in terms of so. uh For purposes of this conversation, I'm going to talk about CPU, speed, really in terms of instructions per second, uh so assembly instructions for a second.

A

So um if we I'm totally making these numbers up by the way, if uh if we say that these N2 machines were able to achieve, say, 10 or 20 percent, uh more instructions per second than um than the amount of time spent holding each occurrence of of holding this, uh this lock manager, lightweight, lock, would be uh would be proportionally shorter and uh therefore less likely to be contended.

A

So I think that this gives us a marginally uh larger Headroom, because, um because the you only get contention when you, when you have um when the acquisite, when the the rate of acquiring the lock um times the the mean duration of holding the lock kind of runs, runs up close to the the saturation point.

A

When um and I think that we're just I think that we're just barely avoiding that, under normal circumstances, on the old nodes and the new nodes have a higher instruction throughput, and so they are a little bit further away from that 100 of the time the lock being held. Does that make sense, yeah thanks.

C

um That's anything anything we're looking at here is suffering from this, uh like the resolution problem that you brought up. So even the new notes that barely show up when you highlighted them could be super close to the Tipping Point, already they're, just a little bit farther off than the older ones. Yes,.

A

Exactly and that's that's why we can't use this metric alone as an Indica as a predictor for hitting the saturation.

A

We only see spikes on this graph once we've reached that saturation points not so we could be 90 of the way there and not see anything here and it's not until we actually cross that line on on- and this is this is very susceptible to um I- don't want to call them microbursts, but, like you know, small scale, variations in the amount of in the in the lock acquisition rate um uh can can kind of cause spikes on this graph um and lead to contention.

A

So the other thing I really do need to find my other um my other issue, where why did I not keep that tab open, just I think this is yeah okay, so um so this this is just kind of walking through the the effects of the saturation I'm going to gloss over that this is the graph we were just looking at kind of zoomed in on that day, so it's a little bit easier to see the the shape of it.

A

um This is I, think a two-week timeline, so we can see adjacent days and how you know how it looks there. Let's just find a Green version of what we were just looking at.

A

um So the other thing I want to show is no, not that one.

A

Yes, this one, okay, so um so I mentioned that these, like this particular the lock manager lightweight locks, are required on uh on a microseconds time scale.

A

um This is so I I'm going to gloss over the details of how how I capture this, um um unless someone's interested, because I find it very interesting, but I'm gonna just focus on the data right now, um so this is uh for a 10. Second time span, uh we captured every time we're capturing um every time we try to acquire um one of these 16 luck, manager, locks and it's not immediately available.

A

In other words, when it's contended, how long did it take to acquire that look when it was contended- and this is the distribution we're seeing so in 10 seconds we had? uh Maybe you know maybe 45 000 or so um or 45, 50, 000 or so um points when it was contended and most of them resolved in less than four microseconds, um but some of them. The long tail is what really worries me um where sometimes we had um over over.

A

You know- um and this was just a random 10 seconds by the way it wasn't during an incident. This was just I got up and I ran my my capture, utility and- and this was the result- um it there's a lot of variation in this long uh at different points in the day and there's I've seen this long tail in production um go. um You know several times higher where we're we're waiting for several milliseconds um to acquire the lock and those are. Those are I.

A

Think I suspect that those are the Stalls that were actually witnessing in in the spikes that we were looking at on the other graph. So um this uh kind of understanding the long tail why we have these? You know these ridiculously long stalls um is.

A

This is one of the open questions that I'd like to try to answer and I've got I've got a few hypotheses about what could be causing it and uh some of them I've ruled out, and some of them are still kind of in the running um for a while I thought that uh that hyper thread contention May potentially play a role um and I. Think that's I think that's unlikely at this point, because um sorry I shouldn't talk about discarded theories. um So so this is. This is a.

A

This is an open question and um um um but I guess. The main reason I wanted to show this distribution is, is to kind of you know, put put some some hard data in front of you and uh oh I forgot to mention these uh there's there's two lock modes that come into play with lightweight logs the shared mode, where uh that's a reader lock and uh an exclusive mode. That's it's a really Rider lock! So um this these lock manager locks are almost always acquired in exclusive mode.

A

There are cases where it's acquired in shared mode, but the count, the the analysis that we've done here um really highlighted that it's the exclusive mode where, where the contentions happening and and um if we, um if we capture every uh every event, uh every call to LW Locker choir rather than just the contended ones it. It highlighted that that uh that we far more often are acquired in exclusive mode rather than shared mode.

A

So this this is the mode to focus on, and it is, um uh unfortunately that means that contention uh escalates very quickly. um Stop sharing. Now, just because there's nothing interesting to see there.

B

Well, it sounds like we need more replicas.

A

Yeah yeah I think I.

B

A

What you started with yes.

A

um I I, but ideally like to do a few things. um If we can have a if we can compose a metric that gives us a better predictor of how how close we are something that we could translate into a a utilization percentage so that folks can see how close we are.

A

um I think this will make everyone much more comfortable in in uh kind of planning out our capacity requirements um as well as being comfortable with uh with being able to make query changes like if we, if we make if we make query changes or schema changes like we. We know that some of our tables are too big and we know that the solution to this is to partition them and there are lots and lots of great benefits to do in this partitioning.

A

um But that is also something that can potentially increase the risk of uh this. This lock manager LW log contention, particularly if we have any queries that don't do efficient partition pruning as part of as part of the the planning process. So so.

B

Yeah I was going to say. The other thing you can do is not make those queries that don't hit the fast path. But uh how practical is that really? Because we're not talking about something very deep in the postgres internals, that.

C

B

Few people understand I, certainly don't, and we want all our application developers to make sure they hit these fast paths on a gigantic production database and.

C

We want them to use replicas, we don't want them to scare them away. Yeah.

A

Exactly and that's one of I mean to be perfectly honest: that's one of my biggest concerns is making developers afraid to move queries off the primary and onto their replicas, um and it's it's super super helpful for them to to do those those migrations, because we can scale out their replica Fleet and we can't scale a primary Fleet without a major architectural change.

A

So yeah I want them to feel comfortable doing that very important work, um I'm glad you mentioned the fast path again. um um It's um I I wrote I'll put this in the issue, but I wrote up a a really concise uh kind of bullet point list of what what are the prerequisites uh for for hitting the past path? And many of our queries do so um kind of one of my ideas for follow-up is to try to identify queries that don't um like one.

A

One of the one of the requirements is um there's each back. End has 16 slots for for recording, fast path uh activity and, if you, if we filled up those 15 slots within the transaction, um we can't use fastpath anymore. For for the rest of the for the rest of the queries in that transaction um um and effectively, that's I mean that's, that's a hard-coded number so, but.

B

Even if we can predict this, how do we get this information back to developers on time before the.

A

B

In production causing an incident.

A

Right exactly so, this is um this. Is why I? This is one of the reasons why I feel like the the right answer is just to have a big enough Fleet that we don't have to worry about it. Yeah.

B

And like and doing to focus on Goods uh good metrics, so that we can do the capacity planning. Yes.

A

Exactly so, that's kind of the way I'm leaning, although definitely I, I I, also kind of want to um be able to give advice about uh how to Reco. You know how to construct the queries to to like hit the fast path, more often but really I, agree. I, don't think it's practical to expect everyone to become an expert in in this particular really weird, optimization goal um yeah and.

B

Even if they were like, if, if you need to know if it depends on the number of partitions we have and if you're, how many partitions you're you're ranging uh about like? How do you even recreate a test environment where you can do that.

A

Yeah yeah, exactly like I, think uh I, think kind of crystal ball time. I. Think one of the easiest ways for us to run into this problem is to um to add an index to a frequently access table, um and uh perhaps you know some set of existing queries um already were you know like perhaps joining that table to one other table and the the total number of of indexes across those tables.

A

So you've got two tables and if you've got 14 indexes on on those tables- and we add one index to either of those tables, guess what you no longer qualify for a fast path. So now, every time we run that that particular query it has to do. It has to go to the shared memory log table and compete for these LW locks, and this is this is no one's fault. It wasn't of a code change, it was a schema change or vice versa.

A

We introduced a join through a code change in the west, new schema change and we have the same outcome. This is not something that people are going to predict very efficiently so um um and the code changes are either.

C

All back the only show right after the Tipping Point, so.

A

Exactly chances are.

C

Like it could live in our in our code base for a year and we grow and then suddenly bam like exactly you.

A

Wouldn't even show it if you did like a small traffic share on a small bit of you'd, only see it until it's a problem. Is that fair? Yes, that's exactly right! Hey.

C

um Which brings me to my next question: Matthew, you showed them the the ebpf analysis.

B

C

You did there in the way we can because that's expensive, and you explain that to me. Is there a way we could do that cheaply somehow and get that.

A

um I I'm gonna, I'm gonna, show why the answer is no and Bob. You may have seen this, but this is. This is cool enough. That I think it's it's worth uh it's worth 60 seconds if, if I can find the crap in in that short amount of time, oh no, the zoom widget's in the way is this it? That's not it!

A

That's it! Okay, okay! So the way uh the way BPF instrumentation works is um you want, so you want to cut catch, calls to a function. um The say say: the entry to a function like a wlr acquired um the way that works is um in the binary, the uh the the first instruction. That's that's part of entering the that that function gets replaced with a single byte instruction called ins. Three, that's that's effectively a call out to an interrupt Handler.

A

um Every call to that function is going to hit this this instruction and whatever, whatever the the other instruction, was uh whether it was a single byte or multi-byte instruction. We uh we will run that instruction after calling the hook that the interrupt Handler um um is handing off to so what we're seeing in this Trace is we're catching every where we're uh we're observing with uh with earth based CPU profiling. That's capturing a stack Trace from our process from our postgres processes, every uh every 99 times a second.

A

So this is just standard CPU profiling on a timer basis um and I ran that for 60 seconds during that 60 second window I also ran the 10 second BPF Trace Tracer extracted all of the all of the only the stacks where that tracer was active or what we're calling the LW luck acquire function, um and you can see that this is the just proportionally on the graph. This is how much time we spent in LW luck acquire normally.

A

This is all time spent in the interrupt Handler trying to transfer control control flow from normal user space to the BPF program. I wrote, and this is the actual BPF program. I wrote, so there's no way we can optimize away this giant chunk of times. It's just.

C

Spent, that's that's only a sixth of the yeah.

B

C

And it's already.

B

It's really terrible user space probes because you have a user space function and you turn it into a system call yeah.

A

C

The context switch in.

B

It because ebpf runs in the kernel, so you cannot run your BPF program without going into the kernel and back um so like you, never want to do extra function, goals and there's a reason. This isn't it it's. It's almost always terrible to make a user space function into a function into a system call yeah.

A

And it's just unfortunate that in this case, the function we need to trace is called frequently like you can get away with that overhead for low.

B

Frequency, that's true! If it's something like you're doing rights uh on a socket, then that doesn't happen that often- and it's a system call anyway. Actually then you wouldn't even have to because it is a system call so.

A

It's a system right.

B

Hook it on the Kernel side, yeah.

A

Yeah exactly um so, that's that's why we can't do this approach um um there. There are kind of a few, a few other kind of um models we could maybe do so. This is this is highly speculative and may not actually be practical just to kind of preface it with a with a caveat um two. The two kind of hair brained ideas, I'm entertaining right now is one um we could.

A

We could do uh the the lightweight uh CPU profiling with perf that we usually do and uh count how many times we observe in in that time, timer based sampling. How many times do we observe uh LW lock uh acquire being cold?

A

The big caveat is that doesn't tell us which kind of lightweight look and we definitely there's like I, don't know 80 or so different kinds of lightweight locks. So it's not always going to be this kind, um but we can infer from the rest of the stack because oh yeah.

B

You can see it in the frame.

A

Right, yes, exactly.

B

And you probably know that which locker was acquiring yes.

A

I think so um so that that's that's a that's a maybe this is it's definitely lightweight enough to do. It may be useful or it may not be useful, I'm, not sure yet I haven't tried it. The.

B

Problem with any of these, things is, uh if you want to have a utilization like what is 100 yeah.

A

B

Can also um if we were able to get those histograms, then you don't want uh High tail latency but like what is the worst that it can get. So if we could measure how many things are waiting, how long is the queue allowed to be? Yes,.

A

Yeah exactly um and the the the weight event sampling that we're doing um from from PT stat activity. Once we've reached that once we've crossed that line of having contention, we can kind of estimate it from there, but before we reach that point, it's really hard.

A

So the other hairbrained idea I have is um that also may not work out um is um there's a there's. Another catalog table called PG locks. That's that's just describing what heavyweight locks are currently held by open transactions and one of the fields it it includes is: did we acquire this lock through fastpath, so that can let us differentiate between whether we're acquiring the look on fast path or not um holding that in the same way that we pull PT stat activity?

A

Every couple you know a couple times a minute may not give us a lot of insight, I'm not really sure, but it's it's I figure, it's worth a shot, so that that's the other thing on my list to to try out as uh and see if it can a get useful information from it and B come up with a way to to make a metric out of it. So, that's that's. That's that's as far as I got so far on on uh ideas.

A

For um for how we can make a metric to kind of measure, utilization or.

B

At least at least there you can naturally Define in a percentage because yeah but I'm not sure if it's a useful percentage, because you same you could say how many of the rows in there are are fast path or not. Yeah.

A

And and how many, uh and how many of the back ends are holding more than 16 that were not on fastpath or I, guess that that's a little silly, because we we are never mind, forget that if it wasn't a fast fact, it doesn't matter why it wasn't fast path. It wasn't fast path, yeah,.

B

I mean it yeah a number of blocks. That is not fast path. At least it sounds. It's simple to Define, I think that's. That is a good thing. That's going for it. Yes,.

A

Yeah, so that's uh that's, that's kind of the the whirlwinds tour. um Hopefully that was interesting and um yeah I think I I mean kind of circling back to what we do about it. I think it's important to get a utilization metric and- uh and it's also important to avoid incidents and I I feel like we're we're currently in a dangerous spot where, if we lose a replica, whether it's planned or unplanned, we'll be we'll be in a state where we could potentially cause it.

A

You know: have an incident uh so I think I think we should add at least one additional replica like nowish.

B

um There's one more question: I wanted to ask about this yeah. You said this is an old problem in postgres that they've been optimizing over time. Yes, what do other people do about this.

A

B

Yeah, um so so other big postgres sites that run into this bottleneck. What did they do about it? Well,.

A

The last time I ran into this bottleneck uh at a previous company about six years ago. um We we did actually uh rewrite. We identified, which queries were uh the the greediest consumers and we- and we rewrote them to to uh to uh with with this as an optimization goal, reducing the the acquisition rate for the lightweight law.

A

um In our case, we knew we knew what the candidate queries were, and it was, you know, a small enough number that we could afford to manually, look at them and and do some tuning on it and that that got us that got us most of the way out of the pinch and um I uh I also built a custom built the postgrad that gave us more lightweight lumps um in in the pool um and uh a few versions later. The community did the same thing, which is why we have 16 now as a standard.

B

I guess that's is adding replicas uh something people do.

A

Yeah, um yes, um I, think um I, think I've seen um I'll see if I can dig it up. I think I saw a write-up um from one of the postgres as a service clones. uh It might have been uh Aurora, but I can't remember for sure um uh that was talking about uh talking about how to how to interpret uh contention over overlock manager lightweight locks in particular.

A

So that's exactly the problem we're talking about um and- and they were talking about, um uh increasing the number of of uh of replicas which their their postgres has a service. So they make more money if you run more replicas, so I kind of take that with a great assault, but it is good advice. Also so yeah, a great question, I I guess, that's that's kind of my my Half Baked answer is: is yeah, adding replicas and uh and um trying to optimize queries to to uh to reduce the the acquisition rate?

A

I guess like from our perspective. We do. We do a lot of caching and query results so from the application side, um if, if there are frequently run queries that we could afford to to cast more often at a at a at any of the other layers, that's another way to to mitigate by just reducing the the block manager, acquisition rate.

B

But it's kind of the same solution right. It's fewer queries per box, exactly.

A

A

I guess I guess that's it. Then anybody else have questions or comments.

A

Cool um I I've only documented about half the research I've done so, but a lot of the flame guys we looked at are um are not in the issue yet, but I I intend to get them there to air tomorrow. So if, if you happen to want to see more of that, give me a ping or or just wait, wait a couple days they'll be there.

A

All right game, thanks for staying late, see you later.