Internet Engineering Task Force 112, 8 Nov 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: IETF112-ICCRG-20211108-1600

Description

ICCRG meeting session at IETF112
2021/11/08 1600

https://datatracker.ietf.org/meeting/112/proceedings/

A

Jonah, that's this! That's the same one as the as the one in mid echo right. It should be yeah yeah, it's the same one here. Okay, I got it thanks.

B

Thanks zeke thanks lucas.

B

All right, it's a minute passed, um so, let's get started, um welcome to iccrg at iit f112.

B

uh We are meeting after about six, um oh after about eight months, so I hope you had a good eight months and I'm looking forward to our session today. It's a pretty packed agenda um and I hope to get through all of it.

B

So I'll start off by um with the with the irdf note well, which is pretty similar to the ietf model. If you're not familiar with it, you should you should read this carefully at a high level. I will simply point out that you are simply following these idf processes and policies by participating in terms of your the intellectual property of anything that you share. um You are expected to file ipr disclosures on anything that you file that might have ipr that you're aware of otherwise read the note for more details.

B

Generally, I will just point out one thing on the code of conduct: please work respectfully. This is a group of people from many different backgrounds and many different uh with many different views um be respectful and have respectful discussions and, finally, the goals of the irdf, specifically pointing out that the irtf conducts research and it is not a standards organization.

B

So as you as you engage as you participate and as you discuss items bear in mind that we are not trying to standardize anything here and with that, let's get to the agenda. So um basically we have. We have a number of things uh uh today, we're going to start with a a a different uh item than usual, which is uh uh source priority flow control and data centers um from jk lee who's at intel, and I will let him talk about this.

B

This is different in the sense that we don't normally or off in anyways talk about data center stuff, and we have sometimes done that and we have a presentation here, which seems to be very much uh on target for iccrg. So we're going to talk about that first.

B

After that, it's going to be uh more things that you are used to we're going to start off with we're going to go on to a ccid for bvr, for dccp, followed by a presentation from ayosh mishra on the game. Theory behind running cubic and dvr you've seen ayush in the past at iccrg, and I'm looking forward to this presentation, and then we have uh updates on bbr v2 from ravine and neil and ian and praveen is also going to give us an update on our like that.

B

So with that I'm going to get started. um Jk do you? Are you here?

B

Yes, hi? Oh there you go wonderful, so you can uh for for all the speakers you can. You can present your own slides. You just have to go to the little start, slides share button, which is right next to the hand the raised hand thing uh on the left. There we go and I will allow you to do that, which you should now be able to.

B

I'm gonna try to grant try to stop sharing first apps.

B

ah And now you shaw skin should be.

C

B

Excellent, the flow is yours, jk,.

C

Thank you very much everyone um this, I'm jk lee a principal engineer from intel. uh This is my first time attending ietf. So um I'm glad to be part of this forum.

C

um Yeah I'm going to talk a little bit about source flow control or a little bit more degenerative form which is source priority flow control, source pfc, and this has been a collaboration mostly between, inter and recently, we started to talk about. You said i3, a report condo and then lilly, probably.

C

Oh sorry, I need to click here good, so um I don't think I need to go through what kind of conditions are there in data center? There are multiple different types, but one type we are looking at right now is incase condition, which is mostly caused by the money to one traffic pattern, and it uh mostly happens that the last of switch is the last, of course, for our second half switches and it drastically.

C

uh Goblins affects the uh tail latency because of the large queen delay, or even it can cause packet drops and absolutely this tail latency is well known to have a large impact on the application performance for our scale metrics, and especially when incase happens due to the line rate rdna senders, because they are really starting to send at the right line rate or the messages and uh connections um the the same size.

C

The message can actually finish really short amount of time, which means that it doesn't really give large uh rtt time for the condition, control to detect and then converge and react to the condition so fast reaction, possibly at the sub rtt time is more preferable with rdma incas in data center.

C

So yeah, obviously for congestion. We, the community, has been working on entering condition, control for many many decades and at hell level yeah. It's uh end-to-end signaling from four direction: data packets and echoed by the receiver, so that the sender can adjust its transmission rates and condition window.

C

And one thing we kind of notice here is that the condition signaling itself is carried by the the data packets, either explicit condition: notification like ecn or rtt measurement.

C

So they happen to be copper with ongoing congestion and um if, for example, there's a new sender arriving at the the tail of the heavily congested queue, then the the new, the pocket, the first packet of this new flow has to wait until um the all the other congested packets are dequeued out and then finally hit the receiver and then can be echoed back and also the nature of aimd or the typical um rate control mechanism.

C

It takes multiple rtts to actually flatten the curve like if, uh if the rate adjustment mechanism is a cut rate by half open the reaction architecture of congestion, that means that 16 to one in case will take another rtt for eight to one four to one down to one to one and eventually we want to really cut the rate down to zero if there is a having cast so that we really flatten the curve. That means that many rtt times will be required.

C

um At the same time, there are a number of flow control mechanisms, mostly layer, two and well known for i3 pfc, and they are really mean to uh prevent congestion practicus from the beginning, by ex employing something like uh exonyx of uh low latency reaction mechanism that is guaranteed to be happening within one microsecond. Detection and reaction can should happen within one microsecond, as required by the standard, and but it's a hopper hub flow control.

C

So, although it can actually avoid congestion packet drops from the beginning, it effectively slows down the fabric because of the many of the hard drive line, blockings happening at the inter-switch links, and it can actually back pressure from the um transition point to towards the upstream switches and towards the eventually to the senders.

C

And it is well known that some side of operational side effects like a pfc storm and deadlocks can happen at the same time.

C

So um here this presentation, we are going to kind of stress on the needs for the new layer. 3 flow control mechanism, which can give us immediate detection and reaction and um and the layer is better to be layer three, so that we can reach out um across the data center.

C

So this is the one slide summary of the our proposer, so the key idea is very simple: at the congested switch um which we just first compute, the minimum time required to drain that target in case q, where you can expect, is something like a expected surgeon time of the queue and the signal is signal. Packet is generated and carrying the information backwards for the incas senders, and there are two different ways for us to consume uh this information. The first one is um the sender side.

C

Top of rec switch can convert this layer, three signaling packet back to standard pfc to directly pause the sender, nic cues, so we call it source pfc or um the signal information can be forwarded back to senders and then sender, link, hardware or host. Networking stack can directly consume that information and pause the search flow and we call it source flow control or sfc.

C

With that, um the next slide is about simple diagram, depicting the uh the behavior of search pfc.

C

So what you're talking about here is that um simple cartoon data center with two senders and one receiver there's an interest happening um so typically, when we enable pfc and 2n. That means that the the destination top of rec switch will create pf's priority flow control, and this will pause the upstream aggregation core switch links instead of that. What we do here is that we assume some simple mechanism at the switch ingress which can learn about the ongoing congestion at the switch egress.

C

So even before, when we fold the package from ingress to egress, the ingress pipeline has some capability to generate a signaling packing back to the senders and here for source pfc. We are assuming that I actually implemented that the source side, topper work switch, can simply convert this layer, three similar packet to pfc frames, so that it can immediately pause the uh in case senders.

C

So the entire detection and reaction can happen within sub-rtt time, where the rtt is congestion-free based rtt, and here we are not really aiming to replace entering condition control. This is more about kind of emergency brake or reaction to having cast, because we are not really pausing any of the inter-switch links. There is no head of line blocking happening between the inter-switch links and no phd side effects are expected.

C

um Of course, we can deploy this mechanism every switch in the data center, but from our simulations and test beds, top of rack switch. Only upgrades can actually give us mostly bang for the buck, because, I think mostly happens at the last stop switch and the signal link between this. These couple rack switches can still give us pretty good reaction to most of the heavy intestine.

C

Meanwhile, when the heavy congested queue is being drained, but finally, the entry and condition signal will be received by the receiver and then echoed by uh the receiver back to sender. So here the point here is that um the this source, pfc reaction, can be much faster than uh the heavy condition. Control reaction to the heavy incase, when the especially qdap is pretty large.

C

um This is the very simple um test battery experiment to show the benefit of source pfc. Here uh we have two switches and multiple senders and receivers, senders on the right hand, side mostly and the receiver on the uh sorry senders and the left-hand side receiving the right-hand side.

C

There are two in-case flows happening at the same time. Here the the experiment is designed in a way such that the the link between top of rack switch 1 and to r2 will be a head of line, blocked mostly by the uh in case happening at the receiver. 2 will pfc pause, the omnics port and which will, in effect, uh create head of line blocking also for the flows of sender, one to receiver. One.

C

So this is all the rdma write request.

C

So first we measure the q-dabs over time using our some telemetry mechanism, and here we compare the q-dabs uh between the two mechanisms. On the upper hand, side, we have a the case where pfc is enabled everywhere versus the and the lower handler side.

C

um We have a case with the remote pfc or source pfc enabled, and you can see that the q that's at the order. Three congested links are pushed down to drastically, um with more than sometimes tens of order of magnitude. Difference with that, you may wonder what would be the throughput performance so with so. This is the measurement of the flow completion. Time of this um more than the thousands of flows uh or rdma write, request and yeah.

C

We see that in the cdf of the flow completion time the source pfc performs uh better than the traditional pfc.

C

Here we actually didn't enable ecn, because it was not easy to find the good tuning of ec and dcqcn, and actually, when we enabled this sequence, we found that the tail latency was worse uh in case of tfc.

C

So we just wanted to simply compare the operation between pfs and source pfc.

C

So you may wonder what information we need to carry for this layer, 3 signaling packet, um so key id is very simple, um so we have some more detailed backup slide at the back down the road. um How we can convey these signaling informations in the i3 802.12 cz, which is the draft cim here, means that conditional isolation, messaging.

C

Excuse me um so in the source, pfc mode, uh the we just need to mostly swap the uh source and ip addresses of the data packet so that the the newly generated.

C

So when we generate a signal in packet, we can just swap the source ip from the data packet and then use them as the user's destination ip of the signaling packet. With that the packet will be forwarded, signal packet will put it back to the sender and we can still carry the original destination ipod, the incas traffic.

C

It can be optionally used to cache some of the post time at the sender side top product switch. I can talk about that more later and we can also. We should also carry something like a dhcp uh or vlan pcp, whatever um qs priority information that is needed to identify the actual pfc priority queue to pause at the center nick and, most importantly, we we carry the pulse time, duration or expected surgeon time.

C

It should be the smaller or the equal than minimum drain time to reach the target shoot ups. So here the target q depths could be something like an ecm thresher or uh slightly over uh lower than that. When we tried to try with different values, they didn't really make big difference, because our reaction was the spx reaction is really fast and optionally. We can carry some additional condition: locator information like a switch for qids, but that's really optional. Even without the information, the entire protocol behavior should be the same.

C

The additional information can have if you really want to consume that for more network-wide monitoring and then traffic management etc, but that's a kind of beyond scope of the initial use case and for the source flow control.

C

We want it to be consumed by the sender side, transport layer, so we need to carry some additional layer, 4 port or maybe qpid like information, so that the the actual sender, side, flow and connection can be identified, which flow and connection to be paused. As a reaction of the signaling.

C

So when we look at this source flow control, um a bit more kind of advanced version of source pfc, because we are pausing the flow uh at the flow level at the transport, um you may wonder: how is it, how does it look different from something like icmp source crunch?

C

So just as a data point, there is actually recent paper nsdi this year, something called onramp which also implements such a similar flow level. um Connection level flow control mechanism implemented at the linux q, disk, so yeah. In order to make a reaction and consumption of this information, we need some uh changes on the software stagware.

C

uh We need to modify the rdm hardware stack um and there's one example uh in the nsd paper back to this question: how does it uh differ from source crunch, which is actually duplicated rfc, I think more than 10 years ago, um as I understand, there are multiple reasons why source quench has been deprecated first, it didn't really specify which information to carry or how to consume and react to that information at the sender side, and so we in sfc or as tfc. We clearly specified that we just carry the pulse time.

C

Duration for the drain time, duration and we promote that uh the immediate flow control so that incas senders can really stop sending immediately, rather than aim. This style of transition control, especially for data center and uh yeah source quench, was really designed for or promoted for when internet transition handling. But we are promoting this sfc for data center, with single administrative domain and in case of layer, two data center there. What has been something called i3 qcn, uh it's actually quite similar.

C

You know message that it also promotes the back to sender signaling from switches to the traffic senders, but this is a layer. 2 transition control with similar aimd still requiring multiple time to flatten the curve and rocky dcq cn is the l3 adaption of gcm.

C

Answer is one of the questions and some additional questions are shared by the itp community uh by separate emails. So um at high level, how do we secure the protocol? um We assume this will be for single domain data center with trusted switching devices, and I can make some argument that the signaling between switches for source pfc it could be similar to error, dp and bgp, and then I understand that there is a bgp encryption mechanism, but in reality it hasn't been really used for many regions.

C

It cannot really serve the problem of malicious or poorly implemented router and it can actually cause additional headaches. So, as I understand no one not really heavily turned on the bgp encryption mechanisms and for the signaling for sfc source flow control. For the sender transport to react, um we can see that this is quite similar to ecm marking, where the data center switches for intermediate switches and routers are uh provide some information in the actual data packet and consumed by the sender side transport.

C

Here we are generating the new new signaling packet instead of modifying or marking on the inband data packets. But you know such that. The information is provided by the switches and then directly concerned by the end host is pretty similar in my opinion, and then ecn is, has been uh heavily used in data centers these days, something like a dc-tcp.

C

At the end, uh we can simply uh implement accur at the domain. Boundaries uh like a topper, rec switches or maybe gateway switches, so that this new form of signal packet cannot really come from the outside of the domain.

C

And another question was yeah: is it only for rocky? Yes, rdma is a primary use case and then rocky b2 is the most popular transfer of today. But we see more new type of transport for rdma or rma mechanisms are rising, and so we believe we can have many different rdma transports in a similar way to scale on standard ethernet fabric, and uh we have some some argument how this can be a good fit for the machine learning training in the backup slide.

C

So if you want, you can take a look and sfc can also be applied to non-rdm use cases as similar to on-ramp paper from nsdi, and we are currently performing some evaluations with the tcp traffic and you may wonder that the ig-28 signal link is still sub-rtt, but it can be still proportional to the network rtt. If the network size is growth, so then isn't it too slow.

C

um So we have a simple mechanism that we can catch the uh pause time per destination, ip at the sender, site, top project switch, and this information can be used to instantly pause, another sender's coming from the same connected same switches and then uh handy gearing for the same destination. Ip can be immediately paused without waiting for them to be trend, uh reach out to the sender, side, receiver, side, tour and their account back.

C

With that, um this is a simple history um of the mechanism yeah. We first talked about this idea, starting from uh last year april, under a number of presentations at the public domain and also recently more in the iit pre.

C

There is a suggestion that ietf should be aware of this activity, so we are having this conversation here in iccrg.

C

Thanks for the opportunity and the planet, the i3 is simply extending the existing in 80 2.1 qcd uh conditional isolation mechanism. It already has a layer, 3 mechanism, so we can simply extend it to enable something like a source pfc, but I can easily imagine that if you really want to do source flow control uh for the transport to make use of this information, then ietf can be a better forum to discuss.

C

Yeah, I think, for the sake of time, I'm going to stop here and then we can. I can take some questions.

C

Should I stop sharing my screen.

B

Hey jk: let's keep your screen on in case people have questions about any other slides.

C

um There is a question from jonathan.

C

In the chat room, if I understand correctly.

B

Yep go ahead. Take that and then praveen can go on after that.

D

uh I was just uh pointing out that the um the basic uh multipli but multiplicative decrease mechanism is an order. Log n mechanisms that was from quite early in your presentation.

C

I see yes, sorry, sorry, if I misunderstood, but yeah it still takes more than one itt. I guess that's the key question. Thanks for pointing out.

B

Growing europe.

E

The question um so there was another work recently presented in iccrg called hpcc. uh Can you compare.

C

E

Approach with hpcc.

C

Yes, hbcc, I'm also part of the effort. It's still four direction, signaling so hpcc. You can imagine this as a really multi-bit ecn. So, instead of just one bdcn, it carries the multiple information about the condition like qdaps and then link utilization, but still in the four direction, data packet and echoed back by the uh receiver back to sender. So um it still kind of suffer from.

C

um The condition signaling path is coupled with the ongoing condition, um so we do have actually simulation results, um demonstrating the effectiveness of source flow control or sfc with hpcc.

C

So when the incase ratio is not that small, then new condition, control like httpcc can actually do decently better. But here in in this case, if we actually created an in case a little bit higher than uh user, um let me go back.

C

uh 88 of traffic is in caspers with hundred twenty two down to one, so it might be a little bit kind of severe in case condition in those cases, and uh we still kind of generate up to orders of magnitude and it's one order of the magnitude better improved our flow compression time compared to uh sfc, and uh we could also maintain the buffer occupancy, smaller and then slight improvement in the good.

C

But overall because of the queuing is better managed that that leads to the reduction of the flow completion. Time.

C

I hope this answers your question.

B

F

uh Yeah, okay: here we just want to double check hello, hello, yes, okay, hey john, I have a particular question regarding your slide. 6 the example in the normally in data center of the tor switch or for the other aggregation switch from the downlink to uplink you're, going to have some ratio not just like the one you're showing in the slide, six like the downlink 100 gig, and the update also 100 gig, normally just like a photo one ratio.

F

It's like not uh not so con congested as you're showing here so like you are using 100 gig for the downlink. You probably are going to use like 400, gig or even combine up to 1t. So in that case, are you still seeing so much uh uh improvement from your yeah experiment? Thank you.

C

Yeah, that's a fair question. Thanks yeah this, this has been intentionally designed. The topology has been intentionally designed to uh just nail down on the head of line blocking issue and uh when we simulated the larger scale simulation for 320 servers yeah, we definitely created a full bisection than this uh topology. Without any um um oversubscription, and here there we could still see that pretty good flow completion. Time improvement, even for the dc qc, which is the uh actually improve the cqc and also hpcc, and um there was another presentation from uh at i3.

C

So if you sorry yeah, this particular highlighted link. If you can click on this one. This was the measurement study done by huawei um a month after our initial presentation in i23, and then they actually quickly the prototype and then uh demonstrated when their rdma traffic is mixed with tcp uh with, I think, normal of a subscription or maybe small over subscription.

C

uh The benefit was still pretty good there in uh in terms of tail latency, so yeah. My key answer is that there are multiple data points that you can still have.

F

Thank you. Thank you.

B

All right, I have a question from the queue from the flow, so I'll ask you this question: jk, um I'm familiar with some work in the past, and this is probably a few years ago, timely and from from google folks, did some work on basically using rtd rtd as a congestion signal for within data centers, and it was, if I remember correctly, they did some work on rdma fabric.

B

Have you compared your work to theirs or have you looked at that book? Somebody else from google might be able to shed more like on the specifics of the proposal, but um have you had a chance to look at that.

C

Yes, yes, um we are very well aware of such a new condition, control algorithms designed for rdma, so hpcc is one of them timely and recently swift, I think near carville- is here today and um many of them really make up way more better position, control than something like a tcqc or dc tcp, because um it's really designed to for low latency high bandwidth.

C

But I think the fundamental difference is that um incas can still happen, because you cannot really perfectly synchronize all the senders, especially with rdma each centers uh blasting at line rate. If somehow they happen to collapse in the within one or two rtt time, maybe more than just three or four senders. They can easily just fill up the queue very quickly right and the entry condition. Controller still has to kind of have the signal uh passing through the congested queue and reaching the receiver and echoed back so.

C

This will naturally will take endless more than two to three or maybe more rtts, depending on the level of incas.

C

So um this source flow control, or we colleges back to sender, signaling mechanism- um provides that incast information directly back to sender within uh one rtt time, and so they can handle such a unintended synchronized interest, or it may also handle the case where um the timely or the rdma condition control uh happen to coincides with non-compliant congestion control or maybe when, when condition control, reach has a way larger, rtt time that they wouldn't really react to this uh incase congestion control uh signaling within uh soon enough, so that the back to sender is really handled such a different cases.

B

Understand well, thank you so much for that and with that, I'm gonna uh uh that we do not have any other folks and we need to move on to the next presentation. So thank you so much for your time. Jk your presentation to the folks in the group I'll say that this is work.

B

That's going to the ieee and if there's any feedback that you'd like to pass along, I think it will be welcome either on the on the research group mailing list or directly to jk, and with that I am going to move to the next presentation um natalie. I hope you say I'm saying your name right, I'm not sure! Yes, I'm here!

B

Yes, I can hear you, um I'm gonna, ask you to request uh presentation: okay,.

B

And you should be able to choose your slides now.

B

Yes, excellent, uh take it away, then! Yes, okay,.

G

So good afternoon everybody, my name is natalie roman and I work for the strategy and technology innovation at deutsche telekom and today I'm going to give a short presentation about an implementation of the vbr congestion control algorithm for the ccp, which we have named sincerely.

G

So, uh first of all, the motivation to bring bvr to the tccp protocol relies on the fact that, right now for dccp, there are only three congestion control, algorithms, standardize it and all of them are lost bases. So we thought about bringing vbr precisely because it is a non-loss-based acc algorithm.

G

Apart from that, we also wanted to use vbr within the multipath dccp protocol.

G

That means to bring it into a multiple scenario where the latency difference, among the paths is a key factor to achieve a good performance, and we thought that, in this case, bbr might be useful.

G

Last but not least, a will note that bbr has proven to have quite good results for tcp in terms of having a low, latency high bandwidth and also avoiding problems like buffer blood. So we wanted to verify whether all these characteristics also apply for this ecp.

G

On that basis, a we started the development of bbr version, one for dccp as cc85 within the linux kernel. This implementation is available as open source and, of course it is based on the existing implementation of bvr for tcp.

G

However, the main difference between tcp and dccp relies on the unreliable nature of dccp, which means that all the functions related to the acknowledgement generation and their processing are part of the ccid definition not of the protocol itself, which means they are going to be part of the ccid code.

G

Apart from that, a we also wanted to adopt this existing and mature a tcp vbvr as a new ccid profile for tccp, and for that we have already submitted an initial draft and we will be happy to receive some feedback and comments about it.

G

So once we finish our first implementation of bbr for dccp, we started some evaluation in a controlled environment using a single path and a multiple scenario. In this evaluation, we compared the performance of our implementation of bvr, which is cc85 with the performance of cci2, which is the default congestion control for the ccp.

G

The results show that, for both cases for single pad and multipath cc-85, that means bbr has way better results in terms of latency when the pad has a limitation in the bandwidth. Apart from that, in the case of the multipath scenario, ccid5 also helped improve the scheduling performance.

G

Moreover, the main achievement of this test was to prove that the conceptual basis of tcpvr was also applicable for dccp, which means that all the existing studies and results available can be extended to dccp as well.

G

In the whole analysis, and also the description of the test environment is here in this paper, which I have linked in case. Anyone is interested to check about it.

G

So after executing this first test, we move to a more realistic environment and we started testing our implementation in nlt link and we compared the result of cc 85 and the standard tcp vivia.

G

In this results, we found that a during the pro rtt phase, the drops in the bandwidth were quite deeper for the cc-85 in comparison with the tcp vr, and also the duration of the property phase was a little bit longer in some cases in comparison with tcp.

G

So we started an analysis of this problem and we figured out that what was the cause. The point is: a bbr requires the restoration of the congestion window when it leaves the property phase, so it restores the congestion window from a quite low value to the value it had previous entering this phase.

G

Apart from that, the dccp requires that, when there is a big change in the congestion window, there has to be a synchronization between the sequence and acknowledgement validity windows.

G

This synchronization uses a feature, negotiation function, which is described here in this figure. Basically, the sender starts this negotiation and once it receives the confirmation it can update the local buy.

G

This brings a brings us to a problem and is that the property federation acquires a latency dependency, because this negotiation takes at least one a rtt before we can update the local values and proceed to the restoration of the of the congestion window. When leaving the property phase.

G

So to solve this problem, we apply the temporary solution, which is that we trigger this synchronization, but we don't wait for the confirmation to update the local values. That means, as soon as we trigger the synchronization, we can update the local value and proceed to restore the congestion wind.

G

After applying this a temporary solution, we managed to achieve a similar performance comparing ccid5 and tcpi.

G

Now. The question is that, as I said, this is a temporary solution and we would like to start a discussion to know what is the best approach to solve this problem. So maybe a new or an enhancing feature for the sequence with the negotiation is necessary, or maybe there can be a different approach that help us to solve this problem.

G

So, to summarize, this whole thing: we have implemented bbr for dccp.

G

The initial test that we have made have proven that a the conceptual basis of tcpvr applies as well for the ccp, of course, taking into account the unreliable nature of dccp and the difference that that brings us, and on that basis we would like to adopt vbr as a new ccid profile.

G

For that, as I said, we have already submitted an initial draft. We submitted it initially to iccrg, but we have been told that this has to actually be part of the usbwg group, so in in the further version google submitted that, but we will be happy to receive any comment or any feedback about it.

G

The second question comes about this sequence: window negotiation. So we have described the problem and we would like to receive some feedback about it and to start as well a discussion there, but if we are not sure what should be the right place to start the discussion either here in ccr in iccrg or in tsbwg.

G

So that's the reason why I'm making this presentation now and that's it for now.

B

Thank you, natalie. That was a wonderful uh and short presentation so uh and thank you for that last slide. In particular, I think it's uh gauri's in the queue so I'll I'll. Let him have a take as to www.

H

That we'd be able to discuss this um on the mailing list and make some progress with the algorithm and the proposal, I'm not sure where the home would be, um but it lies between these two groups and whatever the home, you shouldn't be discouraged. Please please discuss how to fix it and please discuss the issues.

H

um It was a great presentation. I don't know what the outcome will be. So I'm looking forward to hearing more from you about these methods.

G

Okay, I want this stuff thanks.

B

Thank you, gauri.

E

I had one uh question here, so you refer to bbr here as mature right. My understanding is there's quite a few presentations today talking about bbr v2, which is the next evolution of bbr, with like.

C

E

With cubic etc right, so my question is: if you're going to place, you know create a new standard. um Would it be better to wait for bbr v2 to mature? Before doing this, I guess that's my question to the group.

G

Yes, I mean, from my point of view, we are also planning to implement vbr version 2 and tested it in dca in tccp, that is in our scope of workforce next year, and I think yes, it will be better to use bible version 2, as it is a more mature version.

B

All right so we're going to take more discussion on to the list and when I say the less I mean I see crg but um we're going to talk to the chairs about. Oh, I see the ad is in there, so martin go for it well.

I

Not the id for this but um yeah, I like I, I I support this work as well, but I am a little concerned that this is racing a little bit ahead of the actual tcp bbr work, um which I I think probably has a little more data to support it. um So I mean it seems like most bbr discussion at this point is happening in iccrg um and you could certainly do a drafting iccrg.

I

It would be experimental, but given the bb like we'll find out more, I guess in a minute here, but given the bbr is still a little- uh is a moving target. Maybe that's the right designation for now.

B

Yeah, that's my sense as well, um but we'll take this discussion offline uh with the chairs of tsuwg and um we'll get back on that david you'll have the last comment and then I'll. Let them on to the next discussion.

J

Among chairs is fine, um I think icc, as, as I think, iccrg is the right place to discuss the technology, and it's also the right place to figure out appropriate timing when the timing is appropriate. Tsvwg uh is almost certainly uh the uh the venue to work on the directness here and standardization, but uh need to get the timing right and make sure that it's it's it's well coordinated with bbr as a whole.

B

Yep. Thank you for that comment, david all right. Thank.

K

B

So much natalie and I hope to see you on the mating list. um People please engage it's good to have this mapping for bccp as well. I will now move on to the next presentation, and that is ayush. Are you sure you're here I see you up there?

B

uh Can you I I see you requested. I am wait. A minute.

B

All right take it away.

L

Okay, uh so yeah hi everyone, I'm ayush, I'm a third year phd student at the national university of singapore and today I'll be talking about uh some very interesting work that we've been doing on studying the game theory behind. Actually, you know choosing between running cubic and bbr on the internet.

L

uh This work has been done in collaboration with my uh collaborators jinxie melties sean raj and my advisor ben leon.

L

Okay, so, since pbr was introduced in 2016, a lot of websites have made the performance driven decision to actually adopt it and use it to send data for their websites, and companies like google and spotify and dropbox have reported, seeing lower delays and better throughput, especially in lossy networks, where act, locked, loss-based, algorithms, like cubic, are known to suffer and clearly this trend has got on. Since um we did a measurement study in late 2019.

L

We found that close to eighteen percent of the alexa top twenty thousand thousand websites are already running vbr, and this 18 metric actually goes up even more when you consider the more popular websites or websites that uh contribute more to downstream traffic, like, for example, video streaming websites.

L

So the question we want to ask is: where is this transition uh really heading? So this transition in the internet's congestion control landscape is definitely not a new thing. We've seen in the past that renault dominated internet in the early 2000s, slowly transitioned into an internet that was mainly cubic, dominant and much like gbr does today. Even back then cubic basically gave you better throughput and better utilization guarantees on the internet, which is why people moved on to it.

L

But there is one key aspect in terms of which this transition from cubic to bbr is very different from the transition that we've already seen, which was between renault uh to cubic. So the transition between render to cubic was essentially between two window-based loss-based algorithms. So they were, they both had the same congestion, control philosophy.

L

They both uh reacted to the same congestion uh signal, and that's why you know we didn't really face a lot of problems, but right now, as you have more and more websites replacing uh using bbr to replace the existing loss-based algorithms. What's that actually doing, is it's creating a paradigm shift in how congestion control is done on the internet?

L

uh Now we have websites um now we have floors that are competing with other floors and we have all mixed set of congestion signals that everyone's uh responding to.

L

So the question we want to ask is uh given this performance improvement uh that bbr has given us so far. uh Where do we actually expect this transition to move, or in other words, if you're, seeing such good performance benefits? Is it reasonable to expect everyone to switch from cubic to bbi at some point in the future?

L

So this is a question uh that we discussed in a recent short paper at epnet 21. It was titled, conjecture, existence of nash, equilibria and modern internet congestion control and the main insight we found.

L

While writing this paper is that you can really model this entire problem of choosing between cubic and bbr as a normal form game, because we have some players or, in this case, websites that can maximize some utility, which in this case is network performance, and all these players have a fixed set of strategies which is either running cubic or bbr available to them to maximize their utility.

L

So the approach we had uh to analyze this entire system was actually to calculate uh the nash equilibrium in the network, where the senders have the freedom to choose between cubic and bbr to maximize the throughput.

L

So just as a refresher, a nash, equilibrium is basically a strategy distribution where none of the players have the incentive to switch to the other strategy, or in other words in networking terms, it would mean a distribution of congestion control, algorithms over the network, where none of the senders have the incentive have the performance incentive to switch from cubic to bbr, or vice versa.

L

So, let's look at the example on the slide here. Let's say we have a network with seven senders and of the seven centers. Four of them are running bbr and three of them are running cubic, and given this network configuration and congestion control, algorithm distribution, each of the flows are getting some share of the bottleneck bandwidth.

L

Now, let's say one of these senders alex decides to switch from bbr to cubic and while making the switch, he essentially changes the internet's congestion control landscape, and now he sees um different throughput on the network.

L

So, in this case, uh we're going to make the assumption that, when alex does this switch, if he sees a better throughput he's going to switch to the algorithm that is giving him better throughput or basically all uh the agents in our network are going to make a performance driven decision on which algorithm they want to run.

L

So in such a network, if we are able to find a congestion control distribution where everyone where for everyone making the switch to the other algorithm, gives them strictly worse performance.

L

That essentially means that this uh conjunction control, algorithm distribution, is the nash equilibria for that network, or basically, this is the fixed share of cubic and bbr flows. We have, in the network, uh there's really no incentive for the number of bbr flows to increase or for the number of cubic floors to increase.

L

uh Now a conjecture in the paper is that we think this nash, equilibrium equilibria, will exist in all kinds of networks where you have senders and senders running cubic and vbr flows, and this is actually quite a big claim to make, which is why we still say that it's a conjecture, but we have good reason for making this conjecture. So in the paper we go over the exact observations that we made based on how cubic and bbr interact and how these observations actually guide us towards making this conjecture.

L

But in the interest of time I'm only going to discuss the key observation uh over here, which will hopefully convince you guys that yeah there might indeed be a nash equilibria. When you know, n number of flows compete at a common bottleneck, so um over here I'm going to plot a graph for a system where, let's say we have uh symmetric senders. So all my senders have the same rtt and they only differ in the sense of which congestion control algorithm they choose to run now.

L

We know from other measurement studies that when you have a very small number of uh bbr flows in the network, they can get a disproportionately high share of the bottleneck bandwidth. So I'm going to plot this as point a in the graph on the slide. So on this graph, basically, on the y-axis, I have the combined throughput of all the bbr flows and on the x-axis I have the percentage of bbi flows in each congestion control, algorithm distribution.

L

So we can plot point a based on the observation made by other measurement studies. We can also plot point b, which basically says that, when all the flows uh at the bottleneck are bbr flows, they will basically use the entire bottleneck bandwidth, which is really a no-brainer.

L

So we have two points point a and point b and we can also say that all uh all the data points between point, a and point b will lie on some line connecting uh the two points and these different possible lines. I've just depicted using the different gray squiggly lines of the slide.

L

So the interesting thing about this graph is that when you actually plot out these values, every point at which your gray line intersects the fair share line that essentially signifies the nash equilibrium point in the network, so the fair share line- uh I'm sorry, I didn't go over it earlier, but the fair share line is basically the line at which all your flows get uh the fair share. So if bpr was getting the fair share in this network, the the data points would follow the fair share line.

L

So let me actually go over why uh we actually claim that this intersection point in this graph is going to be the nash equilibrium. So to do so, let's zoom into one of these intersection points. So at this intersection point basically what's happening- is that the average bandwidth of all the cubic flows equals to the average bandwidth of all the bbr flows, which is why neither of them wants to switch to the other kind.

L

But why do we say that this is actually the nash equilibrium? Well, we say this intersection point is the nash equilibrium, because, let's say we move to the right of this point, which would signify that a cubic flow in my current configuration wants to switch to running dbr. So when we do this, we will actually be transforming the entire system into a regime where bbr flows perform worse.

L

On the other hand, if um there's a bbr flow in the network that wants to switch to cubic, that would move the distribution to the left into a regime where cubic flows perform worse.

L

So, in both cases the cubic floor does not switch to bbr, because that would mean meaning to moving to a region where bbi performs worse and similarly, the bbr flow does not want to switch to cubic, because that would mean moving to a regime where cubic floors perform worse and because there is no incentive for any floor to switch to the other strategy. At this intersection point. This by definition, becomes our nash equilibrium point now.

L

The graph that I plotted earlier was uh theoretical, but we have uh validated these predictions through actual experiments, so we had 20 flows, running through different length, speeds and different buffer sizes and across these different regimes we plotted the normalized bandwidth for bbr, and we did actually uh observe that at the intersection point where the line crosses the fair share line to the right, pbr actually performs worse than cubic and to the left cubic performs worse than bbr.

L

So to so in the paper, uh beyond the observations, we also use these um observations to write down a couple of uh equations, which we use to come up with the exhaustive proof for showing that a nash equilibrium will always exist when two floors are competing and the two floors have the choice to run either cubic or bbr.

L

But in the interest of time I will not go into the details of this proof, but I will just say that the nash equilibria in this case depends on the buffer size. Unsurprisingly, and the nash equilibria also exists independent of the fact whether your two floors have similar rtts or distinct rtts.

L

But beyond the exhaustive proof, we also wanted to empirically validate some of the claims of our conjecture, which is saying that the nash equilibrium will always exist. So what we did was we set up networks with six nine and 12 floors where all these flows shared a common bottleneck, bandwidth and in each experiment. Exactly one third of these flows had uh 2050 and etms rtts, and this was basically to simulate flows of different rtts competing with each other, and then we wanted to see how this actually impacts the existence of the nash equilibrium.

L

So, given this network configuration, we basically ran all the two power n combinations of different flows, running cubic or bbr, and then we recorded that throughput and once we had the throughputs, uh we used these throughput values to validate if any of those uh congestion control algorithm distributions were the nash equilibria.

L

So, just to recap, if we have a three-floor system- and we say that cbc or the first floor, running cubic the second floor, running bbr and the third floor running cubic again is the nash equilibrium that basically means that uh when your distribution is bbc, the first flow gets worse throughput. When your distribution is ccc. The second flow gets worse through part and when your distribution is cbb, the third flow will get worse. Throughput.

L

So before I actually get into the graphs, there were a couple of interesting properties that uh we found while uh actually calculating the smash equilibria in our experiments.

L

uh Interestingly enough, we found out that in all our experiments there was exactly one nash equilibria, so there was one fixed distribution of congestion, control, algorithms, where none of the flows had the incentive to switch to the other algorithm, and we also found that in each of these nash equilibria, um the congestion control algorithm distribution fell in such a manner that it was always the smaller itt flows that chose cubic and the large rtt4 flows uh decided to opt for bbr so later on.

L

In the graphs, when I say that, let's say that a sixth floor system has a natch equilibria where 50 of the floors are running cubic. uh That basically means that uh 220 ms flows and 150 ms flow is running cubic and 150. Ms flow and two atms flows are running bbr.

L

So, while actually calculating the nash equilibria, we experimented with different link, speeds and different uh buffer sizes, and the entire point of this was to see how the link speed and the buffer sizes impacted, where the nash equilibria lied. uh Predictably, buffer size had the biggest impact on the on the distribution at the nash equilibria.

L

So when your buffer size was deeper, you're more likely to have um floors opting for cubic rather than your buffer size, when your buffer size is uh shallower, and this makes sense because cubic is a buffer filling algorithm and it's likely to be more aggressive when you have deeper buffers.

L

uh We also tried changing the rtt distribution to see if that made any impact on where we earlier saw the nash equilibria, and we found that there was very little in impact on where what the distribution of algorithms actually was at the nash. Equilibrium point.

L

So, to summarize, uh the findings of our short paper is that, despite bbi's current throughput benefits, we think it's unlikely. That cubic is going to disappear soon, and this is because we think that dbr's performance benefits that we see on the internet today are going to wane as more and more people on the internet start running bpr.

L

Therefore, we think that the internet is likely to remain a heterogeneous mix of congestion control algorithms.

L

uh We also think that a lot of our results are a good lesson in understanding that tcp performance is highly contextual, so how your algorithm performs not only depends on the characteristics of your network, but also who you're competing with.

L

And lastly, I would like to know that you know we can make all the fancy predictions of uh having different kind of nash equilibria, but the internet actually does not uh follow economic game theory exactly so it's not a given that the internet will move towards nash equilibria, but given uh the fact that a lot of people on the internet are likely to make the decision between cubic and bbr based on performance, we think it's likely that even if we don't reach the nash equilibrium, we are going to move in that direction.

L

Now, obviously, there's a lot of future work to be done uh in this paper. We want to come up with a formal proof uh for general inflow game. We also want to look at the effect of more complex network utilities. So in our paper we assumed a very simple uh utility function where every flow wanted to maximize its throughput. But obviously that's not true on a real network uh flows are likely to care about both throughput and delay, and the utility function is likely to be a combination of these metrics.

L

We also want to look at the effects on the congestion control, algorithm distribution at the nash equilibria in the presence of bbr, v2, uh multi-hop paths and eqm's, and also how things change when you have very, very deep buffers and much larger number of flows.

L

So in terms of very deep buffers and much larger number of flows, we have been doing a fair number of experiments and so far what we've found is that the trend still exists that at higher bdps, the the share of cubic flows at the natural equilibria will be greater.

L

But um there is one aspect in which the large flow experiments differ from the experiments that we've done in the short paper, uh and that aspect is that when you have very deep buffers and very large number of floors, things are not as nice and clean as having one nash equilibrium point uh generally, we found that there exists a region or a window within which you're likely to get a nash equilibria.

L

Okay, so thank you for your time. uh That's all I have for you today and I'd like to take questions. If there are any now.

B

Thank you so much for your time. Irish. This is a very, very interesting piece of a number of people in the queue already, but I'm going to uh ask a question before I get in there. I actually took two quick questions. One of them is that you seem to suggest cbb and bbc as two different uh um experiments, and I want yes that seems to me. I said the order in which uh flows entering choose makes a difference.

B

L

It it's not really the order in which the flows enter, but it's just the fact that you know we have given all the flow some cardinality and we're treating all the flows separately.

L

uh So basically, we are in this notation we're not assuming that flows are symmetric or there are a bunch of flows that have this that have same rtts. So each flow here is distinct. You can assume each flow has different rdt and therefore is a separate entity.

B

I see yeah okay. Well, maybe I'll ask you later, but I can't see the difference then, between the first uh set up there, the first experiment then the third one, because to me they seem to be this. Okay,.

L

So basically, the first second and third floors might have different rdts understood, which is why bcc.

B

L

Cbc are different.

B

Handsome all right, I'm gonna, I'm gonna, get out and allow the others to ask questions, but I'm gonna close the queue here, because we don't have a lot of time. um All right go for a dn.

K

Hi thanks uh this is this is quite interesting. um I I do have some kind of um extra complexity to add on to this whole thing, which kind of um at least my thoughts on how this replicates or does not replicate the real world. So um it's it's worth it noting that the most valuable flows are commonly the short ones that are very accurate.

K

uh The conical example is things like search and ads, and in the case of my employer, you generate a lot more value. You certainly want more value, provide something like youtube, video um and given the amount of value, it's actually kind of incentive compatible to make sure your uh your long-term flows are not too aggressive and that they move out of the way quickly uh when something that's high value.

K

Like start, your ads comes up, um and the the other thing to note is it's not uncommon to have between 10 and 20 connections for a single page load on the internet um and when you're dealing with that kind of chaotic environment where nothing really gets out of startup or very rarely um it's it's very difficult to reason about the congestion control performance. Right like like the congestion avoidance phase, is basically like irrelevant.

K

um You can largely like remove it from the congestion controller and it would like largely work the same for search um and a number of other major websites. um It obviously matters intensely for youtube that matters intensely, for you know a large flow like an uploader or download, um but but I guess even for a given provider, it might actually be instead of a compatibility like make your congestion avoidance scheme, not too aggressive, to make sure that, like smaller flows, which are higher value like our favorite, um and so I think I think it's complicated.

K

Made sure that this was not a problem and make sure that there was no negative impact on search latency when we launched pvr originally, um and we did a bunch of studies and couldn't find anything so as an anecdotal yeah.

L

Yeah, uh I I think all those are fair points, and I completely agree with you that this is uh extremely complex problem. In fact, you mentioned the flow durations and how flows of different iterations might have different metrics, and they might want to optimize for different things.

L

So yeah, all those things definitely complicate uh things a lot, but currently from um what we are working on is the assumption that all your flows, uh that all the flows that you care about are substantially long such that they enter congestion, avoidance mode, and then we want to see. um You know how how performance is going to change for these considerably longer flows.

I

All right, martin, duke uh thanks, it's a very creative um way of approaching the problem, but um there's a little discussion in the chat uh because, of course, the term rtt is a little overloaded in our in our field, right where sometimes it includes the buffer. Sometimes it's not so.

I

There are two ways to look at it that, if that, if the two two intuitions that that are that we're applying in the chat one is that um that, like if the path latency aside from buffering is low, the cubic is favored and therefore like low flow, latency flows will low latency pads will just use cubic forever.

I

um The other the other one is that um uh as more and more people adopt cubic towards the nash equilibrium that that buffers that buffer occupancy drops and therefore that the the benefit of adopting bbr instead of uh cubic lessons, and so the the nth person to adopt cubic, um has no incentive, because the other bbr people have already, you know, reduced the buffer occupancy. So I don't know if you can speak to either of those intuitions if they're, if both or neither or one of them, is correct. In your view,.

L

Yeah so generally, we have seen that there are diminishing returns in both directions, so whether it be for more and more people to adopt cubic or more and more people to adopt bbr and we are in the process of actually um coming up with a model that can reason about these diminishing returns, and I won't get into the details, but at a very high level.

L

Basically, why we think this is happening is that when you have uh both cubic and bbr flows uh competing at the bottleneck, what they do is that they section off different, they basically section of different regions of the buffer.

L

So there you have one box that belongs to bbr and one box that belongs to cubic and, as you put more and more flows into the cubic foot box or more and more flows in the bbr box, the boxes don't increase in size linearly compared to the number of flows you're putting into them, which is why we're getting diminishing returns and which is why you know the the rate of acceleration when you reach the nash. Equilibrium point keeps on reducing in terms of the the performance benefits that you get.

M

I just wanted to thank you first off for this work. It's really interesting and it seems super useful. um I just wanted to um amplify some of the discussion here about the workload that's being tested here.

M

I think a lot of us are thinking it would be really useful to um in future versions of this work to include a sort of mix of short and long flows and in particular the kind of effect I'm interested in is that if you have a dynamic mix of entering short flows, then often every time somebody enters the bottleneck, they'll cause packet loss because they sort of try to figure out.

M

You know what what bandwidth and how much buffer space is available, and if those flow entries all cause packet loss and those flow entries are um close enough together, then that can basically prevent cubic from reaching its fair share.

C

M

Because, obviously, it's going to be very sensitive to how far apart those lost points are. So I think you might get very different answers to in the question of what cc is incentivized if there's sort of a mix of dynamically entering short flows. So I'd love to see that, in a future version of this.

L

Yeah yeah, so we're definitely considering uh different kind of workloads. uh I think it's a great suggestion that we should look at a mix of short flows and long flows and uh see how things are changing. uh In fact, another thing that we are exploring currently is: if you remove um right now, we were just experimenting with long flows, but we also want to experiment with what happens when you have video workloads and when you're dealing with video workloads.

L

Ideally, the utility function that we'll be looking at would not be the throughput, but actually the qoe that your client is calculating so yeah. Thank you for your suggestion. uh I mean all these uh all these different aspects of the problem uh I mean we have been also trying to reason about it. A lot and, in fact the biggest problem we're facing right now is really you know to come up with a nice systematic way to explore all the different things that can happen in this space.

E

K

E

Hi hey: uh this is great work, uh uh so I'm assuming this work was done. uh Measuring bbr v1 with cubic. It would be also interesting to include vbrv2, and the second comment I had was yeah you're right that the evolution here might not be just based on maximizing throughput as a utility function.

E

So I think certainly reducing latency is one of the goals on the internet right, so for all the players here, the utility function might not be just maximizing throughput, so that should be taken into account, and the other thing here is that um there's certainly a benefit to standardizing on one algorithm in the long term. So when you look at it from purely you know, engineering efficiency point of view, there's one algorithm that can give you better throughput and lower latency. That's what everyone.

C

Will eventually.

L

E

L

um Yeah yeah, I agree, but actually from uh from a design point of view. I think it really it's quite a hard problem to convince everyone to.

L

You know, switch to that oracle algorithm uh just based on performance, because that would mean that uh okay in this graph, we basically want our designed algorithm to always exist north of the fair share line, and uh I mean that you can obviously design based on whatever utility function is you can design? You know a utility-based algorithm that only maximizes that utility but yeah?

L

I I think it becomes a very hard problem to you know: incentivize people to switch incentivize everyone to switch to just one algorithm, because there are different, because performance is contextual and also because generally the performance gains are diminishing as more and more people use it.

L

So from um from a design point of view, I definitely agree uh that the best thing is to have everyone on the internet run the same thing, but uh realistically we feel that it that's something that might never happen and we might have to you know, figure out a way to work around with these zoo of algorithms.

B

Thank you again, ayush. We seem to not have scared you away that he came back and presently the second thing with us and we look forward to seeing. I look forward to seeing more of this work. In particular.

B

The last thing that you just said, which is to uh I'd, be very curious to see how you can, if you can extend this to the zoo of algorithms that you found either on the internet and with that I'm going to move on to the next presentation thanks again ayush and uh please I wish it was on the iccrg mailing list. So if you want uh uh to have any discussion on this, please take it there.

B

I encourage that um I'm going to ask um praveen to come back on so that he can start his presentation next, proving we are running behind. So I'm going to ask you if you could try to keep it tight.

E

um Hello everyone, so um this talk is slightly different. I'm going to be talking about implementation experience. uh Hopefully you know talking about two drafts um that have been presented to the iccr. In some form uh we have an implementation update on on both both of these algorithms.

E

uh So a quick recap on our led bat, uh so early bite is basically trying to bring the benefits of lead battery led by plus plus to the receive side of the transport connection. uh The draft currently only talks about tcp, but it could be applicable to other transports.

E

The key insight is to use the flow control mechanism to throttle the pure uh in the tcp case. It's basically shrinking the tcp receive window uh and and growing it based on uh running sort of an equivalent condition, control algorithm on the receiver side.

E

um I missed books there when I said shrink so yeah. We don't shrink the advertise window, which is we just reduce it by the amount of bytes we received, but we do tune the window over time. Depending on the observed events from the network, like.

C

E

Now, why is this important? uh So one of the reasons why just a center side, conjunction controller, is not good enough in practice is because uh a lot of a lot of software uses cdns a lot of cdns currently don't have, for example, that bet plus support it's harder to update all cdns to have the right uh congestion. Controller proxies can prevent effective use of led bat on the end to end path.

E

Also, if you have proxies on the path then effectively uh from the server side, you're not actually measuring the right bottleneck and are able to basically throttle your sanding rate, and the receiver has a very clear information about which download it things are background downloads compared to foreground download. So there's advantages logistically in doing it on the receiver side, um and this work is based on this draft, which is currently active in iccrg.

E

So we did implement this in the windows. Tcp stack, so we have a single api. Currently that enables both the sender and receiver side.

C

E

Algorithm, so all right, let met plus plus an outlet bat. uh Our implementation on of our led by is based on lightbed plus plus. So it includes all the uh additional mechanisms that were introduced in ledbet plus plus, like uh using rtt measurements instead of one-way delay, uh slower than reno condition. Window increase with the adaptive factor, as well as the multiplicative condition window decrease with the adaptive reduction factor.

E

uh The slow start is also modified to be slower than reno, and then we have the periodic slowdown.

E

So we did simplify this compared to uh ledbet, plus plus, so uh we currently are doing one slowdown period uh per um basically a measurement interval. So this was deliberately done to simplify uh the code. uh We haven't compared one approach to the other, but this just a simpler implementation and we also simplify the base delay implementation uh based on the outlet by draft.

E

So we do require negotiation of time stamps. So what this means is that if the application did request or let bat and the timestamp negotiation may fail with the server in that case, we need to reflect that up to the application, so it can implement its own fallback logic to throttle, for example, using a fixed rate.

E

um Currently we don't take an action if a data packet is received without timestamps after establishment, so, for example, a middle box is stripping timestamp options. uh We are currently not reacting to that. That's actually a the standard says. You know the receiver should drop those packets, but we currently don't. um This is an area where we would like to continue some investigations to see what the draft should recommend, but we are collecting data on this to see how prevalent this is uh in the wild.

E

The other problem that we haven't mitigated is that the uh rtt is measured could be inflated because of bursts during slow start on the sender side. uh There's no uh effective mitigation. We can think of this on the receive side. uh If there are, you know this might be an area of research.

E

So uh one of the things we observed while we started experimenting with this in in production, was to find that there are several cdns which currently do not enable timestamps.

E

So we have worked with uh many cdns to enable timestamps when the client requests timestamps, uh and I believe I think the coverage is much higher now we are currently doing these measurements with with the windows update downloads, uh both for operating system updates, as well as uh store downloads, and we are aiming to share some data by the next iccrg.

E

I'm I'm leaving your question open here, I'm a co-author on the outlet red draft. This work is this presentation was about the implementation, but now that we have an implementation based on the draft, uh um I wanted to ask janna and and the group whether you know we should consider publishing drafts as an experiment.

E

uh We can take that during the q a um I will go on ahead with the presentation because limited time, so I'm also going to talk about bbr v2 and our implementation of bb rb2, um so bbr v2. A quick recap is a model based condition, control algorithm.

E

uh The goal is low q, occupancy of the bottleneck buffer, low loss uh and also some form of bounded aren't cubic coexistence.

E

So the way the algorithm works is uh continuously measure, bandwidth round trip, time, uh packet loss and then ecn markings from the network and basically figure out a rate that the sender should be sending packets at. There are some notable additions in v2 compared to v1, so the bandwidth probing time scale is adaptive. uh Loss and ecn have been incorporated into the network model um and then even when, even when we are application limited, uh we want to basically adapt to loss and ecn information.

E

And, finally, uh because there is significant aggregation in networks, uh we want to adapt this event based on estimating the amount of ack aggregation, that's happening in the network and finally, the the computed rate. uh The sender will basically paste the packets at the computed rate.

E

uh So a brief uh overview of how we implemented this. uh So basically, the code is actually open source. um It's available at this link. So that's what we based our implementation on. uh We integrated it as a conjunction control module in the windows.

E

Tcp stack um it's currently available as an experimental knob in windows, 11 insider builds, um the raid based pacer was built into tcp, so we're not using a pacer, that's outside the tcp module and the way this works is that basically on each send uh we're computing an allowance uh based on the time since the last send and effectively. If the allowance does not allow us to send us the send the package at that time, we schedule the pacing timer to send the remaining data.

E

um So one of the challenges for us was because this code is also integrated into the linux kernel. One of the challenges was that and because the code is not final, it's still evolving. uh We wanted to integrate this, but still leave most of the code intact, so that we can enable direct comparisons between you know, future versions and be able to you know, pull in those changes easily.

E

One of the simplifications we did was we currently don't do any ecn handling, so we currently assume there's no ecm marking happening on the network. um So that's just a simplification, but eventually we'd like to add that in I would like to say that this sort of we called it a reverse engineering approach. Basically just looking at code and trying to implement a conjunction control algorithm is something we've done for the first time and it was very, very hard.

E

um Lack of spec was uh a significant uh problem for us while developing this, uh but thankfully you know we had good support from new and yuck on on email and we were able to get most of our questions answered, but you know longer term, it's probably not sustainable uh some of the early data, uh so significant improvements in latency, so uh particularly with cubic.

E

You know, we see uh latency overshoot uh a lot beyond uh the base rtt, uh but in this case we're seeing up to like 10x improvements in many cases um and some throughput improvements as well, and these are like test cases in the lab that are doing uh wide area network emulation.

E

So um one of the interesting things was hey. You know this, this is primarily aimed to. You know, reduce latency. Let's run it uh at you know: ultra low latency test cases where you have like back to back systems in the same rack and loopback test cases, and we see that there's actually a cpu usage bottleneck, so the algorithm is executing more cycles compared to cubic.

E

So this is something we would like to address by doing software optimizations, and we also find that there are interactions between uh placing an lso lso is basically you know a tso or like sending a large segment out to the neck to improve the efficiency, and we find that because of pacing there are fewer opportunities to do so, and the and the size of the lso is actually smaller.

E

um We also did an inter region test uh in the in the azure cloud and we see about 20 throughput improvement and not much difference in latency, and this is a low loss sort of uh not uh you know, over saturated network. Basically, so there's ample headroom and we see basically throughput improvements, but not much difference in latency in this particular test case.

E

uh Significant fairness issues still so uh in all our lab tests, we see that cubic, dominates, bb or v2 across a range of test cases, so I think in bbr v1 we sort of had an opposite problem with some of the shell buffer cases, but in this case uh we find that uh I think maybe we have over compensated. A little bit and cubic is dominating bbr v2 um and currently it doesn't seem incrementally deployable.

E

Of course, if you have, if you have a network in workload where you can guarantee that it's only going to be uh bbr v2, then it's certainly deployable, but otherwise uh for any sort of incremental deployment. uh Currently we have significant finance issues so uh next steps here, so uh neil did promise he'll bring a draft. I think his talk is next.

C

E

Looking forward to that, but basically we would like to help review that draft and adopt it and take it forward and change our implementation. According to the community feedback, uh we'd like to resolve the fairness issues when cubic shares, the bottleneck, link and, of course, the cpu usage optimizations that need to be looked at and finally, deployment production, a big shout out, and thanks to neil and newton for all their help on this work.

E

B

Thanks so much praveen, um I uh I want to say I don't we don't have time for questions. There are negative time for questions, but if you have a really quick uh question asked here or I would recommend that you take it to the chat, uh if you can, because we're already behind way behind on time, do you have a burning learning question that you want to ask in person.

N

um Not really just very quick comment so about our light pattern like that possible, it's the gain factor and the the target. These are the two things I I just wanted to say that we shipped these two last year um in our tcp implementation and the gain factor that we are using and the target um we probably played with it a little bit because it wasn't working. uh The additive increase wasn't going as fast as I was expecting and the throughput was really suffering.

N

So that's all I want to mention, and maybe I can speak to praveen offline.

B

Yes, um and I would encourage by the way I would encourage even for conversations- I would strongly encourage using the iccrg as a place where, when you're having conversation across uh on clarification of implementations or various things, if you're able to identify ccrg a lot of lot of good things come out of it, because the community gets to see it, there's a record of it.

B

It's available for others to see later when they're doing the implementation work, so not just for this, but for praveen and neil when you're having your conversations, if you're able to have it on the channel they'll be very, very welcome as well. um Thanks for your presentation, um praveen. That was very it's exciting to see this work move forward, and I want to now hand it over to neil uh neil. I know that we are uh behind, so uh the rest of the time is yours.

B

So managers manage it as you see best, but uh go for it.

M

Thank you. Let me uh just get set here. I will request permission to present the slides.

J

Let's see actually get the right one: okay,.

M

Okay, uh let's see, is everyone able to see the slides.

M

Can you guys hear my audio yep, visible, okay, great um so yeah? So I wanted to give a quick update um about uh bbr work going on uh inside. Our team um and uh ian will also be presenting uh some updates on the quick uh side as well um and I'll make this super quick. um So I just wanted to talk a little bit about the deployment status code status and then give a super high level overview of the internet drafts that um I updated um the site last night.

M

So you can, you can find links um on the iccrg list and the vbr devs list- um and the goal here is basically to um uh you know, talk mostly to talk about the drafts which are responding to requests from uh the itf community and other transport stack maintainers, implementing bbr v2 that um you know.

M

Obviously, that would be useful to have a draft documenting the algorithm, and you know this is always part of the plan and we apologize for this not happening sooner, and we also, of course, want to invite the community to read the drafts and offer any kind of feedback both low-level editorial feedback, algorithm ideas or bug fixes test results. Anything is useful and welcome so uh in terms of deployment status of vbr at google right now.

M

Google internal traffic is either using bbrd2 as a default or it's part of a pilot program or we're gradually rolling out the rv to the swift variant which we discussed at a recent iccrg.

M

So the default right now is bbr v2 um using ecn and loss and bandwidth and rtt is as signals, but we are doing a pilot that is small but growing of bbr, a swift variant that is using a sort of network, rtt estimate as a primary congestion signal in the manner of the swift algorithm that was published at sitcom in 2020..

M

Google, external traffic is still using bbr v1 by default, but we're working on transitioning that to v2 um looking at av experiments, qoe and latency data and iterating to to improve that for the launch and, of course, we're continuing to iterate on some of the areas where we know we want to improve, including the issues that praveen mentioned about coexistence with cubic.

M

I also wanted to mention. We have praveen mentioned cpu usage and we do have a apache set. That introduces a fast path for ppr processing that we did find was useful in bringing the cpu usage um between to parity with cubic for our, at least for our production workloads, which we will be sharing when we get time, um let's see the status of the code, this is just a sort of repeat: we have open source versions.

M

You can find the links in the slides, so the bbr functionality is sort of split between two different drafts, as it was in the original um release for pbrv1. So the first draft is a delivery rate estimation algorithm and that covers a bandwidth sampling mechanism. That's used by both pvr version, one and version two, and it's also just generically available in linux. Tcp. You can use those bandwidth samples, no matter what congestion control is in place and being used.

M

The algorithm is largely unchanged since vbrv1 times there was a significant bug fix, though, that we've folded in and described.

M

Basically, the you realize that, when the loss detection algorithm decides to retransmit something that is another point when you need to look for bubbles of silence before that event, just as you would for cases where an application decided to send something um anyway, so you can take a look at the draft and give us feedback we'd be appreciated.

M

um So the other piece of this puzzle is the bbr congestion control draft itself. That's also been updated to cover the current pbr version, 2 algorithm right now. It just includes the aspects relevant to the current public internet, so the core model and the loss response and the strategy for coexistence with cubic and reno. um The ecm part, is only missing due to time limitations. That's still used uh at our site and still part of the long-term roadmap. We just haven't had time to put that in the draft. So I'll do that.

M

You know we'll work on that as soon as we can- um and you know, the the algorithm is sort of documented um in its current state and of course there are some known issues um the one I mentioned here.

M

It corresponds to the the issue proving mentioned in terms of um cubic and bbrb2 coexistence or cubic winds too often, um and then I'm just going to zoom through some pictures and well actually, first in a quick outline as you would expect, um the draft sort of covers first an overview and then a detailed rundown of the algorithm uh network path model, how it sets the control parameters and then the state machine, as the algorithm decides, to probe the network um during its lifetime.

M

um And then then I posted a couple pictures that I think you know as pictures by themselves. They won't have enough context, but I am hoping they'll be useful to folks who are making their way through the draft reading the content and they could, um you know, use a little picture to help uh put everything in context and make it a little more clear. So this is a sort of high level block.

E

M

If you will about how the bbr algorithm fits in with its inputs and outputs and the basic structure of the algorithm, where it's ticking various input signals feeding it through a model and a state machine and then generating you know the three control parameters, congestion window, pacing rate and quantum or burst size.

M

um And then the next picture that I think might be useful to people is just a picture of the parameters in the model and how they fit together. And we don't have time to go into detail. But all of these are defined in the draft and at a high level, you can say: there's basically a set of parameters about the data rate that the algorithm thinks is appropriate and then some parameters about the data volume or you know, amount of in-flight data and a key part of that, of course, is the bdp estimate.

M

But there are also other pieces that are discussed in draft and then there's just a little colorful picture of the state machine diagram that might be helpful as you're reading through the draft and then here's also a just a picture of a typical life in the day in the life of a bbr flow.

M

That sort of starts up, and this shows its evolution through the state machine uh with the level of in-flight data superimposed on top, so you can sort of get a sense of how these things interact and again we don't have time to go into it, but this might help visualize. What's going on in the text of the draft um so yeah. In conclusion, we've updated the drafts to cover pbr version two and uh we look forward to uh feedback if people have time to read um and high level feedback, low level feedback.

M

Anything in between it's all welcome um so uh yeah. Thank you very much, and um if there are any quick questions, I can take some. But let's also leave time for ian to give his uh update about the quick side.

B

Neil, do you want to take questions now or wait until the end's done.

M

Yeah, maybe let's wait until ian's done so I'll um see. Is there a way I can yield? Do I to.

B

Just give it up uh folks line up in queue if you want, but uh we'll wait until the end is done before we take questions go for it.

K

How do I, uh oh there, you go all right. I was just grabbing my slides.

K

Okay, let's do this.

K

All right, um so I I've been working on a variety of small changes to bdr p2 that don't actually they don't substantively change kind of the core algorithm and the approach that neil outland, but they do make some tweaks kind of around the edges and some of those tweaks um may or may not end up being particularly relevant, particularly in the public internet.

K

So I'm going to walk through three today um there are a number of others that I had time to outline and also a number of others that I do not yet have a good qe experience for um so yeah. Let me go for it, um so the the tljr is basically that uh bbrv2 is very, very close to achieving the same uh youtube, video qe as well as search latency as bbr b1, um with these tweaks and a few others, but these are kind of the most substantive ones.

K

Probably the pilot- um and you know there are still some some differences between the two in particular. We we expect that there will be a bandwidth regression between the two algorithms.

K

Just you know, because that's kind of the intent- um and I think that's deemed hopefully acceptable, but that the key issue is that rebuff rate and those other metrics are not seriously harmed uh as as a result, um consumer related research uh search actually has been a little bit easier really so far, it looks like, um and possibly due to the fact that bbr2 is a little bit less aggressive uh search. Latency seems to be pretty robustly uh pretty close to neutral.

K

um So whenever we launch it we'll release full details, but um you know it's: it's not a major concern at this point, so the the video queue is more the problem.

K

um So uh one challenge is uh today: when you start up due to loss, you set imply high to bdp um that that means that, unless you are extremely nicely out clocked in an extremely smooth manner, you are going to be sewing limited uh before you ever achieve the max bandwidth. uh That kind of has resulted in bdp um and similarly, once inflate high is low, it can be very difficult or even impossible to grow it substantially.

K

um So this can sort of result in this sort of a bandwidth crash thing where you know you have a certain bandwidth and then you each crank down the invite high, um and then you know the future bandwidths actually keep going lower, because in flight high is so low that you can't actually achieve the bandwidth that you first achieved.

K

um So my my proposed fix is relatively simple, which is you track the maximum bytes delivered in a round, um not the maximum fight to be clear. The max bytes actually delivered, um so that should indicate that the pipe is at least that large, because actually to look, though, when you put that many guys in lounge around um and as a result, you have a lot less of a bandwidth crash uh when aggregation is present um and as well as like when you have excessive loss.

K

um So this seemed uh the change, at least in our experience, improved qe and had almost no downside, the three trans change and retransfer. It was extraordinarily small, so I think niels actually has plans to like start experimenting with us at some point. But it's kind of one of many smaller changes.

K

um There are a few other spots in the code that one could potentially start using this max delivered in around or rely more on, bytes delivered, rather than like um inflight and other metrics, and I have some experiments to work with those, but the results are less clear than they are with this one, where this one kind of was a pretty clear one.

K

uh So the next one, that's an issue that neil's talked about a few times is early provo exit um and similarly like the lack of inflight high growth during promo, which is kind of related, um so probate up connects it early due to queueing uh the queueing criteria. Is you exit probe up if it's been at least minority in probe up and to be clear probe up in these slides means, probe bw colon going up from needle slots, uh and the bytes in flight are greater than 1.25 times, bdb, plus 2 mss.

K

um So if you're not in profit, you don't really increase in flight high. There are some ways it can increase, but typically it's quite rare um again, you never kind of re-achieve the max bandwidth.

K

So the the simple solution that we currently have in our code default enabled is you wait at least one round instead of min rtt uh in cases when there's a lot of aggregation, the min rtt can be an order of magnitude um smaller than the smooth rtds, that kind of ends up being necessary, and then you put on the extra act um and the for the in-flight check. um This isn't perfect.

K

This does increase pre-transit rates somewhat measurably, it's still massively less than dvr1, but um this extra criteria is, in some cases a little bit aggressive, but it it at least is proof that um there are solutions out there. That kind of like avoid this early exit um and don't cause, like you know, a huge amount of collateral damage.

K

um A newer idea that I wrote relatively recently is instead of looking at the extra act. What about looking for a persistent queue of the course of the round?

K

um And so the code says if you've been in profit for at least a round and your main bites and flight are greater than kind of the 1.25 bdp number that we're checking against. um Then, then you exit in theory. This might allow us to skip the application limited check in various spots.

K

There are some spots in startup is otherwise that we do have limited checks um if you were app limited and you still couldn't get your queue under the target, um probably something's not going great um and so uh making code a little less sensitive to checks. It's kind of a potentially side benefit of this um and yeah the last one was uh excessive time and probe rt this one's pretty simple.

K

Basically, when you're coming out of quiescence and you're in pro rtt, you don't leave until like a full round has passed, because you need an ack to kick yourself out of probe rtt, um and so that means you know you're sending it like. Well less than half the bandwidth for a full round. um We independently discovered this apparently anticipated quick. um So apparently it was a good idea and um yeah tcp already has this. This fix in as well, but it's kind of worth noting just because it's um it does.

K

It can increase the amount of time and probe rgt when you're doing app limited traffic and we noticed there's a number of youtube flows where you'd get a chunk and then stop for a while and get a chunk and so on and so forth. And when you finish, the last chunk you'd still be in pro partsd um cool. So that's it.

K

um I want to open up the floor for questions. We have seven minutes left.

B

Sweet, thank you all right, uh jonathan you're up neil. Do you want to join.

D

Yeah I um I I was wanting to ask about the ecn, since it isn't in the draft yet uh could we have a brief summary of how ecn information is incorporated into the algorithm and how it differs into how loss information is incorporated.

M

Right, the um ecn information is interpreted in a manner, that's very similar to dc-tcp.

M

Is so I think that tells you all! You need to know right now, there's no! It's not l4s, specifically, just because there's no, you know it's not tied with the ect1 code point and it's not integrated with uh accurate ecn, but the intent is to allow it in the future to be for us compliant down the road.

D

Right and that differs from how lost information is incorporated right.

M

M

Did you what well, I don't know if we have time to go on the specifics, but the the um the reaction to the econ is very similar to dc tcp. So there's a every round trip where there's a where there's ecn, marking, um there's sort of a multiplicative uh decrease thoughts proportional to the exponentially weighted moving average of recent ecn marks um so yeah. So hopefully that's a quick summary. There are also um details about it.

M

I think in in previous iccrg uh slides, but I'll also try to update the draft to discuss the ecm part um as soon as we get cycles.

B

That's probably a good idea to put in the draft um video you're up next.

N

I was actually going to ask about the which form of acn are we using, but uh neil already answered that um and you said, there's plans to use accurate ecn right if accurate dcn is supported. So that's that's good. um Oh, so the.

M

Second, question:.

N

I had is, oh sorry, go ahead.

M

No, I was just gonna confirm that yeah, so I think, if and when accurate ecm makes it into um linux or other os's, then yeah. The plan would be to use that signal. Yeah.

N

Okay, the second question I had was regarding uh the the points that ian noted about in flight high. So it's not is, is the in flight high not set again after the loss like I was assuming it would be said again. You know once you have a last year, you set it to bdp, but then you have probably another stage where you would increase the in-flight high, or am I understanding it.

M

Yeah that that's that's right but I'll, let you know yeah yeah you're correct that the flow probes again, um but the tricky issue is that there's sort of a coupling between the in-flight high value. That's that's bounding. The amount of uh the volume.

C

M

That you're willing to put in the network and then your bandwidth estimate, which is the rate, the delivery rate that you can achieve and once you've decided, you can only fit a certain amount of data inside the network. Then that implicitly bounds the rate that you're able to achieve and then because it's bounding, the rate you're able to achieve that um bounds.

M

The your estimate of the bdp and then that in turn bounds the amount of data you're willing to put in the network to probe for bandwidth and causes you to not put enough data in the network in order to achieve a higher delivery rate and raise your bandwidth estimate, and so you can sort of with the current logic that's in place, you can sort of get stuck.

M

Sometimes, if there's a packet loss early on that, can limit your sense of the safe volume of data which limits your bandwidth estimate limits, your probing and you can kind of get stuck in these cases, um and uh you know there are a couple of different ways. You can fix that, and uh you know ian and I are both continuing to experiment with that.

M

It gets a little tricky, though, because then, once you fix that which I think is is fairly straightforward to do, then it impacts your coexistence behavior with cubic and reno, because then you end up causing packet loss more often, and so I think the algorithm might need to be a little more shrewd about how it schedules. It's been with probing once it's more robust about pushing up to encounter loss in these cases.

N

Thank you, yeah, that's a great insight. Thank you. Yeah.

E

B

E

Yeah, hey thanks. Neil uh really appreciate the new drafts. uh I will certainly go over them, provide my feedback, um one quick question on the logistics. So do these drafts reflect the code, that's uh in the sort of uh out of kernel repo that we are using as the basis or is this a reflecting work? There is not part of that code, um so that was one question and- and I also had a suggestion for ian- is to like you know, whatever enhancements you've made, you know, bring them twice as dirty to the draft.

E

uh Having quicken tcp converse to the same thing would be extremely useful.

K

So that's that that is the intent um as soon as these things are kind of proven to work well in both tcp and quick, we're gonna make sure they make it into the draft, but um we're we're so very far been a little bit conservative and tried to make sure that we're we're very sure that, like any change, we make really like works well in a variety of robust circumstances.

K

Neil has a variety of simulations and uh test scenarios that I don't easily have available to me and and kind of vice versa to some extent, um and so we kind of need to make sure that something I changed that, like works, great for youtube, doesn't like destroy like data center applications. So we do need to kind of go back and forth and I think that'll take some time.

M

K

M

Is a question about uh which version yeah? This should correspond very closely to the code. That's currently on github, there may be one or two small differences where there are two different behaviors available in the github code and the draft documents, the one that we currently recommend, but otherwise it should be very similar.

M

I

I'll short, my question um so uh you're implying that um the quick and tcp inflation is diverging a little bit and obviously there's a practical issue there, but is is the different? um Are the differences solely related to the applications involved, or are you seeing like protocol specifics that are driving these divergences?

I

M

I mean I'll, let ian give his perspective, but my perspective is that we just have a team with multiple people who are both doing experiments um on a continuous basis and just in different code bases- and you know, quick- is probably able to move get experiments pushed to front-end servers faster because it's user space code.

M

We have some different tools for the tcp side that can help explore other scenarios. So I would view it as a collaboration and both both of us are running experiments to sort of move.

M

You know figure out how to improve the algorithm and then, as ian said, once we gain confidence with one particular approach: we're going to make sure that the tcp and quick versions of the algorithm agree, and likewise, if there are other people out there doing bbr v2 experiments, we want to make sure we're incorporating any good ideas that are out there in the external research community as well.

K

The only thing that's a really protocol difference. I can think.

D

K

We do observe some of is, um as you probably know, martin ack decimation act, frequency, whatever you'd like to call. It is quite widespread.

K

Frequent similar approaches are fairly, it's read for tcp, um but they kind of the the way they manifest themselves are a little bit different um and that, along with, like some of the scheduling being user space means that I think the traces and the ack lines tend to look a little bit different and I think kwik suffers a little more from the aggregation effects on the public internet due to a variety of these factors. So uh my experience with pbrv one is quick was impacted more by aggregation yeah. That's that's. Definitely all different.

M

Yeah, although it's interesting because uh inside in the data center case, um tcp also does see massive degrees of aggregation, because when your rgts are super small in the data center, then the aggregation that the the nics are doing or your software is doing uh then becomes massive in the in the traces again at these very tiny rtt's. So um so yeah. So I think there's plenty of aggregation on both sides and- and I think um yeah, I'm sure, we'll be able to coalesce on a single um final algorithm.

B

Thank you for that, and I want to. I want to uh thank anil and others on on pushing in new draft, and I will also say that if you don't I'm going to start requiring you to have all of your discussions on the mailing list, so that somebody else can copy them and turn it into a draft.

B

But thank you for for for all the presentations and uh thanks neil ian and praveen for shortening your discussions and allowing the the the first timers to take some more time to present their stuff. But I want to thank everybody for being here. um This is a fantastic session and I look forward to more discussions. I've just created a slack channel so use that as well and we'll see you next time.