IPFS IPFS þing 2022, 10 Aug 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: State of IPFS Public DHT - @guseggert - Content Routing 1: Performance

Description

Lightning Talk: State of IPFS Public DHT - presented by @guseggert at IPFS þing 2022 - Content Routing 1: Performance - https://2022.ipfs-thing.io

A

Hey everyone, I'm gus, I'm a I'm a ipfs steward. I mostly work on kubo, um so we're gonna talk about the the public dht um so to review. What what juan was talking about. um The main functions of the dhc are content or like converting a sid into a into a set of peer ids. We actually look it up by the multi-hash it also. The dht also holds peer id addresses and ips records, and these are the main implementations.

A

So we've got two go implementations that we use in kubo the standard client and then the accelerated full rt client, which I'll talk about in a second and then the javascript and rest implementations.

A

So some some like high level stats about the nodes on the uh on the network. um You can see. We've got some good distribution across the autonomous systems which for people don't know, that's the those are like groups of ip addresses used for bgp routing.

A

And good distribution across cloud providers- these are the these guys right. Here are the user agents that we see. I actually don't know what ioi is. That's a new one to me. I think we're still trying to figure out what that is.

A

um So 70 is kubo 5 of the hydros which I'll talk about in a second as well, um so the the hydras are something that pl runs to help speed up content routing. Basically, we we have um a bunch of peer, ids, basically flood the whole network with a bunch of peers that share a database that stores content routing records so that a lot of peers in the network know a lot about all the other records.

A

So we tuned it a couple months ago, so that 97 percent of dht queries hit a hydra hydra node, so that results in really high performance responses, but it also means that we're basically sibling the network.

A

And it also enables us to bridge the the network with existing clients with other indexers, like the one that andrew will talk about in a minute.

A

um Another benefit of the hydras is that we get a really good global view of the dht, because we basically are the dhd.

A

So yeah there's two two main clients in kubo that we use the traditional client, which um does you know the lookups that one was talking about. The logarithmic cops um and we've also got the accelerated dht client, which uh caches the entire routing table in memory, um which means lookups are really fast because you don't have to do any extra hops, but it also means you've got to cache the entire routing table in memory and one of the big downsides to that is before it's even usable.

A

You have to crawl the entire network and find all the peers.

A

I think about 30 to 40 000 years, something like that.

A

um So this this is a stat that we monitor. That shows the the time it takes a random node in the network to look up uh some content. You can see over the past year. We did some work on the hydras, so it went down a lot but currently to look up a single record. It's about 160 milliseconds.

A

It gets higher as you get towards the tail. Advertising takes a long time because you've got a lot of peers to send your records to and we haven't done any performance optimizations on that.

A

um So yeah that's with the standard client. So if you use the accelerated dht client, everything goes a lot faster once it's bootstrapped because you only have to you only have to make a request to one one peer: you don't have to do hops and provides- or if you do a big bulk of provides. It's a lot faster too, because you already know exactly who to provide those records to um so yeah. These are some links. I can send this these slides out afterwards, if you're interested in looking at these.

A

These are different groups of people who collect the metrics on the dht. So dennis has this nice crawler called nebula, um or he publishes a bunch of statistics. uh I think they're daily now um about like who's on the network and how big it is and and different performance characteristics and then max from the p2p. He he maintains the rust the p2p implementation.

A

He has a nice grafana dashboard.

A

I wanted to show too, let's see so we also have. This is our grafana dashboard for the hydras, um so you can get an idea of this like how much work the hydras are doing so like, for example, these are per second, I believe, um there's, like average requests per second, so we you know, we process a lot of requests on the hydros millions per second- um and you can see like down here, we've got this is the size of this.

A

Is the number of uh records that we've cached, so we've got around a billion records in the last week that we've got in our database.

A

Go back here yeah, so we've still got a bunch of work. We can do on the accelerated dht client as well. Like I mentioned it caches everything up front which kind of sucks, because I've actually run it a few times and it's killed. My router because it very aggressively looks up every peer in the network.

A

Which is fine if you're running, you know some dedicated infrastructure for it, but if you're running it locally on your laptop or something that's, that's really is painful, but I think we can. We can get a middle ground where, like basically it's just a cache of records, so we can um uh you know we can we can build this cache as we go. We don't have to do it all up front.

A

uh So that's that's one, one area that we can definitely improve and then there's a bunch of constants in there that the accelerated dht client was. We didn't, spend a whole lot of time. Writing it. So there's a lot of constants in there that we can research and probably improve on.

A

Yeah and then there's then there's protocol limitations such as provider records. You can only tell someone else that you have a multi-hash. You can't tell them that somebody else has a multi-hash which has it creates a lot of difficulties like hey.

A

You can't split apart the system into two different, uh like you can't say like oh, that machine over there has this this hash, and also um you can't like, if you could like right here, if you could batch, who has it with their addresses, you could reduce the chatter a little bit um and then there's a there's also. I think we talked about it earlier and uh in stand-up. There's a there's, a big look up table that we use for bootstrapping the rowdy table. That's like 400 kilobytes, um that kind of sucks.

A

Sometimes, if you it kind of bloats your your code, size, um yeah,.