IPFS IPFS þing 2022, 10 Aug 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: IPFS Cluster - @hsanjuan - IPFS Implementations

Description

IPFS Cluster - presented by @hsanjuan at IPFS bing 2022 - IPFS Implementations - https://2022.ipfs-thing.io

A

It was good to see a lot of new stuff. This is a bit of an old stuff, because this is started soon soon after go ipfs. Actually, so I'm going to tell you about ipf's cluster, we will go very quickly over the features and we will see try to explain a little bit of the internals and about about how it uses the go. Ipfs stack.

A

I don't think we should call it kubo stack anyway under the hood, so ibfs cluster does data orchestration against a swarm of kubo demons as a as a side car. So what it does is to select given a replication factor for a pin it selects, which kubo demons in your cluster should be the one spinning and it tells them to pin it and then it makes sure they actually pin it if they don't it retries and etc, etc.

A

The whole idea here was that, if you're going to deploy kubo, you might as well just deploy the ipfs cluster daemon next to it more or less for the same price in a very similar fashion. It's also a daemon, it's configured more or less. In the same way, and you run it more or less in the same way and you're going to get ipfs coordination on top almost for free.

A

What it does print replication, you can also do car imports and ipfs ads, but to multiple google peers at the same time, balanced allocations apis. I can go a little bit more in detail, but the idea is that every cluster in the every period in the cluster has a shared pin set, so they they are tracking a pin set and all of them have that estate between them and we will see how it is coordinated and every pin in that pin set has several options among the replication factor.

A

Now they have like origins and things to support the the opinion services. Api too and clusters tracks the pin lifetime telling the ipfs daemon or the kubo demon. What to do.

A

um It doesn't sort of balance the locations. So originally cluster will select a location based on the on the amount of free space reported by google. But now it's a little bit more elaborate, because if you attack your cluster peers right, you will get a locations that you can distribute around the world and you can say I want to replicate this pin in three places, but please do it.

A

So those three places are not in the same data center and are not in the same availability zone, and once you have selected places that are distributed around the world, please select those peers that do not have very high pinning queues and once you're dealing with the ones that are more or less not pinning too many things. Then please select the ones with most available space, and this is user configurable to your needs.

A

It has three apis that it exposes that can be enabled or disabled. The rest. Api is a original one, the one that gives access to all the all the feature set: the beginning services api that we added recently and the ipfs proxy api, which tends to mimic the ipfs kubus api.

A

The idea is that you could also drop in a cluster beer in the place where an ipfs google daemon was before I'm having a lot of trouble with the rename and and it will more or less if your application was just doing like, pin and and pinner I am on on kubo.

A

You could just drop in a cluster beer as a replay as a replacement and get the pin replication for free.

A

We have documentation, and that includes a bunch of tips on how to run kubo and how to run the cluster peers in in production, particularly recommend the settings for bitswap internals for data stores and so on, and we also have something called collaborative clusters, which is the idea that you want to distribute a pin set to a number of of users that are interested through ipfs and you will create a cluster, but in that cluster you have certain drastic peers, so the ones that can actually edit that pin set and the rest of the beers that are part of that cluster are just followers and are just dedicated to pinning whatever is on that pin set with actually the power of making changes to it.

A

So this is used, for example, for for distributing um package repositories. Someone has a collaborative cluster that says you can follow this and by follow win this cluster you're gonna get the latest pin set that allows you to pin all the packages internals a cluster appear is made of a bunch of components.

A

um Each cluster b runs ip2b host and all the cluster appears from a private lip2b network between themselves, they're going to communicate through an rpc interface, so they're going to have rpc endpoints that they can call on each other, and this allows, for example, if I ask one cluster here, what is the status of this bin?

A

It doesn't know what is the status on a bin that is actually stored by some other cluster beer, but you can use the rpc to actually go and ask the other peer. Hey. Do you have this bin, or is it still cute? That's address address, therefore, you make that request, and then you come back to the api to the user request on the api saying: hey yeah and this bin is cute there and it spins somewhere else.

A

This also allows us to do like getting blocks on one cluster, pier and actually adding them remotely to a different kubo daemon that is managed by a different cluster beer.

A

This is done through a lip b2b, so an rpc library only p2p that we wrote. Essentially, we took this the standard, rpc library and made it p2b enabled, and then we added features on top. So now it's not only calling methods and getting a response. We can also do a streaming, so we can pass channels to methods and we will stream everything on the channel and then get a channel as a response and stream things out um comes quite handy and works on top of ptp.

A

So since our cluster peers are forming a private leap to be network, we're now streaming, everything on on that live p2p network are using using leap2p services for all the communication.

A

The heart of the cluster and of the cluster piers is the consensus, combine components we use raft in the past, but today we're using mercury rdds.

A

The idea is that each cluster beer has to agree on what the pin set is, and they all have the ability to update that pin set sometimes very heavily if they're ingesting many pins. At the same time, therefore, we're using in order to manage that pin set, that is a shared state among all the cluster peers, we're using go dscrdt, which is an implementation of mercury crdts and it's essentially america dag is a blockchain except you don't need consensus, because all the objects in that blockchain can actually be merged together.

A

So your final state will be the merge of all the nodes in numerical dac and those nodes essentially have a list of what are the new pins and what are the pins that you're, deleting? If you put all of that together, you get the final list of pins that the cluster should be dragging or not.

A

Everything in the end, when you, when you update you, want to add a pin, you will add a new node to that miracle dac and you you will broadcast that you have added this node with these cids.

A

The rest of the nodes will see the broadcast and will download the thing everything is stored in a go data store.

A

Of course, that means go dser ddt and it's an ipld dac service, and this is provided by ipfslite, which is just a minimal implementation of a dac service. Using the go ipfs stack, dp2p host might plus the hd plus datastore plus blogstore, you wrapped it on a cache blog store. You wrap it on a bloom filter, enable blogstore, you add some bits up to it. You get a block service, you had a reprovider, and now you have a dac service fully functional. You can grab anything from any ipfs network that you want.

A

This is essentially the list of libraries that we use from from the stacks. In the end, if you noticed ipfs cluster is a distributed. Peer-To-Peer database with the purpose just the purpose of coordinating binning and it's taking advantage of essentially every piece that the that the go ipfs stack, which is the the thing that was made here enough when we were building this and as we were building this had to offer and we added the missing pieces.

A

So we added the crdt part we added the go rpc part, but all the other pieces facilitated a lot a lot of the tasks here, because we get pops up for free. We get content addressing and the ability to receive blogs from anywhere in the network for free, et cetera, et cetera, so we, which is covering small bridges, pain, points being a side. Car pain points right now things that bother me right now. A lot of things bother me in the past, but right now, being a side card to google means you pay.

A

You have to interact with kubo through the api and you you pay all the price of the api boundary. You pay the price of your google demon, dyne and being able to react to that, because your cluster demon can keep running of course, but now your kubo demon is dead or not responding.

A

You cannot hit the the ipfs data stored directly and there's a lot of of fighting the limits, as we were growing along our folks at nft storage.

A

There are many limits to fight with, like the total number of pins, that the cluster can track the total number of pins that the cluster can ingest. So how can we ingest faster, but also, how can we increase the speed as at which we've been contents, and how can we increase the speed at which we add content and those are those are different paths and they have like different solutions and different optimizations how to maximize performance? What is the right size of a cluster beer when you're when you're hitting all those limits?

A

What is the right number of parallel pin at calls? What is the right settings for google for badger for the file system in the nodes that you're running for the layout? What is the right balance between the disk size and the memory? How many nodes do you want to run? How can you optimize and last run less notes or not?

A

This is our question that when you're hitting the limits they're relatively harder to to resolve- and you have to try- try something else and try to try to improve uh roadmap, we're working on an ipfs operator for kubernetes that will deploy ipfs and cluster swarms on kubernetes for the first time in a way that we officially support, so not just something random that someone did, but we want to make sure that this works properly and that you get ibfs kubus demons that are publicly accessible that are correctly advertised on the dht, et cetera, et cetera, and I want to go towards embedding google in the cluster peers or running running as a secondary ipfs host directly in the cluster beer.

A

As a way to you know eliminate that api cost that we pay and suddenly you have direct access to the data store and you can do a bunch of other optimization. How do you handle binning, how you handle and binning potentially how you handle garbage collection, etc, etc?

A

Thank you very much. If you want to know more about cluster, you can come to me and if you want to know much more about cluster, I'm opening to making a specific oncom sessions and so on. I know we have some some users here so yeah thanks.

A