Graph Protocol The Graph - MIPs Program, 22 Sep 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Infrastructure Orchestration from the Ground Up

Description

Slides from the workshop: https://docs.google.com/presentation/d/1VJ09Bz4NQ6ud3XIH_0RMWrC3THqt1A-iZhoKgjo-FN8/edit#slide=id.p

The Graph is an indexing protocol for querying networks like Ethereum and IPFS. Anyone can build and publish open APIs, called subgraphs, making data easily accessible.

Follow The Graph on social media
Twitter: https://twitter.com/graphprotocol?s=20
Instagram: https://instagram.com/graphprotocol
LinkedIn: https://www.linkedin.com/company/thegraph/
GitHub: https://github.com/graphprotocol
Website: https://thegraph.com

A

uh Welcome everyone to another mips workshop uh today, we'll be talking about infrastructure orchestration from the ground up uh overall, this is quite a big and complicated um workshop, and it's just the content is quite large. So um next slide, please.

A

Okay, so why do we want to do this workshop? The purpose of this workshop is to walk into excels through all of the different aspects that play a role in running an indexer operation.

A

So on tuesday we have talked about indexer, best practices and most of those work to do with software configuration and specifics of the application stack. So in today's workshop we'll try to cover all aspects of setting up your indexer from picking hardware to spin it spinning it to picking storage all the way to day-to-day operations.

A

This workshop and tuesday workshop will both provide and have provided a theoretical understanding of your options, and these will be followed by practical workshops in the next week in which you will see how you can run your indexer.

A

I do want to note that this is one perspective from um many many diff from research from many different options out there from running an indexer and also from the community, and I also want to welcome anyone who anyone to challenge any of the information that we will be talking about and feel free to jump in with questions at any point.

A

So next slide, please.

A

So the the structure of what you should think about when navigating a sea of options in a very complex environment will be our agenda, so we will start with a very quick review of the indexer software stack. This is something you've seen before. We'll just remind you so that you know what we're talking about, then we will talk a bit about hardware and the decisions to make and how to pick that hardware, then we'll talk about understanding the different types of blockchain nodes.

A

They will cover the importance of storage when we're talking about indexing and then we'll move on to orchestra orchestration. So you have picked your hardware and your storage, and now you need to make things happen. So here we will talk about declarative infrastructure versus imperative infrastructure.

A

We will also briefly touch upon um running containers versus self-compiling um versus self-compiling and then we'll look into docker versus kubernetes, and then we'll quickly talk about uh the tools available and a quick intro to sre and what that means and at the end, we'll have more time for q a uh next slide, please, okay, so quick review of the indexer software stack.

A

You have seen this diagram before we have presented it a few times now.

A

This diagram shows all of the interactions from an indexer point of view, but also from a consumer point of view today we're only looking at the indexer perspective.

A

um So let's quickly remind you how this diagram works as an indexer operator, so the guy with the sunglasses emoji, um you have to first decide what sub graphs you will be indexing and once you've decided that you, in order to interact with the orchestrated, indexer staff, a stack, you will use the indexer cli to instruct the indexer agent on which subgraphs to allocate to, as well as their cost models, and these details and metadata will be stored into the postgres indexer data database.

A

You will also interact with the graph protocol by transacting on the ethereum blockchain, and you will also ask the graph node index node to begin indexing, those subgraphs, then the index node will pull raw data from the blockchain node, based on the subgraph specification and then run the subgraph mappings to generate index data.

A

The index node would then store index data into a postgres sql subgraph database. At this point, the subgraph data is indexed and available for querying. So um so that's the detail and in the inductor infrastructure block of this diagram, you can also see the different subs software components that you would be expected to run. So now, if we go to the next diagram to the next slide, please um this slide shows which of those software components are stateful, so require persistence, and this would be the blockchain node, the postclass sql.

A

So here you need two databases. You need the subgraph data database and the indexer data database.

A

uh And um this is important because later on, we'll talk about storage, and this is where the storage comes into place. Next slide. Please.

A

um um So this diagram talks about shows the whole stack and all of the different interactions based on running one chain.

A

But if you needed to run multiple chains which you will to interact with maps and once you get into mainnet as well, you will need the blockchain node and the postgres sql data you'd need multiple components of those so one for each chain.

A

Then, in order to scale to be able to maintain all of those child chains, you would need the stateless pieces of uh of your application so graph node for the index, node graph node for the query, node and the indexer service. You need to scale those as well to sustain the capacity of multiple chains, and you would ideally scale them horizontally um next slide. Please.

A

um And the indexer agent, you will only ever need one instance of it, even when you are indexing, multiple chains and as your stack is growing. So um next slide.

A

So now that we have spoken about uh reminding you how the actual stack looks like, let's talk a bit about hardware and the decision to make and based on the software stack that's available next, please, okay! So here we'll be talking about the cost of doing business.

A

The cost of the business because uh running an indexing operation is running a business and everyone who has uh run a business before at c suite level would know the terms of capex and opex, so capex or capital expand expenditures refers to investments in the actual physical resources behind the technology, so, for example, installing servers, network pipes or hardware in the in the old traditional in it world, and then we also talk about opex, which is operating expenses or the expenses to run uh day-to-day business, and, if you think of opex as how cloud computing services are procured and and capex as the standard model for traditional I.t procurement, so creating a data center, where you own, where you own the the servers and as chris is reminding us um capex is generally something that happens ones off is um at the beginning of the business.

A

uh You you buy uh software um by uh servers and opex is something that happens monthly or yearly based on the kind of service provider you're using, and these terms will make a lot more sense in the next few slides.

A

So next slide, please!

A

Okay! So here we have a diagram of the hardware with initial versus ongoing cost. So if you decide to pick a self-owned bare metal servers, those have a really high capex, because you buy them upfront, they will cost you a certain amount of money and then, every month from there on your cost will only be things like electricity and rent. If you pay rent where you are and stuff like that, so very high capex initial cost and very low opex, so ongoing monthly cost.

A

If for looking at bare metal cloud and by bare metal cloud here, I specifically mean um uh servers that you are renting uh from a provider. Think of there are so many providers out there um digital ocean.

A

um I think someone was pointing out to vulture or something like that just yesterday, so there are many options out there. So for that option for bare metal cloud where you are just granting servers on a monthly basis, you have no capex to begin with, um but your opex is will be fairly high, a lot a lot higher than than the opex for cellphone bare metal, and we specifically didn't put any numbers here, because the cost variables depend on provider depend on the location.

A

Your servers are hosted in and um on the hardware spec as well, and the last option is public cloud which refers to refers to managed usually virtual machines somewhere in a data center, and it has its advantages, we'll cover them in a bit, but that test stands to have a higher cost higher opex cost than.

A

A higher apex cost than the bare metal cloud option.

A

We have a question: does digital ocean offer bare metal? I've only used them as a public cloud, I'm not 100 sure. If someone else knows please let us know.

A

All right next slide, please.

A

um So, let's talk about managed public cloud, so this is your cla. Your gcloud is your aws and so on. um So some of the pros of managed public cloud is that you're not exposed to hardware failure. You don't have to worry. If your server fails, they usually are set up in high of highly available ways so that, even if there is a failure, you don't have to fix it. Someone else is fixing it for you and it resolves fairly fast, then usually managed public clouds have amazing seamless user experience.

A

So they have automated live migrations of your servers uh between physical servers uh between of your vms, between physical servers. They do provide strong, sla, so service level agreements for uptime and support.

A

They have a very wide number of supported, geographies, so different regions and availability zones, which.

A

Which will matter when we're talking about latency? They have low maintenance from the user point of view. They are very easy and very quick to provision. um Most of the big cloud providers have really good documentation.

A

um uh Another pro is, as I said in the previous slide, zero capex to begin with, and it's fairly secure and usually only you're only in charge of the applications you run on the server in terms of security rather than the whole environment.

A

Some of the cops some of the cons are limited capabilities and flexibilities because they are managed servers. You don't have the same level of customizability as you would with say a self hosted or a rented server in another provider, and then they also have very high opex, um so they tend to cost quite a bit more than uh than um the the other option, which is the the renting servers.

A

um I'm just gonna close the chat, because it's distracting me that's why I am so um I'll make a pause in a bit and we can go to some of the questions. um Actually a good yeah. Let's go to the next slide. Please.

A

Okay, so then the bare metal cloud. um This is where we talk about rented dedicated servers and um there are many providers out there. I'm gonna try not to name them so that we don't influence people but some of the pros. So they are highly customizable a lot more customizable than um manish public cloud.

A

Again they have no initial, no capex and low low opex. The the pro the opex depends from provider to provider. They have remote control over the physical hardware. So while you cannot go and turn on and off the server, you do have remote control over it via ssh or other ways. They are fairly easy to provision not as easy to provision as the public cloud, but.

A

And better value for money and in terms of some of the cons, they have a higher maintenance than public cloud you're in charge of the security, and you are exposed to risks of hardware failure. Let's go to the next slide. Please, okay, and the last option here in terms of picking hardware, is self-hosted servers, so the self-hosted servers are highly customizable.

A

They have the lowest topics. As I said, you only worry about electricity and rent. Once you're uh you decide to go with self-hosted.

A

You also have full control over the hardware and some of the cons very high maintenance require.

B

A

Of expertise to set up and a lot of knowledge to operate on an ongoing basis, you're also in charge of security network, king power and redundancy, a very important thing to mention: self-hosted servers depreciates over time, um and they have a high capex to begin with, and you're exposed to risk and cost of hardware failures.

A

um I'm gonna stop here! Really quick and chris. um I see you you're typing joanna. um Do you have anything else to mention that I haven't um said.

C

No, no, I think you've covered it. Well, sorry for distracting you uh with all the chat messages. I I just uh mentioned that there is one more option which is co-location, which is kind of like a combination between this slide self-hosted servers and bare metal cloud where you basically buy the server, but then you just rent space in some third-party data center to host them, so you buy the server but rent everything else. The power, the space, the networking everything.

A

uh Yes, thank you and also someone else mentioned the data center connectivity. I'm sorry, I'm not gonna try to pronounce your name, because I won't do it justice, but uh thank you for your contribution as well. um Yeah next slide. Please.

A

Okay, so um now that we spoke a bit about hardware, let's talk about understanding the different blockchain nodes, because there can be some confusion there and uh yeah it's worth quickly. Talking about that, uh so keep in mind this diagram. This is where uh the the blockchain node is so the border blockchain, node and next slide. Please.

A

So here we have a table of the different node types and hoping that this will help indexer to to think of what what kind of nodes they need to run depending on what kind of subgraphs they index.

A

So um there is the concept of light node, which is only useful for sending transactions. This actually means that it's not useful for indexing, but you can use for sending transactions so a light node. You cannot index any type of sub graphs at all.

A

Then we have a full node which keeps all log history but prunes everything else that this historical state and other data, so you can use a full node to send transaction and to sub graph and to index subgraphs that only consume events and don't need to access historical data. Then we have archival nodes, the archival nodes. Let you send transaction, uh let you sell index sub graphs that only consume events, but also index sub graphs that query chain state, so they keep everything a full node would keep plus historical state.

A

Then we have the last type of node, which is archival trace node, and this has the whole capability of an archival, node, plus all traces data, and this allows you to index all subgraphs.

A

This is important when, when you think of how many subgraphs you want to index and also the type of subgraphs you.

D

A

And yes, we are talking about ethereum nodes, um I'm not sure. I fully understand your question.

C

I guess uh maybe the distinction is that we're talking about evm nodes, um so that's ethereum, gnosis, chain, optimism, arbitrim and many of the networks that will come to the mips program.

A

Yes, thank you very much um and for uh the next question, does this apply to gnosis chain? Yes, for mips, what kind of eat note will we need so for mips um depends on the kind of subgraphs you will be querying chris.

C

Yeah um I mean the bare minimum to be able to index a subgraph will be a full node, but if you would like to kind of seriously compete, you probably want to be able to index. You know whatever sub graphs. um Have lots of curation or you know, are receiving many queries um and in that instance, you'll want an archival trace, node to be able to index everything.

A

Thank you uh next slide, please, okay! So um we have this slide about whether you should buy hosted rpc service just because I've seen that question come up a lot in the mips test net and in other uh in other discord channels. um So there are pros and cons. um So should you buy a hosted, rpc service?

A

Well, the pros are that it's really easy and quick to set up. You don't need any infrastructure, expertise, expertise to do with um setting up servers, uh deploying applications and so on.

A

With the caveat of you, you still need to run the other components, so you're buying a host rpc service to replace your blockchain node. In that diagram that we showed earlier, you don't have to worry about the security of the blockchain node, another pro in terms of cons, very, very high opex, so the monthly operating sys of operating expense is high.

A

Then you are likely to have high latency issues between the graph, node location and the blockchain location, uh and, to put that into a bit more perspective, um it can be anywhere up to 100 times less cumulative latency when graph, node and blockchain nodes are on the same server compared to when you're, using an rpc service and as we spoken in the previous presentation, latency is very important.

A

Then another point to make is that on many providers, archival state and trace data are add-on premiums not on all of them, but on most of them. um I had a look this morning with chris and, for example, uh quick node has.

A

Quick node gives you archival state and phrase data, not as an add-on, and then another big big con is that you are not contributing to the decentralization of webtree.

A

So um this is where you are not contributing to the bigger ecosystem, but at the same time, if you are seriously, if you seriously have the cash to dish on hosted rbc service- and you want to- I don't know start you might be look at this, but yeah best is to run your own nodes and save money at the same time, and then you may need to use different providers for different chains so, for example, uh for gnosis chain, um infuria and alchemy do not support gnosis chain.

A

So um not all nodes are supported on all providers.

A

I think that's it.

A

Arbitration can be if you want to centralize a centralized nodes sensor data. Thank you hope for that agreed uh next slide. Please.

A

Okay, I wanna hand over to chris for talk to us a bit about storage.

C

Thanks anna um next slide, please- and I will also note there- are a few people on this call who have far greater storage expertise than me. So please do.

C

Chip in uh this is the only slide we have so definitely a bit of time to discuss other perspectives um when it comes to storage, like we've already seen from the diagram that there are three components in particular that are stateful and where storage matters, two of those components um are particularly performance sensitive, and that is the blockchain node, which obviously synchronizes with the chain and keeps up to date with all of the latest blocks, and then the postgres database that holds the um subgraph data um and so for both of these use cases.

C

um High speed by physical storage media is, is really uh high. Leverage. It makes a very big impact and, generally storage is one of the most important and largest cost components of of running the infrastructure. So it's worth thinking about for blockchain nodes, in particular.

C

Any storage solution that offers um high random iops performance is going to be particularly well suited, um and that's because, as blockchains sync, they do a lot of random state accesses. So uh yeah random, I o performance, is important.

C

um Similarly, when subgraphs are indexing, you know they're, constantly writing to the database and, at the same time, you're issuing queries against the database. um You know, as as queries come in from the gateway, and so again this is a an area um and- and just to recap, I'm talking now about the postgres db storage that actually holds subgraph data.

C

This is an area where high performance storage has has a high impact, so it allows you to both index subgraphs faster, but also answer queries faster. um So that's that's uh the physical media. I know uh I see. um Xfs uh is mentioned in the chat. uh It's a file system that I've personally never used. um Please check it out and actually uh yeah, I'm hesitant to try saying your name for fear of butchering it.

C

uh But if you are open to saying a few words about xfs, um please do I'll just get to the end of the next section uh so other than physical media. The the other thing is like how you actually use these devices, um particularly for some of these larger chains. So uh I think gnosis chain, for example, a full archival trace node.

C

I think is on the order of uh three to five terabytes, depending on, if you're, using compression and things like that, um and so you know you might you may not have a single physical disk that can hold all of that state and and obviously uh you have a single point of failure.

C

So generally it makes sense to use some technology to aggregate individual physical disks into a larger volume um and at the same time you know get some other features around redundancy and- and you know just the ability to tolerate a drive failure, um so raid or lvm are great uh kind of beginner options that can allow you to do that, but zfs is also a fantastic option that comes with some other benefits as well.

C

Lvm and zfs both uh offer quality of life features um like snapshotting. um That are definitely worthwhile, and you know, particularly when it comes to just the operational tasks associated with your indexer, like taking backups or moving nodes between machines or things like that. um Having those features uh really does make a difference, and yes, we do.

C

We. We do actually have have a couple of um a couple of people that run butter fs uh in the ecosystem. Cohen is one of them. I don't know if he's on the call, but it it is. It is something that is used um at graphops we're using zfs, um and I know a few people use it quite successfully.

A

And vince from notify said. He uh also said that butter fs uh leaves performance to be desired. um Have you used advance? Would you like to say a few words if you have.

E

um Vince, can you unmute, if not, I can uh try and give you the rights. uh Can you like try now.

F

Should be able to speak yeah.

C

He he can speak no worries thanks for sharing your perspectives anyway, vince. um I don't know whether um you know not to put people on the spot. uh It's totally! Okay, if you can't speak, but I, if jim, has any thoughts. uh Jim's, a storage master.

B

That's not true, I'm not a storage master I'd, say I'm a storage enthusiast. Maybe um can you hear me okay,.

C

We can we can, but that's just donning krueger at work.

B

Well, you know the one thing I think um that's interesting about zfs. Is that all the features that come with it right? um I think you mentioned that chris, but it really can't be understated how how often the features of gfs like snapshotting and the fact that it's a copy on right operating system. So if things go really wrong and you get like a a sig interest or some kind of really awful thing happen on your server, then zfs generally has your back um because it's a copy on right uh operating system.

B

So if you were to lose power immediately, then the likelihood of you actually corrupting something in the office and it not auto correcting, is super super low um and then, of course, snapshotting and compression compression is free. Space and free performance on zfs, just enabling compression can improve even with uncompressible data can improve your. um You know, performance uh biops quite considerably, but as with all things I mean, zfs is not magical right.

B

There's it doesn't really matter what type of um low-level file system you decide to go for or um there's always going to be that trilemma right. So there's three things that you're planning for you're trying to architect for it's capacity, so how much useful space will I have on my arrays performance?

B

How performance is my array? Is it you know? Is it suitable for a really high read read streaming intensity? Is it is it performant for random rewrite? uh Is it going to match the types of workload that I need to do with my stack right and then the last one, of course is integrity?

B

How much of your hardware, your very expensive micron! You know nvme u.3 ssds, that you paid huge amounts of money for or huge amounts of to, rent. How many of them do you want to sacrifice in the uh you know, for the sake of making sure that you have a very resilient uh zfs pool, you know, do you want to do? Do you want to mirror your drive, so you have an exact copy or multiple exact copies?

B

Do you want to do raid z2, where maybe you can afford to have two drives in in part of the pool fail? um Or do you want to just um you know, go balls to the wall and forget about all of that and just go for pure uh performance, and you know, put loads of drives in parallel and uh really to you know, take a real serious hit in your wallet.

B

Those are all things to consider, and the thing I would say is: if you're really really serious about this, you want to you want to play with this stuff on a on a even if you're, um going down the co-host uh co-hosting or this the cloud route, you want to have something at home that you can play with these things, uh experiment benchmark so that all the mistakes you make you make them on a system that does not matter you take some cheap drives some cheap ssds at home play around with them in an old computer.

B

uh Don't don't go straight into production with what you think is going to work because zfs has got you know you can you can organize things in a near infinite? Number of ways same with any uh any sort of low level file system, but don't go don't go in thinking that your theory or your fio is the tool we use often for um for, uh for benchmarking. Don't don't just believe what the tools telling you do.

B

Some experimentation at home before you spend the big bucks uh either on your own servers or servers that you're renting in the cloud.

B

I think that's! That's probably enough. Chris.

C

Yeah, thank you. Thank you. So much for sharing jim some great stuff in there um a couple of questions in the chat. uh At least some of them have been answered in the chat but uh christopher from lemonade uh said, wouldn't compression decrease, throughput and thus negatively affect performance. It's really cool to see how zfs uh impacts, workloads and um zfs with postgres in particular.

C

If your queries are are reading. Large amounts of data can actually significantly increase your query performance because, even though you know you're, obviously compression is not free. You, you have some cpu trade-off that you're making there, um but if you're pulling large amounts of data off of the disk, um you can actually get more of a performance benefit from compression, allowing you to read and write less data to the disk than the trade-off you make with cpu so yeah.

C

There are many configurations and, um as as is being said in the chat chat, often with zfs, you kind of want to um tune it for each workload. If you, you know, if you kind of want to eke out maximum performance, um one last thing I'll say is that you know people make a career out of managing zfs and indeed many of the components in this stack. um You don't need to run some crazy, optimized zfs setup in order to be a successful indexer.

C

There are many indexers that operate well, just with a bug, standard, ext4 file system. So yeah I mean one of the things. That's fantastic about this ecosystem is there's just so much room for, like technical growth, don't feel that you need to go all in. You know in every layer from day one um it's something that you can kind of level up on, uh as as you get more comfortable with the stack um okay, if anyone has anything else to say about storage, please uh say it now, and otherwise I will hand back down.

A

Thank you, chris, and thank you, everyone who has shared their experiences and their comments on the chat regarding storage as well and gym as well. This is a very complicated.

G

Sorry, I'm really sorry to interrupt, but that's a question sure um I'm wondering a little bit about like what is the like. The the pattern of read and writes like. Is it like consistent because blocks are produced consistently or will it more follow like a burst pattern.

C

um It's it's it's hard to answer precisely. I think there are. There are aspects of the workload that are consistent um and then there are aspects of the workload that are highly variable. um If you're indexing a large number of subgraphs, then you know with each block that comes in your graph. Node is consuming and transforming data and saving it into the database. So there's going to be some consistent, write workload that scales, you know roughly linearly to the number of subgraphs that you index the side.

C

That is less easy to answer is the query side, um and you know that is not just a a matter of like what demand exists in the market for different types of data, but even right you can express different preferences to the market, about which queries you serve using cost models. So, for example, if you wanted to really optimize for serving queries that are otherwise you know massive and super expensive for other indexers to run, maybe they have to scan over very large amounts of sequential data.

C

You can use cost models to express that preference to the market and, basically you know kind of relatively underpriced. Expensive queries and overpriced, simple queries in order to skew your query workload in that direction.

C

um So I know that wasn't a super simple answer, but I there are a lot of factors and I hope that does answer a bit.

A

All right, um let's go on to the next slide. Now, please, okay! So we're going to talk about a bit about orchestration and by orchestration we mean you've decided on your hardware. How are you going to manage it?

A

How you're going to spin it up? um How are you gonna, automate, the provisioning and so on? So next slide. Please.

A

So here we talk about infrastructures code, um I'm sure many of you have heard this term, but basically, what it means is instead of going around and clicking or running one command ad hoc somewhere to set something up. You have everything written in code, whether that's yamo, using something like ansible or whether that's terraform using um hcl, is what they use or even bash scripts uh it. It would still be infrastructures code as long as you have a repeatable way of recreating the same thing.

A

um So infrastructure of us code means uh having an automated process to provision something and then reprovision that once it's been deleted, and so on so to take it one step further in this in infrastructures code, we talk about declarative versus imperative infrastructures code.

A

If you have experience as a programmer, you know that when we talk about declarative style of programming, you specify what you want, whereas in an imperative style of programming, you specify how you want it done, and in for infrastructures code for declarative infrastructure, you specify a list of resources you would like.

A

Whereas for uh imperative infrastructure, you specify a list of commands to run to create the resources that you want so imperative. Infrastructure in this case can be bad because um it can introduce configuration drift and because it's very specific so, for example, um running a command with aws cli.

A

That would be, let's say, you're trying to create a new bucket. You run the aws create bucket with the name once and the first time it works, but the second time when you run the same command it would fail because your bucket already exists and it doesn't hold the state properly.

A

As chris said, we strongly recommend the approaches of version control, declarative infrastructure as code. Examples of tools that are declarative infrastructure is code terraform, which maintains a state and is very widely used. I think it's the preferred uh the preferred tool for many sres when it comes to provisioning infrastructure, there's cloud formation, which is an aws product.

A

My personal opinion is that platform is much better than cloud formation and then there's ansible, which is configuration management but can be used as infrastructure as code as well um again use version controlled, declarative infrastructures code to provision your infrastructure.

A

Any questions about this I um before we move to the next slide.

A

As chris said, launchpad, which is our kubernetes toolkit for indexers, is built on ansible, kubernetes, helm and help file, which are all declarative infrastructures code and in case you're wondering why ansible and not something else is because we're only using um the ansible part for installing the different uh for for installing kubernetes between your so between your different nodes and a few other initial setup. Once you've uh installed the operating system, um there will be an uh a link in the next slide for launchpad uh in the next few slides.

A

Okay, um next uh slide, please. So um I wanted to quickly address the whole docker versus kubernetes debate um and there's many ways to address this. So first docker is a suite of software development tools for creating sharing and running individual containers where containers are standardized packages for microservices applications with all their needed dependencies inside, whereas kubernetes is the system for operating containerized applications at scale where an application consists of many containers and to orchestrate them in a production environment to orchestrate them over many servers, many data centers you'd use kubernetes. Now.

A

What I want to specify here is that docker has been used recently has become the thing when you talk about containers. So if you think in our google example, we don't say I'm going to do internet search, you say I'm going to google this. That is the equivalent of what people mean when they talk about docker. They just mean containers, but the reality is that there's many tools for docker, so there's docker for mac, there's um docker compose, there's docker swarm, there's um docker as a container runtime and so on.

A

I um colson aic means infrastructures code uh should have should have made that more clear.

A

So going back to docker versus kubernetes, I would say that a fair comparison would be docker swarm versus kubernetes, where uh docker is also docker. Swarm is also a system for operating containerized application at scale, um as chris is saying, is key difference is that docker doesn't provide any host abstraction clustering, but kubernetes dance, um and here yeah docker storm also does provide host.

A

Host abstraction, but it has been deprecated and there is also docker compose that also doesn't provide clustering. um Okay next slide so, and maybe this would have been a better way if I would have switched the slides around. But here we are so um we talked about containers, but let's quickly uh touch on self compile, so your options are running containers which run based on an image which is a specification of what you're going to install and how, in that container or self-compiling applications.

A

Self-Compiling packages um on an operating system so running using containers is highly recommended because containers come with 100, rep, usable environment and because it's very hard to replicate uh the same behavior between many different servers and nodes outside of containers, especially when different machines, different environments, have different dependencies and especially when sometimes running hardware just means that there might be something really specific to one uh at one machine that you are not seeing on other machines.

A

So um again, please run containers instead of self compiling. uh We know that I think it's cohen, who self self compiles everything. I'm not sure, if he's here, to comment, but maybe we'll try to get his comments in the future to have a more balanced view of his pros and cons next slide. Please.

A

Okay, so um initially we're going to talk about an intro to sre and just in general, day-to-day operations we run out of time, but even the fact that you have to think about running infrastructures, code running everything um version controlled.

A

Even that um is day-to-day sra. um You also want to make sure that you have uh a monitoring system um yeah. You want to make sure that you have monitoring systems.

A

You have visibility into your into what your applications are doing, but also visibility into how your hardware is behaving as well, so um to wrap up this whole conversation um to remind you, the different indexing stacks available, so um there's the docker compose setup from stake, squid um that has been recently updated to support maps.

A

However, the limitations of the docker compose setup are that you can't scale that beyond uh beyond one host one server. uh So when it comes to uh supporting multiple chains, uh you can only get so far then um soon we'll be revealing the kubernetes launchpad from graphops, which uh there's a lot of work that has gone into that. The purpose of it is to take you from you, give it an ip address um and the way of connecting to your servers and- and it takes those servers, creates clusters for you.

A

You can have a cluster of one server and then add as your scaling and you should be able to it. Basically, it follows everything and it should be very optimizable.

A

It would also be able to support a hybrid stack and by hybrid stack, we mean, for example, running one self-hosted small server in your garage, but also renting a bunch from a different provider and making a cluster out of them again for those before you decide to do something like that. Remember.

A

Latency and how the location is important, there's also the bare metal install suite from mind hearts heart, stop sorry for the typo and there's the ansible install suite from steak machine.

A

We are a bit over time, but I would like to.

A

Invite everyone to ask questions.

A

C

Maybe we'll we'll, um I don't know about whether the faq will be in the recording. We will hand it back to abel and and then yeah over to you able.

E

Sure thing um yeah, I I I I'd say we should um uh keep the recording in the faq. So like I've I mean I think people are feel free to drop off. If you, if you're busy but um yeah, I think it's it's valuable, um so we'll keep the uh recording the faq, so yeah so keep going, um but obviously try not to keep it for too long.

C

All right, anna, thank you! So much. um You did a yeah fantastic job.

D

C

Question yeah, and uh indeed to everybody, who's shared their opinions and perspectives in the call and on the chat, I really appreciate the diversity of perspective and keep contributing. I hope everybody can feel that that's a big part of this community.

A

um What is victoria metrics? I've never heard of that.

F

It is a totally fully alternative for the promoters, because it's rewritten and different language and has different stack, it's more efficient in terms of storing the metrics.

F

It allows you to horizontally scale, your storage for metrics and it's more efficient faster, doesn't require such uh resources like primitives, it's less cpu intensive, so you can use it more easier.

A

And um have you used that.

F

Yes, there's uh the state machine that board is running it, I've seen genesis, so we don't use primitives. It's fully reliant on the victoria metric storage.

A

um And okay cool so in terms of um so most of the helm, charts and just in general, kubernetes uh setups out there for different different applications, usually out of the box, support prometheus. What's the do you need to do migration, or are you able to use prometheus metrics with victory.

F

It is, it is fully compatible, so you just can put it in place and everything is good to go. So no change is needed.

A

That's awesome. Thank you so much for bringing that perspective here. We'll have to look.

F

A

So now we have a name for you, we'll call you state machine and okay. Anyone else want uh any questions, any anything.

E

Everyone should have the ability to unmute so feel free to do so. um It looks like we have a comment in the chat, so I know chris feel free to grab that.

C

I'll grab that uh hey dan uh hoping your name is dan, um it's not a huge stretch right. um So the question is as a newbie, I'm confused as to whether uh the intention is for missed participants to run through every chain that is specified in the program um or if you think you can only support one chain. You should just pick one and forget the rest, so um you can participate um in whichever chain phases you would like to.

C

As far as I understand so I'm sure this won't be new information, but through the mips program there will be a number of phases, and you know each chain you can kind of think of. Has its own phase and a phase has a cycle that starts at test net and so mips participants are expected to. You know, participate in test net and meet mission and qos objectives in testnet, um then one once testnet qos gets to qos's quality of service for anyone unsure once testnet qos gets to a certain threshold.

C

The program again just for that chain phase will then move into mainnet and participants will be expected to set up in mainnet and meet qos objectives in mainnet.

C

So, um but each kind of chain phase is other you know is basically largely independent from any other.

C

So you can pick the chains that you want to participate in the program for, but the important part is that you finish the complete phase, so you get from test net all the way through to mainnet for every chain and then, as for whether uh it makes sense to set up and then tear down for every chain phase, I mean to me that doesn't make sense, because the objective of the program is obviously, to uh you know, add sustainable indexers to the network.

C

um I don't know if martin can comment on whether scoring or whether, like um you know, there'll, be some minimum length of participation in mainnet to be eligible.

C

Follow-Up questions from dan, so I might not be ready for nurses chain, but I can start getting things ready in order to jump on a future phase. Yes, that's correct! So long as you, uh you know applied before the deadline and submitted all your kyc, but yeah you. You can pick the chains that you would like to participate in.

A

um Any more questions, and uh since we're waiting for questions just um just uh it's more of a request for feedback, this is around what was missing from this presentation. Obviously, there's more workshops coming up, but if there's something that you need more help with in terms of understanding everything.

A

The name of the node is uh archival trace, node to index all sorts of subgraphs coulson.

C

Maybe just to clarify um the node client is nethermind, but you can run nethermind in those nodes in those modes that were covered so as a light client as a full client as an archive etc. So you want to run ansible for nurses chain and if you would like to support any subgraph, you need to run it in archival trace mode um which yeah guess. I'm sure there is documentation on that and we will actually cover this in our launchpad workshop.

C

But uh yeah you just pass a config flag to nethermind, saying xdi underscore archive, and and that should be what you want.

A

And is there a way to sync your gnosis gnosis archival node, without having to wait a few days? um I don't think there is, unless you're, using an rpc node. But again we are not recommending using our pc providers.

B

Someone the other day said that they were uploading a snapshot. I think it was sun tzu. I don't know if he's on the call, but maybe one of the community members might be providing a snapshot we'll see. There was also some talk about, I think hypnosis providing them, but whether that actually came to to be or not, I'm not sure.

C

Yeah, the there are a few efforts to organize snapshots. uh Honestly, one of the biggest challenges with snaps snapshots isn't so much making them it's distributing them uh because, yes, as state machines, said a 1.7 terabyte snapshots, just a lot of bandwidth to pay for, if you know, 100 150 500 people are downloading. It.

B

Time for bittorrent.

C

A

And uh to all these questions, uh question two: is there a recommended list of regions where we should start the servers um it's hard to provide recommendations on locations, especially because once we start recommending locations, then the chances are that most in indexer might go towards the same location.

A

uh So uh so regions is what give can give you a competitive advantage. uh I don't know chris any comments on that.

C

um Yeah, no, I think spot on. You know we don't really want to recommend uh locations, um but you know I I it's not rocket science. You know think about um where the major markets are for cryptocurrency. You know not a stretch to think that the united states generates a lot of of query demand.

C

um Also, parts of of asia also generate a lot of demand, so you can kind of just think about it. From that perspective, it's also worth saying that um graphops, in collaboration with edge and node, um is working on a gateway quality of service oracle, um and this will, you know, give us a bit more quantitative data. All of us to understand.

C

uh You know, query traffic, that the gateways are processing and um I think that might also you know be useful in planning where to locate nodes.

A

um So lex crime has shared the graph state machine. The map with the loss and he's saying to look at this map and choose the lowest concentration of indexer, um but also remember that.

A

Oh sorry, I don't know what I was gonna say my mind went blank. um A state machine is also saying that during mip you will get synthetic load, so latencies to gateways would be very, very important.

A

And the map is fully trusted, so so any indexer can specify any set of coordination coordinates and they are not being double checked.

B

Are you telling me that there isn't an indexer in the middle of the atlantic sea? I don't believe you.

D

Anyone who looks.

B

At the map will will uh will get that joke.

A

Well, I don't know, maybe maybe there's the tiniest thailand somewhere there. Oh.

B

Maybe maybe it's on a boat with a satellite.

D

C

Pirate radio, anyone.

A

A

Adam's submarine indexing sounds like a great company name, she's, saying.

A

All right, I think people are running out of questions- is that the fair assessment.

E

I think so too, so I think it might be a good opportunity to wrap up. Thank you, everyone for all of your participation. um This has certainly been a very educational um and informative uh conversation, so yeah. Thank you anna. Thank you, chris. Thank you to everyone. This has been an absolute pleasure and we look forward to seeing you all on tuesday for the next ioh slash mip workshop. Thank you all. Take care.

A

C

C