Graph Protocol Core Devs, 8 Mar 2021

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: The Graph's Core Devs Meeting #6

Description

The Graph’s Core Devs Meeting #6

This video was recorded: Tuesday, August 3 @ 8am PST, 2021.

Sections:

0:00 Intro
1:15 Introducing Figment to The Graph community
3:50 GIP & Governance Updates
19:45 Streaming Fast- Integration update & Firehose
27:35 Semiotic- Automated prosocial negotiations in The Graph protocol
45:05 Q&A

The Graph's Media:
Twitter: https://twitter.com/graphprotocol?s=20
Instagram: https://instagram.com/graphprotocol
LinkedIn: https://www.linkedin.com/company/theg...
Website: https://thegraph.com

A

Very good good morning, good afternoon, everyone wherever you are welcome to the core devs meeting number six. Today we have an exciting agenda with a lot of interesting subjects, and we will be starting out today by sharing with you the latest announcement on the grant with figment who joined us as our third core devs team and at the graph.

A

Then we're going to be talking about gip and governance updates, which will be heavily focused on curation. We have a few discussions there.

A

Waiting for us streaming fast will give us an update on where they are in the integration and also have a discussion about something that they've posted last week and which is called the fire hose and semiotic has shared with us some thoughts last month during the core devs meeting as well, and it's going to be a progression of the discussion here where we're going to hear about automated pro social negotiations in the graph protocol.

A

So that should be very interesting and we're going to have some time at the end for q a as well for that I'm going to hand it over to eva for the figment announcement.

B

Awesome, thank you so much oliver. So I just wanted to take a few minutes to you know, discuss with the community how excited we are about decentralizing, our core developers and our ecosystem. uh You know we launched the graph, you know with the initial edge of node team uh streaming fast. You know now welcomed in and now figment and I couldn't be more excited just to see the diverse caliber. You know that each of these teams bring. I think, it's very easy to think.

B

You know we're all competing or you know we're all self-interested, but there is a reality where you can align multiple teams around one mission on a protocol, and you know what the edge of node streaming fast and figment- and you know, semiotics sommelier. All these teams bring is very unique and I think it's really going to set the graph forward in the future for long term. uh So with that we'll hand it back to oliver.

A

Yeah, I would like to give joseph an opportunity to just say a few words um from the figment side.

C

Hello, everyone I'm joseph- and I am the core dev pm on fitment and, as ever said, we are super, excited and happy to be joining forces with the graph for the last couple of months, I've been working alongside some community members and that showed me how the community is special and we are so lucky to have it. So, as you know, our decision about our 5 million grant was the easiest decision that we made and one of our proudest.

C

We are really happy to be joining forces here and just to give you a bit of color on what we're going to be bringing to the graph ecosystem. So for the last couple of years we've been indexing, multiple blockchains and, as you know, the graph is indexing as of today indexing ethereum. So we're going to be taking this presses and lovely technology that everyone loves and we're going to be, bringing it into other blockchains to enable more applications being built in the web3.

C

So we really look forward to doing this and for everyone that is interested to know more and have a deeper dive on our work. We invite you to join the indexer office hours, which is going to be happening in two hours and our lead engineer. There is going to give you a deep dive and you can see the amazing work that we're going to be bringing exciting times.

A

Thank you, awesome, well welcome, and for those of you who don't know about the indexer uh office hours, they happen in discord. uh There's a dedicated channel there, jim cousins, hosts those and should be a very exciting session. Today, it's going to be an hour dedicated. You know to this to this announcement so tune in on that highly recommend joining okay. Let's move on to gip and and governance updates. You do see on the screen uh everything that is currently sort of in the pipeline.

A

What is mostly um actively discussed in the community is around curation and no surprise. We've just launched curation last month and there's a number of things that are different stages of of the discussion and I'd like to start with the batch gns transactions, which is, in other words, the deploy and signal proposal, which has also been reviewed by the council last week. And you can see that in the forum notes and it is fairly far along.

A

So we've got broad positive feedback there and I'd like to hand it over to ariel to give us an update on where we are with that, as well as the subgraph change of ownership.

D

Yeah yeah hello and thank you for being here for another quarter, call um as oliver said, uh one of the feedback we received from from curation and and now many people creating some graphs and currency graphics.

D

That many times owners of subgraph that is published in a new subgraph wants to mint an initial signal and that's a graph and that's not currently possible, because you need to batch those two things into a single transaction.

D

So um today it's open to the possibility of creating a subgraph and then someone else signaling before the the app developer. So the there's, a pr that is open now in the in the repo. Anyone can take a look at that uh that is batching um different gns transactions by using a multi-core.

D

That means that you can batch an arbitrary number of different transactions. You can signal multiple different subgraphs, but at the same time enables by deploying and signaling. So I think it's um a pretty good um addition to the to the gns, but I invite uh everyone to take a look at the implementation and send any feedback in the pr okay.

D

uh The other one is something that is being um also raising the in in some of the feedback in the community. That is um changing the ownership of subgraph. uh It happens with some of the teams that uh they use a different address and they wanted to move a sub graph to a multisig or some other place, or even people created a subgraph and and they wanted to transfer it to the to the official team. uh But it's not easily. uh It can be done easily.

D

Now so uh there's um um this uh investigation, where we are looking into an implementation of our ownership for some graphs. So uh whenever you create one, you can easily transfer to to someone else. um So these these two are the the main things um that are one one is already implemented. The other one is under candidate implementation, looking into the best way to implement it uh about curation, and then we have this stable uh delegation uh yield. That is something that we talked in.

D

The previous um called the talk it's under review by uh by indexers and uh and delegators, and it's already audited, uh and it's also in a in the pr um in the repo okay. um Now I think uh brandon is going to talk about the creation tax uh right. Oliver.

A

Yes, that is correct, so today this proposal is is new, so this has uh brandon has posted this in the forum over the last month, so brandon, why don't you quickly introduce what this proposal is is about sure.

E

um Yeah, so this proposal, uh funnily enough, is to lower the curation tax, which I'm sure most of you are aware, is a tax, that's paid on deposit to the bonding curves for a subgraph the impetus for writing. This was based on feedback that come to me from a number of places that the costs, the total costs to a subgraph developer, were a bit onerous, especially with respect to upgrades and changing versions.

E

The idea being that, every time you upgrade a subgraph because you're exiting one bonding curve and entering a new one, you have to pay this curation tax again.

E

So what's interesting about this proposal, is that it also the gip for it, which you can find in the forums also outlines a previously undisclosed economic attack that we call the subgraph withholding attack and the curation tax. One of the primary functions of the curation tax was making this attack unprofitable and the base. Basically, the way that this attack goes is that an attacker signals on a brand new sub-graph, then they're, the first and only indexer to that subgraph, because they do not uh actually release the subgraph manifest for that subgraph to the network.

E

So no other indexer can come and index it. um They monopolize indexing, rewards on that subgraph. Basically, you know as long as the attack um uh you know goes on and other indexers are unable to come in and dilute the the profits. You know, if you will from that attack um the mitigation for this attack that we've already had in the protocol. Basically from day.

E

One is a combination of the curation tax and the subgraph oracle, so the subgraph oracle, its responsibility, is to basically monitor the network and disable indexing rewards on subgraphs, whose manifests are not available to the network, because that basically impedes competition and impedes the ability of other honest indexers to come in and index the subgraph.

E

And then the curation tax also mitigates this attack by making it essentially penalize someone for trying that attack because they're paying the only cost of trying this attack. You know, in addition to the gas costs is the uh is the curation tax and right now, uh we've, you know, monitored the subgraph oracle, that's running on the live network. It catches most uh attacks within about eight to nine minutes, which is quite good.

E

um That's we have quite a bit of margin of error in our analyses um and so that you know with respect to that attack. Specifically, we could lower lower the curation tax um from where it is today in the network. um I think josh from um uh I believe from streaming fast pointed out that that doesn't necessarily prove that we should lower the curation tax right, and there might be other reasons to um to think that this could have um knock-on effects. You know.

E

One knock-on effect, for example, would be that and if indexers in equilibrium had like a baseline level of curation tax that they wanted subgraph developers to pay, it could just be that the amount signaled per sub graph increases in response and in absolute terms, the cur the amount that's paid as curation tax doesn't end up um changing much. um I suspect that wouldn't be the dynamic, but that's um there are counter arguments to be made here. If you want to play devil's advocate.

E

I think this is definitely one that needs a lot more input in the forums and we'll probably end up doing a snapshot. Vote I'd, especially like to see input from uh curators and subgraph developers. You know how you feel about this change, given that there could be dynamics that might increase the equilibrium signaling amount, which you know, decreases the curation tax relatively speaking, but it could increase like cost of capital and opportunity cost um so yeah. We can follow that up in the forums and feel free to ask questions in the q a.

E

You're muted, oliver.

A

Yeah, thank you for that, so great great segue into the next topic, which is dynamic, curation text. Let me just set that up. We have probably one topic right now in the curated community. That seems to be the most burning one which presents itself as an issue where we have, you know, bought front running the bonding curve. So the incentive seems just too high right now to just instantly signal without doing a curator's job, which is to validate the quality and economic values of a sub graph, and there have been a number of discussions around.

A

How can we solve for that? We had initially talked about the showroom and we then later on discovered some slim chance discovered that an attack you know is also doable in a showroom environment. It just you know, comes at a different point in time, but a similar dynamic is possible to do so.

A

Then slim chance presented the idea of what is called the dynamic curation tax and I'm going to hand it over to you slim plants, where you can maybe introduce that idea to the community and where we are in the discussions of that.

F

Yeah, first of all, I want to say that I really appreciate so many curators actively participating in the protocol discussions and on the forums and in discord and the challenges that you are facing are seen in her and uh closely monitored.

F

So um uh yeah. As I see it, there are three challenges that the curation community is facing.

F

One is the frontlining parts, and also the the as oliver says it's um being first in the bonding curve, has the highest amount of potential reward, while also carrying the least amount of risk, so it encourages a behavior where curators are only trying to be the first in the bonding curve without doing their job and assessing the subgraph and then the second uh challenge is the high volatility around launch, and then uh people are like losing interest and, and it is uh and less engaging because there's there's this high volatility around launch, where people get burned and people some people earn.

F

A lot of money have bought a lot of money or a lot of grt, and the third one is that the protocol is open to bait and switch attacks from from developers where they can deploy a sub-graph, and then they can just move to another another subgraph without any unchain repercussions.

F

So the dynamic curation tax seeks to address all three of these challenges and right now it is being defined and heavily scrutinized to see if that is something that uh is interesting to to present in a more um more refined form to the community and see. So that's that's where it is right now.

A

Very good and and to those who haven't read the proposal. um Brandon just talked about the curation tax and to lower that from two and a half to one percent. Let's call that the steady state curation tax and what slim chance is proposing is, at the time a subgraph is newly deployed.

A

We start out with a curation tax. That is not one percent per new proposal right now, two and a half percent, but we start out with a value that is much higher. Call it something like 50 and what it should be doing is to deter. You know wins to be gained from very quickly. You know curating and signaling on the bonding curve and then, as time goes by with each epoch, that number that curation tax then goes down.

A

There are two proposals out there in terms of how that curve would look like um there's one with a straight line where it reduces, I think, by seven and a half percent with with every day until it hits that two and a half percent, and I think that is day seven, and at that point it's just continuously two and a half percent. So what you would have to decide as a curator is first of all, it's supposed to give you time to do your job right. That is one goal that um we currently don't seem.

A

You know to be able to provide, um and then you know, depending on how strong your confidence is, you know in in that sub graph. You can then decide at what point in time to to move in knowing that the earlier you move in the higher you know the taxation will be so you start to create a risk reward profile. So to say, based on the time that you join, um I I find it um certainly interesting um provide feedback in the forum.

A

There is going to be more discussions that I know that slim chance is having uh with in the curated community. um To me this seems to be the most pressing. You know issue right now, and this proposal seems to be the one that um seems to address it in the best way possible if there are other ideas out there. Please come forward share with us in the forum and if you like that idea or have other thoughts to that, please also provide feedback in the forum as well.

A

There's one other uh item on the list for gip and and governance that I've missed putting on here. Brandon just reminded me that it's the arbitration charter. Do you want to give us an update on that brandon.

E

Yeah this one will be quick. I just want to make sure folks were kept in the loop. So basically uh there's two main pieces of engineering work, that's kind of blocking fully adopting the arbitration charter and the api and, like the subgraph feature version in gips, one is actually tweaking the graph node to produce a valid poi in the case of a field sub graph right now it just returns null, and so this was a a change to the arbitration charter that came out of discussions in the forums where you know.

E

Basically, the community reached a consensus that actually it did make sense for indexers to be able to receive rewards for failed subgraphs, um but right now the graph note just isn't equipped to produce pois for failed subgraphs and so there's work. That's ongoing right now to change that and the other piece is feature detection. So we have a gip that the arbitration charter depends on that outlines how to tell if a sub graph is supported in the protocol for indexing, rewards disputes and other features, but there's no way of actually detecting those features.

E

Right now, and so the graph node team is working on feature detection for a number of different features that will impact support in the protocol, and I think you can also expect some of that feature- detection to be incorporated into some of the ecosystem, tooling, like subgraph um studio and the explorer where you would be able to in the ui see you know, hey this subgraph uses, uh unsupported features that you know may make it ineligible for indexing rewards, which is something that um you know an indexer would want to know up front.

E

You know that subgraph oracle that we mentioned earlier another one of its jobs, is to disable indexing rewards for subgraphs that are ineligible for those rewards, and so, as an indexer before you do, the work of you know spending you know hours indexing, something that's that's helpful helpful context to have so both of that is um are coming in the near term, and so I would say just stay tuned for that. You should probably have more updates by the next cordes call.

A

Very good, thank you. Brandon, okay, we're moving on to streaming fast alexandra is going to give us an update on where they're, at with the integration, and also talk about something that is very interesting, which is called the fire hose alexander. Take it away.

G

Very interesting, okay, thank you from alex and the cto at the streaming fast, faster and uh happy to have recently joined this ecosystem to to bring our focus. Let's say, of increasing indexing performance, that's for the short-term aspects, so maybe integration progress. We can say that we have worked hard on bringing a few insights directly into the graph node recently. So we have a few pr's up and running there. um Upgrading libraries, some some very low level work.

G

Actually, some you know, futures and tokyo library updates are there to enable the next stage of you know: performance increases in the graph node, so we've been been doing some work. There also the same time integrating the fire hose as a source of data, um and maybe I could just switch swap to the fire hose quickly here, so we've posted on the forum. If you go to the forum you search fire hose, um I think it's just a sticky name right. It's a thickening!

G

It's a it's a pattern rather than a product because it's your product, it's our! It's our thing together! It's just! We know what we talk about and we say that, but we posted like some of the rational, the ideas around it's a little. You know a different way to produce and consume blockchain data and there's a few lists of, I think, fair points that have excited those who have read it.

G

So maybe I could just you know, highlight a few of the things that make the fire hose so appealing because it it's a way to you, know, handle high throughput chains by providing.

G

You know also uh increases in linear indexing performance, but maybe I don't know if it's appropriate the sort of people we have here to to get into some details. So the reasons for that uh you know as if we have files or unpacked in some threads. The data is always ready to be processed and indexed in memory when it's when it arrives. So it's it's very fast. It's a fast way to push data down to the indexer instead of the index for going fetch for it.

G

So, um and so it reduces the risk of non-deterministic output. Also because you get fed with data and the data itself is deterministic. There's much less network round-trip when we use such an approach and.

G

Yeah, it also is sort of the groundwork necessary for some of the more crazy things that are to come. There's sort of short-term, but you know a little bit mid-term, also is to bring all the parallelization.

G

This thing was designed for massive parallelization, and if we can get that in in some ways, then all of a sudden we can get to those 800x performance.

G

I was chatting yesterday with some indexers they're present here, just by the way I had, I didn't say that it is my goal to chat and have a good conversation with all of the indexer community. Like we're working on performance, I want to understand your pain points.

G

Yesterday I had a great conversation with some of you guys understanding some of the reasons why the speed is important, like very crucial for us to get to that um an example I heard was that sushi swap's, taking so much time that at some point it was risky to even try to index it because it couldn't fall in the 28-day window, so it changes dynamics there may make so by all means.

G

If I'm all wrong on that, please correct me come to me and make sure we uh we we we fit all these things together, so we work in collaboration, but uh but I understood that yesterday, that's what I saw like having that being really fast is actually important, even in the short term, so that'll influence a little bit of our short-term focus, uh but right right.

G

uh Okay, I I appreciate when interactions happen, so if you have any questions, just drop them there and then maybe I can just give a small idea of our short-term focus here, so our our short-term focus is going to be to bring as fast as possible.

G

You know performance improvements and and for that we'll see that we're gonna, so first of all, we're gonna be uh open sourcing. All that stacks up. Some of them are not yet open. uh So I'm gonna, we're gonna sort of just make sure they're uh easily uh consumable, and I wanna work with some of the indexes so that you guys run fire hoses and you get an understanding of what what what what it is right.

G

What's the cost structure which I had a nice chat about yesterday, which has advantage he's been into even in terms of costs, so a quick, quick glimpse in there like we had an example.

G

I think that kuhn was telling us that the bsc archive node was was it close to 10 terabytes on the nvme, like getting closer to the limits that he can have on the nvme and that the the packed version of the data which we use for indexing when we do the pancake swap thing is close to 1.2 terabytes, so much smaller and also much less cost because they're just on storage? And if you don't read those files, you can put on cold storage on an icy storage on a low kelvin storage.

G

I don't know like that. With these cloud providers they have monkey names right um so um so that we're gonna bring some of that to you as soon as possible. My goal is to get that feedback from you guys as soon as possible. So when you get to the firehose, if you could try it give us some feedback see how to impact.

G

So we can discuss together, like what's the true impact, that's going to give and and decide like what's the best way forward so and because I think the people thing is going to be the the hardest, I want to make sure all the tech is pushed out and you guys can try it and we'll see what sticks, what doesn't stick right, what people want and not want and um right so the fastest path to speed up so we've been discussing also with a few of you guys about you know the improvements that was uh demonstrated with the pancake swap.

G

So I want to put that out also, so you guys can try it dabble with it and see how we go forward that and that will also be you know the introduction of the the the parallelization. So you guys can have I'll have that write-up rfc style, probably in the readme, so you can understand what it means.

G

What's the dynamics of parallelization of subgraphs and also a sample implementation that we can try today, and I know some of you guys some some came to us like they want to try right now because they have an imminent problem, and so maybe we could, you know, do something about sushi swap because it's close to pancake swap so I don't know if people are interested in helping out trying that out by all means I'll, be very interested to talk to you as we sort of integrate and get to learn, like all of you guys.

G

Does that make sense, so is that appropriate as a first uh core dev thing very appropriate? Yes, thank you. I appreciate your support. There.

A

Of course, and if there's anything uh that we can help with just to to connect with folks that you want to get feedback on reach out to me, I'm more than happy to facilitate that. um If we want to, you know, maybe schedule a workshop session, you know where you can have deeper discussions around that you know. We've got different ways where we can get feedback.

A

I know you've got it in the forum and sometimes it doesn't catch everyone's attention and if you like to get a little bit more attention to that, just let me know.

G

Thank you appreciate it.

A

Okay, thank you for that alex and with that we go to semi iotic, I'm gonna stop sharing, because sam is going to take over and walk us through uh their piece and their own presentation. They've got here. We go automated pro social negotiations. Take it away.

H

Thanks oliver hi everyone- it's great to be here today, um I'm sam green, I'm, the cto of semiotic, ai and I'm joined by matt dibel, who is a research scientist at semiotic and today we're going to be talking about automated pro-social negotiations in the graph protocol.

H

So first I will give an overview of reinforcement, learning and the areas where we can apply it in the graph and then I'll hand it over to matt. Who will talk about research related to providing robust, pro-social behavior to rl agents.

H

Okay, so reinforcement learning is a branch of artificial intelligence and it is focused on making automated decisions in uncertain environments, and the graph has many opportunities where we can apply. Rl too, for example, we can build tools for indexers to automatically select prices which lead to the best long-term, profitable outcome for the indexers.

H

We can also build tools, uh rl agent-based tools for indexers to automatically optimize their infrastructure again leading to long-term profits, and we can build tools for consumers to find high quality indexers and pick query prices that will lead to their lowest cost.

H

So why why this care for pro-social behavior for our rl agents? Well, we certainly don't want to be deploying agents that are maximally extractive at the expense of the health of the protocol and on the other hand, we need robust behavior to give our agents the ability to basically defend against adversaries that may exist inside the protocol.

I

All right so as a part of this reinforcement learning effort, uh we wanted to do a deeper investigation into how we can actually get uh the behaviors. We want in the agents that we train so to do that. We took a look at uh this paper pro social or selfish, which explored a sort of generalized negotiation format where two deep neural network agents negotiate over the inclusion or exclusion of six different clauses.

I

So, at the beginning of one of these negotiations, each agent will be given a randomly generated utility vector, so this utility vector will have six numbers. Some of them will be negative, denoting that the agent uh would actually get punished if that uh clause was included in the final deal, and some of them are positive, meaning the agent would benefit from that clause being included in the final deal.

I

Now, it's important to note that both of these are randomly generated every single negotiation. So sometimes these utility vectors will be very similar and there will be a lot of common ground between the agents and thus it'll be a lot easier for them to come to a a good agreement uh and sometimes they'll be very different, in which case it will be very hard for them to come to an agreement that works for both of them.

I

So to actually facilitate this negotiation uh agent. A will begin by sending an offer vector to agent b. This offer vector will contain six bits with a zero meaning that agent a wants to exclude that clause from their deal and a one meaning that that agent wants to include that clause in their deal.

I

So in this case agent a sent over offer sub-zero, and in this case he wants to include clause 4 and clause. 6. agent b can then look at that offer and he can propose a counteroffer. So in this case, agent b proposed that clause, 2 and clause 4 be included in the final deal once again, agent. A can look at that and propose a counter offer if, at any point uh agent, a or agent b proposes the same offer that they were just given.

I

Then that is an agreement and the negotiation ends and the final contract is the offer that they agreed on. If, however, they continue going back and forth and don't reach an agreement eventually, they hit a deadline that we imposed, which in this case, is when they have each offered 15 different offers.

I

So now, for the uh exciting part, the reward functions. This is actually the only thing we need to shape these agents and get very different behaviors. So the paper set out two different uh reward functions that are are fairly similar, but they get very different behaviors.

I

The first one is a selfish reward function. This one is simply the utility that an agent is getting out of the deal. So if a deal is made, uh this is shorthand for the total utility that an agent is getting out of the deal and if no deal is made, then a small punishment is given to disincentivize stalemates.

I

The pro-social reward function on the right here is uh very similar. With the only change being that a pro-social agent only gets rewarded for the utility it's getting out of deal if the deal is optimal, so a prosocial agent still wants to maximize its own benefit, but it will not do so at the cost of the other participants in the environment which, as you can imagine, is, is very desirable in a network that we want to maximize the overall health of.

I

So these two create some very interesting behaviors, but we also wanted to explore some other behaviors, so we actually proposed some additional reward functions.

I

uh The first one we proposed is the total reward function and this one will be the uh total reward of all participants of the network. So in this bilateral negotiation case, uh the total reward for an agent will be the selfish reward of one agent plus the selfish reward of its opponent.

I

So this agent is actually completely selfless in that it only cares about the utility that everyone is getting and does not care how much it actually gets for itself.

I

So this is extremely helpful for maximizing the total utility, but it is very vulnerable to malicious attacks, which you will see on the coming slides.

I

We also explored an adversarial reward function over here, which you can see is uh the selfish reward for an agent minus the selfish reward for all other agents. So this agent is trying to maximize its own gain while actively trying to minimize all other agents gain.

I

So, as you can imagine, if this adversarial agent is successful in maximizing this reward function, it means that it is extracting maximal value from the protocol while uh leaving others with nothing or even worse off than they started, which is obviously not what we want at all.

I

So we would never deploy an agent trained with this reward function, but it is incredibly useful to use within simulation to poke at the other agents that we have as well as poke at the protocol itself, so that we can find vulnerabilities in both of those things and fix them in simulation before they are found in the real world and exploited there.

I

So now we want to show some results, so we can see lots of different behaviors by matching up these different reward functions.

I

So I'm going to go through a few examples of those and for each one I'm going to show two things. Number one is going to be some test statistics that were collected over 10 000 trials. Those will include average scores for the pro-social and selfish agent.

I

In this case, you can see that the selfish agent outscored the pro-social agent by a significant bit, which makes sense because the selfish only cares about himself so he's able to take advantage of the pro-social.

I

I also show the optimality percentage, so the percent of deals that are optimal, uh the agreement percentage, which is the percent of deals that actually end in an agreement and don't end with no deal being made, as well as the average dialogue length which can be interpreted as sort of how agreeable they are, how long they take to come to a deal.

I

So on the right. I actually have a visualization of this negotiation, so this will graph the total utility that each agent is getting in the negotiation over time at the top. You will see the current offer highlighted in the color of the agent that is proposing it and on the left and right, you will see the utility vectors that have been assigned to each agent, as well as the current proportion of the maximum utility they can get that they are getting out of the current offer.

I

So now, if I play this video, you can see this back of back and forth, as uh each agent takes turn uh um proposing offers to each other in all of these visualizations. You will see a very uh jagged uh result because uh the agents are pro taking turns proposing offers that are better for themselves.

I

One important thing to note here is that the selfish agent is reaching much higher peaks all along here, because it only cares about itself. It's trying to offer rewards that will maximize its own gain, while the pro-social agent is offering agents offers that are not perhaps the best for themselves, but actually reach an optimal deal for both of them, and you can see in this particular case uh on time, step 20.

I

It did actually end in a deal which was quite good for both of them, but slightly better for the selfish, which is indicative of how these will usually turn out.

I

So next we can look at.

I

Next, we can look at selfish versus selfish.

I

This one is uh very interesting. You can see in this animation. There uh is a lot less agreeable they're actually for most of the beginning, they are just uh going back and forth, proposing the same set of offers to each other. um It's not until the very end when there is about to be a deadline that they actually start trying to work with each other and eventually, on the last time step. They do come up with a deal they can make.

I

Now. These statistics here are somewhat interesting. You may expect that two selfish agents would not be able to agree much, but actually when they are trained against each other. They learn that if they don't compromise, they're not going to get their selfish reward, because no agreement will be made. So actually, when you train these two selfish agents against each other, they do learn to compromise a bit and, as a result, you actually have a pretty decent optimality rate and agreement rate.

I

So we can also look at pro-social agents, first, pro-social agents.

I

This one is, as you might expect very nice. The optimality rate and agreement rate are extremely high and the average dialogue length is extremely low, which shows that these agents have a common objective and are able to work together quickly to achieve it.

I

So if you watch this sample negotiation here, uh this one is actually one of the longer ones. Usually they end after only a few time steps, because these two agents are able to very quickly come to an agreement on what they want, and you can see in this case that they did come to a good ingredient for both of them as they're, both getting positive utility.

I

Now we can actually do even better by matching up to total agents.

I

So, as we said before, these agents are completely selfless and just want to maximize the total benefit of everyone within the network and as a result, when you match two of these up, you get a pretty much 100 percent agreement rate. They always come to a deal uh and you also get the highest optimality rate that we've seen uh and correspondingly. You can see the highest combined average score as well, and the shortest average dialogue length, which shows that they're even more agreeable than the two pro social agents.

I

So if we play one of these sample negotiations, uh it's very quick. uh You can see, uh interestingly, that it ended with total one getting no utility out of the deal, but he doesn't care because the other agent got a ton of utility out of the deal. So he's still very happy, because overall, the network really benefited from this negotiation.

I

So the picture is a little less rosy when we uh compare this total agent against an adversarial agent. So if you look at the statistics here, it's pretty terrible. The total agent is only getting on average 0.02, while the adversarial is getting 0.88.

I

So the total agent is absolutely just getting completely taken advantage of by this adversarial agent and if we actually look at the negotiation, it's quite interesting because these agents seem to be so disagreeable on their negotiations that they just stalemate for most of the negotiation.

I

uh It's not until the very end when there's a deadline that they actually try to find some common ground, and in this case you can see that the adversarial gets a huge utility and the total got a tiny utility so once again illustrating that the total agent can really get taken advantage of by this adversarial agent.

I

So, lastly, I want to look at a pro-social agent first in adversarial agent. So as I've highlighted before the pro-social agent is trying to be pro-social. But it's not selfless. It does care about maximizing its own utility, as well as the overall health of the network, and as such you can see in these statistics.

I

uh The pro-social is still losing out overall, but it's it's much less uh less extreme than the total agent first adversarial agent, and you can also see when we play this negotiation, there's a similar sort of stalemate, but they start trying to negotiate with each other much quicker and in this case, they're actually able to reach an agreement where both benefit actually both benefit quite a bit, which is a good illustration of some of the the strengths and weaknesses of these pro-social and total reward functions.

I

So that was sort of an overview of how these different reward functions can create behaviors in this abstract negotiation setup moving back to applications within the graph protocol, we can imagine a few different applications of these reward functions, as you all probably know, right now, the gateway acts as an interface between consumers and indexers, and we can imagine a world in which we uh train that gateway via the total reward function, to maximize the total utility received by all consumers and indexers.

I

We can also uh look forward to the future when consumers and indexers negotiate directly with each other, and in that scenario we can train agents for a consumer indexer to negotiate directly with to each other on the price of a query or even on a long-term service level agreement for a long-term relationship for serving a type of query.

I

Those are just a few examples. Overall, we see a ton of applications of reinforcement, learning and more specifically, these goal-driven reward functions at the end of the day. We think a lot of these tools will have a very large impact on the overall health and profitability for everyone of the graph protocol.

I

So I think we have a little bit of time for questions now. If we don't get to anything or there's a more in-depth question, you can feel free to reach out to me or sam uh at our emails, which are listed at the bottom. There.

A

Thank you. We do have a couple questions, so that was a great presentation. Thank you for that matt and sam um now the question went away here. We go, I have to pull up the chat.

A

Okay. The first question is: is the average length of the dialogue captured in the utility vectors used by each agent.

I

No, so the uh the utility vectors just define how each agent will value each clause if it is included in the final deal. If uh the deadline is reached with no deal, then each agent just gets a flat small punishment to uh sort of disincentivize that wasting of negotiation resources.

A

Where do you see this going as next step? What would you like, you know to do you know and maybe share with the community? What what you plan on doing next.

I

Sure so, we've already done a little bit of work uh with the isa, formulating it as an rl environment or an rl agent. uh So moving forward with that would be quite interesting. I do think there's a lot of potential in training that, with this sort of total good of the environment, because the gateway should be impartial so maximizing the total good of the environment is a little less hard to take advantage of. In that scenario, and also the sort of direct negotiations are quite interesting because that moves further towards decentralization.

I

So uh if we can facilitate that with the same framework, that would also be awesome in the future.

A

Very good, I.

E

Think to jonah's point: one utility function that we've discussed that I think came up in the office hours last week was, uh you know, basically valuing the decentralization of the network right, so that could be an example of a pro-social utility function that one of these agents is trained on and specific, more specifically, on its point, uh one of the selection criteria that had been considered for the isa uh was what we were calling negotiation efficiency.

E

So that could also be you know, actually what young described that could be a utility function in one of these agents in the future. Where both you know, agents on one side, value shorter negotiation rounds- and you know the indexers correspondingly uh try to optimize to keep negotiation round short.

A

Any other questions anyone has please post them in the chat. Anyone who's uh here on the panel feel free to ask any other questions. You might.

I

Have yeah so in uh in practice we may want to, instead of just making this uh sort of a hard cut off either or uh we may want to make it a more continuous incentivization for shorter dialogue lengths.

I

I definitely see that as a as a valuable thing that we can include in future future experiments.

A

Very good, thank you again, matt and sam. So don't see any any more questions at this point and also no other questions for the call itself outside of semiotic. So we can actually leave a little early today. Nine minutes left give you back the time. Thank you. Everyone for joining! You are going to see the notes posted here in the next couple of days and we try to be as fast as we can with posting the video also on youtube. Thank you all for joining.

G

Thanks olivier.

E

E