Nebula Graph Community Meeting, 10 Aug 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: NebulaGraph community meeting [2022-July] (NebulaGraph Cache Introduced)

Description

This time the ad-hoc topic is about NebulaGraph Cache design for now and future by Wen Hao, our contributor in Storage.

A

um And then today's uh topic, the ad hoc topic, will be uh shared by windhawk, which is our community contributor. She will share the cash design for now in the future, in naturograph.

A

And so actually one thing to mention is uh we actually are changing um the frequency of the community meeting and I will make it every full week instead. So we will see if it goes well in this new frequency and if you have any suggestions or new ideas. Just let us know so.

A

You can find everything in this user pad or from our uh navigate community repo, which is the nabla graph dash community under resource github organization, and so we don't have new members to introduce today, and we will have this meeting uh every four weeks and we will go through uh some specific topics that anyone would like to bring and we have the same discussion in this stage and anyone would like to uh bring out ideas. Can uh let us know in slack in github uh discussion or uh send emails to us and yeah.

A

I will go through the project harvest, real, quick, so.

A

In last four weeks, we have a bunch of updates in torching the you know: the ecosystem on the other ground, when the first one is our contributor, helped create my baddies. Integration for navigraft and ripple is under nebuchadnezzar uh organization. So this this is the link. So if you're interested just uh feel free to check out this report- and we also uh received uh some contributions regarding the flank connector, one of big things is getting started to support flink circle and spike liu and new south cs7.

A

They are working together on this domain and there is actually one uh there are actually one pr merged this week. Four, so with the help of this, this is working. uh We now support up to uh 1.40 of the link, the other one is uh we. This week we just emerged uh pr to to help users to leveraging the pi spark to using nebulous bar connector, so that one was contributed by me and then I will uh briefly uh preview you on some of the contents of the 3.2.0 release.

A

So in this page they are all enhancements. So the first one is: we actually revisited our default configuration values. We added a bunch of more in a default configuration so that user don't have to dig into the system to figure out some of the configurations and we'll change some of the default value to a one. That makes more sense and you can see we make a bunch of optimizations on specific operators in the query engine.

A

So please expect the performance, automatic optimism, automation and basically most of them are related to match query, but some of them are related to the sub graph and the fine test query and a part of those performance improvements.

A

uh This is a new uh uh syntax, that's added support, which is the extract function in the mesh query. So you can do some expression and a regular expression uh thing with the help of this this function.

A

The final one is, uh we are optimizing, optimizing uh the memory allocation with the arena allocator. So it is not another improvement on the performance and uh actually you can see. There are a bunch of uh updates uh that I will not dive into here and then I want to bring windham our community contributors for the sharing regarding the nabla graph cache.

B

Okay, let's start hello, everyone- and this is holland from neville graph. Today, I'm going to talk about the nebula cache design and the future of nebula cache.

B

So before I talk about the the issues that we are going to solve and how we solve the issue, let me first briefly introduce some background of level of graph.

B

So uh in level graph we are using adjacency list to represent a graph, because adjacent series is good for getting the neighbors of vertex.

B

It's also memory efficient and it's good for mutation like adding or deleting vertex and with adjacent series is easy to partition and it has high concurrency by using bfs okay, and we use key value pairs to represent an adjacency list, because key value store is very mature storage representation, and the diagram here shows the formats of the key and values in nebular graph, and here it mainly shows the key and values of vertices and edges and what's being highlighted here, is the vertex ids in vertex in vertex and the edge and and the vertices in nebula graph are partitioned by hashing.

B

Vertex ids and edges are partitioned by both hashing and key ranges, which is essentially the source vertex id in that edge by this means all edges of a vertex will be stored in the same partition as the source vertex.

B

Now the key value pairs look like adjacency list.

B

So with this design, the bfs in nabla graph from a given vertex will be translated into many prefix. Six and scans of edges in the key value stores and the get property of a vertex will be translated into get operations in key value stores.

B

Now let me briefly discuss the issues that we are going to solve with nebular graph cache. Actually, um the issues that we are going to address come from our findings in the graph database storage access patterns, the first finding is the advantages in the graph database usually have low space locality. Let's take an example of this simple query: get neighbors of a vertex.

B

Which are unhops away?

B

Okay and let's use this tree structure graph to discuss to to discuss the whole process.

B

So first, let me use the laser point so first, it will try to retrieve the edges of this this source vertex in the database, and we already know that the edges on all the edges of a vertex will result in the same partition as the source vertex.

B

Okay and after retrieving the edges, basically the keys of the edges. We can easily get the destination ids of this edge, which points to the neighboring vertices, okay, and we know that enable graph we use hashing to partition the vertices. That means the neighboring vertices and the source voltage voltages may or may not result in the same partitions.

B

Okay, and that means they may not reside in the same storage, okay and- and this process will continue if we are going to retrieve the properties of the neighbors which are more than one half away, so essentially um sorry, so essentially in the in the diagram. Here we have a lot of voltages and because we are using hashing functions to partition, the vertices these voltages may result in different storage.

B

So the what it brings about is retrieving the properties of multiple vertices will usually require accessing multiple partitions and if we have, uh if we want to traverse the graph and retrieving the properties of vertices which are unhops away and is greater than one and then the red, the number of random vertex excesses will increase exponentially with the number of hops.

B

Therefore, uh the voltages will usually have low spaces low space locality, and we know that in rocks db, um we use block cache to provide some of the caching capabilities. So that means the voltages in the block. Cache in roxdb have low space locality, okay and the second key findings is about empty key access, and we know that in graph database um the data is a schema-less, which means the schema of a of data in graph database is not fixed and how it is achieved is by using text in nebula.

B

So, let's look at an example here, so we have a vertex and we have three texts and a tag, and these tags are person. Student athlete and the vertex can have can be associated with one or multiple text, so vertex can be a person can be a student, can be an athlete or any combinations of these three okay by this means of voltages in the graph database can have different schemas, okay and assume.

B

We have a query like this: it means retaining the properties of vertices, which has a tag or person but return the properties of this kind of vertex with all possible text, and we already know that a vertex may be associated with one or multiple texts right. So how we accomplish this in nebular graph is by concatenating the vertex id with all the possible tag, ids and then construct all the vertex keys and then try to retrieve the properties in roxdb with with the other possible keys and if there's a hit.

B

We know um the data exists in rocks db, and then we retrieve the data. If.

A

B

Key doesn't exist uh in ros tv, then we know this vertex is not not associated with that kind of that particular tag id.

B

So what it brings about is it will cause a lot of empty accesses in roxdb, okay, and these actually are the two key findings um that we come across in nebula storage access patterns.

B

And then how we improve this is by designing our nebula storage cache, and this is the architecture of the nebula storage cache and it has a component in the rocks db, part and the components out of rock's db part and the interlocks db part will still uh provision a block hash, which is which is very essential for some, like field filter blocks, index blocks, okay and also block cache can hold uh edges because edges will usually have better locality than vertexes in graph database and out of rough db.

B

We we provision a cache space by using cache lib and we further divide the cache space into two pools. One is the existing cache pool and the other is the empty cache pool, and the existing cash board is mainly used to store the the key and properties which reside which exists in the rocksdb and the empty catchport is mainly used to cache the empty keys which were queried but do not exist in rocksdb.

B

B

And this is the configurations of of the started cache. So let me briefly introduce uh talk about them one by one, so this is the main switch. The enable storage cache is the main switch for the storage cache, and this is the total capacity that we allocate to the storage, cache, okay and okay and and pay attention here that the block cache size is out of this section. So it's it is an existing option, a configuration option in our configuration file.

B

So uh here the config, the configuration here, it only uh managed the the storage, the storage cache space part uh which we implement by using cache lib, okay and the configuration here um is a very- is a very important configuration which is very sensitive to the performance.

B

So it requires you to put an estimated number of cache entries on this storage node in base two logarithm, and if you don't provide enough, um I mean, if the number here put is too too low, which, which means it is much lower than the actual number of entries in the storage node you, you may suffer from low performance, okay and then these two sections um other configurations for the existing cache pool and the empty cache for respective respectively.

B

And the first section is uh first, this is a switch for vortex for all the existing cache pool, and this is a capacity for this existing cache pool, and this is a ttl for the items in this existing cache pool and the second one is mainly manage the empty cache pool. So here the switch and capacity and ttl as well.

B

So here is the performance improvement that we can achieve by using the nebula cache. So first with this go that go on step, query wiz attack, which means we explicit explicitly specify which tag that we are going to access. So there will be no empty keys when running this query, so we can achieve a 20. Latency decrease okay directly with the existing cash pool.

B

Okay and similarly, um if we run this fetch neighbor properties of a given tag again, the tag is specified, so there will be no empty keys. So we only provision the existing cash pool. Okay, so we can achieve 16 percent latency decrease and the next two queries will try to access the data with all the possible tax.

B

So there will be a lot of empty keys. Okay, so for go and step if we provision both the empty cash pool and the existing cash pool, we can achieve 49, latency decrease and 77 qps increase and for go to step. We can achieve even more latency decrease and the qps increase this, because it is more than one hub, as I discussed earlier, that if we have more than one hops, the number of random accesses for voltages will increase exponentially.

B

So the more steps you have, the um the higher potential uh performance improvement that you can achieve with the nebula cache.

B

Okay, um let me briefly talk about our future projects about nebula cache.

B

The first two is about the our products in cloud, so you may know or not, that we already have our products in.

B

In public cloud- and uh we are going in our next few projects, we are going to provide um a memory, caching cloud to improve the performance and also the local storage cache uh to pro to improve the performance uh for the system which put data in object. Storage in the public cloud, and we are gonna, also provide a cache for other in other layers. In the nebula architecture, for example, we can provide um cache for the quest for the query result and- and we are also also going to provide a cache for the graph structure.

B

So even though we have, we may have a very huge graph database, but the structure of the graph is typically very small. So it is very easy to just put the whole graph structure in the memory and- and this can accelerate a lot of like uh analysis, queries.

B

So if you are interested in any kinds of these projects, uh you can contact away. So if you are interested in any kinds of these projects feel free to drop a message two-way uh or in github. So we can collaborate in the future.

B

All right so now I will give time to a way.

A

Well, thank you so much uh excellent sharing, so um so this is actually first time that we have uh we're trying to invite a contributor to the community um on different domains of number graph.

A

So if you are interested in any other topics or domains, just let us know.