Nebula Graph Community Meeting, 21 Sep 2022

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: How we test NebulaGraph

Description

Jingchun Xia, Head of QA & DE talked about how we test NebulaGraph this time.

A

Intro introduce ourselves uh first, then we will come to the projects update.

A

That section today, we'll have a topic uh brought from attention and he will introduce a whole view of how we will do the tests in multiple layers and perspectives uh in the team of Neville graph. uh And finally, we will have the open floors um so, and this is way uh you you are you always see me in this meeting and um maybe uh I see uh John is is on the line so uh John. Could you help um introduce yourself and later uh jinxing will introduce himself with the slides.

A

So then we can move on yeah.

B

um So I'm in charge of the standards activities as well as Community operations uh in in the Visa company, um so very happy to be here thanks away.

A

Thanks, um okay, so I will quickly go through uh the Project's updates before today's topic.

A

um Yes, as you roll, we will have the meeting uh every four weeks, so anyone want to bring any topics before the meeting feel free to reach out to us from either from Slack or from the the week here or ether pad uh so in in last, uh we actually released a 3.2.0, it's another uh release cycle in in last uh months.

A

uh So after that we have a few uh of updates on the core uh card, so basically only new to features were merged.

A

So the first one is uh we finally uh pushed down the star swedes uh field filtering uh condition to the storage side. So if you are interested, you can check out the the pr.

A

um The second one is unwind, so previous online is is a keyword um in in open Cipher.

A

uh So with this change, we bring the the semantic to the uh the native and ngql, so you can unwind a list so transpose it to a transport States this this vector, like with the pipeline. You can put all the items uh of this uh a real vector and uh do uh query in in next and in next step after the pipeline, so uh I recommend to check check the the pr. There are some examples and there are some tests uh in this code either the quotient description of the of the pr.

A

So these are the other major changes uh in last one month.

A

um There are a bunch of um uh job manager related changes, actually, three or more four of them uh are related to uh index jobs and SST, download jobs, Etc and now we handle the the async jobs more uh precisely and uh even better.

A

A

Hi Goran welcome.

A

So it's our first first time join us.

A

um um um Maybe after uh after my part before the next topic, could you introduce yourself to the community Goran? Is it okay.

C

It's okay for me,.

A

Okay, cool, thank you um and uh yeah. We, uh we have uh a sort of high uh priority uh crash issue uh and it is fixed in the new new release uh in a master.

A

um So apart from the core part, we uh one of the news is uh last week in last few days, we uh announced another uh front-end uh infrastructure uh projects to be open source. It's called The Never graph, we editor, so it's actually a library that we underneath use the to construct our um a preparatory offering called Neighbor graph Explorer. So it's the things this Project's doing is to render um uh like diagramming and some other things I'm, not a expert in front end.

A

So if you're interested to check out just check out this report, Repository um we our studio, comes to 3.4.1 with some major. The major change are the connection uh session handling in the back end of the studio and the the two brown shades of dashboard are upgraded. I have a new release, so they um for the uh V3 Branch. uh It comes with more uh newly introduced metrics to be um uh to be managed and uh shown in in the dashboard.

A

uh We have a new release in in uh kubernetes uh operator, so uh support the latest core and the fixed uh leader change bug and that's most of the uh update part uh I'll quickly go through share share with you that, in terms of our team be sub team, we just have an internal hack Zone.

A

um There are some of the, uh so these are the fixed uh pictures. So we invite many of our uh uh friends and families members here. So we have a quite um good party and I want to quickly go through some of our projects, so maybe in the future we can put them on on the main branch. uh One thing is the first one is uh our about network of academies.

A

Not all the teams are uh r d teams, so they uh will introduce a LinkedIn patch systems for our internal of for all the naval graph related certification. So please uh stay tuned. We will have a couple of certifications to be released. There will be some courses and afterwards you will um have a LinkedIn uh badges, so you can share it on a profile. Another one is created by one of uh contributors in our test team, so he shared uh she made.

A

He made a a a first test utility, so you can do first test on top of natural graph now so here this is the repo.

A

Another thing is, though, I was involved in the team that we made some talks and Integrations between Network graph and the djl, which is a deep graph learning uh deep graph Library. So it's pretty cool uh another one is called Naval graph uh merak. So basically they implement the the vector search in ambergraph.

A

This code is not uh open source, so I would check with the team see if they can put the code somewhere in public stay tuned, and that's that's most uh part of of my part. So before we move to the uh session regarding how we test Netflix.

A

C

uh Yeah sure my name is Goran sujanovic uh uh I work in reversing labs in Croatia, um I am a senior software architect and uh recently for almost two years now, I'm experimenting with a graph infrastructure in our company and we implemented the nebulograph 1.2.1 version in production and yeah it's operational and running with our huge amount of data.

C

It's about 60 billion nodes and 350 billion edges for to our use cases covering and currently we are started with the testing of nebula 3.2 and yeah preparing a new production cluster for that testing cluster is operational and working synchronizing data and everything works. Fine. It's uh I had some issues with the importing um history data, but I will try to resolve that and build report on community Pages somewhere um yeah.

C

We had some discussion on uh recently on a slack, so you are informal about that.

A

Yeah and super uh looking forward for your reports, we shared and.

C

Yeah yeah we'll see yeah. uh Sorry one question: do you have a interrupting your presentation but uh any plans to new uh dot, Point version or 3.2 uh in the near future?.

A

C

Yeah, okay, uh you mean 3.21 or something yeah new new patch version or something I saw that there is some improvements in some crashing uh uh reports and something like that, so the the patches I suppose will be ready in near future.

A

Yes, in case there are major uh sincere issues. We will have the uh uh dot the the minor uh hottakes yeah. We will have that when needed.

C

Yes, but you are working on a 3.3 I suppose, yes,.

A

Yeah yeah yeah in in the long run, uh the every three months cycle is fixed yeah, but but when needed, we will have those uh minor readings.

C

Minor versions: okay,.

A

Thank you so much so.

C

um No problem welcome yeah.

A

And the Quran is our um uh uh old friends in the community and his uh his uh Naval graph hacker I would say, and uh he always um answer tricky questions in a slack and also um he participated our last uh public song and um may had made a uh super awesome, uh Park projects, it's related to the the security.

C

A

uh Okay, yeah yeah yeah.

C

Repositories on GitHub: yes, yes,.

A

Scanning all the stevies and Etc and make them in the graph and visualize them super cool, yeah.

A

um Okay, thank you! So much and I think we we finally can introduce uh jinxin for us. uh Could you share your screen? Yeah.

B

Yeah I need your permission to share that I saw a message host disabled participants, okay,.

A

I'm sorry I won, maybe yeah I.

B

I hope I can really share uh in Zoom.

A

Okay, I restart the recording.

A

B

A

Have the permission to share now right.

B

Yes, yes, I, don't need to be the host okay! Sorry for that, um let's, uh let's start and I hope you can see my screen.

A

Yeah I can't see it.

A

B

See the um yes.

A

It's okay! No! Okay! That's not the first slide of.

B

Course, because I'm still working on this, let's start with that I was like okay yeah. So here's the topic, um my name is Jin Chun. This is my uh my email, and this is what I want to share today, starting from introducing myself and a little bit about the uh you know the overview of the testing landscape in the paragraph and dive into some of the areas and finally, we'll talk about some future work.

B

Okay. So let's talk in graph. This is a graph I made for myself to introduce myself. So this is.

A

B

You can find the path from me to this community because I'm currently leading the group of uh qand, which means quality, engineering and delivery, is really soft. Of course, we um we are the company behind the network graph and runs this community and I'm actually I've been in the software industry. For already more than 20 years, okay and uh prior to vsoft I spent more than 10 years working as a group leader of a product for the air mqc or quality Center.

B

It may ring a bell if you are familiar with the testing business, because uh these names are once very famous in the industry, and so microfocus is now on these three products: a clothes runner for the load, testing and UFT. For the automated functional testing- and you know they came from HP software by a big m. A so I did have some experience in the testing business. Okay. So that's why I came to this role as well.

B

So in my team we are not a very big one, um but really we have a lot of talented I put a label here, excellent. He members um I think we already mentioned one about the gql fuzzer right in in hacked on this. You know came from it, so all the stuffs I'm going to share the Great Stuff, uh it's them who made it happen.

B

Okay, let's take a look at the testing landscape and, although I got some experience but many on the application testing site, but I've never been a QA engineer or tester before Okay, so, um but when I, when it comes to testing a graph database, you know the the complication, the um uh the the staffs like the stability testing or performance testing or already blew my mind.

B

I have to say so, take a look at this graph from bottom up and there are all these kind of test types like unit test, functional stability, performance and, of course, security. So the unit test, we mainly run it in the GitHub actions, because we, you know, we use GitHub a lot and we developed our own platform called ngtp, enable graph testing platform to support the functional the automated functional testing, including some long-running ldbc SMB.

B

If you are familiar with this data set, we run like mixed our read and write testing all the time in this platform and of course, we also try to add more and more customer scenarios like Gora.

B

Maybe you can contribute something we can put into our testing and, of course, regression tests as well, and we also developed a tool called a meter to test the performance of a leverograph and we are trying to use like sonar Cloud to do the static code, analysis for GitHub repo, and we already started to use this attainable nessus, which is a free tool.

B

You know good for us to do the penetration testing mainly for the vulnerability scan so from left to right, is the frequencies how we run all the tests, so in here from the you know, the left leftmost is that uh upon each PR, the pull request in GitHub will run the unit test, uh including the Google test, payroll test and also some PCP test case. I will dive into the details in a minute: okay, so every night, every night we uh we run also this unit test and all this stuff's like functional testing.

B

um uh In in mgdp, and also like, we did a night or we do a nightly Benchmark on top of a meter and also the sca okay and um every night, when the build come out, the nurses will run a scan.

B

Okay, so well, of course, for this, uh this testing these test cases we use a rather relatively smaller data, sets to make it great and for the weekly, this is something already still in progress, I have to say, but we plan to run this. uh These cases, as I put I, showed here, the stability and the the functional part against a larger or much larger data set like one terabyte and more to test the uh the you know the database so for every single release.

B

Candidates will run all these uh tests so and of course, the regression cycle is important, uh like the new features are tested along the release development and at the end, we will run the regression test, uh which is also, you know automated or will be automated in the nttp OR platform, and apart from The Benchmark, we will also run a scripts testing. You know we try to boost the concurrency to like 10, 000 or 20. 000 is much to really test the extreme of the limit of on a cluster.

B

So this is a quick view. So, let's dive into some of the test types first start I'll start with the unit test. So we have these three types of uh unit tests. You know an undergraph core is written in C plus plus, so we have around 1 500 Google test test cases. This is like you know.

B

The most popular test framework for C, plus plus Developers um py test in Python, is something in the middle I would say: some developers write this test, so it's around 50, and so it's not so popular even inside the company, even inside we served and the last one is what we call TCP. This is something we borrowed from neo4j. Okay, we took to the the good ones and it stands for technology capability, kit, okay, so the beauty of it is that you only need to write this gherkin features.

B

If you are familiar with the ability, Behavior driven development, turkey is the the language actually the most popular language. For bdd, so in uh in never graph, you can write simple feature files. Like this example, you give a background. Let's say a space named MBA, okay, and then you define the scenarios with all the test steps. You give a um an action like this. One executes one step right, one step from Tim Duncan over serve, and then you you just write the expectations uh in the then across.

B

So uh it's quite simple, not uh only developers can do it, of course, testers or our document team and our UI testers. For example, we have a product called the nebler Explorer, which is for Enterprise only, and it of course runs a lot of queries right so the way to make it always compatible. We also add it to the tck Tesla, Suite, okay, so and I know we have a documentation and to introduce this and it's open for the community to contribute. Okay, and next is the the main part.

B

uh Ngdp stands for Network graph testing platform, let's start from how the tests are organized in level and in nttp, so every test cases or every test case is a Double Image. Okay, so it's implemented in a darker image and uh a suite a test Suite. Actually it's a group of test cases for sure and it defines the target version of a neighbor graph, whether it's a branch or a commitment ID or some release package. Okay. So this is the target to test and it can also configure the cluster of network graph.

B

You want you want to test, including a common method. You want to have from any storage or gravity, and you can also change the configurations. I mean, like you know, the like for the tuning tuning part and a test plan is a a group of test Suites, okay, so we we can configure a test plan and run it manually or you know, trigger it um by uh by timer, so how the tests or test Suite can be run in GTP, it's actually in a pipeline of tasks. This is this.

B

One is the before the pipeline and it's also a very typical one. Given the you know specified Network version it we may need to build it in ngtp. Let's say you only give me a commit okay, so I need to build it.

B

um Then deploy it according to the cluster compilation, right like how many meta or how many other services, and then we will run a data genetogen as the formation is also uh included in in a darker image. Okay, so um for the big data and just mentioned for the weekly weekly tests. Of course, we don't want to generate the data or even you know, import the data into the cluster each time. So we do have a very good solution to uh you know. We have some already generated very big data. Let's say: ldbc sf1000, okay.

B

This is like one terabyte with the three replicas and when the mttp pick this step, we can simply make a hard link, okay to um to link the network graph cluster to the already existing data. So it's like in seconds. Okay. So then, next we'll run the docker image. Okay defined in the in the test, switch to run the real test against the nebula cluster. You define and also the data set.

B

You choose all right and then lastly, it will give a report and if everything works, okay, I mean the test uh passed and the the resources will be released. Okay, but you know when a test is failed, we would like to keep keep the resource so that um our test has testers can do some checks. Okay, so put it all together. This is the um the architecture of ngtp, um so everything everything is inside a large kubernetes cluster.

B

um Behind the scenes we have a dozen of powerful physical servers and starting from the website, the web UI or from GTP, is developed in Python, using the Django framework and also the pipelines. We use the tactone something called tecton, it's actually a popular cicd framework for kubernetes, okay. So of course we can also integrate the dashboard of attack, tone and the kubernetes dashboard itself.

B

So all the tests with runs okay, so we see already a test case is run inside a path or a container, and also the network clusters are running also in in containers. Okay, so on the right side, we use gray lock to be our log Center. This is something and still in progress to collect all the logs. It can be logs from the nebular cluster and also locks from the tecton pipeline execution, and we also have these parameters and the grafana steps to monitor the nebular cluster. Okay. So that's basically how it works.

B

um One thing to mention how we develop a test cases in nttp. There are three layers: okay, the top two we developed a framework called a knife. It's based on the jingle test framework, which is the most popular testing framework for Golan gold, language Okay. So the first layer is the most simple one you can write again in a gherkin. We love cooking a lot. We, you can also write in 13 language to test some queries. Let's say in certain scenarios: okay, especially those from the customers.

B

I just already showed you before, and so the second level is that you can write tests in Gola language, okay, so this is like purely from the jinko framework, but with the with the support like say the graph kinds, sessions or connections to never graph. So this the knife trimmer, can only work with uh the graph current you know through sessions.

B

This is how it works, and the last layer is what we call native, so you can write in any language um to test the most complicated functionalities, to just give me a darker image and a command line, how to trigger the test inside it. Okay, so like we developed a lot of like stability testing, um to do, let's say the scaling or disaster to recovery and so on. So this is the knife and I also uh introduced, of course, the stability testing written in a native document, so um on the left side, is our chaos.

B

Engineering stack. um Of course chaos. Mesh is the okay most famous one in the kubernetes world, but we use it to to stimulate like this guy all fours and we plan to use it for also to simulate some Linux kernel thoughts, but they found it not so well fit to neighbor graph scenario, especially in this. In this kind of cases, let's say the network, partitioning Network, delay or packet loss and also do some between load testing on the apis. So with the developed, our own component called a never ladder. Okay, so this comes from.

B

This name comes from the let's say: like chaos is a ladder. It's not that thing. It's a letter to for help. I mean to help us to go up right for the nebula is to help us be uh more strong. More stable on the right side is the first test in that. So you already mentioned this is a G4 father that it can basically connect to a network graph uh with the schema and use the you know, the the parts, the grammar uh and try and generates like various uh NG code queries.

B

So the purpose is, of course it never graphs, don't ever crash. So this is the purpose of the stability testing yeah and let's move to a meter, so some very basic concepts, so I mean it's. You you'll find you'll, find it uh familiar with the ngtp, so we also have has the test cases, but this time it's only only on June code. Okay and a suite contains the multiple tester case for sure and in the suite you can specify the version to test okay and also configure the customer.

B

But this time you can't change the topology of the cluster. You can only change some configurations, right block hash, for example, and we also can Define the data set to use that id4. We use uh this sf100 um to to do our benchmark.

B

Then you can execute the test in a meter given a certain number of virtual use, which is the actually the concurrent users right and the test results includes the QPS of this scenario, the latency and the response time, and this result can be saved as a baseline. You know for later use for, like every diet, every night we uh run the test, uh also compare it to a already released version as a baseline, so we will know whether or not we got to improved or you know, some degradation of the performance.

B

So this is how it look like on the architecture perspective, so we developed a unmetal web application to manage all this. Okay, the the tested cases and switch and behind the scenes I mean on the back back end. There is a master node to keep all the job in a queue and you can have multiple workers to build the, never uh binary and deploy the cluster and import data when necessary. And of course, you run the test.

B

All the tests are run on top of the uh this gate, k6 actually also a popular uh load testing tool and we use a lot okay. So this is the architecture of a meter and the purpose is to have this kind of a report. Okay, this is just one example for go one step and the yield The Edge property.

B

So the test results is like this. You have the QPS, as I already mentioned, the latency with the you know, P90 or P99 as well, and the loops responsible response time and also gives the uh the row size of you know the query, and this this one is the uh the comparison uh of the Baseline. So we can easily tell um whether this version we uh 3.1, is good, it's better than the baseline or Worse okay. So this is how it works and again the goal is to make a network graph faster and faster okay.

B

um So, lastly, some future work we plan to do. Of course, we are happy to get more and more tester cases into mgtp and uh I believe in Visa. Everyone can contribute, it contribute to it, and we also hope our community can contribute to it. The more cases we have, the better you know, Network graph will be, and you already noticed a meter has some similarities with the ngtp. So in the future we would like to containerize a meter and integrate it into uh like ngtp.

B

That's the one single platform and we hope also uh have more data set and like various scenarios for a meter, and so the result can be, you know referenced not only for the Social Network. Okay and you know we have already Cloud offerings our dbas system. So in future um we can also run the nttp or even meter for the cloud you know uh and TPP is basically a cloud native solution and we just started on the Securities diet and we will spend more time uh or more investment on this.

B

Lastly, don't forget manual testing, you know it always exists, so we are about to launch this uh this tool. It's also a test management tool: okay, guys we'll just just put all our test cases uh into this management system, okay and it will be integrated to our CI system and also our ngtp um to get the automated testing result. You know like all in one we will have a whole picture of the quality of nebulograph, because that's it and cannot uh cut in the line. uh I hope, okay,.

A

Thank you so much back to you back to you.

B

I'll, just stop you um any question or do we should you have time for that.

A

uh I, have a question, can you can you hear me.

B

A

Yeah, uh thank you. So much I have a high expectation, but still I'm uh surprised, I'm. Amazed, it's uh is is fantastic. uh Thank you and the ngtp I'm surprised that we use a lot of uh state of art um Solutions. We didn't choose the easy way. So it's it's awesome. uh The the most careers uh for for me is the uh the latter or elaborate I. Don't I, don't recall.

B

Leather leather.

A

So the letter- it's uh from my understanding, it's it's specific uh chaos thing uh for Naval graph or is we are doing more General.

B

It's not uh it's not open sourced, yet um I think maybe you know it's got the potential to be open source, not necessarily like very specific to nebulograph. That's my understanding. Okay, so I will check, um but you know the network uh petitioning and this stuff, um it's kind of uh you know. We like we add something. You know more suitable for uh neprograph that that's the idea.

A

Okay and yeah, my next question would be: are we planning to open source it? um Okay, we can talk offline.

B

A

um Another thing is: um do we have any plans on in certain levels of making personally some some part of them in public? Do we have those um concept and plans in our mind or.

B

Yeah, you know a similar question so but I think it's a good one. We had some discussions before, um but you know we have the community edition of nebrograph, but we also have the Enterprise Edition right. So at the point at the point, the uh the platforms and- and you know, the tools are many for the Enterprise. Of course we run the test against the Community Edition as well, but uh every every day, and so it may have some features not not available to the community.

B

I mean you know the code, the testing code as well yeah. So we can of course, try to um how to make it like partially for the community community edition only right. So in that way, maybe you know in the future we will have uh having open source as well so and right now, it's not in a.

A

Plan got it got it and it makes sense. So it's really good to know um you know. Behind the scene there are uh like a Superman teams, maintaining everything.

C

A

So it's awesome. Thank you.

C

So much it's not Superman. It's excellent team as I understand. Thank you. Lauren thank.

B

C

Yeah I also have one question uh about um tools for um importing data. Do you test those tools you have in your repository, for example, nebula exchange or nebula importer.

B

That's a very good question, so I I forgot to mention one thing: uh all these slides are many for the for the core, uh not for the tool and so I guess you mean the uh the testing of the functionalities of importer.

C

Or yeah exchange, not actually that uh when you have a new version, for example, um right people are usually using uh tools to import me for.

B

Example, yeah I'm.

C

C

We are not not um uh using um uh Google platform or scaling with the you know, some platforms, we just use plain, go and importing data from from the files.

C

So it's much easier to integrate that or something like that.

B

C

B

C

uh Much simple to scale uh yeah because we have enough machines and- and you know, with the docker or anything else with a virtualization- you can produce as many workers as you need to to load data. If you have shared storage or something it's very easy. So that's that's the way we are using it and uh usually, when I test new version or anything uh trying to um to make more stable or seed the limits.

C

um I'm I'm executing the existing uh tools which which exist with a specific platform, for example, for nebula I'm, using that enable importer and it's a very good tool. It's worked for us. It is open source and work with open source version.

C

So it's a good fit and I think that it is a good add-on to the testing scenarios to use some tests using those tools and see how it will behave so um yeah, usually I'm, trying to see how how fast I can load some dump from other database or something like that and trying to see what are the limits for some cluster size I set up for nebula.

C

Currently, we have four nodes testing cluster which use common Hardware, but it is actually very good. Hardware I, don't know 64 cores machines with the half terabyte Ram and a few ssds, for example. I, don't know. Maybe 6X is this with a few terabytes of not few few thousand terabytes of disk and our production system have almost 300 terabytes of nvme disks uh and uh yeah I'm, trying to uh uh to see how much power I can squeeze for for that uh Hardware, yeah and.

A

And system and.

C

Nebulograph it at the at the end, at the backend part, uh so um yeah, usually I, can go with the, for example, 100K inserts per second on four nodes cluster.

C

uh So it is about what 25 000, maybe 30 000. uh insert per second per node, and that's actually a little bit at the end. So it it should be possible to stretch a little bit more because uh utilization of the nvme disks is very low.

B

C

Yeah I am thinking about that as a waste of of money and and Technology should be utilized much more.

B

So you care, that's not only yeah, you care not only about the uh the functionality right. So ever since you could work yeah.

C

Functionality, yeah yeah.

B

C

Yeah, the performance is crucial part here, because uh you know when you have a lot of data, uh it should be performant uh in with the limits of of current technology, but nebulize good performer here and uh yeah. It's it works. Okay. We have a lot of, uh for example, deep, deep graphs for more than 50 50 levels, so the scanning for one two three levels is uh simple to reach and it is performant.

C

But when you have deep graphs, it can be a little bit slower and also we have hotspot nodes which are millions of nodes connected with millions of edges connected between some nodes, and it is usual when you have some attributes or properties in a text which are um common in a in a graph, then you will get when you filter over that you will get some hot spots.

C

You know in the system and yeah and yeah it's okay, it works fine and it's stable um and uh yeah I will I would like to see more more performance in uh in um uh write, speed or or throughput for number of Records can be inserted, uh because I I think it can be a little bit faster or improved in that way, because can utilize uh or maybe uh we are not, uh it is hard to tune the system at the end, for a production or for some testing environment which uh we need to tune.

C

The parameters of the um OS level or something like that or some parameters for um back-end database uh used so uh I I think that can be also improved. uh We played with the with some parameters uh for rocksdb uh in the back in part, and that also can be shown in a transparent way. uh What are the limits? What are what is the best? Do you test some parameters, which are large, for example, for all system for small system for for huge grass with a lot of nodes, and something like that?

C

That is also interesting, interesting Parts, especially for Enterprise systems, I think.

B

So, regarding the importing uh data importing performance- maybe you can you know reach me in the black Channel. We can, you know, have some discussion, so my team has also I'm reading, also the uh delivery team and we use importer or exchange a lot uh in our customer testing, sometimes for the proof of Concepts and so on. So we also try to squeeze the performance to to the best because we need to compete with other competitors. So maybe we.

C

B

A little bit about this.

C

Yeah, it's very important because you are integrating the graph database platform to some existing database systems. You have in a company and yeah it's it's normal to integrate that in some some way, and you will have also some historical data to load and a lot of That So. Currently, we needed to load 350 billion edges uh in a in a nebula graph system and it took 40 days to load that 40.

B

Days or two.

C

Days, yes, so it is huge amount of data, and you know you you can I can paralyze a lot, but at the end we start to to um time out and started to introduce some problems with with the timeouts and dropping the throughput at the end. So I pushed to the Limit and stayed there for 40 days. That's the approach. Okay! Maybe there is some better way to do that. So I will be glad to to uh chat with you on on a slack.

B

Maybe there is some.

C

C

A

Oh, that's. That's.

B

Awesome um yeah.

A

um Goran's question uh bring me another question: okay, uh you you mentioned uh why you're introducing the uh ngtp uh you mentioned when we are doing the data gen. We.

B

A

Of going through the writing test, you generate the underlying file and make a hard link so that that is uh awesome. I I celebrated in my mind and forgot about it and then I I come back to my mind. So actually you you did it and you it's it's on a local storage or you're doing some centralized storage and you uh you just uh make conquer construct uh how the files uh lying under the knees and the mounted it are you using the kubernetes operator or your own implementation of the kubernetes.

B

But no, we don't use operator yet um all right, so the story is like um we import the data up front Okay, so we have the basically the The Meta or the information for the matter, the spaces. Next, actually, there are more than uh one spaces in this big data data set. Okay, so you just you know when you create a new um of course, a new uh cluster um I think today we need to have the same topology that way, the uh you know the imported already imported the data.

B

So there are some limitations, okay, so it's simply like a migration of a cluster from one to another, but using the hard link the Linux huddling way yeah. um So, thanks to the uh you know, the rocksdb uh um way I mean uh those SSP files. If you are familiar with are kind of read only right, you don't need to um so so let's say we run some tests.

B

um You need to eat right right to the database right. It will create just some new SSD files. Instead of you know right through the existing ones. So this is the idea.

A

Awesome so you have you have some sort of uh uh copy on right, so you you have one like one. A couple of files mounted to different nodes in the in the. In the same time,.

B

uh Different nodes, yeah: it is also possible yeah right. We we haven't done that before. So you know it requires a lot of uh this space and but yeah it's in theory, it's possible you just from a central uh storage. You may have links to different clusters and different classes will write to their own like SCT files right respectively and without touching the original data. So we can reuse it uh in a test again and again. Okay,.

A

This is this sounds super uh magical to me and uh I. Guess the community community members and the other users are also very interested in this, or maybe in some level you you can share more on this in the future and maybe.

C

Yeah, it is, it is.

A

C

Yes, uh yeah, but it is common practice to have some links uh yeah to to implementation under uh yeah I I had an issue with the with the enable exchange part it's using Hadoop uh for uh SSD direct to generation. My idea was to generate SSD files directly and then import it in a in a. There is some uh keywords in a in a syntax of ngql, so you can uh you can import SSD files directly in a nebula uh as I? Understand I didn't try that because uh it uses um Hadoop underneath.

C

So uh uh we are not users of that and it is uh all technology we wanted to reduce in our systems.

B

C

uh Yeah we are, uh we are using uh um shared bgfs based uh storage uh system, which is uh highly compressed with lz4 and um so to squeeze a little bit more from from from the storage part, and it is a uh very practical than to after you generate those files. You can copy that uh to the local nvme disks and import those SSD files in nebulographs.

C

So it can be everything prepared for the specific uh cluster number of nodes and everything distribution of the files, and that will be also I, think Improvement in a in a speed of loading, a large amount of data but yeah. We.

B

C

B

On um yeah- and that was our original idea to do this- for ingestion yeah.

C

I, don't know why yeah yeah, yeah I suppose that you are using Hadoop in a company. So that's that's the reason why nebula exchange is set up like that and supporting that. But there is some party in exchange which are supporting local file system also or some other distributed NFS or something on the Linux so uh yeah. Maybe it is not hard to implement, also support for for some other distributed file system so or or a remote file system. Whatever.

A

Yeah, that's a good feature, yeah and actually uh Goran. You mentioned uh you. You raised that idea to me like months ago and.

B

A

Had brought it to The Exchange, uh our teams and.

B

A

Don't remember the reason why it's not yet considered to support others out. I will double check with that yeah.

B

Or you know, on the other way around, to allow importer to import some other data sources other than uh csev files, I I think we also discussed that before, but we let's say it was important to import data from yeah. There is, for example, that yeah.

C

Yeah, maybe you know you are familiar with New York uh taxi uh drivers. File with uh it is also for some analytical databases, but also can be used in a graph systems. uh You know driving uh around the streets of New York and this analyzing some shortest paths or some some something like that. It is a large database and it can be maybe better representing than I. Don't know, basketball players uh currently used, which is a very, very small data set. So you you cannot cover with that.

C

Even I, don't know, maybe a few partitions or whatever yeah. It will not populate enough data to to cover I, don't know partition crash or something like that. It should be a decent amount of data. It doesn't need to be huge, but I don't know. A million records is a good starting point for something I'm, usually testing with uh around 10 billion records or any any database, uh and most of the modern databases are good in uh dealing with uh 10 billion records. The problems start with 100 billion. So these days, okay and.

B

C

Yeah, so the 10 billion is actually normal data set these days, so uh it is good also to to test something. For example, our loaders are uh in a few million or a few hundred million records. It's easy. There is no concurrency problem.

C

There is no delays in a in a in a compaction background part and nothing like that, and when you start to load more than one billion or few billions, then then system shows some stalls and some some problems with the timeouts and something like that currently I'm dealing with that problem, because uh 2.1.2 old version or we are using of nebula the graph D system except uh connections and I, didn't get any timeouts on normal concurrent load, but on 3.2 I have now so I have to experiment a little bit with the timeouts and uh what can be Set.

C

uh Maybe I will need your help, but we'll see soon, yeah.

B

I'm pleased to help.

C

A

A

um Thank you so much goren and.

C

Yeah I'll start my video, because I see you, you don't see me.

A

So yeah, it's really nice to have you I think this is the most happy meet up. I have ever.

C

A

The last few months.

C

I tried to join some previews, but uh I.

A

Didn't I mean I.

C

Have a time for that, uh but I decided this is Sunday. uh This is a summer time, so I have to join to find some time to meet. You.

B

Yeah, it's actually my first time to join this evening and then I I need to tell myself I should join more often um yeah. It.

C

B

C

Yeah, it is uh interactive and uh you see the the people um behind the the scene so uh and uh we all have some experience during the I know: I'm I'm involved, maybe you're in a half or something with the at least one year with the nebula and uh yeah. A lot of things happens during this period of time, so we should share our experiments, experience and try to improve everything.

C

A

Okay, I think uh we can call this a day. This is a code time. We are showing that we can have this think meeting and we have a lot of insightful things come up and we will have more discussions, think our async way um in the future and yeah so.

C

This will be one month every month, yeah every four.

A

Weeks, yes, yes, yeah.

C

Okay, very good. It was uh every second week something like that. In the start, uh it was too often I. Think yes, so once once.

B

C

Is? Okay? Yes, yes, good.

A

C

A

C

Guys good, thank you. It was nice.

C

Bye, bye, bye, I'll, stop sharing.

B