GitLab #EveryoneCanContribute cafe, 23 Sep 2020

Previous Meeting

⏯

youtube image

►

From YouTube: 1. #everyonecancontribute cafe: QuestDB

Description

QuestDB Introduction, live demo and AMA with Vlad Ilyushchenko, Nicolas Hourcard, David G. Simmons, Niclas Mietz, Michael Friedrich, Michael Aigner, Nico Meisenzahl

Blog: https://everyonecancontribute.com/post/2020-09-23-cafe-1/
QuestDB: https://questdb.io/

A

Or the agenda as well, I think you have seen it already, maybe hopefully.

A

But yeah that that's it um I'm happy to to to welcome everyone um for, uh for our first english, iteration or english version of everyone can contribute coffee, chats um I've called it cafe.

A

um I would I would like to start with a short introduction round um before we dig into um the topic today, which will be quest eb um a little bit about myself, I'm a developer eventualist at gitlab. I've joined in march 2020 before that I've been in the open source monitoring area for around about 10 to 11 years, maintaining an open source monitoring software yeah and I've been looking into metrics and logs events traces all over and I'm eager to learn new things today and in the future.

A

With that, I would like to hand over to vlad to introduce yourself a little.

B

Yeah, thank you michael. um I am my name. Is vlad vlad? Just don't worry about last name: most people can handle it anyway, so it's all good. So I am I'm cto and at in quest db and I'm also been writing quest db since 2014 um and uh my kind of day job at the time was uh algorithmic trading in various uh financial institutions, primarily banks um kind of very fairly big banks, and I've been doing it for for a few years, and this is this- is where effectively inspiration focused. They came from.

B

A

Thanks just uh pic pick, someone on on your zoom screen, who should continue.

B

uh Again, I'm gonna pick. Can I pick another michael.

C

So hi, my name is mike wagner, I'm from austria, I'm a product manager and software engineer at the company called zkw. We are part of el china for roughly two years now and yeah my day. Job is is mainly software, programming, c, plus plus, and this for roughly 15 years now, and also I have more and more problems with time series, analysis and all this kind of stuff. So I'm really interesting to hear about quest tv.

C

I choose next nikkas.

D

Yeah, um my name is sikla smith, I'm also from not from austria but from germany, I'm currently working as a cloud engineer for community quality.

D

Currently, I'm glitching a little bit this because I'm not able to get a right webcam, because it's only my iphone to stream my um my picture um yeah and there I worked as a cloud engineer uh and I'm doing more, mostly if the operating stuff so working with kubernetes monitoring systems, like reviews or and sometimes in some cases, also infrastructure, and I was really curious about this new time series database um crestdb and I tested it a little bit already to find how it works and um what? Where could?

D

I use it in my use case? Typically yeah and I really like to um chat about it. So I would vote then for nikolas.

E

Hey guys so I'm co-founder ceo of quasi-league and uh just like vlad. I come from a financial services uh background, so mostly banks um such as yeah, rothschild and also nasdaq prior to um that super excited to to go through a question with you guys today, um so yeah I'll pass it on. There is another another nico right, so two michaels to nicholas so.

F

Yeah so my name is nico meister, I'm also from germany near munich, so the nearly south part of of germany I'm doing consulting around le, never with kubernetes and just helping our customers to get started with all the combinations, mainly focusing on azure, but at least I'm doing kubernetes. So it's the same everywhere um yeah. So I didn't plan to turn this round.

F

At least I blend, but I thought I wasn't able now I am able um I'm pretty sure, only got to listen, only mode in some minutes but really interested in stuff and hopefully learning some new things.

F

So I think last but not least, david.

G

Well, we'll see about the least part. um I I'm I'm david simmons, I'm head of developer relations at uh questdb. I was previously at influx, so got some history with time series uh databases um and that's I'm. I guess I'm the only one. That's in the us right now on this call so uh I'll represent north america for us.

B

A

Yeah: okay, who has the first question or how I'm open to everything to continue? If you want to give a short introduction to the quest to be if we start with integration questions we collected anything.

B

um Actually, like not knowing how this is gonna go, we prepared the slides, don't all of them, just very short, slides with the kind of subjects we can cover, so you guys get kind of get to know, quest db, a little more. I can. I can just whiz through them and uh quickly, so it shouldn't take very long and then hopefully it might lay some foundation for for more questions. Really so it'd be good to make it super interactive if possible.

A

Perfect thanks.

B

D

B

Cool I can uh can I share my.

A

You can share a screen, of course,.

B

Very cool right: can you? Can you see this uh things cool? So this is this? Is interest live now? You know, you know uh who we are. um I just wanted basically to cover a couple of really three subjects here. So what is what is quest db? um What quest db is not and uh and uh various integrations aspects, um and then then uh perhaps we can uh introduce you to to demo that we have, if you haven't seen it already, you you might have actually and that's that's it so uh just whizzing through it.

B

So what quest db is generally um question b, uh we kind of incepted this as a column column based store. I guess you guys must be familiar with column based and row-based approaches and we picked column based because it kind of made sense for uh for, for the time, series data and generally with the the reason I'm saying, makes sense for time. Series data, because uh usually data with most volume is time series, data and and the column based approach um helps partition this data kind of right right from from the start.

B

So, basically, data is sliced into columns and if, if data is not needed for a query, it doesn't have to be lifted from disk at all. So where's in a row based approach is a little bit more complicated to achieve, and then uh data, that's stored, question b stores data partitioned by time. So uh um time partitioning is. uh I guess this I'm talking to this this. This point and time partitioning um helps us to uh slice data even further in in more meaningful chunks um generally time partitioning uh the way.

B

The way it works in ques db, it's, whereas columns are files on disk and the time partitions are directories containing those columns so and the directory is we we're kind of trying to be very friendly with um with the file system? If you, if you were to to list quest db data directory you, you should work out.

B

What's what they're, basically like, which files to do with which columns and and what columns are to do with what dates that kind of stuff, and we heavily use time partitioning, which I could I can explain, probably further, if you're interested how so we we use it to uh to find time intervals kind of very efficiently and also we use time partitioning as another like horizontal dimension.

B

If we want to calculate data concurrently, um then uh it's kind of important to to know that um quest db is is an append only system um as opposed to uh databases where you you can you can uh mutate data via either update or delete and uh append kind of makes it relatively simple to to implement um and especially simple, to implement uh repeatable consistency that comes comes next.

B

um The way we implement repeatable with consistency is we effectively have uh if you, if you can imagine, there's, there's a data and then there's position in the data below which any reading code does not go generally and as uh as data is appended, then this this pointer moves down. Atomically enabling reading systems pick up all of the appended data consistently anatomically as well. So, that's that's that that's how we kind of deal with consistency and we sort of we.

B

We also understand that sometimes append only system is isn't, isn't necessarily convenient system to work with. So we we basically implemented a query on our data that lets you find latest records for a particular criteria.

B

So, if you, if, if you, if you consider, for example, if your data was not necessarily observability related, um you could just store changes as as as one append the one one record after the other and then there's a query system which says well, I want to get latest for this id and they would return to the last row fairly efficiently um and you have a history of all the changes as well.

B

If you want to query um and- and the last point- I guess we we try, our kind of I suppose unique value proposition is performance. um We try optimize optimize, both data ingestion and retrieval um through various means. um Well, highly optimized is a bit generic, but what we do is uh first partitioning which I mentioned so partitioning uh lends itself to parallel query execution. We can. We can process chunks of data simultaneously on on multiple threads. That that's one thing.

B

The second thing is the data, because it's append only system, we effectively store data very densely. So if you can imagine a column, it's it's just glorified array, pretty much. It's just array sitting on desk and and to to process this data. um We we can use uh cmd instructions to process this data.

B

If we need to aggregate large chunk of data- and you probably know that cmd is something that um I mean lets you parallelize uh parallelize execution within single core hardware, so without using threads, so we can execute multiple instructions at the same time.

B

So that's that's another optimization technique and, and and lastly, uh all of our I o around the system is- is uh non-blocking generally, so the system does not have um like locks or critical sections or anything that that is impossible to look it up generally other than uh have it processed monumental amount of data inefficiently. Maybe in some some inefficient paths in this case it you might take time, but otherwise it's it's a lock and weight free system.

B

So that's that's! I guess that's that's! What quest db is um if we, if we move to to another slide, um I suppose it's important to also understand uh that quest db is is not an online transaction processing system and I say it in a way that um our transactionality, for example, implemented so far per table. So it's it's impossible to in quest db to commit basically several table updates in one transaction.

B

Now we might implement it in the future, but now that's that's not the case. um Likewise, we're not optimized for for delete and update workloads um and and- and you can have some cases where people absolutely have to delete a record from the database or an update, a field on a database. So we not fully optimized for it, so it's possible to update the value generally, but that's not not very efficient and uh to delete.

B

Obviously, we tailor delete workloads to time series, as in you delete data in bulk, like old data, you delete all data, but you don't delete records in kind of random brackets in the middle of the year of the data set.

B

So that's that's that covers the um delete and updates and and last point um quest db is not a data lake um and the data lake. In a sense, we haven't built connectors to to work with cloud storage yet so only all with all the data for quest db need to be located uh locally to computing hardware, which obviously places some limitations on their own amount of data that can be processed in a way that they can fit their fit the physical server.

B

So that's, that's! um That's that's the summary, um and at this point I probably would like to uh welcome if any questions, if you guys have uh so far.

B

Okay cool so again, I'm gonna move on to next slide, which is integrations. What do we have that might help um we? We saw uh influx line protocol as um as as as kind of interesting proposition. First, there's there's a lot of tools that send data in this format and, uh what's kind of I personally found sort of pretty pretty cool. That influx uh done with this protocol.

B

Is that you can, you can add the column on the fly, so you don't need to in. In some cases you don't really need to um at the column offline, when your date will tip me in typical kind of probably relational database. uh Adding a column is, is, is the release procedure generally so, whereas influx protocol makes it makes it really easy and instant and uh and and quest db does support it uh to the fullest.

B

So we we can, we can add column on the fly and we auto detect type and, moreover, adding a column on the fly does not. There is no performance here, basically adding a column on the fly, so there.

F

B

It's not free operation, but there's. No, there is no uh data. Backfill, if you add the column to existing table, it's not trying to backfill the data with nulls or anything like this, um and we use uh use influx line protocol for ingestion. We also on top of it. We we kind of enhanced it a little bit by adding optional uh authentication to this protocol. So it's it's totally optional, but if you need it, you can have it and- and this authentication is a is a- is a secure challenge response authentication.

B

So that's that's something. That's we think is interesting and then the next point is uh influx.

B

Sorry postgres wire protocol, um and we we see that in postgres wire as a as a sort of really really convenient feature to get a lot of tools, kind of integrating with quest db without having to change these tools or program new new new interfaces.

B

um We support uh most of the postgresql protocol to a point where you can pick a driver of the shell for for any language, and you can. You can definitely execute queries um and and insert data through through the postgres through the driver.

B

um There are some limitations where um we don't support some of the all of the features of postgresql, but um like we don't support like a file upload, yet the remote remote file upload via the wire or we don't support like script execution and and also uh some of the metadata queries most of actually metadata queries that aim that postgres may not work correctly, even though we kind of chop in well implementing all these queries to make sure you can get a list of tables list of columns and and basically navigate the database using postgres tools and and the postgres wire actually is is- is an instrument that allows you to use grafana against quizdb using built-in postgres adapter in grafana um this.

B

Basically, uh this postgres uh plugin for grafana. It has two two parts: um there's one that lets you kind of visually build the query.

B

This part effect essentially builds a query for postgres and and may not have all the features that qosdb supports.

B

But, on the other hand, uh this driver also is a plugin rather lets you write um free text, sql and, uh and and there you can, you can write whatever query you need generally and, uh and you can, you can plot the data using using using postgres plugin.

B

So these are, I guess these are the influx line protocol I didn't mention, so um we we actually support um both tcp and udp versions of of of this protocol. So you can send data traditionally, uh over tcp or uh or you can, you can use udp depending on the environment. Some some environments have like um uh effectively.

B

They prioritize, for example like there's a system that sends its own metrics to for storage and they prioritize uh performance of the sending part over the data loss like, for example, they would rather send data quickly, then have tcp protocol create back pressure into application and slow application down just because it just cannot send them, send the metric out. So we so we've got both as a choice of which one to use and and both of them are fairly high performance applications.

B

We tested um udp uh influx line protocol and for for very kind of typical uh message. We we process um kind of easily over three hundred thousand messages, a second generally on on this uh on on a single thread. Genuinely um so that's that's! That's that do you guys have um any any?

B

Did it spark any interest? Any any questions so far,.

D

So I wrote a question regarding to the invert line protocol so which version is it? Is it for version one or version two.

B

I think I think it's it's version, it's, oh. um What's the difference between version one and version two.

D

I'm quite myself, I'm not so sure where the difference is because I want uh I was trying to get some prometheus metrics uh into crestdb um and transform my metrics from from prometheus to inference, and I tried to use it with vector vectors also on an interest star for metrics, or it could be also for log files um and there they stated that they support version two and the version one. So I'm currently down now, where the real difference.

D

I'm currently also checking um what the difference between the both protocols and where the compatibility uh is currently failing.

B

Okay- okay, I see so I I I I think this is. This is fairly we we implemented it a while ago. I I'd say it should be version one at very least. But to be honest, I don't don't know the difference between one and two myself, so the the the whole protocol uh generally is is kind of uh name value, name, value, pairs, there's a table name and then there's a tag, value and then field value and the timestamp generally is a text um we can probably take it.

B

Take it offline and we'll will investigate this this this particular method of sending data from promises to uh to quest db and see see where, where.

F

B

Is and try and figure out basically what what? What does it send? So one important point here is also: is we this the quest db? Is we written the the whole thing? Is ourselves there's? No, we don't use libraries for for anything um and uh we kind of fully in control of the influx line protocol parser. If, if something is a mess, we we usually uh investigate pretty quickly another generally. So that's that's.

B

D

So I got also second question to that. um So is there also support for authentication in the protocol already so that I can enter username password, for example,.

B

Yes, it's a it's! It's uh it's! It's a challenge response, authentication! Generally, it's not a username password. It's uh it's more certificate, based authentication, but yes, yeah you. There is support for that.

D

B

Crew, it's actually to be honest. When I say there is one we haven't released it yet so it's on master available in master, okay, but it's gonna, be it's gonna, be part of the upcoming release.

B

But you you can definitely play if you, if you, if you build from source, which is relatively easy to do, you can definitely play with it.

D

A

um Do you have a docker container for a deaf environment.

B

For dev, we we do not, but this is something. Do we well right now we sort of um again um we're very fairly fairly new to the scene, and one of the things uh we were working on is is a proper uh ci system uh to build on multiple, multiple operating systems and stuff and and deploying dev. Docker container is gonna, be part of it, but this system is is incomplete.

B

uh We just need to we can we can push a docker container from master uh in the space of a two minutes pretty much, but there is manually right now.

A

Yeah no worries I'm just thinking when I want to try it out and, as you mentioned, some features are only in master right now, it's nowadays it's super convenient to just have a docker container and not having to install a long list of dependency manually. I know that this needs a lot of effort to make it right in ci cd, um but I think developers will ask you for that if they want to contribute actively.

B

Yeah that definitely but the same thing is you guys, may be like um kind of um there's a lot of databases out there, and some of them are pretty complicated to install from source uh our database. It builds in in a single file. It's a single file that you run like.

G

You can download.

B

Three megabyte file uh and uh kind of literally double click on it and it runs it. Would you don't need to install any dependency other than java actually like you need to have java locally, but other than that? It's uh it's fairly easy to to get going, but.

G

A

B

Sorry, sorry michael go on sorry.

A

No, I was just thinking you might have java already installed on your systems because of desktop environments or whatever. So it's it's a fair point to say it's easier to install from source and to contribute than to have whatever python mysql whatever to be installed um to make the application run. So it should be easier to contribute.

B

Yeah, the the we actually like the whole all of our process, development process and actually product is about making it super simple to to to use from from every angle from from building it for to build it. You need uh you need maven, though, like it's, it's a it's a like one command and maven, and it produces everything that you need to run um and uh and and the file itself is, is very small.

B

It's like three and a half megabytes all inclusive database right and, and it doesn't pull any dependencies from anywhere so yeah, it's it's and we will kind of want to maintain it. This way.

D

uh Does it need to be an oracle java, or can it be, for example, also an open jdk.

B

uh It can be open jdk, it can.

D

B

Any it's actually need to be java, 11 that that's. Why is why is that? But it can be open, jdk or oracle java.

G

A

And you also have a homebrew tap to install, not just homebrew, which makes it easier on mac os as well.

B

Yes, we do, but the home brew is for the for releases. So uh dev um we don't build the homebrew package for the nightly build, so that makes sense. So this is uh this is for released versions. But yes, we we kind of do, do have homebrew now.

B

But go go ahead. Sorry.

A

For me, it's just um when I want to try things out, I'm hesitant to like call curl and pipe bash, or whatever things I want, either homebrew or a docker container or a single binary which gets started because oftentimes your system gets messy after installing lots of things. That's why I'm asking.

B

I'm sure sure, but when we, when we finish with our ci, which would take a week or two, we will have nightly, builds from homebrew and nightly docker containers.

A

That's awesome and I've I'm also currently uh looking a bit into the github repository. um The building from source documentation is really clear and should be easy going to get started.

B

Oh, thank you very much, and not only that we, uh the one probably this is nick's, usually lying, but we we're kind of very keen to to build community and and now kind of ethos, history to help you guys to do stuff. So we we have also slack channel that you can join and uh if there is, if there's any questions, we we available there to help with anything now building uh using anything.

A

Awesome, I will be writing a blog post afterwards and right now, I'm already collecting some urls and links. So anything you want to share um just go ahead. Yeah.

B

So we've got the demo here. um Have you seen demo by the way who hasn't seen it? Let's start with that,.

A

I haven't seen it, I would love to see it. I also haven't seen it.

C

I've tried a little bit.

B

Oh cool, I can I'm gonna, stop sharing this deck in that case and I'm gonna share my uh browser screen. That's okay, yeah.

A

B

Yeah cool, so let's share another screen right, so this is zoom.

G

B

Calendar, oh and this is demo, uh can can you see this demo screen right? Yes, so this this is the url, it's triquesdb dot io and it runs on port, nine, nine thousand, so just gonna reload the page. So it's there right yeah. So um so what you can do here. So what we have here is a bunch of queries that you can run and.

G

B

Server has like a sample data set as a table called chips, and this is this is the number of rows in the table. I'm highlighting it and if you can see it.

B

It's it's 1.6 billion plus records in the database so um and I'm just going to show you some of the uh it's not the biggest database in the world, but it's sort of uh it's okay, for I guess for for a demo, um and one thing is like one: what the uh mentioned: uh cmd instructions right like, for example, there is this query: uh it calculates of of a single column, from from a table right so and and and this is execution time.

B

Yes uh here I can't select this um yeah 211 millis and then this is the result. This is not this. This is not a canned uh canned query generally, not the can rather result, so you can. You can run it on any other column generally, but this uh this kind of illustrates uh if we run it again, it's probably faster because it's a bit warmer data.

B

So this illustrates the way we kind of execute it. So we would take this column and we slice it in about 240 pieces and execute it using 20 threads, each of which uses avx 512 instructions. So that's the that's. That's the reason for for the performance generally, um one other.

G

B

Kind of illustrates the partitioning as well like you can, for example, pick a query. uh That's just basically pick this query right. So this this is a a filter and this filter would take all the data that that just belongs to this one day or one months rather, and and and this is the performance of finding finding all this all these records for 2.7, milliseconds, right and and you can you can you can tweak it.

B

For example, if you, if you for sake of argument um like find another month's, I don't know months three, so it doesn't get basically much slower. So you can change this or you can do say uh you can get entire year. For sake of argument.

B

That's that's the number of records it returns. 110 million in uh 17 milliseconds, are.

C

There possibilities to to make a time range so, for example, months two and three.

B

Yeah, so you can do basically say you can do months, two and, uh and what you can do is you can do. I think I didn't remember syntax. I think I think one month. I think that that start, so it's gonna do months, two and three together so like this is this? Is the month number two and if I go at the bottom.

B

Oh, that didn't bottom, oh yeah! This is the this is the month three. I don't know if you can see this, it's 2008.03 so this this. Basically, this is a format for for implicit range. So this is the starting point, and this is the uh the the interval lengths right or you can do it uh conventionally. For example, you can do you can do this and uh and.

E

Once you're right I'll just give some precision on you know the hardware. It's a c5 metal, but using 16 cores out of 96.

B

20 20 20 20, so we can run this is this is gonna pick four months so, let's, if we go down to the bottom yeah- and this is the uh yeah- this is the month number four circulars and blah blah blah. So, let's test the query and the execution time is still low generally to two minutes and this this basically what this does. um It does two things.

B

First, the data is partitioned by uh by month um and uh if, if the optimizer, when it sees the date range, it tries to first work out partitions that need to be lifted in memory right and if I, for example, find this there's a top of the partition need to be lifted top of partition, one and bottom of partition two.

B

It knows basically how many rows in each partition- and it just starts lifting data without even searching for things that that's that's the time partitioning is, if you, if you add uh like a day, for example, you can add, like I don't know, zero three and uh you can go say and the time say, twenty two: zero, zero. That's just for sake of argument this. This will go inside the time column and do binary search to see where the data begins and uh and that's execution time it's slightly longer.

B

It's 20 milliseconds because it just does a bit more work generally. To do that, and- and this is this- is the start- interval uh that's the uh basically where it begun, so that that's the interval, searches and, and then you can do things like, uh I think, with within flux. You can do this so, for example, and this query I know this doesn't do this, there's that I'll just pick you yeah, okay, so this this is. This is a good one right.

B

So what this query does and it takes a seven day interval here and samples uh counts by one hour within within those intervals. It's it's. It's basically effectively aggregation of the data by time and you specify a specified time interval like this right.

B

So if I run it, that's execution time, it's a bit longer, 64 millis, but you get hourly um hourly, counts and and this something you can plot with wooden flux, sorry with grafana, so you can, you can run, have grafana run this query and they will plot these values and- and you can vary this one hour- is not a fixed entity. So you can say I will not do you by four hours. Just for sake of argument.

B

Oh well, it's a bit it's a bit longer. Actually there was a networks network latency here, 18 millis for some reason, but yeah so um yeah. So this this does it by for our interval. I don't know if you can see see my screen well in terms of like kind of font, size and stuff. If you can see all these numbers, I'm highlighting they do shout if you can't see it like it's uh yeah and it doesn't have to be, it doesn't have to be hours, so you can do say one one day.

B

It's basically saying same thing, so you get fewer days. Obviously, because interval is uh is basically seven days. It's actually add seven here. So it's eight. So it's a bit it's a bit of a weird mask, so you can do like six plus six, so you can get seven seven rows, including the first one. That's that's the count and and uh generally date, other databases like like, for example, postgres.

B

They would go- and uh I think I think time scale posted this article to say, like we pre-calculate all the different aggregations like you would calculate by uh sort of months, we calculate by day you calculate by whatever right. So this this is real time. You don't really need to pre-calculate anything. This is this is the results you get just by kind of struggling charts. Well, you get the response.

B

uh Sub sub 10, some sub 100 milliseconds generally, so it's gonna be pretty so pretty easy to not pre calculate stuff yeah. um So that's! That's the that's the that's! This query sampling query um and these other queries like this interesting uh time, series query and what this does. Let me just close: this window stay dangling unnecessarily and what this does it takes um one day of data right, one day of data and and then joins.

B

This is called as of join and it joins weather to this data and and what, as of john, does actually say. For example, the the ride was, um I don't know, 10 10 10 a.m in the morning right and and your data reading could be um not necessarily 10 a.m exactly. It could be like 9 55 a.m right and this, as of join effectively, takes the very latest reading from this table.

B

Before or equal to the time of of event from this table, so it's a fuzzy join so and- and- and this is this- is execution speed of that so- and this is none of it is pre-calculated, it's all real time, so you get three milliseconds to to get you the result.

B

um This is this is in in part thanks to the uh to the kind of only one day selection. If, if we kind of select like a year, they'll take a little longer, but not not too much, I mean they didn't actually take any longer. So there you go so- um and this is this is this- is this is actually data you can scroll through? This is I'm just kind of sleeping on page down button here, so it's kind of scrolling and stuff.

B

So it's sort of it's it's the actual data you can you can you can see um that that said, uh the not all of the queries are super optimal. We kind of still need to do some work on optimizer.

B

We need to do some work on the filtering kind of, if you, for example, if you filter by amount here fair amount in the whole table, it will take longer because the access to the data for the filtering is raw based. It's not kind of uh it's, not multi-threaded single single threaded row based, but we're looking in upcoming releases. While we every every query into pieces and we're gonna make filter multi-threaded.

B

So it's gonna execute pretty much like like the queries I'm showing and showing you. So it's not it's not all ideal. So what I'm trying to say effectively, uh we've got some queries that we implemented they. They use row based access to data and single threaded. Some other queries have been tweaked to use uh concurrency and simmed.

B

So even if those those with concurrency and seem going to be way way faster than row and row-based queries, so that's the. uh I guess that's what I'm trying to say um yeah. So this demo is, as I say, it's live you you can. You can just run exactly the same queries yourselves um and uh yeah. If you, if you have any any any any questions so far, uh I'd be really really happy.

A

I have one um what happened what's what's, hiding underneath the chart uh top next to the grid?

A

Can you show what's possible in there, so I peeked a little into the live demo already, um but I want. I could imagine that many people would love to see graphs and dashboards and other things.

B

Yeah, so so this I just switched there, so it builds the chart automatically from the query you have. It took a while to build it because they included like a year here. It's a year worth of data and there's 110 million rows right. It took a few seconds to plot this chart, but this is the uh this shows.

B

uh I guess the colors here a little bit off. It shows fair amount temperature in wind direction on the same chart. It's a bit, it's a bit a bit strange, but um you can, for example, uh I don't know, may make it less dense, so yeah, let's make it like. I don't know um like one day, I don't know 20 seconds away. Bro.

B

And then, if, if I draw this, it's a little bit a little bit faster, but oh yes, it's not it's not very good. Basically I I wasn't. I did. I didn't prepare the child, but the idea is you can um yeah you you can you can pick the values from a query and uh it would put time series access basically time time stamp um on a kind of uh x axis and all of your non-timestamp values on y-axis and then yeah here you can see all these three three three things drawn on the same scale.

B

That's the problem like the scale is, is common.

A

Maybe maybe show the weather example like 10 years of new york city weather data. um I think this graph is a little more a little better to showcase, maybe.

B

Oh yeah you're totally right, michael, you know, you know this more than I do. Yeah.

A

No I'm just clicking around to.

B

Be honest, cool, so yeah, so this is this this. This is this data at least this data is comparable in terms of scale right, uh so it kind of draws kind of it draws well. So this this chart is not super sophisticated to be honest, and uh we, if, if you, if you do kind of need more sophisticated charts, this is why we're trying to connect with db to grafana, because grafana is a lot better at drawing charts than we do.

B

This is just um I don't know just just provide another angle in the data rather than grids, you know. So it's it's not perfect, but yeah this. This is where the data looks pretty pretty cool, just because it's on the same scale, and we need to sort out this timestamp, so we're gonna print these timestamps with a lot of zeros, so we'll fix that.

A

Yeah, I think I think this really adds something uh to the interface, because I I personally I can read the long grid uh with columns and data, but after sometimes you get like tired of it, and you want to see some graphical representation and for for quick debugging or for quick analysis. I would say having a simple interface is best because, in my opinion, grafana has too many options sometimes, and you get lost in the interface.

A

What data you want to see uh having a quick look- and I could imagine that if you, for example, have that available with a persistent url, you could embed the charts into gitlab issues or something else for incident management as well, not having to install grafana, for instance. But you have your own charts and I do know that promisius also has a simple web ui based on on react. I think um so it's it's just when you want to or maybe think of exporting a png or even a pdf. um This could be super useful.

A

If you just query the api and say hey, I want to have a png right now with the latest time series set, which is defined via query, um and you don't need any external dependencies. It's just quest db and nothing else.

B

Sure, michael thanks, this is this is the kind of feedback we we are hoping hoping to hear more generally is very, very useful. Thank you very much.

D

I got also another question regarding um the notification block, um so is there also currently an option in the ui that I can block for notification when I um press the run button, because when you press it a lot of times um in the first time, your browser will be blown by all the notification poetry how long it takes for tree.

B

B

You're totally right so the same thing is we couldn't find like a good piece of real estate where to put this information- and we thought this notification is- is a good idea, but, like I mean, if we just run it, this is what it just gets overrun by. Like you know, it gets overrun by notifications.

B

We do actually have it on our kind of to-do list to like okay, let's find some somewhere else, because we kind of keep closing them like you do as well, but yeah we'll move it somewhere.

D

Else, hide it temporary by time or something else or so per button is also enabled.

B

Yeah it actually it it does hide it by time, but by default this time is set to some something gigantic. You see this. This thing is retracting.

B

Like there's like little little kind of re, you will hide in about five minutes, but.

B

C

B

So I'm sorry guys we'll we'll find it we're kind of we we're just uh annoyed by this. This thing, no less than you do it covers the grid. All the time covers the part of the sequel text. Is it's it's nuisance, but yeah we'll get rid of it.

B

G

A

B

Cool but yeah, um I don't know, I didn't have a chance to say to you that we're kind of very grateful for for you, organizing this, it's uh it's it's it's.

B

It's really really really good to you to see that you guys kind of even a little bit kind of interested and then what we'll do, and thanks for tuning in and and uh yeah, and we we wanted basically to we want to uh to help you in in some ways and if db can be helpful or something works, something doesn't we uh we're happy to uh just go ahead and implement things they like. Just I don't know. If you didn't know, the sequel stuff is also is part of our code base.

B

It's not the library is, is we're fully in control of the syntax error, reporting against it and everything, everything and anything here, so we we think this sort of time aggregation like that. It's it's it's a little bit simpler than you would do in in other systems. In terms of just you write less text generally, that's just the goal. If we can do something to to reduce text that you're right, it's yeah, it's it's something that yeah some something we can like. Just for sake of argument. What we can do is uh this.

B

This whole whole of the uh select from is optional than if you didn't know, so you can just run this and oh, it goes in the chart but yeah. So it just selects from this table um yeah um and uh yeah. We can. We can manipulate syntax in a way that makes your life easier generally and there's also optimizer.

B

We can tweak optimizer, there's quite a lot of funky stuff like moving around. uh Basically, the goal of optimizer is to okay. Let me just rephrase it so. The way query is executed is like a it's, a it's a daisy chain of of of code.

B

So there's a there's one piece of code: the sources data there's another code that takes kind of input from the first code and the third code takes in input from this second one and like the data chain right and the goal of optimize, is to reduce the input from from the first source of data, so the the consequent kind of pieces of code process less data first idea, so it's fairly aggressive that kind of very restructured in the sequel to achieve that. Generally, um so yes, that's the thing. So we can.

B

We can work in this optimizer. To do things even further, and and also um we kind of one of the things we wanted to do is is to to remove headache from trying to. If you have a sequel that you, then you need to like hint, it create indices and whatever right. So all of the execution you you you saw here, they it doesn't rely on indexes. So it's just data generally, like you, don't need to build indexes or remember to build indexes that kind of stuff.

B

So we can we strive to to process huge volumes of data really really fast. uh So so overhead of maintaining indexes uh is is reduced. The support index is that set both for for really really big volumes and some some complex queries, but generally we kind of try and remove that button.

C

um I had a question about the the I would say unusual way to to not using any library and what is the. What are the main reasons you you mentioned that you did, you can build it really easy, but do you also avoid, for example, standard container stuff from the siblasco standard library to optimize it more.

B

Yeah, so what well? The bulk of the the uh bulk of the code is written in actually java. This is java. This there's a c library built in there as well. So the goal is basically the reason we would try not to use libraries to avoid um like a coupling issues right. So you you could have like you use a library, but it doesn't necessarily take the data in the format you have it so you've got to like do.

B

Okay, you, you, you transform the data from what you have it to what the library understands that the test is sort of rational so that we don't need to do these transformations. If, if I have the data here, whatever code, I write can use this data as it is. You know it doesn't have to move it anywhere right and that allows basically as a void, in most cases, copying of data generally without copying and transformation. That's that's! That's the main main goal to tightly couple uh interfaces between libraries that we have.

B

We have libraries internally, but we sort of couple them really tightly. So we don't move the data.

C

So then, the the cc plus plus part is mainly for the instructions and the calculation a little bit around this.

B

And so the rest is just over yeah, so basically we use in in c we uh we we have. We have two things so for java usually comes with um its own libraries to you to do with uh io like if you need to read the file write. The file interact with the network, that kind of stuff right and and those libraries are uh uh horrible. As far as we can tell generally right so so we we use c uh layer very thin c layer to provide java with direct access to operating system.

B

So we bypass kind of like java is very big on frameworks like they've got like operating system, and then they they write their own framework to deal with the operating system right, so we kind of removed. So we implode the entire framework and we let java call like messages like read, file, write file and we would keep the signature of the method consistent across multiple operating systems.

B

So so that's that's one reason for using c. The second thing is we would do use. I don't know if you guys heard of this basically yeah, it's there's a there's, a guy called agna, agna fog.

B

Have you heard of them? No all right! This is he's. Really really is it's the this this gentleman is. Is he I think he's a is a professor at the uni somewhere, denmark or something like that? But anyway, so he he wrote uh like this. One library that we use is called vector library and this library he wrote effectively c, plus plus templates for vectorizing, arithmetic, right and so and and because it's it's basically it's in line templates. It's it's a bailey, a library generally right.

B

So it's just a code that just gets basically inlined in your in your in your binary right, and so we we use that for for cmd. Generally, we use the library because it just if you, if you if you dig into cmd, oh my god, it can become complex really really quickly, because because of the cpus, and also this library lets us basically, what we do is we kind of on the fly determine basically like, for example, if you have a method, there's some stuff right.

B

When you build c code, we compile several versions of this method. For for all, the instruction sets that we support like for for ssc to acc iv, avx and avx2 avx 512, there's four of them right and, and we determine which one to call at runtime like when you. When you start, we go okay. This is avx 512, compatible cpu and all the call is going to go to avx, 512, routine, right and and and the vector library that they are created.

B

Basically lets you template this really easily, so you effectively write same code, just compile it with different flags, they will compile it for hdc2 and the same code would compile it for for different instruction instruction set. So it's really really good. It's really is it's not it's not complete. You can't do all things with it generally.

B

So there's a few things that yeah few things that we kind of need to extend it for, but generally uh is it's amazing, yeah amazing piece of uh code- and I just can't recommend it enough if you want to use them yeah.

C

That was my my next question, because I am in the in the situation now that I have to build some the instruction to a high performance computing system, and this was the question: uh what do you use and how do you deal with this?

C

Many instruction sets, but you already explained it so thank.

B

You for that, it's it's it's. It's amazing, there's probably other libraries that let you do that, but this one does it in the most consistent way. Kind of you know is yeah have a look um I'll. uh um I need a probably can chat on this thing. It's the guy called. Let me just type in.

B

Here this is his name and uh it's called.

G

B

It's in our repository, oh jesus anyway,.

C

I think I found it: uh okay in github, recoder, slash vector class.

B

Yeah yeah, that's it yeah, yeah, yeah, vector class 2, don't use vector class letter class 2 is the late. His latest thing.

C

Thanks for that,.

B

No no worries no worries, yeah.

C

B

One other thing we used c4: we we basically stole idea from um from google's uh swiss table. I don't know if you have this uh and it's basically it's it's it's. uh It's quite well optimized version of a hashmap and we use it for uh for aggregation generally and the way it's optimized pretty much. uh You know how to explain it. So so you when you, when you store your keys like in a hash map right, so what it does is, basically is it stores in in the hash map.

B

It's those in the dense way, a hash code of every key that you you have in it like in blocks like, for example, you've got 16 hash codes right and what you then, do you calculate if you need to find something in in the map you, you calculate hash coding on a key and then using sims instruction. You search 16 hash codes, all at the same time to see which one matches you know and then and then those that you match.

B

Basically, you you do actual key comparison, so that's that's that's how it works so and we use it for for queries. um Average.

B

Yeah, so this this would use it so average uh distance per passenger count right it. We also use it multi-credit in multi-strategy way, so we're gonna build basically 20 hash maps at the same time, pretty much and then merge them together. But that's that's the thing and that's the execution time. It's not super fast, but it's a half a second to do that. Yeah! It's 1.6 billion rows again, you know.

B

So that's the that's the best thing it's pretty good um and we we have our own implementation of of that. uh We basically, I think I think this map comes from a project called uh tool. It's a google's project, um yeah. If you had a list what what I found that sort of I mean you can use it verbatim, but uh they uh the this project, sort of serves almost needs of google right. So the way they they stuff they templated in it like. Oh, my god, we don't need half of it.

B

If you look at this jesus christ, this is like this is a lot of code. You know, and uh we just implemented our own- that again suits our data and not not necessarily what I need to conform to google's formats. You know that kind of stuff.

B

So that's that's! That's that.

B

Cool, um that's! That's! That's all I really have really uh to show, and uh if you, if you guys, uh have any more questions, I'm obviously to you to answer any of them. If I can.

D

um I have one question regarding clustering on crestdb because um you mentioned it on your website, but it's never documented anywhere at the moment, or I can found it currently how I could achieve clustering.

B

Yeah, so the clustering clustering is in progress, so we um we basically they we implement in it. It sits on a branch called replication. It's not it's not fully ready, but it's it's coming generally.

B

We we kind of aim to build a simple replication right now. It's not it's not clustered in the sense of sharding. So it's not going to put half data on one node, half from the other you're just going to replicate the data across multiple nodes and we kind of wanted to to build it um like super efficiently and when I say super efficiently, um so the idea that we we lack kind of stuff and what we're building for clustering.

B

We when we ingest data, we ingest data in a row, kind of row, row format so kind of one row after the other right and for clustering, we're building uh column first ingestion like it would ingest basically like if you, for example, have 20 columns. You will insert 20 columns at the same time, it would just basically kind of write these columns independently and and if you have partitions, they will write partitions independently so and we'll build a mechanism for for you to.

B

If you have data in this this format, so you can, you can ingest basically hugely parallel and and we're building this for clustering, so when clustering gonna be sending data across to to node. Basically, that is a slave.

B

We will send basically all the data or as much data in parallel as possible and uh and we're gonna leverage this mechanism to ingest it simultaneously in the table, so we just want to make it super fast. That's that that's why it's taking taking a while, you know.

D

Okay, thanks sure.

B

D

So I need to check if I forget anything.

A

I I I think we discussed uh last week- maybe a little, maybe showing the road, but for next year or something like that, so giving giving everyone a little heads up and an outlook. What's what's coming, what's cooking.

B

Oh good, um I've prepared it, uh but I I need to dig it up. Basically, so um we, what we want to do is, uh I guess one. One thing: that's an on. The road map is is uh replication, because this is this is something that is is is highly anticipated in database community. We know that this is this is this is really uh really needed, um and this is what we're building first and foremost and um uh yeah.

B

It's gonna be open source, so we're not gonna hide it basically, and then it won't close us and then uh there's this uh there's a lot of um um god. I just I'm sorry guys I forgot there was there was a there were other features that, like super quick.

E

You want to speak about out of order. It's a pretty big one. Oh.

B

Yes, jesus christ. This is this is hilarious, guys because out of order is something I've been doing this morning. Continuing to do, I forgot even forgot about that. You know so so right right now, basically out of water takes takes, takes a while. So what the limitation that we have is your data that you ingest need to be in timestamp order? That's that's the that's! That's the limitation, which we consider to be pretty serious one generally yeah so, and what we're building is ability to insert data in any order right and what what that means.

B

Basically is uh what that effectively means. You've got your target table. You've got your source source data. Your source data can be kind of messed up in any way. In any way you want, and when you insert it, you will be reordered and put in time series as it lands in the table. So that's that's. That's the idea- and this is this- is roughly 50 complete and we really came to you to finish this nick. If you have any other reminders. I'd welcome those you know because I'm like you know.

G

B

Is not thinking well.

A

E

Generally, it's great.

A

Sorry, I was just referring to the out of order uh idea, because I think um we had that problem in the past with so, for instance, you have monitoring system which writes metrics um and then the node is being shut down and after a while, you have some replication data, so you want to keep the older events and you also want to insert them into a time series database and with rrd tool.

A

This isn't possible because you're dependent on the time series inserted in order um with graphite, you can do it, but graphite is horror, is a little slow in that regard, um and I think that could be the same with quest db. Am I right.

B

uh Yes, so the quest db, wouldn't let you do that generally? Would you would force you to to kind of insert time stamp another right now it's going to refuse. You cannot insert vectors out of water. That's that's the issue. um Yeah we're kind of building building something to overcome that like completely and it's it's going to be transparent.

B

You would insert data from by any means you you currently do like in flux line, protocol or anything else, programmatically postgres, anything and all of these things would use the out of order system and also there's we're building it in such a way. um It does not. It will not impose limitation on what kind of data goes in I mean, for example, you would be able to insert data like 10 years old data together with with last second data, and they will go in generally.

B

That's the thing and uh and not only that we we're building it to be hugely parallel as well. So it's gonna, it's gonna, insert uh multiple partitions, multiple columns, columns, split into three pieces say for sake of argument like if you've got five columns and two partitions right. He will parallelize this to two to a degree of 30..

B

A

Sorry, I would recommend uh to put a strong marketing focus on that, because I can see there's a business need with having uh a replay log or something like that on my my sql bin log, or something like that, um and there will always be a possibility to have some old data which needs to be inserted, um and I think this is a key feature which your customers and your users will love. I would say.

B

Yeah yeah: well, it's it's! The thing is it's been: it's been kind of a little bit of a bane of our lives this out of all the stuff. That's why some marketing at all we're just excited to have it and uh and and the the the reason I'm so super excited about. It is because I've been working on it for like two months non-stop. You know uh this is just like you know it needs to yeah it just. I can't can't wait to see it done.

B

You know, but but the we sort of what we do is also like the the way we the way we ended up doing it. It's a bit of a bit of a trial and error, and I'm sort of very excited to do it really fast with basically what we aim to do. um We we can insert we wouldn't be able to insert um a million records in in about um under 100 milliseconds, uh totally out of all the records right. So that's that's!

B

It's kind of the tune of 10 million records out of all the records in second and and what's what's basically interesting there like there's a systems that let you insert data out of out of water. That's all all fine! The problem uh then becomes the this.

B

These systems do not put data linearly like they put them in the index or tree, or something like that, and when you get the data back, what what that means is they're taxing every read request to this data, because, because it's just not stored in in the order of timestamp right and what we're trying to overcome is that we we will tax the ingestion and we're trying to basically reduce this tax by by whatever means possible, to make it as fast as possible, and it's going to be zero tax on retrieval.

B

There's none! So you, after that, you retrieve data. You there's, there's no tax, whatever it's going to be as fast as this demand, so it's not going in in the in emergency, like, like other databases, put it in, so we reorder it on the fly and put it in in it. It lands basically neatly in the in the time stamp order.

B

So yeah I just yeah, I'm just excited for the tech, I'm not a marketing person at all.

A

I think I've disturbed nicolas from saying something before about the roadmap.

E

Oh yeah, I mean you know more generally, we're we're building, uh you know, features and integration, which will be part of our.

E

You know, enterprise offering, which we call pulsar for now- and this is going to be part of you- know, sort of open core model approach, and you know the idea really is to one push quest db, open source, uh as you know, widely as possible and really like triple charge usage of the free products, um and then this enterprise sort of offering will use quest db as a library and will be more suited, for you know super large deployments at scale, um which you know uh we yeah.

E

We don't uh currently sort of serve as of today. So that's the sort of idea- and you know, there's a lot around security monitoring uh and you know also, even maybe you want to touch on that uh vlad- a deep, different type of replication, um which will be.

B

E

B

This the nick is nick, is right, but like this, this takes us beyond the year visibility.

B

So the the replication we're thinking of for for for enterprise product is is more geared towards it's basically cudp based reliable udp based replication, so it's more geared for for setup where you've got uh servers uh closed together generally and they service kind of quite quite a lot of requests together, but rather than sending data with tcp and and getting all of the uh all of the tcp extra kind of traffic right, so the udp duplication would multicast data to the nodes and deal with max generally- and this is this- is this- is going to facilitate faster data propagation generally in the environment that is uh just kind of where it's it's gonna be the service.

B

Basically, they kind of connected to the same, for example, switch right like the same fast switch right so, and this is fairly fairly unique for, for I don't know if, if you ever, if you ever need something like that, but but this is this is this is takes us to maybe like uh 18 months, road map or maybe 24, even you know. So our 12 months roadmap is open source too.

A

Thanks, it's great to hear any any other questions we might have.

A

If not, I would like to maybe wrap up or round up um thanks thanks for thanks for attending today and and sharing all the amazing insights. um For me, it's late now, I'm not trying it today, um but I might I might do tomorrow or next week, um and also thanks for the insights in the roadmap and what's coming next and how to contribute and other things um really appreciate it.

A

um Maybe we just do it next time in a couple of months and see see how the progress is or something like that so just stay in touch um yeah and if, if someone else is watching right now, just reach out on twitter at questdb or at gitlab or at the at our handles like minus dns, michi, dns, mickey yeah and just let us know how it goes when you test drive quest db um and with that.

A

B