Apache Cassandra Meet Up Presentations, 26 Oct 2012

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Developing Applications with Apache Cassandra

Description

Christian Carollo is the director of cloud and alternative platform development at GameFly. He is focused on availability, reliability and scalabilty in cloud computing and how mobile, tablet and other non-traditional platforms can leverage cloud-based services.

Previously Christian has worked at Fandango as the Director of Data Systems and at several other internet companies over the last 15 years.

Follow him on twitter at @supernaut.

A

Like to thank you guys for inviting us uh me tonight to speak to you guys about cassandra and how we use it at gamefly,.

A

Start off with a quick little overview as to uh you know what game fly is in case. Some of you don't know a little bit about what I do there and uh sort of how that's evolved over time and how that's led to us actually using cassandra. Gamefly, um specifically, we actually built something: uh that's social in nature, uh using uh cassandra and then I'll dig deeper into cassandra and kind of give a broad overview of the features that were appealing to us and.

B

A

You know feel free to ask questions throughout I'll try to answer them as best I can and then at the end, we'll have some more questions and answer them at the end of the presentation. So uh game fly just out of curiosity. How many people have heard of gamefly.

A

That's actually a lot more than I usually get cool all right. So for those who don't know what gamefly is we're a rental uh subscription-based game rental business? That was our core initial revenue stream and from there we spawned off other ones like buying and selling of games.

A

Specifically, we have focused in the past on the console systems and then later on, the portable systems and we've moved out into most recently into digital downloads for pcs and now we're starting to look at mobile platforms, phones, ios, android tablets, that sort of thing trying to figure out where we can either build out a rental business or a sell game business in that space. And then we have a couple other properties.

A

You may have heard of shaq news and moby games and they're sort of our content arms and we use those to sort of fill out what we can provide around the gaming experience. You know cheat codes, other bits of information about gaming screenshots. That sort of thing just to kind of give you a greater feel for what a product the game.

C

A

So about two and a half years ago, now, around the first of the year in 2010, I came in to work there uh specifically on mobile games mobile applications. um I worked previously uh for a number of years.

A

At fandango I was the director of data systems there, basically working on making sure that we could sell movie tickets on a friday when spiderman or batman or whatever big comic book movie came out, and I was looking for something new and I actually done the fandango iphone app initially right when it came out in 2008, and I was really looking to move in that direction. Unfortunately, it didn't work out for me at fandango and I found game flying.

A

They didn't have anything having to do with mobile, obviously tablets weren't around back then, so we sort of got together. One thing led to another, and so that was the start of what uh what we do today initially, then there was nobody in that group and now there's 10 people that do mobile development, whether it's phones or tablets or even website development, specifically tailored to mobile browsers.

A

So when we were kind of looking at what we could do, we were looking at this mobile space and we were thinking well, you know, gamefly is whatever it is today. Well, maybe we can reimagine it. Maybe we can interface with new customers using phones and eventually tablets, and you know we have looked at televisions and tried to figure out ways.

A

We could move our platform onto that as well, so my group sort of had this new new vision for what we could do, try and reinvent gamefly and take it to a broader audience, make it a little bit more of a compelling experience more convenient.

A

So, along with that, one of the things that we wanted to do was to take all these different properties that we had the shack news, the mobygames and the gamefly.com pieces and kind of bring them together under one umbrella product, one product that was just super convenient.

A

The interesting thing about that was that we were successful at doing that. We were very successful doing that, but what we found was that.

D

A

Wanted something else they had these connected devices: they could do all sorts of interesting things with them.

A

If you were on twitter, if you were on facebook, if you were maybe buying a ticket on fandango, let's say, but in our space you could like manage your queue and you could read about a game and you could watch a video, but it was it was it sort of was lacking and we heard feedback from customers saying that they wanted to be able to engage in the gamer community, and so we thought about working with the twitters and the facebooks of the world, um but we're kind of a smaller company we're not like a netflix or a facebook or twitter for that matter, and so uh we decided we just try and build it ourselves.

A

We didn't really think it would be that complicated, especially.

B

On our scale, since.

A

We were specifically talking about working with uh just the gamer community, so you know the next question really was: okay, we're going to do social. We know what it's going to be. We want it to be the social stream where information is coming in pretty much all the time that gamers can communicate with each other about anything it could be.

A

You know whatever they would normally say on twitter, or it could be about a news article that we had on chat news or it could be about a video game that they're excited about that's coming around the corner uh or be about a trailer that they hate about a video. It could be a contextual or it could be just generic. um We kind of knew. That was what we wanted to mold our social stream around.

A

We knew we eventually wanted to integrate it with all of our properties, which we to this day still have not done, but we also knew that we wanted it to be fast and scalable and stable, and we absolutely had no idea how to do any of this.

A

So you know we looked around based on our prior experiences. Doing uh other other products at other properties, you know the fandangos of the world as an example, uh we we knew. There were a couple things that might run that we might run into that might be problems, and that was how do you scale systems like that? How do you do things of that nature when we hadn't done those in the past, because gamefly was a retail business, it was a commerce business. It didn't have the same constraints that a social platform might have.

A

Even uh the closest thing we had was something called shaq news that had a forum type system, but even then it was more long form, not a lot of back and forth sort of more like a chat system for lack of a better way of thinking about it. So we looked at facebook and twitter and linkedin read it and some others myspace, um and we we tried to understand how they built their systems.

A

What were the things that were pain, points for them and what were the things that they did right or that they learned the hard way and we hopefully wouldn't have to learn and the one big takeaway was massive data. Massive usage leads to sort of massive scale problems, and so we looked at how they scale.

B

A

Enough as a side note, cassandra came out of facebook, but cassandra was actually built off of white papers from google and from amazon, so it's sort of living on top of the learnings of lots of other companies in the long run, so facebook was actually using it uh for a while as the backbone for their mail product within the platform they've.

A

Actually, since moved on, they bought a couple of companies that were hbase experts and sort of pushed the sounder out over time, and now they do have it doesn't mean that cassandra isn't necessarily a good product. It just means that they have expertise in a different area. So anyway, how do they scale?

A

uh That was sort of the core thing and we didn't we didn't. We didn't know anybody. We didn't know how to ask that question of anybody else you know down here. Myspace was, I guess, maybe the biggest similar player. um I actually have a friend that works there.

A

We learned they spent a lot of money on hardware and they had a lot of scaling problems and they were traditionally a microsoft shop and they just had a lot of problems and.

E

A

We knew we kind of didn't want to go that route. We gamefly is a microsoft shop in its history, um but we knew we just didn't want to pick those technologies. So again, going back to the other companies that I mentioned earlier, they all tended to look for creative solutions to these problems.

A

We knew that for scaling social. There were a couple things we cared about. We wanted to be nimble, fast, flexible, scalable and available, and so what did those things mean to us? Well, since we didn't really know what to expect we, what nimble meant to us was we needed to be able to get into the code potentially in real time, make a change if need be, deploy that change not have to bring down all the infrastructure along the way we knew.

A

We wanted to be flexible in the sense that we wanted to have the stack be flexible so that it could evolve easily. The code base could be flexible by its nature.

A

Database could be hopefully flexible by its nature, every piece sort of had that capability, hopefully built in in some fashion, not necessarily at the architectural level of what we do.

A

Maybe it's something that we're taking the underpinnings from and we're using something that's kind of already got that flexibility built in and we're just leveraging and sitting on top of it.

A

We knew that we needed to be scalable for us that really meant that if we got a big spike in user traffic and we needed another database server to help us out to handle load, uh we could supposedly add one on so to speak and everything would be hunky-dory.

A

It's not always true in case 100 that that works that way, but you don't have that flexibility normally in traditional database systems, and we also knew we wanted that same flexibility at our web application tier. Lastly, we wanted it to be always up. um There's an interesting sort of caveat.

B

F

A

Our system can always be up, but we're sort of dependent on the mother ship in some respects, and the mothership is not always up so we're up and sometimes they're.

C

A

But for us that meant that if we're going to do upgrades to the software, uh whether it's the database or you know the operating system or what have you uh we could do something so that we could always have like rolling upgrades would be sort of the preferred strategy. So we're never down we're, never losing data. Things are always working.

A

That was another area that we were looking for.

C

So that's how we thought of scaling.

A

Social doesn't necessarily mean how the reddits and the twitters and the facebooks looked at it, but not knowing how they did it. That's what we were looking to do so, the philosophy from a software perspective for us and a hardware perspective was we needed to always be able to add that was add or subtract. uh We never really thought too much about subtracting, but you know we wanted that flexibility. We wanted to be able to say, if need be, we could always move forward and without a lot of pain.

A

So one of the things that that led us to was, how do you get hardware like you, can't go down to fry's and get a big server and well maybe you can, but I don't.

B

A

How stable that piece of hardware is going to be, and then you take it down your data center and oh wait: it's 2 am and you need to call the it guy and get them down there, and so that was a challenge thinking about the hardware piece. That was a challenge, but then, if let's say we could solve the hardware challenge, then we had the software challenge, which is all the pieces I just mentioned. How do we do that?

A

How are we going to do that if we can get the hardware at any time, so we wanted to have easily sourced hardware always on demand.

A

We wanted to be able to take take advantage of that as quickly as we could and we wanted to have horizontal scaling be a core tenant of every tier of our application infrastructure.

A

uh So there obviously presents some challenges to that.

A

um Where do you get hardware on demand well up until recently, that wasn't really feasible, um but things have changed, and so now the you know whether or not you were going the route of procuring hardware from a dell or an hp and waiting for five six weeks and then hoping and praying that it can get into the data center rack and all that in a short period of time or you could use this thing called the cloud potentially, and so you know that's where we started kind of. We had no experience in the cloud.

A

We had no idea who to use- or you know how successful it was. um So we we started looking into that. Another architectural challenge for us was okay. If we want to be nimble and flexible, the first two tenants that we described uh earlier. How do we do that in the software here? Is it easy for us to make a change and okay we're deploying a dll?

A

You've got to stop iis or you know, you're, maybe putting up a jar, and you got to do some other stuff where you stop and start something else, and you got to compile it and you can't do it right on the server potentially or if you do. Maybe it seizes something while the server is doing that, so we were looking at statically typed languages versus dynamically typed languages and trying to see what was the most flexible and sort of fit our needs, and so that was the second challenge and then the third challenge was okay.

A

So these database systems, how do you get them to scale whether or not it's a relational database or a cassandra database or a database or oracle? How do you get them to scale, and hopefully it doesn't cost a lot of money along the way and then, lastly, we looking at scaling, we wanted to make sure that we weren't, maybe every couple of months as things started, to grow exponentially, hopefully going back and re-architecting.

A

C

Do you mean by horizontally scaling.

A

uh Just that we're able to add more hardware and expand the software across that hardware. So if you start with traditionally you might start with one database server and then let's say you wanted to add another one. If you're in a relational world, you may not have configured your schema in such a way that it can be replicated easily so now, you've got to go refactor the schema so that you could add another machine so that you could take on more traffic and more load into your infrastructure.

A

So we were trying trying to avoid that. Some of us had experienced some pain points along the way and other companies with problems like that. We were hoping to avoid that if we could up front, sometimes people say you know, don't over. uh What's afraid, I'm forgetting.

F

It right now, but don't over architect.

A

In advance, don't over optimize, some people have told me that we try to do too much too fast.

A

Well, the way we work at gamefly, we don't always have the opportunity to go back, because somebody else wants you to move forward, so we were sort of like well. We got this one shot. Let's see how far we can go, how flexible and scalable and nimble it'll be.

A

So you know when we're talking about the hardware decisions again, it was about procuring hardware from a vendor, and that takes time and then you get it shipped, and then you get it into your into your office and then some it guy has to become available. He has to put down the operating system, drive it down to the data center. All that stuff has to happen. That was the traditional way we worked at gamefly. We knew that that was going to be probably impossible due to resource constraints within our company.

A

So you know we talked about deploy our own or build our own or try the cloud and uh after spending a little bit of time, looking at the vendors that were out there, we pretty much decided the cloud was for us, and uh you know we looked at these three amazon, uh heroku and rackspace, but the maturity and the the feature set that we could get from amazon sort of just drew us into that space.

A

Maybe it wasn't the cheapest solution, but it seems to have a lot of things and they they evolved their infrastructure very quickly and that those were two things we really liked again being able to leverage others and the work they've done.

A

So this is a final note on that. Why did we choose the cloud? Well, we were a new engineering group. We had at the time that we were doing this. We had two resources myself and one other person.

A

We had limited to no access to other engineering resources within the organization and they were traditionally from a microsoft background, so they probably weren't going to be able to help us even if they were available.

A

We needed infrastructure that could scale, which means we needed to be able to get these things at any time. We needed to be able to get small ones, big ones super big ones, uh configured in different ways with x number of drives. All of that became available to us. As soon as we went with the cloud solution, you could kind of design it on the fly if you wanted to, um we needed it to be in more than one location.

A

Eventually, we haven't actually gotten to this, but we like the idea that you could actually scale your infrastructure from just being a one data center to three or four data centers and then. Lastly, this point about horizontally or vertically. It just meant that, like if our infrastructure needed more cpus inside three machines, instead of going to 30 machines, we had that option just as well as we had the option to add more hardware, just additively buying more boxes, so the flexibility that the cloud offered really catered to, what we needed at the time.

A

um So in the software space you know we were looking at obviously uh cheap stuff. I mentioned that earlier. uh We, you know we were trying to stand on top of those who have done some amazing things before us and wherever we could cut costs, so we spent a lot of time looking at open source products.

A

We also wanted the software to be as flexible as this hardware solution. We found we wanted to make sure that we could change it very easily. This was that nimble piece I mentioned earlier and we wanted, as new people came on board, because again we were only two people. We wanted to be able to say if we documented this fairly well, hopefully, you could come in and take that software that we've hopefully documented very, very well, and there would be a low learning curve to making you actually productive in in our uh infrastructure.

A

um Then we had some other tenants which were really just to keep the software as simple as we could, uh as modular as we could so that it was as easy to test as possible. uh This did mean that there were gonna, be some sacrifices that we made and we were gonna sort of do a trade-off.

A

We realized that we're going to go with a dynamic language, one that you could change pretty much on the fly anytime anywhere inside a little piece of you know, software that's running on the third server or the fifth server within your web tier sort of thing. We knew that that meant that we were probably going to sacrifice a little bit of performance, but we were willing to make that trade-off. If we had a slightly lower performance over here, we might add another server at a later date.

A

So by sort of saying, okay, we don't need to be super fast or super super uh sophisticated in our software. We were kind of leveraging. This fact that we could add more later in the hardware space and then put our software on top of it.

A

So uh you know we did some experiments with java, but we kind of quickly moved away from that, and that was predominantly the only language we looked at that wasn't a dynamically typed language. We looked at ruby and python. We like those more than php, because at the time we were really playing around with the interactive shelves. I found out later that there was a php shell that came out of facebook know that at the time, so we were really playing with ruby.

A

We actually really loved ruby, but it was uh difficult to do some things that we wanted to do. We wanted to have the ability to do uh threading like behaviors, without having to write threads, and there was something called event machine that was sort of this thing that could kind of do.

G

A

But it was like poorly documented, and there was one guy who seemed to know everything. It was really hard to understand. So um we ended up going towards python um and then specifically, we ended up using something similar to event machine called tornado, which is a an event loop web server. It allows you to do asynchronous communications sort of like a threading model, but you don't have to like get into the weeds about doing threading.

A

Lastly, before we get into cassandra, we needed a. We had a search requirement um again in my past. I had done some work with endeca, which is a really really expensive.

D

A

Product that I recently got bought by somebody and.

D

I don't even remember who it was.

A

There we go oracle uh yeah, they were like three hundred thousand dollars or something like that when we used them at a prior company of mine- and we had actually done a prototype at that time with solar and it did everything we needed and we still went and bought endeca.

A

So we decided again to use open source, and so we took sold it and then again we spent some time looking at databases, but uh we had a lot of experience on the team of two people working on relational systems and we knew some of the uh the cons that we didn't really want to face. And so we were really looking for alternatives to uh relational systems. Things that could have scalability sort of as a core tenant of their systems uh didn't mean, obviously that there probably were going to be some sacrifices along the way.

A

And so we looked a little bit at mongodb tokyo cabinet. I think it was called at the time and did some light research on riak and redis.

A

I had been following uh cassandra for a while, uh because I knew about the google bigtable uh and amazon dynamo model that it sort of leveraged to grow out and to build off of so it was pretty much sort of ingrained in me. I think at some point that I really just wanted to try this and see if I can make it work, and so that's what led me to cassandra so, like I said you know, we, the big table and dynamo white papers were very interesting.

A

They accomplished a flexible data model and sort of the horizontal scalability. They each did these one thing very well and facebook when they built cassandra. Just basically said: let's see if we can put these two things together. um That really appealed to me. I liked having the flexibility at the data layer and I like having the flexibility of being able to add servers and and bring another machine up, have it replicate sort of distribute the data that it had on two machines?

A

Let's say the three machines without me really having to do anything right that seems kind of like black magic to it.

A

So then, as we dug in you know, the flexible data model really started to come come to light. It took a little while for us to go from a relational thinking to a non-relational thinking.

A

Typically, you think in rows when you're talking about relational systems and then you might be joining those rows up and ordering them and doing data manipulation, I'll call it on the fly. uh Cassandra doesn't want you to work that way.

A

Sonja wants you to work the inverse think of it more like a denormalized database like you're doing a data warehouse like you want to have a static representation at the time you write it and that seems not very flexible, but the model that you, the the data objects that you use inside the system are flexible, but the way you write the data is maybe not necessarily so flexible.

A

The replication piece that I talked about so the ability to basically say today.

A

I have three machines and they all have three copies of data, and I didn't have to do anything to make those three copies exist and now I want to add four more machines, and I want to have that three copies now be sort of striped for lack of a better way of thinking across the seven machines, and I don't have to do anything other than bring those machines up and add them to this gossip architecture, which is basically just a way for the machines to coordinate with one another, but it all happens down at the cassandra layer.

A

I'm not doing it. I'm bringing the machines up, I'm saying you know about you and you know about you and you know about you and the rest happens within the infrastructure itself within the cassandra infrastructure that was really appealing and then later on, they introduced the ability to do that in a wide area network. So now you could have a data center, in los angeles, with three machines, and you have a data center in new york, the three machines, and that same thing I described when you bring up the three machines locally.

A

Could happen across a land, there will obviously be the speed of light and preventing you from getting your copy over there and, however much time that takes to get from la to new york, but your data will get there and it'll happen all without you really having to do much, which again, like I said, if you need to have that. That was really appealing at the time that we were first looking at cassandra. It was a cassandra zero. Six now they're up to one twos in beta, it had very, very fast rights.

A

Its rights were actually, I think, four times faster than its reads, which is not traditionally how a database works, um which just seemed amazing. I was that sort of just drew me just for that alone, but the second question was: why are your reads so slow and they couldn't really give a legitimate answer to that. It was more because the rights were just amazingly fast um and but eventually in around uh 1.0 of cassandra they actually had.

A

The reads are now about, I think they're, like 10 slower than right, whereas they were like 400 percent slower before and the right way for the monkeys to copy.

C

Yeah and the rights didn't slow down by.

A

The way it wasn't like they just sort of brought it down.

H

When you said you write the data across the uh the, when is that uh in async mode or synchronous mode.

B

H

A

It's uh they don't really think of it as asynchronous and synchronous. It has to do with what's called a consistency level, um and everything in cassandra is what's called eventual consistency, so it doesn't follow the traditional acid model, so if you say we're getting a little off and I'll touch on later, but I'm happy to bring it up now, when you say I have a row of data and I want that row of data to be in three places within my architecture.

A

You may be writing your your php or your python code may be calling into the system and it connects to one machine within the system and it writes there and the other two copies will eventually be written. There will they will be written?

A

There won't be loss because of the way it writes to a uh it writes to in memory and to a commit log in an append only manner on the one system that got it before it gives you a response, so you'll get that data somewhere and then the eventual consistency part will happen at a later date uh later date. You know less than one.

A

Second, you know in the in the wan case, maybe a little over a second depending on your connection between la and new york, but it'll get there and it's guaranteed to be written once you get a success path, um so the fast rides were really appealing um and then.

H

Like I said, as they matured.

B

A

Product the reads became equally fast, and that was that was a nice feature to have fast rights and passwords, and then, lastly, this whole idea that you can have sort of x, number of machines and then add 10, more machines and then add 10, more machines, sort of leaves you with this notion that you can have a lot of data.

A

You can have an insane amount of data quite frankly, because you just got to put some disks behind it, and you just got to make sure that you can have enough servers to hold all the data that you want, and you know, depending on what you're actually doing with the data. You probably won't, have it all in memory, but you'll be able to have access to it pretty quickly.

A

So the one major limitation currently in cassandra, just you guys are aware, is that one row can only have two billion columns.

B

A

That was a limitation that was a concern, but it can have billions and billions of rows uh so the data model. Let's talk about that, a little bit um so traditionally in a relational database, you have a table.

F

A

Columns the columns are defined at uh creation, time of the table and there's they're fixed. Every row looks the same if it's got, 10 columns defined every row has 10 columns and if you don't populate those columns, you usually put double data or empty string or whatever your choice is, I guess so. The rough equivalent to that in cassandra is a column found.

A

I guess I should back up and say that cassandra thinks of itself as a column oriented datastore, so they call the columns handling instead of a table.

A

So what is a column? Family? Well, a column family? Is it has a primary key? It has a row lookup id just like a traditional database does um and then it has basically name value errors that make up these rows and these columns these you know the the ten columns that we talked about in the table. You can have those same ten columns in the cassandra column, family. However, let's say one row is a user profile row and it has a username and a password and a couple other data points.

A

Let's say the second row: you don't have a username, yet it doesn't create a field called username and put a no value in it. Just doesn't exist, so you end up with one row that might have ten columns and one row that might have two columns. So you get you know variability in the shape of the data that you're storing inside one column, data type, the key, the rokin it can be variable. You can define it in advance. So, in the definition of a column, family, there's a there's, you can define the column names.

A

This is jumping ahead, a couple of steps, but you can also define the row key as being a certain kind, so it could be an integer type. It could be there's something called the time uuid which is sort of like uh supposedly globally unique value, so oftentimes in cassandra. We use that as opposed to a sequence, or uh what does it involve? There is no default. You have to define.

C

A

Have to define it yeah, you can also do composite keys, which traditionally might look like something like an integer, a colon, another injury colon, but you can actually define that in something called a you can create a type that actually defines the structure of this multi-part key one of the more interesting things about column. Family is because you have these dynamic rows. These different shaped rows.

A

You can actually think of it as being both a data structure that can be statically defined, meaning you can always enforce to have a row. Look the same. Every row look the same as you want exactly, but you have to enforce that when you're, inserting the records, uh the other side of that coin, is you can have it be uh dynamically?

A

Each row can have a different shape, uh the major or the the the traditional example case, for that is timeline or time series data you know instead of um you know off the top of my head. How you do this in a traditional database, but you would probably have some some field called date and you'd be ordering off of that, and the data might necessarily get stored in that particular order.

A

Unless you have like a sorted sorted on disk index on that, but you could be writing data into that in every column and you could have hundreds and hundreds and of thousands, whatever number of rows that you want in cassandra. The way you would do that is you make one row which has much better. I o read, uh and you would just strike, write out that data as different columns and you'd hang off. Of that each column name would actually be a timestamp and then off of that you'd actually put into it whatever you want.

A

So let's say you had uh 400 tweets, just as an example over three years. That would be one row with 400 columns and then the time stamp for each tweet would be a column name and then, whatever you want to hang off of that, the post uh a picture, video url, whatever you want, that would all be sort of bundled into the column values. The column name is the time stamp and the column values can be a string, an integer or it can be a complex object.

G

That's where you have two billion.

E

Right, yes, if you had two.

A

Billion tweets.

E

A

I actually came across that problem in my mind, early on thinking, yeah we're.

B

Going to have that.

A

Problem and no, we haven't had that problem, but I actually built our public timeline that way, so each one is based is the the index for the row. Key is actually a date and then it goes out this way. So you might have you know whatever 100 000 posts one day, 55 000 the next day, 30 000. You know two that sort.

B

A

uh Totally unnecessary, though, for what we did, it could have all been one rough. If we haven't had two billion.

C

B

A

When you have these dynamic, this dynamic data structure, you're gonna, have the opportunity to have it actually sort those posts that we were talking about. You can actually sort them by the column names. So there's this thing called the comparator and, as you define a column family, you define the row key. The data type of the row key also define the data type for the column names so that you can actually sort it in a particular way.

A

So you can actually have it written on disk the exact way you want it for whatever that purpose is and again in the case of timelines.

A

If you want to read it from the earliest the latest or vice versa, you can do it either way and you can use a time stamp data type, a time, uuid data type. To actually do that, so the sortable column names come in handy quite often, so you know rows can have static lists of columns. That's where I was like saying again. You can just write out exactly what you want it to be, every time it can look consistent or the dynamic list of columns.

A

It can be sort of like a timeline um and again the rows can have variable lengths, variable number of columns each row and then the data types. So the row key comment: uh you know you can be in ascii, utf-8, lexical, uuid, tinyuid, byte types, there's like 12 of them too many to put on here and that can be used for columns and for roki's and then lastly, and they'll just touch on it actually absolutely down here and see composite types.

A

Composite types are when I was talking about being able to model uh complex objects, uh it's a little bit more advanced data modeling, but it allows you the ability to basically create like hashes of hashes or dictionary within a dictionary a way.

B

A

Model, a multi-level object or hierarchical object. That's that gets a little further into the weeds than we probably want to go into today. Originally, the way you accessed the data was through thrift and wrote some.

A

They were fairly straightforward, but you have very limited access to how you can actually get to the data. You can do something called a calm family get a compound family.

B

A

uh You can do things like get range, uh a couple other features, but you don't have a lot of query power that would be sort of like the one thing you have to be aware of when you're building your architecture using cassandra is that you're not going to be able to do joins order, buys you're not going to be able to do any of that stuff using the hardware you have at the database level. All that's going to have to happen.

A

If you've got to do that either one of two ways either you do it at right time.

A

So you have all the data in the exact format you want it in or whatever your language is, that you're going to be taking it up to on on the the middle tier or the left here, you're going to do the order by up there you're going to do a loop over it and do some join, none of which are good ideas, but that was in the beginning and now they've introduced something called cql which stands for cassandra: query language, which is uh very similar in nature to sql minus a few limitations. No.

C

A

um And uh there's some limitations on the where predicate um those are sort of the two biggest ones, but you can do select star from entry to column family just like a table. um They actually, I believe, actually just recently uh put in some order by functionality which I haven't actually used yet, but it for those who know sql, understand sql. You don't have to kind of get down into the whole, using these more primitive drivers that leverage thrift and do all this stuff.

A

You can use cql most of the libraries now support cql for accessing the data.

A

So on the data designs and you kind of understand what the model is like, how you're going to build things in it. So now it's like well, okay! Now, how do I build an architecture? Well, the first and probably the most important thing is: don't think of it like a relational database. uh You know it doesn't have joins.

A

You know you don't really want to do the order by because you can end up doing it at your application tier for the most part, um so the two big things are think of it more like a denormalized data system, if you're familiar with that a denormalized relational database.

A

So think of like what is the question or what is the query that I want to call at some later time and you want to model that into your rights, and so whenever you do a write, you're just doing a key lookup, it makes things a lot easier and a lot faster. Now, there's.

F

A there's, a drawback.

A

To that what happens if I have absolutely no idea how I want to do that at this time, because I'm a brand new business- and I want to do it every way, because that's what every ceo or cfo says it's like, I want to read it this way.

A

Well, you have a couple of choices. uh One choice is that you can write multiple versions. Think of them as like indices, think of the column, families like building a materialized view of your data at right time, so I'm gonna, I'm gonna. I wanna look at the orders in my system by date. I want to look at them by user. I want to look at them by product you'd. Actually, those would be three queries traditionally now there are three writes into your system.

A

So again, you've got lots of servers, probably lots of disk space behind it. The space is fairly cheap. These days, you're making a trade-off. Your trade-off is you you want to have the scalability um if the system, in exchange for having to do a little bit more work at right time,.

C

A

Yeah, you don't actually define when you create the equivalent of create tables called create column family. You don't actually define any column names at that time. All you do is define the comparator, which is used to sort of sort the column name, so you have to stick within the confines of a data type.

A

It's going to be ascii is what I'm going to be putting in my columns, but I'm going to have 300 columns, 10 000 columns, one column doesn't matter, and if I have ten thousand columns, I don't have to have one that's consistent in every row. They can all be variable.

F

So, let's take the case where you build the system and have one billion records in it or columns. Right and two years later, your ceo comes back and says you want a new report this way.

F

So from what you're saying, basically, you have to take the information out into your application, massage the data figure out the format and push it back in in a different way and then get a report.

A

So that's one way you could do it the other solution. To what I mentioned. There were two ways to do this. One is at right time. You know all the parties you want. uh The other one is that- um and this is a little bit further jumping ahead a little bit, but it's a good question to bring up so cassandra has this sort of ring topology, which allows you to have multiple nodes servers, they're all part of this replication process.

A

When you go to a multi-lan, a multi-data center configuration, you define these things called virtual data centers, so you have a virtual data center in la and a virtual data center in new york.

A

The ring still exists as one entity, but they sort of slice it up and they sort of treat what's in la as one thing so, you'll have a replication of three in here and you'll have a replication of three in new york, but your replication factor is still three and that's the you have to have three copies when you divide the data center up into these two parts, what happens if the la-1 disappears right? It goes into the ocean.

A

You still need to have three over there, so the reason why that is important is because you don't have to do you- don't have to have uh physically disparate data centers to take advantage of virtual data centers. What you can do is you can create a virtual data of several virtual data centers. If you want inside one physical data center, you can make copies to those that second virtual data center and there you can do all sorts of ad-hoc querying analytics whatever you want to call it.

A

Data stacks, which is the company, that's uh taking cassandra and sort of turning it into a commercial product, has extended cassandra in this way. The core.

F

Cassandra can do this, but they've added some additional.

A

Features taking hadoop and bringing it in so now, if you write your data one way you can use like hive or pig or write your own mapreduce stuff. If you want whatever your language of choices, and you can manipulate that data on your own. However, you want to it is sort of to what you're saying you're going to pull it out, you're going to do some etl type thing and then shove it back in that sort of has to happen in a traditional relational system. Oftentimes.

A

You know you have to pull it out into a data warehouse and you might be changing it, putting in an analytics cube doing whatever you're doing so. It's not sometimes when you're doing that, you don't have to change the data to actually do the query, because hive and pig are pretty powerful and can do some things on their own, where you don't have to like, take it out and shove it in a different manner, and sometimes you will have to.

A

But let's put it this way, is it a bigger pain in the ass doing it? This way in your experience, or is it a bigger pain in the ass, along with relational database, schema changes or whatever that's a personal preference? I think morning.

G

Up there, like I, I might shine in because I've been like dealing with both like in the way. um The reason like why, using like the big data like the nosql uh solutions, is mostly like for, um like okay you're, throwing like an enormous amount of data there, just running like the regular reports like you would run from again like against the mysql database, might not be feasible there at all, because you might have like terabytes or like petabytes of data, it's think of it as like. Okay, you have this apache log there you know.

G

If you want to run some kind of statistics, so then you need to run my analysis script on that a normal way to do it like people, run some kind of map produce using hadoop or like some kind of other, um like parallel analysis tools on it just to collect statistics. Okay, I need like these stats. You know, okay, how many, like you know like female users in alaska? I have you know like um it is you not normally would not run the like reporting tool directly off the data set.

A

And then the really cool thing from client to answer your question about preference, because the replication happens to this other virtual data center. I now have let's say two machines that I can run some really nasty hive mapreduce thing: peg, the cpu, let it run for whatever an hour 24 hours, no impact on my customer facing data servers, none still getting the replicas, the the copies coming out in real time to the analytics server. So I can be doing analytics in basically near real time without me doing any etl anything else.

A

If I need to answer your question in some fancy way like there's some really complex thing, the ceo or cfo wants- and I.

B

Have to pull it out, put it in.

A

A text file do some crazy stuff and show it back hands. That seems like a small price to pay for the sort of sense of well-being that you have at night.

A

When you know you've got like multiple copies of your data and maybe in multiple locations with lots of servers supporting it, and you know if you're not using uh the data stacks product you're using cassandra, it's freaking industry, you know if you use data stocks, you know there's there's a support contract involving it, but I like that, coming from my days doing uh stuff, uh you know doing transactional sales on a single database when we have huge spikes and we would grin and bear it and pray.

A

I prefer this so again. Personal preference.

A

So we talked a little bit about the replication already. The one really interesting thing here is replication factor. So when you um are putting a column family into cassandra, you get to define for that one particular data object, the replication factor or the number of copies of any given row within it that you want, so you can have table a or column family, a and column family b, and they can have different replication factors.

A

We don't practice that we just have a standard. We want three copies of everything, but you have choice there. You can do that what's called the key space level and the key space level is the the catalog that holds all of your traditional databases that the thing that holds all your tables is a catalog and in in cassandra they call it a key space holds all your column families, so you can do a key space level and then you can overwrite it at the column, family level.

A

The hinted handoff and read repair. So we talked earlier about eventual consistency when you're writing something into the system. Eventually it has to.

A

If I have three copies as my mandate, I gotta get three copies right, so these two mechanisms are sort of built into the cassandra replication system and they make sure that either uh when the server's down and comes back up hinted, handoff basically says: hey, you missed all this data and it basically hands over the data because it's been saved on another node and that node is aware that another set of copies is waiting to be made to another server.

A

When that server comes online and then read repair, let's say you're doing a query and you need a consistency level of one think of it like a dirty, read and relational database, but you still need three copies of it. When it does the read- and it recognizes hey, I'm going to give you this row back, but oh by the way it hasn't made it to all three of the servers.

A

Yet the read repair actually can be triggered on a reed to actually do the the eventual consistency so they've built in a lot of mechanisms within the system to ensure that consistency happens in a timely batch.

A

The other really interesting thing about consistency is that it's tunable at query time. So what that really means is at write or read time, both being queries, you can define the level of consistency you want so the best way to think of that is at right time. If I want a consistency level of one, I just need to make sure one machine gets a copy of that data and I'll get a response back.

A

If I say it's, gonna have three: it's actually gonna do the copy to all three servers before it sends a response back so obviously, three, four five and the higher the number the longer the query takes to actually return.

C

You said three copies.

C

No uh three copies so that if one gets corrupted.

B

A

Sure I'm following the question: I'm sorry three.

D

Copies yeah expansion, three copies, so that is that.

C

If one copy, like one fight, goes back into one, you look at the corresponding locations of the two others um to see, which is the correct, you're saying.

A

That's read: repair no.

E

A

I mean that's one reason: no, I just want it for more of the ability like if you lose a server or you have disk failures or anything like that, it's more. For that reason I mean it's really for any reason where you might have data corruption or data loss.

D

A

Yes, but the so back to the the consistency level when you're writing a record, and maybe you only care that it gets to one location, still means you're. Gonna have a replication factor of three. In the example we're talking about so you're, eventually going to get that copy to three other machines or two other machines. Sorry, but initially it only has to be successful at one. So you'll have a faster right before with a response back to your your application, tier um the other side of that coin, is you can do a read? Yes,.

D

Just a question uh regarding that: let's say you have: one of your servers is down with the. If you do a query for.

E

That uh copy you said you would get the response only back once all three replicated or.

A

Right, but what is the case if one of those targets is down uh so in? If you have say three machines, and only three machines- and you have three copies of the data or you've got your your requirements. I have three copies of data and you have one machine go down when you do that right, you will. If I remember correctly, you will actually have a problem with that.

A

We don't typically run with just three machines and a copy of three, so you have the ability to hopefully not have more than one server down at a time without somebody you know getting on it and taking care of it. So you can actually then have three copies say on five machines. So they're spread out it's sort of striped around the the ring, and so in that scenario that you're, describing, if you have three machines and three copies you'll, have you'll, have a problem. You'll have a system down problem.

A

You won't be able to write right so.

E

You cannot write all the copies that you have set for your copy level, because one system somewhere well.

A

But at a consistency level of three you won't even get the write in because it can't say I got all three.

A

So if you did a consistency, level of one it'll write it to one machine it'll at a later date, it'll attempt to write to the other two doing one of these two and there's a third mechanism that can accomplish that.

A

But at the time that they tries to do the eventual consistency, it'll note that it can't do three and it has has one in weight so to speak. But the right was still successful because when you did the right, you said you only needed one.

A

Now, if you have three machines- and you say I have to have all three respond with- we got it, but one of them's down you're not going to be able to do anything you're going to have a down system.

A

So in that scenario, if you have to have three copies- and you only have three machines- you don't really have a lot of flexibility in case one server goes down, so maybe you want to start off with four machines and maybe a replication factor of two. So you have flexibility, so you have to think about those things as you're.

A

You don't have to do anything special to the replication architecture. You just have to kind of plan that, in your mind, as you're doing this stuff. That answer your question.

D

A

Maybe I can explain it afterwards, a little bit better. um I mean, ultimately, if you demand three and you don't even have three servers you're, just not going anywhere. You're gonna have you're going to get errors back from cassandra, saying it wasn't able to do that right and then your application is going to do whatever it does to tell the user. It was unsuccessful.

A

That's a good question. So.

E

In that, assuming you have a two servers, and is that easy to do like a master master thing.

A

Oh yeah, it's built-in there's! No! Every node is equal to every other.

E

A

Because that's all built into the replication architecture, there's no concept of master or slave and one one has authority over the.

E

A

Not the way they architected it, so I I mean I don't know the details at that low level. How they're doing that piece that you're referring to, but uh that's not, uh it's not an issue. They have a. uh They use a uh time uuid to actually do a timestamp of every record at a particular point in time, and they are able to coordinate that in such a fashion that that doesn't happen, but the actual inner workings of how that works. I'm not I'm not aware of the inner system.

C

E

Scenario is, if you have machines in new york and los angeles and there's data being entered in both sides. Basically,.

E

Consistency of one when the data gets put into at least one machine, so.

B

I'm saying this.

E

A

Right but you're going to have two different time stamps for the two records that were written in so I'm assuming I'll. Take that the example a little bit further. Let's say I'm updating my my user record, uh let's say: you're updating in new york and I'm updating in la right, and so it's one record. You're.

E

C

Like that, the new york and the los angeles guys are entering different rows in the same page.

A

Yes, that's what that's what you're describing yes, but the question really is is what then, because that won't actually be a problem.

E

No, I'm not saying there's a problem. One will only be achieved at the moment that the data was in both places is completely independent. I just want to wear it. Oh, I should return. Yes.

B

F

A

So the so it's tied into replication. So in an example where you would have five machines in this ring and but your replication factor which for argument say, is three up on every data model throughout the system. So let's say you have five column families they all have to have every row. In all five column, families has to have three copies, a replication factor of three, but you have five machines, so the partitioning meaning I have five machines. There's gonna be some set of three over here. There's gonna be another set of three over here.

A

There's gonna be an overlapping set of three here. All that the partitioning within the ring all happens inside designer there's, once you've defined the replication factor and the number of servers that are participating in the ring the rest happens below down inside the the cassandra layer. It's.

F

A

Something that you, you know you don't like in traditional systems, you create table with a partition of av, and you know bb and whatever you don't have to do any of that. There's.

B

There's a token.

A

Architecture that you can fuss with that's probably the best way to put it which you can control the partitioning, um but you don't really need to do that. um There's some values that you can set up at the beginning and once you've done that in the config file and when you start up the server you're done you don't every data model doesn't or every data object or table whatever you want to think of it as doesn't need to have partitioning defined at the schema level.

A

So you call them ring or servers. They.

F

Did that? Because each each node knows about all the other four nodes.

A

Yes, they all know about each other.

C

But ultimately,.

A

They also know about who's next to each other, that's sort of the more important purpose, but they all know yes they're all in coordination and are aware of they all know how to query each other to get data as they need to. So when you.

F

Add a new one, you can figure it, you tell it which one is the next node and that's it or do you you.

A

Don't have to do it that way. That's one way you can do it. You can also like inject yourself between two nodes. So if the, if the ring topology has say from zero to a thousand, are these tokens that I'm at you know, point uh 100, 250, 450, 650 and whatever 900 right? I can bring in another machine between the 250 and the 450.

A

um I can manually do this and enter into the system at like 325 in the ring. Now the ring isn't balanced, it's out of balance, because now there are three machines participating in in an area where two used to participate, so they won't have an equal amount of partition data.

A

So there's mechanisms now that, if you use specifically, if you're using data stacks as product, they have something called op center and op center. Basically can do all this configuring for you, you just check the box and it does the rest, so you can manually do it if you want to- or you can have it auto set up, and it's there's no reason in my opinion, to go to the level of doing it manually. It was sort of how it was, and now that it automatically can do this. If you tell it to do so,.

H

um Let's say if you fire up a new uh server to handle the load balancing some of the the peak level. Can you increase the the cpu and ram on the new cassandra dresser, where you pick up or they have to be identical from one to the next.

A

B

A

H

A

Doesn't matter yeah, you know, might get slightly strange performance but yeah it doesn't matter.

B

A

um So we kind of touched on this a little bit already about the I o performance. You know everything right and fast and then eventually they got fast reads, but the two interesting things to take away from the I o performance are that they do all their rights sequentially. They write to memory and they write to the pen log, that's on disk um and then what ends up happening is at some point it's at a predetermined time in the future.

A

What will happen is: what's written in memory actually gets flushed to disk, and these things called ss tables and they're they're immutable once they're there they're done you, don't if you're doing a delete, you're, not deleting the record out of there necessarily at a later date, if you do a delete, it gets marked in another file or in memory depending on where it's happening uh like you've deleted a row and at some point later, there's what's called a compaction phase, and the compaction phase will actually eliminate the data during the compaction of these ss tables, but that stuff all happens, sort of as part of the background process and doesn't actually put too much load.

A

There's major compactions. They can put some load on the system, but you don't normally run those uh and there's minor compactions that kind of sweep the system kind of like garbage collection. If you want to think of that way, that will keep the system in pretty good shape on it on an ongoing basis.

A

um The other thing is that they they recommend that all your rights be item total, so they're replayable, so that they can be.

A

They can keep your your data in a consistent fashion, um so that's something that, from an architectural standpoint when you're developing your application, you you want to keep that in mind. If you don't do that, theoretically, you could run into some problems, um might not always work so well in sort of transactional systems for banking and things of that nature, you might have to architect it in a different way than your traditional relational.

A

C

A

Well, it'll play them all in the order that they were written to the append log, and you know you'll just replay it in that order. So just as long as you you're keeping, however, you've written your application, you've kept that in mind. If you've written it, it should be fine and it's used for like when a node goes down and it needs to replay something off that commit log. You can do so, and you know again with the rights being you write once it goes in the memory it goes into.

A

A location eventually gets rid of the disk. It's it's immutable at that point that that's why you need that item to exist. So a lot of these different pieces that we've talked about, allow you to scale in cassandra.

A

Rumor has it. I can't say that I actually know this for a fact that there's like a 900 node system in the defense department, that's rumor.

C

A

You can add machines and you can remove machines. They can get integrated into the ring that we talked about earlier, um and so you know your system's flexible. At this point you can on a heavy day you could add more cpu on a bigger machine or just add four more machines if they were small machines. It's up to you, so you can get your hands on the hardware um and again with this multicenter the multi-data center topology.

A

Not only can you do this virtual data center for the purposes of analytics transactional data they've recently integrated solar into cassandra into the datastax product, so you can have analytics and search in transactional virtual data centers.

A

Not only can you do that, but you can actually just have virtual data centers in physically different locations, so you can scale further that way as well things that are typically pretty hard to do.

A

Just a couple of notes about what we're doing to kind of wrap this up. um So we chose python um and tornado is our web framework. uh We use something called picasa, which uh is an official uh python driver uh for cassandra. There's a php casa. That's also a uh official driver for php.

A

We actually have adopted the datastax enterprise product. We actually have deployed the solar, hadoop and cassandra components. We specifically use hive and our infrastructure's hosted at aws.

G

And uh we've talked about some future companies.

A

So right now we have uh four cassandra nodes, two search nodes and two hadoop nodes.

B

A

I think that's right uh and it's all in one data center we haven't done the multi-center configuration yet.

D

You, don't you don't use pic.

A

We're looking into it, we chose hives and now we're kind of thinking, we're.

C

Going to end up with both what instant sizes and how much space are, is your four nodes.

H

C

A

Your load we're doing the actual large configuration the things called m1 xl. If I remember correctly, it's the one with 16 or 15 and a half gigs of ram, and I don't know how many cpu units.

C

And how much disk space I actually don't know off the top? Are you evs mounted or.

A

Yeah, we are actually evs, uh actually, no, no. No, we were and we're not anymore. We went to uh to ephemeral drives, we were and uh we changed that, like a release or two ago, we've been kind of in this little tumultuous state where we've been upgrading a lot because cassandra's been moving quite quickly recently and uh all.

D

Of a sudden, we need a new feature. Oh there, it is, and.

A

So we've done like three upgrades and a couple of hardware swap outs. We used to run m1 larges and we went to m1 extra larges and then we added the solar actually ran as a separate component like traditional solar with master slave, and now we've done this one, which is no longer the master slave architecture.

A

It leverages the replication, that's built in the cassandra to support the indexing for solar.

C

So we've been changing quite a bit, so no, but just one more temperature. So this is all nice and theory in practice you need. You have developers that need to create apps that connect to cassandra.

C

So that's question number one. Do.

A

I develop in cassandra like the.

C

C

Is server testing, so I'm gonna testing my uh code and they need to hit some data layer.

B

C

A memory mock the sondra that you use, or do you actually get an actual instance of cassandra when you're doing testing, we actually have uh multiple.

A

um Environments, I guess the best way to think of them at aws, so we have like a data qa and production infrastructures, and we actually currently copy our our cassandra database isn't so large that we can't copy it. Although it's painful to do it takes you know a.

C

A

To get it all over and down, and it's we don't do it very often, um but uh the developers hit those um directly uh and what was the second question.

C

uh For unit tests, absolutely typically, you create a completely new table and then you want to have multiple instances of that running, because developers are checking to see if the code works. So if you're getting the exact same development instance yep, it's going to be complex yep. So what do you do in that case? Do you have a mock instance that they could use or they're actually hitting a video.

A

Kind of on a case-by-case basis, like my laptop right now, is actually running. Cassandra and I'll spin up objects in there and do some work against it and you know do whatever I need to do, and I mean it's obviously not against a lot of data. But uh you know I can do the development that way and we have. um We actually did at one point.

A

We bought parallels and we built a whole vm sort of like the amis of amazon and we had those in source control and we went that route and that just seemed kind of painful.

A

Now we actually give developers their own account at aws and we set up amis that they can just deploy, and so they can set up a small cluster of their own and do whatever they need to do up there. Basically, all of our team that does back in deployment back-end development is all using amazon as much as we can. We stop trying to do it in-house in any way, shape or form.

D

How much for the data stacks and the price, how much the cost.

A

Is free, it's the support license. Oh.

D

Support how much for support.

A

Well, there's different levels of support, it's on a per node basis and I don't actually know the numbers off.

D

A

My head, but you know you can buy like three nodes of support and have 12 nodes running.

D

A

Out you have 12 nodes, running they've, called it off and you're calling about four nodes being down. I'm not really sure what they're going to do for you, but um it's not particularly expensive. I think it was.

A

I don't. I don't want to quote a number. I I it was a couple thousand dollars a note, if I remember correctly, but the amount of change it was two years ago we made our contract so.

D

It's been a while.

A

Or a year and a half ago.

A

So uh just some future use cases, uh so we have all of our user generated content and gamefly is actually in relational databases still and we're thinking of moving a lot of that out to the cloud to to our cassandra implementation.

A

We have a need for in the.

B

A

Team we want to replace a cloud service. We use for push notifications uh they're a little pricey, so we've been experimenting with uh building a socket architecture that can use cassandra as a back-end message, uh persistent layer. um So that's that looks pretty promising and then, lastly, we are going to eventually do the multi-data center deployment.

A

Just a couple other things: uh if you guys are interested in data modeling um matt dennis he works at uh at datasacks. He gives a really good presentation on data modeling availability.

A

He can probably fill you in on some more details than I can he's been working there for over two years um and then, uh if you really want to get down into the nitty gritty of data modeling the the way that column families came to be so to speak, they're based on bigtable, so there's a good article uh from google.

A

I don't remember what year it is it's 2005, maybe uh at that url and then uh the cto of uh aws wrote in a really interesting article on replication and how dynamo did that this is the replication. That's the ring topology that we're talking about, and these are the the core foundations of cassandra.

A

So those are some good articles and then lastly, uh there's a new meetup group starting cassandra meetup that we mentioned earlier starting the 18th closet, 16 16, the 16th in la callfire, is going to be hosting it this time and I'll be there.

A

Thank you very much.

E

F

Each other that they are.

E

A

Well, you could do both, we don't you do the ladder we do, the former um right now, it's kind of manageable with our eight machines.

A

I know companies like netflix use. This is kind of funny. They use simple dv, which is amazon's like relational product and they store.

B

Stuff in there.

A

And then they query that at startup time to figure out what other nodes are in the system and then they modify that text file, because the text file is part of cassandra, so cassandra needs that to know how to do what it needs to do so they use that simple db thing as like a active directory type. You know a directory lookup system to inform the rest of their architecture of what they mean.

A

That's hilarious, yeah, yeah,.

E

You in the back, I'm sorry um is this presentation going to be available.

A

uh Yeah, I can figure out something: definitely, whether it's slideshare or I don't know, if there's an email list, but uh we'll figure it out. Yeah.

H

So the application you usually get using for cassandra is good for, like I guess, messagings and user data is that what you're using for for.

A

So yeah I mean it's social, it's kind of like we built like our version of twitter and.

B

A

It's deployed on our ios and android applications and we have a digital client, that's uh installable for your desktop that runs on an adobe framework air framework and we're looking to bring it to the web uh soon. Lastly, but yeah it's uh it's, it's basically just a messaging architecture. We actually have a second use for it, um where we do some click stream type tracking,.

D

A

Small for one particular product that we have or one specific small use case. um It actually generates more data than our social system, though, which is kind of funny. But.

H

A

H

All this, but you haven't gotten on the web yet on the on yeah.

A

That's not by choice. We actually have a functioning mobile web product uh that our product team will. Let us as well.

A

uh That is the messaging thing that I said on two slides ago: um we're actually trying to do. We looked at uh so we've been it's a little off subject. We did all of our mobile web stuff in node node.js and uh so we've been experimenting with uh doing socket, io that didn't really work as well as we'd like it to at scale. So now we're now we're actually looking at doing node with uh just keep alive connections, and uh that seems to be working pretty well uh inside the node architecture.

A

um So we have a client that does the connection and then we just push out through that keep a live session, each each socket um and then two other questions. Can you talk about latency.

A

Security, there isn't a whole heck of a lot of security currently in cassandra. To be honest, I believe their next major release is addressing that. What they're doing I don't know uh and latency? What in particular, are you looking.

E

One of the big open problems when I was working on it was um again a couple years ago, is that if some noble links were much smaller than the ones first of all, talking between the people was very heavy.

E

And the other one is that there was no real.

E

Abstraction in the graph for the amalgam we can see or the throughput.

F

Between the nodes, in order to optimize, the dashboard have has.

C

Either of those things addressed.

A

I'm not sure I'm the right person to get down to that level of detail. I my experience, has been that when one server becomes bogged down, it hasn't brought down the system, so they must have addressed those problems in some fashion, how they did it.

A

I don't know the details on, um but I can also say that our system, you know gamer gamer tweeting, for lack of a better way of thinking about it on mobile phones is not anywhere near the scale that you're probably talking about you know our scale is much much lower than whatever it was that you were working on a couple years ago. I'm guessing we, you know.

B

We don't even have it on.

A

The web, yet so it's uh you know it just doesn't have the type of throughput we're not pushing the type of throughput that I think you came up against. Does it have the type of uh smaller house as you.

E

Know with solar, you know in my experience with cassandra.

A

uh Well, there's key level caching and there's row level, caching and depending on which one you're looking to use uh like rollout, will cache and you'll probably run out of memory pretty fast, um so we tend to use only key level. Caching and key level. Caching is fast, but you still have to sometimes go to disc if it's not currently in memory, as opposed to pinning it into memory with row cache.

A

So it's pretty fast. If you ask me, but it also depends on your configuration number of servers, how much memory you have how much you can put into memory, it's all tunable. So it's really more about how you set it up.

D

What's a key difference between cassandra and dynamo db yeah, they are look from the same. You know big table right, but what's different.

A

I'm not a dynamo expert, so I'm not sure.

D

Because china was available in aws right, but you don't use dynamo, you use the cassandra.

A

Well, this was around before dynamo was available for us. Oh.

D

I see so dynamo.

A

Has only been around for six or seven months, yeah.

D

Right right, yeah, this.

A

Has been something we've worked on for almost two years now.

A

I did experiment very briefly with dynamo and it seems similar yeah.

D

A

Sure I can say that you know they don't think they call them column families. I can't remember what they call them. I think they just call them tables right, but the the data structure that goes into it, don't they call them tables anyway.

A

They're, probably pretty similar at this point would be my guess, but I don't have a lot of dynamo experience. They only go up.

A

Did you say they do have secondary indices? Yes, because cassandra does too yeah.

D

The secondary and that's the same yeah.

A

C

That yeah well, let's, let's get back to the uh latency question. What do you do for for monitoring? So when you monitor your system, as you know, aws instances aren't very reliable at times sure when it goes down. You should see some graph that says. Oh one, node is down here's my latency. It goes up. Okay.

C

What do you? What do you do for monitoring.

A

So, just to be aware of those problems occurring.

C

Right just to see the performance of your system.

A

Well, we use the the op center product that is free from uh data styles, um but what we actually use is uh we use uh cloudwatch and we use uh you get the name of it. It's not sites here, it's another product that we outsource to uh that. We use to hook into the systems and and get uh email alerts when we, when we reach thresholds.

A

So an example that is often times when we're doing that major compaction I mentioned earlier, the cpu will spike up pretty high, so you're trying not to do that um and we'll get alerts. When that happens,.

A

No well, we use munin to monitor it, but we don't have unions actually sending us alerts.

B

A

Have another third party product, that's sending us the alerts and then we use cloudwatch, also consent alerts based on metrics within the.

D

System any other questions, okay and last question: why don't you use uh java java? It's not good for cassandra.

A

I'm sure it's perfectly fine, it just didn't really fit with what we wanted to do. When we first built this, we were two people.

H

A

We wanted to be able to hop on to a system in need at any point in time and be able to make a change, not have to compile it and want to be.

C

Able to take it out of.

A

The system we wanted full flexibility everywhere and we felt that a dynamically typed language gave us a little bit more flexibility than a static language, something that you have to compile, and then you have to upload it. You know we can make a change in tornado and you can, if you have it in debug mode, it'll automatically recognize that the changes occurred and it will restart on its own, and you don't have to like compile things.

A

You can change one line, you know one character in one place and it just it'll run not that you should do.

B

A

Mode in production- I'm not saying to do that, but you know you have those options uh and that kind of flexibility- and we were- we were concerned with our limited resources. We needed all the flexibility we could have.

B

Good extremely brief: what are the other.

A

Technologies uh pretty much covered at all. For what we're doing um I mean we use linux, we use ubuntu linux at aws,.

H

Solar: okay! What with.

A

We actually use nginx. I didn't mention that to do a reverse proxy, because tornado python single threaded, so we run multiple tornadoes per machine for as many cores as there are on the machine.

F

A

Just uh elb down to you know doing uh sort of a round robin d down to each web server. Each web server has nginx, which then actually works off, and we have four tornadoes running on each machine.

B

A

Those call down into cassandra- and we just use cassandra's caching for now and then we solder for search and we have hive on the backside for doing our analytics.

A

Again, we're not you know, I don't think we're quite on the scale of some of the larger organizations that need the super high performance stuff great. Thank you very much.