Apache Cassandra Cassandra Summit 2015, 14 Mar 2016

Previous Meeting Next Meeting

⏯

youtube image

►

From YouTube: Walmart: From bricks to clicks - Cassandra at Walmart

Description

No description was provided for this meeting.
If this is YOUR meeting, an easy way to fix this is to add a description to your video, wherever mtngs.io found it (probably YouTube).

A

My name is Andrew Weaver I'm, a cloud engineer at walmart and I focused on providing database services in the cloud and.

B

I'm chet helms the Cassandra engineer at walmarts kind of focused, more primarily on helping our developers be successful with Cassandra overall. So our.

A

Goal today is just to give you an overview of Cassandra at walmart how we brought it in some of the challenges we face. Some mistakes we made and then we're going to kind of go into a particular use case that chatter and I spent the last few years working on in a little bit more detail. That's that dynamic data model, so we'll have time for questions at the end. So if you could just kind of hold those questions until the end, I think we'll have plenty of time to address those. So.

B

Just to get a quick show of hands who's all shopped at walmart com. Okay, what about the stores? The store? Oh wow, more, like.

A

It I, like it yeah, I, think I, don't think the target guys race there, but that's okay. It's.

B

A little bit about Walmart, these are just some interesting facts. Over 260 million customers are in our stores and online. Each week we span 28 different countries employ roughly 2.2 million associates across around the globe over 11,000 retail units, so that includes walmart stores and sam's clubs, and we have 12 different ecommerce websites.

A

We have an engineering presence in Bentonville Arkansas and that's actually where chad and I work from also here in the bay area, and we recently opened up an office in Reston Virginia and in Bangalore India, so we've got over 35 production, closed clusters and that's over 500 nodes. Most of those are physical hardware, so bare metal, and we started looking at Cassandra back in the zero point seven days. This was around late 2010 early 2011 and things were really there kind of primitive back then. So there was no cql. No transactions. Drivers were really bad.

A

We we had a lot of Hector code, so yeah it was really bad so and there was actually a lot of different groups in walmart. You know we're a big organization and there was actually a lot of different engineering groups, both in Bentonville and out here in walmart com that we're looking at Cassandra and and there's just a lot of things happening, and it really was kind of a grassroots thing. Developers were getting it, trying it out, seeing how it worked and where they might use it.

B

Yeah, so you can imagine with with the size and the number of engineering presence engineers that we have. We have a lot of use cases for that fixed center, really well in particular. Right now, we're making a pretty big investment on modernizing are all of our systems, so we're taking all these nightly batch processes moving them into more of a real time based applications and as we're doing that we're seeing a huge uptake on Cassandra in general.

B

So a lot of teams migrating moving things into Cassandra and then one thing in particular around real-time updates is Cassandra's, really the only database that we found that can really reasonably keep up with the real-time updates from our stores and clubs and.

A

That's that's pretty big I mean a company our size with the resources we have, although we do try to keep things low cost, you know wit, we've got access to. You know the full menu of databases and technologies out there and we have a lot of them already implemented. So we're able to try a lot of different things, and so this one with you know, updates coming in from stores and clubs.

A

I mean you saw 11 and a half in retail units, so making that being able to ingest that data and make it available for reads to everybody in real time and that's that's a huge deal for us and cassandra is just the only thing that we found. That can actually do that, but it was hard, so I don't mean technically hard. I mean we had challenges and we'll go into what some of those challenges were and mistakes we made. But those are technical problems we can solve those I mean politically just just getting through.

A

You know believe in her out in 2011. You know there were people who thought we should not be putting any important data and Cassandra. You know period right so I had a lot of debates, a lot of discussions. They got a little heated at times, but we've.

B

It had David Sachs involved with a few of them, yeah.

A

And so but people everybody has their bias right, but but you know most of these people were genuinely concerned with with worried about us losing data or not keeping data safe, and so you kinda have to recognize that is that they're just kind of looking out for for the company and for for the company's information. So it took some extra work on our part to kind of put together this plan. You know what are you going to do about backups and recovery, and how are you going to deal with his eventual consistency?

A

We've never had to deal with that before when, if you actually think about and go look at it, we've always had some form of eventual consistency. It's just a little different now and non acid compliance. You know people's hairs catch on fire when you say you know, you're, not acid, compliant they just sort of at least back. Then it would just kind of assume. Oh it's a database, that's just like every other database, but Cassandra was different.

A

So we had to kind of put together that plan and show it, but really when it came down to it, we just had to prove be successful to prove it and and I'm really proud to say that we actually did that yeah.

B

So if there's other companies and here kind of fighting that that same battle, I just I just word of advice, just kind of stands- strong and really kind of you want to really show the success is in the implementation, so rise to stand, strong, get it done and prove it. There works yeah.

A

So we did make some mistakes first off it. We picked the wrong hardware. I think you'll hear that from a lot of people. You know.

A

Oh this big data, so whatever's good for Hadoop must be good for Cassandra, and so we had these really huge boxes with lots of big, slow, SATA disks, and you know we kind of made it work, just put them all in a big raid 0 and you give by, but it wasn't really optimal, especially on the reeds, so moving to SSDs, really kind of helped that read latency come down and and then you know we could do leveled compaction, which really helps in some use cases where we've got a lot of over rights on our data.

A

Another one is batch workloads, so we actually had one instance where there was this team. They were processing a bunch of data, and so they had all this data in Hadoop and they thought what would it be cool if we could just etl that data into cassandra and use cassandra to do some more processing and then load all that data back out again into Hadoop and well that just didn't work at all.

A

That was a complete failure and and so sort of sort of along with that, we we do get use cases sometimes and developers come up to us and they're they're, saying: okay, we're going to do this and we're going to do on Cassandra and so every night we're going to etl some data from the data warehouse or somewhere and loaded in, and you know, make it available for reads and well. You know and that's like their primary use case.

A

They just want to get this bulk data every night and their overriding their data every night, and so that that doesn't always work out. Well, you end up just compacting all the time and the cluster just really suffers because of that yeah.

B

So speaking of use cases, I know that android I probably get two to three teams come up tues every week and ask hey, we want to use Cassandra. Does our use case fit a lot of times? No it doesn't. We will tell them that a lot of times or use cases is kind of a small small data.

B

We're a relational database will suffice and on the same vein of use cases, we get a lot that are perfect fits for Cassandra, but the the teams have implemented implemented a relational model into a cassandra model which we all know doesn't work.

A

And so one thing you kind of tend to see: there is a quick way to evaluate a model that somebody gives you is if they have a ton of secondary indexes. So we've seen that a lot where, like half the columns, have a secondary index and you account to explain things and work through that and see. Well, what what queries are you really going to run and how do we make this model perform it another one?

A

It is tombstones this one actually kind of bit us in the project we were working on, so you want to make sure you're using TTL to your advantage. That's a really handy feature if you can, but really you've got to understand what youre delete workload is going to be, and- and one thing we didn't realize is that when you update a column to Knohl, that's actually a tombstone, and you know you can we were able to go back.

A

We that's the problem that that we had where we were updating to null and ended up having excessive tombstones and- and you know there was a solution there. We just had to go back and sort of rework things to keep from generating all those tombstones.

A

So there there's one particular data model or particular use case that we want to talk about. This is dynamic data model, so back in 2011 that really triggered us to start. Looking at Cassandra there's a lot of stuff going on in the industry we got put on to this project. It was really a kind of a transformational Greenfield type project, which is a really cool place to be. That doesn't happen a lot, but but this was sort of it's kind of a perfect storm for being able to go out and look at you know.

A

This was really a lot of stuff happening in the industry. Cassandra was part of that. So we looked at a lot of stuff, I mean we looked at HBase and and Hadoop, and things and Cassandra end up coming out on top after we got through it and and sort of got to play with everything. Cassandra was really kind of a no-brainer for for us, so the problems that we had to solve. We had this handful of entities, different types of records that we were dealing with.

A

Each one would have maybe dozens to even hundreds of attributes for entities. It was really sparse and by that I mean record to record, you would have a different set, a wildly different set of attributes, so it is very sparse. Attribution new attributes are coming up regularly and we needed full-text search. So DSC with solar really worked out for us, and then we would also have frequent intensive maintenance, so customers would be able to go in and initiate some maintenance process.

A

That would end up doing a lot of updates and have to read a lot of data and.

B

Then non functionally right. We wanted one logical database where historically, we've charted our databases by country right. So imagine all those countries that were in we'd have the duplicate database in that particular country. So we wanted to bring all that together in one one database and, of course, the data that we were storing. We needed to be available, one hundred percent of the time no downtime was acceptable and it had to be fast.

B

I mean we're servicing I, don't know how many tons and tons of reeds throughout the company, so low, latency and availability was we're. Big concerns. Yeah.

A

So we decided to a dynamic data model, and this was this was on thrift, because that's all there was this was before cql and as it who's actually modeled in thrift. Before you mean okay, so there's.

C

A

You you don't count so so this is an thrift. So it looks something like this this. This is sort of what the schema might look like. There's not a lot here, but so we've got this column, family called stuff and there's two columns to find here, there's a type and a user key and then down there at the bottom. You can see this default validation class is bytes type.

A

What that man is you just write whatever you wanted to in there just bites just write it, however, you wanted to, and that was like super cool, because that's exactly what we wanted, we could just add columns on the fly in the code, and so really this the schema was in the code and and we just kind of went to town basically and.

B

Yes, that is true. We do we did cash everything, so the rose, a million keys at that time, yeah.

A

I, don't recommend cashing, it was all yes that was bad, but I left it in there see if anybody might catch it. So here's what a record might look like. So we got one row with one column. So a row, kiev foo and this description is foo, and so you know that that worked.

A

Then we started getting a little bit fancy and okay. Well, let's make it a little dimensional. So we would do like this. In this case, this is a language code, so we've got a different different description for another language, so we started adding all this stuff and this got really complex really fast and we had some significant serialization logic in our code right everything was locked away in the code, and so it got pretty complex, yeah.

B

So then anybody can guess what happened. But what came about cql and then our first impression cql was hey you're breaking our dynamic model, you're killing us here, you're putting training wheels on our database that weed that we've had successful implementation with, but we did see the community going towards cql and immediately it had a lot of valuable features that we wanted to take advantage of. So we got a board, but.

A

How to get there so yeah we're using hector and everything and and so as part of migration, we did move to the datastax driver and c ql, but but when seek you all came along, it really sort of pointed out this deficiency we had and it was. We were getting into this world of schema list, which is a very dangerous place to be, and so so the application defined the schema. And so, if you really want to know how things work, you'd have to go into the code and so maintainability suffered because of this.

A

So it was pretty fragile. So a small change in the code could really render all of your data unreadable and it made since this schema wasn't obvious. It made reading the data with something like hadoo really hard. So we had drifted pretty far into this. This world of schema less. There.

B

Than one particular instance, where really kind of bit is, as you remember the use case, we had, we need to have real-time text text, search, full text, search on all of the entities and all the attributes right. So we got into a huge heap of trouble. He put a ton in Java pond, so we had all this coming up with r. If everything was working all of our unit tests, everything was successful, so we wanted to put a little stress test, a load test on this data.

B

We quickly identify. We had a huge spike in latency long, GC pauses and the cluster was dying. We weren't able to restart our solar nodes. We tried clearing all the commits logs increasingly heap to 16 gigs, and they would not start they run out of heap. So at that time we were like hey this kid. Datastax involved called a few people they got engaged in and they they uncovered that we we had literally billions of fields that were trying to be indexed and solar.

B

So this was an exact problem of our schema lyst model and our dynamic solar schema.

A

So we kind of had like a index all the things mentality and so that it it really. We really had to kind of dial that back and really look at what what we wanted to do.

A

So this is sort of what we did in c ql. This looks an awful lot. What you might define in a relational model to do a dynamic database. The difference is this will actually work we've. Actually we actually have a history of trying this in this type of dynamic model in a relational database, and it just does not doesn't perform well.

A

So you can see here, we've got this table with the partition key of ID and then clustering columns on this context and attribute ID field.

A

So here's kind of what a what some data in that table might look like. You can see that context. Column is what's giving us the dimensionality now, so this isn't isn't perfect right. So there's still this. That context has to be parsed and generated consistently and in the same way that this value is just text. So you have to manage that consistently as well, but it's it's a lot better place to be so queries. You can actually express meaningful queries.

A

Doing you know slices on context, attribute ID and now that you can actually see the schema you can get. You can get a hint to what's actually in there yeah.

B

And look pretty big win for our data analyst and in das that we work with they were able to actually look at this kima actually run some queries and actually see the data, whereas before you really couldn't so, this really helped them out. Yeah.

A

And- and one thing we're not really shown here is there's a lot there's more metadata to this. So that's that's really important. When you're doing this type of work, where you need to define attributes on the fly, there's a whole whole schema for dealing with the attributes themselves. You know, what's their data type, what are they so? That's. That's also really critical to get right.

B

So we started with a multiple relational databases: member we was charted by country, so some of the results that we we saw- this isn't all of them. But this is a few high points. When we transition into Cassandra, we could solid eight at all those different databases into once. We were able to they're not completely sunsetted yet but they're they're on their way out.

B

So you used to take you know and I kid you not three to six months to add a new attribute to the system we're now we can do it in minutes really hours depending on the governance process of those attributes.

B

We did see service response times dropped from 800, milliseconds, 250 milliseconds, so a huge win for our consumers of all of our services. They really appreciated that and, of course, the zero downtime we've done. I, don't 1012 upgrades even refresh the hardware completely in production and zero downtime, so that was a huge, huge success. Yeah.

A

And I mean no joke there, actually zero downtime, it's literally never gone down in production, so so also we kind of learn. Cassandra is not just for time series right, so those those golden use cases that that cassandra is a really good fit for those are good, but there's also more use cases you can that Cassandra really works well for and that set of use cases is actually growing as features in the product. Matures eventual consistency is worth well trade off.

A

It really is I mean if you're telling me I can get always on active, active, active database where I'm always writable always readable in multiple data centers and it's fast I'm gonna live with eventual consistency. It's really most of the use cases it just it's it's immaterial, it doesn't matter, and you just have to kind of know where it does matter and plan it appropriately for it I. Don't think. We've run into a use case, yet where we haven't been able to satisfy the requirement because of eventual consistency and.

B

Then one common thing that you're going to hear through which I've already heard several sessions is the the data modeling is critical. We I can't express that enough to the teams that come up to us and ask for ask for help and assistance. Your data model is so key to your application to your your cluster. If that's not correct, more more than likely is going to fail, so pay special attention to that data model and.

A

So SI q, eles expressiveness, really does make modeling easier. It is a mind shift if you're coming from a relational background and you kind of have to carry people along with you know you we're getting conversations all the time and like whoa, you can't join these two tables, you just can't and they are like whoa they're stuck at that point like well, what am I going to do and then you show them like. Oh well, there's these collection types.

A

You could use those- and you know sometimes oh okay, that solves a problem, and so it really kind of kind of opens up the options for how you design your database, especially if you're coming from a relational background cql. It has a lot of power and and it's getting better and.

B

Then, for those use, cases that require the full text search right, just like the data model, be intentional with your solar schema, really put some thoughts behind that only index the fields as you really need indexed and leave the other ones out. Just just really put some thought behind your solar schema in general, yeah.

A

So it looks like we have ample time for questions at the end here. So.

B

We may just like they have a bike: they're gonna yeah.

A

We'll run a mic to you.

D

Hello, hey, thank you. So I have a question like, as you mentioned you at the beginning, there are a lot of thoughts about securities and safe keys, so I'm wondering in your stocks. If there are some certain attributes that are sensitive, how do you prevent it from being globally accessed? How do you solve those problems? Okay,.

A

So so, if you're storing sensitive data, how do you protect that data yeah.

D

Like the same record same row, some attributes and columns, so you want it to be globally accessed about some columns. You want to limit the access to certain users or certain groups. How do you do those so.

A

At a cell level, I think you know that that level of security is not really built into Cassandra or DC. Yes,.

E

Whereas I know so.

A

Yeah I mean obviously the the features of you know encryption and things like that are there, but actually the access to the information and preventing unauthorized access at a soul level. That's just not there. That would be at this point. It's up to the application to take care of that. It's.

B

Funny you asked the question: we have an actual project going on right now, whether we're dealing with that right now. So it's not been fully implemented. Yet so I yeah I, don't have any details to.

A

Share we're still investigating some of that as well. Okay,.

D

Thank you. Yes,.

A

F

Thank you for the presentation is really good, because this is walmart has the size and scale which we are dealing with in other places. My question is around when you look at a data model of an existing application or one of the first questions that get asked is I have to redo the data model. You got to be kidding and you go like well yeah and what is it level of effort? And I know it is a very general question that you start what is the methodology you adopted?

F

That's the real question of looking at data model or models for different applications than developing a kind of an estimated meth level of effort complexity risk. Could you shed some light on that yeah.

A

So obviously, if somebody's coming to you with a data model, the first thing we I say is well okay. Thank you for the model now tell me about your queries right because the that's not documented in in the relational model, I've got to know what what your read and write patterns are in order to transition that into a Cassandra model. So it can be tedious right. I mean I've seen some massive relational potholes with you know, hundreds of tables and everything and I mean I, don't have a silver bullet. It's not you going.

A

121 is almost never the right way to do it unless it's just a trivial model, but the I mean I'll. Tell you. The data modeling class from datastax is very helpful and it's very methodical and how you go through and document everything and do your erd and do document your query patterns and really lay it out and I mean we've gotten a huge benefit from that, but it just takes the time to go in there and and to your point about well. You know that may now I'm going to redo everything.

A

Well, I mean if, if it was working for you in the relational model, obviously I mean you're coming to me as a Cassandra, your engineer, because you've got a problem to solve so yeah, you kind of got to change the model a lot of times. I was.

F

Also asking about the level of effort it takes or is there a methodology you adopted or some gross numbers you could share to actually go through that.

B

It's really its case by case honestly. It's a you, get some requests to come in and they're pretty simple straightforward. They have a 1 or 5 to 10 different queries that they have to serve, and those are those are pretty easy right.

B

But then, when you get into the use cases where I have lots of entities that relate to other entities- and I have all these different queries that I got to serve- that's where it's a little bit more in depth and your meeting with the customer several times to get it right and working through some prototyping. So it takes some time depending on how complex the data model is.

C

Yeah, my question is more on the zero downtime that you mentioned right. So can you touch a little bit on that, because I would assume, like some of the models have to be rewritten, like you might have got into some data model, some kind of tables based on your query patterns and figured out hey? This is no more a right thing and application probably expecting that data from that data that set of tables- and now you have to rip that into something else. So in conditions like that which would take this almost in every solutions.

C

How do you take care of that zero downtime part on that? Okay,.

A

So the question is, as you're evolving the database schema. How do you achieve that? Zero downtime? So we've done everything in a backwards compatible manner, so we have tables that have a number. You know the table name has a number at the end because we version that table.

A

So we've done a lot of work where we we've done this a few times, actually that, where we've migrated, the schema one was wholesale thrift to c ql, but then, even after we were in c ql, we would determine that even in some pretty critical tables, we needed to modify the schema and the way we've done that is to you know, create the new table and update the application to do to write to both tables.

A

So the minus one table and the new table will write to both of those and we'll do that over a period of time and will migrate the data over. So we actually did got kind of kind of clever one time where we were. We essentially would check the new on reads.

A

We would check the new table to see if the data was there and if it wasn't, we'd fall back to the old table, and that way we did this slow migration of doing these dual dual rights and then you can think of it as like, a reed repair right like oh well, it's not not in the new table. Let me read it from the old table and update and move that data into the new table, and we've done that a few times and it's actually worked out pretty well for us.

A

So if you have the the Headroom to double your rights for your period of time, it actually works out really well that way to simply create a new table and then slowly migrated over to it and that's how we've evolved the schema without having to take down the cluster without having to take down the application, and it's able to enable this to just run continuously yeah.

B

I look: we've done that we've done that by four times in this particular use case that we talked about yeah yeah.

G

Yeahs haitian.

B

Would write to multiple versions of the table? I have.

G

A question about the the dynamic table that you guys described: could you go in just a little bit more detail into the use case of why you needed a table like that? The.

A

Particular use case I I can't get too specific, but I think the I mean the problem to a handful of entities right so different kinds of things right and each thing may have you know dozens to hundreds of attributes and then they're all very different, not sure how else to describe it but yeah. This is this sparse attribution of of information, so we- and this all goes into one table right. So we we basically built services around around the database and you know just a restful interface into it and JSON in JSON out.

A

You know it just kind of talks JSON, so.

B

We can easily add a new entity to the system without having to really change the model and they've attributes.

E

So I'm curious about why you are, if you looked at maps or other collection data types and why you chose not to use them yeah.

A

So so back in that cql table and that's not the only way to do it, it's beyond, so this is actually one way to do it and so to the keen observer. This really won't work in solar right because it's a cql row right, like if you search you're, going to get a single cql row back instead of all the all the rows, all the attributes for the the ID. So what we actually do there is.

A

This was a further optimization for solar, where we have another table that we write to specifically for solar and it's writing all these attributes as a map and so you're a little bit limited there to some degree. That's another way to do it if that works for your use case.

A

But what that let us do is it's all in 11 c ql row, and so that means it's it's one solar document and we can actually put some some filtering and transformation in between there to sort of make solar indexing work a little bit better so like we could discard fields that we don't need. We know we don't need to index. So that's like an optimization, you can do there, but that actually works really well for for solar.

A

Is this: if you get a dynamic model just and we use all dynamic fields, mostly dynamic fields and solar, so we're doing the wild cards and that works with a map and.

B

Actually, I think two more to your question. We there's another version of this table where the value is a collection, so we can add multiple values for a particular attribute.

H

Did you have any situation where you have a table and then half of the data would stay in our dbms and half of the data is going to go Cassandra and we were pointing to Cassandra to fetch the unstructured data or.

A

No I wouldn't say we have that I think the way we would solve that would be sort of further up in the stack at the service layer. So we're we're kind of big. At least chat and I are big on microservices. So if it was coming from a relational database, I would probably be further up in a stack and then you'd have a service that would combine those. We.

H

Have situation we're in a table? We have to kind of data, one is like attribute data which we don't know. How is going to come, there won't be one product might have ten attribute or might have like 100 and the other data where we want to have the roll I mean column level, access that you know. Somebody should be able to see only this much data.

H

Somebody should be able to see other part of the data, so I mean we are thinking of like anything which has accessibility, need you put in the I dbms and less can go into no sequel database. Is it think the right thinking you think, or you wouldn't take that approach so.

A

You're you're splitting your data between Cassandra and a relational database and relational.

H

Databases will have only the one which we need: accessibility, excess.

A

So what do you mean by accessibility.

H

Meaning like we wanted to have some folks see some of the specific column. Other rosewood see another columns like what you had just before. Oh.

A

Okay, so sort of a static part of your model is in a relational and then the dynamic part quad.

H

Goes into cuz, you know, then, connect them with a difference. It's.

A

It's just operationally a lot easier to just put it all in Cassandra, then.

H

How you won't be able to okay and then you application level? Accessibility is what you are saying: yeah.

B

H

B

Every talk a bit lighter after this shot, more detail, yeah.

B

Any other questions pull back.

D

Okay, so one more question, so in this data model you mention the using contacts, the text to save extra dimensions, and you said it as a a clustering key. So my question is: if you have multiple dimension information, you want to save into this context, and you want to search on certain dimension. Then, in that case, when you are acquiring, are you performing a key look up or are you performing a search because you have multiple dimensions on a map together in this column, single column, right.

A

So the so we don't do solar searches on that particular column. So.

D

How do you find Clarion a certain dimension, specific time, machine.

A

So in so in this, in this case, I mean it sort of works out for us, because that those dimensional that goes context attributes everything we're wrapping up in that context is, is provided to the service. So that's part of the lookup part of the key look update that you're doing so. There's kind of a couple of ways to read it. One you're you're querying the data, and you know the context so.

D

I'll give an example, for example, for the record in the context, the text it says: foo equals blah, comma and language equals F R, and you want to query final records with language equals fr. How would you carry on that context? Color yeah.

A

It's I, don't think we have a lot of time to address that, but basically the context is being provided at the read when the day is being queried. So we kind of know that ahead of time, but we're kinda out of time now, but we can, you know you can come up and we got our okay. Thank.

D

A

A