youtube image
From YouTube: C* Summit EU 2013: From CQL to Time-Series Event Tracking and Aggregation Using Cassandra and Hadoop

Description

Speaker: Mick Semb Wever, Programmer at FINN.no
Slides: http://www.slideshare.net/planetcassandra/c-summit-eu-2013-from-cql-to-timeseries-event-tracking-and-aggregation-using-cassandra-and-hadoop
FINN.no's is a classifieds website and Norway's busiest website. This session will go through various product development where c* has shown to be the best choice, focusing on our primary c* use-case: our in-house tracking solution that's collects raw time-series data in c* and aggregates minute-by-minute it using hadoop into various new datasets from advert-centric statistics to user-centric behavioural analysis. I'll cover the final technical design chosen after a number of development iterations touching on technologies: scribe, thrift, kafka, hadoop, pig, mahout; the hurdles faced along the way, and the throughput and performance of today's systems.