Apache Cassandra / Cassandra Summit 2013

Add meeting Rate page Subscribe

Apache Cassandra / Cassandra Summit 2013

These are all the meetings we have in "Cassandra Summit 2013" (part of the organization "Apache Cassandra"). Click into individual meeting pages to watch the recording and search or read the transcript.

18 Jul 2013

Thomas J. Glazier, Senior Big Data Architect at Accenture
Nothing is more frustrating than knowing you may have the right solution to a problem, only to have the rug pulled out from under you and your project gets derailed for unknown reasons or for unclear understanding of your solution. While you may not see your job as being a business champion, the fact is that you can be a powerful force to stop your company from making the wrong choice.
  • 2 participants
  • 37 minutes
technologists
analytics
innovate
economists
enterprises
scientists
challenges
thinking
survey
accenture
youtube image

28 Jun 2013

Speaker: Jason Brown, Senior Software Engineer at Netflix and Apache Cassandra Committer
Slides: http://www.slideshare.net/planetcassandra/6-jason-brown
This talk focuses Cassandra's anti-entrpoy mechanisms. Jason will discuss the details of read repair, hinted handoff, node repair, and more as they aide in reolving data that has become inconsistent across nodes. In addition, he'll provide insight into how those techniques are used to ensure data consistency at Netflix.
  • 13 participants
  • 54 minutes
consistency
replication
protocols
connectivity
integrity
distributed
databases
inconsistency
serially
servers
youtube image

28 Jun 2013

Speaker: Jonathan Ellis, Apache Cassandra Chair and DataStax CTO
Slides: http://www.slideshare.net/jbellis/cassandra-summit-2013-keynote
Keynote for Cassandra Summit 2013
  • 1 participant
  • 37 minutes
cassandra
benchmarked
performance
mongodb
databases
querying
cache
robust
replication
increasing
youtube image

28 Jun 2013

Lightning Talk Presentations (Chronological):
John Wrobel, Director at SanDisk
Scaling Cassandra on SSDs

Yuki Morishita, Apache Cassandra Committer & Software Engineer at DataStax
How to Contribute to Cassandra

Nate McCall, Development Lead at Apigee
Adding Your Own Thrift Method in 5 Minutes

Yue Cathy Chang, Sr. Director of Business Development at Impetus
Impetus: Proven Practices in Leveraging Big Data's Competitive Advantage

Eyal Reuveni, Software Engineer at Eventbrite
Cassandra at Eventbrite

Joey Jablonski, Director of Product Management at Dell
Redefining Security for Big Data

Brian Hawkins, Senior Software Engineer at Proofpoint
KairosDB: Bob's Story

Joaquin Casares, Software Engineer at DataStax
Introduction to DataStax Enterprise

C. Scott Andreas, Engineer at Boundary

Jeremy Hanna, Senior Support Engineer at DataStax
Troubleshooting Cassandra
  • 12 participants
  • 52 minutes
cassandra
computing
gigabyte
capacity
throughput
servers
performance
datastax
flash
streaming
youtube image

27 Jun 2013

Speaker: Michael Kjellman, Software Engineer at Barracuda Networks
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-hindsight-is-2020-mysql-to-cassandra-by-michael-kjellman
A brief intro to how Barracuda Networks uses Cassandra and the ways in which they are replacing their MySQL infrastructure, with Cassandra. This presentation will include the lessons they've learned along the way during this migration.
  • 8 participants
  • 54 minutes
cassandra
takes
concern
oracle
backing
soon
transition
conversation
barracuda
blog
youtube image

26 Jun 2013

Speaker: Darshan Rawal, VP of Engineering at Openwave Messaging
Slides: http://www.slideshare.net/planetcassandra/1-darshan
Darshan Rawal leads the development of hybrid cloud based messaging products for global Tier 1 Telcos. Darshan has been working in Silicon valley since 2000, building nimble, cost effective products/services, handling millions of users and billions of transactions per day. Previous to Openwave Messaging, Darshan held engineering positions @ SS8 networks, Yahoo, DE Shaw, yp.com and has a M.S in Software Engineering from Carnegie Mellon University.
  • 2 participants
  • 52 minutes
cassandra
telcos
datacenters
capacity
subscribers
deployments
management
services
techcrunch
transition
youtube image

26 Jun 2013

Speaker: Manish Sood, CEO & Founder at Reltio
Slides: http://www.slideshare.net/planetcassandra/3-manish-sood
The Life Sciences industry is undergoing significant changes in how companies do business due to recent legislative changes. The evolving landscape is forcing the Pharmaceutical companies to change how their entire Sales model and move from Prescriber based sales model to an Account based sales model, which has a downstream impact on Sales team organization, Field sales alignment, Incentive compensation and Marketing. This changing reality also requires that the Pharmaceutical companies understand and drive the changes to business strategy on insights driven by data about Prescription Sales, Medication Adherence, Claims, etc. to name a few categories of data sources. The required insights are derived from the convergence of data from multiple sources that include numerous internal applications, 3rd party data sources and social media. In this session, learn how Reltio is helping various Pharmaceutical companies cope with the evolving business landscape with a data driven strategy by leveraging the Reltio data science engine that runs on Cassandra.
  • 2 participants
  • 27 minutes
cassandra
services
database
leveraging
pharma
mismanagement
insights
enterprise
crowdsourcing
hadoop
youtube image

26 Jun 2013

Speaker: Manish Sood, CEO & Founder at Reltio
Slides: http://www.slideshare.net/planetcassandra/3-manish-sood
The Life Sciences industry is undergoing significant changes in how companies do business due to recent legislative changes. The evolving landscape is forcing the Pharmaceutical companies to change how their entire Sales model and move from Prescriber based sales model to an Account based sales model, which has a downstream impact on Sales team organization, Field sales alignment, Incentive compensation and Marketing. This changing reality also requires that the Pharmaceutical companies understand and drive the changes to business strategy on insights driven by data about Prescription Sales, Medication Adherence, Claims, etc. to name a few categories of data sources. The required insights are derived from the convergence of data from multiple sources that include numerous internal applications, 3rd party data sources and social media. In this session, learn how Reltio is helping various Pharmaceutical companies cope with the evolving business landscape with a data driven strategy by leveraging the Reltio data science engine that runs on Cassandra.
  • 2 participants
  • 27 minutes
cassandra
realtio
services
leveraging
database
crm
enterprise
insights
pharma
market
youtube image

26 Jun 2013

Speaker: Stefan Piesche, Chief Technology Officer at Constant Contact
Slides: http://www.slideshare.net/planetcassandra/data-stax-presentation-stefan-1-0
During this presentation Stefan Piesche, Chief Technology Officer at Constant Contact, will discuss how he and his team were able to grow and scale Constant Contact's technology infrastructure by aligning technology with horizontal business growth to improve performance and reduce costs. He will share some of the lessons learned, best practices, and recommendations for other technology executives looking to transform their technology infrastructure to business.
  • 3 participants
  • 27 minutes
contact
continuous
persistence
customers
manage
cto
servers
dbt
data
cassandra
youtube image

26 Jun 2013

Speaker: Eric Lubow, CTO and Co-founder at SimpleReach
Slides: http://www.slideshare.net/planetcassandra/2-eric-lubow
Having many different technologies within an organization can be problematic for developers and operations alike. Structuring those systems into discrete modules not only abstracts away a lot of the complexity of a heterogeneous architecture, it also allows the evolution of systems using common access and storage patterns. This session will discuss how to think about, architect, and maintain a service architecture for a big data system.
  • 1 participant
  • 29 minutes
capacity
data
big
server
clients
information
important
simple
methodology
dashboard
youtube image

26 Jun 2013

Speaker: Mark Davis, Principal Engineer at Dell
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-big-data-analytics-realize-the-investment-from-your-big-data-clusters-by-mark-davis
The term "big data" seems to be everywhere these days. With the ever growing number of attendees at big data and Hadoop events, it's clear big data is here to stay. But what does that mean for the analytics market, and how does big data fit into the picture? This session, featuring Mark Davis, Sr. Product Architect at Dell, will explore what big data means in a practical sense to the IT department. It will also explore the many ways that big data affects an organization's picture of performance. Plus, see how big data analytics, using technologies like Cassandra and Hadoop, will converge with traditional business intelligence to create a complete picture of the enterprise's information assets, thereby giving the business a complete and insightful view of its operational efficiency.
  • 2 participants
  • 30 minutes
cassandra
dell
servers
netflix
users
data
analyst
escalating
talk
big
youtube image

26 Jun 2013

Speaker: Jay Patel, Technical Architect at eBay
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-buy-it-now-cassandra-at-ebay-by-jay-patel
This session will cover various use cases for Cassandra at eBay. It'll start with overview of eBay's heterogeneous data platform comprised of SQL & NoSQL databases, and where Cassandra fits into that. For each use case, Jay will go into detail of system design, data model & multi-datacenter deployment. To conclude, Jay will summarize the best practices that guide Cassandra utilization at eBay.
  • 2 participants
  • 29 minutes
cassandra
ebay
database
oracle
dbas
servers
deploying
workload
inventory
terabytes
youtube image

26 Jun 2013

Speaker: Colin Charles, Chief Evangelist at Monty Program Ab
Slides: http://www.slideshare.net/planetcassandra/5-colin-charles
The Cassandra Storage Engine allows access to data in a Cassandra cluster from MariaDB. Learn what the Cassandra Storage Engine is and how to make use of it, how we implemented it using dynamic columns in MariaDB. Also, we'll look at CQL, data and command mapping, use cases and benchmarks.
  • 1 participant
  • 28 minutes
cassandra
maria
mysql
sql
maury
query
intermediary
enterprise
mac
deployments
youtube image

26 Jun 2013

Speaker: Aaron Morton, Apache Cassandra Committer
Slides: http://www.slideshare.net/aaronmorton/apachecon-nafeb2013
  • 1 participant
  • 22 minutes
cassandra
api
server
thread
protocols
database
interfaces
configuration
abstractions
execution
youtube image

26 Jun 2013

Speaker: Boris Wolf, Lead Engineer CMB Project at the Comcast Silicon Valley Innovation Center
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-cmb-an-open-message-bus-for-the-cloud-by-boris-wolf
The Comcast Silicon Valley Innovation Center has developed a general purpose message bus for the cloud. The service is API compatible with Amazon's SQS/SNS and is built on Cassandra and Redis with the goal of linear horizontal scalability. This presentation offers and in-depth look at the architecture of the system and how they employ Cassandra as a central component to meet key requirements. Latest feature enhancements and performance data will also be covered.
  • 2 participants
  • 54 minutes
cmb
comcast
services
cmv
protocols
cassandra
proxy
configuration
message
subscribers
youtube image

26 Jun 2013

Speaker: Rick Branson, Infrastructure Engineer at Instagram
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-cassandra-at-instagram-23756207
Cassandra is a critical part of Instagram's large scale site infrastructure that supports more than 100 million active users. This talk is a practical deep dive into data models, systems architecture, and challenges encountered during the implementation process.
  • 1 participant
  • 25 minutes
redis
difficulties
fail
memory
users
cassandra
concerns
instagram
io
big
youtube image

26 Jun 2013

Speakers: Feng Qu, Principal DBA and Anurag Jambhekar, Senior Manager of Database Infrastructure
Slides: http://www.slideshare.net/planetcassandra/5-feng-qu
We have seen rapid adoption of C* at eBay in past two years. We have made tremendous efforts to integrate C* into existing database platforms, including Oracle, MySQL, Postgres, MongoDB, XMP etc.. We also scale C* to meet business requirement and encountered technical challenges you only see at eBay scale, 100TB data on hundreds of nodes. We will share our experience of deployment automation, managing, monitoring, reporting for both Apache Cassandra and DataStax enterprise.
  • 2 participants
  • 48 minutes
oracle
enterprise
database
cassandra
servers
ebay
infrastructure
eb
workloads
vm
youtube image

26 Jun 2013

Speakers: Renat Khasanshyn, Founder and CEO at Altoros and Cornelia Davis, Senior Technologist at Pivotal
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-cassandra-on-cloud-foundry-by-renat-khasanshyn-and-cornelia-davis
Coupling Cassandra with a Platform as a Service may significantly simplify the process of deploying Cassandra and applications that utilize it, reduce the cost of managing Cassandra within the organization, and to allow infrastructure service providers a simple path to offering database as a service to their customers. Attendees will learn why and when use Cassandra atop of Cloud Foundry, the history of Cassandra service within Cloud Foundry, the State of Cassandra integration with Cloud Foundry, how to create and manage Cassandra nodes on Cloud Foundry and what to expect in the next 6 months.
  • 3 participants
  • 27 minutes
pivotal
cassandra
enterprise
services
cloud
vmware
foundry
deployable
developer
platform
youtube image

26 Jun 2013

Speaker: Matt Kennedy, Big Data Solutions Architect at Fusion-IO
Slides: http://www.slideshare.net/planetcassandra/1-matt-kennedy-23155838
Flash Memory technology, deployed as server-side PCIe or solid state disks (SSDs), is emerging as a critical tool for performance and efficiency in data centers of all scales. This presentation will discuss how the use of Flash impacts Cassandra deployments in terms of configuration, DRAM requirements and performance expectations. Ideas on leveraging C*'s cutting-edge data-center awareness to blend flash and disk storage nodes for cost and workload efficiency will also be shared. Flash media itself will be examined from a physical perspective to understand endurance issues. Data on write amplification under bulk-load and operational workload conditions will be presented to explain the impact to Flash of C*'s Log Structured Merge Tree architecture and the associated compactions. Finally, we will examine strategies to make Cassandra more Flash-aware using both conventional techniques as well as emerging Non-volatile memory (NVM) programming capabilities. Lessons learned from real-world customer deployments will be shared to complete this presentation.
  • 2 participants
  • 53 minutes
flash
capacity
cassandras
switching
deploying
disks
backup
databases
io
vmware
youtube image

26 Jun 2013

Speaker: Sameer Farooqui, Freelance Big Data Consultant and Trainer
Slides: http://www.slideshare.net/planetcassandra/cassandra-vsh-base
Have you wondered what actually happens when you submit a write to Cassandra? This vendor agnostic technical talk will cover the internals of the read and write paths of Cassandra and compare it to other NoSQL stores, especially HBase so you can pick the right database for your project. Some of the topics mentioned are consistency levels, memtables/memstores, SSTables/HFiles, bloom filters, block indexes, data distribution partitioners and optimal use cases.
  • 1 participant
  • 50 minutes
nosql
database
cassandra
sql
querying
mysql
hbase
hadoop
mongodb
discussions
youtube image

26 Jun 2013

Speaker: David Leimbrock, CTO at Riptide IO
Slides: http://www.slideshare.net/planetcassandra/data-driven-retail-how-one-megaretailer-drove-down-energy-costs-across-7000-stores-by-david
How do you keep up with the velocity and variety of data streaming in from all the smart devices that run the physical environments of 7,000+ stores? What about getting analytics that tell you exactly where energy waste is happening in real-time? In this talk, Riptide IO, describes their blueprint for collecting, organizing and deriving real-time operational intelligence from smart devices such as lighting, HVAC, sensors and more. Learn how this retailer gained a dramatic boost to their sustainability program, and solved some of the major bottlenecks in managing countless devices across thousands of stores.
  • 1 participant
  • 25 minutes
technology
operationally
industry
companies
enterprises
efficiency
utilize
considerations
utility
consumption
youtube image

26 Jun 2013

Speakers: Rich Hammel, Director of Advanced Manufacturing at Brocade and Vivek Ganesan, Principal Architect at Impetus Technologies
Almost 10 years ago in a hotel room in Asia his first parser was born. That parser and its offspring have supported the development of world-class networking products at Brocade. This discussion will include how big data will change manufacturing, the essential ingredients for success in greenfield big data projects, and what it's like to be obsessed with quality.
  • 3 participants
  • 32 minutes
brocade
advanced
conference
data
ibm
consulting
cassandra
agilent
enterprise
presents
youtube image

26 Jun 2013

Speakers: Matt Pfeil, Vice President of Customer Solutions at DataStax; Rick Branson, Infrastructure Engineer at Instagram; Adrian Cockcroft, Cloud Architect at Netflix
In today's world, data is growing faster than ever. For online apps, two things matter more than anything else for the database: uptime and performance. The intersection between data growth and online requirements results in interesting technology choices. This panel will discuss the implications - and approaches - to maximize revenue via technology decisions.
  • 5 participants
  • 27 minutes
consistency
consistently
commitment
reliable
substantial
inconsistent
understood
thinking
eventual
strong
youtube image

26 Jun 2013

Speaker: Matthias Broecheler, CTO at Aurelius
Slides: http://www.slideshare.net/planetcassandra/distributed-graph-computing-with-titan-and-faunus
This presentation introduces Titan, Faunus, and scalable graph computing in general. We present a case study of how Pearson builds an education social network on top of Titan, Faunus, and Cassandra to support learning in the 21st century. Titan is an open source distributed graph database build on top of Cassandra that can power real-time applications with thousands of concurrent users over graphs with billions of edges. Faunus is an open source global graph processing engine build on top of Hadoop and compatible with Cassandra that can analyze graphs, compute graph statistics, and execute global traversals. Titan and Faunus are components of the Aurelius Graph Cluster which enables scalable graph computation and powers applications in social networking, recommendation engines, advertisement optimization, knowledge representation, health care, education, and security.
  • 8 participants
  • 43 minutes
titan
oracle
cassandra
discussion
soon
summit
conference
intelligence
support
backends
youtube image

26 Jun 2013

Speaker: Christos Kalantzis, Engineering Manager of Cloud Persistence Engineering at Netflix
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-eventual-consistency-hopeful-consistency-by-christos-kalantzis
This session will address Cassandra's tunable consistency model and cover how developers and companies should adopt a more Optimistic Software Design model.
  • 9 participants
  • 28 minutes
replication
consistency
cassandra
persistence
manage
integrity
data
parallel
eventually
transactions
youtube image

26 Jun 2013

Speaker: Jesse Young, Director of Research at Zonar Systems
Slides: http://www.slideshare.net/planetcassandra/2-jesse-young
Come learn about how Zonar Systems uses Cassandra for logistics use cases such as tracking fleets of school buses and other fleet management services. Zonar uses Cassandra because because of its ability to scale horizontally, its continuous availability and operational ease. This talk will cover details about the implementation and our 3 year journey that got us here, including the challenges along the way.
  • 2 participants
  • 20 minutes
logistics
vehicles
cassandra
fleet
gps
telematics
dbms
sonar
consulting
applications
youtube image

26 Jun 2013

Speaker: Andy Cobley, Lecturer at University of Dundee
Slides: http://www.slideshare.net/planetcassandra/5-andy-cobley-raspberry-pi
The raspberry Pi is a credit-card sized $25 ARM based linux box designed to teach children the basics of programming. The machine comes with a 700MHz ARM and 512Mb of memory and boots off a SD card, not much power for running the likes of a Cassandra cluster. This presentation will discuss the problems of getting Cassandra up and running on the Pi and will answer the all important question: Why on Earth would you want to do this!?
  • 3 participants
  • 24 minutes
cassandra
raspberry
raspbian
computing
pc
geeky
laptop
oracle
java
question
youtube image

26 Jun 2013

Speaker: Aaron Stannard, Founder and CEO at Marked Up Analytics
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-high-throughput-analytics-with-cassandra-by-aaron-stannard
Building analytics systems is an increasingly common requirement for BI teams inside companies both big and small, and a feat made even more challenging when analytic results have to be produced in real-time. In this presentation the team from MarkedUp Analytics will show you techniques for leveraging Cassandra, Hadoop, and Hive to build a manageable and scalable analytics system capable of handling a wide range of business cases and needs.
  • 1 participant
  • 28 minutes
app
apps
application
developers
clients
workflow
sophisticated
monitoring
opscenter
microsoft
youtube image

26 Jun 2013

Speaker: Axel Liljencrantz, Backend Developer at Spotify
Slides: http://www.slideshare.net/planetcassandra/8-axel-liljencrantz-23204252
At Spotify, we see failure as an opportunity to learn. During the two years we've used Cassandra in our production environment, we have learned a lot. This session touches on some of the exciting design anti-patterns, performance killers and other opportunities to lose a finger that are at your disposal with Cassandra.
  • 6 participants
  • 59 minutes
cassandra
spotify
server
services
postgres
databases
manage
performance
streaming
cluster
youtube image

26 Jun 2013

Speaker: Aaron Morton, Apache Cassandra Committer
Slides: http://www.slideshare.net/aaronmorton/cassandra-sf-2013-in-case-of-emergency-break-glass
The design of Apache Cassandra allows applications to provide constant uptime. Peer-to-Peer technology ensures there are no single points of failure, and the Consistency guarantees allow applications to function correctly while some nodes are down. There is also a wealth of information provided by the JMX API and the system log. All of this means that when things go wrong you have the time, information and platform to resolve them without downtime. This presentation will cover some of the common, and not so common, performance issues, failures and management tasks observed in running clusters. I'll discuss how to gather information and how to act on it. Operators, Developers and Managers will all benefit from this exposition of Cassandra in the wild.
  • 4 participants
  • 50 minutes
cassandra
servers
configuration
dashboard
replication
servicing
throughput
datarow
upgrade
batches
youtube image

26 Jun 2013

Speakers: Michael Figuiere and Patrick McFadin, Principal Solutions Architect at DataStax
Slides: http://www.slideshare.net/planetcassandra/cassandra-summit-data-stax-java-driver
Cassandra 1.2 finalizes CQL3 and introduces a new binary protocol for client/server communication. These two components are the foundation of the new line of drivers developed by DataStax. Based on years of experience with Cassandra, these new drivers for Java, .Net and Python come with an asynchronous and lightweight architecture, a clean and simple API, a standardized way to discover nodes and to manage load balancing and fail over. This presentation will give an in depth look at these new drivers which will make your Cassandra-based applications even more robust, efficient and simple to write.
  • 4 participants
  • 57 minutes
cassandra
cql
cass
implementation
advanced
insight
database
schemas
thread
casting
youtube image

26 Jun 2013

Speaker: Matt Stump, Senior Backend Engineer at KISSMetrics
Slides: http://www.slideshare.net/planetcassandra/1-matt-stump
The ability to manipulate and query very large datasets in realtime is a pressing need for most large data enterprises. Recently, we've seen an explosion of tools such as Impala or Druid, but all of these tools suffer from single points of failure or can't deliver the sub 1 second query times necessary for realtime results. Together we'll explore how to break down these seemingly intractable problems. We'll learn how to build horizontally scalable query engines with Cassandra, capable of sub-second query times across multi-billion row datasets.
  • 4 participants
  • 52 minutes
analytics
google
server
users
backend
optimizing
query
cassandra
throughput
companies
youtube image

26 Jun 2013

Speakers: Ameet Chaubal, Technologist and Fausto Inestroza, Architect at Accenture
Slides: http://www.slideshare.net/planetcassandra/ameet-chaubal/
The presentation aims to highlight the challenges posed by large scale and near real-time data processing problems. In past, such problems were solved using conventional technologies, primarily a database and JMS queue. However these solutions had their limits and presented serious problems in terms of scale and redundancy. The new breed of products - a la Cassandra & Kafka, being innately distributed in their design, aim to tackle such challenges in a very elegant manner. The presentation will showcase some of the use cases of this genre from the industry and describe the solutions which have been increasing in their sophistication.
  • 2 participants
  • 29 minutes
workflows
problem
clients
data
transaction
analyze
throughput
scalability
planning
queries
youtube image

26 Jun 2013

Speaker: Sam Heywood, Sr. Director of Products at Gazzang
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-lock-it-up-securing-sensitive-data-by-sam-heywood-23124858
As adoption of NoSQL solutions like Apache Cassandra grows, so too does the likelihood that organizations will use it to capture and analyze sensitive data. Enterprises that don't take every precaution to protect this data leave themselves exposed to risk of a data breach, and depending on the regulatory nature of the data, fines for noncompliance. This session will discuss how transparent data encryption and advanced key management protect data at-rest and in-flight, so regardless of where the data resides — either on premises or in the cloud -- it remains garbled and unreadable to all people, processes and applications that don't require immediate access. The session will also cover DevOps automation tools that ensure rapid distributed deployment of big data security across thousands of nodes.
  • 1 participant
  • 22 minutes
monitoring
administration
database
concerns
bezang
discussion
robust
disclose
stuff
cryptographers
youtube image

26 Jun 2013

Speaker: Adrian Cockcroft, Cloud Architect at Netflix
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-netflix-open-source-tools-and-benchmarks-for-cassandra
Netflix has updated and added new tools and benchmarks for Cassandra in the last year. In this talk we will cover the latest additions and recipes for the Astyanax Java client, updates to Priam to support Cassandra 1.2 Vnodes, plus newly released and upcoming tools that are all part of the NetflixOSS platform. Following on from the Cassandra on SSD on AWS benchmark that was run live during the 2012 Summit, we've been benchmarking a large write intensive multi-region cluster to see how far we can push it. Cassandra is the data storage and global replication foundation for the Cloud Native architecture that runs Netflix streaming for 36 Million users. Netflix is also offering a Cloud Prize for open source contributions to NetflixOSS, and there are ten categories including Best Datastore Integration and Best Contribution to Performance Improvements, with $10K cash and $5K of AWS credits for each winner. We'd like to pay you to use our free software!
  • 2 participants
  • 60 minutes
netflix
cassandra
cloud
streaming
microservice
launch
speakers
presentations
native
abstracted
youtube image

26 Jun 2013

Speaker: Dave Gardner, Senior Engineer at Hailo
Slides: http://www.slideshare.net/planetcassandra/no-whistling-required-cabs-cassandra-and-hailo-by-dave-gardner
Hailo has leveraged Cassandra to build one of the most successful startups in European history. This presentations looks at how Hailo grew from a simple MySQL-backed infrastructure to a resilient Cassandra-backed system running in three data centres globally. Topics covered include: the process of migration, experience running multi-DC on AWS, common data modeling patterns and security implications for achieving PCI compliance.
  • 5 participants
  • 45 minutes
cassandra
sandra
considerations
advanced
halo
aspirations
summit
success
migrated
meetup
youtube image

26 Jun 2013

Speaker: Charles Lamanna, MetricsHub Founder & Developer Lead and Ricardo Villalobos, Senior Cloud Architect at Microsoft
Slides: http://www.slideshare.net/planetcassandra/optimizing-the-public-cloud-for-cost-and-scalability-with-cassandra-the-metricshub-story-by-charles-lamanna
MetricsHub is a monitoring and scalability service for public clouds, allowing companies to continuously gather data from their systems and auto-scale their deployments to optimize service costs. Taking advantage of Cassandra rapid ingestion rates, reliable replication model, and easiness of deployment, Metrics Hub can handle billions of datapoints per day. During this session, you will learn about the architecture supporting this service, which combines the power of the PaaS + IaaS on the Windows Azure platform.
  • 4 participants
  • 37 minutes
monitoring
analytics
dashboard
servers
manage
database
capacity
balancer
meta
deployments
youtube image

26 Jun 2013

Speaker: Albert P Tobey, Tech Lead, Compute and Data Services at Ooyala
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-practice-makes-perfect-extreme-cassandra-optimization-by-albert-tobey
Ooyala has been using Apache Cassandra since version 0.4. Our data ingest volume has exploded since 0.4 and Cassandra has scaled along with us. Al will cover many topics from an operational perspective on how to manage, tune, and scale Cassandra in a production environment.
  • 2 participants
  • 58 minutes
cassandra
manage
devops
yella
performance
dashboard
computing
technical
databases
recommending
youtube image

26 Jun 2013

Speaker: Terrell Deppe, CTO at HealthCare Anytime
Slides: http://www.slideshare.net/planetcassandra/7-terrell-deppe
HealthCare Anytime provides Web-based portal solutions that assist healthcare organizations in achieving meaningful use, optimized operations, and increased patient and staff satisfaction. During this speaking session, HealthCare Anytime CTO Terrell Deppe will discuss the challenges his company faced when processing an "avalanche" of patient records and how he utilized DataStax's Cassandra-based big data platform to improve their product's performance while reducing costs.
  • 3 participants
  • 26 minutes
provider
clinicians
healthcare
patient
regulatory
meaningful
data
broadcast
taking
portal
youtube image

26 Jun 2013

Speaker: Tim Moreton, CTO at Acunu Ltd
Slides: http://www.slideshare.net/planetcassandra/tim-moreton
Data modeling for Cassandra presents a new set of challenges, especially for developers with a background in relational data modeling. And there are added complexities in modeling for analytic applications which need to enable statistical functions over the data, but a good data model, exploiting Cassandra's strengths, can make all the difference to a successful project. This tutorial will examine a number of real-world customer data modeling examples and draw out some hints and tips that will benefit hnot just the Cassandra newbie, but also the more experienced data modeler.
  • 1 participant
  • 32 minutes
cassandra
modelling
cql3
data
customers
implementation
conference
currently
cto
knowledge
youtube image

26 Jun 2013

Speakers: DeWayne Filppi, Technical Account Manager at GigaSpaces
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-realtime-big-data-with-storm-cassandra-and-inmemory-computing-by-dewayne-filppi
This session will describe how to resolve the processing limitations by placing the streaming and data store interfaces in-memory as well, through an in-memory computing platform, and also how to resolve the complexity challenge by implementing a DevOps approach that abstracts all the underlying infrastructure and provides single-click management of all the application tiers and services, on any environment (private/public cloud, bare metal...). And the best news is that all this optimization can be implemented seamlessly, with no code change to your apps.
  • 1 participant
  • 29 minutes
cassandra
computing
hadoop
data
scalable
memory
terabytes
cloudify
real
time
youtube image

26 Jun 2013

Speaker: Evan Chan, Ooyala
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-realtime-analytics-using-cassandra-spark-and-shark-by-evan-chan
This session covers our experience with using the Spark and Shark frameworks for running real-time queries on top of Cassandra data.We will start by surveying the current Cassandra analytics landscape, including Hadoop and HIVE, and touch on the use of custom input formats to extract data from Cassandra. We will then dive into Spark and Shark, two memory-based cluster computing frameworks, and how they enable often dramatic improvements in query speed and productivity, over the standard solutions today.
  • 1 participant
  • 25 minutes
cassandra
cloudera
streaming
managed
query
analytics
problems
hadoop
luella
mission
youtube image

26 Jun 2013

Speaker: Les Hazlewood, Co-Founder & CTO of Stormpath and Apache Shiro PMC Chair
Slides: http://www.slideshare.net/planetcassandra/infinite-sessionclusteringwithapacheshiro9x16-23252714
In this session Les Hazlewood, the Apache Shiro PMC Chair, will cover Shiro's enterprise session management capabilities, how it can be used across any application (not just web or JEE applications) and how to use Cassandra as Shiro's session store, enabling a distributed session cluster supporting hundreds of thousands or even millions of concurrent sessions. As a working example, Les will show how to set up a session cluster in under 10 minutes using Cassandra. If you need to scale user session load, you won't want to miss this!
  • 1 participant
  • 28 minutes
shiro
apache
security
server
proxy
sas
jboss
enterprise
project
share
youtube image

26 Jun 2013

Speaker: Jason Rutherglen, Senior Big Data Engineer at DataStax
Slides: http://www.slideshare.net/planetcassandra/dse-solr-realtimeanalytics
The presentation demonstrates how Solr may be used to create real-time analytics applications. In addition, Datastax Enterprise 3.0 will be showcased, which offers Solr version 4.0 with a number of improvements over the previous DSE release. A realtime financial application will run for the audience, and then a detailed look at how the application was built. An overview of Datastax Enterprise Solr features will be given, and how the many enhancements in DSE make it unique in the marketplace.
  • 5 participants
  • 31 minutes
solar
data
analytics
datastax
server
capacity
cassandra
enterprise
applications
hadoop
youtube image

26 Jun 2013

Speaker: Eddie Satterly, Chief Big Data Evangelist at Splunk
Slides: http://www.slideshare.net/planetcassandra/cassandra-summit2013-eddie
The session will demonstrate Splunk integration with Cassandra today and discuss more concepts for an integrations to come in the future.
  • 2 participants
  • 14 minutes
splunk
datastore
cassandra
server
slug
dashboard
dbm
oltp
hadoop
enterprise
youtube image

26 Jun 2013

Speaker: Chris "Mac" McEniry and Igor von Nyssen, Systems Architect at Sony Network Entertainment
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-stepping-through-the-lifecycle-of-a-service-offering-with-cassandra-by-igor
It's a fine line to walk for incorporating new technologies in an organization with 15+ years of legacy software. In this presentation, we'll look at the lifecycle and adoption of Cassandra from a skunkworks project to a full fledged service in a legacy organization.
  • 4 participants
  • 29 minutes
cassandra
enterprise
conversations
mike
server
management
having
transactions
advance
takes
youtube image

26 Jun 2013

Speaker: Ken Krugler, Big Data Consulting at Scale Unlimited
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-suicide-risk-prediction-using-social-media-and-cassandra-by-ken-krugler
In this presentation, Ken will describe a portion of an early-phase project that uses social media data (tweets, Facebook posts, etc.) from service personnel to predict suicide rates. There's a lot of motivation to provide better data for military psychologies, since more military wind up taking their own lives than are killed in the line of duty. By analyzing social media data that is voluntarily provided by personnel, plus a predictive analytics system, we can provide assessments that help mental health workers focus their time and energy on the most at-risk individuals. This project uses Cassandra as the scalable storage system for this social media data, which is then analyzed in a distributed environment using Hadoop. The project also uses the Solr search support from DataStax Enterprise to provide ways for users to dig into the underlying data, which is critical when understanding the assigned risk levels.
  • 2 participants
  • 45 minutes
having
thinking
hadoop
big
boat
talked
oracle
audience
presentations
plan
youtube image

26 Jun 2013

Speaker: Renato Javier and Lewis John McGibbney
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-taking-bytes-from-cassandra-clients-by-renato
Since early 2012 Gora has been proudly participating as an honorary Incubator post-grad within the ASF. This presentation provides Renato and Lewis' perspective on a phenomenon they refer to as the "big datastore client wars", which is a real life challenge they've discovered whilst attempting to integrate several big data backends (Accumulo, Cassandra, HBase, MySQL, HSQLDB, Amazon's DynamoDB, MongoDB) under one common persistence layer and, in the process, obtain optimal results over Gora operations. They emphasize their approach to addressing this problem by discussing a pluggable Cassandra client infrastructure (Hector-client, Datastax java driver, intravert-ug, etc) adapted specifically for the gora-cassandra module.
  • 5 participants
  • 49 minutes
cassandra
conversation
enthusiast
mysql
oracle
apache
inquisitive
consultancy
luis
going
youtube image

26 Jun 2013

Speaker: Sylvain Lebresne, Apache Cassandra Committer and Engineer at DataStax
Slides: http://www.slideshare.net/planetcassandra/cassandra13-state-of-cql
Since its inception, the Cassandra Query Language (CQL) has grown and matured, resulting in the 3rd version of the language (CQL3) being finalized in Cassandra 1.2. Compared to the legacy Thrift API, CQL3 aims at providing an API that is higher level and more user friendly but still fully assumes the distributed nature of Cassandra and it's storage engine. This presentation will present CQL3, describing the reasoning and goals behind the language as well as the language itself. CQL's relationship with Thrift will be touched on, along with the CQL binary protocol that has been introduced in Cassandra 1.2. This presentation will wrap up by discussing the future of CQL.
  • 2 participants
  • 57 minutes
cql3
sql
secure
discussed
advanced
cassandra
declaring
protocol
talkback
row
youtube image

26 Jun 2013

Speaker: Patrick McFadin, Principal Solutions Architect at DataStax
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-the-worlds-next-top-data-model-by-patrick-mcfadin
You know you need Cassandra for it's uptime and scaling, but what about that data model? Let's bridge that gap and get you building your game changing app. We'll break down topics like storing objects and indexing for fast retrieval. You will see by understanding a few things about Cassandra internals, you can put your data model in the spotlight. The goal of this talk is to get you comfortable working with data in Cassandra throughout the application lifecycle. What are you waiting for? The cameras are waiting!
  • 11 participants
  • 1:07 hours
cassandra
models
advanced
dbas
mike
dashboard
data
discussions
insight
relationally
youtube image

26 Jun 2013

Speaker: Mohit Anchlia, Architect at Intuit
Slides: http://www.slideshare.net/planetcassandra/3-mohit-anchlia
This session talks about Intuit's journey of our Consumer Financial Platform that is built to scale to petabytes of data. The original system used a major RDBMS and from there, we redesigned to use the distributed nature of Cassandra. This talk will go through our transition including the data model used for the final product. As with any large system transition, many hard lessons are learned and we will discuss those and share our experiences.
  • 3 participants
  • 40 minutes
turbotax
manages
processing
transactions
services
cfb
application
discussion
problems
database
youtube image

26 Jun 2013

Speakers: Jake Luciani and Carl Yeksigian, Quantitative Strategists at BlueMountain Capital Management
Slides: http://www.slideshare.net/planetcassandra/jake-luciani-and-carl-yeksigian
This session will focus on our approach to building a scalable TimeSeries database for financial data using Cassandra 1.2 and CQL3. We will discuss how we deal with a heavy mix of reads and writes as well as how we monitor and track performance of the system.
  • 3 participants
  • 18 minutes
cassandra
database
time
models
dashboard
throughput
bottleneck
pooling
market
serialization
youtube image

26 Jun 2013

Speaker: Mike Heffner, Engineer & Co-Founder at Librato
Slides: https://speakerdeck.com/mheffner/time-series-metrics-with-cassandra
Librato's Metrics platform relies on Cassandra as its sole data storage platform for time-series data. This session will discuss how we have scaled from a single six node Cassandra ring two years ago to the multiple storage rings that handle over 150,000 writes/second today. We'll cover the steps we have taken to scale the platform including the evolution of our underlying schema, operational tricks, and client-library improvements. The session will finish with our suggestions on how we believe Cassandra as a project and its community can be improved.
  • 1 participant
  • 32 minutes
cassandra
data
advanced
dashboard
liberado
monitoring
volumes
schema
epoch
iteratively
youtube image

22 Jun 2013

Speaker: Isaac Rieksts, Software Development at Health Market Science
Slides: http://www.slideshare.net/planetcassandra/1-isaac-rieksts
Over the past few years, Health Market Science has transitioned from traditional relational databases and enterprise systems to a massively scalable Big Data platform that combines Cassandra and Storm to ingest thousands of feeds of data from the health market industry to produce a single high-quality masterfile. Come hear the "Why?", "What for?" and "How?" of that evolution.
  • 2 participants
  • 25 minutes
manage
management
data
prescribe
practitioners
pharmacies
claims
transactions
processing
schema
youtube image

19 Jun 2013

Speaker: Andrew Noonan, Developer at Gnip
Slides: http://www.slideshare.net/planetcassandra/c-summit-2013-dude-wheres-my-tweet-taming-the-twitter-firehose-by-andrew-noonan
Gnip ingests and must serve out hundreds of millions of social activities every day and social platforms are only growing. This makes the scalability of applications essential for Gnip. Enter Cassandra. Problem solved, right? Not exactly, Gnip's relationship with Cassandra was not all rainbows and unicorns. In this session we will walk you through why we began looking at Cassandra as a data store in the first place and the valuable lessons we with Cassandra that has made it an invaluable part of our infrastructure.
  • 1 participant
  • 30 minutes
twitter
tweets
cassandra
chat
currently
network
hooking
pinging
client
streaming
youtube image

19 Jun 2013

Speakers: Derek Bromenshenkel and Jeff Smoley, Infrastructure Architects at NativeX
Slides: http://www.slideshare.net/planetcassandra/native-x
NativeX (formerly W3i) recently transitioned a large portion of their backend infrastructure from Microsoft SQL Server to Apache Cassandra. Today, its Cassandra cluster backs its mobile advertising network supporting over 10 million daily active users that produce over 10,000 transactions per second with an average database request latency of under 2 milliseconds. Come hear our story about how we were successful at getting our .NET web apps to reliably connect to Cassandra. Come learn about FluentCassandra, Snowflake, Hector, and IKVM. It's a story of struggle and perseverance, where everyone lives happily ever after.
  • 5 participants
  • 49 minutes
native
workflow
datacenters
cassandra
plan
enterprise
backend
clients
infrastructure
app
youtube image

19 Jun 2013

Speaker: Eric Evans, Apache Cassandra Committer and Chief Architect at OpenNMS
Slides: http://www.slideshare.net/planetcassandra/4-eric-evans
A discussion of the recent work to transition Cassandra from its naive 1-partition-per-node distribution, to a proper virtual nodes implementation.
  • 6 participants
  • 37 minutes
cassandra
virtual
nodes
distributed
replication
dht101
complexity
hash
data
property
youtube image