10 Apr 2013
Speaker: Edward Capriolo
SlideShare: http://www.slideshare.net/slideshow/embed_code/18573117
The ColumnFamily data model and wide-row support provides the ability to store and access data efficiently in a de-normalized state. Recent enhancements for CQL's spare tables and built-in indexing provide the capability to store data in a manner similar to that of relational databases. For many use cases hybrid approaches are needed, because complete de-normalization is appropriate for some access patterns whereas more structured data is appropriate for others. At times a single logical event becomes multiple insertions across multiple column families. Likewise a user request might require a several reads across different column families. This talk describes some of these scenarios and demonstrates how advanced operations such multiple step procedures, filtering, intersection, and paging can be implemented client side or server side with the help of the IntraVert plugin.
SlideShare: http://www.slideshare.net/slideshow/embed_code/18573117
The ColumnFamily data model and wide-row support provides the ability to store and access data efficiently in a de-normalized state. Recent enhancements for CQL's spare tables and built-in indexing provide the capability to store data in a manner similar to that of relational databases. For many use cases hybrid approaches are needed, because complete de-normalization is appropriate for some access patterns whereas more structured data is appropriate for others. At times a single logical event becomes multiple insertions across multiple column families. Likewise a user request might require a several reads across different column families. This talk describes some of these scenarios and demonstrates how advanced operations such multiple step procedures, filtering, intersection, and paging can be implemented client side or server side with the help of the IntraVert plugin.
- 7 participants
- 39 minutes

10 Apr 2013
Speaker: Nathan Milford
SlideShare: http://www.slideshare.net/nmilford/the-automation-factory
Automation Stack at Outbrain
SlideShare: http://www.slideshare.net/nmilford/the-automation-factory
Automation Stack at Outbrain
- 3 participants
- 43 minutes

10 Apr 2013
Speaker: Michael Figuiere
SlideShare: http://www.slideshare.net/planetcassandra/nyc-tech-day-new-cassandra-drivers-in-depth-17867623
Cassandra 1.2 finalizes CQL3 and introduces a new binary protocol for client/server communication. These two components are the foundation of the new line of drivers developed by DataStax. Based on years of experience with Cassandra, these new drivers for Java, .Net and Python come with an asynchronous and lightweight architecture, a clean and simple API, a standardized way to discover nodes and to manage load balancing and fail over. This presentation will give an in depth look at these new drivers which will make your Cassandra-based applications even more robust, efficient and simple to write.
SlideShare: http://www.slideshare.net/planetcassandra/nyc-tech-day-new-cassandra-drivers-in-depth-17867623
Cassandra 1.2 finalizes CQL3 and introduces a new binary protocol for client/server communication. These two components are the foundation of the new line of drivers developed by DataStax. Based on years of experience with Cassandra, these new drivers for Java, .Net and Python come with an asynchronous and lightweight architecture, a clean and simple API, a standardized way to discover nodes and to manage load balancing and fail over. This presentation will give an in depth look at these new drivers which will make your Cassandra-based applications even more robust, efficient and simple to write.
- 3 participants
- 51 minutes

10 Apr 2013
Speaker: Dave Finnegan
SlideShare:
An April 2012 InformationWeek special report entitled "Why NoSQL Equals No Security" began by stating: "If it seems security is an afterthought at best in the big data ecosystem, you're right." DataStax Enterprise 3.0 overcomes this perception and is the first big data platform in the NoSQL industry to bring the type of enterprise security used in traditional RDBMS's to secure systems and important data to the big data/NoSQL market. This presentation will describe each aspect of DataStax Enterprise 3.0's security feature set. Note that all security features are optional; the administrator can decide to use none, some, or all of them depending on their specific application. Features described include Internal Authentication, External Authentication, Permission Management, Transparent Data Encryption, Data Auditing, Client to Node Encryption
SlideShare:
An April 2012 InformationWeek special report entitled "Why NoSQL Equals No Security" began by stating: "If it seems security is an afterthought at best in the big data ecosystem, you're right." DataStax Enterprise 3.0 overcomes this perception and is the first big data platform in the NoSQL industry to bring the type of enterprise security used in traditional RDBMS's to secure systems and important data to the big data/NoSQL market. This presentation will describe each aspect of DataStax Enterprise 3.0's security feature set. Note that all security features are optional; the administrator can decide to use none, some, or all of them depending on their specific application. Features described include Internal Authentication, External Authentication, Permission Management, Transparent Data Encryption, Data Auditing, Client to Node Encryption
- 3 participants
- 37 minutes

10 Apr 2013
- 6 participants
- 36 minutes

9 Apr 2013
Speakers: Matt Pfeil(DataStax), Eddie Satterly(Splunk), Edward Capriolo(Media6Degrees), Matt Conway(Backupify), Russell Bradburry(SimpleReach), Jake Lucianni(BlueMountain Capital)
- 16 participants
- 53 minutes

8 Apr 2013
Speakers: Jake Luciani and Carl Yeksigian, BlueMountain Capital
SlideShare: http://www.slideshare.net/planetcassandra/nyc-tech-day
This talk will focus on our approach to building a scalable TimeSeries database for financial data using Cassandra 1.2 and CQL3. We will discuss how we deal with a heavy mix of reads and writes as well as how we monitor and track performance of the system.
SlideShare: http://www.slideshare.net/planetcassandra/nyc-tech-day
This talk will focus on our approach to building a scalable TimeSeries database for financial data using Cassandra 1.2 and CQL3. We will discuss how we deal with a heavy mix of reads and writes as well as how we monitor and track performance of the system.
- 9 participants
- 47 minutes

8 Apr 2013
Speakers: Brian O'Neill and Taylor Goetz
A successful Big Data platform combines distributed processing and polyglot persistence into a single cohesive infrastructure. Over the past few years, Health Market Science has transitioned from traditional relational databases and enterprise systems to a massively scalable Big Data platform that combines Cassandra and Storm to ingest thousands of feeds of data from the health market industry to produce a single high-quality masterfile. Hear how we applied event processing and NoSQL to deliver real-time analytics, while accommodating structural change over time, and fuzzy/geospatial search.
A successful Big Data platform combines distributed processing and polyglot persistence into a single cohesive infrastructure. Over the past few years, Health Market Science has transitioned from traditional relational databases and enterprise systems to a massively scalable Big Data platform that combines Cassandra and Storm to ingest thousands of feeds of data from the health market industry to produce a single high-quality masterfile. Hear how we applied event processing and NoSQL to deliver real-time analytics, while accommodating structural change over time, and fuzzy/geospatial search.
- 2 participants
- 51 minutes

8 Apr 2013
Speaker: Rick Branson of Instagram
It's upsetting whenever we hear that we can't have things that we want. It'd be nice to live in a world where it was possible to have things ACID transactions, uniqueness guarantees, and sequential counters that were globally and always available. What makes this worse is that when we're told we can't have them, people just wave their arms around in the air and shout things like "CAP theorem." In this talk, I'll walk through some of these "ponies" and demonstrate the points at which things start falling apart with practical, real-world examples.
It's upsetting whenever we hear that we can't have things that we want. It'd be nice to live in a world where it was possible to have things ACID transactions, uniqueness guarantees, and sequential counters that were globally and always available. What makes this worse is that when we're told we can't have them, people just wave their arms around in the air and shout things like "CAP theorem." In this talk, I'll walk through some of these "ponies" and demonstrate the points at which things start falling apart with practical, real-world examples.
- 8 participants
- 49 minutes

8 Apr 2013
Speaker: Thomas Pinckney, Senior Director of Engineering at eBay
SlideShare: http://www.slideshare.net/planetcassandra/e-bay-nyc
Recommendation and personalization systems are an important part of many modern websites. Graphs provide a natural way to represent the behavioral data that is the core input to many recommendation algorithms. Thomas Pinckney and his colleagues at Hunch (recently acquired by eBay) built a large scale recommendation system, and then ported the technology to eBay. Thomas will be discussing how his team uses Cassandra to provide the high I/O storage of their fifty billion edge graphs and how they generate new recommendations in real time as users click around the site.
SlideShare: http://www.slideshare.net/planetcassandra/e-bay-nyc
Recommendation and personalization systems are an important part of many modern websites. Graphs provide a natural way to represent the behavioral data that is the core input to many recommendation algorithms. Thomas Pinckney and his colleagues at Hunch (recently acquired by eBay) built a large scale recommendation system, and then ported the technology to eBay. Thomas will be discussing how his team uses Cassandra to provide the high I/O storage of their fifty billion edge graphs and how they generate new recommendations in real time as users click around the site.
- 9 participants
- 48 minutes

8 Apr 2013
Speaker: Eddie Garcia, VP of Development at Gazzang
Topic: Security at Gazzang (DataStax Partner)
Topic: Security at Gazzang (DataStax Partner)
- 1 participant
- 5 minutes

8 Apr 2013
Speaker: John McCann, Senior Director of Applications Engineering at Comcast
SlideShare: http://www.slideshare.net/planetcassandra/nyc-tech-day-using-cassandra-for-dvr-scheduling-at-comcast
Comcast is developing a highly scalable cloud DVR scheduling system on top of Cassandra. The system is responsible for managing all DVR data and scheduling logic for devices on the X1 platform. This talk will cover the overall architecture of the scheduling system, data model, message queue and notification software that have been developed as part of this ambitious project. We'll take a deep dive into the details of our data model and review the implementation of Comcast's open-source, Cassandra-based clones of Amazon SQS and SNS.
SlideShare: http://www.slideshare.net/planetcassandra/nyc-tech-day-using-cassandra-for-dvr-scheduling-at-comcast
Comcast is developing a highly scalable cloud DVR scheduling system on top of Cassandra. The system is responsible for managing all DVR data and scheduling logic for devices on the X1 platform. This talk will cover the overall architecture of the scheduling system, data model, message queue and notification software that have been developed as part of this ambitious project. We'll take a deep dive into the details of our data model and review the implementation of Comcast's open-source, Cassandra-based clones of Amazon SQS and SNS.
- 4 participants
- 42 minutes

8 Apr 2013
Speaker: Jonathan Ellis, Apache Cassandra Project Chair and CTO/Co-Founder of DataStax
SlideShare Presentation: http://www.slideshare.net/planetcassandra/nyc-jonathan-ellis-keynote-cassandra-12-20
SlideShare Presentation: http://www.slideshare.net/planetcassandra/nyc-jonathan-ellis-keynote-cassandra-12-20
- 2 participants
- 47 minutes

20 Mar 2013
Speaker: Eric Lubow
SlideShare: http://www.slideshare.net/planetcassandra/big-data-revolution-is-an-evolution-ny-cassandra-20130320
Dealing with data doesn't only require a data store, it requires an infrastructure. At SimpleReach, we have 5 data storage layers to service all of our data needs. These range from high volume, high velocity data ingestion with real-time analytics to ad-hoc style historical analysis with search capabilities. To communicate effectively between applications, data stores sit behind a service architecture for consistent data access patterns and failover/redundancy. This talk is a story of how we came to this architecture and some of the lessons we learned along the way.
SlideShare: http://www.slideshare.net/planetcassandra/big-data-revolution-is-an-evolution-ny-cassandra-20130320
Dealing with data doesn't only require a data store, it requires an infrastructure. At SimpleReach, we have 5 data storage layers to service all of our data needs. These range from high volume, high velocity data ingestion with real-time analytics to ad-hoc style historical analysis with search capabilities. To communicate effectively between applications, data stores sit behind a service architecture for consistent data access patterns and failover/redundancy. This talk is a story of how we came to this architecture and some of the lessons we learned along the way.
- 5 participants
- 40 minutes
