youtube image
From YouTube: New York Times: IoT & Cassandra - Topic Wildcards in Retained Storage

Description

Speaker: Michael Laing, Systems Architect

The Internet of Things uses Topics to tag information.
Topics are segmented named channels that are attached when information is sent or stored
IoT Brokers use Retained Storage to persistently store information by Topic.
Retained Storage is a data store that is searchable using Wildcards in Topics.
Wildcards are reserved characters that match a single level or multiple levels in a Topic
By externalizing Retained Storage to Cassandra, IoT broker instances can autoscale, potentially handling 10's of millions of clients.
This requires efficient queries using Wildcards in Cassandra to access Retained Storage.

I will present strategies for implementing fast Wildcard queries composed of sub-strategies such as:
- Cluster key shuffling and auto-inversion to help determine partition key and narrow the row slice
- Sparse secondary indexes to minimize filtering
- Stratio's Cassandra Lucene Index to augment or replace other sub-strategies
illustrated by comparative benchmarks.
I will further discuss integration with IoT brokers at scale.