Category Archives: Cassandra

Reading Nested Objects Modeled with Composite Key from Cassandra

My earlier post was about storing nested objects modeled with composite key in Cassandra. Well, we need to be able to read the data back as objects and that’s the topic for this post. This post will focus on rest … Continue reading

Posted in Big Data, Cassandra, Data Model, NOSQL | Tagged | 2 Comments

Storing Nested Objects in Cassandra with Composite Columns

One of the popular features of MongoDB is the ability to store arbitrarily nested objects and be able to index on any nested field. In this post I will show how to store nested objects in Cassandra using composite columns. … Continue reading

Posted in Big Data, Cassandra, Data Model, NOSQL | Tagged , , , | 19 Comments

Cassandra Range Query Made Simple

In Cassandra, rows are hash partitioned  by default. If you want to data sorted by some attribute, column name sorting feature of Cassandra is usually exploited. If you look at the Cassandra slice range API, you will find that you … Continue reading

Posted in Cassandra, NOSQL | Tagged , , | 7 Comments

Data Loader for NOSQL Databases

In one of my recent projects, I had to load product data from a CSV file into HBase and also to index it for search purpose.. I decided to separate out the loader part of the project as a stand … Continue reading

Posted in Cassandra, HBase, Indexing, NOSQL | Tagged , , , | 3 Comments

Cassandra Secondary Index Patterns

We all know that any real application needs to do query based on attributes other than the primary key or row key in case of Cassandra. Cassandra version .7 onwards provides native secondary index support. But there are several limitations. … Continue reading

Posted in Cassandra, Indexing, NOSQL | Tagged , | 8 Comments

Easy Cassandra Data Access

This post is about a simple no nonsense data access API for Cassandra. I did not start with a grandiose plan for yet another high level Cassandra API. I was implementing a Cassandra based BPM that I blogged about earlier. … Continue reading

Posted in Cassandra, Java, NOSQL | Tagged , | 4 Comments

Cassandra secondary index to the rescue

In an earlier blog, I discussed Cassandra data model for a BPM system. I used an eCommerce order processing system driven by a BPM as an example. In this post, we will discuss some of the data access usage scenario … Continue reading

Posted in Cassandra, Indexing, NOSQL | Tagged , , , | 1 Comment

Roll out your own BPM with some help from Cassandra (Part 2)

In Part 1 we talked about BPM architecture in  general. My focus in this post will be a data model for our BPM using Cassandra. Well, you might ask why not stick with MySQL that we all love. Why Cassandra … Continue reading

Posted in BPM, Cassandra, Data Model, NOSQL | Tagged , , | 2 Comments

Roll out your own BPM with some help from Cassandra (Part 1)

BPM stands for Business Process Management. In simple terms it’s a framework for managing long running business transaction. A long running business transaction manages a business process e.g., order processing in a eCommerce web site. A business process is generally … Continue reading

Posted in BPM, Cassandra, eCommerce, NOSQL | 1 Comment

Cassandra and Hadoop

I was always interested in mining patterns and knowledge from data. While working on a data mining project some time ago, I ran into a road block when dealing with very large data set. Most data mining algorithms are monolithic … Continue reading

Posted in Cassandra, Hadoop and Map Reduce, NOSQL | Tagged , | 1 Comment