Following on from my previous post Getting started with Spring Data Cassandra we will look into using multiple keyspaces within an application. This will be a relatively short post due to most of the content being covered in the earlier post allowing us to focus on the code needed to allow multiple keyspaces and reasons why you might want to switch from a single one to using multiple.
As mentioned a minute ago, Getting started with Spring Data Cassandra contains fundamental information required in this post, including dependencies and code snippets that are not shown here.
Firstly lets start off with why you might want to use multiple keyspaces. When creating a multi tenant application there are a few ways to separate each tenant’s data from each other; including a column in the partition key to distinguish each tenant, creating individual tables for each tenant or using a keyspace for each tenant. Each come with their advantages and disadvantages for example sharing a table between tenants is efficient and easy to use but more active tenants will take over all the key and row caches due to the volume of data they have and the higher number of queries they run.
As mentioned earlier we will be focusing on using multiple keyspaces which come with the advantages of allowing each tenant to have different schemas as well as replication settings. It also comes with the disadvantages of requiring a larger amount of tables when compared to sharing tables between tenants which in turn increases the number of SSTables and MemTables that are created leading to more memory being used. The advantages and disadvantages described for separate keyspaces are very similar to those of sharing a keyspace but with the exception of not being able to set different replication factors.
Now that there is some context we can carry on and look at how to implement multiple keyspaces when using Spring Data Cassandra.
There isn’t to much code to add upon my previous post as it just requires a few tweaks to the beans used. That being said I was hoping it would be a bit more elegant to write but hopefully the solution I produced is good enough. Anyway lets get on with looking at the code and you can decide how it looks.
If you did view my previous post this class might look familiar. This was originally the main configuration class for a application that was designed to work for a single keyspace. To make the application suitable for multiple keyspaces I turned it into an abstract class and removed the
@EnableCassandraRepositories annotations. It is then extended by other configuration classes which provide alterations to a few beans to allow them to point to their chosen keyspace. It is worth mentioning that
getEntityBasePackages is defined here so that all entities can be defined in a single place where they can be used from whatever keyspaces are defined.
This configuration class is the smaller of the two shown in this post (the other can be found further below) as I chose to allow it to use the beans defined in
AbstractCassandraConfiguration that are created due to extending
CassandraConfig. The only configuration actually included in this class is the definition of the keyspace it is using.
getKeyspace is used in the session bean created in
AbstractCassandraConfiguration which leads onto a important point about how I have gone about writing this code.
In this post I chose to tie each keyspace to a session, so as the number of keyspaces increases the amount of sessions running also goes up. This could become problematic when connecting to a large amount of keyspaces due to the overhead of running so many independent sessions. It is also possible to use a single session for multiple keyspaces (which I might cover in a later post) by using the
CassandraTemplate and specify the keyspace name in the query, but this requires you to write your own query implementations as you cannot use the inferred queries that the Spring Data repositories provide.
Continuing onto the other keyspace configuration.
This class is still pretty similar to the previous configuration but obviously it has some additions to it. Firstly it defines two beans which are also created in
AbstractCassandraConfiguration. A new implementation of
session is added that uses the
super implementation but provides the bean with a new name of “keyspaceBSession” instead of “session”. If this name was not provided it would get mixed up with the other session used in
KeyspaceACassandraConfig. As before
getKeyspaceName is used in the original
session method meaning that it doesn’t need to be explicitly set inside this class. The other created bean is the
CassandraAdminOperations which is an interface that
@Qualifier needs to be used here to specify that the session created here is the one that is used and not the session tied to the other keyspace. The
cassandraTemplateRef property inside
@EnableCassandraRepositories defines what
CassandraTemplate bean is used for all
CassandraRepository methods and is crucial for allowing multiple keyspaces to be used. If this property was not set the repositories would look for a bean named “cassandraTemplate” which in this scenario is tied to a different keyspace.
After these configuration classes are finished the only task remaining is creating some repositories. Due to wanting to provide the same methods in each keyspace a base repository has been created to reduce duplication.
To be used by the keyspace repositories, for example.
This allows the keyspaces to diverge in functionality if required but keeps any shared requirements in a central location. Remember what I mentioned a minute ago, setting the
cassandraTemplateRef in the
@EnableCassandraRepositories annotation specifies which
CassandraTemplate to use allowing the repositories themselves to be left bare and not require any extra configuration or annotations to make them work.
Now when each repository is used their queries will be directed towards the correct keyspaces.
That brings us to the end of this shorter post. We looked at how to configure the Spring Data Cassandra repositories to allow multiple keyspaces to be used and briefly touched on why you might want to use more keyspaces within your application. The method used in this post revolved around creating a new session and template for each keyspace which can be set up without to much extra configuration when compared to using a singular keyspace.
The code used in this post can be found on my GitHub profile.