Wednesday 25 July 2018

NoSQL: Awesome Morning, What is Apache Cassandra?

The Cassandra was developed by Facebook for Inbox Search, and it became open-source in July 2008.
In March 2009, Apache accepted Cassandra, and now Cassandra is one of the high-level projects of the Apache.
The Apache released a new version of Cassandra 3.5 on 2016-04-13.
It is open source, meaning its source code is freely available for anyone to study, modify and use.
NoSQL Cassandra
The Cassandra is a highly scalable and high performance distributed NoSQL database.
It is a type of Column Stored NoSQL database.
I already posted about, What is a Column Stored NoSQL Database?

The Apache Cassandra is a peer to peer architecture.
Some of the database system with high availability has Master – Slave format in which, if Master server is failed, the other Slave server also may fail.
In the peer to peer architecture, all nodes are equal and there are no any dependency on each other. Very easily, We can add or remove any node in the cluster.
Cassandra has distributed architecture, and it can handle a huge amount of data with high availability.
There are very rare chance of failure of the data because multiple nodes are running with same copies of data.
If one node is crashed, the data can be available in the second nearest node. You can define Replication Factor value (number of nodes) by the time of creating a Keyspace.
Cassandra offers solutions for a problem where your requirement is to have a very heavy write system, and you want to have a quite responsive reporting system on top of that stored data.
A Cassandra data structures are very similar to any RDBMS data structures, and it’s faster than RDBMS. The RDBMS people can easily learn and migrate in Cassandra.
Some of the Cassandra Features are:
It is Masterless Architecture,
Flexible and Dynamic Data Model,
Good with Data Compression,
Having a Cassandra Query Language (CQL),
It has Fault Detection and Recovery option, Having a good algorithm for Data Protection, Replicate data into multiple servers.

Cassandra is the best option for applications like Mobile Messaging Services, Internet Provider, Social Media Analytics, and other Data Mining related systems.
Some of the popular companies like Facebook, Twitter, Netflix, Cisco, eBay, Reddit, Comcast, Adobe are using Cassandra for storing a large amount of data.

0 comments:

Post a Comment