Difference between Cassandra and Hadoop:
Parameters | Cassandra | Hadoop |
---|---|---|
CAP theorem | AP | CP |
Architecture | All nodes are same in Peer-to-Peer Distributed architecture. |
Master/Slave. Name Node act as Master and data node act as worker node. |
Read and Write Design | Read and write anywhere model. | Write once read many access models. |
Area of utilization | Real time online transactional processing | Batch-oriented analytical solutions. |
Mode of accessing data | Cassandra query language and Command line interface tools. |
Map/Reduce for read/write operations. |
Data storage model | Keys space column family to store the data and introduces primary and secondary indexes for high availability of data |
File system. Large files are destroyed into small piece and copy on many data node |
Fault tolerance | High availability- NO SPOF cluster all nodes are same and experienced to handle the access |
when master node is down if the Single point of failure Vulnerable to be failed. |
Storage schema | Combines schema from Google big table and Amazon Dynamo |
Physical file system schema. |
Communication | Gossip protocol | RPC/TCP and UDP |