09 Columnar Databases
Slides: https://webeep.polimi.it/mod/resource/view.php?id=49240 Exercises: https://webeep.polimi.it/mod/resource/view.php?id=48804
Pages
01 Introduction
In the recent years there has been a ever growing need for technologies capable of handling large scala data analysis Thas need was born because dataset of ”Big Data” size have often very different…
02 Cassandra
Apache Cassandra is a highly scalable, high-performance distributed database designed to handle large amounts of data, providing high availability with no single point of failure. It’s a column-based…
03 HBase
HBase Table : Split it into multiple regions: replicated across servers. One Store per ColumnFamily (subset of columns with similar query patterns) per region. Memstore for each Store: in-memory…
04 Cassandra Query Language
To query the data stored within Cassandra, a dedicated query language named Cassandra Query Language (CQL) was developed. CQL offers a model similar to MySQL under many different aspects It is used…