Question : You are working in an Investment Bank, and they have subscribed data feeds from BloomBerg, which delivers data in various different format e.g. CSV file, Direst database access either RDBMS or NOSQL(Cassandra) and daily incremental data. Which of the following you can use in Cassandra to load data or migrate data from one Cassandra cluster to another. A. CQL INSERT command B. CQL COPY command C. Loading CSV file directly. D. sstableloader
1. A,B,C 2. B,C,D 3. A,C,D 4. A,B,C,D
Correct Answer : 4 Explanation: Data is inserted using the CQL INSERT command, the CQL COPY command and CSV files, or sstableloader. But in reality, you need to consider how your client application will query the tables, and do data modeling first. The paradigm shift between relational and NoSQL means that a straight move of data from an RDBMS database to Cassandra will be doomed to failure.
Question : A file to which Cassandra appends changed data for recovery in the event of a hardware failure is called.
1. commit log
2. memtable
3. SSTable
4. redo log
Correct Answer : 1 Explanation: A sequentially written commit log on each node captures write activity to ensure data durability.
Question : Cassandra addresses the problem of failures by employing
1. Master-slave Architecture
2. peer-to-peer gossip communication protocol
3. peer-to-peer TCP communication protocol
4. peer-to-peer UDP communication protocol
Correct Answer : 2 Explanation: Cassandra is designed to handle big data workloads across multiple nodes with no single point of failure. Its architecture is based on the understanding that system and hardware failures can and do occur. Cassandra addresses the problem of failures by employing a peer-to-peer distributed system across homogeneous nodes where data is distributed among all nodes in the cluster. Each node frequently exchanges state information about itself and other nodes across the cluster using peer-to-peer gossip communication protocol.