Datastax Cassandra Administrator Certification Questions and Answer (Pratice Questions and Dumps)

Question-: Which of the following statement/s is/are correct for Cassandra?
A. Cassandra support linear performance when scaling nodes in a cluster.
B. You can tune the consistency in Cassandra DB
C. Every row in a table can have different number of columns.
D. Across the Data center it has single point of failure.

Answer: A, B, C
Exp: Following are the features of the Cassandra database you should keep in mind.

- Cassandra support JMX (Java Monitoring Extensions), to get the Matrics for performance, latency, system usage etc. And this matrics you can pass to other application as well for further analysis.
- Cassandra DB has flexible data model, where each row in a partition can different number of columns.
- Tunable consistency: If we want to replicate data across three nodes, we can have replication factor set as 3. While writing the data we have option whether to wait for all the three node acknowledgement for the write or as soon as data is written to first node should be acknowledged. As per the need we can tune the consistency. However, finally data will be stored all three nodes and that is known as eventual consistency.
- Linear performance: If your 3 node Cassandra cluster support 6000 writes per second than 4 node will support 8000 writes, similarly adding 2 more nodes (total 6) will have 12000 writes per second. As many nodes you would have in the cluster it would linearly increase the performance.
- Continuous available: As Cassandra has peer-to-peer replication of data across the nodes. And there is no single master. Hence, there is no single point of failure, even your cluster span across the data center.
In the event of operation tie
1. Deletes will take precedence over inserts/updates.
2. If there are two updates, the one with the lexically larger value will wins.

Admin and Dev both

Question-: Which of the following statements are correct for querying data from a Cassandra table?
A. In where clause you first provide the partition key and then you can have clustering columns.
B. You cannot perform equality or range queries on clustering columns.
C. All equality comparisons must come before inequality comparisons.
D. Range searches are binary search.

Answer: A, C, D
Exp: We must know the following things while querying the table which has partition key and clustering column.
- While querying in where clause we should first use the partition key.
- Then in the where clause we can have Clustering columns.
- We can perform equality (=) and range queries (<,>) on clustering columns.
- Also keep in mind that while querying data equality comparisons must come before inequality comparisons.
- As we define clustering columns then data would always be sorted on the disk. And while searching the data if there is a range search it would do the binary search and then apply the linear search.
Admin Only

Question-: Which of the following statements are correct for the storage layer of the Apache Cassandra node?
A. Cassandra node must have locally attached storage.
B. Cassandra node can work with Direct Attached Storage.
C. Cassandra best work the SAN (Storage Area Network)
D. Cassandra can work with the SSD
E. Cassandra works well with the Network attached storage.

Answer: B, D
Exp: Cassandra database works well with the locally attached storage but this is not the must. It can have direct attached storage as well. Hence, option-2 is correct.

Cassandra should never be setup with SAN (Storage area network). Hence, option-3 is not at all correct.
Yes, it is a good choice having an SSD for storage layer. Hence, option-4 is correct.

Cassandra is does not perform better with the network attached storage and must be avoided.

Admin and Dev Both

Related Questions

Question-: Which of the following are valid for the vnodes in a Cassandra cluster?
A. There are by default 128 vnodes for each node.
B. There are by default 8 vnodes for each node.
C. To configure you have to set num_tokens in Cassandra.yaml file.
D. To configure you have to set num_vnodes in Cassandra.yaml file.
E. When we use the vnodes Cassandra automatically assigns the token range.

Question-: Which of the following statement true with regards to Cassandra cluster gossip protocol?
A. In cluster every node initiates a gossip round every second and can choose between 1 to 3 nodes for gossip.
B. Gossip can only be initiated by the seed nodes.
C. Nodes can gossip with any other node in the cluster but they favor the seed node.
D. Node always track which node they have gossiped. To avoid the re-gossiping.
E. Gossip protocol transfer client data from one node to another node.

Question-: In Cassandra snitch is used for
A. To find all the slow performing nodes in the big Cassandra cluster.
B. This is a protocol for Gossiping in the Cassandra cluster.
C. It finds the seed nodes in the cluster.
D. All of the above
E. None of the above

Question-: You have node cluster, initially you kept the replication factor as and then after discussing with the architect you added more nodes in the cluster and changed the replication factor to , then which of the following would happen?
A. A single node in the cluster now would be able to handle more number of tokens.
B. Same token would be handled by more than one node.
C. Overall storage requirement would increase
D. None of the above

Question-: Cassandra database best fit which of the following from CAP theorem?
A. Consistency, Availability
B. Availability, Partition tolerance
C. Partition tolerance, Consistency
D. Consistency, Availability, and Partition tolerance.

Question-: You have node cluster setups within datacenters. Each datacenter has nodes and dynamic snitch is configured. Also, you have RF= and consistency level is set as LOCAL_QUORUM. What does that mean?

A. A read or write request will be acknowledged to the client once it has achieved quorum from each data center.
B. A read or write request will be acknowledged to the client once it has achieved quorum from data center it is talking to.
C. A read or write request will be complete once it has achieved quorum across all the data centers.
D. It will check all the copies of data in the cluster before read replies to the client and latest copy of the data would be returned.