Datastax Cassandra Administrator Certification Questions and Answer (Pratice Questions and Dumps)

Question-: Which of the following are correct statement for the sstableloader command?
A. This Tool help in loading the external data to an existing cluster.
B. This tool help in loading the existing SSTable to the cluster.
C. When you load the data from SSTable then previous cluster and new cluster should have the same number of nodes.
D. Loading the data using this tool required that previous cluster and new cluster have same application strategy or partitioner.

Answer: A,B

Explanation: The sstableloader command is used to load the existing SStable into the cluster. There is no hard and fast requirement that previous cluster and new cluster have the same number of nodes or the same replication strategy, this can be different. So when data is loaded using the sstableloader on the new cluster, then partition strategy or replication strategy would be rearranged as per the new cluster configuration.

Question-: You have many csv files with almost million records across all the files , there is a one column in the csv file which has a date data in it. But before loading the csv file you want to change the format of date column. Which of the following is most suitable solution for this requirement?
A. You would be using CQL copy command
B. You would be using sstableloader
C. You would be using DSBulk tool
D. You would be using spark

Answer: D

Explanation: There are multiple options to load data in Cassandra Database, as mentioned previously you should use different tool based on the different requirement. In this question it is given that you wanted to format the date before uploading the data in the Cassandra database. So if you want to modify the data or pre-process the data then Spark is the best solution for this. Using the spark programming you can modify the format of the data as per your requirement and then directly save the data in Cassandra database. You can use parallelism to speed up this process. There are a lot of Optimization possible in this process and various tuning parameters are available through Spark as well.

Question-: Which of the following statement are correct for the DSBulk tool?
A. This can be used to export data from Casandra database.
B. This can be used how to load JSON file in database.
C. This can be used to load csv file in database
D. This can be used to rebalance Cassandra cluster.
E. This can be used to repartition Cassandra cluster.

Answer: A,B,C

Explanation: DSBulk tool it is one of the best tool available for importing and exporting huge volume of data to the Cassandra cluster. These tools support Both csv and JSON format. Yes, this tool is available as a command line interface. This tool can not be used for rebalancing or repartitioning the Cassandra database.

Related Questions

Question-: You have batch processing on every midnight which runs for - hours, based on the data volume. And this batch process writes this data in a Cassandra database table. Which of the following Compaction strategy fits for this requirement?
A. Leveled compaction
B. Size Tiered compaction
C. Time window compaction
D. Batch window compaction

Question-: Which of the following event Trigger the compaction process?
A. Compaction happens every time when Memtable flushes to an SSTable.
B. Compaction happens when manual flush happens
C. Compaction happens during the bootstrap process
D. Compaction happens when Memtable or commit log become too large.
E. Compaction happens every midnight by default
F. Compaction happens with every delete row command issued by a client

Question-: Which of the following statements are correct with regards to Tombstones compaction strategy?
A. If there are no eligible buckets for size tiered compaction then this may be initiated.
B. If there are expired tombstone records are more than 20% then this would be initiated.
C. In case of Tombstone compaction largest SSTable would be chosen first.
D. SSTable must be at least one week older before it is chosen for Tombstone compaction.

Question-: In your Cassandra cluster you have a heavy write, then which of the compaction strategy most suitable?
A. Size tiered compaction strategy
B. Leveled compaction strategy
C. Tombstone compaction strategy
D. Expired data compaction strategy

Question-: When you issue “nodetool compact�? it will
A. Initiate major compaction.
B. Initiate minor compaction.
C. Initiate Tombstone compaction
D. Creates one large SSTable
E. This would result in high IO

Question-: Size tiered compaction triggers compaction process based on the number of tables?
A. True
B. False