Datastax Cassandra Administrator Certification Questions and Answer (Pratice Questions and Dumps)

Question-: Suppose there are two Database tables in your Cassandra cluster, HE_VISIT and HE_SIGNUP. As you can see that HE_VISIT table has a high throughput, frequent write operations for this table. Which of the following statements are correct with regards to memtable flush for each table?
A. Table HE_VISIT’s memtable fills up rapidly and gets flushed frequently the table HE_SIGNUP.
B. Table HE_SIGNUP’s memtable fills up slowly and rarely gets flushed.
C. When commit log reaches to its maximum size then it forces HE_SIGNUP’s to flush.
D. If the commit log space and memtable space are same size than table HE_Signup’s memtable would flush every time table HE_VISIT is flushed.

Answer: A,B,C,D
Exp: Suppose you have two tables table A and table B, where write to the table A is very high and table B has a very low write. In this case the commit log would have write from both the table. And memtable of table A get filled very rapidly so it got flush frequently. But in case of table B table fills slowly and flushed very rarely. Now the role of the commit log is to keep data written from both the table. And commit log has its own define Max size, if it reaches to that maximum size then it forces all the table’s memtable to be flushed to the disc as SSTable. Reason being commit log is common for all the table. And as you know commit log itself is divided into the segments, so when commit log reaches its maximum size it has to delete oldest segment for the commit log as well.

Admin/Dev both

Question-: Which of the following is shared across all the tables?
A. Memtable
B. SSTables
C. Commit logs
D. Both memtable and SSTable

Answer: C
Exp: Memtables and SSTables are maintained for each table. And commit log is shared across all the tables on the node. Remember SSTables are immutable and once written they cannot be changed, only a bigger SSTable can be created by merging the various memtables.

Question-: You have been given below detail from the directory stored on one of the nodes in the Cassandra cluster.

data/hadoopexam/course_fee-a5g22x211gf422l7790c34ad987777d3d/xx-1-bti-Data.db
Can you please map the following?

A. HadoopExam
B. Course_FEE
C. BTI
D. XX

1. Keyspace Name
2. Table Name
3. Access Mostly Uused Products by 50000+ Subscribers
4. Version of the SSTable

Answer: A-1, B-2, C-3, D-4
Exp: In the given example you can easily identify what is the name of the table, what is the keyspace name, what is the unique identifier of the table and what format SSTable is using etc.

In this case HadoopExam is the name of the keyspace, which is created under the data directory. Hence for each keyspace you would have one directory created on the data directory. And the next is the directory, which contains the initial as a table name and next Big hexadecimal string is a unique identifier for the table. In this directory various .db files are created, where initial is the version of the SSTable and format of the SSTable.

Dev/Admin both

Related Questions

Question-: Which all of the below are correct for the “nodetool tablehistograms�? command?
A. It provides the current table level statistics.
B. It provide the detail of an individual table read and write latency since last 15 minues.
C. It provide the information about the entire clutser.
D. It provides the latencies of a node.
E. It provides all the tables name which are not in sync across the cluster

Question-: Please map the followings
A. logback.xml
B. system.log
C. debug.log
D. gc.log

1. You can set the log levels and location of the log files in this.
2. In this you can check when the nodes leaving and joining
3. Access Mostly Uused Products by 50000+ Subscribers
4. You can check the application pause time in this log

Question-: Which of the following information you can check with the “system.log�? ?
A. All the gc pauses
B. When the nodes are leaving and joining
C. Metadata ownership change information
D. Index redistribution
E. Which node was attacked by hacker?

Question-: Which of the following are the causes for GC pauses in Cassandra database?
A. lack of JVM space
B. Mistuned JVM GC algorithm
C. Slow disk
D. High volume of data
E. high number of nodes in Cassandra cluster

Question-: While analyzing the Cassandra cluster you found that one of the nodes in the ring is responding quite slow to query. Hence, you decided to debug the system and wanted to enable the log level to Debug only on that particular node. And once analysis is done, you wanted to reset the log level to INFO. Which of the following option you would use, so that you don’t have to restart the cluster?
A. You would be changing the log level in logback.xml file. Which would be effective for next query
B. You would be updating the log level by changing the JVM options.
C. You will be using the nodetool setlogginglevel option.
D. It is not possible, without node restart

Question-: You have setup Splunk process as well as Geneos alert system to analyze the logs. And they both can read the logs from a particular directory only. Now you need to make sure that the logs can be created in that directory only. How can you achieve this?
A. You would be providing the log path at Cassandra.yaml file
B. You will be providing the log path using the jvm.options
C. You would be updating the log4j.properties file
D. You would be updating the logback.xml file