Question-: In which of the following situation you would be using DSBulk tool? A. If we have a lot of files which needs to be loaded in different tables then we should use DSBulk tool. B. If you wanted to take the backup of data from the Cassandra cluster then we should use this tool. C. If data model changes for our existing data and we wanted to move it to the new data model then you should use this tool. D. While migrating data from one Cassandra database to another Cassandra database. E. While converting JSON data to csv data.
Answer: A, B, C, D Exp: DSBulk tool can be used for loading huge volume of data to Cassandra database. And it supports gigabytes of data load/unload. So, this is one of the convenient tools for loading bigger volume of data. Even if you are using multiple files which needs to be loaded into many tables in the same Cassandra database then you should use this tool. If you wanted to take the backup of your present Cassandra database then just download the data or export the data using this tool and create a backup file.
At the later stage if you have found that your data model is not efficient and you need to change this data model so what you can do you download your existing data from the existing cluster and Same data can be uploaded into the new data model using the DSBulk tool.
Question-: To get the backup of the data you run the snapshot command; it would create a snapshot for the data which are in the as SSTable as well as in the memtable? A. True B. False
Answer: B Exp: When you are going to take the snapshot of the existing data in the database. It will create the snapshot from the SSTable files only and does not consider the data which is in the Memtable. So if you want to take the backup of the current data in the memtable as well then you should first flush the data to SSTable, then take the snapshot.
Question-: When you take up Snapshot for one of the SSTable on a particular node then it would not take a backup of the entire cluster? A. True B. False
Answer: A Exp: When you take a backup using the snapshot then it would create a snapshot of SSTable only for the data which exist on that node. It would not consider the data which are residing on any other node in the cluster, even for the same table.