Correct Answer : Get Lastest Questions and Answer : Explanation: : In this case we know, both the course_name are equality operator. Hence, need to be part of primary key and also publish_date is the part of inequality operator. Hence, it should be part of cluster key. Course_name is enough to uniquely identify the course.
Question : You have data size is TB and your schema design will duplicate this data times, having replication factor=. How much data are you going to duplicate? 1. 500TB
Correct Answer : Get Lastest Questions and Answer : Explanation: As you have 10TB data size, , you will be duplicating entire data set. Hence, it will become 100TB, now replication factor is 5. So total datasize will become = 500TB
Question : You have following table designed
Comment_By_course comment_id K course_id C Title description
For each comment_id, you will be copying entire course detail again. Is this an issue ?
Correct Answer : Get Lastest Questions and Answer : Explanation: Yes, this is an issue. Because as we know, there is no limit on number of comments a user can post. If we copy entire course info for each comment, it will be a concern. Here, growth is multiple of number of comments and this is not a constant value. If a user post 1000 comments, we will copy entire course info 5 times and that’s not at all good for database growth.