Question : What is the benefit of the Hadoop framework 1. Can process PetaBytes of data volume 2. Can store PetaBytes of Structured and Unstructured data 3. Access Mostly Uused Products by 50000+ Subscribers 4. All of the above
Question : In which of the below scenerio Hadoop framework should not be used 1. Working with small and structured datasets 2. If there is a requirement of transaction 3. Access Mostly Uused Products by 50000+ Subscribers 4. 1 and 2 5. 1 , 2 and 3
DistCp (distributed copy) is a tool used for large inter/intra-cluster copying. It uses Map/Reduce to effect its distribution, error handling and recovery, and reporting. It expands a list of files and directories into input to map tasks, each of which will copy a partition of the files specified in the source list. Its Map/Reduce pedigree has endowed it with some quirks in both its semantics and execution. The purpose of this document is to offer guidance for common tasks and to elucidate its model.