Explanation: Technically it is possible to run all the hadoop daemons on single node, however it is not advisable
Refer HadoopExam.com Recorded Training Module : 2, 14 and 16
Question : MapR services produce metrics that can be written to an output file, which is the metrics that can be collected. A. CLDB B. File Servers C. Hive Query Statistics D. Fume job statistics 1. A,B 2. B,C 3. Access Mostly Uused Products by 50000+ Subscribers 4. A,D 5. B,D
Correct Answer : Get Lastest Questions and Answer : Explanation: By default, the CLDB and FileServer metrics are sent via unicast to the Ganglia gmon server running on localhost. To send the metrics directly to a Gmeta server, change the cldb.servers property to the hostname of the Gmeta server. To send the metrics to a multicast channel, change the cldb.servers property to the IP address of the multicast channel.
Question : MapReduce v (MRv/YARN) is designed to address which two issues? A. Single point of failure in the NameNode. B. Resource pressure on the JobTracker. C. HDFS latency. D. Ability to run frameworks other than MapReduce, such as MPI. E. Reduce complexity of the MapReduce APIs. F. Standardize on a single MapReduce API.
1. Pig is more powerfull and allow certain types of data manipulation not possible with MapReduce. 2. Pig provides has same capabilities as MapReduce. Pig programs are executed as MapReduce jobs via the Pig interpreter. 3. Access Mostly Uused Products by 50000+ Subscribers 4. Pig provides the additional capability of allowing you to control the flow of multiple MapReduce jobs and Chaining the MapReduce jobs, which is not possible only with the MapReduce.
1. Input file splits may cross line breaks. A line that crosses tile splits is ignored. 2. The input file is split exactly at the line breaks, so each Record Reader will read a series of complete lines. 3. Access Mostly Uused Products by 50000+ Subscribers 4. Input file splits may cross line breaks. A line that crosses file splits is read by the RecordReader of the split that contains the end of the broken line. 5. Input file splits may cross line breaks. A line that crosses file splits is read by the RecordReader of the split that contains the beginning of the broken line.
1. Increase the parameter that controls minimum split size in the job configuration. 2. Write a custom MapRunner that iterates over all key-value pairs in the entire file. 3. Access Mostly Uused Products by 50000+ Subscribers 4. Write a custom FileInputFormat and override the method isSplittable to always return false.
1. A SequenceFile contains a binary encoding of an arbitrary number of homogeneous writable objects. 2. A SequenceFile contains a binary encoding of an arbitrary number of heterogeneous writable objects. 3. Access Mostly Uused Products by 50000+ Subscribers 4. A SequenceFile contains a binary encoding of an arbitrary number key-value pairs. Each key must be the same type. Each value must be same type.