Correct Answer : Get Lastest Questions and Answer : Explanation: It is just take data as input and convert into key-value pair. For each record, it chomp the last character (\n), and create a new key and value variables as output to the split method (split on \t). While printing "key:" followed by the key and "value:" followed by the value to standard out. Here key and value strings are separated by the \t, and the key-value pair is terminated by a \n. For intermediate results of the streaming job.
Question : Which is the following is/are correct way to debug streaming job
1. We need to check mapper and reducer script can run on its own by feeding it input on standard in
Correct Answer : Get Lastest Questions and Answer : Explanation: General approaches to debug hadoop streaming code We must make sure that mapper and reducer script can run on it's own by feeding it input on standard in. As with any program, we should always test with bad data for instance data is ' not formatted according to what map and reduce scripts expect. Test the map and reduce functions in the hadoop framework by using "identity" mapper and reducers accordingly.
Question : How to monitor streaming jobs using counter
1. update counters from within your map and reduce scripts, with the string "reporter:counter"
2. update status from within your map and reduce scripts, with the string "reporter:status"
1. Binary data can be used directly by a map-reduce job. Often binary data is added to a sequence file 2. Binary data cannot be used by Hadoop framework. Binary data should be converted to a Hadoop compatible format prior to loading 3. Access Mostly Uused Products by 50000+ Subscribers 4. Hadoop can freely use binary files with map-reduce jobs so long as the files have headers
1. The Hadoop administrator has to set the number of the reducer slot to zero on all slave nodes. This will disable the reduce step. 2. It is impossible to disable the reduce step since it is critical part of the Map-Reduce abstraction. 3. Access Mostly Uused Products by 50000+ Subscribers 4. While you cannot completely disable reducers you can set output to one. There needs to be at least one reduce step in Map-Reduce abstraction.