Q1. What is a Sequence File?

a. A Sequence File contains a binary encoding of an arbitrary number of homogeneous writable objects. b. A Sequence File contains a binary encoding of an arbitrary number key-value pairs. Each key must be the same type. Each value must be of same type. c. A Sequence File contains a binary encoding of an arbitrary number of heterogeneous writeable objects. d. A Sequence File contains a binary encoding of an arbitrary number of Writable Comparable objects, in sorted order.

Ans. A Sequence File contains a binary encoding of an arbitrary number key-value pairs. Each key must be the same type. Each value must be of same type.

Q2. What is the input to the Reduce function ?

a. One Key and One Value b. Multiple Keys and Multiple associated Values c. Multiple Keys and One associated values with each d. One key and associated values.

Ans. One key and associated values.

Q3. Which of the following is the implementation language for Map Reduce Framework ?

a. Big Data b. Hadoop c. Java d. C++

Ans. Java

Q4. How would you process a 20 gb file with an application only having access to 4gb memory ?

Ans. Load the file in chunks and then process. If we need to do analytic, we can process analytic information for those chunks and then reprocess the processed information from each chunk.

For example - we need to average all marks in the file. We can divide the file and load into 5 chunks and calculate average for each chunk. Then we can collect averages for all 5 chunks and then calculate the final average.

