BigData - Interview Questions and Answers for 'Hadoop' | Search Interview Question - javasearch.buggybread.com
Javasearch.buggybread.com

Search Interview Questions


 More than 3000 questions in repository.
 There are more than 900 unanswered questions.
Click here and help us by providing the answer.
 Have a video suggestion.
Click Correct / Improve and please let us know.
Label / Company      Label / Company / Text

   



BigData - Interview Questions and Answers for 'Hadoop' - 17 question(s) found - Order By Newest

 Q1. What is the difference between namenode and datanode in Hadoop? BigData
Ans. NameNode stores MetaData (No of Blocks, On Which Rack which DataNode is stored etc) whereas the DataNode stores the actual Data.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   at&t     Asked in 2 Companies


 Q2. Which is the default Input Formats defined in Hadoop ?

a. SequenceFileInputFormat
b. ByteInputFormat
c. KeyValueInputFormat
d. TextInputFormat
Ans. TextInputFormat

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   hadoop input format


 Q3. Which of the following is not an input format in Hadoop ?

a. TextInputFormat
b. ByteInputFormat
c. SequenceFileInputFormat
d. KeyValueInputFormat
Ans. ByteInputFormat

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   hadoop input format


 Q4. Which of the following is a valid flow in Hadoop ?

a. Input -> Reducer -> Mapper -> Combiner -> -> Output
b. Input -> Mapper -> Reducer -> Combiner -> Output
c. Input -> Mapper -> Combiner -> Reducer -> Output
d. Input -> Reducer -> Combiner -> Mapper -> Output
Ans. Input -> Mapper -> Combiner -> Reducer -> Output

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   map reduce


 Q5. MapReduce was devised by ...

a. Apple
b. Google
c. Microsoft
d. Samsung
Ans. Google

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   map reduce


 Q6. Which of the following is not a phase of Reducer ?

a. Map
b. Reduce
c. Shuffle
d. Sort
Ans. Map

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   map reduce


 Q7. How many instances of Job tracker can run on Hadoop cluster ?
a. 1
b. 2
c. 3
d. 4
Ans. 1

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   hadoop cluster


 Q8. Which of the following is not the Dameon process that runs on a hadoop cluster ?

a. JobTracker
b. DataNode
c. TaskTracker
d. TaskNode
Ans. TaskNode

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   hadoop cluster


 Q9. What is a Sequence File?

a. A Sequence File contains a binary encoding of an arbitrary number of homogeneous writable objects.
b. A Sequence File contains a binary encoding of an arbitrary number key-value pairs. Each key must be the same type. Each value must be of same type.
c. A Sequence File contains a binary encoding of an arbitrary number of heterogeneous writeable objects.
d. A Sequence File contains a binary encoding of an arbitrary number of Writable Comparable objects, in sorted order.
Ans. A Sequence File contains a binary encoding of an arbitrary number key-value pairs. Each key must be the same type. Each value must be of same type.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   bigdata   big data


 Q10. What is the input to the Reduce function ?

a. One Key and One Value
b. Multiple Keys and Multiple associated Values
c. Multiple Keys and One associated values with each
d. One key and associated values.
Ans. One key and associated values.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   bigdata   big data   map-reduce   map reduce   reduce function


 Q11. Which of the following is the implementation language for Map Reduce Framework ?

a. Big Data
b. Hadoop
c. Java
d. C++
Ans. Java

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop   bigdata   big data   map-reduce   map reduce framework


 Q12. What are the two main components of Hadoop System ?BigData
Ans. Distributed file system and Map Reduce system.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     hadoop  bigdata  hadoop system components        frequent


 Q13. What is Hadoop Framework ?BigData
Ans. Hadoop is an open source framework , written in java by apche software foundation. This framework is used to write applications to process vast amount of data. Processing happens in parallel on large clusters which could have 1000 of computers. It processes data in a very reliable and fault tolerant manner.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     Hadoop framework  BigData


 Q14. What is the use of Combiners ?BigData
Ans. Combiners are used to increase the efficiency of a Map Reduce program. They are used to aggregate intermediate map output locally on individual mapper outputs. Combiners can help you reduce the amount of data that needs to be transferred across to the reducers.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     Hadoop Systems  BigData   hadoop combiners     Asked in 1 Companies


 Q15. how many reducer task runs on a hadoop cluster ?BigData
Ans. Generally one reducer runs for all mappers, but it can be increased as per requirements.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     Hadoop  BigData     Asked in 1 Companies


 Q16. Difference between hadoop 1.x and 2.x?Apache Hadoop
Ans. Hadoop 1.x

It supports only MapReduce (MR) processing model.
It has limited scaling of nodes. Limited to 4000 nodes per cluster.
It has single Namenode to manage the entire namespace.
It has Single-Point-of-Failure (SPOF)
Works on concepts of slots – slots can run either a Map task or a Reduce task
MR has to do both processing and cluster resource management.

Hadoop 2.x

It supports MR as well as other distributed computing models like Spark, Hama, etc
It has better scalability. Scalable up to 10000 nodes per cluster.
Works on concepts of containers. Using containers can run generic tasks.
It has Multiple Namenode servers manage multiple namespace.
YARN does cluster resource management and processing is done using different processing models.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     


 Q17. Difference between group and cogroup?Apache Hadoop
 This question is still unanswered. Can you please provide an answer.


 Help us improve. Please let us know the company, where you were asked this question :   

   Like         Discuss         Correct / Improve     



Help us and Others Improve. Please let us know the questions asked in any of your previous interview.

Any input from you will be highly appreciated and It will unlock the application for 10 more requests.

Company Name:
Questions Asked: