Java - Interview Questions and Answers for 'Bigdata' | Search Java Interview Question - javasearch.buggybread.com
Javasearch.buggybread.com
Share

Search Java Interview Questions


 2098 questions in repository.
 There are more than 200 unanswered questions.
Click here and help us by providing the answer.
Label / Company      Label / Company / Text

   



Interview Questions and Answers for 'Bigdata' - 21 question(s) found - Order By Newest

 Q1. What is the difference between namenode and datanode in Hadoop? BigData
Admin
info@buggybread.com
Ans. NameNode stores MetaData (No of Blocks, On Which Rack which DataNode is stored etc) whereas the DataNode stores the actual Data.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     hadoop   at&t     Barclays  AT&T


 Q2. What is Apache Kafka ?BigData
admin
info@buggybread.com
Ans. Kafka is a distributed, partitioned, replicated commit log service. It provides the functionality of a messaging system through messages being written to logs.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     kafka   bigdata   message queue     American Express  Digital Reasoning  HMS Holdings  MarkMonitor


 Q3. What is a broker in Apache Kafka ?
admin
info@buggybread.com
Ans. Kafka is run as a cluster comprised of one or more servers each of which is called a broker

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     kafka   bigdata   message queue


 Q4. What is a Topic in Apache Kafka ?
admin
info@buggybread.com
Ans. A topic is a category or feed name to which messages are published

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     kafka   bigdata   message queue


 Q5. Have you used Kafka in your project ? If Yes, for what ?
admin
info@buggybread.com
Ans. We were using Kafka as a replacement for JMS Message Queue for better throughput. We were just using a simple Java multi threaded client as order of message consumption didn't matter to us.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     kafka   bigdata   message queue   java


 Q6. Does Kafka uses ZooKeeper ?
admin
info@buggybread.com
Ans. Yes

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     kafka   bigdata   yes-no   yes no


 Q7. What is a Sequence File?

a. A Sequence File contains a binary encoding of an arbitrary number of homogeneous writable objects.
b. A Sequence File contains a binary encoding of an arbitrary number key-value pairs. Each key must be the same type. Each value must be of same type.
c. A Sequence File contains a binary encoding of an arbitrary number of heterogeneous writeable objects.
d. A Sequence File contains a binary encoding of an arbitrary number of Writable Comparable objects, in sorted order.
Anonymous
Ans. A Sequence File contains a binary encoding of an arbitrary number key-value pairs. Each key must be the same type. Each value must be of same type.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     hadoop   bigdata   big data


 Q8. What is the input to the Reduce function ?

a. One Key and One Value
b. Multiple Keys and Multiple associated Values
c. Multiple Keys and One associated values with each
d. One key and associated values.
Anonymous
Ans. One key and associated values.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     hadoop   bigdata   big data   map-reduce   map reduce   reduce function


 Q9. Which of the following is the implementation language for Map Reduce Framework ?

a. Big Data
b. Hadoop
c. Java
d. C++
Anonymous
Ans. Java

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     hadoop   bigdata   big data   map-reduce   map reduce framework



Do you think these are the Best Java Frameworks ?

OpenXavaSPRING MVCApache StripesCheck everything
that is Best in Java

Click Here



 Q10. Can we have multiple threads consuming message stream from a single partition ?

Ans. Yes, by having multiple Consumer Groups.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     apache kafka  kafka  kafka consumer  kafka topic partitions  bigdata  consumer group


 Q11. Can we have multiple threads consuming messages from a single partition if we have single Consumer Group ?

Ans. No, we can only have max 1 thread per partition in a single Consumer Group.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     apache kafka  kafka  kafka consumer  kafka topic partitions  bigdata  consumer group


 Q12. If we have more threads than partitions in a kafka consumer, How can we model it efficiently ?

Ans. We will have to use multiple consumer groups in that case as threads will remain idle if we use single consumer group. A more sophisticated algorithm could be required with multiple groups if we have to ensure the order of consumption.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     apache kafka  kafka  kafka consumer  kafka topic partitions  bigdata  consumer group


 Q13. What are the two main components of Hadoop System ?BigData2016-10-17 09:56:41

Ans. Distributed file system and Map Reduce system.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     hadoop  bigdata  hadoop system components        frequent


 Q14. What is Hadoop Framework ?BigData2016-10-17 09:58:36

Ans. Hadoop is an open source framework , written in java by apche software foundation. This framework is used to write applications to process vast amount of data. Processing happens in parallel on large clusters which could have 1000 of computers. It processes data in a very reliable and fault tolerant manner.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     Hadoop framework  BigData


 Q15. Can you narrate some sample usage for Bigdata ?BigData2016-10-17 10:02:43

Ans. One common usage is predictive analytic using huge current or past data. For example - Using recent medical data ( diagnosis and procedure ), one can identify the pattern of diseases or the procedures that has to eventually applied upon certain diagnosis. This analysis might help in predicting the diseases that might occur to a patient. the other usage could be to identify the future spending patterns of the population by analyzing the past and current habits.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     BigData


 Q16. What is the use of Combiners ?BigData2016-10-17 10:03:41

Ans. Combiners are used to increase the efficiency of a Map Reduce program. They are used to aggregate intermediate map output locally on individual mapper outputs. Combiners can help you reduce the amount of data that needs to be transferred across to the reducers.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     Hadoop Systems  BigData   hadoop combiners     Barclays


 Q17. How would you process a 20 gb file with an application only having access to 4gb memory ?BigData2016-12-13 11:07:23

Ans. Load the file in chunks and then process. If we need to do analytic, we can process analytic information for those chunks and then reprocess the processed information from each chunk.

For example - we need to average all marks in the file. We can divide the file and load into 5 chunks and calculate average for each chunk. Then we can collect averages for all 5 chunks and then calculate the final average.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     bigdata  processing big data     Alibaba


 Q18. how many reducer task runs on a hadoop cluster ?BigData2017-03-03 07:45:50

Ans. Generally one reducer runs for all mappers, but it can be increased as per requirements.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     Hadoop  BigData     Wipro


 Q19. Have you ever heard of Kafka connect ?BigData2017-04-20 08:57:40

Ans. Yes, it's a project by confluent that provides in built mechanism for streaming records from bigdata data source to apache kafka message queues and vice versa. It provides a variety of source and sink connectors to achieve this.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     kafka  kafka connect



Do you think these are the Best Java Frameworks ?

OpenXavaSPRING MVCApache StripesCheck everything
that is Best in Java

Click Here



 Q20. What are the source and sink connectors in Kafka connect ? BigData2017-04-20 08:59:40

Ans. Source connectors are the connectors that are used to get information from the source whereas sink connectors are used to deploy information to the destination.

 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     kafka connect


 Q21. Is there any schema in mongo DB ?MongoDB2017-08-27 20:30:13

 This question was recently asked at 'Tapp Me'.This question is still unanswered. Can you please provide an answer.


 Help us improve. Please let us know the company, where you were asked this question :   

   Like      Discuss      Correct / Improve     mongodb  nosql  bigdata     Tapp Me




Subscribe to Java News and Posts. Get latest updates and posts on Java from Buggybread.com
Enter your email address:
Delivered by FeedBurner



comments powered by Disqus
 

Help us and Others Improve. Please let us know the questions asked in any of your previous interview.

Any input from you will be highly appreciated and It will unlock the application for 10 more requests.

Company Name:
Questions Asked:
         

X Close this

X Close this

Help Us Improve.
Please share your
interview experience.

Company Name:   


Questions Asked: