Search Interview Questions
| 2802 questions in repository.|
There are more than 200 unanswered questions.
Click here and help us by providing the answer.
Have a video suggestion.
Click Correct / Improve and please let us know.
|Bigdata - Interview Questions and Answers for 'Alibaba' - 1 question(s) found - Order By Newest|
|Ans. Load the file in chunks and then process. If we need to do analytic, we can process analytic information for those chunks and then reprocess the processed information from each chunk. |
For example - we need to average all marks in the file. We can divide the file and load into 5 chunks and calculate average for each chunk. Then we can collect averages for all 5 chunks and then calculate the final average.
|Help us improve. Please let us know the company, where you were asked this question :|
|Like Discuss Correct / Improve bigdata processing big data Alibaba|
|What is Apache Kafka ?|
|What is a broker in Apache Kafka ?|
|What is a Topic in Apache Kafka ?|
|Have you used Kafka in your project ? If Yes, for what ?|
|Does Kafka uses ZooKeeper ?|
|What is a Sequence File?|
|What is the input to the Reduce function ?|
|Which of the following is the implementation language for Map Reduce Framework ?|
|Can we have multiple threads consuming message stream from a single partition ?|
|Can we have multiple threads consuming messages from a single partition if we have single Consumer Group ?|