Apache Kafka - Interview Questions and Answers for 'Apache' | Search Interview Question

Apache Kafka - Interview Questions and Answers for 'Apache' - 13 question(s) found - Order By Newest

Q1. Is Apache an application or Web server ?

Ans. Web server.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve apache web server j2ee}

Q2. How to make sure that a string contains only Numerals ?

Core Java

Ans. We can use NumberUtils.isNumber which is a part of apache commons package.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve java apache commons numberutils numberutils.isnumber}

Q3. Can we have multiple threads consuming message stream from a single partition ?

Ans. Yes, by having multiple Consumer Groups.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve apache kafka kafka kafka consumer kafka topic partitions bigdata consumer group}

Q4. Can we have multiple threads consuming messages from a single partition if we have single Consumer Group ?

Ans. No, we can only have max 1 thread per partition in a single Consumer Group.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve apache kafka kafka kafka consumer kafka topic partitions bigdata consumer group}

Q5. If we have more threads than partitions in a kafka consumer, How can we model it efficiently ?

Ans. We will have to use multiple consumer groups in that case as threads will remain idle if we use single consumer group. A more sophisticated algorithm could be required with multiple groups if we have to ensure the order of consumption.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve apache kafka kafka kafka consumer kafka topic partitions bigdata consumer group}

Q6. What is Apache Cassandra ?

Cassandra

Ans. Apache Cassandra is a free and open source distributed NoSQL database management system designed to handle large amounts of data providing high availability with no single point of failure.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve apache cassandra cassandra}

Q7. What is the difference between Java SE Map and Apache Commons MultiMap ? How can we implement functionality similar to multimap using Java SE map ?

Core Java

Ans. We can have duplicate keys within MultiMap whereas we cannot have duplicate keys within Java Util Map.

We can have a Map with value as a "collection of values" instead of single value to have a similar function as Multimap.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve Apache commons multimap map vs multimap}

Q8. How can we make sure that we don't receive duplicate messages from the Kafka Topic ?

Apache Kafka

^{This question is still unanswered. Can you please provide an answer.}

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve kafka}

Q9. How can we makes sure that message has been consumed properly from Kafka ?

Apache Kafka

Ans. We can have a mechanism to send the conformation back to Kafka so that Kafka can do offset only after receiving the confirmation.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve apache kafka kafka kafka consumer}

Q10. Kafka has a deliver atleast once default Policy, What are the other policies that can be configured with Kafka ?

Apache Kafka

Ans. Exactly once and At Max Once

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve kafka}

Q11. What are the different delivery policies in Apache Kafka ?

Apache Kafka

Ans. Exactly once , At least once , At max once

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve Asked in 1 Companies basic frequent}

Q12. Difference between hadoop 1.x and 2.x?

Apache Hadoop

Ans. Hadoop 1.x

It supports only MapReduce (MR) processing model.
It has limited scaling of nodes. Limited to 4000 nodes per cluster.
It has single Namenode to manage the entire namespace.
It has Single-Point-of-Failure (SPOF)
Works on concepts of slots – slots can run either a Map task or a Reduce task
MR has to do both processing and cluster resource management.

Hadoop 2.x

It supports MR as well as other distributed computing models like Spark, Hama, etc
It has better scalability. Scalable up to 10000 nodes per cluster.
Works on concepts of containers. Using containers can run generic tasks.
It has Multiple Namenode servers manage multiple namespace.
YARN does cluster resource management and processing is done using different processing models.

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve}

Q13. Difference between group and cogroup?

Apache Hadoop

^{This question is still unanswered. Can you please provide an answer.}

Help us improve. Please let us know the company, where you were asked this question :

_{Like Discuss Correct / Improve}

Search Interview Questions

Help us and Others Improve. Please let us know the questions asked in any of your previous interview.