Skip to content

Big Data Interview

The Interview Hacker and Technical guide
  • Home
  • Blogs
  • About Us
  • Contact Us
  • Privacy Policy

Blog

  1. Pages:
  2. «
  3. 1
  4. 2
  5. 3
  6. 4
  7. 5
  8. 6
  9. 7
  10. 8
  11. 9
  12. »

What is Catalyst Optimizer in Spark?

May 13, 2019 admin Leave a comment

One of the common questions in Spark interview. There is a lot about this Catalyst Optimizer. Here in this blog…

Continue Reading →

Posted in: Catalyst Optimizer in Spark, Spark

What is Bucketing in Hive?

admin Leave a comment

What is Bucketing? Why do we need Bucketing? How it is going to improve query performance?   Bucketing Bucketing is…

Continue Reading →

Posted in: Uncategorized

What are the optimization techniques available in Hive?

admin Leave a comment

Similar questions: How can we optimize a Hive job? As we deal with data of size terabytes and petabytes the…

Continue Reading →

Posted in: Uncategorized

What is partitioning in Hive?

May 12, 2019 admin Leave a comment

Why do we need partitioning in Hive? What are the types of partitioning in Hive? Above are example questions that…

Continue Reading →

Posted in: Uncategorized

How do you create immutable object in Java?

admin Leave a comment

Most frequently asked interview question, when you say you have very good knowledge in Java. Below are similar questions: Can…

Continue Reading →

Posted in: Uncategorized

How do you optimize a Spark job?

admin Leave a comment

What are the techniques to optimize a Spark job? This is a super important question for a Big data developer…

Continue Reading →

Posted in: Uncategorized

What is the difference between Map and FlatMap in spark?

admin Leave a comment

One of the most common interview questions in big data developer interviews. I was asked this question in almost all…

Continue Reading →

Posted in: Uncategorized

What is DAG scheduler in Spark?

admin Leave a comment

DAG – Directed Acyclic Graph A DAG comprises of edges and vertices, in which edges represent rdds and  vertices represent…

Continue Reading →

Posted in: Uncategorized

What is RDD in Spark?

admin Leave a comment

Why RDD is immutable? What is the need of RDD in spark?   RDD – Resilient Distributed Dataset What is…

Continue Reading →

Posted in: Uncategorized

What is the need of large block size in Hadoop?

May 11, 2019 admin 1 Comment

Why the block size is large in Hadoop? What is the use of having large block size in Hadoop?  …

Continue Reading →

Posted in: Hadoop
  1. Pages:
  2. «
  3. 1
  4. 2
  5. 3
  6. 4
  7. 5
  8. 6
  9. 7
  10. 8
  11. 9
  12. »

Post navigation

Page 8 of 9
← Previous 1 … 7 8 9 Next →

Follow Us

Contact Us

  • Email
    sparkandbigdatainterview@gmail.com
Privacy Policy
Copyright © 2023 Big Data Interview