Skip to content
Big Data Interview
The Interview Hacker and Technical guide
  • Home
  • Blogs
  • About Us
  • Contact Us
  • Privacy Policy

Category: Uncategorized

What is Paired RDD in Spark?

May 14, 2019 admin Leave a comment

Spark Paired RDD is nothing but an RDD that contains key, value pairs. Key-value pairs are linked data items. Keys…

Continue Reading →

Posted in: Uncategorized

How to create Singleton class in Java?

admin 3 Comments

How do you or can you create a Singleton class in Java? Design patterns are generalized solutions for common development…

Continue Reading →

Posted in: Uncategorized

What are Companion object and Singleton object in Scala?

May 13, 2019 admin Leave a comment

Singleton Object Singleton objects are the objects that are defined using the keyword object before it and we don’t need…

Continue Reading →

Posted in: Uncategorized

What is Whole-Stage CodeGen in Spark?

admin Leave a comment

Whole-Stage CodeGen is also known as Whole-Stage Java Code Generation, which is a physical query optimization phase in Spakr SQL…

Continue Reading →

Posted in: Uncategorized

What is Bucketing in Hive?

admin Leave a comment

What is Bucketing? Why do we need Bucketing? How it is going to improve query performance?   Bucketing Bucketing is…

Continue Reading →

Posted in: Uncategorized

What are the optimization techniques available in Hive?

admin Leave a comment

Similar questions: How can we optimize a Hive job? As we deal with data of size terabytes and petabytes the…

Continue Reading →

Posted in: Uncategorized

What is partitioning in Hive?

May 12, 2019 admin Leave a comment

Why do we need partitioning in Hive? What are the types of partitioning in Hive? Above are example questions that…

Continue Reading →

Posted in: Uncategorized

How do you create immutable object in Java?

admin Leave a comment

Most frequently asked interview question, when you say you have very good knowledge in Java. Below are similar questions: Can…

Continue Reading →

Posted in: Uncategorized

How do you optimize a Spark job?

admin Leave a comment

What are the techniques to optimize a Spark job? This is a super important question for a Big data developer…

Continue Reading →

Posted in: Uncategorized

What is the difference between Map and FlatMap in spark?

admin Leave a comment

One of the most common interview questions in big data developer interviews. I was asked this question in almost all…

Continue Reading →

Posted in: Uncategorized

Post navigation

Page 3 of 4
← Previous 1 2 3 4 Next →

Recent Posts

  • Save action in Spark takes too long time/Save operation spills huge data on to disk and fails with the error “No space left on device”
  • How to set configuration to start Reduce jobs after completion of certain proportion of the Map jobs in Hive or Hadoop?
  • HDFS commands

Recent Comments

  • curry 7 sour patch on Spark groupByKey vs reduceByKey vs aggregateByKey
  • jordan 4 on Hive – Order By vs Sort By vs Cluster By vs Distribute By
  • louboutin shoes on Spark RDD vs Dataframe vs Dataset

Archives

  • August 2021
  • June 2021
  • May 2021
  • January 2021
  • December 2020
  • October 2020
  • July 2020
  • May 2020
  • April 2020
  • March 2020
  • November 2019
  • July 2019
  • June 2019
  • May 2019

Follow Us

Contact Us

  • Email
    sparkandbigdatainterview@gmail.com
Privacy Policy
Copyright © 2023 Big Data Interview