Skip to content
Big Data Interview
The Interview Hacker and Technical guide
  • Home
  • Blogs
  • About Us
  • Contact Us
  • Privacy Policy

Category: Uncategorized

Difference between Vector and ArrayList in Java? (or) ArrayList vs Vector in Java.

May 19, 2019 admin Leave a comment

It is a very common question in Java interviews. But if you has mentioned Java along with your Big data…

Continue Reading →

Posted in: Uncategorized

How to allocate resources for a Spark job? (or) Resources allocation in Spark.

admin Leave a comment

This will be must and should question in Spark interviews. Understanding the resource allocation for a Spark job is very…

Continue Reading →

Posted in: Uncategorized

What is Spark SQL?

May 18, 2019 admin Leave a comment

Spark SQL in one of modules available in Spark that runs on top of Spark Core module for structured data…

Continue Reading →

Posted in: Uncategorized

Difference between fold and reduce (or) reduce vs fold.

admin Leave a comment

Not a common question but lot of developers are confused between these two functions including me. So I thought of…

Continue Reading →

Posted in: Uncategorized

Spark RDD vs Dataframe vs Dataset

admin 1 Comment

Similar questions What is the difference between RDD, Dataframe and Dataset? Compare RDD, Dataframe and Datasets.     Spark release…

Continue Reading →

Posted in: Uncategorized

Why reduce() is implemented as an action and reduceByKey() is as a transformation?

May 17, 2019 admin Leave a comment

Similar questions: What is the difference between reduce() and reduceByKey()? reduce() vs reduceByKey()   The main difference between reduce and…

Continue Reading →

Posted in: Uncategorized

Explain Spark architecture (or) What happens when submit a Spark job?

admin Leave a comment

This is one of the most common interview questions. As a Spark applications developer we must have the idea of…

Continue Reading →

Posted in: Uncategorized

What is the difference between ClassNotFoundException and NoClassDefFoundError? (or) ClassNotFoundException vs NoClassDefFoundError

May 16, 2019 admin Leave a comment

Both ClassNotFoundException and NoClassDefFound will occur when some class is not available at run time. But they occur in different…

Continue Reading →

Posted in: Uncategorized

What are the deployment modes in Spark (Client vs Cluster modes)?

May 15, 2019 admin Leave a comment

Similar Questions What is the difference between Spark cluster mode and client mode? Cluster mode vs Client mode in Spark?…

Continue Reading →

Posted in: Uncategorized

Spark groupByKey vs reduceByKey vs aggregateByKey

admin 1 Comment

Similar questions: What is the differences among groupByKey, reduceByKey and aggregateByKey? goupByKey – While using groupByKey we must be careful…

Continue Reading →

Posted in: Uncategorized

Post navigation

Page 2 of 4
← Previous 1 2 3 4 Next →

Recent Posts

  • Save action in Spark takes too long time/Save operation spills huge data on to disk and fails with the error “No space left on device”
  • How to set configuration to start Reduce jobs after completion of certain proportion of the Map jobs in Hive or Hadoop?
  • HDFS commands

Recent Comments

  • curry 7 sour patch on Spark groupByKey vs reduceByKey vs aggregateByKey
  • jordan 4 on Hive – Order By vs Sort By vs Cluster By vs Distribute By
  • louboutin shoes on Spark RDD vs Dataframe vs Dataset

Archives

  • August 2021
  • June 2021
  • May 2021
  • January 2021
  • December 2020
  • October 2020
  • July 2020
  • May 2020
  • April 2020
  • March 2020
  • November 2019
  • July 2019
  • June 2019
  • May 2019

Follow Us

Contact Us

  • Email
    sparkandbigdatainterview@gmail.com
Privacy Policy
Copyright © 2023 Big Data Interview