Skip to content
Big Data Interview
The Interview Hacker and Technical guide
  • Home
  • Blogs
  • About Us
  • Contact Us
  • Privacy Policy

Month: May 2019

How to the update records in Hive?

May 25, 2019 admin 1 Comment

Most of us might have worked on Hive in projects. But we never get a requirement to update records in…

Continue Reading →

Posted in: Big Data, Hive Filed under: Hive update delete

Repartition vs spark.sql.shuffle.partitions

May 22, 2019 admin Leave a comment

Similar questions: Coalesce vs. spark.sql.shuffle.partitions Coalesce vs repartition vs spark.sql.shuffle.partitions   Not a frequently asked question but there are chances…

Continue Reading →

Posted in: Uncategorized

Explain about fold() method in Spark.

May 20, 2019 admin Leave a comment

Fold method is not commonly used in Scala as we have reduce method, which gives the same functionality. When we…

Continue Reading →

Posted in: Uncategorized

Sqoop Interview Questions and Answers (Part-2)

May 19, 2019 admin Leave a comment

1). How to import large object such as BLOB and CLOB using Sqoop? There are no arguments available for direct…

Continue Reading →

Posted in: Uncategorized

Sqoop Interview Questions and Answers (Part-1)

admin Leave a comment

Sqoop is a common data migration tool used to import data from RDBMS tables to other Hadoop components. It is…

Continue Reading →

Posted in: Uncategorized

Difference between Vector and ArrayList in Java? (or) ArrayList vs Vector in Java.

admin Leave a comment

It is a very common question in Java interviews. But if you has mentioned Java along with your Big data…

Continue Reading →

Posted in: Uncategorized

How to allocate resources for a Spark job? (or) Resources allocation in Spark.

admin Leave a comment

This will be must and should question in Spark interviews. Understanding the resource allocation for a Spark job is very…

Continue Reading →

Posted in: Uncategorized

What is Spark SQL?

May 18, 2019 admin Leave a comment

Spark SQL in one of modules available in Spark that runs on top of Spark Core module for structured data…

Continue Reading →

Posted in: Uncategorized

Difference between fold and reduce (or) reduce vs fold.

admin Leave a comment

Not a common question but lot of developers are confused between these two functions including me. So I thought of…

Continue Reading →

Posted in: Uncategorized

Spark RDD vs Dataframe vs Dataset

admin 1 Comment

Similar questions What is the difference between RDD, Dataframe and Dataset? Compare RDD, Dataframe and Datasets.     Spark release…

Continue Reading →

Posted in: Uncategorized

Post navigation

Page 1 of 4
1 2 … 4 Next →

Recent Posts

  • Save action in Spark takes too long time/Save operation spills huge data on to disk and fails with the error “No space left on device”
  • How to set configuration to start Reduce jobs after completion of certain proportion of the Map jobs in Hive or Hadoop?
  • HDFS commands

Recent Comments

  • curry 7 sour patch on Spark groupByKey vs reduceByKey vs aggregateByKey
  • jordan 4 on Hive – Order By vs Sort By vs Cluster By vs Distribute By
  • louboutin shoes on Spark RDD vs Dataframe vs Dataset

Archives

  • August 2021
  • June 2021
  • May 2021
  • January 2021
  • December 2020
  • October 2020
  • July 2020
  • May 2020
  • April 2020
  • March 2020
  • November 2019
  • July 2019
  • June 2019
  • May 2019

Follow Us

Contact Us

  • Email
    sparkandbigdatainterview@gmail.com
Privacy Policy
Copyright © 2023 Big Data Interview