Skip to content
Big Data Interview
The Interview Hacker and Technical guide
  • Home
  • Blogs
  • About Us
  • Contact Us
  • Privacy Policy

Category: Uncategorized

ROWID vs ROWNUM (or) What is the difference between ROWID and ROWNUM?

March 8, 2020 admin Leave a comment

This post will explain the difference between the SQL functions rownum and rowid.   ROWID : 1). ROWID is a…

Continue Reading →

Posted in: Uncategorized

How to write word count program in Hive? (Or) Write word count program in Hive?

November 19, 2019 admin 11 Comments

In most of your interviews you might have came across the question to write word count program in MapReduce or…

Continue Reading →

Posted in: Uncategorized

Currying Function in Scala

June 25, 2019 admin Leave a comment

Currying in Scala is a technique of transforming a function that takes multiple arguments into a function that takes only…

Continue Reading →

Posted in: Uncategorized

Higher Order Functions in Scala

June 24, 2019 admin Leave a comment

Higher Order functions take other functions as parameters and return function as result, i.e., passing functions as parameters to other…

Continue Reading →

Posted in: Uncategorized

What are the differences between Spark 1.x and Spark 2.x?

June 9, 2019 admin Leave a comment

Even though Spark is very faster compared to Hadoop, Spark 1.6x has some performance issues which are corrected in Spark…

Continue Reading →

Posted in: Uncategorized

What happens if we add or delete partitions manually in Hive? (or) What is MSCK repair command in Hive?

June 5, 2019 admin Leave a comment

We will never think of this kind of scenarios when we work with Hive tables in our projects. But in…

Continue Reading →

Posted in: Uncategorized

Repartition vs spark.sql.shuffle.partitions

May 22, 2019 admin Leave a comment

Similar questions: Coalesce vs. spark.sql.shuffle.partitions Coalesce vs repartition vs spark.sql.shuffle.partitions   Not a frequently asked question but there are chances…

Continue Reading →

Posted in: Uncategorized

Explain about fold() method in Spark.

May 20, 2019 admin Leave a comment

Fold method is not commonly used in Scala as we have reduce method, which gives the same functionality. When we…

Continue Reading →

Posted in: Uncategorized

Sqoop Interview Questions and Answers (Part-2)

May 19, 2019 admin Leave a comment

1). How to import large object such as BLOB and CLOB using Sqoop? There are no arguments available for direct…

Continue Reading →

Posted in: Uncategorized

Sqoop Interview Questions and Answers (Part-1)

admin Leave a comment

Sqoop is a common data migration tool used to import data from RDBMS tables to other Hadoop components. It is…

Continue Reading →

Posted in: Uncategorized

Post navigation

Page 1 of 4
1 2 … 4 Next →

Recent Posts

  • Save action in Spark takes too long time/Save operation spills huge data on to disk and fails with the error “No space left on device”
  • How to set configuration to start Reduce jobs after completion of certain proportion of the Map jobs in Hive or Hadoop?
  • HDFS commands

Recent Comments

  • curry 7 sour patch on Spark groupByKey vs reduceByKey vs aggregateByKey
  • jordan 4 on Hive – Order By vs Sort By vs Cluster By vs Distribute By
  • louboutin shoes on Spark RDD vs Dataframe vs Dataset

Archives

  • August 2021
  • June 2021
  • May 2021
  • January 2021
  • December 2020
  • October 2020
  • July 2020
  • May 2020
  • April 2020
  • March 2020
  • November 2019
  • July 2019
  • June 2019
  • May 2019

Follow Us

Contact Us

  • Email
    sparkandbigdatainterview@gmail.com
Privacy Policy
Copyright © 2023 Big Data Interview