Skip to content

Big Data Interview

The Interview Hacker and Technical guide
  • Home
  • Blogs
  • About Us
  • Contact Us
  • Privacy Policy

Blog

  1. Pages:
  2. «
  3. 1
  4. 2
  5. 3
  6. 4
  7. 5
  8. 6
  9. 7
  10. 8
  11. 9
  12. »

Difference between createOrReplaceTempView and registerTempTable (or) createOrReplaceTempView vs registerTempTable.

June 18, 2019 admin Leave a comment

All the functions mentioned below are more or less same functionally, but there very minor differences among them. createOrReplaceTempView createTempView…

Continue Reading →

Posted in: Spark, Spark SQL

map vs mapValues

June 16, 2019 admin Leave a comment

mapValues – This function works with PairRDDs only. So this function always requires an RDD of type RDD[(a,b)]. mapValues functions…

Continue Reading →

Posted in: Scala, Spark

Hive – Order By vs Sort By vs Cluster By vs Distribute By

June 14, 2019 admin 2 Comments

Hive has so many clubbing operations like Order By, Sort By etc. Each clause it’s own uses, advantages and disadvantages.…

Continue Reading →

Posted in: Hive

What are the differences between Spark 1.x and Spark 2.x?

June 9, 2019 admin Leave a comment

Even though Spark is very faster compared to Hadoop, Spark 1.6x has some performance issues which are corrected in Spark…

Continue Reading →

Posted in: Uncategorized

Hive query to get sum of all the positive values and negative values of a column into two different columns

admin Leave a comment

Assume that we have a table as below: Column_name 1 -2 3 -4 5 Need to write a query to…

Continue Reading →

Posted in: Hive, Spark Filed under: Hive

LATERAL VIEW in Hive

admin 1 Comment

Some you will be asked a question in Hive like, we have a table in which one of the columns…

Continue Reading →

Posted in: Hadoop Filed under: Hive, Lateral View

Difference between SparkContext and SparkSession (or) SparkSession vs SparkContext

admin Leave a comment

One of the most commonly asked interview questions. If you are mid-level experienced professional this will be compulsory question. In…

Continue Reading →

Posted in: Spark

Why do companies prefer Cloud platforms in Bigdata processing?

admin Leave a comment

Now a days almost all the companies are using Cloud platforms despite any technology. There are lot reasons behind it.…

Continue Reading →

Posted in: Cloud computing, EMR, Hadoop, S3, Spark

What happens to number of partitions when we union two RDDs?

June 6, 2019 admin 5 Comments

Assume that we have an RDD of m partitions and another RDD of n partitions. When we perform union operation…

Continue Reading →

Posted in: RDD, Spark, UNION

Hive – Miscellaneous interview questions.

June 5, 2019 admin Leave a comment

1). How to convert an external table to internal table? We can achieve this by using below command: ALTER TABLE…

Continue Reading →

Posted in: Hive
  1. Pages:
  2. «
  3. 1
  4. 2
  5. 3
  6. 4
  7. 5
  8. 6
  9. 7
  10. 8
  11. 9
  12. »

Post navigation

Page 5 of 9
← Previous 1 … 4 5 6 … 9 Next →

Follow Us

Contact Us

  • Email
    sparkandbigdatainterview@gmail.com
Privacy Policy
Copyright © 2023 Big Data Interview