This post will explain the difference between the SQL functions rownum and rowid. ROWID : 1). ROWID is a…
In most of your interviews you might have came across the question to write word count program in MapReduce or…
Currying in Scala is a technique of transforming a function that takes multiple arguments into a function that takes only…
Higher Order functions take other functions as parameters and return function as result, i.e., passing functions as parameters to other…
Even though Spark is very faster compared to Hadoop, Spark 1.6x has some performance issues which are corrected in Spark…
We will never think of this kind of scenarios when we work with Hive tables in our projects. But in…
Similar questions: Coalesce vs. spark.sql.shuffle.partitions Coalesce vs repartition vs spark.sql.shuffle.partitions Not a frequently asked question but there are chances…
Fold method is not commonly used in Scala as we have reduce method, which gives the same functionality. When we…
1). How to import large object such as BLOB and CLOB using Sqoop? There are no arguments available for direct…
Sqoop is a common data migration tool used to import data from RDBMS tables to other Hadoop components. It is…