Blog - Page 9 of 10 - Big Data Interview

What is the difference between Map and FlatMap in spark?

May 12, 2019 admin Leave a comment

One of the most common interview questions in big data developer interviews. I was asked this question in almost all…

DAG – Directed Acyclic Graph A DAG comprises of edges and vertices, in which edges represent rdds and vertices represent…

admin Leave a comment

Why RDD is immutable? What is the need of RDD in spark? RDD – Resilient Distributed Dataset What is…

May 11, 2019 admin 1 Comment

Why the block size is large in Hadoop? What is the use of having large block size in Hadoop? …

admin Leave a comment

What is ORC file format? How ORC is better than RC file format? ORC stands for Optimized Record Columnar…

admin Leave a comment

Tell me something about Avro file format. What do you know about Avro files? These are the common question…

May 10, 2019 admin Leave a comment

You may have come across questions like below during any of your spark interview. So to get full knowledge on…

May 8, 2019 admin Leave a comment

Many of us might be thinking is really Java required for a Big Data/Spark/Data engineer interview? If yes, what all…

May 7, 2019 admin Leave a comment

Similar and related questions: How do you cache dataset in Spark? How many ways to cache the data in Spark?…

admin Leave a comment

How can you set number of reducers for Sqoop job? How many reducers did you use for your Sqoop job?…