In this article, we will see the list of questions asked in EY Company Interview for 2+ year of experience candidate in big data field.
Let’s see the Questions:
- What distinguishes wide transformations from narrow transformations in Spark?
- How would you execute an anti join in Spark?
- Could you describe what a semi join is in Spark?
- What are the different types of anti joins?
- How do cache and checkpoint differ in Spark?
- What role does a DAG play in Spark?
- How does speculative execution work in Spark?
- Where is data stored when caching occurs in Spark?
- What are the key components of Spark’s architecture?
- Is it possible to uncache a DataFrame after caching it? How?
- Why is it important to remove cached data in Spark?
- What are the consequences of forgetting to uncache data in Spark? How does it impact performance?
- What are RDDs in Spark?
- In SQL, what is the specific purpose of window functions?
- Can you explain the difference between ROLLUP and CUBE in SQL?
- Which PySpark operator can you use to verify if two DataFrames are identical? What is a fast and specific function for this?
- How does a CTE differ from a temporary table in SQL?
I hope these questions assist anyone preparing for their interviews.
Check out the given link for knowing about this company: EY
Check out the given link for knowing about this company rating on Glassdoor: EY |GlassDoor
Check out the given link for this company profile on LinkedIn: LinkedIn
Thank you for reading this post.