1. Explain about your projects
2. What and how will optimise spark jobs in your work(Spark related questions)
3. How will justify reliability and durability of data
4. What is Lineage graph and check points in Spark
5. Explain Strategic of Spark
Coding
1. Input is an array of integers. There is a sliding window of size W which
is moving from the very left of the array to the very right. Each time the sliding window moves rightwards by one position. Find the maximum number from each window.
Input array is [1, 3, -1, -3, 5, 3, 6, 7] and Sliding window (W) is 3.
Output array is [3, 3, 5, 5, 6, 7]
2. Write a query to provide number of times an employee got increment and max increment he has got along with columns emp_id, emp_name, joining_date, dept_name
Additional info:
a. An emp may not be tagged to any dept
b. An emp may not have got any hike
Emp table
emp_id | emp_name | joining_date | Dept_id
Dept table
dept_id | dept_name | Dept_Location | Dept_Manager
salary_increase table
emp_id | increment_date | inc_amount
3. Write a Python program to find element in a list from current element to next index
list is ["Mon","Tue","Wed","Thu","Fri","Sat","Sun"]
If current element is “Wed” and next index is 2, result is “Fri”
If current element is “Sat” and next index is 23 (cyclic), result is “Mon”