Problem: o A traveler flies to many cities (airports) in an unbroken chain of flights with no loops i.e never revisiting an airport. o For every flight, she has a boarding pass with only a From (City) and To (City) printed on it but no date/time. o At the end of her journey, she hands you all her boarding passes but they’re shuffled, so you don’t know the starting or the ending city. Can you: o Write logic or pseudocode to print her whole journey in sequence. It should print e.g. (Starting) City1 -> City2 ->….-> (Ending) CityX o State the time complexity of your solution. o you’re given a Set of BoardingPass objects as input. o there could be as many as hundreds of thousands of unique cities/airports. o memory is no concern (i.e. you have infinite memory!). Optimize for execution time (time complexity).
Senior Data Engineer Interview Questions
2,556 senior data engineer interview questions shared by candidates
What is conformed dimension How many executors in Spark
1. Previous Works 2. Joining of Datasets and Questions in Relation to CDC Logics. 3. SCD-related questions and backfill scenarios 4. Architecture of Abinitio (since my tool was Abinitio)
- What is the difference between shallow and deep copy in Python?
I was asked about data pipeline information in my current project
Give an example of when you have used data to improve a business process or technical system.
What would have to happen in the first 30 days so you would want to quit?
How can we scale a banking system beyond RDBS replication capacity?
Explain how you would design an ETL pipeline to move and transform data from multiple sources.
Questions involved: Current Roles and Responsibilities Past Projects Tech Stack Sister Tools and technologies Marketing Domian knowledge Analytical Approach to solve a real time problem SQL Big Data Concepts Data Pipelines - Key things (Data Validation, Quality Checks) BI & Data Science concept and data consumption
Viewing 191 - 200 interview questions