How many joins are there? Please explain each
Data Engineer Interview Questions
Data Engineer Interview Questions
Data engineers are IT professionals who are needed in almost every industry. Data engineers monitor data trends to determine best next steps for companies. A critical part of a data engineer job is to process raw data into usable data by creating data pipelines and building data systems.
Top Data Engineer Interview Questions & How To Answer
Question #1: Can you describe in detail your level of expertise with programming languages?
Question #2: Explain data engineering in your own words.
Question #3: Can you describe your experience working with Apache Hadoop and cloud data management environments?
20,166 data engineer interview questions shared by candidates
sql and python
You are given an integer list coins representing coins of different denominations and an integer amount representing a total amount of money. Return the fewest number of coins that you need to make up that amount. If that amount of money cannot be made up by any combination of the coins, return {}. You may assume that you have an infinite number of each kind of coin. Example 1: Input: coins = [1, 5, 6, 9, 15], amount = 31 Output: {15:2, 1:1} Example 2: Input: coins = [1, 5, 6, 9, 15], amount = 100 Output: {15:6, 9:1, 1:1}
If you do a xor b xor a, what is the end result?
Typical data engineering questions
The gave me driver data, passenger data and booking data. You will be asked to run some aggregation on this and publish the results.
1. Imagine a table keeping track of all the rides a driver undertakes in a given day. How do you query that table to find out the time when he did not have any rides. So the table may have a record when he picks up his first ride and then he picks another and then drops both of them. After a while he picks up a few more which again is recorded in the table. So with all that how do you find the data when he does not have any rides
# Q2: Same as Q1, but now assume there can be duplicates. The output should not have duplicates # Ex: # l1 = [1,2,3,4,5,5,5] # l2 = [1,5,7,11,100] # result = [1,5]
What percent of all products in the grocery chain's catalog are both low fat and recyclable?
Write a query that returns product_family, units_sold, percentage of promoted.
Viewing 181 - 190 interview questions