- Dimension modelling
- Difference between etl and elt, and state the process for each
- Given a business problem, how would you construct the data model
- how spark or mpp database joins data under the hood - shuffling, broadcasting, hash join, sort merge join, nested loop join etc.
- https://datalemur.com/