Core Skills Required:
✔ Advanced SQL (query optimization & performance tuning)
✔ Hands-on experience with Trino/Presto (Starburst / Galaxy)
✔ Experience with data lakehouse technologies (Apache Iceberg, Hive, Delta Lake)
✔ Strong Python skills (pipeline automation & API integration)
✔ Cloud experience (AWS S3 / Azure ADLS / GCP GCS)
Nice to Have:
✔ Kubernetes, Docker, Terraform
✔ Distributed query performance tuning
✔ Data security & governance (Apache Ranger)
✔ Orchestration tools (Airflow / Dagster)