Role Summary:
The Data Engineer will be responsible develop semantic models on top of Data lake/ Data warehouse to fulfill the self-service BI foundation requirements. Data extraction from the various data sources and integration into the central data lake / data warehouse using enterprise platform like Informatica iPaaS.
Responsibilities:
Designing data warehouse data model based on the business requirements
• Designing, developing, and testing both batch and real-time Extract, Transform and Load (ETL) processes required for the data integration
• Ingesting of both structured and unstructured data into SMBU data lake / data warehouse system
• Designing and Developing Semantic Models/Self Service Cubes.
• Performing BI administration and access management to make sure access and reports are properly governed.
• Performing Unit Testing and Data Validation to ensure Business UAT are successful
• Performing ad-hoc data analysis and presenting results in a clear manner
• Assessing data quality of the source systems and proposing required enhancements to achieve satisfying level of the data accuracy
• Optimizing ETL processes to ensure execution time is meeting the requirements
• Maintaining and architecting ETL pipelines to make sure data is loaded on time on regular basis.
Requirement :
• 5 to 8 years of overall experience.
• Proven experience in development of the dimensional models in Azure Synapse with strong SQL knowledge
• Minimum of 3 years working as a Data Engineer in Azure ecosystem specifically using Synapse, ADF & Data bricks.
• Preferable 3 year of experience with data warehousing, ETL development, SQL Queries, Synapse, ADF, PySpark, Informatica iPaaS for data ingestion & data modeling.