We are looking for proficient Data Engineers who can work well in a modern agile software engineering environment and knowledge of application architecture patterns. The role will be a technical solution development function working with technical architects.
Roles and Responsibilities:
- Create and maintain optimal data pipeline architecture,
- Assemble large, complex data sets that meet functional / non-functional business requirements.
- Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
- Expertise and hands-on experience in Python/Java – Must Have
- Expertise knowledge on SparkQL/Spark Dataframe/HiveQL – Must Have
- Good knowledge of SQL – Good to Have
- Good knowledge of Shell script/kafka/sqoop/mapreduce – Good to Have
- Good Knowledge of one of the Workflow engine like Oozie, Autosys – Good to Have