Seeking a Cloud Data Engineer that will assist in maintaining and monitoring infrastructure as well as build or assist in building data transformation pipelines.
Candidate must have prior experience with AWS or Azure and extensive knowledge of python for ETL development. Additional Cloud-based tools experience is important (see skills section)
Additional desired skills include experience with the following:
Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
Extensive experience leveraging Python to build data transformation pipelines (ETL) and experience with libraries such as pandas and NumPy.
Experience with pyspark.
Experience using AWS Glue and EMR to construct data pipelines
Experience building and optimizing 'big data' data pipelines, architectures, and datasets.