Found Description
What you will doDesign, develop, and maintain ETL pipelines to extract, transform, and load data across various data sources (cloud storage, databases, APIs);Use Apache Airflow for orchestrating workflows, scheduling tasks, and managing pipeline dependencies;Build and manage data pipelines on Azure and GCP clouds;Design and support Data Lake;Write Python scripts for data cleansing, transformation, and enrichment using libraries like Pandas, PySpark;Analyze logs and metrics from Airflow and cloud services to resolve pipeline failures or inefficiencies.Must havesExperience (2+ years) writing efficient and scalablePython code, especially for data manipulation andETL tasks(using libraries likePandas,PySpark,Dask, etc.);Knowledge ofApache Airflowfor orchestratingETLworkflows, managing task dependencies, scheduling, and error handling;Experience in building, optimizing, and maintainingETL pipelinesfor large datasets, focusing on data extraction, transformation, and loading;Familiarity wit...
Ready to Apply?
Submit your application for Python Data Engineer (Junior/Middle) Id31594 (México) at Link-Worldwide
Apply Now