Office hour: Monday – Friday (9:00 AM – 6:00 PM)
- Build and operate batch and near–real-time data pipelines integrating POS/ERP/WMS systems, e-commerce, marketing platforms, and external data sources.
- Orchestrate and monitor workflows using Airflow, Prefect, or Dagster, ensuring defined SLAs and reliable incident management.
- Design and maintain scalable schemas and core domain data models (Products, SKUs, Inventory, Sales, Suppliers, Warehouses).
- Implement data quality frameworks including validation, reconciliation, freshness checks, and observability.
- Develop, optimize, and maintain cloud data warehouses (BigQuery, Snowflake, or Redshift) with efficient dbt modeling.
- Collaborate with ML/DS teams to deliver feature stores, training datasets, and production-ready model pipelines.
- Establish and enforce data governance, security standards, documentation practices, and reusable data contracts/APIs.