Lead Data Engineer

Lead Data Engineer

1 Nos.
126775
Full Time
6.0 Year(s) To 7.0 Year(s)
16.00 LPA TO 18.00 LPA
Job Description:

Role: Lead the design, development, and optimization of data pipelines and data platform components for an e-commerce business. Work across ingestion, transformation, modeling, governance, and performance to support large-scale Data & AI use cases.


Key Responsibilities

  • Build and maintain scalable batch and streaming data pipelines from transactional systems, clickstream, and third-party sources.

  • Implement and optimize data lake/lakehouse/warehouse environments on cloud platforms (GCP/AWS/Azure).

  • Develop data models and support medallion-layer architecture (bronze/silver/gold) for analytics, BI, and AI workloads.

  • Ensure end-to-end data quality, reliability, lineage, and performance.

  • Apply best practices in data governance, privacy, PII protection, and security standards.

  • Collaborate with architects, analysts, ML teams, and product stakeholders to translate business needs into scalable data solutions.

  • Optimize SQL, ETL/ELT jobs, orchestration workflows, and cloud infrastructure for performance and cost efficiency.

  • Contribute to defining data engineering standards, coding practices, and technology improvements.


Required Skills – Lead Data Engineer

  • 6 -7+ years of experience in data engineering, with strong exposure to cloud-based data platforms.

  • Hands-on expertise with data pipelines and streaming technologies (Kafka/PubSub, Spark, Flink).

  • Strong SQL and Python programming skills.

  • Experience building and optimizing data lake/lakehouse/warehouse platforms on GCP/AWS/Azure.

  • Solid understanding of data modeling: dimensional modeling, data vault (or similar), and medallion architecture.

  • Familiarity with lakehouse/warehouse tools (Databricks, BigQuery, Snowflake, Redshift, ClickHouse).

  • Experience with orchestration tools (Airflow/Cloud Composer, Databricks Workflows, etc.).

  • Strong background in data governance, data quality frameworks, cataloging, lineage, RBAC/ABAC, and compliance (GDPR/CCPA).

  • Experience in e-commerce, digital, or consumer-facing data environments is a strong advantage (clickstream, customer 360, orders, marketing).

  • Good communication skills; able to work with cross-functional teams and support architectural decisions.


    keyword skill 

    • AWS

    • Azure

    • Google Cloud Platform (GCP)

    • Data Pipelines

    • Streaming (Kafka / Spark)

    • SQL & Python

    • Data Lake / Lakehouse

    • Medallion Architecture

    • Databricks

    • Snowflake

     

Company Profile

A ---nology-driven consulting and solutions provider specializing in Generative AI, cloud infrastructure, data platforms, and automation solutions. Focused on helping businesses modernize operations with scalable, secure, and intelligent digital systems that improve efficiency, analytics, and business growth.

Apply Now

  • Interested candidates are requested to apply for this job.
  • Recruiters will evaluate your candidature and will get in touch with you.

Similar Jobs