Senior AI / Data Science Engineer – Data Engineering
1 Nos.
91148
Full Time
8.0 Year(s) To 13.0 Year(s)
20.00 LPA TO 35.00 LPA
IT Software - System Programming
IT-Software/Software Services
Job Description:
A Day in the Life
- Collaborate with data analysts, data scientists, and business stakeholders to understand data requirements and translate them into efficient and scalable data solutions.
- Design and develop end-to-end data pipelines encompassing data ingestion, transformation, storage, and delivery.
- Utilize strong programming skills in Python to write clean, maintainable, and optimized code for data processing tasks.
- Leverage expertise in Apache Spark and PySpark to distribute data processing across clusters and handle large datasets efficiently.
- Possess a deep understanding of SQL (Oracle/SQL Server) and NoSQL databases (Big Data) to manage and query data effectively at scale.
- Design and implement data pipelines on cloud platforms like AWS, Azure, or Snowflake for scalability and cost-effectiveness, leveraging services like S3, Blob Storage, or Data Lake Storage.
- Orchestrate data pipelines using tools like Airflow or similar solutions to ensure smooth data flow, automation, and reliable scheduling.
- Build and maintain integrations with RESTful and SOAP web services to facilitate seamless data exchange between systems.
- Monitor and troubleshoot data pipelines to ensure data quality, consistency, and timely delivery.
- Champion best practices for data engineering and maintain a high standard of code documentation.
- Stay up-to-date on the latest advancements in data engineering tools and technologies, including Big Data frameworks and cloud platforms.
Must Have
Job Responsibilities
- 8+ & 14+ years of experience in data engineering with a proven track record of designing and implementing data pipelines.
- Strong programming skills in Python with proficiency in libraries like Pandas, Spark, and PySpark for data manipulation and analysis.
- In-depth knowledge of SQL (Oracle/SQL Server) and NoSQL databases (Big Data) for data storage and retrieval at scale.
- Experience with data pipeline orchestration tools like Airflow or similar solutions.
- Experience designing and implementing end-to-end data solutions, from data ingestion to consumption.
- Familiarity with cloud platforms (AWS, Azure, Snowflake) for data storage, processing, and services.
- Understanding of web service protocols (REST, SOAP) and experience building data integrations.
- Excellent problem-solving and analytical skills with a passion for building efficient data infrastructures.
- Effective communication and collaboration skills to work effectively with cross-functional teams.
Minimum Qualification
- Bachelors / Master’s of Engineering in Computer Science, Statistics, Mathematics, or a related technical field (PhD is a plus).
- 8+ & 14+ years of Software industry experience.
Principal Working Relationship
- Reports to the AI / Data Science Manager.
- Collaborates with data scientists, business analysts, data analysts, and other data engineers.
Nice to Haves
- Experience with real-time data processing frameworks (Apache Kafka, Apache Flink).
- Experience with data governance and data security best practices.
- Experience with data visualization tools (Tableau, Power BI) for data exploration.
- Experience with DevOps practices for continuous integration and deployment (CI/CD) of data pipelines.
Company Profile
A global healthcare technology leader — boldly attacking the most challenging health problems facing humanity with innovations that transform lives.
Apply Now
- Interested candidates are requested to apply for this job.
- Recruiters will evaluate your candidature and will get in touch with you.