Site reliability engineer
Job Description:
Core Responsibilities
• Engineering Execution: Lead and execute planned infrastructure projects (e.g., cluster upgrades, automation of provisioning workflows, migration of legacy services)
• Infrastructure as Code: Build and maintain reproducible environments on AWS using Terraform
• Platform Health: Manage and optimize Kubernetes (EKS) workloads, ensuring high availability and performance
• Legacy Stability: Maintain and troubleshoot Linux-based legacy systems
• Automation: Eliminate manual "toil" by developing robust scripts (Bash, Python, or Go) for system maintenance and deployment
Required Technical Skills
• Cloud: Professional-level experience with AWS (VPC, EKS, EC2, S3, IAM)
• IaC: Strong proficiency with Terraform (module design, state management)
• Orchestration: Deep understanding of Kubernetes (workloads, services, ingress, and config management)
• Legacy Systems: Solid Linux Administration skills (troubleshooting Java/Tomcat stacks)
• Database: Fundamental knowledge of MongoDB, PSQL and other DB engines (monitoring connectivity, basic metrics analysis)
• Scripting: Competent with at least one scripting language (Python, Bash, or Go) for automation
Key Skills :
Company Profile
Client is a leading name in offering a full spectrum of software outsourcing solutions. Equipping businesses to keep up with the diverse software needs, the client is committed to provide software development services to start-ups and enterprises of all scales and sizes.
Apply Now
- Interested candidates are requested to apply for this job.
- Recruiters will evaluate your candidature and will get in touch with you.