Site Reliability Engineer

SRE

1 Nos.
32046
Full Time
8.0 Year(s) To 12.0 Year(s)
Not Disclosed by Recruiter
IT Software- Application Programming / Maintenance
IT-Software/Software Services
Job Description:

Work timing: 3-12 AM

 

Responsibilities: 

  1. Candidate will be part of the SRE team and lead technical role to determine
  2. Reliability Engineering needs of mission critical systems and business processes
  3. Candidate will assess high level architecture and design issues relating to platform enterprise software interactions with other systems
  4. Application development infrastructure and middleware teams to ensure stability and reliability of the system Engineering will proactive detect issues within the applications platform network.
  5. Candidate should have familiarity with Internet protocols such as HTTP DNS TCP and UDP and Linux development environment and well versed with DevOps.
  6. Candidate will identify anti patterns optimization and support development of self-healing capabilities
  7. Responsibilities Create operational tooling for monitoring self-healing infrastructures and testing
  8. Design and create controlled in production systems
  9. Work across teams identify and fix issues that affect systems reliability and performance
  10. Dive into system and latent reliability issues service performance and capacity modeling of distributed systems at scale
  11. Partner with development team to identify anti patterns and optimization strategies create fallback options and help develop self-healing capabilities across the enterprise in a sustainable manner
  12. Requirements A passion for creating reliable applications and a systematic problem solving approach coupled with a strong sense of ownership and drive
  13. 3+ years of hands on experience with cloud-based technologies and tools in configuration management deployment monitoring and operations
  14. Experience with Engineering tools such as Terraform, Ansible, Consul and Linux development environment.
  15. Experience in Application Performance Managing Real User Monitoring infrastructure monitoring and log analysis tool such as Apica Nagios Sensu and Sumologic NewRelic with DevOps Continuous Delivery
  16. Expertise in working in partnership with colleagues throughout the firm and in leading collaborative teams to achieve common goals
  17. Experience in an Agile delivery environment
  18. Experience as a hands on software engineer so you understand the core principles of the engineering work
  19. Experience in communication and organization in large distributed teams 
  20. A Bachelor s degree is required 

Behavioral Attributes

  1. Strong problem-solving skills, ability to drive and implement structured solutions
  2. Ability to quickly identify and prioritize right issue areas
  3. Good writing and verbal communication skills
  4. Self-directed and proactive approach to tackling problems and leveraging resources 

 

Company Profile

We are a specialized IT services company with re-usable technology assets in the DevOps, Cloud, Automation, Digital, Service Delivery and Agile Analytics domains. It helps global organizations achieve frictionless business by transforming their Infrastructure, Applications and Data to provide business scale, operational efficiency and deliver superior customer experience.

Apply Now

  • Interested candidates are requested to apply for this job.
  • Recruiters will evaluate your candidature and will get in touch with you.

Similar Jobs