Arista is hiring a Remote Site Reliability Engineer
Job Description
Responsibilities:
- Ensure the scalability, performance, and resilience of our suite of products
- Work with the development and product team to establish the right monitoring and alerting strategy
- Develop build, test, and deployment automation that seamlessly targets multiple cloud regions
- Define and implement standards and best practices related to, system architecture, service delivery, metrics, and the automation of operational tasks
- Optimize telemetry platform to identify customer-impacting events while providing relevant data to drive debugging
- Partner with the engineering team to optimize the performance of services for cloud architecture
- Debug Live Site events and conduct follow-up post-mortem and RCA analysis
Qualifications
- B.E/B.Tech in Computer Science or equivalent
- 5 to 7 years of relevant experience
- Scripting languages like Bash, Python, etc.
- Exposure to operational knowledge of managing applications in AWS/GCP
- Experienced in automating software build, deployment, and server configuration management using tools such as Puppet, Chef, and Jenkins
- Hands-on experience with Linux/Unix Administration
- Good understanding of containerization concepts - docker, ECS, EKS, Kubernetes
- Experience with building tools such as Jenkins
- Working experience with NoSQL databases such as MongoDB, PostgreSQL, etc.
- Understanding of basic networking concepts
Apply for this job