????Playson is a B2B game provider with 10 years of experience on the market. Since 2012 we have ambitiously developed worldwide recognition in the industry. Nowadays, our main focus is on regulated European Markets and we operate in 20+ different jurisdictions. As of 2023, we are continuously working on enhancing our portfolio, encompassing best practices in order to meet the highest standards of technology, design, support and interoperability.
We are looking for a Site Reliability Engineer. It is a position in the Platform Tribe, SRE Stream, FireX Squad, responsible for the automation and high-load infrastructure maintenance.
To succeed in the advertised role, you have:
???? Strong understanding of Kubernetes (K8s) - Including deployment, scaling, troubleshooting, and managing containerized applications.
???? Proficiency in AWS services - Specifically, expertise in Amazon Elastic Kubernetes Service (EKS), EC2, RDS, S3, VPC, IAM, CloudWatch, and other relevant services.
???? Infrastructure as Code (IAC) - Experience with tools like Terraform, CloudFormation, or Pulumi for infrastructure provisioning and management.
???? Containerization technologies - Knowledge of Docker, including creating and managing Docker images and containers.
???? CI/CD - Familiarity with continuous integration and deployment tools like Jenkins, GitLab CI/CD, or GitHub Actions.
???? Monitoring and observability - Experience with monitoring tools like Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack), or AWS CloudWatch.
???? Distributed tracing: Experience with tools like Jaeger, Zipkin, or AWS X-Ray to diagnose and optimize microservices-based applications.
???? Networking - Strong understanding of network concepts like DNS, load balancing, and firewalls, as well as network protocols like TCP/IP, HTTP, and HTTPS.
???? Scripting and programming languages - Proficiency in at least one scripting language (e.g., Python, Ruby, Bash) and one programming language (e.g., Go, Java, Node.js).
???? Configuration management - Experience with tools like Ansible, Puppet, or Chef for managing server and application configurations.
???? Version control systems - Proficiency in using Git or other version control systems.
???? Incident management - Familiarity with incident response and management tools like PagerDuty, Opsgenie, or VictorOps.
???? Security best practices - Knowledge of security principles, including securing applications, infrastructure, and data.
???? Strong problem-solving and troubleshooting skills - The ability to diagnose and resolve complex technical issues.
???? Strong experience with issues processing (RCA, Postmortems practices).
???? Strong ownership, proactiveness, persistence, and passion for maintaining one of the biggest online gambling platforms/
It would be beneficial to know: