????Playson is a B2B game provider with 11 years of experience on the market. Since 2012 we have ambitiously developed worldwide recognition in the industry. Nowadays, our main focus is on regulated European Markets and we operate in 20+ different jurisdictions. As of 2023, we are continuously working on enhancing our portfolio, encompassing best practices in order to meet the highest standards of technology, design, support and interoperability.
We are looking for a Site Reliability Engineer. It is a position in the Platform Tribe, SRE Stream, FireX Squad, responsible for the automation and high-load infrastructure maintenance.
To succeed in the advertised role, you have:
― Strong experience with issues processing (RCA, Postmortems practices).
― Strong understanding of Kubernetes(K8s) — Including deployment, scaling, troubleshooting, and managing containerized applications.
― Proficiency in AWSservices — Specifically, expertise in Amazon Elastic Kubernetes Service (EKS), EC2, RDS, CloudFront, and other relevant services.
― Infrastructure as Code (IAC) — Terraformmust have
― Containerization technologies — Knowledge of Docker, including creating and managing Docker images and containers.
― CI/CD — Familiarity with continuous integration and continuous deployment tools like Jenkins, GitLab CI/CD, or GitHub Actions.
― Monitoring and observability — Experience with monitoring tools like DataDog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS CloudWatch.
― Networking — Strong understanding of network concepts like DNS, load balancing, and firewalls, as well as network protocols like TCP/IP, HTTP, and HTTPS and gRPCas a big plus.
― Scripting and programming languages — Proficiency in at least one scripting language (e.g., Python, NodeJS, Go).
― Configuration management — Experience with tools like FluxCD/ArgoCD.
― Version control systems — Proficiency in using Git or other version control systems.
― Incident management — Familiarity with incident response and management tools like PagerDuty, Opsgenie, or VictorOps.
― Strong problem-solving and troubleshooting skills — The ability to diagnose and resolve complex technical issues.
― Strong ownership, proactiveness, persistence, and passion for maintaining one of the biggest online gambling platforms
Would be beneficial to know:
― Ticket systems: Jira
― Understanding of event-driven architecture
― Understanding of ITIL Frameworks
― Security best practices — Knowledge of security principles, including securing applications, infrastructure, and data