Site Reliability Engineer Remote Jobs

8 Results

+30d

Site Reliability Engineer

PlaysonRemote job, Remote
10 years of experienceterraformB2BDesignansiblegitrubyjavadockerelasticsearchkubernetesjenkinspythonAWSNode.js

Playson is hiring a Remote Site Reliability Engineer

????Playson is a B2B game provider with 10 years of experience on the market. Since 2012 we have ambitiously developed worldwide recognition in the industry. Nowadays, our main focus is on regulated European Markets and we operate in 20+ different jurisdictions. As of 2023, we are continuously working on enhancing our portfolio, encompassing best practices in order to meet the highest standards of technology, design, support and interoperability.

We are looking for a Site Reliability Engineer. It is a position in the Platform Tribe, SRE Stream, FireX Squad, responsible for the automation and high-load infrastructure maintenance.


To succeed in the advertised role, you have:

???? Strong understanding of Kubernetes (K8s) - Including deployment, scaling, troubleshooting, and managing containerized applications.

???? Proficiency in AWS services - Specifically, expertise in Amazon Elastic Kubernetes Service (EKS), EC2, RDS, S3, VPC, IAM, CloudWatch, and other relevant services.

???? Infrastructure as Code (IAC) - Experience with tools like Terraform, CloudFormation, or Pulumi for infrastructure provisioning and management.

???? Containerization technologies - Knowledge of Docker, including creating and managing Docker images and containers.

???? CI/CD - Familiarity with continuous integration and deployment tools like Jenkins, GitLab CI/CD, or GitHub Actions.

???? Monitoring and observability - Experience with monitoring tools like Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack), or AWS CloudWatch.

???? Distributed tracing: Experience with tools like Jaeger, Zipkin, or AWS X-Ray to diagnose and optimize microservices-based applications.

???? Networking - Strong understanding of network concepts like DNS, load balancing, and firewalls, as well as network protocols like TCP/IP, HTTP, and HTTPS.

???? Scripting and programming languages - Proficiency in at least one scripting language (e.g., Python, Ruby, Bash) and one programming language (e.g., Go, Java, Node.js).

???? Configuration management - Experience with tools like Ansible, Puppet, or Chef for managing server and application configurations.

???? Version control systems - Proficiency in using Git or other version control systems.

???? Incident management - Familiarity with incident response and management tools like PagerDuty, Opsgenie, or VictorOps.

???? Security best practices - Knowledge of security principles, including securing applications, infrastructure, and data.

???? Strong problem-solving and troubleshooting skills - The ability to diagnose and resolve complex technical issues.

???? Strong experience with issues processing (RCA, Postmortems practices).

???? Strong ownership, proactiveness, persistence, and passion for maintaining one of the biggest online gambling platforms/


It would be beneficial to know:

  • FluxCD/ArgoCD.

  • Ticket systems: Jira.

  • Understanding of event-driven architecture.

  • Knowledge of ITSM and ITIL Frameworks.

See more jobs at Playson

Apply for this job

+30d

Senior Site Reliability Engineer

TenableRemote, United States
agileBachelor's degreeterraformjavadockerkubernetespythonAWSNode.js

Tenable is hiring a Remote Senior Site Reliability Engineer

Description

Who is Tenable?

Tenable® is the Exposure Management company. 40,000 organizations around the globe rely on Tenable to understand and reduce cyber risk. Our global employees support 60 percent of the Fortune 500, 40 percent of the Global 2000, and large government agencies. Come be part of our journey! 

What makes Tenable such a great place to work? 

Ask a member of our team and they’ll answer, “Our people!” We work together to build and innovate best-in-class cybersecurity solutions for our customers; all while creating a culture of belonging, respect, and excellence where we can be our best selves. When you’re part of our #OneTenable team, you can expect to partner with some of the most talented and passionate people in the industry, and have the support and resources you need to do work that truly matters. We deliver results that exceed expectations and we win together!

Your Role:

Have you heard of Tenable.io? Our cloud-based vulnerability management platform built for today’s dynamic IT assets, like cloud, containers and web apps? Well, that’s what you’ll be working on in this role.  You will need to continue to quickly build out the platform, scale it automatically, and make it more self-managing for our cloud customers!

Your Opportunity:

  • Responsible for taking the code and functionality of Tenable.io and making it function in private cloud environments
  • Responsible for responding to support escalations which involve troubleshooting complex technical problems and resolving data/configuration issues within defined service level objectives
  • Managing customer segregation across multiple geographical regions and zones to provide high performance, reliability, and availability
  • Responsible for developing software, tools, and scripts to automate deployment, management, and monitoring of production systems in all environments
  • Provide strategic and thought leadership among peers on complex projects
  • Collaboration with cloud engineers in understanding new cloud technologies, assessing impact to security services operations, and proposing solutions to existing business problems
  • Collaboration in the software development lifecycle to develop detailed enhancement/bug definitions, write functional requirements, translate the requirements into solution designs, and navigate the functional requirements through to Production deployments
  • Proactively look for ways to create efficiencies within operations as it pertains to the tools and technology used by Tenable to support their customer base
  • Manage, participate in, or directly work on any additional projects, assignments, or initiatives assigned by management
  • Create/maintain documentation for operational procedures
  • Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization
  • Participate in an on-call rotation and support 24x7 availability of production application systems

What You'll Need:

  • U.S citizen required
  • 5+ years of related SRE experience
  • Bachelor's Degree or Master's degree in a technical field such as Computer Science, Information Technology Engineering or equivalent work experience
  • Strong experience with the Agile software development methodology and collaboration with internal teams to deliver software and configuration artifacts
  • Strong background in bash scripting in addition to experience in higher-level scripting languages like Python or Node.js
  • Experience with Docker or similar container solution
  • Experience with orchestration tooling such as Kubernetes and Docker Swarm
  • Experience with Terraform or similar IaC technologies
  • 2+ years deploying Amazon Web Services (AWS) public cloud infrastructures preferred including administering managed AWS services (OpenSearch, MSK, EKS, Batch, etc.)
  • 1+ years of operational experience with industry-leading "big data" services technologies
  • Be an enthusiastic learner, user, and advocate of our technologies
  • Has desire to win as a team – make big things happen by working together and being open and willing to try new ideas
  • Strong interpersonal and communications skills (written, verbal, & virtual) with ability to work in a team-oriented, collaborative environment
  • Must have high degree of personal integrity and ability to maintain strict confidentiality
  • Must have a strong drive, be self-motivated, logical, and have a keen attention to detail

And Ideally:

  • Experience deploying distributed, microservice oriented applications
  • Experience with Java build tools including Gradle
  • Experience with Helm/Tiller
  • Experience with Go and Java/Kotlin

If you’ve reached this point, and you’re still not sure if you should apply…..Just do it! We’re human and we don’t fit a perfect mold. Having diverse backgrounds, experiences and perspectives, that’s a good thing! If you’re coming from outside of the cyber industry - great! If you’re looking to try something new - awesome! All we ask is you bring passion to all that you do, crave creativity and innovation, and embrace the hard work of gaining new skills and accepting big challenges.

We’re committed to promoting Equal Employment Opportunity (EEO) at Tenable - through all equal employment opportunity laws and regulations at the international, federal, state and local levels.

The base salary range for this position is $128,000.00 - $170,000.00 USD. Compensation for the role will depend on a number of factors, including the candidate's qualifications, skills, competencies, location and experience, and may fall outside of the range shown. Employees are also eligible for variable compensation in addition to base pay (commission for sales roles, bonus for non-sales roles), depending on company and individual performance. Tenable also offers a variety of comprehensive and competitive benefits which include: medical, dental, vision, disability and life insurance; 401(k) retirement savings with company match; an employee stock purchase plan; an employee referral program; flexible spending accounts; an Employee Assistance Program (EAP); education assistance; parental leave; paid time off (PTO); company-paid holidays; health and wellness events; and community programs.

#LI-Remote

See more jobs at Tenable

Apply for this job

+30d

Site Reliability Engineer (Remote USA)

Open Systems AGRemote , California, United States
agileterraformDesignkubernetespython

Open Systems AG is hiring a Remote Site Reliability Engineer (Remote USA)

Are you an engineer passionate about building software and systems that improve the everyday work life of people around the world? Read on, this job might be for you! 

About Open Systems


Open Systems delivers cybersecurity beyond expectations. We partner with organizations to boost the security performance of their digital transformations. Our award-winning Managed Detection and Response (MDR) and Secure Access Service Edge (SASE) services connect and protect customers today, while increasing their security maturity for tomorrow

Open Systems’ Mission Control SOCs and NOCs are staffed by certified, outcome-obsessed engineers who provide 24x7 global coverage. They leverage a platform backed by data science and years of finetuning complex processes to better understand and reduce attack surfaces.

Deployed in nearly 10,000 locations across 184 countries, Open Systems has earned an out of this world 97% retention rate. No wonder our customers call it crazy good cybersecurity.


Discover more at www.open-systems.com. 


Join us and empower our ambitious Site Reliability Engineering team as: 

Site Reliability Engineer (80% - 100%) Your mission:

As a Site Reliability Engineer, you empower Open Systems to deploy and operate a reliable, distributed service at scale. You will: 


  • Work closely with engineering teams, product owners, and other stakeholders to define service operations, identify operational issues early and prevent them
  • Help define Service Level Objectives to assess release readiness of all services 
  • Develop software, tooling, and processes to automate our operations 
  • Measure and optimize system performance 
  • Participate in incident management on-call rotation and drive root cause analysis


As part of your SRE responsibilities, and through the Open Systems training and courses, you will become certified as a Mission Control Engineer, providing you with knowledge in a wide area covering networking and security topics. This will give you the opportunity to visit our Mission Control NOCs in Redwood City and Honolulu for deployments. 


Your qualifications:

You are strong in either Software Engineering or Networking and Security Operations and have an interest in developing your skills and solving problems at the intersection of both. You are motivated to learn new skills and expand your existing theoretical and practical knowledge in training programs offered by internal domain experts and team colleagues, and ideally, you bring some of the following skills to the table: 


  • University degree in Computer Science, or equivalent professional experience
  • At least 2 years of software development, DevOps, or security automation
  • Strong conceptional understanding of scalable system design
  • Full ability to design, test, and release code in general-purpose languages such as Go or Python and scripting (Bash)
  • Familiarity with GitOps, Terraform, Kubernetes, Prometheus, as well as major clouds
  • Interpersonal skills, ability to collaborate and build trust across teams to design and deliver shared solutions


What we offer:

You will join our growing SRE team, and work with agile method in coordination with software and product engineering teams. You will have the opportunity to work remotely from Switzerland, Germany, Austria, the Netherlands or the UK, or be based in one of our offices in Zürich, Bern, Düsseldorf, Vienna. You will have the option to work full-time or part-time 80%.


Open Systems will offer you interesting challenges in the dynamic and global environment of SD-WAN and cybersecurity. You will be in a work environment in which innovative solutions, rapid development times, creativity, and open communication are practiced and continuously fostered. The pursuit of technical advancement is at the center of our attention. Our employees are known as enthusiastic, humorous, and passionate individuals. It’s all about people because it’s them who make us stand out in the marketplace, not our technology. 

We look forward to receiving your online application (please note that you have to compress your application into two attachments).  Only direct applications will be considered.


Come as you are! We search for amazing people of diverse backgrounds, experiences, abilities, and perspectives. Open Systems welcomes and encourages diversity in the workplace regardless of race, gender, religion, age, sexual orientation, disability, or veteran status. 

Get the word out!


See more jobs at Open Systems AG

Apply for this job

Numbrs Personal Finance AG is hiring a Remote Site Reliability Engineer - Remote

Site Reliability Engineer - Remote - Numbrs Personal Finance AG - Career Page

See more jobs at Numbrs Personal Finance AG

Apply for this job

+30d

Site Reliability Engineer - (100% remote)

Giant SwarmRemote
agileterraformDesignansiblekubernetespython

Giant Swarm is hiring a Remote Site Reliability Engineer - (100% remote)

Your Job
We are looking for a Site Reliability Engineer (m/f/d). You will be a key member of a tight-knit group of talented Engineers who are responsible for keeping ours and our customer’s Kubernetes clusters operational and healthy. You’ll also have a key role in the development of the product itself, working together with our Platform Engineers to deliver the greatest Kubernetes service possible.

Giant Swarm is a fast-growing open-source infrastructure management platform used by modern enterprises. Our vision is to empower developers around the world to ship great products. We are a diverse, fully remote (since 2014) and experienced team that is growing and spread across Europe - with a headquarters in Cologne.

  • You maintain, operate and upgrade our own and our customer’s Kubernetes clusters.
  • You will design, configure, build, and maintain our core infrastructure, from kernel parameters to the cloud provider templates.
  • You understand how servers and systems work and you tweak their behavior to your needs.
  • You will be responsible for our monitoring, logging and alerting.
  • You will help resolve incidents on our own and our customer’s clusters.
  • You participate in the on-call support schedule
  • You are a go-to person in case our developers need advice regarding infrastructure.
  • You will automate all the things, and the thought of Terraform doesn’t make you cry.
  • We (and the majority of our customers) are currently mostly distributed around Europe (around UTC), thus, your main time zone should be somewhere between +/-2UTC to ensure better communication.
Requirements
  • You have deep hands-on knowledge of the inner workings of a Kubernetes cluster
  • You must be able to configure all cluster components from the ground up with no automated deployment tools (think Kubernetes the Hard Way)
  • You’re comfortable debugging systems at all levels, from kernel fundamentals right up to workloads running on Kubernetes.
  • You’re happy troubleshooting a wide variety of issues and you’re not afraid to parse thousands of lines of logs in pursuit of an answer.
  • You have good coding skills (preferably Go, but Python or similar is fine as well)
  • You have experience with maintaining infrastructure with code and you know the pros and cons of various automation tools (We use Terraform & Ansible but Chef, Puppet and the lot is also a good start).
  • You are fluent with Cloud Native Tools running on top of Kubernetes (prometheus, grafana, ingress controller, …) you know how to use them and how to configure them.
  • You automate all the things by writing code. Using bash scripts makes you sad :)
About us

Every new team member changes the team. We love to learn from each other and we are looking for people who know things we don’t. 

  • Becoming part of Giant Swarm means that, by extension, you also become part of the Cloud Native community. We actively contribute to upstream projects and our quarterly hackathons will give you space to work on out-of-the-box projects. Occasionally, when we, as a team, want to fully focus on one project, we scratch all meetings and routines for a certain time to better focus during our hive-sprints.

  • Continuous learning is important to us - we foster this through bi-yearly personal development talks, a budget for training/certifications/coaching as well as regular feedback talks and workshops. Our teams are cross- functional and collaboration is key. 

  • Nothing crazy, but useful Basics: We currently operate on a 32 hour workweek (or 4 day workweek, you decide!). We don't count holidays but set a minimum number; You choose your own hard- and software; As a company that has almost, if not more, kids than employees, family-friendliness is crucial to us and paid parental leave is a no-brainer; We pay monthly perks that cover your costs for working remotely; We meet twice a year as an entire company and (if possible) see conferences as an important place to catch up with team members; We aim to be fully transparent (finance, salaries) unless it hurts people and trust you, based on this to make the best decisions

We failed in exactly describing our way to approach important company elements that can be described with ‘buzzwords’ such as agile mindset, cross-functional teams, self-organization, value of the individual or trust & teamwork. However, we truly care about them, we live them and we constantly iterate on them. Some snippets about how we do this are posted in our blog but by far not all of them. 

Important note: We are not hiring job descriptions. We hire humans. :) We welcome applications from everybody, regardless ethnic or national origin, religion, gender identity, sexual orientation or age.

+30d

Site Reliability Engineer

2 years of experienceagileuijavatypescriptlinuxangularjenkinspythonAWS

Evertz Microsystems Limited is hiring a Remote Site Reliability Engineer

Site Reliability Engineer - Evertz Microsystems Limited - Career Page

See more jobs at Evertz Microsystems Limited

Apply for this job

+30d

(Senior) Site Reliability Engineer (f/m/d)

terraformnosqlDesignmobilemetaldockerelasticsearchkuberneteslinux

The Jodel Venture GmbH is hiring a Remote (Senior) Site Reliability Engineer (f/m/d)

Company Description
Jodel was launched in 2014. The idea of Jodel started when we realised that despite countless products in tech, there was no fast and simple way to connect with people around us.

The people behind Jodel are as diverse, creative and friendly as our users. We are a 90+ people strong team with over 25+ nationalities represented in our Berlin HQ and all over the world. Each individual brings a different perspective and approach to the table - but all have the same drive and motivation to create the world’s hyperlocal community. Together, we are building the most successful social media platform in Europe, while still enjoying ourselves in this exciting journey.

Now we are looking for our next colleague to join the Jodel team.
Your mission
  • Define, own and execute on our infrastructure and systems design strategy to support our ambitious growth plans.
  • Become an integral part and play a key role in our migration to AWS.
  • Work within a cross-functional product team and closely collaborate with stakeholders from across the company.
  • Consult other engineers on technical topics and enable them to take full ownership of their infrastructure.
  • Improve your team’s efficiency and reduce toil by creating reliable automation for recurring tasks.
  • Ensure the architecture is extensible, cost-efficient and future proof by establishing and asserting standards.
  • Own KPIs like cost, uptime or average response time.
  • Develop and practice disaster recovery scenarios.
  • Assess, manage and mitigate security risks.
Your profile
  • Solid experience with AWS.
  • Profound knowledge of Linux, CI/CD, Docker, Kubernetes, NoSQL and relational databases, networking, security, Terraform, configuration management, well-known monitoring, logging and alerting solutions e.g. ElasticSearch, Grafana, Prometheus.
  • Ability to analyze and write source code in at least major programming language.
  • Former experience in pragmatically migrating bare metal infrastructure into the cloud is a plus.
  • Team player with a DevOps mindset.
  • Strong analytical skills.
  • Structured, self-sufficient & solution-oriented approach. You will be the one in the driving seat!
  • Strong written and verbal communication skills in English.
What we offer
  • SOME OF OUR BENEFITS
    • Learning & Development Budget - 600 EUR
    • Company Funded Pension
    • BVG transportation ticket discount for Berlin
    • Company trips and offsite events
    • Nilo - employee Mental Health Support & Personal Development sessions
    • Kima Ventures Network
    • Hybrid working set-up
  • Career building is part of the deal - you don't join to just write tests and contribute to the product - you also join to improve your career. We pay special attention to your personal development and make sure you're focusing on the skills that matter the most to you.
  • United in diversity - with people from all over the world, from Tunisia to France, from India to Poland, we are multicultural by default and proud to be so. We all come from different walks of life and cultural backgrounds, and we continue to push for diversity in our team!
  • We're building a team, not just making money - we solve hard problems together but we also relax and have fun. From team cooking to going on company trips, from office parties to go-karting, we'll make sure you have a good work-life balance.
  • No one else is doing what we're doing - it's that simple. We're pioneering local communication and since communication is a basic human need, our work is just very very exciting. On top of that, we are one of the few large scale European Social Networks. There’s not that many and we're very proud to be "Made in Europe".
About us
Jodel is the world’s hyperlocal community.

Through its state-of-the-art mobile platform, it enables its users to discover, follow, and participate in-real-time in the most relevant conversations with people nearby. We have millions of active users across the Nordics, DACH and middle-east and continue to expand globally.

Our vision is that you can open Jodel at any time and anywhere in the world, to easily talk and connect with people around you.

You will be able to fully explore the city you live in, listen in on all its vibrant communities and connect with your peers. No matter if it’s other pet lovers nearby or the local techno crowd: Jodel allows you to share memes and jokes, get news, ask questions and simply have fun. You can discuss everything from new hypes to modern-dating, organize help for social causes. And by doing all that you might even get to know new amazing people in your area.

Jodel will be closing the gap to easily share a togetherness with all the people you see in the physical world. As this works in your hometown, so it does when moving somewhere else. Whenever you’re travelling, you can be certain there are people around you that wonder about the same things. Is it still worth it to go to the Full Moon party here in Ko Pha-ngan or what are the best tricks for negotiations on the Marrakech market? And maybe, one day, you think about moving to New York so you teleport to its local feed and explore.

Come as you are! 
At Jodel everyone is welcome, regardless of gender identity, nationality, age, disability status, sexual orientation or religion. Jodel is an equal opportunity employer and believes that a great working environment reflects a diversity of backgrounds, experience, talent and thoughts. We will not tolerate discrimination or harassment based on any of these characteristics. All you need is a passion for local communities and a desire to be part of a fast growing startup.

See more jobs at The Jodel Venture GmbH

Apply for this job

+30d

Site Reliability Engineer, evertz.io (Poland)

2 years of experienceagile3 years of experienceuiscrumjavatypescriptlinuxangularjenkinspythonAWS

Evertz Microsystems Limited is hiring a Remote Site Reliability Engineer, evertz.io (Poland)

Site Reliability Engineer, evertz.io (Poland) - Evertz Microsystems Limited - Career Page

See more jobs at Evertz Microsystems Limited

Apply for this job