ansible Remote Jobs

217 Results

+30d

Copy of Senior Site Reliability Engineer - Brazil

PodiumRemote, Brazil
Bachelor's degreeterraformDesignansibleazurerubydockerkuberneteslinuxpythonAWS

Podium is hiring a Remote Copy of Senior Site Reliability Engineer - Brazil

At Podium, our mission is to help local businesses win. Our lead conversion platform, powered by AI and integrations, helps local businesses convert leads faster, communicate easier, and make more sales. Every day, thousands of local businesses utilize our review management, communication, marketing, and payments products. 

Our work and focus on helping local businesses thrive has been recognized across the industry, including Forbes’ Next Billion Dollar Startups, Forbes’ Cloud 100, the Inc. 5000, and Fast Company’s World’s Most Innovative Companies.

At Podium, we believe in fostering a culture that thrives on hiring and developing exceptional talent. Our operating principles serve as a compass, guiding daily behavior and decision-making, and ensure we hire people who will thrive at Podium. If you resonate with our operating principles and are energized by our mission, Podium will be a great place for you!

The Role:

A Site Reliability Engineer borders the worlds of software engineering and systems engineering. At Podium, the SRE team drives our products to success by building a stable, scalable, sustainable, and slick system. We permanently sit and sup with the product engineering teams to address all of their needs, and work as an SRE guild to build a world-class platform for our products to run on. We're currently targeting a senior SRE to come in and deliver impact from day one.

What you will be doing: 

  • Work with the following technologies: Kubernetes, Helm, Docker, AWS, Terraform, Datadog, Prometheus, Ansible, StrongDM, Python, Go, Ruby, GitLab and GitLab CI.
  • Engaging with Podium's engineering community to identify potential areas of improvement or pain points and making Podium's systems safer and more pleasant to operate.
  • Participating in an on-call rotation for the services the team owns, triaging and addressing production as well as development issues.
  • Working cross-functionally with different teams to make sure that there is no down time for our products.
  • Mentoring junior engineers on the team.

What you should have: 

  • Bachelor’s degree in a technical field or relevant work experience.
  • 4+ years experience working alongside a production system in either a software engineer or systems engineer type role
  • 3+ years deploying, operating and debugging server software on Linux
  • Curiosity and the desire to learn
  • Ability to take a rotating on-call shift

What we hope you have: 

  • Experience with distributed systems and microservices
  • Practical knowledge of system design
  • Cloud computing, such as AWS, GCP, or Azure
  • SOC2, HIPAA, PCI, or other regulatory or compliance standards
  • Building and maintaining a CI/CD pipeline
  • Heavy Infrastructure experience 

See more jobs at Podium

Apply for this job

+30d

DevOps Infrastructure en alternance H/F

DevoteamLevallois-Perret, France, Remote
DevOPSagileterraformansibleazuredockerAWS

Devoteam is hiring a Remote DevOps Infrastructure en alternance H/F

Description du poste

Participer à notre démarche DevOps via les tâches suivantes :

 

-Mise en place d'une démarche Infrastructure as Code pour les environnements Cloud Azure & AWS
-Administration du tenant Azure DevOps
-Mettre en place et faire évoluer les « use cases » 
-Travailler en mode agile via des sprints
-Travailler conjointement avec l'équipe sécurité dans une démarche DevSecOps
-Automatiser les tâches d'exploitation des équipes infrastructures
-Participer à la mise en place de pipeline de déploiement "Cloud" (Infra as Code, Kubernetes/Docker, cloud AWS)
-Participer au développement des automatisations via Terraform
-Développer les tests automatisés

Qualifications

Un fort intérêt pour les technologies suivantes est souhaitable :

 

Intégration continue et gestionnaire de code : Azure DevOps, GitHub, GitLab CI
Analyse Qualité et sécurité (SAST) : SonarQube
Automatisation : PowerShell, Terraform et Ansible
Langages : YAML
Plate-formes et Runtimes : Container (Docker & K8S)

See more jobs at Devoteam

Apply for this job

+30d

Senior Software Engineer (K8)

AcquiaRemote - India
DevOPSEC2golang9 years of experience6 years of experience3 years of experienceterraformDesignansibleazurerubydockerkubernetesjenkinsAWSPHP

Acquia is hiring a Remote Senior Software Engineer (K8)

Senior Software Engineer, Observability Team

Department: Engineering

Acquia is an open source digital experience company. We provide the world's most ambitious brands with technology that allows them to embrace innovation and create customer moments that matter. At Acquia we believe in the power of community and collaboration - giving our customers the freedom to build tomorrow on their terms.

Headquartered in Boston, we have been named as one of North America’s fastest growing software companies as reported by Deloitte and Inc. Magazine, and have been rated a leader by the analyst community and named one of the Best Places to Work by the Boston Business Journal. We are Acquia. We are building for the future of the web, and we want you to be a part of it.

Acquia’s products run 100% on Amazon Web Services using EKS, EC2, CloudFormation, Terraform and various other technologies and best practices. Since each product is built and maintained by its own engineering team, the ideal candidate for this position would need to be proactive in familiarizing themselves with those services and have the ability to coordinate and collaborate with multiple teams.

About the Team: The Observability team plays a pivotal role in ensuring the smooth functioning and performance optimization of all our systems. We are a dynamic team of engineers dedicated to providing centralized Observability solutions to empower all teams within the company. We are seeking a highly skilled and experienced Senior Software Engineer to join our team. As a Senior Engineer, you will play a key role in designing, implementing, and maintaining systems and tools to ensure the reliability, performance, and scalability of our infrastructure and applications.

As a Senior Software Engineer, you will…

  • Lead the design and implementation of observability solutions, including monitoring, logging, and tracing systems along with a wide range of core internal systems. Work with your team to develop far reaching modules that have scalability and availability at their core
  • Collaborate with cross-functional teams in deciding, developing integrations with other subsystems and best practices for both current and future infrastructure needs at a scale.
  • Develop and maintain monitoring dashboards and alerts to provide actionable insights into the system health and performance.
  • Automate the observability process to improve efficiency and scalability.
  • Conduct in-depth performance analysis and troubleshooting to identify and resolve issues proactively, ensuring minimal impact on operations.
  • Maintain an understanding of system functionality and architecture, with a strong focus on the operational aspects of the service (availability, performance, change management, emergency response, capacity planning, etc)
  • Stay abreast of industry trends and emerging technologies in observability, and make recommendations for adoptions to enhance our systems.
  • Provide product support to internal and external stakeholders
  • Work in a team environment where your team owns and operates the services you build

You’ll enjoy this role if you…

  • Like solving complex challenges for scalable, low latency systems
  • Enjoy solutioning for a Cloud native environment
  • Enjoy collaborating with multiple stakeholders
  • Have a passion for DevOps & SRE practices

 

What you’ll need to be successful…

  • Have 5+ years of software development experience with time spent working on Cloud technologies (AWS, Google Compute, Azure) at large scale. AWS with Kubernetes is greatly preferred.
  • Proficiency in programming languages such as Golang, PHP, Ruby or similar.
  • Comfortable navigating & troubleshooting unix/linux based operating systems.
  • Strong understanding of monitoring and logging technologies, such as Prometheus, OpenTelemetry, Fluentd, Collectd, Grafana, ELK Stack, or similar.
  • Familiarity with Sumo Logic, New Relic, Dynatrace, Cloudwatch, Splunk, Nagios.
  • Strong interest in building and operating distributed systems and/or service oriented architectures.
  • Passion for Devops processes and tools (Jenkins), distributed configuration management systems (Ansible, Puppet) and maintaining infrastructure as code (Terraform, Cloudformation)
  • Excellent problem-solving skills, attention to detail, and ability to work independently as well as part of a team.
  • Strong communication and collaboration skills, with ability to effectively interact with stakeholders across different teams and levels.

 

Extra credit if you…

  • Certifications in relevant technologies (AWS, CKAD, CKA, etc)
  • Have hands on experience with Docker, K8s or equivalent
  • Have a mindset to automate repetitive tasks  

 

Acquiais an equal opportunity (EEO) employer. We hire without regard to age, color, disability, gender (including gender identity), marital status, national origin, race, religion, sex, sexual orientation, veteran status, or any other status protected by applicable law.

 

See more jobs at Acquia

Apply for this job

+30d

Ingénieur cloud - Summer Job Dating

DevoteamTunis, Tunisia, Remote
DevOPSterraformansibleazuregitdockerkuberneteslinuxjenkinspythonAWS

Devoteam is hiring a Remote Ingénieur cloud - Summer Job Dating

Description du poste

???? Missions

  • Accompagner les équipes de développement par la mise en place de pipelines CI/CD, d’Infra As code et de conteneurs ainsi que des micro-services,

  • Participer à la conception de solutions techniques et fonctionnelles sur des environnements On-Premise ou de Cloud public (principalement AWS et dans une moindre mesure GCP),

  • Mise place et configuration d’outils d’automatisation pour accélérer le déploiements des applicatifs

  • Industrialiser et améliorer l’architecture technique, les outils et les processus,

  • Participer à l’amélioration continue des plateformes/architecture

  • Participer à la mise en place ou configuration outils d’observabilité en fonction de l’environnement du client

  • Sensibiliser les équipes sur les démarches d’Intégration Continue et de Déploiement Continu et à la philosophie DevOps en tant que tel

  • Tenue d’entretiens techniques pour aider Revolve à grandir

  • Participation en shadow à des avant-ventes afin d’aider à la conception de solutions techniques pour nos futurs clients

Stack technique :

  • Cloud : AWS,GCP, Azure
  • Scripting : Python, Bash,Powershell
  • Programmation : Python,
  • Infrastructure As Code : Terraform, Cloudformation
  • Configuration management : Ansible, Packer
  • CI/CD : GitlabCI, Jenkins, AWS CodePipeline, CodeDeploy etc…
  • Containers : Docker, Kubernetes
  • Versioning : Git
  • Bonne connaissance en systèmes (Linux ou Windows)
  • Bonne connaissance en réseau, notamment sur le modèle TCP/IP

Qualifications

???? Compétences

  • Vous avez une formation supérieure (Bac+5) et vous disposez d’une  expérience de 3 ans dans un environnement Cloud, incluant infrastructure, déploiement, migration, et DevOps.
  • Une certification Cloud (GCP, AWS, Azure, RedHat) serait un atout supplémentaire.
  • Vous êtes doté(e) d’un excellent relationnel et d’un sens prononcé du service et de la qualité.
  • Vous appréciez le travail en équipe.
  • Curieux(se), autonome et à l’écoute, vous possédez un réel esprit d’analyse.
  • Vous êtes désireux(se) de vous investir dans des projets challengeants.

N’attendez plus et postulez à l’offre!

See more jobs at Devoteam

Apply for this job

+30d

Distributed Cloud | AWS DevOps Engineer

DevoteamLisboa, Portugal, Remote
DevOPSS3EC2agilejiraterraformDesignansiblegitdockerkubernetesjenkinspythonAWS

Devoteam is hiring a Remote Distributed Cloud | AWS DevOps Engineer

Job Description

  • As a DevOps / Systems Engineer, you will be responsible for our core AWS infrastructure and constantly improve it in terms of automation, resilience and robustness. Your tasks and responsibilities will include:
  • Contribute to the design of a secure, scalable and robust infrastructure code base
  • Continuously improve our CI/CD automation to speed up the deployment cycles
  • Continuously improve our cloud infrastructure logging/auditing capabilities
  • Collaborate with our Engineering team to identify and solve issues and tasks that can
  • be automated
  • Ensure the security of our cloud infrastructure and services, including reliable backup and real-time monitoring
  • Continuously improve and test our backup/recovery strategy
  • Contribute with your input to the definition of our internal roadmap

Qualifications

  • Strong AWS experience especially in the following services: VPC, EC2, IAM, Terraform, S3, Route53, Security
  • Good Knowledge of IaC Terraform
  • Strong background in DevOps related practices such as CI/CD, infrastructure automation, and infrastructure as code
  • Experience in Configuration Management tools (Ansible preferred) and scripting languages (bash or python, etc)
  • Experience with continuous integration tools, like GIT, JIRA, Jenkins, Maven, Docker, Kubernetes, Openshift;
  • Knowledge of web architectures and services (HTTP, SOAP, REST, JSON, etc.);
  • Knowledge of agile development methodologies (nice to have);
  • Ability to think creative;
  • Strong attention to detail;
  • Strong time management skills;
  • Excellent interpersonal skills;
  • Proficiency in English (both spoken and written).

See more jobs at Devoteam

Apply for this job

+30d

Application Performance and Security Solutions Engineer

SalesDjangoterraformsqloraclelaravelDesignansibleazuregraphqlgitrubyangularAWSjavascript

Cloudflare is hiring a Remote Application Performance and Security Solutions Engineer

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company. 

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us! 

Application Performance and Security Solutions Engineer

 

About Us

At Cloudflare, we have our eyes set on an ambitious goal: to help build a better Internet. Today the company runs one of the world’s largest networks that powers trillions of requests per month. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare have all web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was recognized by the World Economic Forum as a Technology Pioneer and named to Entrepreneur Magazine’s Top Company Cultures list.

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us!  

 

What you'll do as a Application Performance and Security Solutions Engineer

You will work with our clients to understand their business needs and understand their current applications landscape with the goal of helping them develop a security posture that meets their requirements while also keeping their applications and assets safe according to the most recent standards in cyber security, with high levels of reliability and performance.

You will work closely with our Account Executives (AEs); generalist SEs, as well as every other team at Cloudflare, from Sales and Product to Engineering and Customer Support. 

 

Responsibilities

You serve as a trusted advisor to our customers; helping them find the best solution for their business needs with Cloudflare.

  • Guide qualified prospects through the Solution Design and Proof of Concept stages of sales engagements; showing them the key capabilities of the Cloudflare One portfolio
  • Work with your Sales Partners to consistently achieve Pipeline and Revenue goals
  • Understand our customers IT landscape and security requirements and develop an architecture for our customers to support their business needs
  • Articulate the benefits of the Cloudflare application performance and security product portfolio vs competing solutions
  • Promote retention by capturing and communicating gaps in product or features
  • Collaborate and coordinate with account sales/pre-sales teams on opportunities and liaise with product management and support through the deal cycle.

 

Skill Requirements

  • Curiosity and Learning Agility
  • 3+ years of prior Technical Sales, Systems Engineering, or related experience
  • Experience interacting with senior level/executives to communicate a message of network and security transformation
  • Familiar with trends and attacks in cyber security (e.g. DDOS, SQL Injection, Cross Site Scripting, etc)
  • Hands-on experience with Cloud based architectures (e.g AWS, Azure, GCP)
  • Detailed understanding of workflow from user to application including hybrid architectures with Azure, AWS, GCP.
  • Knowledge of the common web application frameworks and practices (JAMstack, Micro-services, GraphQL, event-driven architecture…)
  • Knowledge of one or more common web frameworks (Django, Ruby on Rails, Laravel, Angular, React…)
  • Experience with web performance testing and optimization such as RUM, Synthetic testing and deep understanding of the protocol stack (DNS, TCP, HTTP, SSL…)
  • Experience with Application, Security, Performance and Reliability products such as WAF, Bot Management, DDOS, Firewall, DNS and CDN.
  • Experience with sophisticated Botting technologies and technics (e.g Residential proxies, Headless browsers, fingerprinting rotation, click-farms)
  • Fundamental understanding of customer network and application architectures. 
  • Experience with JavaScript, Restful APIs, JSON,  and code versioning solutions (e.g. Git, Gitlab, Bbitbucket) and scripting languages
  • Hands-on experience with infrastructure-as-code (e.g Terraform, Pulumi, Ansible, Puppet, Chef)
  • Fundamental understanding of the networking concepts (e.g BGP, TCP, Anycast, IP)
  • Stay up to date not only with Cloudflare’s specific products, but with industry trends.
  • Ability to jump in with hands-on configuration when needed
  • Ability to manage a project, work to deadlines, and prioritize between competing goals

 

Other desirable skills areas include:

  • Industry Certification, e.g. CISSP, CISA, CEH or OSCP
  • Cloud Provider certification e.g AWS, GCP, Azure, OVH, Oracle OCI
  • Experience in one or two industry verticals (Industry, Retail, Banking, Automotive, Supply-chain, Insurance…)
  • Certification with any of the major Cloud Provider

 

What Makes Cloudflare Special?

We’re not just a highly ambitious, large-scale technology company. We’re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

Project Galileo: We equip politically and artistically important organizations and journalists with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.

Athenian Project: We created Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration.

Path Forward Partnership: Since 2016, we have partnered with Path Forward, a nonprofit organization, to create 16-week positions for mid-career professionals who want to get back to the workplace after taking time off to care for a child, parent, or loved one.

1.1.1.1: We released 1.1.1.1to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitmentand ensure that no user data is sold to advertisers or used to target consumers.

Sound like something you’d like to be a part of? We’d love to hear from you!

This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

Cloudflare is proud to be an equal opportunity employer.  We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness.  All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law.We are an AA/Veterans/Disabled Employer.

Cloudflare provides reasonable accommodations to qualified individuals with disabilities.  Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.  If you require a reasonable accommodation to apply for a job, please contact us via e-mail athr@cloudflare.comor via mail at 101 Townsend St. San Francisco, CA 94107.

See more jobs at Cloudflare

Apply for this job

+30d

Senior Systems Engineer (Pre-Sales)

AristaDelhi, India, Remote
SalesDesignansiblepython

Arista is hiring a Remote Senior Systems Engineer (Pre-Sales)

Job Description

Who You'll Work With

When you join Arista as part of the Sales Engineering team, you're not just stepping into a role; you're becoming part of a team of industry experts and technical leaders. Typically reporting to the System Engineering Manager, you'll be working alongside some of the most skilled field engineers in the industry. Our team prides itself on not just understanding the technology but also the business impact and aligning our solutions with the larger goals of our clients. In collaboration with our Product Management and Software Development teams, you will play a pivotal role in steering product developments, offerings, and strategic direction to best serve our customers and prospects. Supported by the expertise of our world class Arista TAC, cutting-edge proof-of-concept resources, and support of the executive team, you are well-positioned to lead and innovate within the industry.

What You’ll Do

We are experiencing tremendous growth and have an immediate need for a collaborative, self-motivated Senior Systems Engineer to partner with our Account teams to provide pre-sales technical systems engineering support for our enterprise/commercial customers in the Pittsburgh area.

  • The Systems Engineer is a critical component of the Arista Sales team with the key responsibility of acting as a trusted advisor for our customers to gather requirements and identify opportunities with existing and new customers.

  • You will partner with the Arista Account Managers to understand customer pain points and conduct white board network architectural reviews in addition to conducting Arista product presentations of Arista’s Open Networking Data Centre and Cognitive Campus (including Wi-fi) networking solutions, CloudVision (Network Automation), Security (Network Detection & Response), Endpoint Security and Real-time Fabric Monitoring solutions.

  • You will architect, design and propose Arista Data Centre & Campus network solutions using leaf-spine architectures (VxLAN, EVPN) and network overlays to capture additional sales.

  • Perform hands-on tests to validate customer proof-of-concept setups, Data Centre and/or Campus network designs, and network deployments using new products and features

  • Put together design guidelines and recommend improvements to customers for the networks they support

  • Partner with Sales Team to respond to RFP/RFQs

  • Provide feedback to Product Management and Engineering

  • Represent Arista at SDN and Open Networking industry events and conferences

  • Keep up-to-date on competitive solutions, products, and services

  • Author white papers on technology and product

Qualifications

  •  
  • BE/BS/CS/CE technical degree required
  • Network Industry Certification preferred (e.g. CCIE (R&S), JNCIE)
  • You possess a minimum of 10+ years of L2/L3 networking design and implementation experience with a focus on Data Centre and Campus networks. 
  • You possess expert level expertise in routing and switching including L2/L3 protocols (Cisco, Juniper, Extreme, Aruba) 
  • Demonstrated work experience as either a Sales Engineer, Solutions Architect, Pre-Sales SE or Network Consulting Engineer preferred
  • Previous experience with network overlays preferred. 
  • Expert knowledge in three or more of the following areas: Ethernet, RSTP/ MSTP, VLANs, IP Routing, TCP/IP, BGP, eBGP, VxLAN, EVPN, Multicast, Spanning Tree, QoS
  • Expert-level knowledge of industry-standard CLI
  • Experience with SDN and Network Function Virtualization (NFV) highly desired. 
  • Previous experience building network automation using Python and Ansible desired.
  • Knowledge of competitive products, solutions, and services
  • Ability to write white papers a plus
  •  

Apply for this job

+30d

Production Operations Engineer (5216)

DevOPSBachelor's degreeDesignansibleazurec++elasticsearchlinuxjenkinspythonAWS

MetroStar Systems is hiring a Remote Production Operations Engineer (5216)

As Production Operations Engineer, you’ll  be a passionate and experienced ProdOps Lead who thrives in a leadership role.  The ideal candidate will have a deep-seated passion for technology and excel in managing operations at scale. This role requires a strategic mind that can balance day-to-day operational responsibilities with long-term improvement initiatives.

We know that you can’t have great technology services without amazing people. At MetroStar, we are obsessedwithour people and have led a two-decade legacy of building the best and brightest teams. Because we know our future relies on our deep understanding and relentless focus on our people, we live by our mission: A passion for our people. Value for our customers.

If you think you can see yourself delivering our mission and pursuing our goals with us, then check out the job description below!

What you’ll do:

Leadership and Team Management:

  • Lead, mentor, and manage a dynamic DevOps and ProdOps teams
  • Collaborate with cross-functional teams including Development and Product Management to ensure seamless delivery and support processes
  • Oversee Kanban teams to ensure efficient workflow and task management

Technical Expertise:

  • Administer and optimize Linux servers
  • Develop, maintain, and troubleshoot scripts using Bash and/or Python
  • Manage and automate configuration using Ansible
  • Implement and manage GitOps workflows using GitLab
  • Oversee CI/CD pipelines with Jenkins for reliable and efficient software delivery

Cloud and Infrastructure:

  • Design and manage cloud infrastructure (AWS, Azure, GCP)
  • Implement and manage certificate management processes
  • Implement robust monitoring and alerting systems using Elasticsearch and modern tools
  • Manage elastic search clusters and ensure optimal performance

Incident and Change Management:

  • Lead incident management processes, ensuring timely resolution and root cause analysis
  • Manage change control processes to ensure stability and reliability of production environments
  • Develop and maintain documentation for best practices, operational procedures, and incident reports

Client and Stakeholder Management:

  • Interface with clients and stakeholders to understand requirements, constraints, and deliverables
  • Communicate effectively with clients to provide regular updates and status reports

What you’ll need to succeed:

  • 4+ years of experience in Linux, Bash, Python, Elasticsearch, certificate management, GitLab, Jenkins, Ansible, and Incident Management
  • Demonstrated expertise in managing cloud environments (AWS)
  • Proficient in GitOps methodologies and tools
  • Extensive hands-on experience with CI/CD pipelines
  • Strong scripting and automation skills
  • Proven ability to lead and inspire technical teams
  • Excellent problem-solving abilities and strategic thinking
  • Strong communication and collaboration skills
  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent work experience
  • Ability to obtain Public Trust clearance

Like we said, we arebig fans of our people. That’s why we offer a generous benefits package, professional growth, and valuable time to recharge. Learn more about our company culture code and benefits. Plus, check out our accolades.

Don’t meet every single requirement? 

Studies have shown that women, people of color and the LGBTQ+ community are less likely to apply to jobs unless they meet every single qualification.  At MetroStar we are dedicated to building a diverse, inclusive, and authentic culture, so, if you’re excited about this role, but your previous experience doesn’t align perfectly with every qualification in the job description, we encourage you to go ahead and apply.  We pride ourselves on making great matches, and you may be the perfect match for this role or another one we have. Best of luck! – The MetroStar People & Culture Team

What we want you to know:

In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.

MetroStar Systems is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. The statements herein are intended to describe the general nature and level of work being performed by employees and are not to be construed as an exhaustive list of responsibilities, duties, and skills required of personnel so classified. Furthermore, they do not establish a contract for employment and are subject to change at the discretion of MetroStar Systems.

Not ready to apply now?

Sign up to join our newsletter here.

"EEO IS THE LAW MetroStar Systems, LLC (MetroStar) invites any employee and/or applicant to review the Company’s Affirmative Action Plan. This plan is available for inspection upon request by emailing msshr@metrostar.com."

See more jobs at MetroStar Systems

Apply for this job

+30d

Cloud NetOps Engineer

In All Media IncArgentina - Remote
DevOPSS3EC2LambdaterraformDesignansiblelinuxpythonAWS

In All Media Inc is hiring a Remote Cloud NetOps Engineer

Job Summary:

We are seeking a highly skilled Cloud NetOps Engineer to design, deploy, and manage our scalable, secure, and high-availability AWS cloud infrastructure. The ideal candidate will have extensive experience in network engineering, security solutions implementation, automation, scripting, system administration, and monitoring and optimization.

Key Responsibilities:

Cloud Infrastructure Management:

  • Design, deploy, and manage scalable, secure, and high-availability AWS cloud infrastructure.
  • Optimize AWS services (EC2, VPC, S3, RDS, Lambda, etc.) to ensure efficient operation and cost management.

Network Engineering:

  • Configure, manage, and troubleshoot network routing and switching across cloud and on-premises environments.
  • Implement and maintain advanced network security solutions, including firewalls, VPNs, and intrusion detection/prevention systems.

Security Solutions Implementation:

  • Develop and implement end-to-end network security solutions to protect against internal and external threats.
  • Monitor network traffic and security logs to identify and mitigate potential security breaches.

Automation and Scripting:

  • Automate infrastructure provisioning, configuration management, and deployment processes using tools such as Terraform and Ansible.
  • Develop custom scripts and tools in Python to improve operational efficiency and reduce manual intervention.
  • Implement automation strategies to streamline repetitive tasks and enhance productivity.

System Administration:

  • Perform system administration tasks for Linux servers, including installation, configuration, maintenance, and troubleshooting.
  • Manage and integrate Active Directory services for authentication and authorization.

Firewall and Security Management:

  • Administer and troubleshoot Palo Alto firewalls and Panorama for centralized management and policy enforcement.
  • Manage Cisco Meraki wireless and security stacks, ensuring robust network performance and security compliance.

Monitoring and Optimization:

  • Implement monitoring solutions to track performance metrics, identify issues, and optimize network and cloud resources.
  • Conduct regular performance tuning, capacity planning, and system audits to ensure optimal operation.

Collaboration and Support:

  • Work closely with cross-functional teams, including DevOps, Security, and Development, to support infrastructure and application needs.
  • Provide technical support and guidance to internal teams, ensuring timely resolution of network and system issues.

Documentation and Compliance:

  • Maintain comprehensive documentation of network configurations, infrastructure designs, and operational procedures.
  • Ensure compliance with industry standards and regulatory requirements through regular audits and updates.

Continuous Improvement:

  • Stay updated with the latest trends and technologies in cloud computing, networking, and cybersecurity.
  • Propose and implement improvements to enhance system reliability, security, and performance.

Qualifications:

  • Bachelor’s degree in computer science, Information Technology, or a related field.
  • Proven experience as a Cloud Engineer, Network Engineer, or similar role.
  • Strong knowledge of AWS services and cloud infrastructure management.
  • Proficiency in network engineering, including routing, switching, and security solutions.
  • Experience with automation tools such as Terraform, Ansible, and scripting languages like Python.
  • Solid system administration skills, particularly with Linux servers.
  • Experience managing firewalls and security solutions (e.g., Palo Alto, Cisco Meraki).
  • Strong problem-solving skills and the ability to work in a collaborative environment.
  • Excellent documentation and communication skills.

Preferred Qualifications:

  • AWS certifications (e.g., AWS Certified Solutions Architect, AWS Certified SysOps Administrator).
  • Familiarity with DevOps practices and tools.
  • Knowledge of regulatory requirements and compliance standards (e.g., PCI, CIS).

See more jobs at In All Media Inc

Apply for this job

+30d

Ingénieur DevOps | Summer Job Dating

DevoteamTunis, Tunisia, Remote
DevOPSterraformansible

Devoteam is hiring a Remote Ingénieur DevOps | Summer Job Dating

Description du poste

Vos principales responsabilités en tant que Devops CI/CD Engineer

Voici une liste non exhaustive de vos missions au quotidien, nous vous faisons confiance pour les prendre en main et les enrichir à votre façon ????

  • Accompagner nos clients dans la mise en pratique de la méthodologie DevOps : versionning et stratégie de développement, intégration continue, déploiement continu, Infrastructure as Code.

  • Implémenter chez nos clients les outils nécessaires à la mise en place des pratiques et outils DevOps (Terraform, Ansible, Puppet, Chef, Gitlab CI, ... ).

  • Concevoir et mettre en œuvre des solutions techniques éditeurs ou open source dans des environnements Cloud Hybrides et veiller à l’efficacité de ces dernières.

  • Intervenir dans des écosystèmes techniques DevOps et des plateformes de CI/CD complexes pour des milliers d’utilisateurs.

  • Contribuer à des missions intégrées aux équipes Client pour développer des applications adaptées à la méthodologie DevOps.

Où réaliserez-vous vos missions ? Chez des clients grands comptes de la banque, de l’assurance, de l’industrie, du retail, de la défense, du luxe ou encore de l’énergie, porteurs de projets innovants.

Qualifications

???? Compétences

Quels atouts pour rejoindre l’équipe ?

Diplômé.e d’une école d’ingénieurs ou d’un Master 2 en informatique, vous êtes doté.e d’un excellent relationnel, d’un sens prononcé du service et de la qualité.

Vous avez minimum 4 années d’expérience professionnelle en tant que DevOps/SRE, et êtes issu.e du monde du développement ou de l'administration système.

Vous êtes passionné.e par l’automatisation et l’amélioration continue et avez développé des compétences en scripting.

Vous avez déjà expérimenté la mise en place d’outils de l’écosystème DevOps CI/CD, idéalement en production.

Alors, si vous souhaitez progresser, apprendre et partager, rejoignez-nous !

See more jobs at Devoteam

Apply for this job

+30d

Ingénieur cloud/Ingénieure cloud - Summer Job Dating

DevoteamTunis, Tunisia, Remote
DevOPSterraformansibleazuregitdockerkuberneteslinuxjenkinspythonAWS

Devoteam is hiring a Remote Ingénieur cloud/Ingénieure cloud - Summer Job Dating

Description du poste

Missions

  • Accompagner les équipes de développement par la mise en place de pipelines CI/CD, d’Infra As code et de conteneurs ainsi que des micro-services,

  • Participer à la conception de solutions techniques et fonctionnelles sur des environnements On-Premise ou de Cloud public (principalement AWS et dans une moindre mesure GCP),

  • Mise place et configuration d’outils d’automatisation pour accélérer le déploiements des applicatifs

  • Industrialiser et améliorer l’architecture technique, les outils et les processus,

  • Participer à l’amélioration continue des plateformes/architecture

  • Participer à la mise en place ou configuration outils d’observabilité en fonction de l’environnement du client

  • Sensibiliser les équipes sur les démarches d’Intégration Continue et de Déploiement Continu et à la philosophie DevOps en tant que tel

  • Tenue d’entretiens techniques pour aider Revolve à grandir

  • Participation en shadow à des avant-ventes afin d’aider à la conception de solutions techniques pour nos futurs clients

Stack technique :

  • Cloud : AWS,GCP, Azure
  • Scripting : Python, Bash,Powershell
  • Programmation : Python,
  • Infrastructure As Code : Terraform, Cloudformation
  • Configuration management : Ansible, Packer
  • CI/CD : GitlabCI, Jenkins, AWS CodePipeline, CodeDeploy etc…
  • Containers : Docker, Kubernetes
  • Versioning : Git
  • Bonne connaissance en systèmes (Linux ou Windows)
  • Bonne connaissance en réseau, notamment sur le modèle TCP/IP

Qualifications

  • Vous avez une formation supérieure (Bac+5) et vous disposez d’une  expérience de 3 ans dans un environnement Cloud, incluant infrastructure, déploiement, migration, et DevOps.
  • Une certification Cloud (GCP, AWS, Azure, RedHat) serait un atout supplémentaire.
  • Vous êtes doté(e) d’un excellent relationnel et d’un sens prononcé du service et de la qualité.
  • Vous appréciez le travail en équipe.
  • Curieux(se), autonome et à l’écoute, vous possédez un réel esprit d’analyse.
  • Vous êtes désireux(se) de vous investir dans des projets challengeants.

N’attendez plus et postulez à l’offre!

See more jobs at Devoteam

Apply for this job

+30d

Site Reliability Engineer - Brazil

PodiumRemote, Brazil
Bachelor's degreeterraformDesignansibleazurerubydockerkuberneteslinuxpythonAWS

Podium is hiring a Remote Site Reliability Engineer - Brazil

At Podium, our mission is to help local businesses win. Our lead conversion platform, powered by AI and integrations, helps local businesses convert leads faster, communicate easier, and make more sales. Every day, thousands of local businesses utilize our review management, communication, marketing, and payments products. 

Our work and focus on helping local businesses thrive has been recognized across the industry, including Forbes’ Next Billion Dollar Startups, Forbes’ Cloud 100, the Inc. 5000, and Fast Company’s World’s Most Innovative Companies.

At Podium, we believe in fostering a culture that thrives on hiring and developing exceptional talent. Our operating principles serve as a compass, guiding daily behavior and decision-making, and ensure we hire people who will thrive at Podium. If you resonate with our operating principles and are energized by our mission, Podium will be a great place for you!

The Role:

A Site Reliability Engineer borders the worlds of software engineering and systems engineering. At Podium, the SRE team drives our products to success by building a stable, scalable, sustainable, and slick system. We permanently sit and sup with the product engineering teams to address all of their needs, and work as an SRE guild to build a world-class platform for our products to run on. We're currently targeting a senior SRE to come in and deliver impact from day one.

What you will be doing: 

  • Work with the following technologies: Kubernetes, Helm, Docker, AWS, Terraform, Datadog, Prometheus, Ansible, StrongDM, Python, Go, Ruby, GitLab and GitLab CI.
  • Engaging with Podium's engineering community to identify potential areas of improvement or pain points and making Podium's systems safer and more pleasant to operate.
  • Participating in an on-call rotation for the services the team owns, triaging and addressing production as well as development issues.
  • Working cross-functionally with different teams to make sure that there is no down time for our products.
  • Mentoring junior engineers on the team.

What you should have: 

  • Bachelor’s degree in a technical field or relevant work experience.
  • 4+ years experience working alongside a production system in either a software engineer or systems engineer type role
  • 3+ years deploying, operating and debugging server software on Linux
  • Curiosity and the desire to learn
  • Ability to take a rotating on-call shift

What we hope you have: 

  • Experience with distributed systems and microservices
  • Practical knowledge of system design
  • Cloud computing, such as AWS, GCP, or Azure
  • SOC2, HIPAA, PCI, or other regulatory or compliance standards
  • Building and maintaining a CI/CD pipeline
  • Heavy Infrastructure experience 

See more jobs at Podium

Apply for this job

+30d

Solutions Architect

GremlinRemote, based in the US
SalesDevOPSansibleazurejavakuberneteslinuxjenkinspythonAWS

Gremlin is hiring a Remote Solutions Architect

Today’s complex, fast-paced systems have become a minefield of reliability risks—any of which could cause an outage that costs millions and destroys customer confidence. That’s why high-availability teams use Gremlin to find and fix ‌reliability risks before they become incidents. The Gremlin Reliability Platform helps software teams proactively monitor and test their systems for common reliability risks, build and enforce reliability standards, and automate their reliability practices organization-wide. As the industry leader in Chaos Engineering and reliability testing, we work with hundreds of the world’s largest organizations where high availability is non-negotiable.

About the Role of Solutions Architect

Gremlin’s team is growing, and we’re seeking a passionate Solutions Architect to help prove the value of Reliability Management to customers. In this pre- and post-sales role, you will have the opportunity to demonstrate Gremlin Reliability Management and offer guidance on best practices for building reliable architectures.  As customers convert to a paid subscription, you will advise on how to design and implement experiments to activate customers for their reliability journey.

In this role, you'll get to:

  • Demonstrate Gremlin in customer calls and webinars
  • Partner with sales team to drive technical wins and grow Gremlin’s customer base
  • Participate and lead proof-of-concepts with potential customers
  • Educate potential customers on Reliability and Chaos Engineering
  • Work with existing customers on technical projects and assist in troubleshooting
  • Consult with customers on the resiliency of their applications and architecture, diagnose gaps and recommend solutions
  • Participate in technical workshops and conferences

Collaborate with different functions of the company including Product Marketing, Support, and Engineering

We'll expect you to have:

  • 5+ years of experience as a Solution Architect in a tech company 
  • Excellent verbal and written communication skills
  • Strong problem-solving skills
  • Hands on experience with:
    • Kubernetes Platforms, Managed and Unmanaged
      • AKS, EKS, GKE
      • OpenShift, Rancher
      • Certified k8 Administrator is a plus
      • Linux - Shell scripting, Certified Linux Administrator is a plus
      • Container and Container Runtimes
    • Operating Systems concepts (CPU, Memory, and networking)
  • Working knowledge of :
    • Observability solutions - Application Performance Management
    • Load Testing solutions (e.g JMeter, LoadRunner, Grafana K6)
    • CI/CD and Automation Tools (e.g. Jenkins, Ansible)
    • Service Mesh (e.g. Istio), REST APIs and related tools
    • Familiarity with one or more programming Languages - Python, Java, Go
  • Certification and experience with one or more public cloud providers including Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP)

Bonus experience:

  • Experience in a SRE or DevOps role resolving production outages
  • Knowledge of modern DevOps and SRE tools
  • Integration into ITSM Tools

*The role does not offer sponsorship employment benefits. 

**If you don't think you meet all of the criteria below but still are interested in the job, please apply. Nobody checks every box—we’re looking for candidates that are particularly strong in a few areas, and have some interest and capabilities in others.

Gremlin offers a competitive total rewards package, which includes:

  • Base salary
  • Equity
  • Healthcare, dental, and vision benefits
  • 401(k) with employer match.
  • Variable compensation for specific roles.

Compensation is based on the candidate’s skills and qualifications.

About Gremlin:

Gremlin is a team of industry veterans and people eager to learn from one another. We set the standard for reliability and equip leading organizations with the mindset and expertise needed to drive reliability improvements that move the world forward. We’re backed by top-tier investors Index Ventures, Amplify Partners, and Redpoint Ventures. Our customers love us, and we’re thrilled to be a partner in their success.

What Do We Care About:

  • We Care about our People

People are our critical differentiators. The company strives to treat our people with respect, empathy, and dignity. We expect that our people will treat each other similarly. In both cases, we will assume good intent. All are welcome at Gremlin. We know our differences make us stronger and that our best ideas and contributions can come from anyone at any level.

  • We Care about Collaboration

Gremlin is strongest when we come together as one team with shared goals. Be the glue, not the glitter. But as a remote company, teamwork and collaboration won’t happen by accident. We approach every challenge as a shared challenge. We rely on each other for diverse perspectives and creative ideas. We celebrate our wins as a team.

  • We Care about Results

Be high productivity, low drama. Results matter. To keep our pace, everyone owns the outcomes of their actions and takes action when needed. We reward speed over perfection. We empower each other to iterate and experiment.

You are welcome at Gremlin for who you are. The more voices and ideas we have represented in our business, the more we will all flourish, contribute, and build a more reliable internet. Gremlin is a place where everyone can grow and is encouraged. However you identify and whatever background you bring with you, please apply if this sounds like a role that would make you excited to come into work everyday. It’s in our differences that we will find the power to keep building a more reliable internet by building and designing tools used by the best companies in the world. 

Visit our website to learn more -https://www.gremlin.com/about

See more jobs at Gremlin

Apply for this job

+30d

Site Reliability Engineer - II

Live PersonHyderabad, Telangana, India (Remote)
terraformnosqlpostgressqlansiblemongodbazureelasticsearchMySQLkuberneteslinuxjenkinsAWS

Live Person is hiring a Remote Site Reliability Engineer - II

LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consumers. We power nearly a billion conversational interactions every month, providing a uniquely rich data set and safety tools to unlock the power of Conversational AI for better customer experiences.

At LivePerson, we foster an inclusive workplace culture that encourages meaningful connection, collaboration, and innovation. Everyone is invited to ask questions, actively seek new ways to achieve success, nd reach their full potential. We are continually looking for ways to improve our products and make things better. This means spotting opportunities, solving ambiguities, and seeking effective solutions to the problems our customers care about.

Overview:

LivePerson is looking for a Site Reliability/DevOps Engineer for the GPT (Global Product & Technology) Division. You will be part of the LivePerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a strong team and enjoy the work environment of a start-up, with a robust product and the benefits of a leading company in its field.

You will: 

  • Ensure product high uptime and reliability 24x7.
  • Manage Linux servers in a multi-cloud environment
  • Manage high availability Kubernetes resources using Helm charts
  • Assist with deploying upgrades and patches using Puppet/Ansible/Chef/Helm
  • Monitoring and troubleshooting warnings and alerts related to the reporting platform’s performance
  • Develop monitoring resources and alerting systems such as Grafana, Prometheus, Kibana, DataDog and PagerDuty
  • Coordinate with DBA and developers to manage SQL and NOSQL database systems, including MongoDB, ElasticSearch, Postgres, MySQL and others
  • Managing message bus systems such as Kafka and Pulsar

You have:

  • Minimum 3+ years of experience of managing cloud based production environment (AWS, GCP, Azure, etc)
  • Highly experienced working in the Linux environment, good scripting in Bash / Python.
  • Highly experienced working configuration management systems like Puppet, OpsCode Chef, Ansible, etc.
  • Strong experience in Terraform, CloudFormation or other IAC
  • Experienced in SQL, including DDL and complex queries
  • Experienced working in the Kubernetes platform
  • Experience working in a microservices architecture using a message bus
  • Good knowledge of CI/CD pipelines orchestrators like TeamCity, Jenkins, Gitlab.
  • Highly motivated and independent.
  • Team player and excellent interpersonal Skills.
  • Excellent written and verbal communication skills.
  • BS in Computer Science or a related field, or equivalent work experience.
  • A strong background in cloud, network and application security and compliance
  • Experience with GPT or other LLMs a strong advantage

Benefits

  • Health: Medical, Dental, and Vision
  • Time away: Vacation and holidays
  • Development: Generous tuition reimbursement and access to internal professional development resources.
  • Equal opportunity employer

Why You’ll Love Working Here

As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace.

Belonging At LivePerson

We are proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

We are committed to the accessibility needs of applicants and employees. We provide reasonable accommodations to job applicants with physical or mental disabilities. Applicants with a disability who require reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection.

Apply for this job

+30d

Network Advanced Services Engineer

AristaLondon, United Kingdom, Remote
SalesDesignansibleopenstacklinuxpython

Arista is hiring a Remote Network Advanced Services Engineer

Job Description

Arista seeks an Advanced Services Engineer to provide advanced post-sales support, guidance, and assistance to account teams to address specific customer needs. In this position, you will be working as a technology expert in the Routing & Switching space to design, implement, and support (troubleshoot) our deployments within a number of customer infrastructures. The ideal candidate will also have a level of comfort communicating across all functions within Arista, as well as with clients and partners.

Essential Functions of the Job:

  • You will provide advanced post-sales engineering support for Arista's Open Networking Data Center and Campus networking deployments for our enterprise and commercial customers.
  • Review customer network designs for an EVPN, VxLAN, leaf-spine architecture and make recommendations for deployment
  • Migrate or interconnect to/from Cisco, Juniper, and other vendors to Arista infrastructure
  • Assist with configuration build-outs including creating network provisioning automation using Python and tools such as Chef or Ansible
  • Assist with implementation and change controls
  • You will assist with proof of concepts (POC) and in-depth testing to validate design scenario
  • Provide bug scrubs and code recommendations
  • Provide interface to TAC and internal development teams and the customer
  • You will provide customer advice regarding architectural questions, product prerequisites, product features, etc.
  • Translate complex business requirements into Leaf-Spine Network solutions
  • Assist Pre-Sales Engineer and Account Executives with designing Network solutions
  • Establish and maintaining strong relationships with key partners
  •  Attend key partner events, training sessions, and provide ongoing training with the customer teams globally
  • Continue training to maintain expertise
  • Ability to understand the client’s business objectives and technical needs
  • Ability to meet Service Level Agreements (SLAs) for sales and clients
  • Regularly exercises discretion and independent judgment
  • Maintain professional relationships with teammates, partners, and clients
  • Some travel may be required within assigned territory

Qualifications

Who Are You?

Required Skills and Experience

  • Bachelor’s degree in Computer Science or equivalent
  • Network Industry Certification preferred ACE (Arista Cloud Engineer or equivalent CCIE (R&S), JNCIE)
  • 5+ years’ working experience with network technologies including network design and deployments of Campus and Data Center networks. Knowledge of leaf-spine arhcitectures highly desired. 
  • 5+ years’ minimum experience with Cisco-based technologies focusing on infrastructure and voice
  • Demonstrated experience in technical post-sales, as either a Network Consulting Engineer or as an Advanced Systems (AS) Engineer preferred
  • Experience with Arista/Juniper/Cisco enterprise routing/switching within large data center enterprise customers (Catalyst, Nexus, ASR)
  • Expert knowledge in the following areas: Ethernet, VLANs, VxLAN, EVPN, IP Routing, TCP/IP, OSPF, BGP, eBGP, Multicast, QoS
  • Expertise in at least one area of Data Center related technologies - Openstack, SDN, NFV, Load Balancers, Virtualization, Linux tools
  • Expert level knowledge of industry-standard CLI
  • Ability to write white papers a plus
  • Background in Perl, Python, Scripting for creating network automation is highly desired
  • Excellent customer service and verbal communication skills
  • Excellent written skills and the ability to do related documentation and ticket tracking of opportunities/meeting follow-up

Apply for this job

+30d

Principal Site Reliability Engineer (SRE/DevOps)

TripadvisorOxford, United Kingdom - Hybrid
DevOPSagileBachelor's degreeDesignmobileansiblejavadockerpostgresqlkuberneteslinuxjenkinspython

Tripadvisor is hiring a Remote Principal Site Reliability Engineer (SRE/DevOps)

We believe that we are better together, and at Tripadvisor we welcome you for who you are. Our workplace is for everyone, as is our people powered platform. At Tripadvisor, we want you to bring your unique perspective and experiences, so we can collectively revolutionize travel and together find the good out there.

 

 

 

Tripadvisor captured the online travel market 20 years ago as a Boston-based startup before an online travel market existed. The fact that we still dominate the industry proves that we know how to operate a fast-moving technology company and hire the right people who allow us to maintain that lead throughout the many advancements in technology. As we enter the era of Large Language Models and mobile-based internet everywhere, we are poised to innovate again. As a Tripadvisor Engineer, you will work with some of the best and brightest minds that technology offers and learn best practices and engineering methodologies that will empower you for the rest of your career.

 

 

The Site Operations team at Tripadvisor maintains and enhances the core systems that power and support the Tripadvisor.com website. This includes systems in private data centers and over a hundred accounts in AWS. Our scope of responsibilities is vast, and listing them here would take an entire page. Suffice it to say that we are the go-to team for questions about the interface boundaries between these two halves of the company and the deep inner workings of our infrastructure.

 

As a Site Operations Engineer on the SiteOps team, you will be a force multiplier for our engineering and operations teams, delivering tooling & infrastructure that not only has a direct impact on day-to-day operations but also helps contribute to the future evolution of infrastructure and engineering here at Tripadvisor. You'll be part of a dynamic team responsible for ensuring our services' high availability, reliability, and scalability. We seek passionate engineers with experience in Python, Java, Ansible, PostgreSQL, CentOS, and Alma Linux to help us optimize and automate our infrastructure and deployment processes. We are currently involved in several types of systems migrations, within both the scope of on-prem to AWS/cloud-native migrations and on-prem data centers to alternate on-prem data center migrations. As a SiteOps Engineer, you will be involved in designing and implementing how we perform those migrations, testing them, and then performing them with a “no surprises in production” mindset.

 

What You'll Do:

  • Infrastructure Automation: Design, implement, and maintain automated infrastructure provisioning and configuration management using tools like Ansible to ensure consistency and scalability.
  • Monitoring and Alerting: Set up monitoring and logging systems to proactively detect and address potential issues, ensuring optimal performance and reliability in environments like on-prem Prometheus/Thanos, Grafana Cloud, and Grafana Cloud Loki.
  • Database Management: Manage hundreds of on-prem PostgreSQL databases, including performance tuning, backups, and disaster recovery strategies.
  • Collaboration: Work closely with cross-functional teams, including developers and system administrators, to improve the overall development and deployment processes.
  • Troubleshooting and Incident Management: Assist in identifying and resolving operational issues and participate in on-call rotations.

 

Skills and Experience:

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
  • Proven experience as a DevOps Engineer or similar role, focusing on building and maintaining scalable infrastructures.
  • Strong proficiency in Python for scripting and automation tasks.
  • Expertise in configuration management such as Ansible or Puppet.
  • Solid understanding of PostgreSQL and experience in managing PostgreSQL databases.
  • Hands-on experience with CI/CD tools like Jenkins, GitLab CI, and GitHub Actions.
  • Knowledge of containerization technologies like Docker and container orchestration tools like Kubernetes is a plus.
  • Understanding of networking concepts such as load balancing and DNS.
  • Strong problem-solving skills and the ability to work in a fast-paced, agile environment.

 

 

If you need a reasonable accommodation or support during the application or the recruiting process due to a medical condition or disability, please reach out to your individual recruiter or send an email to AccessibleRecruiting@Tripadvisor.com and let us know the nature of your request . Please include the job requisition number in your message.

 

 

 

 

#LI-AMCVAY

#LI-Remote

#LI-Hybrid

See more jobs at Tripadvisor

Apply for this job

+30d

Senior Observability Engineer (DevOps)

Live PersonHyderabad, Telangana, India (Remote)
DevOPSBachelor's degreeterraformDesignansibleazuredockerkubernetespythonAWS

Live Person is hiring a Remote Senior Observability Engineer (DevOps)

LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consumers. We power nearly a billion conversational interactions every month, providing a uniquely rich data set and safety tools to unlock the power of Conversational AI for better customer experiences.  

At LivePerson, we foster an inclusive workplace culture that encourages meaningful connection, collaboration, and innovation. Everyone is invited to ask questions, actively seek new ways to achieve success, nd reach their full potential. We are continually looking for ways to improve our products and make things better. This means spotting opportunities, solving ambiguities, and seeking effective solutions to the problems our customers care about.

Overview:

The Observability Platform team is building a state of the art system for logging, motoring, and tracing across cloud and on-prem data centers. We’re looking for an experienced Senior DevOps engineer to lead our Logging and Monitoring, ensuring robust, scalable solutions within our Google Cloud Platform. In this role, you will be helping to bring systems to life that give superpowers to an entire organization of software developers.

You will:

  • Lead the planning, execution, and manage our observability infrastructure, which processes trillions of observability events (logs, traces, metrics) daily.
  • Create and manage monitoring, logging, and alerting systems utilizing various technologies such as GrafanaLab, CaptainHook, Zabbix, fluentd, filebeat, ELK, Kafka, Prometheus, OpenTelemetry, and other related tools.
  • Design and develop parts of a highly scalable software observability platform which manages trillions of observability events (logs, traces, metrics) per day.
  • Develop and maintain Kubernetes Helm charts that deploy hundreds of pods across nodes every day.
  • Collaborate closely with DevOps teams in delivering cloud solutions aligned with our observability platform.
  • Ensure high availability and performance of observability platforms and tools.
  • Design and develop end-to-end Synthetic Tests Monitoring solutions on GCP. with self-service capabilities for engineering teams.
  • Participate in on-call rotations.

You have:

  • Bachelor's degree in Computer Science, Engineering, or related work experience.
  • 5+ years as DevOps Engineer (or equal role) with a passion for technology and strong motivation and responsibility for high reliability and service level
  • Proficient in Kubernetes and containerization technologies (Docker, etc.)
  • Extensive experience with observability tools such as GrafanaLab, CaptainHook, Zabbix, Fluentd, ELK, Kafka, and Prometheus.
  • Familiarity with infrastructure as code (IaC) tools like Terraform, Ansible, or CloudFormation.
  • Experience with cloud platforms (AWS, Azure, GCP) and their services related to computing, storage, and networking - preferred GCP.
  • Strong programming skills in one or more languages (Bash, Python, Go, etc.).
  • The ideal candidate will have experience with OpenTelemetry Collector and Grafana Agent.

Benefits:

  • Health: Medical, Dental and Vision
  • Time away: Vacation and Holidays
  • Development: Generous tuition reimbursement and access to internal professional development resources.
  • Equal opportunity employer
  • #LI-Remote

Why you’ll love working here:

As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace. 

Belonging at LivePerson: 

We are proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

We are committed to the accessibility needs of applicants and employees. We provide reasonable accommodations to job applicants with physical or mental disabilities. Applicants with a disability who require reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection.

Apply for this job

+30d

Principal DevOps Engineer

Live PersonBulgariia-Remote
DevOPSBachelor's degreeterraformDesignansibleazuredockerkubernetespythonAWS

Live Person is hiring a Remote Principal DevOps Engineer

 LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consumers. We power nearly a billion conversational interactions every month, providing a uniquely rich data set and safety tools to unlock the power of Conversational AI for better customer experiences.  

At LivePerson, we foster an inclusive workplace culture that encourages meaningful connection, collaboration, and innovation. Everyone is invited to ask questions, actively seek new ways to achieve success and reach their full potential. We are continually looking for ways to improve our products and make things better. This means spotting opportunities, solving ambiguities, and seeking effective solutions to the problems our customers care about. 

 Overview:

The Observability Platform team is building a state of the art system for logging, motoring, and tracing across cloud and on-prem data centers. We’re looking for an experienced Principal DevOps Lead to head our Logging and Monitoring, ensuring robust, scalable solutions within our Google Cloud Platform. In this role, you will be helping to bring systems to life that give superpowers to an entire organization of software developers. 

You will: 

  • Lead the design, implementation, and maintenance of our observability infrastructure.
    which manages trillions of observability events (logs, traces, metrics) per day.
  • Develop and manage monitoring, logging, and alerting systems using GrafanaLab, CaptainHook, Zabbix, fluentd, filebeat ,ELK, Kafka, Prometheus, OpenTelemetry and related technologies.
  • Provide strategic direction and leadership to the Logging and Monitoring team
  • Design and develop parts of a highly scalable software observability platform which manages trillions of observability events (logs, traces, metrics) per day.
  • Develop and maintain Kubernetes Helm charts that deploy hundreds of pods across nodes every day.
  • Collaborate closely with DevOps teams in delivering cloud solutions aligned with our observability platform.
  • Ensure high availability and performance of observability platforms and tools.
  • Partner with vendors, and stay up to date with emerging technologies around observability, improving your own skills and those of the team around you.
  • Design and develop end-to-end Synthetic Tests Monitoring solutions on GCP. with self-service capabilities for engineering teams.
  • Located in Bulgaria

You have:

  • Bachelor's degree in Computer Science, Engineering, or related work experience.
  • 6+ years as DevOps Engineer (or equal role) with a passion for technology and strong motivation and responsibility for high reliability and service level
  • Proficient in Kubernetes and containerization technologies (Docker, etc.)
  • Extensive experience with observability tools such as GrafanaLab, CaptainHook, Zabbix, Fluentd, ELK, Kafka, and Prometheus.
  • Familiarity with infrastructure as code (IaC) tools like Terraform, Ansible, or CloudFormation.
  • Experience with cloud platforms (AWS, Azure, GCP) and their services related to computing, storage, and networking - preferred GCP.
  • Strong programming skills in one or more languages (Bash, Python, Go, etc.).
  • The ideal candidate will have experience with OpenTelemetry Collector and Grafana Agent.

Benefits: 

  • Health: medical, dental, and vision

Why you’ll love working here: 

As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace. 

Belonging at LivePerson:

We are proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

We are committed to the accessibility needs of applicants and employees. We provide reasonable accommodations to job applicants with physical or mental disabilities. Applicants with a disability who require reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection.



 

Apply for this job

+30d

Ingénieur DevOps H/F

DevoteamLyon, France, Remote
DevOPSansibledockerkubernetes

Devoteam is hiring a Remote Ingénieur DevOps H/F

Description du poste

Nous recherchons un consultant DevOps H/F, prêt à s'investir sur des projets innovants et challengeants. 

Voici un aperçu de tes missions :

  • Participer au déploiement des applications par l'implémentation de solutions d'infrastructures,
  • Mettre en œuvre et participer à la maintenance des outils et solutions d'infrastructures nécessaires à l'intégration continue,
  • Réaliser des tests continus et mettre en place des solutions automatisées,
  • Contribuer à notre communauté technique DevOps.

Pour réussir ce premier challenge, nous te proposerons des actions de formations, des parrainages, des certifications et un dispositif d’évaluation personnel régulier.

Que tu sois administrateur, intégrateur ou développeur, nous saurons t'accompagner et te faire progresser ! Nous t'apporterons notre culture Ops et notre expérience de facilitateur technique. 

Tu possèdes une bonne culture DevOps et tu adhères pleinement à cette philosophie.

Tu pratiques le scripting python. Tu connais Ansible, Openshift, Gitlab, Docker, Kubernetes, Git...

Alors n’hésites plus et rejoins la communauté des Digital Transformakers !

Qualifications

Diplômé(e) d'une école d'ingénieurs ou d'un Master 2 en informatique, tu es reconnu(e) pour ton caractère méthodique et ta capacité à être force de proposition. Tu as un réel sens de la communication and you are fluent in english ! 

Si t'investir dans des projets challengeants et gagner rapidement en responsabilités correspond à tes ambitions, alors contacte-nous !

See more jobs at Devoteam

Apply for this job

+30d

Ingénieur DevOps Kubernetes H/F

DevoteamLyon, France, Remote
DevOPSDesignansiblegitdockerkuberneteslinuxjenkinspython

Devoteam is hiring a Remote Ingénieur DevOps Kubernetes H/F

Description du poste

La tribu DevOps intervient sur des missions variées comme la conception-implémentation des plateformes Cloud et Kubernetes / Openshift, des pipelines et process CI/CD, et des nouveaux modèles opérationnels / pratiques.

Vous rejoignez la communauté Plateforme DevOps en tant que Consultant DevOps & Ingénieur Kubernetes. Vos principales contributions chez nos clients :

  • Participer au design d'architecture des plateformes Kubernetes ;
  • Implémenter, administrer et assurer le bon fonctionnement et la performance des clusters Kubernetes ;
  • Intégrer la plateforme de l'éco-système technique de nos clients (Cloud Native et/ou legacy) ;
  • Participer aux évolutions des processus de livraison et de déploiement continu ;
  • Collaborer au travers des démarches agiles avec les Tech Leads, les équipes opérationnelles, d'architecture, de sécurité et de développement ;
  • Partager et développer vos connaissances (veille, pair programming, rayonnement par des conférences, partage des compétences, etc.).

Vous êtes amené(e) à travailler dans des environnements techniques regroupant les technologies suivantes : Docker, Kubernetes, Openshift, Ansible, Jenkins, Sysdig, HELM, Artifactory, Nexus, ELK, Grafana, Prometheus, GIT, etc.

Qualifications

Issu(e) d'un Bac +5 ou d'une formation d'ingénieur, avec a minima 2 ans d’expérience professionnelle post diplôme :

  • vous implémentez et/ou administrez des clusters Kubernetes ou Docker ;
  • vous maîtrisez Linux ;
  • vous scriptez  régulièrement (python, go...) ;
  • vous provisionnez les composants par Infra as Code.

Vous avez une bonne compréhension :

  • des architectures distribuées,
  • du fonctionnement de pipeline CI/CD,
  • l’observabilité et des outillages associés (ex. ELK, prometheus, etc.),
  • la gestion de configuration.

Vous êtes désireux(se) de vous investir dans des projets challengeants et gagner rapidement en compétences et en responsabilités ?
Alors n'hésitez plus !

See more jobs at Devoteam

Apply for this job

To keep the results as relevant as possible we have omitted results past 200. If you would like to find older jobs, please repeat your search query with additional keywords to reduce the number of matches.