ansible Remote Jobs

216 Results

+30d

Senior Network DevOps Engineer

Live PersonHyderabad, Telangana, India (Remote)
DevOPSagileterraformDesignansibleazureapigitkuberneteslinuxjenkinspythonAWS

Live Person is hiring a Remote Senior Network DevOps Engineer

LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consumers. We power nearly a billion conversational interactions every month, providing a uniquely rich data set and safety tools to unlock the power of Conversational AI for better customer experiences.  

At LivePerson, we foster an inclusive workplace culture that encourages meaningful connection, collaboration, and innovation. Everyone is invited to ask questions, actively seek new ways to achieve success, nd reach their full potential. We are continually looking for ways to improve our products and make things better. This means spotting opportunities, solving ambiguities, and seeking effective solutions to the problems our customers care about.

Overview:

Our global NetDevOps team is growing rapidly, requiring engineers to collaborate across US, EMEA, and APAC regions to support our datacenter and cloud environments.  This team focuses on the stability and reliability of our global infrastructure leveraging existing standards, processes, and automation solutions.  The NetDevOps Engineer will serve as a domain expert in networking technologies and the supporting both datacenter and cloud infrastructure.  

You will:

  • Design, deploy, and manage Kubernetes clusters on the cloud (e.g., GCP) and on-prem to support containerized applications.
  • Implement best practices for monitoring, logging, and troubleshooting within Kubernetes.
  • Collaborate with the cloud team to provision, configure, and maintain cloud resources on GCP, ensuring optimal performance and cost efficiency.
  • Implement automation for resource provisioning and scaling using tools like Terraform and Helm.

Skills:

  • Strong working knowledge in configuring and troubleshooting routing protocols (BGP, OSPF, and static). 
  • Extensive experience with data center and cloud based networking technologies and infrastructure (LAN, WAN, firewall, SDWAN, BGP, DNS, load balancing, VPN, etc)
  • Experience with Arista and Cisco configurations and maintenance.
  • Deep understanding of network protocols and services. 
  • Extensive experience in linux environments and enterprise distros
  • Experience with software development and strong scripting skills.
  • Experience with Palo Alto firewall configurations and maintenance.
  • Experience with F5 LTM and AFM configurations and maintenance.
  • Experience with networking and securing kubernetes with Calico.
  • Experience with cloud technologies and IaC deployments. 
  • Experience with GCP, AWS, Azure cloud environments.  (Certifications preferred)
  • Experience with virtual and containerized deployments in both data center and cloud. 
  • Experience with Kubernetes and GKE deployments and networking elements. (CNI, Itsio, Calico)
  • Experience with CI/CD pipeline components, support, functionality, and tools.
  • Experience with version control concepts and operations. (Git) 
  • Experience with data formats XML, JSON, YAML and parsing with Python data structures.
  • Experience working within an Agile development environment
  • Experience with webhooks, API styles, HTTP Response codes, and authentication mechanisms.
  • Experience with Ansible deployments and creating ansible playbooks
  • Experience with Jenkins and parameterization. 
  • Use of automation tools and modules (Rundeck/Puppet/Terraform)
  • Experience with Network Automation and Programmability Abstraction Layer with Multivendor (NAPALM) framework
  • Leverage model driven programmability within an Arista networking environment.
  • Experience with cloud infrastructure such as Compute, Network, Storage and Backup
  • Understand the need to organize code into methods, functions, classes, and modules
  • Experience with monitoring performance metrics and KPIs.

Additional requirements:

  • Collect feedback and requirements from design and technical staff
  • Create diagrams, business cases, and architectural designs documents.
  • Support on-call and weekend rotation as needed
  • Collaborate with cross functional teams.
  • Able to handle stressful situations with a level headed approach
  • Excellent verbal and writing skills (English)
  • Oncall and shift rotation (primarily between US and APAC hours)

Benefits:

  • Health: medical, dental, and vision
  • Time away: vacation and holidays
  • Development: Generous tuition reimbursement and access to internal professional development resources.
  • Equal opportunity employer
  • #LI-Remote

Why you’ll love working here:

As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace. 

Belonging at LivePerson: 

We are proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

We are committed to the accessibility needs of applicants and employees. We provide reasonable accommodations to job applicants with physical or mental disabilities. Applicants with a disability who require reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection.

Apply for this job

+30d

Systems Reliability Engineer (SRE) - Edge

CloudflareHybrid or Remote
sqlDesignansibledockerpostgresqllinuxpython

Cloudflare is hiring a Remote Systems Reliability Engineer (SRE) - Edge

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company. 

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us! 

Available Locations:Lisbon or Remote Portugal; London or Remote UK, Munich or Remote Germany

About the Role

We are looking for talented Systems Reliability Engineers to build and operate our Edge platform running in more than 320 cities in over 120 countries. Our SREs come from diverse technical backgrounds and have built up their knowledge working in different environments, but common factors across all of our reliability-focused engineers include a passion for automation, scalability, and operational excellence. We support our services in a “follow the sun” model with offices in East Asia, Europe and North America.

This is a superb opportunity to join a high-performing team and scale our high-growth network as Cloudflare’s business grows. We live at the boundary between systems, network, and software, and love improving the glue that holds them together. Working with us, you will build tools to constantly improve service availability, performance, and operational velocity. You will nurture a passion for an “automate everything” approach that makes systems failure resistant and ready to scale.

SREs focus on the immediate state and functionality of the Cloudflare platform around the world, leveraging an array of monitoring, alerting and diagnostics tools while developing and enhancing the Cloudflare platform and its capabilities. We own a wide portfolio of applications and services, running a tight feedback loop of developer and operator patterns. The ideal SRE candidate has a passionate curiosity about how the Internet fundamentally works and has a strong knowledge of networking, Linux and TLS along with coding ability in Go or Python.

Requisite Skills

  • Aptitude for identifying problems, owning them and working with others to solve them
  • Linux systems experience
  • 3 years experience in an SRE role or a role with similar functions
  • Software development skills in some programming language such as Go or Python
  • Understanding of distributed software systems and large scale system design tradeoffs
  • Intermediate experience of common network protocols like DNS and HTTP
  • Understanding of routing protocols and concepts such as BGP and IP anycast 

Examples of desirable skills, knowledge and experience

  • Experience with the Linux kernel and Linux software packaging
  • Performance analysis and debugging
  • Configuration management systems such as Saltstack, Chef, Puppet or Ansible
  • Load balancing and reverse proxies such as Nginx, Varnish, HAProxy, Squid or Apache
  • SQL databases
  • Time series databases such as OpenTSDB, Graphite, Prometheus or Grafana
  • Key/Value stores

Bonus Points

  • Experience with continuous / rapid release engineering
  • Strong tooling and automation development experience
  • Experience working in a 24/7/365 service environment
  • Experience working with large scale production distributed systems
  • A history of contributing to Open Source Software

Some tools that we use

  • Nginx
  • PostgreSQL
  • Docker
  • Prometheus
  • Grafana
  • Consul
  • Nomad
  • Salt

 

What Makes Cloudflare Special?

We’re not just a highly ambitious, large-scale technology company. We’re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

Project Galileo: We equip politically and artistically important organizations and journalists with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.

Athenian Project: We created Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration.

1.1.1.1: We released 1.1.1.1to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitmentand ensure that no user data is sold to advertisers or used to target consumers.

Sound like something you’d like to be a part of? We’d love to hear from you!

This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

Cloudflare is proud to be an equal opportunity employer.  We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness.  All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law.We are an AA/Veterans/Disabled Employer.

Cloudflare provides reasonable accommodations to qualified individuals with disabilities.  Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.  If you require a reasonable accommodation to apply for a job, please contact us via e-mail athr@cloudflare.comor via mail at 101 Townsend St. San Francisco, CA 94107.

See more jobs at Cloudflare

Apply for this job

+30d

Linux VmWare Operations Engineer

oracleansibleUXlinux

Information International Associates, Inc. is hiring a Remote Linux VmWare Operations Engineer

Job Description

Senior Linux VMWare Operations Engineer

KeyLogic is currently recruiting for a Senior Linux VMWare Operations Engineer and Deputy Team Lead to support our Federal Client in Alexandria, VA with a hybrid telework arrangement.

Description:

The Senior Linux VMWare Operations Engineer will support the client’s Infrastructure Services Division’s (ISD) Operating Systems Operations Section (OSOS) by performing administration and maintenance activities on over 8000 RHEL/HPUX/AIX servers in use.   Additionally, the candidate will have a secondary role as the Deputy Team Lead regarding the day-to-day tasks and operations of the team. The candidate should be a self-starter and one not afraid to undertake and lead a project from beginning to end accompanied with broad technical exposure is the ideal candidate.  The selected candidate will work the daytime shift Monday through Friday.

 

The servers supported are located in both production and lab data centers located at the client’s campus in Alexandria, VA. Additional remote support is provided for systems located at the Federal Client’s Alternate Processing Site (APS) located in Manassas, VA.

 

Duties performed will include, but are not limited to, the following:

 

·        Provide escalation support from subordinates and junior resources, including on-call rotation support.

·        Ensure proper planning and execution of major projects and O&M (operations and maintenance) activities.

·        Mentor and direct junior staff in the course of daily assignments and projects to promote a collaborative learning environment

·        Troubleshoot hardware, Operating System, and software problems with Linux and VMWare servers.

·        Develop and maintain installation and configuration procedures for server builds, configurations, and scheduled maintenance activities.

·        Provide suggestions and best practices for various activities and communicate them to stakeholders at various technical levels

·        Install and configure ESXi hypervisors, vCenter servers, create data centers, clusters, add hosts, and configure their firewall, services and advanced settings.

·        Setup and configure virtual switches, port groups, and VLANs in VMWare

·        Configure HA, DRS and setup affinity rules on each VMWare cluster as needed.

·        Perform storage expansion, migration, and reclamations on block storage from SAN, familiarity with boot-from-SAN on RHEL is preferred.

·        Perform cyber-security remediation and server hardening as needed.

·        Write custom shell scripts to poll server inventory for health and configuration data within the environment.

·        Work on assigned change requests/incident tickets.

·        Investigate and responding to alerts generated by the various monitoring systems in use at USPTO.

·        Evaluate and implementing potential new tools and technologies.

·        Provide vCenter permissions to users as well as creating folders, placing VMs under folders to provide access rights to certain users\groups to be able to manage their VMs.

·        Monitor, maintain, and troubleshoot all issues that might arise in the VMware virtual environment.

·        Install and troubleshoot physical and virtual server’s performance and connectivity issues.

·        Patch and update hypervisor’s baseline using Update Manager.

·        Perform Cisco UCS service profile migration.

·        Maintain documentation for processes and procedures as required.

·        Perform detailed analysis of incidents - utilize log management tools and performance data to author and submit RCA (Root Cause Analysis) reports following service outages

·        Perform hardware repair procedures and activities.

 

Work Experience/Skills Requirements

 

The successful candidate will have experience in the following areas:

·        7+ years of experience with Red Hat Enterprise Linux/CentOS

·        7+ years of experience with VMWare vCenter and ESXi hypervisors

·        5+ years of experience supporting Tomcat, Apache, JBoss, Oracle, and MySQL.

·        Knowledge of Red Hat Virtualization (RHV) or oVirt

·        Knowledge of Cisco UCS and managing systems via UCSCentral

·        Ability to write shell scripts in bash.

·        Ability to write custom Ansible playbooks and run them across the environment 

·        RHCE Certification is highly recommended

·        VCP Certification is highly recommended

·        Experience in supervision of teams and personnel

 

The ideal candidate also has experience in the following areas:

·        Rocky Linux (or other Open Source Enterprise Linux)

·        Red Hat Satellite 6 or Katello

·        Red Hat Identity Management (IDM) or FreeIPA

·        Foreman

·        Puppet

·        Powershell/PowerCLI

·        HP-UX

·        IBM AIX

·        Windows Server

 

A Bachelor’s Degree is strongly preferred. 

 

Clearance Requirements:

Must be a U.S. Citizen and able to hold a security clearance. You do not need a current/active clearance to apply, but must be able to pass a government Public Trust (SF-85) background investigation.

We are proud to be an EEO/AA employer M/F/D/V. We maintain a drug-free workplace and perform pre-employment substance abuse testing.

Qualifications

Work Experience/Skills Requirements

 

The successful candidate will have experience in the following areas:

·        7+ years of experience with Red Hat Enterprise Linux/CentOS

·        7+ years of experience with VMWare vCenter and ESXi hypervisors

·        5+ years of experience supporting Tomcat, Apache, JBoss, Oracle, and MySQL.

·        Knowledge of Red Hat Virtualization (RHV) or oVirt

·        Knowledge of Cisco UCS and managing systems via UCSCentral

·        Ability to write shell scripts in bash.

·        Ability to write custom Ansible playbooks and run them across the environment 

·        RHCE Certification is highly recommended

·        VCP Certification is highly recommended

·        Experience in supervision of teams and personnel

The ideal candidate also has experience in the following areas:

·        Rocky Linux (or other Open Source Enterprise Linux)

·        Red Hat Satellite 6 or Katello

·        Red Hat Identity Management (IDM) or FreeIPA

·        Foreman

·        Puppet

·        Powershell/PowerCLI

·        HP-UX

·        IBM AIX

·        Windows Server

A Bachelor’s Degree is strongly preferred. 

 

See more jobs at Information International Associates, Inc.

Apply for this job

+30d

Windows Support Technician (End User Support Specialist II)

DevOPSagilejiraterraformDesignansibleazureUXdockerkubernetesjenkinsAWS

Information International Associates, Inc. is hiring a Remote Windows Support Technician (End User Support Specialist II)

Job Description

Platform Automation Support Specialist 

Job Description or Summary:

KeyLogic is seeking Platform Services Automation Specialist with strong systems, software, and Agile experience to support our program at the USPTO. 

Job Duties:

As a DevOps Platform Engineer, you will be working closely with our Automation teams to develop USPTO's Platform Services environment. This is a Full-Time position and work location will be at the KeyLogic's office in Alexandria, VA.

Job Requirements or Skills Required:

Engineer and deploy hybrid-cloud solutions for enterprise environments by leveraging Configuration management tools such as Puppet and IaC tools such as Ansible and Terraform

Design and implementation of automated infrastructure in on-prem and cloud environments.

Design and implementation of CI/CD, testing and operations infrastructure on-premise and in cloud

Required Skills:

5+ years of hands-on experience in Linux/Unix, HP-UX, and Windows server administration

3+ years of hands-on experience developing Puppet modules for platform products.

2+ years of hands-on experience with DevOps using Terraform, Ansible, etc.

2+ years of hands-on experience working with containers, and container orchestration technologies such as Kubernetes, Docker, etc.

2+ years of hands-on experience with CI/CD tools, such as GitHub, Jenkins, Jenkins Pipeline, Maven, and Nexus

At least entry-level certification in at least one of the three major CSPs (AWS, Google, or Azure)

Experience with automation/orchestration platforms and tools such as Red Hat OpenShift, Red Hat CloudForms, Puppet, Chef and Ansible

Experience working with defining, configuring, and building CI/CD pipelines using Jenkins, GitHub actions and other automation techniques.

Experience working within an Agile Environment and working with Agile tools such as JIRA and Rally

Excellent written and verbal communication skills

Education Requirements:

Bachelor’s in computer science or related field

Qualifications

    Required Skills:

    5+ years of hands-on experience in Linux/Unix, HP-UX, and Windows server administration

    3+ years of hands-on experience developing Puppet modules for platform products.

    2+ years of hands-on experience with DevOps using Terraform, Ansible, etc.

    2+ years of hands-on experience working with containers, and container orchestration technologies such as Kubernetes, Docker, etc.

    2+ years of hands-on experience with CI/CD tools, such as GitHub, Jenkins, Jenkins Pipeline, Maven, and Nexus

    At least entry-level certification in at least one of the three major CSPs (AWS, Google, or Azure)

    Experience with automation/orchestration platforms and tools such as Red Hat OpenShift, Red Hat CloudForms, Puppet, Chef and Ansible

    Experience working with defining, configuring, and building CI/CD pipelines using Jenkins, GitHub actions and other automation techniques.

    Experience working within an Agile Environment and working with Agile tools such as JIRA and Rally

    Excellent written and verbal communication skills

    Education Requirements:

    Bachelor’s in computer science or related field

     

    See more jobs at Information International Associates, Inc.

    Apply for this job

    +30d

    Site Reliability Engineer (SRE/ DevOps) - Engineering Productivity

    AristaPoland-Remote, Poland, Remote
    DevOPSagileCommercial experienceDesignansiblec++dockerelasticsearchpostgresqlMySQLkuberneteslinuxjenkinspython

    Arista is hiring a Remote Site Reliability Engineer (SRE/ DevOps) - Engineering Productivity

    Job Description

    Who You'll Work With

    Arista Networks is looking for a skilled professional for our Engineering Productivity team to help maintain and support our rapidly expanding infrastructure and internal user base. The ideal candidate is someone who can wear many hats, can be versatile and is enthusiastic about learning new technologies.

    As a part of the software engineering team, you will work with other team members to design, build and administer secure, scalable and fault-tolerant tools and infrastructure in a hybrid cloud environment.

    What You'll Do

    • Building, integrating and maintaining tools and infrastructure facilitating internal development and testing.
    • Improve maintainability of build system
    • Evaluate new tools
    • Improve speed of information back to the development team within the build systems and processes
    • Troubleshoot and resolve systems and network issues.
    • Adherence to infrastructure-as-code principles.
    • Proactively ensure the highest levels of systems and infrastructure availability.
    • Participate in the design and implementation of new systems and infrastructure projects.

    Qualifications

    Essential Skills

    • Minimum 4+ years commercial experience in this space as a DevOps / SRE Engineer
    • Solid experience with Jenkins and GitHub, ideally with a background/understanding of the Atlassian stack of products (Confluence/Jira/Bamboo/Bitbucket)
    • UNIX / Linux systems administration (preferably RedHat/CentOS).
    • Scripting with Python or Bash or experience at least one high level language such as Go, C++, etc.. 
    • Experience with containerization and container orchestration (e.g. Docker, Kubernetes).
    • Experience with (CI/CD) orchestration and software configuration management tools (e.g. Ansible, Puppet, Salt, Chef).
    • Ability to work in a fast paced and agile development environment.
    • Excellent communication and documentation skills.
    • Working knowledge/experience with Makefile/make

    Desired Skills

    • BS/MS degree in Computer Science or a relevant experience subject.
    • Experience with monitoring systems (e.g. Zabbix, Nagios, Prometheus, DataDog).
    • Experience with relational databases (e.g. MySQL, PostgreSQL)
    • Experience with virtualization technologies (e.g. VMware, XenServer, RHEV, QEMU/KVM).
    • Experience with any of the following: Elasticsearch, InfluxDB, Grafana, Artifactory.
    • Exposure to FPGA build projects
    • Exposure or experience with Vivado (Xilinx)

    #LI-SZ1

    Apply for this job

    +30d

    Advanced Services Engineer

    AristaLondon, United Kingdom, Remote
    SalesDesignansibleopenstacklinuxpython

    Arista is hiring a Remote Advanced Services Engineer

    Job Description

    Who You'll Work With

    Arista seeks an Advanced Services Engineer to provide advanced post-sales support, guidance, and assistance to account teams to address specific customer needs. In this position, you will be working as a technology expert in the Routing & Switching space to design, implement, and support (troubleshoot) our deployments within a number of customer infrastructures. The ideal candidate will also have a level of comfort communicating across all functions within Arista, as well as with clients and partners.

    What You'll Do

    • You will provide advanced post-sales engineering support for Arista's Open Networking Data Center and Campus networking deployments for our enterprise and commercial customers.
    • Review customer network designs for an EVPN, VxLAN, leaf-spine architecture and make recommendations for deployment
    • Migrate or interconnect to/from Cisco, Juniper, and other vendors to Arista infrastructure
    • Assist with configuration build-outs including creating network provisioning automation using Python and tools such as Chef or Ansible
    • Assist with implementation and change controls
    • You will assist with proof of concepts (POC) and in-depth testing to validate design scenario
    • Provide bug scrubs and code recommendations
    • Provide interface to TAC and internal development teams and the customer
    • You will provide customer advice regarding architectural questions, product prerequisites, product features, etc.
    • Translate complex business requirements into Leaf-Spine Network solutions
    • Assist Pre-Sales Engineer and Account Executives with designing Network solutions
    • Establish and maintaining strong relationships with key partners
    •  Attend key partner events, training sessions, and provide ongoing training with the customer teams globally
    • Continue training to maintain expertise
    • Ability to understand the client’s business objectives and technical needs
    • Ability to meet Service Level Agreements (SLAs) for sales and clients
    • Regularly exercises discretion and independent judgment
    • Maintain professional relationships with teammates, partners, and clients
    • Some travel may be required within assigned territory

    Qualifications

     

    • Bachelor’s degree in Computer Science or equivalent
    • Network Industry Certification preferred ACE (Arista Cloud Engineer or equivalent CCIE (R&S), JNCIE)
    • 5+ years’ working experience with network technologies including network design and deployments of Campus and Data Center networks. Knowledge of leaf-spine architectures highly desired. 
    • 5+ years’ minimum experience with Cisco-based technologies focusing on infrastructure and voice
    • Demonstrated experience in technical post-sales, as either a Network Consulting Engineer or as an Advanced Systems (AS) Engineer preferred
    • Experience with Arista/Juniper/Cisco enterprise routing/switching within large data center enterprise customers (Catalyst, Nexus, ASR)
    • Expert knowledge in the following areas: Ethernet, VLANs, VxLAN, EVPN, IP Routing, TCP/IP, OSPF, BGP, eBGP, Multicast, QoS
    • Expertise in at least one area of Data Center related technologies - Openstack, SDN, NFV, Load Balancers, Virtualization, Linux tools
    • Expert level knowledge of industry-standard CLI
    • Ability to write white papers a plus
    • Background in Perl, Python, Scripting for creating network automation is highly desired
    • Excellent customer service and verbal communication skills
    • Excellent written skills and the ability to do related documentation and ticket tracking of opportunities/meeting follow-up
    • Fluency in written and spoken English 

    Apply for this job

    +30d

    Cloud Operations Team Lead

    Shift TechnologyCanada - Remote
    PrismaFull TimeDevOPS1 year of experiencejiraterraformansibleazuregitubuntulinuxjenkinspythonAWS

    Shift Technology is hiring a Remote Cloud Operations Team Lead

    The future of insurance starts with AI. To date, Shift Technology's AI-powered products have benefitted more than 300 million policyholders globally by reducing underwriting risk, identifying more fraud, and automating critical tasks throughout the claims process.  Shift harnesses the power of AI to enable the world’s leading insurance organizations to make better decisions. Our products help insurers improve operational efficiency, reduce costs, and deliver superior customer experiences to their policyholders.  Our culture is built on innovation, trust, and a drive to transform the insurance industry by imagining and innovating solutions that impact insurers and their customers - like you! We come from more than 50 different countries and cultures and together we are creating the future of insurance.

    As a member of Shift Technology's Infrastructure team, your role as a Cloud Operations Team Leader:

    Responsibilities:

    • Manage a team (2) of Cloud Operations specialists in US
    • Will be tasked with serving as the primary point of technical escalation contact
    • You will be responsible for being the point of contact for any operational escalations within the organisation.
    • Ensure that the Incident management process is running as expected, and that the operations team is handling incidents in a timely and efficient manner.
    • Operations team will be responsible for the Incident management process, so need to ensure the process is running as expected.
    • Manage support tickets (changes, requests, incidents, etc.) and escalate to the appropriate resolution level.
    • Monitor alerts and follow their evolution, escalate as needed.
    • Manage cloud infrastructure (Azure, AWS, and OVH) and take care of the infrastructure backup and the backup checks.
    • Maintain Linux and Windows systems, network, and security software/equipment.
    • Apply security patches to the entire IT infrastructure.
    • Deploy new client projects and infrastructure based on established requirements.
    • Manage day-to-day infrastructure work and ensure that desktop computers are compliant with security policies.
    • Cultivate great co-worker and client relationships.
    • Available to work during weekends based on the team’s rotation schedule. (1 in 3 weekends)

    Technical Abilities:

    • Knowledge and experience working with cloud computing - e.g. Azure or AWS or GCP (Required)
    • Networking and firewall expertise - VLANs, Zone based firewalling, IPSec VPN, SSL VPN, URL filtering, IDPS (Required)
    • Proficiency in Windows, Office, and Active Directory is required
    • Infrastructure security experience - Patch and vulnerability management
    • Backup knowledge and experience
    • Experience with Infrastructure-as-Code (IaC) tools, such as Terraform or CloudFormation or ARM, for deploying and managing cloud resources. (good to have)
    • Understanding of cloud cost management and optimization techniques, including resource tagging, reserved instances, and usage analytics. (good to have)
    • Familiarity with monitoring and logging solutions, such as Grafana (Required)
    • Experience with Jira ticketing system and Confluence (good to have)
    • Familiarity with DevOps methodologies and tools, such as Git, Jenkins and Ansible, for automating software delivery and infrastructure management. (good to have)
    • Knowledge of compliance standards and regulations, such as GDPR, HIPAA, and SOC 2, and experience implementing controls to meet these requirements. (good to have)

    Soft Skills:

    • At least 1 year of experience as a lead is preferred
    • Autonomous, dynamic, curious, and eager to learn, always looking to expand your fields of expertise.
    • Proactive and take pride and ownership of your work.
    • Ability to work under pressure and still deliver excellent service to our customers.
    • Maintain a high level of confidentiality, professionalism, and a courteous demeanour when working with clients and internal teams.
    • Ability to adapt your work to changing priorities as needed.

    Tools:

    • Microsoft Azure AD, Intune and Autopilot, Office 365, and G Suite.
    • Windows Server, Linux (Centos and Ubuntu), MacOS.
    • Microsoft Azure and AWS cloud native services.
    • VMWare Data Centres.
    • Palo Alto Firewalls, Palo Alto Prisma, Cisco WiFi, Cisco Switches.
    • Automation driven - IaC (Terraform), Ansible, Python, Github. 
    • Thycotic
    • VMWare
    • Veeam backups
    • Atlassian products - Jira, Opsgenie, Confluence

    #LI-REMOTE  #LI-ONSITE  #LI-HYBR

    To support our permanent, full time employees at every stage of their careers and lives, we provide a competitive total rewards and benefits package. Here are the global benefits we’d like to highlight:

    • Flexible remote and hybrid working options
    • Competitive Salary and a variable component tied to personal and company performance
    • Company equity
    • Focus Fridays, a half-day each month to focus on learning and personal growth
    • Generous PTO and paid holidays
    • Mental health benefits 
    • 2 MAD Days per year (Make A Difference Days for paid volunteering)

    Additional benefits may be offered by country - ask your recruiter for more information. Intern and Apprentice position are eligible for some of these benefits - ask your recruiter for more details.

    At Shift we strive to be a diverse and inclusive workforce. We welcome applications from and hire people who will contribute to the diversity of our company, without regard to race, color, religion, marital status, age, national or ethnic origin, physical or mental disability, medical condition, pregnancy, genetic information, gender identity or expression, sexual orientation, or other non-merit criteria.

    Shift Technology is committed to providing reasonable accommodations for qualified individuals with disabilities in our application and employment process. Should you require accommodation, please email accommodation@shift-technology.com and we will work with you to meet your accessibility needs.

    Please be aware of scammers and only trust correspondence that comes from emails ending in shift-technology.com

    Shift Technology does not accept unsolicited CVs from recruiters or employment agencies in response to the Shift Technology Careers page or a Shift Technology social media post. Any unsolicited CVs, including those submitted directly to hiring managers, are deemed to be the property of Shift Technology.

    See more jobs at Shift Technology

    Apply for this job

    +30d

    Principal Software Engineer (SRE/DevOps) - Remote

    InvisibleTechnologiesSan Francisco, CA, Remote
    DevOPSterraformansibleuikubernetesAWS

    InvisibleTechnologies is hiring a Remote Principal Software Engineer (SRE/DevOps) - Remote

    Job Description

    Principal engineers at Invisible are able to follow multiple paths. Some of our Principal engineers are technical leads of teams and are responsible for people management of those teams. They oversee the technical vision for their area and ensure that there is proper mentorship

    Other principal engineers lead through technical initiatives. These engineers oversee broad multi-team technical initiatives and own parts of our software stack (ex. Principal engineers might research and roll out new technical frameworks or might develop a new generation of our UI component library.

    Qualifications

    -We know that if we have a DevOps team we aren’t practicing DevOps ???? both are listed to make it clear that we’re looking for a multi position player who’s comfortable with application engineering AND infrastructure.

    - A good candidate will have a strong understanding of cloud architecture including the major cloud providers (AWS, GCP, etc).

    - Candidates should understand underlying networking and security considerations when developing the architecture of our deployment environments.

    - Candidates should have a strong understanding of authentication and authorization frameworks such as IAM, Security Groups, RBAC, etc.

    - Candidates should have experience with Kubernetes and be able to point to deployments they have architected or managed.

    - Candidates should have a strong understanding of the operating model of Kubernetes and be able to explain the requirements for designing deployments for new applications.

    - Ideal candidates would have experience with infrastructure as code tools such as Terraform, CloudFormation, Ansible or Puppet.

    We’re always eager to learn and grow and try new technologies.

    See more jobs at InvisibleTechnologies

    Apply for this job

    +30d

    Software Engineer (mid-level)

    ThrotleRed Bank, NJ, Remote
    Full Timegolang5 years of experiencesqloracleDesignansiblemongodbhtml5apijavac++postgresqllinuxpythonAWSjavascript

    Throtle is hiring a Remote Software Engineer (mid-level)

    Benefits:
    • 401(k) matching
    • Company parties
    • Competitive salary
    • Dental insurance
    • Flexible schedule
    • Free food & snacks
    • Health insurance
    • Paid time off
    • Parental leave
    • Training & development
    • Vision insurance
    SOFTWARE ENGINEER (Hybrid Position-In Office Tuesday through Thursday)
     
    The Software Engineer will be part of the team responsible for designing, developing, and operating the applications that make Throtle’s data onboarding solution work.  The ideal candidate can work with teammates in troubleshooting problems, designing solutions, and assessing situations in real time.  Our team is empowered to keep our fast-paced, high-volume processing environment operational for our clients and partners.
      
    PRIMARY RESPONSIBILITIES
    • Create tools and solutions to manage and monitor our rapidly growing operations.
    • Be involved in real-time assessment of issues and help develop solutions
    • Build and design solutions that mitigate risk and increase efficiencies
    • Automate processes and sub-processes to enable greater scale and speed
    • Maintaining our existing code.  
    • Take part in performance & capacity monitoring and planning
    KNOWLEDGE AND SKILL REQUIREMENTS
    • At least 5 years of experience 
    • Significant proficiency in one or more of these languages- Java, Python, Golang
    • Experience with databases – PostgreSQL, Oracle, or Microsoft SQL Server.
    • Proficiency in Restful API Development 
    • Experience interacting with AWS CLI and AWS Console
    • Knowledge of software architecture, data structures, modern design patterns and network protocols 
    • Ability to identify problems, and effectively communicate solutions to peers and management
    OTHER VALUABLE SKILLS
    • Experience with data flow and queue management using tools like Kafka and Flume Experience in front end technologies including JavaScript, CSS3 and HTML5 to include libraries such as React Js and Angular. 
    • Experience in Linux SysAdmin
    • Exposure to NoSQL/Big Data: Hadoop, HBase, Cassandra, MongoDB
    • Hands on experience with a CI/CD environment
    • Experience with configuration management and automation tools like Ansible, Chef, or Puppet.
    About Throtle:
     
    Throtle is a leading identity company trusted by the world’s top brands and agencies located in Red Bank, NJ. At Throtle, we empower brands at scale with true individual-based marketing using a data-centric identity and onboarding approach.
     
    Throtle is a company that truly values its employees and their work-life balance. We offer a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being:

    • Competitive compensation.
    • Comprehensive benefits include Medical, Dental, and Vision.
    •  Life insurance.
    • Long-Term Disability
    • A generous PTO program.
    • A 401k plan supported by a company match.
    • Half Day Summer Fridays (close at 1 p.m. Memorial Day to Labor Day).
    • Early Fridays (office closes at 3 p.m.). 
    • Hybrid Schedule (Mondays and Fridays WFH)
    • The office is closed between Christmas and New Year.
    • Company-sponsored lunch at least 1x a month. 
    •  
      And much MORE!



      Throtle is an equal-opportunity employer that is committed to diversity and inclusion in the workplace. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws.

    Flexible work from home options available.

    Compensation: $95,000.00 - $110,000.00 per year




    See more jobs at Throtle

    Apply for this job

    +30d

    Senior Observability/Monitoring Engineer (Grafana, Prometheus, ELK)

    Live PersonIndia (Remote)
    DevOPSBachelor's degreeterraformDesignansibleazuredockerkubernetespythonAWS

    Live Person is hiring a Remote Senior Observability/Monitoring Engineer (Grafana, Prometheus, ELK)

    LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consumers. We power nearly a billion conversational interactions every month, providing a uniquely rich data set and safety tools to unlock the power of Conversational AI for better customer experiences.  

    At LivePerson, we foster an inclusive workplace culture that encourages meaningful connection, collaboration, and innovation. Everyone is invited to ask questions, actively seek new ways to achieve success, nd reach their full potential. We are continually looking for ways to improve our products and make things better. This means spotting opportunities, solving ambiguities, and seeking effective solutions to the problems our customers care about.

    Overview:

    The Observability Platform team is building a state of the art system for logging, motoring, and tracing across cloud and on-prem data centers. We’re looking for an experienced Senior DevOps engineer to lead our Logging and Monitoring, ensuring robust, scalable solutions within our Google Cloud Platform. In this role, you will be helping to bring systems to life that give superpowers to an entire organization of software developers.

    You will:

    • Lead the planning, execution, and manage our observability infrastructure, which processes trillions of observability events (logs, traces, metrics) daily.
    • Create and manage monitoring, logging, and alerting systems utilizing various technologies such as GrafanaLab, CaptainHook, Zabbix, fluentd, filebeat, ELK, Kafka, Prometheus, OpenTelemetry, and other related tools.
    • Design and develop parts of a highly scalable software observability platform which manages trillions of observability events (logs, traces, metrics) per day.
    • Develop and maintain Kubernetes Helm charts that deploy hundreds of pods across nodes every day.
    • Collaborate closely with DevOps teams in delivering cloud solutions aligned with our observability platform.
    • Ensure high availability and performance of observability platforms and tools.
    • Design and develop end-to-end Synthetic Tests Monitoring solutions on GCP. with self-service capabilities for engineering teams.
    • Participate in on-call rotations.

    You have:

    • Bachelor's degree in Computer Science, Engineering, or related work experience.
    • 5+ years as DevOps Engineer (or equal role) with a passion for technology and strong motivation and responsibility for high reliability and service level
    • Proficient in Kubernetes and containerization technologies (Docker, etc.)
    • Extensive experience with observability tools such as GrafanaLab, CaptainHook, Zabbix, Fluentd, ELK, Kafka, and Prometheus.
    • Familiarity with infrastructure as code (IaC) tools like Terraform, Ansible, or CloudFormation.
    • Experience with cloud platforms (AWS, Azure, GCP) and their services related to computing, storage, and networking - preferred GCP.
    • Strong programming skills in one or more languages (Bash, Python, Go, etc.).
    • The ideal candidate will have experience with OpenTelemetry Collector and Grafana Agent.

    Benefits:

    • Health: Medical, Dental and Vision
    • Time away: Vacation and Holidays
    • Development: Generous tuition reimbursement and access to internal professional development resources.
    • Equal opportunity employer
    • #LI-Remote

    Why you’ll love working here:

    As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace. 

    Belonging at LivePerson: 

    We are proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

    We are committed to the accessibility needs of applicants and employees. We provide reasonable accommodations to job applicants with physical or mental disabilities. Applicants with a disability who require reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection.

    Apply for this job

    +30d

    Ingénieur DevOps

    DevoteamTunis, Tunisia, Remote
    DevOPSterraformansible

    Devoteam is hiring a Remote Ingénieur DevOps

    Description du poste

    Vos principales responsabilités en tant que Devops CI/CD Engineer

    Voici une liste non exhaustive de vos missions au quotidien, nous vous faisons confiance pour les prendre en main et les enrichir à votre façon ????

    • Accompagner nos clients dans la mise en pratique de la méthodologie DevOps : versionning et stratégie de développement, intégration continue, déploiement continu, Infrastructure as Code.

    • Implémenter chez nos clients les outils nécessaires à la mise en place des pratiques et outils DevOps (Terraform, Ansible, Puppet, Chef, Gitlab CI, ... ).

    • Concevoir et mettre en œuvre des solutions techniques éditeurs ou open source dans des environnements Cloud Hybrides et veiller à l’efficacité de ces dernières.

    • Intervenir dans des écosystèmes techniques DevOps et des plateformes de CI/CD complexes pour des milliers d’utilisateurs.

    • Contribuer à des missions intégrées aux équipes Client pour développer des applications adaptées à la méthodologie DevOps.

    Où réaliserez-vous vos missions ? Chez des clients grands comptes de la banque, de l’assurance, de l’industrie, du retail, de la défense, du luxe ou encore de l’énergie, porteurs de projets innovants.

    Qualifications

    ???? Compétences

    Quels atouts pour rejoindre l’équipe ?

    Diplômé.e d’une école d’ingénieurs ou d’un Master 2 en informatique, vous êtes doté.e d’un excellent relationnel, d’un sens prononcé du service et de la qualité.

    Vous avez minimum 4 années d’expérience professionnelle en tant que DevOps/SRE, et êtes issu.e du monde du développement ou de l'administration système.

    Vous êtes passionné.e par l’automatisation et l’amélioration continue et avez développé des compétences en scripting.

    Vous avez déjà expérimenté la mise en place d’outils de l’écosystème DevOps CI/CD, idéalement en production.

    Alors, si vous souhaitez progresser, apprendre et partager, rejoignez-nous !

    See more jobs at Devoteam

    Apply for this job

    +30d

    Azure Platform Engineer

    DevoteamMadrid, Spain, Remote
    DevOPSagilejiraterraformDesignansibleazuregitc++linuxpython

    Devoteam is hiring a Remote Azure Platform Engineer

    Descripción del empleo

    We are seeking an experienced and passionate azure cloud engineer to join our team. In this role, you will be responsible for building scalable solutions in the Azure Platforms for our Data & Analytics Platform. You may be expected to lead a team of junior engineers towards successful delivery of engineering solutions. The ideal candidate will bring a blend of technical skills and leadership ability, with the communication and teamwork skills to collaborate with other members of the IT team.

     

    Typical Duties & Responsibilities:
     Manage and administer the Microsoft Azure cloud environment, including provisioning,
    configuration, performance monitoring, policy governance and security.
     Design, develop, and implement highly available, multi-region solutions within Microsoft Azure
     Analyze existing operational standards, processes, and/or governance to identify and implement
    improvements.
     Engage in conducting Proof of Concepts of various vendor solutions and present final
    recommendations.
     Migrate existing infrastructure services to cloud-based solutions.
     Manage security and access controls of cloud-based solutions.
     Develop infrastructure as code (IaC) leveraging cloud native tooling to ensure automated and
    consistent platform deployments.
     Develop and implement policy driven data protection best practices to ensure cloud solutions
    are protected from data loss.
     Support cloud adoption of applications as they are being transformed and/or modernized.
     Ensure all infrastructure components meet proper performance and capacity standards.
     Participate in an on-call rotation to address and resolve technical escalations.

     

    Requisitos

    Required Skills and Experience:
     4+ years of Microsoft Azure experience involving design, deployment, configuration, and
    optimization.
     6+ years of experience with IaaS and PaaS solutions
     Experience in the design and operation of medium to large scale enterprise infrastructure,
    databases, and application systems
     Experience working within an Agile framework.
     Experience in IaC development with Terraform, Azure CLI, Puppet, and Ansible
     Experience Azure PowerShell.
     Experience with Python, data analysis using Python, working on Python solutions in Azure.

     Experience with C#, frameworks for apps and solutions on Azure.
     Experience with Windows, Linux VMs, networking, routing & firewalls.
     Experience with Azure services such as Azure App Services, ADF, Logic Apps, AKS, ADLS, etc..
     Proficient with GIT, ADO to perform source code management.
     Experience with Terraform Cloud and Azure DevOps
     Experience with Jira and ServiceNow
     Competence in a wide range of IT skills including networking, systems administration, data
    protection, information security and CI/CD tooling.
     Enthusiastic learner with the ability to teach and mentor teammates and cross functional
    partners.
     Excellent communication skills, both written and verbal, with a keen attention to detail
     Azure Certifications:
    o Azure Administrator Associate
    o Azure Developer Associate
    o DevOps Engineer Expert

    Preferred Qualifications
     10+ years’ IT experience
     Experience with administration or development using Azure Databricks & Unity Catalog
     Azure Certifications:
    o Azure Security Engineer Associate
    o Azure Solutions Architect Expert
    o Azure Data Engineer Associate

    See more jobs at Devoteam

    Apply for this job

    +30d

    Senior Engineer - DevOps

    SPLICEChicago, IL, Remote
    DevOPSterraformnosqlDesignansiblejava.netdockerkuberneteslinuxjenkinspythonAWS

    SPLICE is hiring a Remote Senior Engineer - DevOps

    Job Description

    Keys

    • Familiarity with the software development lifecycle and continuous integration/delivery concepts
    • Ability to work in both Windows and Linux environments
    • Understanding of containerization technologies such as Docker, openshift
    • Experience working with deployment automation tools such as Jenkins, Cloudbees
    • Fluency in one or more common scripting languages such as Python, Terraform, ansible , Groovy, Bourne shell, or PowerShell
    • 5+ years working in a DevOps, application support or administration, or configuration engineering role
    • 2+ years working in a hybrid public/private cloud environment including AWS

    Job Description

    • Owns the change request process and may coordinate with other teams as necessary.
    • Provides technical advice and weighs in on technical decisions that impact cross functional teams.
    • Researches and may propose new technologies.
    • Develops and owns list of final enhancements.
    • Develops and defines application scope and objectives and prepares technical and/or functional specifications from with programs will be written.
    • Performs technical design reviews and code reviews.
    • May own technical testing to ensures unit test is completed and meets the test plan requirements, system testing is completed and system is implemented according to plan.
    • Assesses current status and supports data information planning.
    • Coordinates on-call support and ensures effective monitoring of system.
    • Maintains technical development environment.
    • Mentors others and may lead multiple or small to medium sized projects. 
    • Will begin to set direction at the project/service level and influences decision-making.
    • Provides technical guidance, and mentoring.
    • Maintain source repositories and encourage good practices in source control
    • Support .NET and Java build automation processes
    • Engage with test engineers to maintain automated testing processes
    • Support developers in packaging software for distribution
    • Perform application deployments to development, test, and production environments both on-premises and in the cloud
    • Maintain application and environment configuration through automated processes
    • Monitor testing and production environments to ensure stable operation
    • ​​​​​​​Perform initial triage of application issues to ensure rapid resolution

    This is part of a hybrid work schedule, if you are in Atlanta or Chicago and within 50 miles of the office you would come in 1-3x per month.

    Sponsorship is not available for this position.

    Qualifications

    Requires an BA/BS degree in Information Technology, Computer Science or related field of study and a minimum of 5+ years related experience; multi dimensional platform experience; expert level experience with business and technical applications, or any combination of education and experience, which would provide an equivalent background.

    • Collaborative attitude and an ability to build consensus among your technical peers
    • Strong ability to communicate technical information to your non-technical peers
    • Attention to detail & excellent communication skills is must.
    • Experience developing and maintaining plugins and application support processes in IBM Urban Code Deploy
    • Developing and deploying infrastructure as code using Terraform/CloudFormation
    • Working on config management tools like Ansible/Chef/Puppet.
    • Working / mentoring offshore team members.
    • Developing Jenkins Pipeline build automation
    • DockerEE or Kubernetes container orchestration tools
    • Developing and deploying infrastructure as code
    • Developing application monitors and alerts
    • Experience with RDBMS as well as NoSQL data platforms.
    • A strong understanding of IP networking, including load balancing, routing, and firewall concepts

    See more jobs at SPLICE

    Apply for this job

    +30d

    SRE with Infra-as-code and programming skills

    Shift TechnologyFrance - Remote
    Full TimeDevOPSgolangterraformnosqlRabbitMQansiblemongodbazurec++elasticsearchkuberneteslinuxpythonAWS

    Shift Technology is hiring a Remote SRE with Infra-as-code and programming skills

    The future of insurance starts with AI. To date, Shift Technology's AI-powered products have benefitted more than 300 million policyholders globally by reducing underwriting risk, identifying more fraud, and automating critical tasks throughout the claims process.  Shift harnesses the power of AI to enable the world’s leading insurance organizations to make better decisions. Our products help insurers improve operational efficiency, reduce costs, and deliver superior customer experiences to their policyholders.  Our culture is built on innovation, trust, and a drive to transform the insurance industry by imagining and innovating solutions that impact insurers and their customers - like you! We come from more than 50 different countries and cultures and together we are creating the future of insurance.

    Our Engineering Team lies at the core of the value we offer to our customers. We solve complex problems by working not only within our squads, but also by working collaboratively with other teams across the organization. If you are excited by solving complex technical challenges, this is the right place for you!


    YOUR ROLE

    As a member of Shift Technology's SRE and Developer experience team within our Cloud platform department, your role will be to:

    • Build our Infrastructure platforms which enable the deployment of our services and their hosting (CI/CD, Cloud platform, Observability)
    • Own the development of our Internal developer platform which enables our internal users to self-serve (creation of a new service in Kubernetes, day 2 operations, start of new products…)
    • Keep our service reliable, available and fast
    • Debug, troubleshoot, optimize application performance and solve a scaling bottleneck in a critical service, whether they be deep in the OS kernel or in the application code
    • Define the internal operational needs, develop and own appropriate tools.
    • Provide expert support to our level-2 / application support team, to troubleshoot priority incidents, and conduct post-mortems
    • Build a DevOps and SRE culture and enable the transition to modern infrastructure management and deployment practices

     

    YOUR TOOLKIT

    We work with modern technologies and always encourage our team to explore what's new in the market.

    Our main tools are:

    • GitHub, Terraform, Python, C#, Golang, Ansible, ArgoCD
    • Linux, Containers and Kubernetes
    • Grafana, Opentelemetry, Tempo and Loki
    • Microsoft Azure, AWS, and OVH Data Centers
    • Palo Alto Firewalls, F5, Cloudflare and Cilium for our Kubernetes clusters

    What We Are Looking For

    • Technical Abilities:
      • 5+ years experience with data structure and algorithms, handling big data with NoSQL Databases, queuing systems with Kafka, RabbitMQ, Redis... and other like Elasticsearch, MongoDB, Clickhouse at scale
      • 3+ years of previous experience using Kubernetes in production and at least 1 major cloud provider at scale (multi  regions, geo routing, low latency, sharding, high availability and scalability)
      • Ability to write and scale Infrastructure as Code;
      • You have architected, built, and operated distributed systems to solve problems at high scale
      • Understanding of security, logging, monitoring and performance aspects of cloud-native platform and application architectures;
      • Solid understanding of automation principles and programming experience using frameworks such as Python, C# and/or GoLang
    • Soft Skills:
      • You have 5 years experience in SRE/DevOps and Software Engineering
      • You are proactive and take pride and ownership of your work and able to distribute the workload across the SRE team;
      • You are dynamic, curious, and eager to learn; always looking to expand your fields of expertise;
      • You can work under pressure and still deliver excellent service to our customers;
      • You are able to maintain a high level of confidentiality, professionalism and a courteous demeanor when working with clients and internal teams;
      • You can easily adapt your work to changing priorities, as needed.

     

    RECRUITMENT PROCESS

    • HR interview 
    • 1 technical interview with our Head of SRE
    • 1 case study from home + oral tech debrief with the team
    • Interview with the Head of Infrastructure

    #LI-RH1 #LI-REMOTE



    To support our permanent, full time employees at every stage of their careers and lives, we provide a competitive total rewards and benefits package. Here are the global benefits we’d like to highlight:

    • Flexible remote and hybrid working options
    • Competitive Salary and a variable component tied to personal and company performance
    • Company equity
    • Focus Fridays, a half-day each month to focus on learning and personal growth
    • Generous PTO and paid holidays
    • Mental health benefits 
    • 2 MAD Days per year (Make A Difference Days for paid volunteering)

    Additional benefits may be offered by country - ask your recruiter for more information. Intern and Apprentice position are eligible for some of these benefits - ask your recruiter for more details.

    At Shift we strive to be a diverse and inclusive workforce. We welcome applications from and hire people who will contribute to the diversity of our company, without regard to race, color, religion, marital status, age, national or ethnic origin, physical or mental disability, medical condition, pregnancy, genetic information, gender identity or expression, sexual orientation, or other non-merit criteria.

    Shift Technology is committed to providing reasonable accommodations for qualified individuals with disabilities in our application and employment process. Should you require accommodation, please email accommodation@shift-technology.com and we will work with you to meet your accessibility needs.

    Please be aware of scammers and only trust correspondence that comes from emails ending in shift-technology.com

    Shift Technology does not accept unsolicited CVs from recruiters or employment agencies in response to the Shift Technology Careers page or a Shift Technology social media post. Any unsolicited CVs, including those submitted directly to hiring managers, are deemed to be the property of Shift Technology.

    See more jobs at Shift Technology

    Apply for this job

    +30d

    Senior Software Engineer, Infrastructure

    GeminiRemote (USA)
    agileremote-firstscalaDesignansibleazurerubyjavadockerjenkinspythonAWS

    Gemini is hiring a Remote Senior Software Engineer, Infrastructure

    About the Company

    Gemini is a global crypto and Web3 platform founded by Tyler Winklevoss and Cameron Winklevoss in 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.

    Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency. 

    At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.

    In the United States, we have a flexible hybrid work policy for employees who live within 30 miles of our office headquartered in New York City and our office in Seattle. Employees within the New York and Seattle metropolitan areas are expected to work from the designated office twice a week, unless there is a job-specific requirement to be in the office every workday. Employees outside of these areas are considered part of our remote-first workforce. We believe our hybrid approach for those near our NYC and Seattle offices increases productivity through more in-person collaboration where possible.

    The Department: Crypto Core

    The Role: Senior Software Engineer, Infrastructure

    The infrastructure team at Gemini creates and manages software tools and platforms, automates the creation and support of this infrastructure, helps integrate complex processes, and supports secure data access.

    Security of customers’ digital assets and personal information held with Gemini is our first and foremost priority. The infrastructure team builds and operates environments for the purpose of digital asset access. There are three main pillars of work including building and running network nodes, building and running validators, and supporting our next generation wallet infrastructure. There are constant challenges in providing up to date data in our nodes to providing highly available specialty nodes to our stakeholders.

    In our work, we build and use software to support our cloud-based infrastructure. Given the need to build and integrate more of our software in the cloud, the ideal engineer will have extensive experience in automating and building out cloud-based software (e.g., AWS or GCP), preferably with experience as a software developer that focuses on cloud-based automation approaches. This engineer will also work closely with various teams including various teams such as Product Security, Protocols, On-chain, and Asset Transfer. 

    We are a dynamic group with both entrepreneurial spirit and security engineering experience. We have incredibly high aspirations, and we are looking for like-minded individuals who want to guide the transition to a new more decentralized world where access to digital assets is normalized and ubiquitous.

    Responsibilities:

    • Design, build, and deploy infrastructure in our three areas of focus 1) building and running network nodes, 2) building and running validators, and 3) building and running our next generation wallet infrastructure
    • Develop tools and automation that integrate these systems in a secure way
    • With a focus on our next generation wallet infrastructure, improve the capabilities of the existing infrastructure with a mindset towards infrastructure as code
    • Improve availability and reliability while maintaining acceptable security, especially in monitoring and automation 
    • Integrate the use of cloud-based security mechanisms into the build infrastructure. Example security mechanisms include identity and access management and key management
    • Participate in disaster recovery (DR) scenarios to validate operability of physical and digital material

    Minimum Qualifications:

    • 5+ years implementing cloud software while building “infrastructure as code”
    • Experience in at least one area of software development, operating systems or device driver development, hardware, secure protocols, encryption, authentication, key management, or applied cryptography 
    • Hands-on experience in at least one or more cloud platforms (e.g., AWS, GCP, Azure, or others)
    • Hands-on expertise with one or more of the following including ansible, puppet, docker, KMS, IAM, jenkins
    • Proficiency in a common scripting language including but not limited to Python, Ruby, etc.
    • Able to troubleshoot and debug issues, and demonstrate a methodical approach to root cause analysis
    • Strong written and verbal communication skills; attentive to details

    Preferred Qualifications:

    • 6+ years implementing software 
    • Ability to read and write code written in one or more of Go, Java, Scala, and C/C++
    • 3+ years implementing software in AWS
    • 1+ years using monitoring, alerting, and automation tooling 
    • Previous experience in one of the three focus areas of blockchain node operations, validators as a service, and wallet infrastructure
    • Experience in a code-first environment, developing automated solutions to solve support and operational issues
    • Experience working with engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions
    • Demonstrated ability to convert theoretical security concepts into production
    • Solid understanding of Product Management and Product Ownership, Agile practices and methodologies
    •  
    It Pays to Work Here
     
    The compensation & benefits package for this role includes:
    • Competitive starting salary
    • A discretionary annual bonus
    • Long-term incentive in the form of a new hire equity grant
    • Comprehensive health plans
    • 401K with company matching
    • Paid Parental Leave
    • Flexible time off

    Salary Range: The base salary range for this role is between $152,000 - $190,000 in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.

    At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.

    #LI-AH1

    Apply for this job

    +30d

    Cassandra DBA

    Now1Dallas, TX, Remote
    terraformnosqlsqloracleDesignansibleazurepostgresqlMySQLAWS

    Now1 is hiring a Remote Cassandra DBA

    Job Description

    We are looking for a Cassandra DBA for one of our clients here in Atlanta, GA or Dallas, TX. Please find the position details below and let me know your interest in this.

    • Role: DBA - Cassandra
    • Duration: 6+ Months
    • Location: Atlanta, GA / Dallas, TX (Remote)
    • No of positions: 2

    Required 5-7+ years of working experience in Design, Build, Maintenance, and Administration of NoSQL distributed database systems like Datastax Cassandra and good experience on Legacy Databases like PostgreSQL, MySQL, IBM DB2 UDB, Oracle DB, Graph DBs, etc. Needs to have experience in Google Cloud Platform or similar Cloud (AWS, Azure, etc.)

    Qualifications

    • Experience in Designing and deploying Highly Scalable Cassandra Database clusters on Cloud.
    • Proficient in scripting languages and automation tools like Terraform, Ansible, Chef required Skills
    • Cockroach DB (Distributed SQL Database). So distributed database systems deployment Support and ACID Databases Support knowledge required
    • Strong Experience handling common database procedures, such as upgrades, backups, recovery, migration, maintenance, etc.
    • Experience in designing, implementing, and deployment of large-scale web and database infrastructures that are highly available, performing, cost-effective, and sustainable.
    • Participate in continuous improvement efforts in enhancing performance and providing increased functionality.
    • Work with internal and external customers to develop new value-added programs and data solutions with the existing data structure.
    • Strong knowledge of Cassandra schema, CQL query, Cassandra command-line utility, and DataStax ops center.
    • Experience with GCP Compute Engine, Cloud Storage, and Google managed databases.(Cloud Spanner, Bigtable, CloudSQL, etc..) is big plus
    • Defining and delivering robust monitoring solutions for database tiers that encompass both infrastructure and application-level perspectives.
    • Extensive experience with designing and implementing database infrastructure and processes to support true 24x7x365 operations, by reaching the expectation of no downtime for database maintenance.
    • Analyzing incidents, performance, metrics, and trends to proactively identify and resolve potential site issues before they develop.
    • Proactively analyze performance to identify bottlenecks and handle incidents, bugs, and provide solutions with root cause analysis.
    • Performance monitoring and query fine-tuning, Problem determination and troubleshooting, interacting with UNIX and Application group and playing a diverse role with a large team of DBREs.

    See more jobs at Now1

    Apply for this job

    +30d

    DevOps Engineer

    ViedTechFrederick, MD, Remote
    DevOPSagilejiraDesignansiblegitlinuxjenkinsAWS

    ViedTech is hiring a Remote DevOps Engineer

    Job Description

    • Passionate about the concept of infrastructure as code and leverages modern tools to define, build and manage virtual infrastructure in the cloud.
    • Excellent hands-on experience with AWS.
    • Solid understanding of Windows systems (2012 R2+) and Linux Systems (CentOS, RedHat), hosts, networks, security, applications and proficiency in shell scripting. 
    • Solid understanding and experience with configuration management tools like Ansible and Jenkins, Terraform. 
    • Believes in automation for consistent, scalable and fool-proof delivery of infrastructure and applications.
    • Support production issues/high severity issues on weekends or off hours as required.
    • As part of company’s growth efforts, employee from time to time participates in coding and design challenges to build working prototypes

    Required Skills

    • 3 or more years of experience in working as devOps leader focusing on CI/CD and CM tools and modern frameworks in the eco-system. 
    • 3 or more years of solid hands-on experience with working on AWS  
    • 3 or more years of hands-on experience in using Ansible.
    • 3 or more years of experience with orchestration tools such as terraform.
    • Someone with experience with tools such as Jenkins to enable CI/CD. 
    • 3 or more years of experience working with agile tools like Jira, Git, and Confluence. 
    • Candidates with proven certifications and socially accessible profiles that demonstrate the body of work and participation in modern collaboration hubs. 
    • A great team player and genuinely believes in solving challenges as a team.
    • Willing to learn new technologies and methodologies quickly. 
    • Explores alternatives and quickly prototyping to validate hypothetical architectures or solutions 
    • Good understanding of the core tenets of agile both in letter and spirit  

    NOTE : This role requires the hired candidate to go through public clearance. A minimum of 3 years of stay in the U.S. within the last 5 years is a must to be eligible to qualify for public trust clearance sponsorship.

     

      Qualifications

      See more jobs at ViedTech

      Apply for this job

      +30d

      Security Engineer

      terraformSailPointDesignansibleazurec++kuberneteslinuxpython

      Cloudflare is hiring a Remote Security Engineer

      About Us

      At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company. 

      We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us! 

      Available Locations: Lisbon, Portugal or Austin, Texas 

      About the role 

      As a Security Engineer, you will play a key role in designing, implementing, and managing security technologies and the supporting infrastructure.  You will  be responsible for ensuring systems are secure, highly available, fault tolerant, and scale to meet business needs.  

      Work may include documenting new standard operating procedures, ensuring vendor recommended security baseline configurations are implemented, designing repeatable deployment patterns, performing disaster recovery testing, configuring new integrations, implementing a new technology, patching applications and operating systems, performing upgrades and other maintenance tasks, documenting the as-built architecture, and participate in investigations and service restorations. 

      What You’ll Do

      • Design, implement, and maintain secure infrastructure across various environments (non-production and production).
      • Ensure resilient and secure designs are implemented and maintained.
      • Drive continuous improvement while maintaining multiple environments.
      • Engage in proactive risk management and incident response planning.
      • Develop or utilize automation to streamline repeatable tasks.Contribute to the creation and dissemination of knowledge about the designs within the company.

      Qualifications

      • Experience with deploying and administering Kubernetes in an enterprise environment. 
      • Experience with deploying and administering Linux systems in an enterprise environment. 
      • Experience with deploying and administering Cloudflare products (access, tunnels, waf) Experience implementing, intergrading, and  supporting identity and access management (IAM) technologies. 
      • Experience deploying and administering enterprise solutions in GCP, Azure, and AWS.Experience implementing, integrating, and supporting application security tools within a CICD pipeline environment.
      • Experience with all aspects of network infrastructure. Experience in all aspects of Site Reliability Engineering (SRE).
      • Solid understanding of reliability engineering principles and a commitment to continuous improvement.Experience writing scripts, leveraging automation, and creating infrastructure as code to streamline processes.
      • Strong analytical skills focused on service availability with curiosity and thoroughness in problem-solving.
      • Ability to navigate ambiguity, bring clarity to complex situations, and collaborate effectively with various stakeholders.

      Desired Skills

      • Proficient in managing IAM related technologies like SailPoint, Saviynt, OneLogin, Ping, Okta, Azure Active Directory, Cyberark, Dilenea, or Beyond Trust in diverse environments.
      • Proficient in managing Application Security related technologies like Veracode, Checkmarx, SonarQube, Snyk, Semgrep, Fortify, or Coverity integrated into CI/CD pipelines. 
      • Strong background in deploying and supporting infrastructure and security technologies.
      • Knowledge of scripting and automation tools (e.g., Python, Terraform, Ansible).
      • Excellent communication and collaboration skills.

      What Makes Cloudflare Special?

      We’re not just a highly ambitious, large-scale technology company. We’re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

      Project Galileo: We equip politically and artistically important organizations and journalists with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.

      Athenian Project: We created Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration.

      1.1.1.1: We released 1.1.1.1to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitmentand ensure that no user data is sold to advertisers or used to target consumers.

      Sound like something you’d like to be a part of? We’d love to hear from you!

      This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

      Cloudflare is proud to be an equal opportunity employer.  We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness.  All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law.We are an AA/Veterans/Disabled Employer.

      Cloudflare provides reasonable accommodations to qualified individuals with disabilities.  Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.  If you require a reasonable accommodation to apply for a job, please contact us via e-mail athr@cloudflare.comor via mail at 101 Townsend St. San Francisco, CA 94107.

      See more jobs at Cloudflare

      Apply for this job

      +30d

      Senior Dev Ops Engineer (UK & Poland Remote)

      Turnitin LLCBirmingham, United Kingdom, Remote
      DevOPSterraformsqlDesignansibledockerpostgresqlkubernetesjenkinsAWS

      Turnitin LLC is hiring a Remote Senior Dev Ops Engineer (UK & Poland Remote)

      Job Description

      We are seeking a Senior DevOps Engineer with hands-on experience building, automating and operating large-scale systems. You will be part of an exceptional team of individuals spread across the globe, working on the most challenging technical problems in EdTech, helping to build systems, tools, and platforms on which Turnitin’s infrastructure and applications are deployed and operated.

      Responsibilities

      • Collaborate cross-functionally with the Engineering, Quality Assurance, and Support teams.
      • Break down large projects and features into independently workable/shippable milestones and stories.
      • Contribute to the architectural design and implementation of the infrastructure Turnitin runs both on-premise and in AWS.
      • Contribute readable, testable, maintainable & documented code when making changes to our infrastructure through Infrastructure as Code (IaC) systems like Terraform, or AWS Cloudformation.
      • Contribute readable, testable, maintainable & documented code when managing configuration for our infrastructure through Configuration as Code (IaC) systems like Ansible, or Puppet..
      • Ensure systems and platforms relied upon by both external and internal customers are fault-tolerant, highly available.

      Qualifications

      • 5+ years experience with containerization technology (Docker) and administration of distributed containerization orchestration like Kubernetes (including EKS/AKS/GKE) or Docker Swarm.
      • 5+ years Comfort in creating and executing configuration management using tools like Ansible and Puppet.
      • 5+ years  well-versed in best practices when writing efficient, understandable & maintainable Infrastructure as Code.
      • 4+ years Demonstrable experience and curiosity when troubleshooting full-stack production systems (including network, storage, compute layers, and service dependencies such as DNS, DB, etc.).
      • 4+ years Experience with continuous integration and delivery platforms such as Jenkins, Github Actions, or Bitbucket Pipelines. 
      • 3+ years  microservices, micro front-ends and distributed architecture.
      • An interest in log management, including AWS OpenSearch, Fluentd, and Kibana (ELK), or Splunk logging.
      • 4+ years AWS CLoud and Databases, (PostgreSQL, SQL, DB2).

      Apply for this job

      +30d

      Staff DevOps Engineer

      BloomreachSlovakia, Czechia, Remote
      gRPCDevOPSgolangredisremote-firstterraformDesignansiblemongodbdockerelasticsearchkubernetespython

      Bloomreach is hiring a Remote Staff DevOps Engineer

      Bloomreach is the world’s #1 Commerce Experience Cloud, empowering brands to deliver customer journeys so personalized, they feel like magic. It offers a suite of products that drive true personalization and digital commerce growth, including:

      • Discovery, offering AI-driven search and merchandising
      • Content, offering a headless CMS
      • Engagement, offering a leading CDP and marketing automation solutions

      Together, these solutions combine the power of unified customer and product data with the speed and scale of AI optimization, enabling revenue-driving digital commerce experiences that convert on any channel and every journey. Bloomreach serves over 850 global brands including Albertsons, Bosch, Puma, FC Bayern München, and Marks & Spencer. Bloomreach recently raised $175 million in a Series F funding round, bringing its total valuation to $2.2 billion. The investment was led by Goldman Sachs Asset Management with participation from Bain Capital Ventures and Sixth Street Growth. For more information, visit Bloomreach.com.

       

      Are you looking for a cutting-edge tech stack to work with on a daily basis? We are currently expanding our Infrastructure team and are looking for a new colleague to join as a SeniorDevOps / Infrastructure Engineer. The salary starts from €3,500 based on your experience level, and you can work from home or one of our Central Europe offices on a full-time basis. Are you ready to grow with us?

      What tech stack do we have for you?

      • Python, Golang
      • Kubernetes, Terraform, Gitlab
      • Google Cloud, GCP Bigtable, GCP BigQuery, GRPC  
      • MongoDB, Redis, Elasticsearch, Influxdb, Etcd, Kafka
      • Victoria Metrics, Grafana, Sentry

      Minimum requirements:

      At least 3 years of production experience with:

      • Kubernetes - we are looking for engineer that not only deployed applications to a cluster, but who also understands what is happening behind the scenes and can operate 24x7 production 
      • GCP (preferred)/AWS/Azure - our solution is built on top of GCP platform. Candidate should be comfortable working with public cloud, understand the risks and benefits associated with running applications in the public cloud, be familiar with infrastructure as a code principle and have ability to make design choices between using cloud managed solutions versus self hosted alternatives
      • Python/Go - you should be a solid programmer capable of developing custom tooling

      If you don’t meet these requirements, don’t worry we are also looking for Junior DevOps engineers.

      How to know if you are good fit:

      The qualifications outlined below serve as a guide to determine if your skills and experience align with the requirements of this position:

      • Continuous Learning: You have a keen interest in Kubernetes and related technologies, demonstrated by your active engagement in reading and staying updated about them.
      • Conference Participation: You have participated in DevOps related conferences, showcasing your commitment to continuous learning and networking in the field.
      • Configuration Proficiency: You have hands-on experience configuring pod/container security context, network policies, roles and role bindings, pod affinity, host path, pod disruption budgets, priority classes, node taints, to name a few.
      • Resource Optimization: You have analyzed resource usage of applications hosted on a cluster and implemented or suggested changes to resource requests/limits, Horizontal Pod Autoscalers (HPAs), or Vertical Pod Autoscalers (VPAs).
      • Cluster Management: You have a deep understanding of the clusters you manage, including the types of machines used in node pools, the reasons for their selection, the enabled or disabled cluster features, the cluster version, and the node autoscaling setup. You have successfully upgraded Kubernetes cluster versions without causing interruptions to live applications hosted on the cluster.
      • Terraform Proficiency: You have written a Terraform module with multiple interconnected resources. 
      • Monitoring and Alerting: You have experience setting up monitoring systems and configuring alerts. On-duty experience is preferred, along with experience with Grafana and Prometheus.
      • DevOps and CI/CD Experience: You have experience with DevOps, Orchestration/Configuration Management, and Continuous Integration technologies such as Terraform, GitLab, Ansible, Docker, etc.
      • Team Onboarding and Training: You have experience with onboarding and training new team members, demonstrating your leadership skills and commitment to team growth.

      About your team:

      The Infrastructure team operates and maintains Bloomreach Engagement core infrastructure built on Google Cloud with security, high availability, costs, and scalability in mind.Our vision is to identify and implement opportunities to achieve a robust, reliable, and efficient infrastructure and development platform. We strongly support DevOps culture: each team is responsible for releasing, operating, and monitoring their own applications.

      The role of the Infrastructure Team is to provide a strong foundation upon which all teams can build, for example, manage big infrastructure components like Kubernetes, databases, and cloud components in Google Cloud. An important role of the team is also providing support for developers, reviewing design proposals, validating the performance and availability of applications, and sometimes even developing new core application components like logging or authorization.

      Tasks and responsibilities:

      In the position of DevOps Engineer, you’d be expected to work with other Engineering teams to design sustainable infrastructure, microservice solutions, and an efficient and robust production environment. Additionally, you’ll be working on a variety of tasks and projects, including automating tools and infrastructure to reduce manual work, monitoring applications and participating in an on-call rotation as required. 

      The ideal candidate will be passionate about learning new things, creative, willing to take the initiative, and able to think outside the box to solve problems strategically.

      #LI-DU1

      More things you'll like about Bloomreach:

      Culture:

      • A great deal of freedom and trust. At Bloomreach we don’t clock in and out, and we have neither corporate rules nor long approval processes. This freedom goes hand in hand with responsibility. We are interested in results from day one. 

      • We have defined our5 valuesand the 10 underlying key behaviors that we strongly believe in. We can only succeed if everyone lives these behaviors day to day. We've embedded them in our processes like recruitment, onboarding, feedback, personal development, performance review and internal communication. 

      • We believe in flexible working hours to accommodate your working style.

      • We work remote-first with several Bloomreach Hubs available across three continents.

      • We organize company events to experience the global spirit of the company and get excited about what's ahead.

      • We encourage and support our employees to engage in volunteering activities - every Bloomreacher can take 5 paid days off to volunteer*.
      • TheBloomreach Glassdoor pageelaborates on our stellar 4.6/5 rating. The Bloomreach Comparably page Culture score is even higher at 4.9/5

      Personal Development:

      • We have a People Development Program -- participating in personal development workshops on various topics run by experts from inside the company. We are continuously developing & updating competency maps for select functions.

      • Our resident communication coachIvo Večeřais available to help navigate work-related communications & decision-making challenges.*
      • Our managers are strongly encouraged to participate in the Leader Development Program to develop in the areas we consider essential for any leader. The program includes regular comprehensive feedback, consultations with a coach and follow-up check-ins.

      • Bloomreachers utilize the $1,500 professional education budget on an annual basis to purchase education products (books, courses, certifications, etc.)*

      Well-being:

      • The Employee Assistance Program -- with counselors -- is available for non-work-related challenges.*

      • Subscription to Calm - sleep and meditation app.*

      • We organize ‘DisConnect’ days where Bloomreachers globally enjoy one additional day off each quarter, allowing us to unwind together and focus on activities away from the screen with our loved ones.

      • We facilitate sports, yoga, and meditation opportunities for each other.

      • Extended parental leave up to 26 calendar weeks for Primary Caregivers.*

      Compensation:

      • Restricted Stock Units or Stock Options are granted depending on a team member’s role, seniority, and location.*

      • Everyone gets to participate in the company's success through the company performance bonus.*

      • We offer an employee referral bonus of up to $3,000 paid out immediately after the new hire starts.

      • We reward & celebrate work anniversaries -- Bloomversaries!*

      (*Subject to employment type. Interns are exempt from marked benefits, usually for the first 6 months.)

      Excited? Join us and transform the future of commerce experiences!

      If this position doesn't suit you, but you know someone who might be a great fit, share it - we will be very grateful!


      Any unsolicited resumes/candidate profiles submitted through our website or to personal email accounts of employees of Bloomreach are considered property of Bloomreach and are not subject to payment of agency fees.

       #LI-Remote

      See more jobs at Bloomreach

      Apply for this job