ansible Remote Jobs

217 Results

3h

Senior Reliability Engineer

AmericorRemote
Full TimeDevOPSS3agilejiraterraformlaravelansibleazuremetalqasymfonydockerkuberneteslinuxAWSPHP

Americor is hiring a Remote Senior Reliability Engineer

Senior Reliability Engineer - Americor - Career PageSee more jobs at Americor

Apply for this job

8h

Network Reliability Engineer

RustairflowDesignansiblemetalc++dockerkuberneteslinuxpython

Cloudflare is hiring a Remote Network Reliability Engineer

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company. 

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us! 

Hiring Locations: Austin Texas, Atlanta, Denver, New York City, San Francisco, Seattle, or Washington D.C.

About the Role (or What you'll do)

Cloudflare operates a large global network spanning hundreds of cities (data centers). You will join a team of talented network engineers who are building software solutions to improve network resilience and reduce operational toil.
This position will be responsible for the technical operation and engineering of the Cloudflare's core data center network, including the planning, installation and management of the hardware and software as well as the day-to-day operations of the network. The core network supports our critical internal needs such as databases, high volume logging, and internal application clusters. This is an opportunity to be part of the team that is building a high­-performance network that is accessible to any web property online.

You will build tools to automate operational tasks, streamline deployment processes and provide a platform for other engineering teams to build upon. You will nurture a passion for an “automate everything” approach that makes systems failure-resistant and ready-to-scale. Furthermore, you will be required to play a key role in system design and demonstrate the ability to bring an idea from design all the way to production.

 

Examples of desirable skills, knowledge and experience

  • 5+ years of relevant Network/Site Reliability Engineering experience
  • BA/BS in Computer Science or equivalent experience
  • Solid foundation on configuration management frameworks: Saltstack, Ansible, Chef
  • Experience with NX-OS, JUNOS, EOS, Cumulus, or Sonic Network Operating Systems 
  • Solid Linux systems administration experience
  • Linux networking - iproute2, Traffic Control, Devlink, etc. 
  • Strong software development skills in Go and Python

Bonus Points

  • Deep knowledge of BGP and other routing protocols
  • Workflow Management (AirFlow, Temporal)
  • Open Source Routing Daemons (FRR, Bird, GoBGP)
  • Experience with bare metal switching
  • Experience with network programming in C, C++ or rust
  • Experience with the Linux kernel and Linux software packaging
  • Strong tooling and automations development experience
  • Time series databases (Prometheus, Grafana, Thanos, Clickhouse) 
  • Other Tools - Kubernetes, Docker, Prometheus, Consul

Compensation

Compensation may be adjusted depending on work location and level. 

  • For Colorado-based hires: Estimated annual salary of $137,000 - $187,000.
  • For New York City-based and California (excluding Bay Area) and Washington hires: Estimated annual salary of $154,000- $208,000.
  • For Bay Area-based hires: Estimated annual salary of $162,000 - $218,000.

Equity

This role is eligible to participate in Cloudflare’s equity plan.

Benefits

Cloudflare offers a complete package of benefits and programs to support you and your family.  Our benefits programs can help you pay health care expenses, support caregiving, build capital for the future and make life a little easier and fun!  The below is a description of our benefits for employees in the United States, and benefits may vary for employees based outside the U.S.

Health & Welfare Benefits

  • Medical/Rx Insurance
  • Dental Insurance
  • Vision Insurance
  • Flexible Spending Accounts
  • Commuter Spending Accounts
  • Fertility & Family Forming Benefits
  • On-demand mental health support and Employee Assistance Program
  • Global Travel Medical Insurance

Financial Benefits

  • Short and Long Term Disability Insurance
  • Life & Accident Insurance
  • 401(k) Retirement Savings Plan
  • Employee Stock Participation Plan

Time Off

  • Flexible paid time off covering vacation and sick leave
  • Leave programs, including parental, pregnancy health, medical, and bereavement leave

What Makes Cloudflare Special?

We’re not just a highly ambitious, large-scale technology company. We’re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

Project Galileo: We equip politically and artistically important organizations and journalists with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.

Athenian Project: We created Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration.

1.1.1.1: We released 1.1.1.1to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitmentand ensure that no user data is sold to advertisers or used to target consumers.

Sound like something you’d like to be a part of? We’d love to hear from you!

This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

Cloudflare is proud to be an equal opportunity employer.  We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness.  All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law.We are an AA/Veterans/Disabled Employer.

Cloudflare provides reasonable accommodations to qualified individuals with disabilities.  Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.  If you require a reasonable accommodation to apply for a job, please contact us via e-mail athr@cloudflare.comor via mail at 101 Townsend St. San Francisco, CA 94107.

See more jobs at Cloudflare

Apply for this job

1d

Cloud Platform Engineer (Hybrid)

UpstreamGerakas,Attica,Greece, Remote Hybrid
agileDesignmobileansiblejavakuberneteslinuxjenkinspythonAWSPHP

Upstream is hiring a Remote Cloud Platform Engineer (Hybrid)

Who we are

We are a leader in mobile technology, providing innovative solutions to 1.2 billion consumers. In an ever-increasing digital world, we help businesses grow their digital reach & toolkits to optimize user experience, increase engagement, attract new customers, and boost their revenues.

Think of performance marketing but on steroids as our top-notch marketers and engineers build digital journeys through an omnichannel marketing approach like no other. Currently, we work with the biggest names in Telco, Insurance, Education, FMCGs, and Retail in over 45 countries in Latin America, Africa, the Middle East, and South-East Asia.

The role

Our focus centers on streamlining the development, building, testing, integration, packaging, and deployment of our microservice-based products. We are looking for a Cloud Platform Engineerto join our team and help us maintain and scale our infrastructure to provide an even better experience for our users. An ideal candidate would be a passionate engineer who loves Linux, prefers working in the shell over an IDE, is intimately familiar with cloud infrastructure and VMWare virtualization, has a strong networking background, is comfortable in a polyglot environment, who thrives in true-agile, fast paced, production facing environments and loves hardware. A team member who may not have all the answers but knows how to find them.

What you will do...

  • Help us define the future of our artifact and container-based deployment strategies.
  • Evolve and support our immutable deployment platform.
  • Contribute to the design of high-volume, low-latency applications for mission-critical systems, delivering high availability and performance while working closely with development teams.
  • Design and implement on-premise and cloud infrastructure solutions on platforms such as VMWare, AWS, Azure.
  • Maintain and upgrade our on-premise data center infrastructure.
  • Optimize cloud infrastructure for cost, performance, and scalability.
  • Develop and deploy Infrastructure as Code (IaC) using tools like Ansible and Terraform.
  • Troubleshoot and resolve issues related to system performance, network connectivity, and security.
  • Perform system upgrades, patch management, and security hardening.
  • Adapt to and become part of a continuously evolving environment.
  • Design and implement CI/CD pipelines using tools like Jenkins, GitLab CI/CD, or other similar tools to automate the build, test, and deployment processes.
  • Enable developers to quickly build, bake, and deploy images either locally, on Kubernetes and to the cloud.
  • Work directly with Upstream engineers to provide a polyglot-friendly experience and first-class support for platforms built on Java, Python, Go, PHP and others.
  • Monitor the performance, availability, and security of our on-premise and cloud environments.
  • Contribute to the development of tools for automating deployments, unifying platform metrics, reporting, and monitoring.

  • Expertise in VMware vCenter, clustering, and virtualization technologies.
  • Experience managing bare metal servers, storages and SAN networks.
  • Linux experience -- knowledge on Linux system administration and troubleshooting.
  • Exposure to container technology (e.g., Docker), container orchestration systems (e.g., Kubernetes), container-focused Linux distributions, and cloud virtualization.
  • Cloud experience -- designing and building tools and infrastructure for cloud platforms (e.g. AWS, Azure).
  • Knowledge on DevOps tools -- Ansible, Terraform, Helm, Jenkins, Gitlab, Grafana, Prometheus, Loki amongst others.
  • Operational experience -- comfortable providing support to other departments, optimizing deployments for availability and uptime, going deep on troubleshooting and remediation.
  • Programming experience -- Bash, Python, Golang amongst others (both reading and writing).
  • Knowledge of Relational Databases (e.g., PostgreSQL) and NoSQL systems (e.g., Redis, ElasticSearch, Couchbase, Cassandra).
  • Great communication skills, both written and verbal.
  • Eagerness to learn new technologies.

We offer a competitive base salary and benefits, directly dependent on the candidate’s qualifications and skills. The real excitement comes from working closely with a dynamic, smart, agile, and highly motivated team in a competitive and fast-paced environment.

Follow us on LinkedIn and stay updated on our latest news. Upstream is an equal-opportunity employer.
The Company does not discriminate on the basis of race, color, creed, pregnancy, religion, gender, national origin, age, disability, marital, or any other legally protected status. The Company also makes reasonable accommodations for disabled employees.
Finally, the Company prohibits the harassment of any individual based on their protected status. This policy applies to all areas of personnel actions including recruitment, hiring, training, promotion, compensation, benefits, transfer,and social and recreational programs

See more jobs at Upstream

Apply for this job

1d

Staff Information Cloud Security Engineer

ServiceNowAtlanta, Georgia, Remote
terraformDesignansibleazurekuberneteslinuxpythonAWS

ServiceNow is hiring a Remote Staff Information Cloud Security Engineer

Job Description

The ServiceNow Security Organization delivers world-class, innovative security solutions to reduce risk and protect the company and our customers. We enable our customers to migrate their most sensitive data and workloads to the cloud, accelerating our business so that we are the most trusted SaaS provider. We create an environment where our employees are proud to work and can make a positive impact 

The Team 

This role will be a part of the Security Engineering Org that reports to the Senior Manager of Public Cloud Security Engineering. The security engineering team targets building state-of-the-art technology that will help reduce the risk surrounding the sensitive assets of the company with the least impact possible on operations, acts as guidance and facilitator to the security operations teams and helps shifting Security perception from blocker to enabler by building a relationship of trust with the other teams. 

What you get to do in this role: 

  • Build and operationalize security tools. 

  • Build and automate new and existing workflows and trending metrics dashboards 

  • Suggest security improvements by assessing the current situation, evaluating trends, anticipating requirements, and supporting proof-of-concept experimentations. 

  • Create and present functional and technical designs including data analysis to business team, and gather feedback to influence solution design and approach 

  • Direct and influence multi-disciplinary teams in implementing and operating Cyber Security controls 

  • Participate in security incidents and help implement containment and eradication. 

  • Evaluate audit and vulnerability findings and act upon them. 

  • Serve as an expert for the different tools built by the team. 

  • Act as guidance and facilitator to security operations. 

  • Build strong relationships with different stakeholders. 

  • Partake in efforts that shape the organization’s security policies and standards for use in hybrid cloud environments 

  • Interpret security and technical requirements into business requirements and communicate security risks to relevant stakeholders ranging from business leaders to engineers 

  • Ability to influence build vs buy decisions related to security tools 

  • Accountable for the timely delivery of projects as per established roadmaps by working closely with the business teams 

  • Participate in on call roster 

  • Strong alignment with product management and engineering teams on roadmap and product feedback 

  • Provide mentoring and training to peers and other colleagues in the organization 

Qualifications

To be successful in this role you have: 

  • MUST HAVE 8+ years of Security Engineering is required 

  • MUST HAVE 5+ Years working in the Public Cloud (GCP, AWS or Azure)

    • Compute, network, storage, content delivery, administration and security, deployment and management, automation technologies 

    • Preference is GCP, but OK if you have 2 of the 3 

    • Experience in infrastructure automation tools like AWS CloudFormation, Ansible, Terraform and equivalents. 

  • MUST BE PROFICIANT In Python

    • REGO is a nice to have 

  • Experience with containerization on Cloud, AKS or EKS (Dedicated/Managed/Serverless Kubernetes). 

  • Experience implementing cloud native security tools like AWS Guard Duty or Azure Security Center.  

  • Experience implementing Cloud Security Posture Management toolset. 

  • Apply adept understanding and experience with systems automation platforms and technologies 

  • Execute security architectures for cloud/hybrid systems 

  • Automate security controls, data and processes to provide improved metrics and operational support 

  • Build dynamic dashboards to track metrics 

  • Employ cloud-based APIs when suitable to write network/system level tools for safeguarding cloud environments 

  • Spot and execute new security technologies and best practices into the company’s Cloud offerings. 

  • Strong working knowledge of system internals as well as networking. 

  • Strong familiarity with Linux and Windows operating systems and cloud provider ecosystems like AWS/Azure/GCP. 

  • Assist in the integration of Infrastructure pipelines with secure configuration parameters to remove or reduce known threat vectors. 

  • Familiarity with common security vulnerabilities and the ability to judge their severity and impact to the business 

  • A keen analytical mind for problem solving, abstract thought, and offensive security tactics with a goal to make security a strong enabler at ServiceNow. 

  • Proven experience in being an effective team player and driven to automate and continuously improve 

  • Ability to articulate complex issues to executives and customers 

 

 

#SecurityJobs 

See more jobs at ServiceNow

Apply for this job

2d

DevOps Engineer

MobicaRemote, Poland
DevOPSansiblegitdockerkubernetes

Mobica is hiring a Remote DevOps Engineer

Job Description

We are seeking a DevOps Engineer to join the team working for our Customer who is a global financial institution. You will be part of a team focused on enhancing and deploying applications that support credit risk rating and financial insight for Wholesale Banking. In this role, you will drive the expansion of risk rating applications for a global rollout and support the team’s journey to full automation of software delivery. You’ll work within a cross-functional team to ensure reliable, secure, and efficient deployments while applying the latest DevOps and SRE practices.

Qualifications

Required skills:

  • Strong experience with Linux/Unix system administration.
  • Experience with CI/CD tools and automated pipelines.
  • Hands-on experience with observability tools like ELK, Grafana, or Prometheus.
  • Knowledge of container technologies (e.g., Docker, Kubernetes).
  • Scripting skills in Bash and/or PowerShell.
  • Familiarity with firewall management and network security practices.
  • Knowledge of Git and version control systems.
  • Experience with Agile/Scrum and ITIL-based incident/problem management.
  • Solid understanding of networking concepts such as DNS and TCP/IP.
  • Good English language skills, both written and verbal
  • Experience with Ansible for automation and configuration management.
  • Knowledge of RDBMS and database management.
  • Familiarity with SRE concepts, including SLIs, SLAs, and error budgets.
  • Strong problem-solving skills and a proactive approach to system optimization.

See more jobs at Mobica

Apply for this job

2d

GCP Infrastructure Architect

MobicaRemote, Poland
golangCommercial experienceterraformDesignansibleazureapikubernetes

Mobica is hiring a Remote GCP Infrastructure Architect

Job Description

We are seeking a GCP Infrastructure Engineer to design, build, and maintain scalable, secure, and efficient Google Cloud Platform (GCP)-based solutions for our multinational Customer. The role involves setting up infrastructure, developing a microservices-based API framework in Golang, and ensuring robust security and governance practices. The ideal candidate has expertise in cloud infrastructure, API development, and CI/CD processes.

Key Responsibilities:

  • Designing and implementing scalable, reliable, and secure GCP-based solutions to meet clients' business requirements 
  • Advising clients on the best practices and optimal use of GCP services and features 
  • Building and maintaining GCP infrastructure, including virtual machines, storage, and networking components 
  • Automating deployment and management of GCP resources using tools like Terraform and Ansible 
  • Monitoring and troubleshooting GCP infrastructure, applications, and services 
  • Collaborating with cross-functional teams to design and implement end-to-end solutions that leverage GCP services 
  • Staying up to date with GCP offerings and new features, and recommending improvements to existing solutions 
  • Ensuring compliance with security and regulatory requirements 
  • Developing and maintaining documentation related to GCP architecture, configurations, and processes 
  • Client wants to build an API based framework in Golang, this framework should be in microservices architecture. 
  • Experience in integration of Cloud IAM ang Azure AD. Setting up data, network and infra security and right Governance for continuous management. 

Qualifications

Must Have:

  • 8+ years of commercial experience in software development or software architecture
  • Strong experience with Google Cloud Platform (GCP) infrastructure and services.
  • Proficiency in Golang for API development within a microservices architecture.
  • Experience with CI/CD tools and pipelines.
  • Familiarity with integrating Cloud IAM and Azure AD for secure access management.
  • Expertise in infrastructure automation using tools like Terraform and Ansible.
  • In-depth knowledge of data, network, and infrastructure security practices.
  • Strong understanding of scalable and reliable cloud architecture design.
  • Prior experience in documenting technical architectures and processes.
  • Experience in setting up governance for continuous management and compliance.
  • Excellent problem-solving and troubleshooting skills.
  • Good English language skills, both written and verbal

Nice to Have:

  • Experience in Kubernetes for container orchestration.
  • Familiarity with observability tools such as Grafana, Prometheus, or ELK Stack.
  • Knowledge of serverless technologies or event-driven architectures.
  • Hands-on experience in Agile/Scrum environments.
  • Familiarity with regulatory compliance in cloud environments (e.g., GDPR, HIPAA).

See more jobs at Mobica

Apply for this job

4d

Cloud Security Architect (US Remote)

Experian., ., Remote
DevOPSterraformansibleazuregitrubyc++jenkinspythonAWSjavascript

Experian is hiring a Remote Cloud Security Architect (US Remote)

Job Description

As a Cloud Security Architect, you will be an individual contributor who works with Experian's application teams, helping them build and manage a secure cloud infrastructure by following Experian's cloud security policy and industry best practices. This is a growing team with senior leadership support and visibility. You will be involved in projects or high-complexity issues that require you to think quickly and improve efficiency.

The Cloud Security Architect will report to the Director of Cloud Security.

You'll have the opportunity to:

  • Lead Experian's global cloud security and tech transformation strategy by guiding the architecture of critical automated security capabilities into Experian's enterprise CI/CD pipelines
  • Work with Experian's enterprise cloud architects and engineers to build modernized multi-hybrid automated cloud security controls
  • Work with our Cloud Engineers, Solutions Engineers, and DevOps team to operationalize Experian security policies
  • Create and gain stakeholder support of cloud/hybrid security architectures that achieve important business strategies
  • Help governance, compliance, and risk management teams to ensure the system meets the requirements for certification and accreditation
  • Provide regular communication with all project partners at all levels, including presentations to senior management, creating agendas and meeting minutes
  • Create and support KPIs and KRIs that measure risk reduction and progress in cloud security over time

Qualifications

Your background:

  • 10+ years' of related work experience
  • 3+ years' of experience as a DevSecOps Engineer or experience in Cloud Architecture and Engineering
  • 3+ years' of experience with cloud platforms such as Amazon Web Services (AWS), Azure, Google Cloud Platform (GCP)
  • Proficiency in scripting/programming languages such as Python, JavaScript, Ruby, C#, or PowerShell
  • In-depth knowledge of DevSecOps principles and experience with CI/CD tools such as Terraform, Jenkins, Harness, and Open Policy Agent (OPA)
  • Experience working with technology architects and writing security Policy as Code (PaC) or working as a DevSecOps engineer with Infrastructure as Code (IaC) experience
  • Experience with container offerings and capabilities within Amazon Web Services (AWS), GCP, and Microsoft Azure platforms such as EKS, ECS, AKS, or GKE
  • In-depth knowledge of CIS Benchmarks for best practices for cloud and container platform configuration
  • Experience with deployment orchestration, automation, and security configuration management (Jenkins, Puppet, Chef, Git, CloudFormation, Terraform, or Ansible)
  • Experience with assessment, development, optimization, and documentation of a comprehensive and broad set of security technologies and processes (secure software development (Application Security), data protection, cryptography, key management, Identity and Access management (IAM), network security) within SaaS, IaaS, PaaS, and other cloud environments
  • Working knowledge of common and industry standard cloud-native/cloud-friendly authentication mechanisms (OAuth, OpenID)
  • Industry Cloud Certifications (Azure/AWS/GCP); CISSP certification preferred
  • College degree or equivalent

Benefits/Perks:

  • Great compensation package and bonus plan
  • Core benefits including full medical, dental, vision, and matching 401K
  • Flexible work environment, ability to work remotely, hybrid, or in-office
  • Flexible time off, including volunteer time off, vacation, sick, and 12-paid holidays

See more jobs at Experian

Apply for this job

5d

Senior Azure Cloud Infrastructure and Automation Engineer (fully remote opportunity)

Full TimeDevOPSOpenAIterraformsqlB2Bansibleazureapi

Zealogics.com is hiring a Remote Senior Azure Cloud Infrastructure and Automation Engineer (fully remote opportunity)

Senior Azure Cloud Infrastructure and Automation Engineer (fully remote opportunity) - Zealogics.com - Career PageSee more jobs at Zealogics.com

Apply for this job

5d

Senior Kubernetes Admin / Systems Engineer, EngProd

AristaVancouver, Canada, Remote
SalesDesignansiblemetalelasticsearchMySQLkuberneteslinuxjenkinspython

Arista is hiring a Remote Senior Kubernetes Admin / Systems Engineer, EngProd

Job Description

Who You’ll Work With

Arista Networks is looking for world-class Kubernetes-aware engineers passionate about driving systems reliability and scalability to provide the best possible development experience for our 1400+ person engineering team. You will be part of a fast paced, high caliber team building the internal systems and infrastructure used to build the routing and switching products driving the industry's largest data center networks.

Arista’s Software Engineering team runs at a scale rarely found - TBs of source control, 60GB work trees with 1000s of developer branches in flight at any given time, over 400K daily build/test jobs and over 150 homegrown and cloud native services running on a 100 node on-prem bare metal Kubernetes cluster.  Operating these systems takes vigilance, responsiveness to alerts, and a steady stream of updates and bug fixes to keep things running smoothly and efficiently as well as to increase our ability to monitor, understand and visualize them. The role will cover all aspects of our Kubernetes infrastructure, and may include monitoring, responding to, and enhancing alerts, working to unify and standardize our alerts, fine tuning code for scalability and performance, debugging problems, simplifying and securing developer experience with k8s etc. You will own your projects from definition to deployment, developer and vendor interactions, and you will be responsible for the quality of everything you deliver.

What You’ll Do

Working in the Engineering Productivity (EngProd) group, you will collaborate and work with other engineers to design, build, scale, and operate the systems that the rest of Arista’s development teams use.  The EngProd team uses industry-standard systems like Ansible, Jenkins, Kubernetes, Grafana, Spinnaker, MySQL, ElasticSearch, Google Cloud, and Varnish and also internal systems that we’ve built from the ground-up to automate CI/CD, testing, analysis, and visualization.

Responsibilities

  • Work with existing k8s admin team to own different aspects of managing a production k8s cluster (eg: upgrades, monitoring, capacity planning, security, developer experience etc)
  • Proactively monitor, respond to, and enhance alerts and set up automated alert handling where applicable
  • Create and maintain the incident response runbooks working with the service dev teams
  • Debug and resolve issues impacting developer user experience and infrastructure stability around the k8s platform
  • Adopt current best practices in k8s cluster management. Evaluate and adopt OSS projects that simplify k8s cluster management. 
  • Set up guidelines and paved paths for service dev teams improving developer experience around the k8s platform.
  • Work with Arista’s software engineers to identify bottlenecks and limitations in our workflows, tooling, and infrastructure around k8s and provide fixes for those problems.
  • Engage with 3rd party vendor support as part of triage

Qualifications

  • At least BSc Computer Science or Engineering + 8 years’ experience, MS Computer Science or Engineering + 6 years’ experience, or Ph.D. in Computer Science or equivalent work experience.
  • Knowledge of one or more of Go, Python, Javascript. Experience with shell Scripting to be able to implement medium complexity automation workflows.
  • Knowledge of Linux (or UNIX).
  • Experience in operating software systems at scale.
  • Strong understanding of the fundamentals of storage and networking.
  • Comfortable with Ansible and GitOps.
  • Strong expertise with managing on-prem/baremetal Kubernetes clusters.
  • Applied understanding of software engineering principles.
  • Strong problem solving and software troubleshooting skills.
  • Ability to design a solution and implement features independently. Ability to work in small teams.
  • Comfortable with security principles and able to study source code of OSS projects, conduct experiments as necessary to debug issues.
  • Proven expertise with debugging complex issues that span the technology stack.
  • Experience dealing with network proxies and containerized storage.

Compensation Information

The new hire base pay for this role has a pay range of CAD 120,000 to 160,000.

Arista offers different pay ranges based on work location, so that we can offer consistent and competitive pay appropriate to the market. The actual base pay offered will be based on a wide range of factors, including skills, qualifications, relevant experience, and work location.

The pay range provided reflects base pay only and in addition certain roles may also be eligible for discretionary Arista bonuses and equity. Employees in Sales roles are eligible to participate in Arista’s Sales Incentive Plan, which pays commissions calculated as a percentage of eligible sales. Employees are also entitled to benefits including medical, dental, vision, wellbeing, income protection and a Group Retirement Savings Plan. The recruiting team can share more details during the hiring process specific to the role and location. 

#LI-SP1

Apply for this job

5d

Systems Reliability Engineer SRE, Edge Platform

CloudflareHybrid or Remote
sqlDesignansiblec++dockerpostgresqllinuxpython

Cloudflare is hiring a Remote Systems Reliability Engineer SRE, Edge Platform

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company. 

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us! 

About the Role

We are looking for talented Systems Reliability Engineers to build and operate our Edge platform running in more than 320 cities in over 120 countries. Our SREs come from diverse technical backgrounds and have built up their knowledge working in different environments, but common factors across all of our reliability-focused engineers include a passion for automation, scalability, and operational excellence. We support our services in a “follow the sun” model with offices in East Asia, Europe and North America.

This is a superb opportunity to join a high-performing team and scale our high-growth network as Cloudflare’s business grows. We live at the boundary between systems, network, and software, and love improving the glue that holds them together. Working with us, you will build tools to constantly improve service availability, performance, and operational velocity. You will nurture a passion for an “automate everything” approach that makes systems failure resistant and ready to scale.

SREs focus on the immediate state and functionality of the Cloudflare platform around the world, leveraging an array of monitoring, alerting and diagnostics tools while developing and enhancing the Cloudflare platform and its capabilities. We own a wide portfolio of applications and services, running a tight feedback loop of developer and operator patterns. The ideal SRE candidate has a passionate curiosity about how the Internet fundamentally works and has a strong knowledge of networking, Linux and TLS along with coding ability in Go or Python.

Requisite Skills

  • Aptitude for identifying problems, owning them and working with others to solve them
  • Linux systems experience
  • 3 years experience in an SRE role or a role with similar functions
  • Software development skills in some programming language such as Go or Python
  • Understanding of distributed software systems and large scale system design tradeoffs
  • Intermediate experience of common network protocols like DNS and HTTP

Examples of desirable skills, knowledge and experience

  • Experience with the Linux kernel and Linux software packaging
  • Performance analysis and debugging
  • Configuration management systems such as Saltstack, Chef, Puppet or Ansible
  • Load balancing and reverse proxies such as Nginx, Varnish, HAProxy, Squid or Apache
  • SQL databases
  • Time series databases such as OpenTSDB, Graphite, Prometheus or Grafana
  • Key/Value stores
  • Internetworking and BGP

Bonus Points

  • Experience with continuous / rapid release engineering
  • Strong tooling and automation development experience
  • Experience working in a 24/7/365 service environment
  • Experience working with large scale production distributed systems
  • A history of contributing to Open Source Software

Some tools that we use

  • Nginx
  • PostgreSQL
  • Docker
  • Prometheus
  • Grafana
  • Consul
  • Nomad
  • Temporal
  • Salt

What Makes Cloudflare Special?

We’re not just a highly ambitious, large-scale technology company. We’re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

Project Galileo: We equip politically and artistically important organizations and journalists with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.

Athenian Project: We created Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration.

1.1.1.1: We released 1.1.1.1to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitmentand ensure that no user data is sold to advertisers or used to target consumers.

Sound like something you’d like to be a part of? We’d love to hear from you!

This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

Cloudflare is proud to be an equal opportunity employer.  We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness.  All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law.We are an AA/Veterans/Disabled Employer.

Cloudflare provides reasonable accommodations to qualified individuals with disabilities.  Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.  If you require a reasonable accommodation to apply for a job, please contact us via e-mail athr@cloudflare.comor via mail at 101 Townsend St. San Francisco, CA 94107.

See more jobs at Cloudflare

Apply for this job

6d

MLOps Engineer

NielsenIQChennai, India, Remote
agileBachelor's degreejiraterraformDesignansibleazuregitdockerkuberneteslinuxpythonAWS

NielsenIQ is hiring a Remote MLOps Engineer

Job Description

Job Purpose  

Analyzing, designing, developing and managing the infrastructure to release scalable Data Science models. The MLOps Engineer is expected to deploy, monitor and operate production grade AI systems in a scalable, automated and repeatable way.  

Job Responsibilities 

  • Create and maintain a scalable infrastructure to deliver AI/ML processes, responding to the user requests in near real time. 
  • Design and implement the pipelines for build and deployment. 
  • Write infrastructure as code. 
  • Design dashboards to monitor a system. 
  • Collect metrics and create alerts based on them. 
  • Design and execute performance tests. 
  • Perform feasibility studies/analysis with a critical point of view.   
  • Support and maintain (troubleshoot issues with data and applications). 
  • Develop technical documentation for applications, including diagrams and manuals. 
  • Working on many different software challenges always and ensuring a combination of simplicity and maintainability within the code.  
  • Contribute to architectural designs of large complexity and size, potentially involving several distinct software components.  
  • Working closely with data scientists and a variety of end-users (across diverse cultures) to ensure technical compatibility and user satisfaction. 
  • Work as a member of a team, encouraging team building, motivation and cultivating effective team

Qualifications

Role Requirements  

E=essential
P=preferred 

  • P - bachelor's degree in computer science or related field 
  • P - master's degree in data engineering or related 
  • E - Demonstrated experience and knowledge in Linux and Docker containers 
  • E - Demonstrated experience and knowledge in some of the main cloud providers (Azure, GCP or AWS) 
  • P - Demonstrated experience and knowledge in distributed systems 
  • E - Proficient in programming languages: Python 
  • P - Experience designing and implementing CICD pipelines for automation. 
  • P - Experience designing monitoring dashboards (Grafana or similar) 
  • P - Experience with container orchestrators (Kubernetes, Docker Swarm) 
  • P - Experience with IaC tools (Terraform, Ansible) 
  • E - Experience as software engineer 
  • E - Experience in the use of collaborative developing tools such as Git, Confluence, Jira, etc.   
  • E - Problem-solving capabilities.   
  • E - Strong ability to analyze and synthesize.  (Good analytical and logical thinking capability) 
  • E - Proactive attitude, resolutive, used to work in a team and manage deadlines.   
  • E - Ability to learn quickly 
  • E - Agile methodologies development (SCRUM/KANBAN). 
  • E - Minimal work experience of 2 years with evidence.  
  • E - Ability to keep fluid communication written and oral in English, both written and spoken 

See more jobs at NielsenIQ

Apply for this job

6d

Solution Engineer_Images (OTC) - REF2627T

Deutsche Telekom IT SolutionsBudapest, Debrecen, Szeged, Pécs, Hungary, Remote
DevOPSagilejiraansibleopenstacklinuxpython

Deutsche Telekom IT Solutions is hiring a Remote Solution Engineer_Images (OTC) - REF2627T

Job Description

Company Description

The largest ICT employer in Hungary, Deutsche Telekom IT Solutions (formerly IT-Services Hungary, ITSH) is a subsidiary of the Deutsche Telekom Group. Established in 2006, the company provides a wide portfolio of IT and telecommunications services with more than 5000 employees. ITSH was awarded with the Best in Educational Cooperation prize by HIPA in 2019, acknowledged as one of the most attractive workplaces by PwC Hungary’s independent survey in 2021 and rewarded with the title of the Most Ethical Multinational Company in 2019. The company continuously develops its four sites in Budapest, Debrecen, Pécs and Szeged and is looking for skilled IT professionals to join its team.

 

Job Description

The Public Cloud Portfolio Unit operates on a national and international level, for medium-sized and large companies. We develop, market and operate agile, cloud-native, forward-looking products and services for the digital world. We see ourselves as innovation drivers and make our customers' business fit for the digital future. Our mission: Together with our customer, shaping the safest, easiest and most efficient transformation to a digitized and cloud-native future.

 

Your Department

We run Open Telekom Cloud! Open Telekom Cloud is a public cloud standard product based on open source community software and driven by principles of DevSecOps. Lean structures, agile methods, highly motivated teams and an extremely dynamic business environment determine our actions. With this customer-oriented and agile orientation, we are the anchor point for the Public Cloud business in Deutsche Telekom Group.

We are measured by delivering a secure, stable and innovative platform. We work jointly with our platform partner and other partners out of the OpenStack ecosystem to create a highly innovative public cloud product based on European security and data protection standards.

We are looking for people who are professionals and evangelists with a great deal of enthusiasm for cloud technology and who are up to the challenges created by the development and operation of a hyper-scale public cloud.

We offer a unique insight into how a large public cloud works under the hood, intercultural teamwork, flat hierarchies, and an independent working-style.

Your Tasks

As "Solution Engineer OTC" you understand the latest developments in cloud and container technology. You will operate and enhance our Open Telekom Cloud platform in a customer-oriented manner.

Do you like?

  • Solve complex problems in the daily operation of a hyper-scaler's cloud backend.
  • Consistently automate with common automation frameworks.
  • Work in a team of specialists where everyone helps each other in an open and trusting manner.
  • Work in proactive and agile way
  • Participate / coordinate in process oriented manner of the daily activities, incoming customer requests

Qualifications

Your Profile

  • Completed studies in a technical, engineering or scientific subject or comparable professional training.
  • 3-5 years of professional experience in IT (with focus on modern cloud technologies).
  • In-depth knowledge of Linux (e.g. networking, logging concepts), system tools (sudo, SSH) and network-related services (e.g. LDAP, NTP), Linux/Unix command line.
  • System technologies (Linux, KVM, Linux network and storage, system tools) as well as OpenStack
  • Strong Linux Administrations knowledge (e.g. RHCSA or. LFCS)
  • Experienced with Ansible
  • Shell scripting advanced experience
  • Python beginner level
  • Experience in OpenSource projects and community’s requirements.
  • Agile tools (e.g. GitHub, JIRA, Confluence) and methodologies (DevOps, Gitlab CI/CD)
  • High level of customer focus.
  • Ability to assess technical solutions and come up with creative approaches.
  • Fluency in written and spoken English

 

See more jobs at Deutsche Telekom IT Solutions

Apply for this job

6d

Senior Linux Systems Engineer - REF3519L

Deutsche Telekom IT SolutionsBudapest, Debrecen, Pécs, Hungary, Remote
Bachelor's degreeterraformDesignansibledockerlinuxpython

Deutsche Telekom IT Solutions is hiring a Remote Senior Linux Systems Engineer - REF3519L

Job Description

  • Provide 3rd level OS support for Linux machines (mainly Red Hat)
  • Design and develop tools and solutions for 2nd level OS admins or for other teams
  • Create reports
  • Work with other internal teams
  • Contact the OS vendor if necessary

Qualifications

  • 3+ years of proven experience as a Linux engineer
  • Extensive understanding of Linux operating systems (preferred: Red Hat)
  • Proficiency in programming languages like Bash, PowerCLI, Python
  • Experience with Veritas InfoScale HA solutions
  • Familiarity with configuration management and automation tools (preferred: Ansible, Ansible AWX).
  • Deep knowledge of virtualization technologies (VMware, KVM)
  • Understanding network protocols and services (e.g., TCP/IP, DNS, DHCP)
  • Knowledge of centralized authentication (LDAP)
  • Excellent problem-solving abilities and a detail-oriented approach
  • Capable of working both independently and collaboratively
  • Strong adaptability and flexibility to manage changing priorities and dynamic environments
  • At least intermediate English language knowledge
  • Willingness to participate in on-call

Nice to have skills and experience:

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • Relevant certifications (such as RHCE).
  • Knowledge of containerization technologies (Docker, Podman)
  • Experience with various deployment methods (Kickstart ISO, Cobbler, Terraform)
  • German language knowledge is an advantage

See more jobs at Deutsche Telekom IT Solutions

Apply for this job

6d

Senior Solutions Engineer (Remote - Bogota, Colombia)

DynatraceBogotá, Colombia, Remote
SalesDevOPS5 years of experienceterraformansibleazurejavac++.netcsskuberneteslinuxAWSjavascriptNode.jsPHP

Dynatrace is hiring a Remote Senior Solutions Engineer (Remote - Bogota, Colombia)

Job Description

What’s the role?

As a Dynatrace Solution Engineer, you will be a key member of the Dynatrace sales engine and will be responsible for providing excellent technical support to the sales team. You will be the expert on Dynatrace and all facets of Advanced Observability. Within this exciting role, you will be responsible for executing great demos which demonstrate the Dynatrace unique approach in solving the customer’s pain, executing and managing POCs onsite and remote, building key relationships with Dynatrace’s customers and completing RFIs & RFPs. You will also work across teams including Dynatrace’s innovation labs, Dynatrace’s Expert Services consultants, CSMs and marketing.

About you:

To ensure your success as a Solution Engineer at Dynatrace, you need to be an ambitious, confident and self-motivated individual, with previous SE experience or another technical customer facing role. You need to be passionate about innovative technology, technical sales and articulating value to customers and prospects. In addition, we are also looking for:

  • An excellent team player, with the ability to work across all disciplines.
  • Excellent communication and presentation skills, with the ability to communicate technical value into business value
  • Previous 3 – 5 years of experience with observability or application performance management technologies and techniques
  • Ability to troubleshoot technical issues to produce a working outcome and be able to manage this process
  • Ability to manage a number of projects simultaneously, work with a number of different sales people and support other SEs where needed
  • Must have a strong desire to grow professionally, adapt to an ever-changing environment and are coachable
  • Must be able to travel up to 50% of the time

Responsibilities:

  • Evangelize Dynatrace’s product offerings during international trade shows and at key customer account meetings to promote new and expanded business
  • Partner with sales representatives to identify new sales opportunities as well as incremental sales opportunities within existing accounts
  • As part of the solution engineering team participate in proof of concept (PoC) creation and cloud architecture discussions, leading the technical solution evaluation portion in support of sales opportunities either directly or through channel partners for multiple POCs
  • Present Dynatrace’s vision to our customers C-suite executives
  • Provide technical guidance in the Discovery, Solution Evaluation, and Solution Proposal stages of the opportunity sales cycle
  • Present on-stage demonstrations providing insight and context to our customers during key marketing events. Either at Dynatrace sponsored industry events or partner sponsored events, ensure key demonstrations are delivered by you or a team member at demonstration booths
  • Gather, qualify and provide feedback from customers to Product Management to improve Dynatrace’s market share and meet the market needs
  • Build best practices and share knowledge the team to continuously develop and enhance both your personal and team capabilities
  • Work with local Sales and Sales Engineering leadership to identify learning/ development opportunities for you and the local team to maintain Dynatrace’s leadership position in the market
  • Create and modify Dynatrace template presentations, in order to attend the specific demands of each customer
  • Not only work with internal sales team, but also with partners, supporting their team in the customers and being a technical point of contact for them (trusted advisor/technical coach)

Position might be filled at a higher level based on candidate experience.

Qualifications

Minimum Requirements:

  • Bachelor’s degree in Computer Science or equivalent education or experience required
  • 3+  years of experience within the observability space

Preferred Requirements:

  • Experience with web technologies such as HTML, CSS, and JavaScript
  • Experience with programming / scripting side technologies such as Java, .NET, PHP, Go, Node.js and database
  • Advance knowledge of Operating Systems (OS) including Windows and Linux
  • Experience with DevOps or Site Reliability Engineering practices
  • Knowledge with cloud platforms, including AWS, Azure or GCP
  • Experience with modern technologies like containers, Kubernetes / OpenShift, Serverless functions, and CI/CD pipelines
  • Experience with automation like Ansible, Puppet, Terraform, etc.

See more jobs at Dynatrace

Apply for this job

6d

Senior Solutions Engineer

DynatraceDetroit, MI, Remote
DevOPSterraformDesignansibleazuredockerelasticsearchkuberneteslinuxjenkinsAWSbackendfrontend

Dynatrace is hiring a Remote Senior Solutions Engineer

Job Description

Job Description

Our growing Enterprise Solutions Architecture Team has formed out of a high demand in our Enterprise clients needing leadership to deploy the Dynatrace platform at extreme scale, many of the Senior Solutions Engineers design and develop valuable solutions either during our engagements or innovating significant ways to support the delivery of our engagements.

The solutions we build are many times shared and used by many of our growing list of enterprise accounts. As we continue to grow, solutions such as our regularly expanding online Elevate portal or custom solutions that help maximize the use of the Dynatrace platform for our clients often need continued development support and assistance as our user base and engagement offerings expand. The team consists of extremely strong technical talent, yet we are always exposed to new challenges for technical growth, daily communication with peers as well as customers.      

This Job is a full-time role within the ESA team to first assist customers in solutioning complex integrations, second, assisting the Enterprise Solutions Architect team with key delivery tasks, presentations, diagrams, data analytics or hands on integrations, and third to contribute to the adoption of the new Elevate portal that not only serves up the ESA content to our engagements but will also serve up and manage content for the rest of the Dynatrace Services Organization.

As a team member it is expected to be exposed to a vast number of technologies, teams and techniques throughout some of the top organizations in the world. Our team members find this job incredibly engaging, very demanding at times but extremely rewarding. Given the commitment to the role and your personal growth, there are career paths to becoming a Solution Architect and potential of Senior Solution Architect.

While the Senior Solutions Engineers on the team are expected to refine and grow their skills and responsibilities in development, DevOps, Business Intelligence, cloud, security and infrastructure, log analytics, and various other aspects of the Dynatrace Platform.

Qualifications

Minimum Requirements

  • Bachelor’s Degree required or equivalent experience accepted in lieu of degree
  • 3+ years of experience using Dynatrace or managing Observability solutions.

Preferred Requirements:

  • Prior experience with log analytics with products such as Splunk, ElasticSearch, Dynatrace, etc.
  • Ability to Develop automation and repeatable processes/scripts to enable solutions that deploy, manage, configure, scale and monitor Client applications
  • Experience in frontend and / or backend programing languages
  • 1+ industry certification (AWS / Azure / GCP / Kubernetes / Ansible etc..)
  • An Active Dynatrace Certification
  • Understanding of PaaS concepts and implementations such as Cloud Foundry, OpenShift, BlueMix or similar offerings
  • Strong skills in diagraming and articulating designs and solutions
  • Strong analytic, organization, presentation, customer service and facilitation skills.
  • Ability to gather customer requirements and translate those requirements into short- and long-term deliverables while working with Project Managers and Directors
  • Comfortable with Software Development Life Cycles, Test Driven Development, Continuous Integration and Continuous Delivery/Deployment
  • A general understanding of a variety of Cloud technologies and offerings such as AWS, Azure or Google Cloud
  • Solid understanding of Network and Software Security Models, protocols, certificates, etc.
  • A general understanding of network topologies, routing, network security, security protocols, routing, load balancers and capacity planning
  • A general understanding of a Containers and Container orchestration products i.e. Kubernetes, OpenShift, docker
  • Outstanding communicator and writing skills with the ability to consult and lead multi-day meetings to assess technologies and processes
  • An understanding of deploying Application and/or Infrastructure Monitoring & Observability
  • Experience in both Linux and Windows OS
  • A general understanding of solutions using Chef, Puppet, Ansible, Terraform, Jenkins, and similar products

 

 

See more jobs at Dynatrace

Apply for this job

6d

Public Sector Solutions Engineer - SLED (Remote - Central US)

DynatraceDenver, CO, Remote
SalesDevOPSterraformansibleazurejava.netcsslinuxAWSNode.jsPHP

Dynatrace is hiring a Remote Public Sector Solutions Engineer - SLED (Remote - Central US)

Job Description

What’s the role?

Dynatrace is looking for a highly skilled and motivated Public Sector Solutions Engineer to support our rapidly growing customer base within State, Local, and Education (SLED) accounts. This is a unique opportunity to work with cutting-edge technologies in cloud, observability, and AI-driven application performance management within highly secured environments.

As a key member of our Public Sector team, you will collaborate closely with your account teams to deliver mission-focused solutions that enable SLED accounts to achieve superior application performance and observability.

Responsibilities

  • Serve as a technical lead and subject matter expert on Dynatrace’s platform, focusing on observability, cloud infrastructure, and application security within highly secure environments.
  • Own technical engagement with customers during the trial phase. Communicate Dynatrace’s value based on activities and work with customers on any identified issues or concerns to successful conclusion
  • Participate in proof of concept (PoC) creation and cloud architecture discussions, leading the technical solution evaluation in support of sales opportunities directly or through channel partners—especially in environments requiring security clearance.
  • Present Dynatrace’s vision to agency leaders.
  • Lead discovery workshops to determine customers' challenges and give product demonstrations to align our solution with customer needs
  • Act as a trusted advisor to agency customers, delivering technical presentations and demonstrations to highlight Dynatrace’s differentiated capabilities.
  • Technically close complex opportunities through advanced competitive knowledge, technical skill, and credibility
  • Proactively engage and communicate with customers and Dynatrace business/technical teams regarding product feedback and competitive landscape
  • Support marketing events including executive briefings, conferences, user groups, and trade shows

Position might be filled at a higher level based on candidate experience.

Qualifications

Minimum Requirements:

  • Bachelor’s degree in Computer Science or equivalent education/experience.
  • Minimum of 3+ years of experience in the observability or a related field within a public sector environment.

Preferred Requirements:

  • Experience with web technologies such as HTML, CSS, and JavaScript.
  • Experience with programming/scripting languages such as Java, .NET, PHP, Go, Node.js, and databases.
  • Advanced knowledge of Operating Systems (Windows and Linux).
  • Experience with DevOps or Site Reliability Engineering (SRE) practices.
  • Knowledge of cloud platforms including AWS, Azure, or GCP.
  • Experience with modern technologies like containers (Kubernetes/OpenShift), serverless functions, and CI/CD pipelines.
  • Experience with automation tools such as Ansible, Puppet, Terraform, etc.

See more jobs at Dynatrace

Apply for this job

8d

Senior Network Operations Engineer - Federal (Weekends)

ServiceNowSan Diego, California, Remote
SalesDevOPS5 years of experienceterraformDesignansibleazureiosjava.netlinuxpythonAWSNode.jsPHP

ServiceNow is hiring a Remote Senior Network Operations Engineer - Federal (Weekends)

Job Description

Please Note:

“This position requires passing a ServiceNow background screening, USFedPASS (US Federal Personnel Authorization Screening Standards). This includes a credit check, criminal/misdemeanor check and taking a drug test. Any employment is contingent upon passing the screening.  Due to Federal requirements, only US citizens, US naturalized citizens or US Permanent Residents, holding a green card, will be considered.

What you get to do in this role:

As a Federal Cloud Networking Operations Engineer you will help deliver 24x7 support for our Government Cloud infrastructure. 
This is a Weekend position from Thursday to Monday. The working hours are from 7:00 am - 4:00 pm Pacific Time.

What you get to do in this role:

  • Monitor network operations dashboards and respond to alerts and failures to troubleshoot networks to identify and resolve issues quickly.
  • Partner with project and program managers to meet overall timelines and resolution of issues.
  • Operate and troubleshoot networks to identify and resolve issues quickly.
  • Take a lead role in the engagement and mitigation of outage-causing events or issues.
  • Validate problem descriptions and perform detailed problem diagnosis; track and update problems in the trouble-ticketing system.
  • Perform non-critical investigations into functionality that is not working as desired.
  • Engage deeply in the sustainment function to proactively analyze network parameters such as capacity and availability to ensure issues are fixed before they cause an outage.
  • Review, consult and prepare for planned change introduction to production environment.
  • Partner with teams to plan and execute software code upgrades and device maintenance.
  • Partner with the Site Reliability Engineering (SRE) team to provide mentorship and input on operational process improvements.
  • Provide feedback to infrastructure architects on design issues or improvements and input into the design process for new initiatives.

Qualifications

To be successful in this role you have:

  • The candidate should have a solid foundation in networking including routing, switching, security and load balancing. 
  • 4+ years of experience with cloud computing technologies (e.g. Azure, AWS, Google Cloud Platform, etc.) across Windows and/or Linux
  • Azure Core Platform: Compute, Storage, Networking.
  • Azure Web Apps: developing, deploying, debugging and supporting web applications using .NET, Java, PHP, Python, Node.js etc. on Windows or Linux.
  • Continuous Integration/Continuous Deployment (CI/CD): using DevOps, Bit Bucket, GitHub.
  • Experience in one or more automation languages (PowerShell, shell scripts, Perl, Python, Ansible, Terraform) desired.
  • A minimum of 5 years of experience in working on Internet and data center networks.
  • Possess a solid understanding of and have experience with most of the following network technologies: BGP, OSPF, IS-IS, HSRP/VRRP, IPSEC, SNMP.
  • Deep, hands-on experience with TCP/IP protocols including capturing and analyzing traffic with Wireshark and/or other tools.
  • Familiarity with Cisco IOS and JunOS operating systems required.
  • F5 and Cisco ASA knowledge and experience strongly desired.
  • Experience with network monitoring applications such as EMC Watch4Net, Cacti, Splunk is a plus.
  • Ability to partner with peers who are globally distributed is a key part of this role.
  • Passion for customer experiences and focus on delivering high quality support.
  • Strong communication skills and empathy for customers.
  • Ability to learn new technology in a fast-paced environment.
  • Ability to deal with ambiguity.

GCS-23

For positions in California (outside of the Bay Area), we offer a base pay of $109,400 - $185,900, plus equity (when applicable), variable/incentive compensation and benefits. Sales positions generally offer a competitive On Target Earnings (OTE) incentive compensation structure. Please note that the base pay shown is a guideline, and individual total compensation will vary based on factors such as qualifications, skill level, competencies and work location. We also offer health plans, including flexible spending accounts, a 401(k) Plan with company match, ESPP, matching donations, a flexible time away plan and family leave programs (subject to eligibility requirements). Compensation is based on the geographic location in which the role is located, and is subject to change based on work location. For individuals who will be working in the Bay Area, there is a pay enhancement for positions located in that geographical area; please contact your recruiter for additional information.

See more jobs at ServiceNow

Apply for this job

8d

[PEG] Junior DevOps Engineer

Software MindChișinău, Moldova, Remote
DevOPSterraformansibleqagitjavadockerkuberneteslinuxjenkinsAWS

Software Mind is hiring a Remote [PEG] Junior DevOps Engineer

Job Description

Project – the aim you’ll have

We are seeking talented DevOps Engineers (Linux/Cloud) to deliver our platform components in a clean and consolidated build. This position within the DevOps team will be responsible for process workflow (monitoring and documentation), continuous integration with the code repository (Jenkins pipelines), configuration management (Ansible, Pipelines), and vendor management (cloud providers). The ideal candidate will be comfortable in a dynamic environment and possess excellent troubleshooting and organizational skills, along with the ability to deliver complete solutions for multiple product development pipelines.

Position - how you'll contribute

  • Perform daily system, application, or database updates via CI tools like Jenkins, Atlassian, Ansible, and Kubernetes.
  • Conduct Linux troubleshooting and continuous integration scripting.
  • Support the build process and assist in test and QA builds.
  • Manage the code repository and improve practices of branching and code merging.
  • Coordinate with team members prior to releases.
  • Document release steps and processes, and manage the software repository.
  • Assist in system administration and maintenance.

Qualifications

Expectations - the experience you need

  • At least 1 year of experience in a DevOps role.
  • Good knowledge of Linux servers, including Ubuntu.
  • Knowledge of TCP/IP networking, DNS, HTTP, load balancers, high availability architecture, and zero downtime production deployments.
  • Experience with Docker, Kubernetes, and Helm.
  • Experience working with development teams and writing process documentation.
  • Experience with continuous integration build systems (e.g., Bitbucket Pipelines, Jenkins).
  • Experience with code management software (e.g., Git).
  • Proficiency in shell scripting and basic Linux administration.

Additional skills - the edge you have

  • Experience with various Linux distributions.
  • Experience designing and building highly available distributed systems.
  • Understanding of Java/Python/Bash and Linux.
  • Java runtime inspection skills.
  • Familiarity with Grafana, Ansible, and ELK stack.
  • Experience with AWS and Kubernetes.
  • Experience with Ansible, Terraform, and configuration management.
  • Knowledge of Apache Kafka.

See more jobs at Software Mind

Apply for this job

8d

[CSR] DevOps Engineer

Software MindChișinău, Moldova, Remote
DevOPSterraformansibleqajavadockerkubernetesubuntulinuxpythonAWS

Software Mind is hiring a Remote [CSR] DevOps Engineer

Job Description

Project – the aim you’ll have

We are looking for experienced DevOps Engineers with expertise in Linux and Cloud technologies. As part of our DevOps team, you will focus on enhancing and optimizing platform components. Your role will involve managing workflows, including monitoring and documentation, while ensuring seamless continuous integration and delivery using GitLab CI/CD pipelines, Ansible, and Helmfile. You will also be responsible for managing cloud infrastructure on AWS and supporting container orchestration with Kubernetes. A strong background in troubleshooting, organizational skills, and delivering resilient solutions across development pipelines is essential.

Position - how you'll contribute

  • Implement and maintain system, application, and database updates using GitLab CI/CD, Kubernetes, Kafka (Strimzi/MSK), and Helmfile.
  • Troubleshoot Linux systems, develop CI scripts, and automate deployments.
  • Support the build process, collaborating with test and QA teams.
  • Manage code repositories and optimize branching and merging practices with Git.
  • Collaborate with development teams to streamline release processes.
  • Document release steps, processes, and repository management.
  • Provide system administration support and maintain Linux servers (Ubuntu, CentOS).

Qualifications

Expectations - the experience you need

To excel in this role, you should have:

  • Expertise in Linux server administration (Ubuntu, CentOS) and TCP/IP networking.
  • Experience with Docker, Kubernetes, and Helm.
  • Familiarity with AWS cloud services and infrastructure management.
  • Proven experience in continuous integration and delivery using GitLab CI/CD pipelines.
  • Proficiency in shell scripting, Bash, Python, and Linux administration.
  • Experience with code management tools like Git.
  • Strong troubleshooting skills and ability to thrive in fast-paced environments.

Additional skills - the edge you have

  • Experience managing Cassandra and Kafka (Strimzi/MSK) clusters.
  • Knowledge of Terraform for infrastructure automation.
  • Proficiency with Ansible for configuration management.
  • Understanding of highly available distributed systems and Java runtime inspection.

See more jobs at Software Mind

Apply for this job

10d

Azure DevOps

Ingenia AgencyMexico - Remote
DevOPSsqlansibleazuregitc++dockerkubernetes

Ingenia Agency is hiring a Remote Azure DevOps

Requisitos:
Ing. en Sistemas, computación o afín
2+ años de experiencia diseñando y administrando ambientes CI/CD
2+ años de experiencia en Azure cloud services
Experiencia usando frameworks .NET/C#
Conocimientos en SQL server
Conocimientos en ambientes de desarrollo Linux.
Experiencia en shell scripting, YAML, PowerShell.
Experiencia en monitoreo, alertas y ajuste de rendimiento
Experiencia en la implementación de aplicaciones para control de calidad, puesta en escena y producción
Experiencia con herramientas de gestión de configuración como Puppet, Chef y/o Ansible
Experiencia con Docker y Kubernetes
Conocimiento en gestión de código fuente y estrategias de Git

Actividades:
Responsable del diseño, implementación y gestión del pipeline CI/CD.
Responbable del diseño e implementación de la infraestructura de los diferentes proyectos.
Elaborar y actualizar los lineamientos de integración, despliegue continuo y divulgarlos en los equipos involucrados.
Monitorear que los procesos de desarrollo, pruebas y liberación cumplan los lineamientos y utilicen adecuadamente las herramientas para asegurar la automatización del proceso del ciclo de vida de las aplicaciones

See more jobs at Ingenia Agency

Apply for this job