Site Reliability Engineer

Company:  Gradient IT
Location: Ajax
Closing Date: 01-08-2024
Hours: Full Time
Type: Permanent
Job Requirements / Description

We are looking for a passionate Site Reliability Engineer with a deep-rooted foundation in DevSecOps and Open Source Technology. The engineer should be passionate about automation and building highly scalable and available services in the cloud. You will help lead a team of engineers to build tooling, automation, and support Spinnaker on behalf of our customers for our Managed and SaaS offerings; as well as company internal infrastructure and security.Additionally the engineer will be responsible for delivering delightful customer experiences while leveraging their sharp technical edge and background. The successful candidate will be a self-motivated learner who is able to pick up new technologies and skill sets at a rapid pace. This is a customer-facing role in a highly collaborative and fast-paced environment and therefore requires exceptional soft skills with the bandwidth to prioritize and juggle several customer issues at once.

?ompany offers

  • We live in a world today where software is your competitive advantage. Leveraging this advantage results in accelerating your time-to-market, maintaining stability, avoiding outages; always being reliable and available for your customers.
  • We believe deploying software safely and continuously at any scale is at the center of achieving your competitive advantage. It should be easy to understand, achievable, and effortless for all developers of the world. Write code. Package artifacts. Choose targets. Hit deploy — This is every developer’s dream deployment scenario.
  • A company makes this dream a reality by enabling development teams to confidently deploy their software every time; easily, reliably, safely, securely, and continuously.

Responsibilities

  • Develop software, tools, reusable modules, and scripts for deployments, monitoring, diagnostics, and self-healing services
  • Be responsible for maintaining the overall health, performance, and life cycle of multiple instances of Spinnaker
  • Deploy and maintain cloud services in AWS, GCP, and Azure to support Spinnaker
  • Automate operational workflows, cloud services management, configuration and change management
  • Triage and troubleshoot issues that arises from automated alerts, upgrades, or change activities
  • Partner with product, engineering, support, and technical account managers teams to ensure customer success
  • Participate in 24/7 on-call rotation- Maintain a track record of response time SLAs, CSAT, and Update Frequency KPIs- Be a team player above all else

What you should have

  • 5+ years of experience in a software engineering role
  • 2+ years of experience with Docker and Linux- Exceptional customer-facing soft skills and proficiency with documentation and digital communications
  • Experience with enterprise level application and infrastructure support
  • High level understanding of DevOps lifecycle and concepts, microservice architecture
  • Experience with cloud service providers (AWS and/or GCP)
  • 1+ years of experience with Kubernetes
  • Experience with infrastructure automation tools such as Terraform, Cloudformation, etc.

Will be a big plus

  • 3+ years of experience with cloud environment such AWS and/or GCP- Programming experience, ideally Java- Knowledge of or experience with security scanning and monitoring tools
  • Experience architecting and implementing highly available infrastructure and services- Experience with distributed systems and micro services
  • Experience with monitoring and alerting tools such as Prometheus, NewRelic, or Datadog
  • Experience with version control utilizing Git
  • Experience with CI/CD including Jenkins Pipelines, Spinnaker, Argo, CircleCI or similar tooling

Java, AWS, Clean Code and SOLID principles

#J-18808-Ljbffr
Apply Now
Share this job
Gradient IT
  • Similar Jobs

  • Distribution Engineer - Standards & Reliability

    Ajax
    View Job
  • Plant Reliability and Continuous Improvement Manager

    Ajax
    View Job
  • FP&R Site Leader

    Ajax
    View Job
  • Project Manager - OPG Site

    Pickering
    View Job
  • Site Procurement Manager - Ajax, ON

    Ajax
    View Job
An unhandled exception has occurred. See browser dev tools for details. Reload 🗙