Site Reliability Engineer II

Company:  Microsoft
Location: Vancouver
Closing Date: 17-10-2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description
Overview Are you an individual who loves to work on large-scale projects at one of the most exciting and diverse divisions within Microsoft? Are you looking for big, creative challenges that show immediate results since your customers are the product engineers for Office and M365? Do you want to be at the core of it all, acting as a force multiplier enabling groups of engineers to do their best work? If so, we have the perfect job for you! The ES365 (Engineering Systems 365) team owns the tools that make up the end-to-end developer experience in Office and M365 (Substrate) from source control and check-in experience to build, validation, and deployment automation, and we’re making big, bold changes – for the better! We’re making it easy to build and ship apps across platforms and endpoints, and we’re moving away from proprietary, internal-only tools onto “one Microsoft” investments, open source, and industry standard tools. This is an exciting time as we seek to re-invent productivity leveraging the power of AI universally. We are looking for Site Reliability Engineer II (SRE) to join ES365’s Infrastructure teams. The charters of these teams include the following (and more): Azure management & governance Business continuity Infrastructure as Code Network engineering Provisioning & service deployment Security & vulnerability management Systems State Management As one of our new SREs you’ll get to deliver novel solutions using a modern DevOps approach leveraging the full stack of technologies Microsoft has to offer to enable our organization to respond more effectively to evolving customer needs and market demands, all while reducing costs, eliminating duplicated work, and driving efficiencies through automation. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. Qualifications Required Qualifications 4+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration OR Master's Degree in Computer Science, Information Technology, or related field. Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter. Preferred Qualifications 5+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration. Full-stack troubleshooting skills across network, application, hardware, management fabric, and distributed services layers. Experience documenting complex systems accurately to convey technical ideas across teams. Experience in implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for production services. Experience with one or more automation tools or frameworks (e.g., Terraform, ARM, Chef, Bicep) and scripting languages (e.g., Python, Bash, Powershell) or similar. Responsibilities You will participate in onboarding, code/design reviews, and regular meetings with the engineering teams that develop and manage those products. You will independently develop code or scripts that automate the performance of repetitive and easily scalable operations processes. You will design, develop, and maintain telemetry pipelines and monitoring tools that detail operations metrics. You will develop, test, troubleshoot, and implement changes to optimize code and improve products. You will respond to incidents during regular on-call rotations. You will author technical documentation for your tools and services. You will participate in post incident reviews to drive service improvements. Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work. Industry leading healthcare Educational resources Discounts on products and services Savings and investments Maternity and paternity leave Generous time away Giving programs Opportunities to network and connect #J-18808-Ljbffr
Apply Now
Share this job
Microsoft
  • Similar Jobs

  • Site Reliability Engineer

    Vancouver
    View Job
  • Site Reliability Engineer

    Vancouver
    View Job
  • Site Reliability Engineer

    Vancouver
    View Job
  • Site Reliability Engineer

    Vancouver
    View Job
  • Site Reliability Engineer

    Vancouver
    View Job
An unhandled exception has occurred. See browser dev tools for details. Reload 🗙