Shift Assignment: Mid Shift
The Site Reliability Engineer (Systems Engineer) works In the Infrastructure Operations team and closely Interacts with development, cloud platform engineering and architecture teams to run and operate cloud-native, DevOps delivered, highly scalable and fault tolerant systems and services. The SRE Engineer ensures that internally and externally facing systems meet and exceed ambitious SLOs (Service Level Objectives) and are continuously improved. This includes deployments and monitoring of all systems through established tools. The incumbent is an intellectually curious passionate technologist up to date with technology trends in the industry, equally relishing challenges to optimize existing systems and services, and challenges to introduce modern technologies. They will execute, monitor, and assist in the continuous improvement of the operational running of our services. The incumbent will execute operations and engineering projects and provide support for and administration of all DevSecOps tools, cloud automation, critical business applications and platforms, and associated systems to build and run software in the cloud.
- Provides guidance to engineering and operations teams on enabling and managing end to end availability and performance of mission critical services.
- Executes established automation for
- Participates in incident response, root cause/postmortem analysis, and drives production improvements for key issues that result in business opportunity for Arch.
- Works on building configuration automation to eliminate waste and manual/repetitive tasks, prevent problem recurrence, and to respond to various service alerts and condition metrics.
- Supports Kubernetes deployments in Azure, including cluster management, upgrades, security policies, application deployment with Argo CD using git and harness pipelines to deploy in collaboration with engineering and development teams.
- Supports DevSecOps tools, pipeline, automation, and infrastructure-as-code scripts and continuously monitors the industry landscape to improve and expand to further speed to market and delivery team autonomy.
- Knows security is job one and includes security in all plans.
- Implements, executes and manages Configuration as Code automation
- Executes tasks aligned to projects in the operations space (upgrades, rollouts of technology, )
- Promotes a continuous improvement, innovation, and collaboration culture across
- Builds and strengthens relationships and partnerships with corporate infrastructure leads and
- Perform other operational duties as required.