Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _
- SREs in our team take an engineering approach to building and running our Equifax Security production systems - we engineer solutions to operational problems. Our SREs are responsible for overall system operation and we use a breadth of tools and approaches to solve a broad set of problems. _
**What you'll do**:
- Engage in and improve the software development lifecycle - from inception and design, through development, deployment, operation and refinement.
- Influence and design infrastructure, architecture, standards and methods for large-scale systems.
- Support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews.
- Maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health.
- Automate system scalability and continually work to improve system resiliency, performance and efficiency.
- Remediate tasks within the corrective action plan via sustainable, preventative, and automated measures whenever possible.
- Practice sustainable incident response as part of an on-call rotation and through blameless postmortems
- Responsible for vulnerability and penetration testing remediation.
- On call rotational support (1 week a month)
**What experience you need**:
- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required.
- 2+ years of experience **developing and/or administering software in public cloud**
- 2+ years experience in languages such as **Python, Bash, Java, Go JavaScript and/or node.js**
- 2+ years experience with cross-functional knowledge with systems, storage, networking, security and databases
- 2+ years experience with system administration, including automation and orchestration of **Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)**
- 2+ years experience with **CI/CD tooling and practices**
**What could set you apart**:
- Experience implementing CI/CD Pipelines with automation and orchestration of builds/deployments
- Experience in Jenkins Pipelines & Kubernetes Deployments
- Experience with Cloud Security Tools such as Twistlock, Qualys, Fortify, SentinelOne
- Experience with system administration, including automation and orchestration of Linux/Windows using Chef, Puppet, Ansible, Salt Stack and/or containers (Docker, Kubernetes, etc.)