**Job Title:
**Sr. DevOps Engineer**:
**Job Grading Information**:
- (For HR use only) _
Grade Assigned: 26
Date Approved: 5/12/2021
Job Code: 003709
FLSA Status: E
**Department:
- eCommerce_
**Supervisor Job Title:
DevSecOps Manager, eCommerce
**Direct Report Job Title(s):
N/A
**Work Location:
Columbus, OH
**Created By:
Jon Herbst
**HR Approved By**:
M.Regan
**Job Summary**
**Essential Tasks**
Summarize up to 5 main tasks that this job performs and assign a percentage of time spent on each task.
Percent of time spent must add up to 100%.
%** of Time Spent**
**Essential Tasks**
15%
**Project Leader**
- Hands-on leadership of Architecture design and work closely with developer along with supporting operations functions within an Agile/Scrum environment
- Gather requirement, create plan, create an estimated timeline and execute the project.
- Lead the high-quality execution of software products against project plans and delivery commitments.
20%
**Engineering Strategy and Architecture**
- Responsibility for multi-channel software development lifecycle, enhancements/modifications, system configuration, migrations/upgrades, and production support.
- Provide support and troubleshooting for all related systems and technologies
40%
**Production and Operational excellence.
**
- Production Operations and support.
- Participate in troubleshooting effort to find the root cause and provide valuable suggestion to prevent this from happening again.
- Provide guidance and standards for web-site optimization techniques such as CDN, Cloud, and web-caching techniques
- Influence the business strategy across Engineering and all of EXPRESS by articulating key architecture, design or technology challenges and building understanding among executive decision makers.
- Resolve difficult technical issues, remove obstacles for teams and help all projects to move forward on schedule, budget, and meeting
**15%**
**Practice and Industry leadership and growth**
- Build and operate a high performance stable and resilient eCommerce platform
- Champion Site Reliability Engineering
- Work closely with our cross functional IT and technology partners to ensure system interactions are top notch and to your standard.
- Align our technology with business eCommerce strategy, including scope definition, cost estimations, resource allocation, business requirements, process design, technical specifications, data management, compliance, testing.
**Other essential tasks may occur as directed by your supervisor**
**Job Requirements**
List the essential and preferred requirements for this job; including years of experience, education, certifications, skills and abilities
**Essential Technical Requirements**
- 5 years of experience in distributed system development (design and support of systems with scalability and disaster recovery robustness) to support compute use cases for business requirements.
- 3+ years of implementation and operations experience with production systems in public cloud environments (AWS or GCP Preferred).
- Hands-on experience automating infrastructure operations and with modern best practices such as infrastructure-as-code, cross-region & multi-provider redundancy, and event correlation solutions.
- Proficient with containerization and cluster management technologies like Docker and Kubernetes
- Deep understanding and hands-on experience with Cloud Native deployment and monitoring tools/technology with a expertise in areas like Kubernetes, Helm charts, container based deployment, Service Mesh, Prometheus, Grafana, etc.
- Motivated by a DevOps culture and Site Reliability Engineering concepts.
**Key Skills and Experiences**:
- 5 years of experience in operating systems (Windows, RedHat, CentOS, Amazon Linux), networking (Akamai, Nginx, Apache, AWS/GCP VPC), and/or software (Terraform, Bash, Sh) packages.
- 5 years of experience integrating monitoring, alerting and reporting tools (NewRelic, Akamai, Grafana, Elasticsearch, Prometheus) with existing and newly developed systems.
- 4 years of cloud engineering and development experience.
Must have experience extending and supporting cloud-based systems using Terraform and AWS or GCP.
- 3 years of experience implementing and supporting microservices architecture using containers, with tools such as Docker, AWS ECS or GCP Compute/GKE & Rancher.
- 3 years of experience working with database systems such as AWS RDS, Oracle SQL, MongoDB & Elasticsearch.
- Design and Build CI-CD pipeline for code deployment using Travis, Codebuild, Jenkins and Bamboo.
- Must have a solid experience with multiple Apache Projects including Web, HTTP Server, Tomcat, Ant
- Experience in supporting open-source Web and Application Services (Java, Ruby, PHP, Python, Perl)
- Experience with bash, perl, or other shell scripting required.
- Experience with git fundamentals and Stash required.
- Experience with Level 1 support, monitoring of customer facing systems and participate in 24/7