**Introduction**
At IBM, work is more than a job - it's a calling: To build.
To design.
To code.
To consult.
To think along with clients and sell.
To make markets.
To invent.
To collaborate.
Not just to do something better, but to attempt things you've never thought possible.
Are you ready to lead in this new era of technology and solve some of the world's most challenging problems?
If so, lets talk.
**Your Role and Responsibilities**
Are you passionate about technology?
Do you love building new things?
Do you want to develop the future of IBM's Cloud offerings?
If you answered YES, then we have the right opportunity for you!
The shift toward the consumption of IT as a service, i.e, the cloud, is one of the most important
changes to happen to our industry in decades.
At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud.
With industry leadership in analytics, security, commerce, and cognitive computing and with
unmatched hardware and software design and industrial research capabilities, no other company is as well positioned to address the full opportunity of cloud computing.
We are looking for a dynamic, Site Reliability and Automation Engineer to join our Cloud
Operations Team, who is responsive to market needs, to deliver value to our clients in a fast
- changing cloud landscape.
The Cloud team is dedicated to ensuring the IBM Cloud is at the
forefront of cloud technology, from data center design to network architecture to storage and
compute clusters to flexible infrastructure services.
We are building and operating IBM's VMware Solutions cloud platform to deliver performance and predictability for our customers' most demanding workloads, at global scale and with leadership efficiency, resiliency and security.
It is an exciting time, and as a team we are driven by this incredible opportunity to thrill our clients.
In this Site Reliability and Automation Engineer role, you will work closely with the Data Center, the entire Cloud development organization and IBM vendors to support, maintain and operationally improve the cloud infrastructure.
Your focus will be the following key responsibilities:
- Support and Operate Cloud Service delivery
- Automate health monitoring of the production and test systems
- Automate return to service procedures for Cloud Service delivery
- Support the compliance and security integrity of the environment through your work
- Partner with other teams, functional managers and program managers to deliver mission-critical services to the market
- Support development of new and existing capabilities for our compute, storage and network services.
- Integrate automation with operational requirements
Work with Engineering and Development to:
- Define operational requirements
- Automate operational requirements
- Provide initial assessment and possible workaround of production issue
- Troubleshoot and resolve production issues
Work with Support and Infrastructure to:
- Identify and resolve complex issues
- Discuss and plan integration tasks
Qualifications:
- Excellent written and verbal communication skills
- Comfortable operating in fast paced environment
**Required Technical and Professional Expertise**
- 2-3 years of experience in data center infrastructure, engineering and support
- Minimum of 2 years' experience with hands-on production administration of large virtual system environments using VMware vSphere, VMware vCenter
- Experience with VMware NSX, vRealize Operations Manager, vRealize Network Insight.
- Experience in establishing, following, and improving operational procedures within a mission critical environment
- Experience in IT Change, Incident, Problem, Asset management
- Must be efficient in writing, debugging and maintaining scripts (Bash, Python, Powershell)
- Ability to do low level debugging and problem analysis by examining logs and running Unix commands
- 2-3 years of experience with open-source products
- Hands on knowledge using vRealize Log Insight or LogDNA
- Excellent written and verbal communication skills
**Preferred Technical and Professional Expertise**
- Experience in maintaining cloud based solutions with VMware vCloud Director
- Experience with Veeam Backup
- Experience with replication/failover using Zerto Platform, VMware vCloud
- Availability or Veeam Cloud Connect
- (Extensive) Experience with scripting languages, such as Bash, Powershell and Python
- Working knowledge with SQL (PostgreSQL, MSSQL) and Cloudant
- Working knowledge with Networking, sub-netting and Storage technologies
- Working knowledge with ServiceNow, JIRA, Confluence, and GitHub
**About Business Unit**
Digitization is accelerating the ongoing evolution of business, and clouds - public, private, and hybrid - enable companies to extend their existing infrastructure and integrate across systems.
IBM Cloud provides the security, control, and visibility that our