SITE RELIABILITY ENGINEER
# 8327
Our client is looking for a highly motivated and talented site reliability
engineer to participate in the deployment of the most advanced solutions in
the big data space. The site reliability engineer will actively participate
and collaborate with the agile development team to deploy, validate and
maintain large production clusters.
Responsibilities:
- Automate and document frequent deployment of agile builds to large
clusters
- Monitor, analyze and improve performance and operations of big data
clusters
- Perform second/third level support of production systems
Requirements:
- Three years of experience as an operations engineer
- Strong Linux server understanding and troubleshooting capabilities
(Centos)
- Strong scripting capabilities (Bash, Python)
- Good experience with Git
- Holder of a Degree in Computer Science or Engineering
- Experience in Cloud and non-Cloud based Hadoop ecosystem
- Strong Linux and shell scripting experience
- Strong automation experience using Ansible
- Extensive experience in public cloud infrastructure (Amazon, Azure)
- Good knowledge of SQL
APPLY FOR JOB #8327