Consulpro, Montréal Hi-tech Career Opportunities, Carrières en Haute Technologie

Back to Recent Jobs

New Search

<< previous job | next job >>

Location : Montreal

APPLY FOR JOB #4118

LEAD SITE RELIABILITY ENGINEER


# 4118


Our client is looking for a highly motivated and talented lead site reliability
engineer to participate in the deployment of the most advanced solutions in
the big data space. The lead site reliability engineer will actively participate
and collaborate with the agile development team to deploy, validate and
maintain large production clusters.


Responsibilities:


- Manage day to day tasks and priorities of a team of SREs
- Mentor team to focus on automation and continuous improvement
- Automate and document frequent deployment of agile builds to large
clusters
- Monitor, analyze and improve performance and operations of big data
clusters
- Perform second/third level support of production systems


Requirements:


- Two years of experience managing a technical team
- Six years of experience as an operations engineer
- Strong Linux server understanding and troubleshooting capabilities
(Centos)
- Strong scripting capabilities (Bash, Python)
- Good experience with Git
- Holder of a Degree in Computer Science or Engineering
- Experience in Cloud and non-Cloud based Hadoop ecosystem
- Strong Linux and shell scripting experience
- Strong automation experience using Ansible
- Extensive experience in public cloud infrastructure (Amazon, Azure)
- Good knowledge of SQL


APPLY FOR JOB #4118
8