Senior DevOps
About the job
We are seeking engineers to join our DevOps team. The DevOps role will strive to deliver high-quality software products and services with a continuous integration and continuous delivery (CI/CD) approach.
Primary Responsibilities
- Responsible for designing, building, testing, and deploying software changes using various tools and technologies
- Collaborate closely with product developers and other stakeholders to ensure that the software solution meets the functional and non-functional requirements such as availability, performance, security, and scalability
- Drive high reliability and performance into systems by implementing automation, monitoring, and feedback loops
- Build and maintain proprietary or open-source tools to improve software delivery and incident management
- Implement best practices for code management, version control, testing, and documentation
- Ensure that DevOps and software developers follow the incident response process and document the root cause analysis and action items
- Ensure that error budgets are maintained and SLOs are met
- Develop and support processes to ensure uptime, SLAs, and availability of critical platform components
Apply Here
Required Skills/Experience
- Bachelor’s Degree in Computer Science, Information Science, Engineering, or related/relevant field
- 3+ years working in a DevOps or Site Reliability Engineering role
- Experience with monitoring tools (preferred Datadog, Sumologic, Grafana, and New Relic)
- Skilled in identifying and resolving bottlenecks, anomalies, and errors in software systems
- Passionate about technology and a strong sense of curiosity and desire to seek out new ways to improve processes, automate everything, and learn anything
- Understanding of high availability and business continuity principles
- Strong collaborative skillset working with both internal and external teams
- Ability to work and communicate under intense pressure during outages and other critical project implementations
- Successful experience supporting a Cloud production environment (Strongly prefer AWS)
- Competency in one or more programming languages to create automation. Python strongly preferred
- Knowledge of cloud deployment tools and methodologies (ideally Terraform, but will consider AWS CloudFormation, Ansible, etc.)
- Must have deep understanding of Kubernetes and Docker architecture and associated tools
- Experience with at least one configuration management solution (i.e. Chef, Ansible, AWS CodeDeploy)
- Some proficiency with Repository AND Pipeline related tools (examples: GitLab, GIT - Jenkins, Bamboo, Travis, Circle CI)
- Any experience with implementation and use of different Application and Infrastructure monitoring tools
- Solid troubleshooting skills
- Taking ownership and delivering solutions autonomously
Desired Experience
- Experience in working on distributed systems like databases and distributed file systems is an added plus