Hotline: 0123-456-789

DevOps Manager

About the Role

The DevOps Manager is responsible for leading and managing the DevOps team, driving the adoption of DevOps practices, and ensuring the seamless integration of development and operations processes. This role involves strategic planning, team leadership, and hands-on involvement in implementing and optimizing continuous integration, continuous delivery, and infrastructure as code practices.

Essential Responsibilities

Team Leadership:

  • Lead and manage the DevOps team, providing guidance, mentorship, and professional development opportunities.
  • Foster a culture of collaboration, innovation, and continuous improvement within the team.

DevOps Strategy:

  • Develop and implement the organization’s DevOps strategy, ensuring alignment with overall business objectives.
  • Stay current with industry trends and emerging technologies to enhance the DevOps practice.

Continuous Integration and Continuous Delivery (CI/CD):

  • Design, implement, and optimize CI/CD pipelines to enable efficient and automated software delivery.
  • Ensure the timely and high-quality release of software to production environments.

Infrastructure as Code:

  • Implement and manage Infrastructure as Code practices using tools such as Terraform, Ansible, or similar.
  • Ensure that infrastructure provisioning and configuration are automated, versioned, and scalable.

Cloud Management:

  • Oversee the organization’s cloud strategy, including the management of resources on platforms such as AWS, Azure, or Google Cloud.
  • Optimize cloud infrastructure for performance, cost, and scalability.

Monitoring and Alerting:

  • Implement and manage monitoring and alerting systems to proactively identify and address operational issues.
  • Define key performance indicators and ensure systems meet or exceed defined SLAs.

Security and Compliance:

  • Collaborate with security teams to integrate security practices into the DevOps pipeline.
  • Implement and enforce security policies and compliance standards.

Collaboration and Communication:

  • Facilitate communication and collaboration between development, operations, and other stakeholders.
  • Act as a liaison between technical and non-technical teams to ensure alignment on DevOps goals.

Incident Response and Recovery:

  • Develop and maintain incident response plans and participate in incident response activities.
  • Conduct post-incident reviews and implement improvements to prevent future incidents.

Budget Management:

  • Manage the DevOps budget, including tools, resources, and training.
  • Identify cost-saving opportunities without compromising performance or security.

 

Education

  • BS or MS in Computer Science or equivalent industry experience

 

Experience

  • At least 12 to 15 years in a Site Reliability Engineering, DevOps, SecOps or Infrastructure focused role
  • Experience supporting internet-facing production services and distributed systems
  • Ability to implement and coordinate telemetry using monitoring and observability tools such as Splunk, Grafana or Prometheus
  • Coding experience using a high-level programming languages like: Java, or Python
  • Automation advocate – you truly believe in removing operational load via software
  • Strong leadership and team management skills.
  • Experience managing, scaling, and troubleshooting Java applications
  • Familiarity with cloud infrastructure concepts (zones, regions, VPCs, etc.)
  • An understanding of a variety of software service deployment packaging, strategies, and tooling
  • Working understanding of common authentication schemes, certificates, and securely managing secrets
  • Capable of designing and implementing automated configuration management processes for repeatable and consistent service deployment
  • Knowledge of cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).

 

Knowledge, Ability & Skills

  • Prior experience as an SRE, software engineer, DevOps Engineer, or system administrator
  • Experience in system automation technology, such as Ansible
  • Relevant certifications (e.g., AWS/GCP/Azure Certified DevOps Engineer, Docker Certified) are a plus.
  • Understanding of security practices and compliance standards.

More Information

Leave your thoughts

Share this job

We strive to simplify the job search process, making it more efficient and effective for candidates, while helping companies find the right talent to drive their growth and success.

Contact Us

JobMonster Inc.
54/29 West 21st Street, New York, 10010, USA
[email protected]
http://jobmonster.com