Design, implement, and maintain CI/CD pipelines to automate build, testing, and deployment processes, ensuring reliability and efficiency.
Manage, operate, and monitor cloud infrastructure environments (e.g., AWS, Azure, or GCP) to support high availability, scalability, and performance.
Collaborate closely with development teams to optimize application performance and streamline deployment workflows.
Continuously monitor system performance, proactively troubleshoot issues, and conduct root cause analysis to prevent recurrence.
Develop and maintain Infrastructure-as-Code (IaC) using tools such as Terraform or Ansible to ensure consistency and repeatability.
Ensure infrastructure and systems comply with security best practices, industry standards, and organizational policies.
Support incident response, recovery, and post-incident reviews to minimize system downtime and business impact.
Create and maintain comprehensive documentation for systems, deployment processes, and operational workflows.
Participate in on-call rotation as required to support system availability and operational continuity.
Job Qualification:
Bachelor's degree in Computer Science, Information Technology, or a related field.
35 years of professional experience in DevOps, infrastructure engineering, or system administration.
Hands-on experience with CI/CD tools such as Jenkins, GitLab CI, or GitHub Actions.
Strong proficiency in cloud platforms, including AWS, Azure, or GCP.
Solid understanding of containerization and orchestration technologies, including Docker and Kubernetes.
Strong scripting and automation skills using Bash, Python, or similar languages.
Experience with system monitoring and observability tools such as Prometheus, ELK, or Grafana.
Good understanding of networking concepts, system security principles, and backup and recovery strategies.
Strong communication skills, with the ability to collaborate effectively with both technical and non-technical stakeholders, including developers, operations teams, and business owners.