Design and implement scalable, secure, and reliable cloud infrastructure to support large-scale data platforms and applications on AWS, GCP, or Azure, leveraging Kubernetes, Docker, and auto-scaling clusters.
Develop and maintain CI/CD/CT/CE pipelines for application and data workflows (ETL/ELT, ML pipelines), ensuring automation, observability, and rollback capability.
Integrate security practices (DevSecOps) throughout the development and deployment lifecycle — including vulnerability scanning, dependency control, and secret management.
Design and configure secure network topologies (VPC, Subnets, Firewalls, Private Link, VPN, etc.) aligned with cloud and organizational security best practices.
Implement Infrastructure as Code (IaC) using Terraform, Helm, and Ansible for reproducible, version-controlled, and auditable environments.
Monitor and optimize system, network, and pipeline performance, ensuring reliability, scalability, and cost efficiency.
Set up observability and alerting systems (e.g., Prometheus, Grafana, CloudWatch, ELK) for infrastructure health, application metrics, and data pipeline monitoring.
Drive internal product packaging and publishing to AWS Marketplace and other global cloud platforms, ensuring compliance with required architecture and security standards.
Collaborate cross-functionally with Data Engineers, Data Scientists, and Application Developers to enable seamless integration and deployment.
Document infrastructure, security controls, and deployment workflows, ensuring smooth operations and knowledge transfer.
Mentor junior engineers and promote best practices in cloud infrastructure, automation, and security.
Qualifications
Required 2+ years of experience in DevOps, Platform Engineering, or Cloud Deployment roles.
Hands-on experience with AWS services: ECS, EKS, EC2, S3, IAM, CloudFormation/Terraform, and Secrets Manager.
Deep understanding of Docker, container security, and image lifecycle management.
Strong knowledge of CI/CD automation (Bitbucket Pipelines, GitHub Actions, GitLab CI, Jenkins, etc.) and artifact management.
Experience designing secure deployment frameworks for both cloud and on-prem environments.
Experience with monitoring tools such as Prometheus, Grafana, ELK Stack, Cloud Monitoring.
Scripting proficiency in Python and Bash.
Familiarity with software licensing models and subscription validation mechanisms.