What is the role?
As Cloud Ops Lead, your role would encompass a wide range of responsibilities and require a deep understanding of both technical and team management aspects.
Key Responsibilities
- Key Responsibilities:
- Manage and maintain cloud infrastructure (preferably AWS/Azure) including networking (VPC, VPN, NAT, routing).
- Administer and monitor Kubernetes clusters (EKS/AKS), ensuring high availability and security.
- Implement infrastructure automation using Terraform or equivalent IAC tools.
- Configure and manage system observability (Prometheus, Grafana, Node Exporter, Kafka Exporter).
- Integrate security and quality gates into CI/CD workflows (SAST, DAST, code coverage).
- Ensure robust logging, alerting, and uptime monitoring across zones and services.
- Optimize infrastructure for cost, performance, and scalability.
- Enforce security best practices at infrastructure and network levels (TLS, firewalls, IAM).
Required Skills:
- 4–6 years of experience in cloud infrastructure management (AWS/Azure).
- Strong experience with Kubernetes (deployment, scaling, networking, RBAC).
- Proficiency with monitoring tools (Prometheus, Grafana) and Linux OS (Ubuntu / Linux).
- Experience managing cloud-native components like Kafka, MongoDB, Scylla / Cassandra, and MySQL RDS (8.4).
- Understanding of CI/CD pipelines and secure container management.
- Strong scripting in Bash, Python
In Summary
Overall, you would be a professional capable of providing strategic direction, technical expertise, and leadership to ensure the ongoing success and reliability of the organization’s offerings.