Lead the design, implementation, maintenance, and continue evolution of our Kubernetes clusters across multiple environments
Architect and maintain infrastructure as code using Terraform
Implement and maintain CI/CD pipelines for reliable, automated deployments
Deploy monitoring, logging, and alerting solutions for infrastructure and applications
Establish security best practices and compliance across cloud resources
Optimize cloud resource utilization for performance, reliability, and cost-efficiency
Design disaster recovery and high availability solutions for critical services
Collaborate with full-stack engineers and data engineers to support application deployment needs
Automate repetitive operational tasks and infrastructure management
Implement and maintain secrets management and access control systems
Provide technical guidance and mentorship to junior DevOps engineers
Create comprehensive documentation for infrastructure, deployment processes, and operational procedures
Lead incident response for production infrastructure issues
Required Skills & Experience
5+ years of experience in DevOps or Cloud Engineering with at least 2 years in a leadership role, with experience managing others in both project work and career progression
Top 3 non-nego tools: AWS, Terraform, Docker
Understanding of networking concepts and implementation in cloud environments
Strong expertise in Kubernetes administration, deployment, and troubleshooting
Experience implementing and managing CI/CD pipelines
Extensive experience with Terraform for infrastructure as code
Experience designing, implementing, or maintaining systems compliant with ISO 27001 and other security frameworks such as SOC 2, NIST 800-53, or HIPAA requirements
Deep knowledge of AWS services (EKS, IAM, S3, RDS, EC2)
Strong understanding of container technologies (Docker)
Proficiency in scripting languages (Python, Bash) for automation
Knowledge of monitoring, logging, and observability solutions
Experience with security best practices and compliance requirements
Strong problem-solving skills and attention to detail
Excellent communication skills in English (verbal and written) for explaining complex infrastructure concepts, troubleshooting, documentation, and technical specifications
Demonstrated proficiency with preferred development environment (Mac or Linux) for infrastructure management and DevOps workflows
Ability to collaborate synchronously with team members during core hours (10:00 AM - 6:00 PM US Eastern Time) while respecting work-life balance
Preferred Qualifications
Experience with GitOps methodologies and tools
Experience with authentication services like Auth0
Knowledge of database management and optimization in cloud environments
Familiarity with serverless architectures and technologies
Background in advertising technology or marketing analytics infrastructure
Knowledge of cost optimization strategies for cloud resources