Senior SRE Engineer

Moledao

Negotiable
Remote5-10 Yrs ExpBachelorFull-time
Share

Remote Details

Open CountryWorldwide

Language RequirementsEnglish

Job Description

Summary

We are seeking a Senior SRE Engineer (Wallet Operations Focus) to ensure the stability, availability, and performance of our core business infrastructure on AWS. The role involves managing global production environments, building scalable and highly available systems, implementing automation and observability platforms, and maintaining security and compliance standards.


Job Purpose

  • In charge of deployment
  • Ensures systems run reliably, efficiently, and at scale.
  • Builds tools to improve uptime, performance, and incident response.


Responsibilities

  • Ensure global infrastructure stability, availability, and performance on AWS for core business operations, taking ownership of production SLAs.
  • Design, operate, and troubleshoot cloud-native components such as Kubernetes, Envoy, Service Mesh (Istio/Linkerd), and Ingress controllers.
  • Improve operational efficiency through automation and platform tools (IaC, CI/CD), achieving system observability, self-healing, and fast recovery from incidents.
  • Implement and maintain operational security practices, including access control (AWS IAM/K8s RBAC), network security policies, vulnerability management, and incident response.
  • Build and enhance a global operations system, including capacity planning, monitoring and alerting (Prometheus/ELK), CI/CD pipelines (GitLab/Jenkins), disaster recovery, and automated fault recovery.
  • Understand business architecture deeply and participate in designing high-availability and disaster recovery solutions, with continuous cost optimization.


Qualifications

  • 5+ years of Linux operations, SRE, or DevOps experience, with expertise in managing large-scale distributed systems.
  • Proficient in AWS core services (EC2, S3, VPC, IAM, ELB, RDS, etc.) with architecture, operations, and cost optimization experience.
  • In-depth knowledge of Kubernetes architecture, including managing, troubleshooting, and performance tuning large-scale production clusters.
  • Familiarity with Envoy, Istio/Linkerd service mesh, or Nginx/Istio Ingress controllers for L7 traffic management.
  • Strong operational security awareness and practices, including common OS, network, and application security vulnerabilities and mitigation measures.
  • Proficient in at least one programming language (Go/Python/Shell) to implement automation solutions for operational challenges.
  • Strong experience with observability stacks such as Prometheus and ELK, capable of building efficient monitoring platforms.
  • Proven experience in capacity planning and performance testing, with the ability to quantify system bottlenecks and plan accordingly.


Preferred:

  • Experience managing SRE/tooling/platform teams.
  • Familiarity with observability stacks such as Prometheus, Grafana, and ELK.
  • Professional certifications such as AWS (SAA/SAP), Kubernetes (CKA/CKE/CKS) are a plus


Preview

Dorothy Mole

HR OfficerMoledao

Active today

Posted on 23 December 2025

Moledao

<50 Employees

DAOs

View jobs hiring

Report this job

Bossjob Safety Reminder

If the position requires you to work overseas, please be vigilant and beware of fraud.

If you encounter an employer who has the following actions during your job search, please report it immediately

  • withholds your ID,
  • requires you to provide a guarantee or collects property,
  • forces you to invest or raise funds,
  • collects illicit benefits,
  • or other illegal situations.
Tips
×

Some of our features may not work properly on your device.

If you are using a mobile device, please use a desktop browser to access our website.

Or use our app: Download App