• Application deadline: 4 weeks

    Site Reliability Engineer

    O
    Offchain Labs

    FULL_TIMENA

    Job description

    Offchain Labs is seeking a Site Reliability Engineer to support and scale infrastructure powering the Arbitrum ecosystem. The role focuses on maintaining Kubernetes-based deployments, automating infrastructure using declarative tooling such as Terraform, and working within GitOps workflows using tools like ArgoCD. The candidate will diagnose complex reliability issues across distributed systems, contribute to CI/CD improvement, and participate in on-call rotation and post-incident reviews.

    The ideal applicant brings a strong Linux foundation, familiarity with Python/Go, exposure to observability stacks such as Prometheus and Grafana, and experience operating cloud environments. This role fits someone who naturally digs deep into issues, treats infrastructure as code, and continuously drives reliability improvements in production blockchain-related environments.

    2️⃣ Responsibilities

    • Maintain & scale production Kubernetes clusters

    • Deploy & diagnose infrastructure using GitOps practices

    • Build Terraform-based infrastructure & automation

    • Implement CI/CD pipelines & deployment workflows

    • Operate observability stacks & incident investigation

    • Analyze reliability failures & drive root-cause fixes

    • Participate in on-call rotations

    • Apply security-first principles when designing systems

    3️⃣ Requirements

    • Hands-on experience with Kubernetes

    • Terraform or equivalent IaC exposure

    • Linux, shell scripting, and Python or Go familiarity

    • Experience operating on AWS/GCP/Azure

    • Observability tools (Prometheus, Grafana, Loki, etc.)

    • Experience debugging distributed system issues

    • Experience in on-call / incident response

    4️⃣ Nice to Have

    • GitOps experience (ArgoCD, ApplicationSets)

    • Experience with container registries & artifact pipelines

    • Familiarity with networking & storage stack internals

    • Security & threat-modeling fundamentals

    5️⃣ Benefits

    • Fully remote with optional NY office

    • Annual company-wide offsite + team meetups

    • Professional training & certification reimbursement

    • Medical/dental/vision coverage

    • US-only: 401k plan with company match

    • Home-office equipment subsidy

    • Wellness stipend

      🔖 Curated by ArtofBlockchain.club We source credible Web3 roles directly from official company career pages. 👉 More jobs & discussions at ArtofBlockchain.club

Home Channels Search Login Register