Mid-Level DevOps Engineer

Phase Labs

Phase Labs

Other Engineering Β· Full-time
Remote
USD 4k-4k / month
Posted on Aug 1, 2025

πŸ“‹ About Phase Labs

Phase Labs is a premier Solana-native team deeply engaged with the Solana ecosystem. We're known for exceptional validator expertise, ecosystem contributions, and community-first mentality. We are a remote-first team spread across the Americas.

Phase Labs builds critical infrastructure across three flagship products:

  • Netrunner (crypto tax software)
  • Aero (Solana stake pool operations)
  • Sentinel (institutional staking and validator operations platform)

πŸš€ Your Mission

  1. πŸ›  Product Infrastructure Ownership & SRE
    1. Design and build cloud environments for Netrunner, Aero, and Sentinel then operate them as an SRE (incident response, runbooks, post-mortems).
    2. Write scripts in Python, Bash, or Go to automate routine maintenance, scaling, and recovery tasks.
    3. Extend into tooling for validator and stake pool operations (monitoring hooks, data exporters, health checks)
  2. ⚑ CI/CD Optimization & Developer Experience
    1. Continuously audit and refine pipelines to speed up build-test-deploy cycles.
    2. Introduce improvements like parallel builds, cache strategies, quality gates, canary and blue-green deployments.
    3. Partner with engineering leads to identify pain points and deliver tailored DevOps solutions.
  3. πŸ’» Infrastructure as Code & Cloud Automation
    1. Define and maintain Terraform or Pulumi modules and CloudFormation templates.
    2. Enforce Git-driven IaC workflows with code reviews, versioned modules, and drift detection.
  4. 🐳 Containerization & Orchestration
    1. Package microservices in Docker containers and manage them with Kubernetes (EKS, GKE, or AKS) or Docker Swarm.
    2. Develop Helm charts or operators for consistent, scalable deployments.
  5. πŸ” Monitoring, Observability & Data Backbone
    1. Build and operate an observability stack with Grafana for products and the validator fleet.
    2. Define service level objectives (SLOs) and indicators (SLIs), set up alerts, and manage on-call rotations.
    3. Expose metrics, logs, and traces to support future validator data products.
  6. πŸ” Security, Compliance & Cost Management
    1. Integrate security scans into pipelines and apply IAM, network, and secrets management best practices.
    2. Optimize cloud costs via rightsizing, autoscaling policies, and budget alerts.
    3. Support SOC2 and ISO audits for Sentinel.
  7. πŸ“ Documentation & Collaboration
    1. Produce runbooks, architecture diagrams, and onboarding guides for each product and validator use case,
    2. Collaborate with product, engineering, and validator operations teams to roadmap infrastructure improvements.

🎯 Requirements

Experience:

  • 3 to 5 years in DevOps or site reliability roles owning and operating production systems end to end.

Cloud:

  • Advanced AWS experience with multi-account setups, VPC design, IAM policies, routing, NAT gateways, ALBs, and Route 53.
  • Comfortable with networking and security groups for multi-service deployments.
  • Implemented vertical and horizontal scaling of ECS services and databases based on CloudWatch metrics and target tracking policies.
  • Architected high-availability (HA) services with multi‑AZ VPCs, automatic failover, and backup strategies for critical workloads.

Containers:

  • Docker and Kubernetes (EKS, GKE, or AKS) or equivalent orchestration tool.
  • Experienced deploying multi-container applications with ECS Fargate and ECR.
  • Implemented service-to-service communication, load balancing, health checks, and autoscaling.

Databases:

  • Hands-on experience deploying, scaling, and managing PostgreSQL (we use TimescaleDB!) in containerized and cloud environments.
  • Experience with backup strategies, high availability, and connection management.
  • Familiar with ElastiCache (Redis) for caching and session management.

CI/CD:

  • Expertise in GitHub Actions.
  • Implemented secrets management via AWS Secrets Manager and parameter management via SSM.
  • Automated ephemeral branch testing environments with full stack ECS + DB spin-up and teardown.

IaC:

  • CDK, Terraform, Pulumi, or CloudFormation plus strong Git and IaC workflow discipline.

Scripting:

  • Proficient in Python, Bash, or Go for automation tasks.
  • Experienced with AWS CLI and SDKs for operational scripting.

Observability:

  • Experience with Grafana including SLO/SLI definition and on-call practices.
  • On-call familiarity with incident response and operational playbooks.
  • Extensive experience with AWS CloudWatch for logs, metrics, and alarms across ECS tasks, ALB, and RDS/TimescaleDB.

Soft Skills:

  • Excellent remote communication.
  • Proactive collaboration.
  • Focus on developer experience and operational excellence.

Education & Certs:

  • Degree in computer science or related field preferred (equivalent experience accepted).
  • AWS, Kubernetes, or Terraform certifications are a plus.

Language:

  • Fluency in English.

🌟 Nice to Have

  • Background in crypto tax, staking, validators, or other fintech products
  • Familiarity with serverless technologies (AWS Lambda or Google Cloud Functions) and event-driven architectures
Phase Labs is an equal opportunity employer.

Apply for this job

Drag and drop or click to upload.
Tell us why you are a good fit, add a cover letter or anything else you want to share.
To withdraw or update your application, email [email protected]