Site Reliability Engineer / Cloud Architect

100% Remote Lead Site Reliability Engineer / Cloud Architect

  • REMOTE
  • Durham, NC
  • $140,000 - $200,000
Easy Apply Now

A bit about us:

The Lead SRE / Cloud Architect is a critical role that will have a significant impact. Looking for someone who is excited about taking the ownership of improving the existing infrastructure, designing the future of our platform, and building a team to support their vision. Attention to detail and eagerness to learn new technologies and systems is critical to the success of this role.

Why join us?

  • Have you built out a cloud native software product for a software vendor? (Not simply hosted an app in the cloud)
  • Have you written "infrastructure as code" and used ARM templates?
  • Have you built out Kubernetes in production?
  • Do you have experience building out cloud site reliability, scalability, latency, monitoring, and alerting?
  • Do you have a developer background?

Job Details

Duties:
  • Define and help implement infrastructure improvements for our platform
  • Support & contribute improvements to the availability, scalability, latency, and efficiency of our platform
  • Define and measure production availability, navigating known downtime, and service level outages.
  • Debug problems at scale for our mission critical services, and help our development teams implement lasting fixes to recurring issues
  • Execute, debug, and configure CI/CD pipelines.
  • Analyze service requests and take appropriate action meeting defined SLA
  • Define and implement monitoring metrics and alerts to ensure tools and environments are meeting SLA's for uptime and performance
  • Deliver SRE-focused technology roadmaps in collaboration with architecture, application development, security and infrastructure partners
  • Grow and lead a highly skilled team of SREs

  • Experience leading an SRE team as a hands-on player coach and architect
  • 3+ recent years of AWS or Azure cloud infrastructure architecture and Linux OS internals
  • Coding experience in C# or Java and mastery of one or more scripting languages: Bash, Python, Ansible
  • Experience with Kubernetes or AKS orchestration platform both as a programmer, and from an operations perspective.
  • Knowledge of networking, Linux and Windows operating system
  • Database experience (knowledge of SQL Server)
  • Experience with SDLC process and agile development practices (JIRA, git, Azure DevOps)
  • Experience with messaging systems and APIs
  • Working knowledge of networking (e.g., firewall, routing, network topologies and hardware, SDN)
  • Bachelors degree in Computer Science

Easy Apply Now
Easy Apply Now
Job Details
Managed by Jobot Pro
Location
REMOTE
Job Type
Permanent
Compensation
$140,000 - $200,000