Site Reliability Engineer

Overview:

Corporate Tools is looking for a Site Reliability Engineer. You will be a traditional company employee. This is a remote position, but if you’re near one of our local offices, you’re welcome to come hangout with us in-office as well. Our main offices are in Post Falls, ID, and Spokane, WA; we also have satellite offices in Austin, TX, and Salt Lake City, UT. You’ll be working 40 hours a week and, of course, enjoy great company benefits. Our Site Reliability Engineer should help keep our systems steady, secure, and running like a well-oiled machine (except without actual oil). You’ll work closely with our DevOps engineers to build out tools and automation that make things faster, easier, and less painful for everyone.

Your main job? Stop problems before they start. And when something does break (because let’s be real—it will), help us fix it quickly and learn from it so we don’t do the same dumb thing twice. We’re big on taking ownership here. You won’t get blamed for something going wrong—but you will be expected to help make it right.

If you like digging into weird errors, thinking ahead, and making things just work—even when no one notices—this might be your kind of thing.

Wage:

up to $175,000/ Year

Benefits:

100% employer-paid medical, dental and vision for employees
Annual review with raise option
22 days Paid Time Off accrued annually, and 4 holidays
- After 3 years, PTO increases to 29 days. Employees transition to flexible time off after 5 years with the company—not accrued, not capped, take time off when you want
- The 4 holidays are: New Year’s Day, Fourth of July, Thanksgiving, and Christmas Day
Paid Parental Leave
Up to 6% company matching 401(k) with no vesting period
Quarterly allowance
- Use to make your remote work set up more comfortable, for continuing education classes, a plant for your desk, coffee for your coworker, a massage for yourself... really, whatever
Open concept office with friendly coworkers
Creative environment where you can make a difference
No dumb benefits like free dog walking on the weekends that snobby hipster places have to make you feel cool, but mathematically won't cost the company much money because you won't use it
Trail Mix Bar --- oh yeah

Requirements:

Bachelor's degree in Computer Science, Software Engineering, or equivalent practical experience.
5+ years of experience in software engineering.
2+ years of experience in site reliability engineering, DevOps, or infrastructure engineering roles.
Deep experience with cloud platforms (AWS, Azure, or GCP) and infrastructure as code tools such as Terraform, CloudFormation, or Pulumi.
Strong proficiency with Kubernetes, Docker, and container orchestration in production environments.
Hands-on experience with observability and monitoring tools like Prometheus, Grafana, OpenTelemetry, Sentry, or New Relic.
Proven ability to design and implement highly available, fault-tolerant systems and lead proactive incident response efforts.
Experience with performance tuning, database optimization, and caching strategies (e.g., PostgreSQL, Redis, Memcached).
Demonstrated ability to drive reliability improvements, reduce operational toil, and foster a culture of resilience and continuous improvement.
Experience leading reliability-focused initiatives such as post-incident reviews, capacity planning, and root cause analysis.
Experience in site reliability engineering within Ruby on Rails environments.
Familiarity with the Grafana observability stack and related tools (e.g., Alloy, Loki, Tempo, Prometheus).
In-depth experience with AWS services, including ECS, EKS, Route 53, and other related tools.
Proven ability to collaborate across teams to improve service reliability, reduce incident frequency, and drive operational excellence.
Troubleshoot and resolve complex production issues, applying SRE best practices to minimize impact and prevent recurrence.
Continuously drive improvements in operational efficiency and system resilience.

Why you might like this job:

You like when things work—and you’re the kind of person who quietly fixes things while everyone else is still yelling “It’s broken!” You think alerts should be useful, not just annoying background noise, and you enjoy building systems that mostly run themselves (because babysitting servers isn’t your idea of fun).

You probably have a bit of a tinkerer’s soul. Maybe you’ve automated your coffee maker or built a Raspberry Pi just to turn your lights purple. You appreciate clean logs, quiet dashboards, and sleep that isn’t interrupted by 3AM calls.

You want to work somewhere that’s weird in a good way—where you’re trusted to do your job, encouraged to ask “why?”, and no one makes you sit through a meeting about synergy.

If that all sounds oddly satisfying, this might be the job for you.

Apply Today Why Work Here