company logo

Senior Manager, Site Reliability Engineering

Plume Design, Inc.com

Office

United States

Full Time

Life At Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 60 million locations globally and have managed over 3 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data. 

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

Senior Manager, Site Reliability Engineering (Sre)

Life At Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 60 million locations globally and have managed over 3 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data. 

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

We are seeking a highly experienced and visionary Director of Site Reliability Engineering (SRE) to lead our growing global SRE organization. This critical role requires a strong people leader who can manage managers, set the strategic direction for the SRE function, and ensure the reliability, scalability, and performance of our systems.

What You’Ll Do:

  • People Leadership and Management:
  • Lead, mentor, and develop a team of SRE managers and individual contributors across multiple geographical locations.
  • Foster a culture of continuous learning, collaboration, and operational excellence within the SRE organization.
  • Conduct regular 1:1s, provide constructive feedback, and support career development for all team members.
  • Mediate conflicts and facilitate effective communication within the team and with other departments.
  • Organizational Strategy and Direction:
  • Define and articulate the strategic vision and roadmap for the global SRE organization, aligning with overall business objectives.
  • Establish and enforce best practices for incident response, problem management, change management, and disaster recovery.
  • Drive the adoption of SRE principles and methodologies across engineering teams.
  • Stay abreast of industry trends and emerging technologies to continuously improve our SRE capabilities.
  • Hiring and Team Growth:
  • Lead the recruitment efforts for SRE roles, including defining job requirements, interviewing candidates, and making hiring decisions.
  • Develop and implement onboarding programs to ensure new hires are successfully integrated into the team.
  • Identify skill gaps and implement training programs to enhance the capabilities of the SRE team.
  • Enabling and Empowering the Team:
  • Provide the necessary tools, resources, and support to enable SRE teams to effectively monitor, troubleshoot, and optimize system performance.
  • Empower teams to take ownership of system reliability and drive continuous improvement initiatives.
  • Remove roadblocks and facilitate cross-functional collaboration to ensure the success of SRE projects.

Delivery and Results:

  • Ensure the successful delivery of SRE initiatives, projects, and goals, meeting defined SLAs and KPIs.
  • Drive efforts to reduce operational toil, improve system availability, and enhance overall system stability.
  • Report on key SRE metrics and progress to senior leadership.

What You’Ll Bring:

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 10+ years of experience in Site Reliability Engineering or a similar role.
  • 3-5 years of experience managing and leading engineering teams, including managing managers, in a global environment.
  • Proven track record of building and scaling high-performing SRE organizations.
  • Deep understanding of SRE principles, methodologies, and best practices.
  • Experience with large-scale distributed systems and cloud platforms.
  • Strong communication, interpersonal, and leadership skills.
  • Ability to think strategically and execute tactically.
  • Experience with budgeting and resource allocation.

About Plume

As the creator of the only open, hardware-independent, cloud-controlled experience platform for ISPs and their subscribers, Plume partners with over 400 ISP customers, including some of the world’s largest such as Comcast, Charter, Liberty Global, and J:COM. 

Using OpenSync, the most widely supported open-source, silicon-to-cloud framework for smart spaces, Plume’s software-defined network allows ISPs to decouple their service offerings from hardware and rapidly curate and deliver new services over a multi-vendor, open-platform architecture.  

Plume is an equal opportunity workplace that maintains a continuing policy of nondiscrimination in all employment practices and decisions, ensuring equal employment opportunities for all qualified individuals without regard to race, color, creed, religion, sex, national origin, age, physical or mental disability, sexual orientation, gender identity, marital status, pregnancy, childbirth or related individual conditions, medical conditions (as defined by state law), military or veteran status, or any other characteristic protected by federal, state or local law.

Senior Manager, Site Reliability Engineering

Office

United States

Full Time

October 15, 2025

company logo

Plume Design, Inc