Software Engineering Manager, Reliability Tooling
3 weeks ago
Software Engineering Manager, Reliability Tooling Toast is driven by building the platform that helps restaurants adapt, take control, and focus on what they do best: creating experiences their guests love. Tremendous business growth has spurred a need for significant investment in Toast's platform teams. The Site Reliability Engineering team at Toast is responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency — without needing heroics. The team accomplishes this goal by:
Building tooling to automate, monitor, and manage deployed services using reliability best practices
Developing and evangelizing patterns and best practices to improve the scalability, observability, and reliability of all Toast systems
Consulting with teams to improve product scalability, observability, security, and reliability
Participating in outage response and root cause analysis for critical systems and infrastructure incidents
As a Manager of the Site Reliability engineering Tooling team, you will provide technical leadership and hands-on code contributions, incorporating reliability best practices for programming and scripting, observability, production triage, incident resolution, and retrospective/root cause analysis to maintain the world-class reliability and uptime of our platform.
Responsibilities Enable a geographically distributed team of talented engineers to continue performing at a high level and help increase the impact of their work
Drive day-to-day operations of the team and contribute to the development and prioritization of the SRE roadmap for major initiatives
Create and drive strategic organization-wide scalability, observability, and reliability initiatives in collaboration with technical leadership and Product Management
Influence architecture decisions for your team and for individual services to optimize resilience and scalability
Guide teams to build and maintain systems that are reliable and available for Toast customers
Facilitate professional growth by mentoring engineers on your team
Requirements Hands-on experience managing or leading an Internal tools team, including hiring, mentoring, cross-functional collaboration
Hands-on coding experience with multiple coding languages - Java/JVM required + one or more of Kotlin, Go, Python, etc.
Background in leading complex engineering projects in a Scrum environment
Experience in building and running distributed systems
Exposure to networking, cloud architectures, and patterns
Deep understanding of systems, networking, and scaling issues
Direct exposure to cloud infrastructure and SaaS solutions
AI at Toast
At Toast we’re Hungry to Build and Learn. We believe learning new AI tools empowers us to build for our customers faster, more independently, and with higher quality. We provide these tools across all disciplines, from Engineering and Product to Sales and Support, and are inspired by how our Toasters are already driving real value with them. The people who thrive here are those who embrace changes that let us build more for our customers; it’s a core part of our culture.
Our Spread of Total Rewards We strive to provide competitive compensation and benefits programs that help to attract, retain, and motivate the best and brightest people in our industry. Our total rewards package goes beyond great earnings potential and provides the means to a healthy lifestyle with the flexibility to meet Toasters’ changing needs. Learn more about our benefits at https://careers.toasttab.com/toast-benefits.
Bread puns encouraged but not required
Diversity, Equity, and Inclusion At Toast, our employees are our secret ingredient—when they thrive, we thrive. The restaurant industry is one of the most diverse, and we embrace that diversity with authenticity, inclusivity, respect, and humility. By embedding these principles into our culture and design, we create equitable opportunities for all and raise the bar in delivering exceptional experiences.
We Thrive Together We embrace a hybrid work model that fosters in-person collaboration while valuing individual needs. Our goal is to build a strong culture of connection as we work together to empower the restaurant community. To learn more about how we work globally and regionally, check out: https://careers.toasttab.com/locations-toast.
Apply today
Toast is committed to creating an accessible and inclusive hiring process. As part of this commitment, we strive to provide reasonable accommodations for persons with disabilities to enable them to access the hiring process. If you need an accommodation to access the job application or interview process, please contact candidateaccommodations@toasttab.com.
For roles in the United States, It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.
#J-18808-Ljbffr
-
▷ 15h Left! Site Reliability Engineer II
3 weeks ago
Dublin, Ireland Guidewire Software Full timeJoin to apply for the Site Reliability Engineer II role at Guidewire Software Join to apply for the Site Reliability Engineer II role at Guidewire Software Get AI-powered advice on this job and more exclusive features. Summary Are you passionate about solving interesting technical challenges by defining, designing, deploying and troubleshooting Cloud...
-
Site Reliability Engineer Iii
2 weeks ago
Dublin, Ireland Guidewire Software Full timeSummaryAt Guidewire, we make software that offers Property and Casualty (P&C) Insurance companies the tools to take care of their customers when they need it the most, whether that's a time of crisis, a natural disaster, an accident, or exposure to cyber risks.We build the core applications that insurance companies use to sell and underwrite policies, settle...
-
Software Engineer, Site Reliability
2 weeks ago
Dublin, Ireland Ebay Full timeSocial network you want to login/join with:Software Engineer, Site Reliability, Dublincol-narrow-leftClient:eBayLocation:Dublin, IrelandJob Category:Other-EU work permit required:Yescol-narrow-rightJob Reference:b284d91f****Job Views:4Posted:Expiry Date:col-wideJob Description:At eBay, we're more than a global ecommerce leader — we're changing the way the...
-
[Immediate Start] Site Reliability Engineer
4 weeks ago
Dublin Pike, Ireland Apple Inc. Full timeDublin, County Dublin, Ireland Software and Services Description The Apple Services Engineering Cloud Services SRE organization are domain experts in fleet management, systems, and software engineering. We build automations, instrument reliability tools, and respond to alerts and incidents which may pose a risk to the reliability of the platform. The...
-
Reliability Engineering Manager
3 weeks ago
Dublin Pike, Ireland Mondelēz International Full timeReliability Engineering Manager page is loaded Reliability Engineering Manager Apply locations Coolock, Dublin, Ireland time type Full time posted on Posted Yesterday job requisition id R-144606 Job Description Are You Ready to Make It Happen at Mondelēz International? Join our Mission to Lead the Future of Snacking. Make It With Pride. Your goal will be...
-
Software Engineer
3 weeks ago
Dublin, Ireland Revolut Full timeOverview Software Engineer (DevOps) - Database Reliability. Join to apply for the Software Engineer (DevOps) - Database Reliability role at Revolut . Our Technology team is one of the best in the world. We’re looking for a DevOps Engineer with database expertise to join our Site Reliability team and drive automation and tooling to manage, scale, and...
-
Dublin Pike, Ireland Google Inc. Full timeSenior Site Reliability Engineer, Software Engineer Google Dublin, Ireland About the job Site Reliability Engineering (SRE) is what you get when you treat operations as if it’s a software problem. Our mission is to progress, protect, and provide for the software and systems behind all of Google's public services - Search, Ads, Gmail, Android, YouTube, and...
-
Senior Site Reliability Engineer
4 weeks ago
Dublin Pike, Ireland Reddit, Inc. Full timeReddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. Reddit is one of the internet’s largest sources of information. Reddit SRE is rapidly innovating and our...
-
Site Reliability Engineer
1 week ago
Dublin, Ireland Reperio Human Capital Full timeSite Reliability Engineer 190995 Desired skills: SRE, Azure, SaaS, Dublin Site Reliability Engineer - Azure | SaaS | Dublin (Hybrid - 2 Days Onsite)A leading software company is looking for a Site Reliability Engineer (SRE) to join their growing team supporting a complex SaaS platform hosted in Microsoft Azure. This is a traditional SRE role focused on...
-
Security Site Reliability Engineer
3 weeks ago
Dublin Pike, Ireland Apple Inc. Full timeSecurity Site Reliability Engineer - Apple Service Engineering Dublin, County Dublin, Ireland Software and Services Description We are seeking a highly skilled and motivated Security Site Reliability Engineer (SRE) to join our dynamic and growing team. As a Security SRE, you will play a critical role in ensuring the security, reliability, and scalability...