Senior Site Reliability Engineer
1 day ago
Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 97M+ daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit redditinc.com.
Reddit SRE is rapidly innovating and our teams are working to meet the needs of infrastructure and development teams as they evolve our product faster than ever before. This is a unique opportunity to leave your mark on one of the most influential and trafficked corners of the internet.
As a Senior Site Reliability Engineer on Reddit’s Infrastructure SRE team, you’ll use your knowledge of distributed systems and architecture to improve the reliability and performance of Reddit’s engineering platforms and services. We are looking for someone who thrives at the intersection of infrastructure and software development. This team will work very closely with the Compute, Traffic, and Observability infrastructure teams. They will own a suite of tools for allowing engineers to understand their creations, based primarily on open-source solutions at scale. We’re active users of and contributors to Prometheus, Thanos, Grafana, Vector and more.
In this role, you will also take ownership of risk management, ensuring the reliability and performance of our systems. You will collaborate with cross-functional teams to identify, assess, and mitigate risks, implementing best practices to enhance system resilience. Your expertise will drive proactive measures to maintain uptime and optimize service delivery, making a significant impact on our operational excellence.
Join us and help build the future of Reddit
Responsibilities:
- Advise: Work closely with engineering teams in designing and developing systems that are resilient and highly performant at a tremendous scale, and maintaining the foundational platform for running Reddit’s infrastructure.
- Amplify: Identify and build capabilities into our foundational Infrastructure and Platform services, which are used by Reddit engineering teams to build, deploy, and operate Reddit.
- Deliver software to improve the availability, scalability, latency, and efficiency of observability components.
- Identify and engineer away risk across Reddit’s systems.
- Automate: Take repetitive, manual, or risky tasks and automate them out of existence. Build tools and integrate systems to support Reddit’s evolution.
- Automate critical aspects of the event driven development process.
- Diagnose: Draw on your knowledge of distributed systems to identify and fix network, system, and service-level issues. Practice sustainable incident response, and drive structural improvement with blameless postmortem.
- Share on-call responsibilities.
- Optimize: Observe and improve performance, reduce cost, and improve the experience for millions of users.
- Contribute upstream changes to the open source projects we use.
Qualifications
- 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role.
- Proficiency in one or more programming languages. We’re predominantly writing code in Go and Python.
- Experience with Kubernetes and Cloud systems.
- Familiarity with distributed systems development, bonus if familiar with any of the specific tools (Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, Loki).
- Experience with the development and operation of high-traffic backend systems.
- A demonstrated ability to debug, fix, and optimize code.
- Troubleshooting skills that span applications, networking (TCP/IP), and systems.
- Strong working knowledge of Linux and containers.
- Excellent communication and collaborative skills.
Benefits:
- Private Medical, Dental and Vision Benefits.
- Retirement Savings plan with matching contributions.
- Workspace benefits for your home office.
- Family Planning Support.
- Flexible Vacation & Reddit Global Days Off.
Reddit is proud to be an equal opportunity employer, and is committed to building a workforce representative of the diverse communities we serve. Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at ApplicationAssistance@Reddit.com.
Apply for this job
* indicates a required field
#J-18808-Ljbffr
-
Senior Site Reliability Engineer
4 days ago
Dublin, Ireland Prove Full timeTitle: Senior Site Reliability Engineer Department: Internal Operations Reports To: Senior Manager, Site Reliability FLSA Status: N/A Location: Ireland Job Summary: The Senior Site Reliability Engineer is responsible for bringing a software engineering approach to Prove operations. Using software as a tool to manage systems, solve problems, and...
-
Apply Now: Senior Site Reliability Engineer
5 days ago
Dublin, Ireland Tbwa ChiatDay Inc Full timeReddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 97M+ daily active unique visitors, Reddit is one of the...
-
Dublin, Ireland Google Inc. Full timeSenior Software Engineer, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Apply Minimum Qualifications: - Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. - 5 years of experience with software development in one or more programming languages. - 5 years of experience with data...
-
Site Reliability Engineer
5 days ago
Dublin, Ireland DocuSign, Inc. Full timeCompany Overview Docusign brings agreements to life. Over 1.5 million customers and more than a billion people in over 180 countries use Docusign solutions to accelerate the process of doing business and simplify people’s lives. With intelligent agreement management, Docusign unleashes business-critical data that is trapped inside of documents. Using...
-
Site Reliability Engineer
4 days ago
Dublin, Ireland September Consulting Ltd Full timeSite Reliability Engineer (SRE) (6 months) €510 a day REMOTE or Hybrid Want to work for a large banking multinational in Dublin with a diverse technical infrastructure supporting an enterprise platform? Their SRE team are responsible for ensuring that the platform is stable and healthy, empowering developers to build resilient products – working on...
-
Dublin, Ireland ENGINEERINGUK Full timeSite Reliability Engineer, Managed Operations DESCRIPTION AWS is set to introduce the inaugural European Sovereign Cloud (ESC), marking a significant development in utility computing (UC). To spearhead this initiative, we are actively seeking experienced systems development engineers with a strong background in automation and operations. As part of the AWS...
-
Dublin, Ireland Apple Inc. Full timeApple Services Engineering team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud services for millions of Apple users. We are building and supporting new and existing critical...
-
Dublin, Ireland Google Inc. Full timeSoftware Engineer III, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Mid Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. Minimum Qualifications: - Bachelor’s degree in Computer Science, a related field, or equivalent practical...
-
Site Reliability Engineer, ESC Managed Operations
22 hours ago
Dublin, Ireland Amazon Full timeSite Reliability Engineer, ESC Managed Operations Job ID: 2847735 | Amazon Development Centre Ireland Limited - D94 AWS is set to introduce the inaugural European Sovereign Cloud (ESC), marking a significant development in utility computing (UC). To spearhead this initiative, we are actively seeking experienced systems development engineers with a strong...
-
Dublin, Ireland Google Inc. Full timeSoftware Engineer III, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Mid Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. Apply Minimum Qualifications: - Bachelor’s degree in Computer Science, a related field, or equivalent...
-
Apply Now! Site Reliability Engineer
2 days ago
Dublin, Ireland Crusoe Energy Systems LLC Full timeCrusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the...
-
Dublin, Ireland Google Full timePrincipal Engineer, Core Networking Site Reliability corporate_fare Google place Sunnyvale, CA, USA ; Dublin, Ireland bar_chart Director+ Apply info_outline info_outline X Info Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Sunnyvale, CA, USA; Dublin, Ireland. Minimum...
-
Dublin, Ireland Google Full timeSoftware Engineering Manager II, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Apply Minimum Qualifications: - Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. - Candidates will typically have 8 years of experience with data structures or algorithms. - Typically 5 years of...
-
Site Engineer
5 days ago
Dublin, Ireland ICDS (UK) Ltd Full timeSite Engineer (Senior) | Dublin & Leinster c.€75k (all-in), Car, Pension & Benefits Recruiting a high calibre Senior Site Engineer to assist in the provision of civil engineering services for high- and low-density housing and apartment developments in the GDA (Greater Dublin Area) under the direction of the Project Manager/Site Manager. Key...
-
Reliability Engineering Specialist
5 days ago
Dublin, Ireland Life Science Recruitment Full timeWe’re currently recruiting for an exciting opportunity based in Dundalk. This is an excellent position for anyone who is looking to join a leading multinational who are one of the best at what they do. Duties 1. Oversee both internal and external engineering, calibration and maintenance teams to ensure all equipment is functioning optimally. 2. Respond...
-
(15h Left) Site Reliability Engineer
3 days ago
Dublin, Ireland Apple Inc. Full timeDublin, County Dublin, Ireland Software and Services People at Apple don’t just build products — they craft the kind of experience that have revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we...
-
Site Reliability Engineer
5 days ago
Dublin, Ireland Tiger Resourcing Group Full timeRole: Sr SRE (Application Support + Automation)Location: Dublin, IrelandExperience: 5-7 yearsSalary: 55K – 60K EUR/ yearAbout the Role· Plan, manage, and oversee all aspects of a Production Environment· Define strategies for Application Performance Monitoring, Optimization in Prod environment· Respond to Incidents and improvise platform based on...
-
Senior Site Engineer
6 days ago
Dublin, Ireland Global Professional Consultants Full timeSenior Site Engineer Senior Site Engineer with 6+ years experience required for a Tier 1 Main Building Contractor to work on a high rise commercial project in South Dublin City Centre. This project is very accessible from all major public transport routes, in a nice part of town and is just starting in December 2024. Our client is ideally looking for an...
-
Dublin, Ireland ENGINEERINGUK Full timeSite Reliability Engineer, ESC Managed Operations DESCRIPTION AWS is set to introduce the inaugural European Sovereign Cloud (ESC), marking a significant development in utility computing (UC). To spearhead this initiative, we are actively seeking experienced systems development engineers with a strong background in automation and operations. As part of the...
-
Site Reliability Engineer
5 days ago
Dublin, Ireland Amazon Full timeSite Reliability Engineer , CloudWatch Infrastructure Job ID: 2844437 | Amazon Development Centre Ireland Limited If you love infrastructure and automation, we are the team for you! AWS CloudWatch is blazing new trails as a pioneer in the cloud Infrastructure Monitoring, Application Monitoring, and Log Analytics space. We are seeking a Systems Development...