Site Reliability Engineer-Cloud Infrastructure
4 weeks ago
About TikTok
TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and its offices include New York, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.
Why Join Us
Creation is the core of TikTok's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At TikTok, we create together and grow together. That's how we drive impact-for ourselves, our company, and the users we serve. Join us.
About the Team:
Video Infrastructure is a world-leading video platform that provides multi-media storage, delivery, transcoding, and streaming services. We are building the next generation video processing platform and the largest live streaming network, which provides excellent experiences for billions of users around the world. Popular video products of TikTok and its affiliates are all empowered by our cutting-edge cloud technologies. Working in this team, you will have the opportunity to tackle challenges of large-scale networks all over the world, while leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design.
SRE team is responsible for managing the whole video infrastructure and applications. Our mission is to ensure all production systems can support our fast growing world-wide user base as well as keep the entire systems stable, efficient and cost effective. We manage deployments, system capacity, traffic scheduling, fault tolerance, disaster recovery, emergency response, automations, operation platforms development, etc.
Responsibilities:
- Be responsible for the basic engineering construction of byte infrastructure products & components, focusing on infrastructure O&M architecture optimization, automated O&M platform research and development, data and intelligent O&M.
- Reliability: Ensure the stability of the company's core infrastructure (system high availability and reliability), focus on system performance and capacity, establish O&M (Operation & Maintenance) standards and SOP processes.
- Troubleshooting and locating technical issues, collaborate with the technical team to develop and implement system capacity planning, performance testing, anomaly analysis, and fault diagnosis and resolution strategies.
- Research and evaluate large-scale system architectures and technologies, use new tools and technologies to improve existing systems and processes to support business development.
- Design and implement O&M platforms to achieve efficient, automated, and intelligent system maintenance.
- Develop delivery standards for mass production system scales, from budgeting to resource delivery, to online system capacity assessments, to help the company optimize IT costs.
- Design and establish new IDC, design and implement data protection plans to meet standard requirements.
Qualifications
Minimum Qualifications:
- Bachelor's / Master's Degree in Computer Science or related major.
- Solid basic knowledge of computer software, understanding of Linux operating system, storage, network IO and other related principles.
- Familiar with one or more programming languages, such as Python, Go, and Java. Knowledge of design patterns and coding principles is necessary.
- Familiar with Elastic Search.
Preferred Qualifications:
1. Experience with storage, and relevant system experience with the following: KV, Table, Graph, Redis, MySQL, MongoDB, MQ, and Kafka.
2. Experience with computing & big data, and system experience with the following: Kubernetes, Docker/Containers, AIops, Spark, Flink, Function as a service, RPC Framework, and Service Mesh.
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
Seniority level- Associate
- Full-time
- Engineering and Information Technology
- Industries: Software Development
-
Senior Site Reliability Engineer
3 weeks ago
Dublin, Dublin City, Ireland Reperio Human Capital Full timeSenior Site Reliability Engineer 98301 Desired skills: Senior Site Reliability Engineer, SRE, Senior, Ireland, Remote, AWS, Cloud, Infrastructure Senior Site Reliability Engineer Ireland Salary: 75K+ Full-time, Permanent We are currently looking for a Senior Site Reliability Engineer for our client in Dublin. The role would be...
-
Senior Site Reliability Engineer
22 hours ago
Dublin, Ireland Reperio Human Capital Full timeSenior Site Reliability Engineer 98301 Desired skills: Senior Site Reliability Engineer, SRE, Senior, Ireland, Remote, AWS, Cloud, Infrastructure Senior Site Reliability Engineer Ireland Salary: 75K+ Full-time, Permanent We are currently looking for a Senior Site Reliability Engineer for our client in Dublin. The role would be...
-
Site Reliability Engineer
18 hours ago
Dublin, Dublin City, Ireland Scopely Full timeResponsibilitiesProvide site reliability engineering for cloud infrastructure powering the game.Liaise with backend and core technology engineers and ensure their needs are fulfilled while following good practices in terms of IaC coding and infrastructure usage.Uphold the security, observability, and compliance standards necessary to operate the game with...
-
Site Reliability Engineering Leader
3 hours ago
Dublin, Dublin City, Ireland Apple Inc. Full timeJob SummaryWe are seeking an experienced Site Reliability Engineer to join our Cloud Services Infrastructure team.This individual will play a key role in designing, building, and operating scalable and reliable cloud infrastructure services that power Apple's internet services.
-
Cloud Reliability Engineer
2 days ago
Dublin, Dublin City, Ireland Google Inc. Full timeAbout the JobSite Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. We ensure that Google Cloud's services have reliability, uptime appropriate to customer needs, and a fast rate of improvement.SREs keep an ever-watchful eye on our systems capacity...
-
Site Reliability Engineer
22 hours ago
Dublin, Dublin City, Ireland Fruition Group Ireland Full timeMy client based in Co Louth is currently recruiting for a Site Reliability Engineer to join a growing team. The role is hybrid working. As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and security of cloud-based ERP solutions. You will work closely with software engineers, DevOps teams, and...
-
Site Reliability Engineer
2 weeks ago
Dublin, Dublin City, Ireland Moofwd Limited Full timeAbout the Role: We are seeking a highly experienced Level 7 Site Reliability Engineer (SRE) to join our team, supporting mission-critical systems in the finance and payments industry. This role involves designing and maintaining secure, highly available, and scalable financial infrastructure while ensuring regulatory compliance, security, and real-time...
-
Cloud Infrastructure Engineer
2 days ago
Dublin, Dublin City, Ireland Amazon Full timeAbout the JobThe Cloud Infrastructure Engineer - Reliability Focus will play a critical role in ensuring the highest level of initial quality and ongoing support from our suppliers. Responsibilities include evaluating product design quality/reliability risks and assessing electronics manufacture process-related quality/reliability issues, as well as...
-
Site Reliability Engineer Specialist
7 days ago
Dublin, Dublin City, Ireland Tn Ireland Full timeTn Ireland Job OpportunityWe are seeking a highly skilled Site Reliability Engineer to join our team in Tn Ireland.Job SummaryThe successful candidate will be responsible for designing, implementing, and maintaining our cloud-based infrastructure to ensure high availability and performance.To be successful in this role, you will need:At least 3 years...
-
Site Reliability Engineer
5 days ago
Dublin, Dublin City, Ireland Lexisnexis Risk Solutions Full timeDo you have cloud infrastructure expertise?Would you like to join our great reliability engineering team?Site Reliability Engineer About the Business: LexisNexis Risk Solutions, part of the RELX Group, is the essential partner in the assessment of risk.Within our Insurance vertical, we provide customers with solutions and decision tools that combine public...
-
Site Reliability Engineer
2 weeks ago
Dublin, Ireland Moofwd Limited Full timeAbout the Role: We are seeking a highly experienced Level 7 Site Reliability Engineer (SRE) to join our team, supporting mission-critical systems in the finance and payments industry. This role involves designing and maintaining secure, highly available, and scalable financial infrastructure while ensuring regulatory compliance, security, and real-time...
-
Staff Site Reliability Engineer
3 weeks ago
Dublin, Dublin City, Ireland Sojern Full timePosition summary:Sojern is looking for a Staff Site Reliability Engineer in Dublin to collaborate with Software Engineering teams located primarily in our Dublin office. An ideal candidate would have extensive experience building cloud infrastructure on Google Cloud with Terraform, and have strong experience running and securing workloads at scale on...
-
Cloud Infrastructure Architect
6 hours ago
Dublin, Dublin City, Ireland Scopely Full timeScopely is a global interactive entertainment company that creates immersive games for players around the world. As a Principal DevOps Engineer, you will be part of our Star Trek Fleet Command team, responsible for designing and implementing reliable and secure cloud infrastructure to power the game.About the RoleProvide site reliability engineering for...
-
Senior Site Reliability Engineer
4 weeks ago
Dublin, Ireland TN Ireland Full timeSenior Site Reliability EngineerLocation: Dublin, IrelandJob Category: OtherEU work permit required: YesJob Reference: dbee0d0caac2Job Views: 74Posted: 21.01.2025Expiry Date: 07.03.2025Salary: 75K+Job Type: Full-time, PermanentWe are currently looking for a Senior Site Reliability Engineer for our client in Dublin. The role would be remote with the potential...
-
Cloud Infrastructure Specialist
3 hours ago
Dublin, Dublin City, Ireland Scopely Full timeAbout Us\At Scopely, we create games for everyone and strive to make our work environments inclusive and welcoming. We aim to inspire play every day through our work environments and deep connections with our communities of players.\As a leading mobile-first video game company, we develop, publish, and innovate in the gaming industry, connecting millions of...
-
Dublin, Dublin City, Ireland Amazon Full timeSr. Infrastructure Reliability Engineer, Infrastructure Reliability & QualityJob ID: 2804990 | Amazon Asia-Pacific Resources Private Limited (Singapore)AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers...
-
Dublin, Dublin City, Ireland Amazon Full timeSr. Infrastructure Reliability Engineer, Infrastructure Reliability & QualityJob ID: 2804990 | Amazon Asia-Pacific Resources Private Limited (Singapore)AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers...
-
Cloud Engineer
2 weeks ago
Dublin, Dublin City, Ireland Damson Cloud Full timeTitle: Cloud EngineerRemote status: Fully Remote (In Ireland)Annual Salary range: €50,000 - €55,000Department: Operations/Technical TeamEmployment Type: Full-TimeCompany OverviewDamson Cloud is a Google Cloud Premier Partner in Ireland. With over 22+ years of experience in the IT industry we focus on helping organisations leverage the value of Google...
-
Staff Site Reliability Engineer
4 weeks ago
Dublin, Ireland Sojern Full timePosition summary:Sojern is looking for a Staff Site Reliability Engineer in Dublin to collaborate with Software Engineering teams located primarily in our Dublin office. An ideal candidate would have extensive experience building cloud infrastructure on Google Cloud with Terraform, and have strong experience running and securing workloads at scale on...
-
Site Reliability Engineer
4 weeks ago
Dublin, Ireland Google Inc. Full timecorporate_fare Google place Dublin, IrelandMidExperience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area.ApplyBachelor’s degree in Computer Science, a related field, or equivalent practical experience.2 years of experience with data structures/algorithms and software...