Associate Site Reliability Engineer

3 weeks ago


Dublin, Dublin City, Ireland TikTok Full time

About TikTok

TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and its offices include New York, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.

Why Join Us

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible. Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day. To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve. Join us.

About the Team

Video Infrastructure is a world-leading video platform that provides multi-media storage, delivery, transcoding, and streaming services. We are building the next generation video processing platform and the largest live streaming network, which provides excellent experiences for billions of users around the world. Popular video products of TikTok and its affiliates are all empowered by our cutting-edge cloud technologies.

Working in this team, you will have the opportunity to tackle challenges of large-scale networks all over the world, while leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design. SRE team is responsible for managing the whole video infrastructure and applications. Our mission is to ensure all production systems can support our fast growing world-wide user base as well as keep the entire systems stable, efficient and cost effective. We manage deployments, system capacity, traffic scheduling, fault tolerance, disaster recovery, emergency response, automations, operation platforms development, etc. Our team is full of diversity. We have team members in Singapore, USA and Australia. Now we are extending our teams to Ireland. We are looking forward to seeing new talents joining our team and together helping TikTok grow.

Responsibilities
  1. Be responsible for the basic engineering construction of byte infrastructure products & components, focusing on infrastructure O&M architecture optimization, automated O&M platform research and development, data and intelligent O&M. Through the methodology of software engineering and digital intelligence, O&M, around the O&M requirements of infrastructure products & components, built a layered and systematic O&M platform to solve the problem of ultra-large-scale cluster O&M management. (Goals) To provide stable, efficient, and low-cost serverless infrastructure facilities for Mid-Platform & Business. We aim to be the leading SRE team across the industry.
  2. Reliability: Ensure the stability of the company's core infrastructure (system high availability and reliability), focus on system performance and capacity, establish O&M (Operation & Maintenance) standards and SOP processes.
  3. Reliability: Troubleshooting and locating technical issues, collaborate with the technical team to develop and implement system capacity planning, performance testing, anomaly analysis, and fault diagnosis and resolution strategies.
  4. Efficiency: Research and evaluate large-scale system architectures and technologies, use new tools and technologies to improve existing systems and processes to support business development.
  5. Efficiency: Design and implement O&M platforms to achieve efficient, automated, and intelligent system maintenance.
  6. Cost: Develop delivery standards for mass production system scales, from budgeting to resource delivery, to online system capacity assessments, to help the company optimize IT costs.
  7. Compliance: Design and establish new IDC, design and implement data protection plans to meet standard requirements.
Requirements
  • Bachelor's / Master's Degree in Computer Science or related major.
  • Solid basic knowledge of computer software, understanding of Linux operating system, storage, network IO and other related principles.
  • Familiar with one or more programming languages, such as Python, Go, and Java. Knowledge of design patterns and coding principles is necessary.

Preference will be given to those who have one of the following:

  1. Experience with storage, and relevant system experience with the following: KV, Table, Graph, Redis, MySQL, MongoDB, MQ, and Kafka.
  2. Experience with computing & big data, and system experience with the following: Kubernetes, Docker/Containers, AIops, Spark, Flink, Function as a service, RPC Framework, and Service Mesh.

#J-18808-Ljbffr

  • Dublin, Dublin City, Ireland realTime Recruitment Full time

    Job OpeningSite Reliability Engineer - SREPermanentDublin RealTime are looking for a Site Reliability Engineer to help with the development and deployment of tooling, monitoring, control, self-service reporting, and analysis approach. You will be design & build, with a focus on monitor & traceability and remediation of security, and network issues using...


  • Dublin, Dublin City, Ireland Daft Media Limited Full time

    What's the OpportunityYou'll be part of an experienced Site Reliability Team where you'll collaborate closely with software and quality engineers. We value everyone's input and lean on our team's collective experience to continuously enhance both our processes and platforms.We're on the lookout for a Site Reliability Engineer. In this role, you'll thrive in...


  • Dublin, Dublin City, Ireland Principle HR Full time

    Are you a tech enthusiast with a passion for ensuring top-notch performance and reliability? We're on the hunt for a Site Reliability Engineer to join our dynamic team in DublinThe Offer:Competitive Annual Salary of up to €82,000 doeLocation: Dublin 24, hybrid working model – 2-3 days onsite6 months contract – PAYE (Paid weekly) until the end of the...


  • Dublin, Dublin City, Ireland realTime Recruitment Full time

    Job Opening: Lead Site Reliability Engineer - SRE Permanent Position in Dublin, starting on RealTime is seeking a Lead Site Reliability Engineer to oversee a site reliability function, designing, implementing, and leading a team responsible for achieving growth and strategic objectives that change the industry. Key responsibilities include monitoring and...


  • Dublin, Dublin City, Ireland Token, Inc. Full time

    We're looking for an experienced Site Reliability Engineer (SRE) to help drive forward the platform at Our SRE team work closely with client facing teams and internal Engineering to make the service highly reliable and scalable. Here's what you get to do Design, develop, implement and own products and solutions to improve the security, reliability, and...


  • Dublin, Dublin City, Ireland Token, Inc. Full time

    We're looking for an experienced Site Reliability Engineer (SRE) to help drive forward the platform at Our SRE team work closely with client facing teams and internal Engineering to make the service highly reliable and scalable. Here's what you get to do Design, develop, implement and own products and solutions to improve the security, reliability, and...


  • Dublin, Dublin City, Ireland Reperio Human Capital Full time

    Site Reliability Engineer 96880Desired skills: AWS, Containerisation, Terraform, Ansible, DevOpsThis is a hybrid position based in Dublin, Ireland.Requirements:Highly experienced across AWS technologies & servicesHands-on experience with containerisation technologies (Docker, Kubernetes)Scripting skills (Java, Python, Go)Exposure to Infrastructure as Code...


  • Dublin, Dublin City, Ireland Reddit Inc Full time

    Reddit SRE is rapidly innovating and leading the company on a mission to meet Redditor's user-experience expectations. Our teams are working to meet the needs of infrastructure and development teams as they evolve our product faster than ever before. This is a unique opportunity to leave your mark on one of the most influential and trafficked corners of the...


  • Dublin, Dublin City, Ireland Adobe Full time

    Site Reliability Engineer, Adobe Stock page is loaded Site Reliability Engineer, Adobe Stock Apply locations Dublin Remote Northern Ireland Remote Denmark Remote Ireland Maidenhead time type Full time posted on Posted Yesterday job requisition id R145266 Our Company Changing the world through digital experiences is what Adobe's all about. We give...


  • Dublin, Dublin City, Ireland Adobe Full time

    JOB LEVELP40EMPLOYEE ROLEIndividual ContributorThe ChallengeAdobe Stock team is looking for an exceptional Site Reliability Engineer (SRE) to support the innovation we are bringing to the market through microservice and continuous integration/continuous deployment processes. We provide designers and businesses access to hundreds of millions high-quality,...


  • Dublin, Dublin City, Ireland Adobe Full time

    JOB LEVELP40EMPLOYEE ROLEIndividual ContributorThe ChallengeAdobe Stock team is looking for an exceptional Site Reliability Engineer (SRE) to support the innovation we are bringing to the market through microservice and continuous integration/continuous deployment processes. We provide designers and businesses access to hundreds of millions high-quality,...


  • Dublin, Dublin City, Ireland Regeneron Pharmaceuticals, Inc Full time

    Within this role you will be responsible for providing support, direction and subject matter expertise on mechanical reliability issues to Facilities Production, Utilities and HVAC teams. Evaluating current mechanical plant equipment for reliability and assesses alternative options available. Maintaining data of current equipment performance to identify and...

  • Reliability Engineer

    3 weeks ago


    Dublin, Dublin City, Ireland Cpl Healthcare Full time

    SK biotek Ireland are seeking to recruit a Reliability Engineer to join the Maintenance Department based in Swords, Co. Dublin on a temporary 12 month basis through CPL.Key responsibilitiesThe Reliability Engineer act as a key technical person responsible for resolution of repetitive failures and long term issues and to define the reliability strategy...


  • Dublin, Dublin City, Ireland Google Inc. Full time

    Staff Software Engineer, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Apply Bachelor's degree in Computer Science, a related field, or equivalent practical experience.Candidates will typically have 5 years of experience with software development in one or more programming languages.Typically 8 years of experience with data...


  • Dublin, Dublin City, Ireland Google Inc. Full time

    Software Engineer III, Site Reliability Engineering corporate_fare Google place Dublin, Ireland ; Ireland laptop_windows Remote eligible Apply info_outline info_outline X Info Note: Google's hybrid workplace includes remote and in-office roles. By applying to this position you will have an opportunity to share your preferred working location from the...


  • Dublin, Dublin City, Ireland Google Inc. Full time

    Software Engineering Manager II, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Apply Bachelor's degree in Computer Science, a related field, or equivalent practical experience.Candidates will typically have 8 years of experience with data structures or algorithms.Typically 5 years of experience with software development in one or...


  • Dublin, Dublin City, Ireland Amazon Full time

    Site Reliability Engineer, Managed OperationsAWS is set to introduce the inaugural European Sovereign Cloud (ESC), marking a significant development in utility computing (UC). To spearhead this initiative, we are actively seeking experienced System development engineers with a strong background in automation and operations. As part of the AWS Managed...


  • Dublin, Dublin City, Ireland Amazon Full time

    Sr. Infrastructure Reliability Engineer, Infrastructure Reliability & QualityJob ID: | Amazon Data Services Ireland LimitedAs an Infrastructure Reliability Engineer you will be proactively driving the reliability risk identification, assessment and mitigation for datacenter infrastructure equipment (Example: Air Handling Units, LV Generator, MV Transformers,...


  • Dublin, Dublin City, Ireland Google Inc. Full time

    Software Engineering Manager II, Site Reliability Engineering corporate_fare Google place Dublin, Ireland Apply Bachelor's degree in Computer Science, a related field, or equivalent practical experience.Candidates will typically have 8 years of experience with data structures or algorithms.Typically 5 years of experience with software development in one or...


  • Dublin, Dublin City, Ireland Google Inc. Full time

    Principal Engineer, AI, Trust, Security, Site Reliability Engineering link Copy link corporate_fare Google place Dublin, Ireland bar_chart Director+ Apply link Copy link Bachelor's degree in Computer Science, similar technical field, or equivalent practical experience. Experience in technical leadership and setting technical direction for...