Senior MLOps Engineer, vLLM Inference

17 hours ago


Dublin Pike, Ireland Red Hat Full time

At Red Hat, we believe the future of AI is open, and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments.
We are seeking an experienced ML Ops engineer to work closely with our product and research teams to scale SOTA deep learning products and software. As an ML Ops engineer, you will work closely with our technical and research teams to manage training and deployment pipelines, create DevOps and CI/CD infrastructure, and scale our current technology stack.
In this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the DevOps team, and find opportunities to automate procedures and tasks.
What You Will Do
Collaborate with research and product development teams to scale machine learning products for internal and external applications
Create and manage model training and deployment pipelines
Actively contribute to managing and releasing upstream and midstream product builds
Test to ensure correctness, responsiveness, and efficiency
Troubleshoot, debug, and upgrade Dev & Test pipelines
Identify and deploy cybersecurity measures by continuously performing vulnerability assessment and risk management
Collaborate with a cross-functional team about market requirements and best practices
Keep abreast of the latest technologies and standards in the field
What You Will Bring
2+ years of experience in MLOps, DevOps, Automation, and modern Software Deployment practices
Experience evaluating LLMs for performance on accelerators and accuracy
Strong experience with Python and PyTest
Experience with Git, Github Actions, Terraform, Jenkins, Ansible, and common technologies for automation and monitoring
Highly experienced with administering Kubernetes/Openshift
Familiarity with Agile development methodology
Experience with Cloud Computing using at least one of the following Cloud infrastructures: AWS, GCP, Azure, or IBM Cloud
Solid programming skills, especially in Python
Solid troubleshooting skills
Ability to interact comfortably with the other members of a large, geographically dispersed team
Experience maintaining an infrastructure and ensuring stability
The salary range for this position is $133,650.00 - $220,680.00. Actual offer will be based on your qualifications.
Pay Transparency
Red Hat determines compensation based on several factors, including job location, experience, applicable skills, and training, external market value, and internal pay equity.
About Red Hat
Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies.
Benefits
Comprehensive medical, dental, and vision coverage
Flexible Spending Account - healthcare and dependent care
Health Savings Account - high deductible medical plan
Retirement 401(k) with employer match
Paid time off and holidays
Paid parental leave plans for all new parents
Leave benefits, including disability, paid family medical leave, and paid military leave
Inclusion at Red Hat
Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone.
Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer.

#J-18808-Ljbffr



  • Dublin Pike, Ireland Red Hat, Inc. Full time

    Overview At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art...


  • Dublin, Ireland Red Hat, Inc. Full time

    Overview At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art...


  • Dublin, Ireland Red Hat, Inc. Full time

    OverviewAt Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise.Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments.As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques...

  • Ai Platform Engineer

    3 weeks ago


    Dublin, Dublin City, Ireland T-Pro Group Full time

    Join to apply for theAI Platform Engineerrole atT-ProWe are seeking anAI Platform Engineerto build and scale the infrastructure that powers our production AI services.You will take cutting-edge models—fromspeech recognition (ASR)tolarge language models (LLMs)—and deploy them into highly available, developer-friendly APIs.You will be responsible for...

  • Ai Platform Engineer

    3 weeks ago


    Dublin, Ireland T-Pro Group Full time

    Join to apply for theAI Platform Engineerrole atT-ProWe are seeking anAI Platform Engineerto build and scale the infrastructure that powers our production AI services.You will take cutting-edge models, ranging fromspeech recognition (ASR)tolarge language models (LLMs), and deploy them into highly available, developer-friendly APIs.You will be responsible for...

  • Ai Platform Engineer

    3 weeks ago


    Dublin, Ireland T-Pro Full time

    OverviewWe are seeking anAI Platform Engineerto build and scale the infrastructure that powers our production AI services.You will take cutting-edge models, ranging fromspeech recognition (ASR)tolarge language models (LLMs), and deploy them into highly available, developer-friendly APIs.You will be responsible for creating the bridge between theR&D team, who...

  • AI Platform Engineer

    4 weeks ago


    Dublin, Dublin City, Ireland T-Pro Group Full time

    Join to apply for the AI Platform Engineer role at T-ProWe are seeking an AI Platform Engineer to build and scale the infrastructure that powers our production AI services. You will take cutting-edge models, ranging from speech recognition (ASR) to large language models (LLMs), and deploy them into highly available, developer-friendly APIs. You will be...

  • AI Platform Engineer

    4 weeks ago


    Dublin, Dublin City, Ireland T-Pro Full time

    OverviewJoin to apply for the AI Platform Engineer role at T-Pro.We are seeking an AI Platform Engineer to build and scale the infrastructure that powers our production AI services. You will work with models ranging from speech recognition (ASR) to large language models (LLMs), and deploy them into highly available, developer-friendly APIs. You will bridge...

  • Ai Platform Engineer

    3 weeks ago


    Dublin, Dublin City, Ireland T-Pro Full time

    OverviewJoin to apply for theAI Platform Engineerrole atT-Pro.We are seeking anAI Platform Engineerto build and scale the infrastructure that powers our production AI services.You will work with models ranging fromspeech recognition (ASR)tolarge language models (LLMs), and deploy them into highly available, developer-friendly APIs.You will bridge the gap...

  • AI Platform Engineer

    4 weeks ago


    Dublin, Dublin City, Ireland T-Pro Group Full time

    Join to apply for the AI Platform Engineer role at T-ProWe are seeking an AI Platform Engineer to build and scale the infrastructure that powers our production AI services. You will take cutting-edge models—from speech recognition (ASR) to large language models (LLMs)—and deploy them into highly available, developer-friendly APIs. You will be responsible...