Lab Manager Job at MetroSys, Livermore, CA

bVBucDF3VVF0L1lQMG5XMXkyb3JtUjFlUmc9PQ==
  • MetroSys
  • Livermore, CA

Job Description

Position Overview

We are seeking a Lab Manager with expertise in containerization and AI infrastructure to lead the setup, management, and optimization of containerized environments for AI development and testing. This role will focus on building and maintaining scalable, efficient, and high-performance lab environments to support AI-driven applications. The ideal candidate will have experience managing lab operations, designing container-based architectures, and collaborating with stakeholders to ensure seamless integration of AI workloads.

Key Responsibilities

Lab Management & Infrastructure
  • Oversee the setup, configuration, and ongoing management of lab environments, including containerized AI workloads.

  • Maintain hardware and software resources, ensuring high availability and performance for AI research and development.

  • Implement best practices for container orchestration (Docker, Kubernetes) to support AI workloads efficiently.

  • Manage resource allocation for compute-intensive AI tasks, optimizing GPU, CPU, and storage usage.

Containerization & System Architecture
  • Design and implement containerized environments for AI applications, ensuring scalability and security.

  • Utilize Kubernetes, Docker, and other orchestration tools to deploy and manage containerized AI models.

  • Develop automated workflows for building, testing, and deploying AI models within a containerized infrastructure.

  • Ensure integration with cloud and on-prem infrastructure for hybrid AI workloads.

Collaboration & Stakeholder Engagement
  • Work closely with data scientists, AI engineers, and IT teams to understand infrastructure requirements.

  • Provide technical leadership on best practices for AI containerization and lab infrastructure.

  • Conduct knowledge-sharing sessions to educate teams on containerization strategies and deployment methodologies.

Security & Compliance
  • Implement security best practices for containerized environments, including RBAC, encryption, and vulnerability management.

  • Ensure compliance with industry standards and internal policies for AI research and development.

  • Monitor and enforce access controls to protect sensitive AI datasets and computing resources.

Documentation & Support
  • Maintain detailed documentation for lab setups, configurations, and container orchestration strategies.

  • Provide troubleshooting and technical support to AI teams using the lab environment.

  • Continuously optimize container performance, resource utilization, and system reliability.

Qualifications

Education:

  • Bachelor’s degree in Computer Science, Engineering, or a related field (Master’s degree preferred).

Experience:

  • 5+ years of experience in IT infrastructure, containerization, or AI-focused lab management .

  • Hands-on experience with Kubernetes, Docker, OpenShift, or other container orchestration platforms .

  • Strong background in Linux system administration and networking in a lab or enterprise setting.

  • Experience working with AI/ML frameworks (TensorFlow, PyTorch, etc.) in containerized environments.

Skills:

  • Expertise in container orchestration and deployment automation (Helm, Terraform, Ansible).

  • Knowledge of GPU acceleration and resource scheduling for AI workloads .

  • Proficiency in monitoring and logging tools (Prometheus, Grafana, ELK Stack) .

  • Strong scripting skills ( Python, Bash, YAML ) for automation and system configuration.

  • Excellent problem-solving and troubleshooting abilities.

Preferred Certifications:

  • Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD) .

  • Docker Certified Associate (DCA) .

  • Red Hat Certified Engineer (RHCE) or equivalent Linux certification.

Job Tags

Similar Jobs

State of Arkansas

PUBLIC INFORMATION COORDINATOR Job at State of Arkansas

 ...Class Code: P013CGrade: GS07FLSA Status: EXEMPTSalary Range: $40,340.00 - $64,343.00SummaryThe Public Information Coordinator is responsible for overseeing public relations activities and developing and administering educational and informational programs related to... 

Globe Life AO

~REMOTE BENEFITS REPRESENTATIVE - ENTRY LEVEL | CUSTOMER SERVICE & SALES | WEEKLY PAY Job at Globe Life AO

Start a Fulfilling Career with Globe Life AO No Experience Needed! Are you looking for a remote position where you can grow your income, make a difference, and enjoy work-life balance? Globe Life AO is now hiring Remote Benefits Representatives to join our fast-growing...

Symmetry Financial Group –Simple Solutions Financial Service...

Debt Free Life Specialists - Work from home Job at Symmetry Financial Group –Simple Solutions Financial Service...

Are you looking for work life balance while making a substantial income? Our agents participate in meaningful and impactful work while...  ...Qualifications Must currently hold a Life Insurance License in your home state or be willing to obtain one. We are more than happy to... 

Beijing Orange English Limited Company

High Paid Dancing and Sports(Basketball, volleyball, football, tennis) Foreign Teachers Job at Beijing Orange English Limited Company

 ...have a calm and positive personality, wed like to meet you.1. Prepare classroom and course materials. 2. Use a combination of audio-visual & electronic media to engage the students. 3. Create a supportive and positive classroom environment. 4. Resolve crises in... 

HOLCIM Group

Process Operator Job at HOLCIM Group

 ...ABOUT THE ROLE Systech Process Operators are trained in one process area and work with a minimum of supervision, unloading hazardous (i.e., flammable) and/or non-hazardous materials; and processing or blending the material for reuse as a fuel. WHAT YOU'LL ACCOMPLISH...