← all jobs

Senior Machine Learning Engineer - ML Training Infrastructure

Work from home Full-time role Hiring

The Role: We are seeking an experienced, technical oriented, impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical work. In this role, you will be responsible for designing and building scalable, reliable, and high-performance AI/ML platform infrastructure to support advanced AI research and model development initiatives. As a Senior ML Engineer, you will collaborate closely with machine learning engineers, research scientists, and other partners to develop state-of-the-art AI solutions that enable the future of intelligent driving technologies across General Motors vehicles. What You'll Do:

  • Design and development of scalable, reliable, high-performance ML framework to support model training at scale.
  • Model training performance analysis and optimization solutions to scale distributed training workflows and maximize resource utilization across heterogeneous hardware environments, and save cost.
  • Raise the bar on system observability, debuggability, and operational excellence, and user experience.
  • Collaborate with cross-functional teams to integrate new features and technologies into the platform.

Your Skills & Abilities (Required Qualifications)

  • Bachelors degree or higher in Computer Science or equivalent major OR equivalent relevant experience
  • 3+ years professional software engineering experience.
  • 2+ years specialized experience in AI/ML infrastructure, e.g., enabling distributed training for scaling large ML models
  • Strong programming skills in Python, with proficiency in frameworks such as,PyTorch (preferred), TensorFlow, or similar
  • Experience with distributed computing, GPU computing, and cloud environments (AWS, GCP, Azure).
  • Willingness to travel to Sunnyvale, CA as needed
  • Comfortable working in highly ambiguous and dynamic environments

What Will Give You a Competitive Edge (preferred qualifications):

  • 3+ years of professional software engineering experience.
  • Self-motivated, strong execution, impact-delivering oriented
  • Extensive knowledge and experience with PyTorch 2.x+ and distributed training framework
  • Experience with design and development of training framework that supports FSDP, Pipeline Parallelism and other scalable solutions to training large foundational models
  • Experience with profiling, analysis, debugging and optimizing training and data loading performance.
  • Excellent communication skills to resolve controversial, make consensus, communicate risks and give constructive feedback

Compensation: The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of the California Bay Area.

  • The salary range for this role is $170,000 to $240,000. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.
  • Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.

Relocation: This job may be eligible for relocation benefits. Benefits:

  • Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.

#GM-AV-1

More open positions

[Remote] Senior/Staff Machine Learning Engineer (Active Secret Clearance)

Work from home Full-time role

ML Ops Lead

Work from home Full-time role

Senior Scientific Machine Learning Engineer – Earth-2

Work from home Full-time role

[Remote] Senior Staff Machine Learning Engineer, Search & Discovery

Work from home Full-time role

Remote Artificial Intelligence & Machine Learning Engineer (First 2-5 days onsite in Annapolis, MD)

Work from home Full-time role

AI for QA Teaching Expert Part-time

Work from home Full-time role

Experienced Data Entry Operator – Remote Opportunity at careerzynith

Work from home Full-time role

Senior PHP Engineer with strong DB experience

Work from home Full-time role

Sales Manager, Product Sales Executives

Work from home Full-time role

Accounts Receivable Specialist

Work from home Full-time role

Sr. Specialist, HR Business Partner Remote / Telecommute Jobs

Work from home Full-time role

Product Manager, Stablecoin & Digital Assets

Work from home Full-time role

Application Engineer II - Jira Cloud Platform

Work from home Full-time role

HR Business Partner - Employee Relations & Engagement (Americas)

Work from home Full-time role

Corporate Trainer/Instructor - Onsite (1 day a week remote)

Work from home Full-time role

MULTIMEDIA TECHNICIAN Job at Compass Group in New York

Work from home Full-time role

Senior Statistical Programmer

Work from home Full-time role

Shopify Developer andamp; CRO Specialist - Now Hiring

Work from home Full-time role

Entry-Level Remote Data Entry Clerk – Part‑Time Position with careerzynith Aviation, Flexible Schedule, Career‑Growth Opportunity

Work from home Full-time role

Blood and Marrow Transplant Research Registry (CIBMTR)

Work from home Full-time role

Wedding Dress & Bridal Seamstress – Alterations – San Dimas, CA

Work from home Full-time role