Senior Software Engineer, Distributed AI Infrastructure in San Jose, CA at Volt

Date Posted: 10/24/2021

Job Snapshot

  • Employee Type:
    Direct Hire
  • Location:
    San Jose, CA
  • Job Type:
  • Duration:
    N/A
  • Date Posted:
    10/24/2021
  • Job ID:
    306672
  • Contact Name
    Volt Branch
  • Phone
    714-921-7429

Job Description

Volt is looking for a Senior Software Engineer, Distributed AI Infrastructure to join our client that is at the forefront of innovation in creating intelligent and interactive machines.

  • Our client’s perspective on Artificial Intelligence is to build an ecosystem that is user-centric rather than device-centric.
  • A world where Artificial Intelligence can recommend and perform integrated and seamless functions across all your devices, it gets to know you better and better, continuously learning through interactions with you.
  • Our client works on distributed deep learning at scale across hundreds of GPU nodes to accelerate and automate deep learning workflows in a Kubernetes environment – on premises and in the cloud.
  • Our client specializes in making deep neural networks work best on GPU High Performance Computing clusters; we love to work on scaling, breakthrough performance and record shattering benchmarks.

The Senior Software Engineer is:

  • an expert in writing micro services in Python language who will help us to develop tools and ensure business needs are translated into understandable, ethical, and thus trustworthy AI.

Responsibilities:

  • Design & implement micro services in Python that can be deployed on Kubernetes.
  • Design & implement Kubernetes based software that can be deployed as pods, jobs, deployment, services, ingress, etc.
  • Write REST based Micro-Services Software in Python interacting with Postgres database.
  • Write and debug Cluster Software with the PyTorch Framework.
  • Provide expert level knowledge of Distributed PyTorch Framework to team members and customers.
  • Above all, deliver very high-quality code that can be maintained in production.

 Requirements:

  • BS/MS degree in Computer Science, Computer Vision or related field.
  • 5+ years in enterprise software, 2+ years in Python
  • 3+ years of experience as a software engineer with emphasis on serving data science and machine learning workflows.
  • Experience with ETL and ML pipeline implementation and maintenance.
  • Fluent in Python, and Python data science ecosystem tools.
  • Experience operationalizing data science and machine learning algorithms.

 Volt is an equal opportunity employer.