Back to positions

Staff Software Engineer - Computer Vision Deployment

Remote role Full-time Open position

About the position We're looking for a Staff Software Engineer – Computer Vision Deployment to build and scale the infrastructure that powers our AI-driven warehouse intelligence platform. You'll own the end-to-end lifecycle of computer vision models — from training pipelines through optimized cloud deployment — ensuring our cutting-edge computer vision and multi-modal AI systems run reliably and efficiently in production. Your work will directly enable the real-time perception and autonomous decision-making capabilities at the core of our platform. This is a deeply technical role at the intersection of machine learning, distributed systems, and cloud infrastructure. You'll design scalable GPU compute clusters, build robust orchestration pipelines, and optimize model serving for low-latency inference at scale. You'll work closely with our research scientists, computer vision engineers, and product teams to bridge the gap between experimental models and production-ready systems that operate across diverse warehouse environments. We've found tremendous value in collaborative problem-solving, thus our team works from our SF office three days a week.

Responsibilities

  • Develop and maintain distributed cloud GPU infrastructure for large-scale world model training and low-latency inference.
  • Build end-to-end computer vision pipelines — from data ingestion and preprocessing through model training, evaluation, and deployment — and integrate them into core product workflows.
  • Deploy and optimize state-of-the-art machine learning models in the cloud using model serving platforms and inference optimization techniques, including VLMs and VLAs.
  • Design and operate orchestration systems that enable both engineers and non-engineers to build and manage data and ML pipelines.
  • Establish monitoring, benchmarking, and evaluation frameworks to ensure model performance and reliability in production environments.

Requirements

  • B.S. / M.S. in Computer Science, Robotics, or similar technical field, or equivalent practical experience.
  • 7+ years of professional software engineering experience, with at least 3 years in machine learning infrastructure — developing, scaling, training, deploying, and optimizing large-scale ML systems from data to model.
  • Track record of deploying computer vision models in production environments with real-world constraints.
  • Experience with distributed messaging and compute systems (Kafka, gRPC, ROS2, or similar).
  • Strong programming skills in Python with solid software engineering practices.

Nice-to-haves

  • Experience developing, running, and managing orchestration systems (Flyte, Temporal, Airflow, or similar) for ML and data pipelines.
  • Proficiency with ML frameworks (PyTorch, TensorFlow, DeepSpeed) and model serving platforms (TorchServe, TensorFlow Serving, NVIDIA Triton Inference Server, or similar).
  • Deep understanding of state-of-the-art machine learning models such as auto-regressive transformers and familiarity with inference optimization techniques (TensorRT, quantization, custom kernels).
  • Experience with C++ or CUDA programming for GPU acceleration.
  • Prior experience working at autonomous vehicles or robotics companies.

Benefits

  • top-tier medical, dental, and vision coverage
  • 401k with employer matching
  • parental leave
  • unlimited vacation

Apply To This Job

Further positions

PhD Autonomy Engineer Intern - Deep Learning or Computer Vision

Remote role Full-time

AI/NLP Engineer Remote (Ohio, USA) Contract

Remote role Full-time

Bilingual NLP Engineer (Japanese)- Remote

Remote role Full-time

PhD Intern – AI/ML/NLP Engineer

Remote role Full-time

Software Developer — Backend - US

Remote role Full-time

Sr. Software Engineer, Backend

Remote role Full-time

Frontend Engineer

Remote role Full-time

Backend Developer, Remote

Remote role Full-time

Java Full Stack Developer - Remote Most of the time - Full time

Remote role Full-time

Experienced Web Developer Wanted! (Wordpress / Elementor / CSS)

Remote role Full-time

Experienced Data Entry Specialist – Remote Opportunity at arenaflex

Remote role Full-time

Onsite Release of Information Specialist I- Leesburg, FL

Remote role Full-time

Certified Pharmacy Technician - Start of Care

Remote role Full-time

Chat Support Representative

Remote role Full-time

Remote Data Entry Specialist – Accurate Data Management, Quality Assurance & Remote Team Collaboration for arenaflex Logistics Operations

Remote role Full-time

Remote Data Analyst – End‑to‑End Logistics & Insights Specialist – $27/hr Full‑Time at arenaflex

Remote role Full-time

Performance Strategist

Remote role Full-time

Case Manager (RN) - Inpatient - Relief A Status (0.2 FTE), 08-HR, Day Shift

Remote role Full-time

Experienced Data Entry Specialist – Content Management and Quality Assurance (Remote, Part-Time)

Remote role Full-time

Experienced Part-time Data Entry Specialist (Remote) – High-Precision Data Management for arenaflex

Remote role Full-time