Back to positions

Member of Technical Staff, Inference (Bay Area, Remote)

Remote role Full-time Open position

What You’ll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation) Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks What You’ll Bring Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years) Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go) Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness Apply To This Job

Further positions

Member of Technical Staff, Training (Bay Area, Remote)

Remote role Full-time

Marketing Analyst (Attribution Focus) (Promova)

Remote role Full-time

Student and Family Experience Manager (Immediate Opening)

Remote role Full-time

Customer Sales Representative (remote work)

Remote role Full-time

Account Manager Industrial Markets Region: France - Africa

Remote role Full-time

VP of Engineering

Remote role Full-time

Member of Technical Staff, Foundation Models (Bay Area)

Remote role Full-time

Member of Technical Staff, Data Agent (Bay Area, Remote)

Remote role Full-time

Member of Technical Staff, Platform (Bay Area, Remote)

Remote role Full-time

Account Manager Industrial Markets Region: Europe - Middle Eas

Remote role Full-time

Senior Software Engineer

Remote role Full-time

Experienced Customer Support Specialist – Remote Healthcare Industry Position

Remote role Full-time

Accounts Payable And Accounts Receivable Specialist

Remote role Full-time

Experienced Customer Insights Manager – Healthcare Industry Research and Strategy Development

Remote role Full-time

Senior Testability Engineer

Remote role Full-time

Experienced Customer Service Representatives – Remote Customer Support Team at arenaflex

Remote role Full-time

Remote Live Chat Customer Support Specialist – Flexible Hours, Home‑Based, Full‑Time, Customer Experience Champion

Remote role Full-time

Sr Manager of Clinical Compliance (Remote)

Remote role Full-time

Associate Counsel - Queens, NY (Remote)

Remote role Full-time

Experienced Full Stack Data Entry Clerk – Remote Data Entry and Database Management

Remote role Full-time