Skip to content
Anthropic logo

Staff + Senior Software Engineer, Inference

AnthropicGenerative AI, company
San Francisco, United States$320,000 - $485,000 USDLead
Software Engineering

About the role

Design and build distributed systems for AI model inference at scale.

  • The Inference team builds and maintains the critical systems that serve Claude to millions of users worldwide, handling compute-agnostic inference deployments across diverse AI accelerators.
  • The team focuses on maximizing compute efficiency and enabling breakthrough research by providing high-performance inference infrastructure.
  • Key Responsibilities Design, build, and maintain distributed systems for serving Claude.
  • Develop intelligent request routing, load balancing, and traffic management systems.
  • Maximize compute efficiency through autoscaling and orchestration.
  • Build and operate production-grade deployment pipelines for new models.
  • Provide high-performance inference infrastructure for researchers.
  • Requirements Significant software engineering experience, particularly with distributed systems.
  • Results-oriented with a bias towards flexibility and impact.
  • Willingness to pick up slack and enjoy pair programming.
View original posting →

Tech stack

PythonRustKubernetesAWSGoogle CloudAzureDockerCI/CDgRPCREST API

Match insights

Tech:Python, Rust, Kubernetes, AWS, Google Cloud
Level:Lead

More roles at Anthropic

View open roles at Anthropic