AI Infrastructure & Platform Engineer
Build and scale the underlying infrastructure for AI training and deployment
295
Open Positions
Active Positions (50)
SWE - Grids - Fixed Term Contract - 6 Months - London, UKmid
Google DeepMind·London, UK
JaxPower Grid Optimization
Senior Technical Solutions Engineer - Platform (Greater China Region)senior
Databricks·Singapore
Databricks unified analytics platformBig Data ecosystem
Platform Engineer (x/f/m) - Tech Foundationsmid
Alan·Anywhere in France, Belgium, Spain
Senior Backend Engineer - Commerce Platformsenior
Spotify·London
LLMs for workflow improvementcommerce platform systemspayment flowsinvoicing systemshigh-scale transaction systems
Software Engineer - Fullstackmid
Databricks·Amsterdam, Netherlands
Databricks Lakehouse PlatformApache Spark™MLflowDelta Lake
Network EngineermidRemote
Anthropic·Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
software-defined networkingnetwork automationdata center network designhigh-performance networksrouting protocolsphysical network builds
AI Data Infrastructure Engineer - Helix Teammid
Figure AI·San Jose, CA
AI data infrastructure for roboticsrobot data pipelinesneural network training data workflowsSLURM
Staff Backend Software Engineer- (AI Platform)staff
Databricks·San Francisco, California
Model ServingautoscalingGPU serving workloads
Staff Software Engineer - Code Qualitystaff
Deliveroo·London - The River Building HQ
agentic development toolsAI-assisted developmentAI-generated codeGo servicesquality gates
Senior Backend Engineer - Subscriptionssenior
Spotify·London
Backend Engineer (Python/Go) - Consumer Platform teammid
Wolt·Berlin, Germany; Helsinki, Finland; Stockholm, Sweden
Site Reliability Engineering (SRE)API gateway developmentSplit payments systemsPurchase core scalingBackend system reliabilityDevOps practices
Senior Software Engineer, Data Distributionsenior
Anduril·Seattle, Washington, United States
Distributed Service Bus (DSB)mesh networking communicationdenied degraded intermittent limited (DDIL) networksmulti-path routingtactical radios integrationsatellite links integration
Senior Software Engineer - JVM/Ruby/React (x/f/m)senior
Doctolib·Berlin, Berlin, Germany
DataDogobservabilityGitHub ActionsCI/CDtest-driven developmentmob-programming
Backend Software Engineer, ChatGPT Enterprisemid
OpenAI·San Francisco
ChatGPT EnterpriseEnterprise ControlsPermissions SystemsPolicy EnforcementAuditability WorkflowsCompliance Workflows (HIPAA/FINRA)
Software Engineermid
Wolt·London, United Kingdom
AI-native developmentCursorClaude CodeGitHub CopilotVercel AI SDKMCP (Model Context Protocol)
Software Engineer, System Enablementmid
OpenAI·San Francisco
TerraformChefVMSSinstance poolsgolden image provisioningbare metal bring-up
Senior Software Engineer, Developer Experience (DevEx)senior
Harvey AI·Bengaluru
Developer Experience (DevEx)CI/CD SystemsInternal FrameworksPlatform GuardrailsDeveloper Workflow AutomationAI Feature Shipping Velocity
Software Engineer, Workload Enablementmid
OpenAI·San Francisco
NCCLRCCLRDMA collectivesNVlinkdistributed training performanceinference performance
Staff Software Engineer, Core Infrastructurestaff
Harvey AI·Bengaluru
Agent Architecturesteerable AI agentsverifiable AI agentsconversational AI groundingknowledge base retrieval for AIAI agent evals
Engineering Manager, Connectmanager
Stripe·Seattle
Event InfrastructureReal-time ingestion systemsBackpressure controlGraceful degradation systems
Senior Engineering Manager, Core Infrastructuresenior
Harvey AI·San Francisco
prompt token processing at scaleglobal legal AI platforminfrastructure operational excellenceAI platform resilienceinfrastructure innovation for AIsecurity for AI infrastructure
FullStack Engineer, AdTechmid
Wolt·London, United Kingdom
Wolt Merchant appInternal systems for nutrient/allergen management
Member of Technical Staff - Site Infrastructure (US Government)staff
xAI·Los Angeles, CA; Memphis, TN
PXEair-gapped classified environmentsGPU serversnetwork fabricsbare metal provisioningend-to-end inference pipelines
Engineering Manager, Developer Productivity AImanagerRemote
Stripe·US-Remote
GPTN AccountsTransit StateMoney as a Service (MaaS)Global Payouts orchestrationPayments Orchestration
Backend Engineer, Musicmid
Spotify·New York, NY
promotion scorespromotion allocationpersonalization systems (PZN)ML-adjacent systemsdata pipelines
Engineering Manager, Financial Data Qualitymanager
OpenAI·San Francisco
data quality monitoringdata validationdata lineagesystem migrationsvendor dependency reductionfinancial data modeling
Senior Director of Engineering, AI Computesenior
Anduril·Costa Mesa, California, United States
Edge AI compute platformsRuggedized computeModel-Based Systems Engineering (MBSE)AI compute architecturesBoard-level solutionsIntegrated systems design
Backend Software Engineer, B2B Connectors mid
OpenAI·San Francisco
B2B Connectors Platformconnector sync infrastructureconnector indexing pipelinescontrol plane primitivesrollout controlskill switches
Software Engineer Mobile, Web + Backendmid
Wolt·Helsinki, Finland
Wolt+ subscription serviceglobal product scaling
Senior Go Backend Engineer - Merchant Groupsenior
Wolt·Helsinki, Finland; Stockholm, Sweden
KotlinKafkagRPCPostgreSQLTerraform
Senior Fullstack Engineer – Generative UI Platformsenior
Spotify·Toronto
generative UI prototypingAI-native capabilitiesgenerative UI systemprompt-based interfacesSpotify UI Studiofullstack generative UI
Senior Software Engineer, Nix senior
Anduril·Costa Mesa, California, United States
Computer VisionMachine Learning Engineeringperception systemsautonomous dronessolid rocket motorsAnduril Rocket Motor Systems (RMS)
Data Center Engineer, Resource Efficiency – Compute Supply midRemote
Anthropic·Remote-Friendly, United States
Power & resource efficiency modelingIT/OT interfacesLoad management systemsPower/thermal-aware placementStatistical modeling of cluster utilizationWorkload profile analysis
Software Engineer, AI Developer Toolingmid
Scale AI·San Francisco, CA; Seattle, WA; New York, NY
Cursor AI developmentClaude CodeOpenAI CodexMS Copilot for developmentAI development tooling frameworkslocal development process for AI
Senior Software EngineerseniorRemote
Algolia·Remote - United States
A/B testingexperimentation platformsfeature flagginggrowth engineeringuser onboardingdata integrations
Staff Software Engineer, Enterprise Platformstaff
Replit·Foster City, CA (Hybrid) In office M,W,F
single-tenant architecturesVPC peeringprivate connectivitybring-your-own-key (BYOK)customer-managed keysdata residency
Engineering Manager, AI Gatewaymanager
Vercel·Hybrid - San Francisco
AI Gatewayunified AI model accessOpenAIAnthropicGooglerate limit management
Engineering Manager, Privymanager
Stripe·NYC-Privy
modern cryptographydeveloper toolingopen-source workInfrastructure engineering
Manager I, Engineering - Code Coveragemanager
Datadog·Madrid, Spain
Code CoverageAI-Powered Developer ToolingTest Coverage EnforcementAI-Powered Test QualityLLMs for Test ImprovementAgentic Engineering
Senior Software Engineer, QualityOSsenior
Anduril·Ashville, Ohio, United States
QualityOSproduction quality management systemlean business processmanufacturing data analysiscustom production softwarenext-generation defense prime
Frontend Platform Engineer, JavaScript Infrastructuremid
Stripe·Canada
Revenue and Financial Automation (RFA)Stripe BillingSaaS subscriptionsmulti-stage contractsUsage-Based Billingcustomer churn prevention
Staff Software Engineer – Logs Observability Pipelinesstaff
Datadog·New York, New York, USA
Product Analyticsuser analytics platformfunnel queriesretention queriesbehavioral data explorationanalytics storage patterns
Senior Network Engineersenior
Together AI·San Francisco
BGPOSPFVXLANEVPNQoSWireshark
IT Systems Engineer, M&Amid
Anduril·Costa Mesa, California, United States
VMware hypervisorAzure ArcAzure FilesEntra IDhybrid cloud infrastructureon-premises infrastructure
Senior Software Engineer - Java, Ruby (x/f/m)senior
Doctolib·Berlin, Berlin, Germany
LLM evaluation frameworksshadow testingstaged deploymenthuman-in-the-loop workflowsLLM-powered clinical agentsclinical coding
Senior Software Engineer - Together Cloud Infrastructuresenior
Together AI·San Francisco
Infiniband partitioningin-DC parallel storage provisioningVM provisioningIaaS software layerGB200 data centermulti-exabyte high-performance object store
Sr Technical Solutions Engineer, Platformsenior
Databricks·Amsterdam, Netherlands
Databricks platform supportBig Data ecosystem architectureAWS/Azure/GCP integrationLinux/Unix administrationPython/Java/Scala applicationsSQL-based databases
Staff Backend Software Engineerstaff
Databricks·New York
LLM infrastructuremodel servingVector SearchAI agentsdistributed AI workloadsOpenAI integration
Senior Software Engineer - Together Cloud Platformsenior
Together AI·San Francisco
GB200/GB300 hardware virtualizationBlueField DPUsSlurm clustersInfiniband partitioningmulti-exabyte object storedecentralized AI workloads
Senior Software Engineer - Backendsenior
Databricks·Amsterdam, Netherlands
Scala for distributed systemsApache Spark for data pipelinesDatabricks platform developmentAWS S3 integrationAzure Blob Store integrationKubernetes for SaaS