Portfolio Founder potential, realized

Across investments in enterprise and consumer at seed and early growth stages, see why portfolio founders consistently say we're the most valuable investors on their cap table.

companies

Jobs

My job alerts

ML Platform Engineer, Backend

Sciforium

Software Engineering, Data Science

San Francisco, CA, USA

Posted on Dec 8, 2025

Location

San Francisco

Employment Type

Full time

Location Type

On-site

Department

Engineering

Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from AMD engineers the team is scaling rapidly to build the full stack powering frontier AI models and real-time applications.

We offer a fast-moving, collaborative environment where engineers have meaningful impact, learn quickly, and tackle deep technical challenges across the AI systems stack.

Role Overview

This role offers a unique opportunity to work on the core systems that power Sciforium’s multimodal AI models. You’ll help build the model serving platform working across C++, Python, runtime execution, and distributed infrastructure to create a fast, reliable engine for real-time AI applications.

You’ll gain hands-on experience with performance engineering, learn how large AI models are optimized and deployed at scale, and collaborate closely with ML researchers and experienced systems engineers. If you enjoy low-level programming, care deeply about performance, and want exposure to the full AI stack, this role provides both high-impact work and strong growth potential.

Key Responsibilities

Build the model serving platform, including API, Control Plane, Billing, Monitoring, and distributed inference features.
Collaborate with ML researchers to integrate new multimodal models into production workflows.
Write reliable, maintainable code with strong testing and documentation practices.
Provide operational support for keeping our production services highly performant, available and reliable
Help troubleshoot complex issues across runtime, service, and GPU layers, working closely with other engineers.

Must-Haves

Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
3+ years of software engineering experience, with a focus on infrastructure or machine learning systems.
Strong proficiency in C++/Python/Go/Rust
Experience in building large scale ML/MLOps infrastructure
Strong collaboration and communication skills, with the ability to work effectively across engineering and ML teams.
Comfortable working from the office and contributing to a fast-moving, high-ownership team culture.

Nice to Have

Experience with ML systems engineering, open source inference engine like vLLM, Sglang, or TRT-LLM
Proficiency in CUDA or ROCm and experience with GPU profiling tools
Contributions to open-source ML or HPC infrastructure

Why Join Us

Opportunity to build frontier-scale AI infrastructure powering next-generation LLMs and multimodal models.
Work with top-tier engineers and researchers across systems, GPUs, and ML frameworks.
Tackle high-impact performance and scalability challenges in training and inference.
Access state-of-the-art GPU clusters, datasets, and tooling.
Opportunity to publish, patent, and push the boundaries of modern AI
Join a culture of innovation, ownership, and fast execution in a rapidly scaling AI organization.

Benefits include

Medical, dental, and vision insurance
401k plan
Daily lunch, snacks, and beverages
Flexible time off
Competitive salary and equity

Equal opportunity

Sciforium is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

See more open positions at Sciforium

Privacy policy Cookie policy