Talent.com
Senior Systems Engineer (AI Cloud Infrastructure)
Senior Systems Engineer (AI Cloud Infrastructure)MULTIVERSE COMPUTING • München, Bavaria, Germany
Senior Systems Engineer (AI Cloud Infrastructure)

Senior Systems Engineer (AI Cloud Infrastructure)

MULTIVERSE COMPUTING • München, Bavaria, Germany
Vor 17 Stunden
Stellenbeschreibung

Multiverse Computing

Multiverse is a well-funded fast-growing deep-tech company founded in 2019. We are the largest quantum software company in the EU and have been recognized by CB Insights (2023 and 2025) as one of the 100 most promising AI companies in the world.

With 180 employees and growing our team is fully multicultural and international. We deliver hyper-efficient software for companies seeking a competitive edge through quantum computing and artificial intelligence.

Our flagship products CompactifAI and Singularity address critical needs across various industries :

CompactifAI is a groundbreaking compression tool for foundational AI models based on Tensor Networks. It enables the compression of large AI systemssuch as language modelsto make them significantly more efficient and portable.

Singularity is a quantum- and quantum-inspired optimization platform used by blue-chip companies to solve complex problems in finance energy manufacturing and beyond. It integrates seamlessly with existing systems and delivers immediate performance gains on classical and quantum hardware.

Youll be working alongside world-leading experts to develop solutions that tackle real-world challenges. Were looking for passionate individuals eager to grow in an ethics-driven environment that values sustainability and diversity.

Were committed to building a truly inclusive culturecome and join us.

Role description

We are looking for a Senior Engineer to lead a critical initiative within our Platform Engineering team : building the software layer for AI Gigafactory . In this role you will move beyond consuming public cloud resources to architecting and building a private Neo-cloud from the ground up. You will design the control planes that manage high-performance compute clusters orchestrate thousands of GPUs and optimize the hardware-software interface for massive AI workloads.

This role sits at the intersection of High-Performance Computing (HPC) Kubernetes Internals and Bare Metal Engineering.

What you will be doing

Building the Control Plane : Designing and developing the software layer (APIs Controllers Agents) that automates the lifecycle of bare-metal AI infrastructure.

Orchestrating High-Scale Compute : Architecting scheduling solutions for large-scale distributed training jobs across massive clusters of GPUs (NVIDIA H200 / B200 / B300) ensuring efficient bin-packing and gang scheduling.

Optimizing the Fabric : Tuning the software-defined networking layer to support low-latency interconnects (InfiniBand / RDMA / RoCEv2) essential for multi-node training.

Developing Kubernetes Extensions : Writing custom Kubernetes Operators and CRDs to abstract complex hardware realities (topology awareness GPU partitioning) into usable interfaces for our Data Scientists.

Hardware-Level Debugging : Investigating and resolving deep systems issues ranging from PCIe bus errors and NCCL communication timeouts to kernel panics on bare-metal nodes.

Defining Standards : Creating the Golden Image for AI workloads managing drivers firmware and OS optimizations to squeeze maximum performance out of the hardware.

Requirements

Systems Programming Expertise : 10 years of software engineering experience with strong proficiency in Go (Golang) C or Rust. You must be comfortable building system agents APIs and CLI tools.

Deep Kubernetes Knowledge : You understand K8s internals beyond simple deployment. Experience with Custom Resource Definitions (CRDs) Operators and the Kubernetes API server architecture.

GPU Ecosystem Experience : Hands-on experience managing NVIDIA GPU clusters. Familiarity with NVIDIA drivers CUDA toolkit and the container runtime (NVIDIA Container Toolkit).

Linux Internals : Deep understanding of the Linux kernel cgroups namespaces and system performance tuning.

Infrastructure as Code : Mastery of declarative infrastructure tools (Terraform Ansible) but with a focus on provisioning physical hardware rather than just cloud VMs.

Problem Solving : A proven track record of debugging complex distributed systems where the root cause could be code network or silicon.

Preferred qualifications

HPC Background : Experience working with traditional supercomputing schedulers (Slurm PBS) or modern batch schedulers (Volcano Kueue Ray).

Bare Metal Provisioning : Experience with tools like Cluster API (CAPI) Metal3 Tinkerbell Canonical MaaS or OpenStack Ironic.

High-Speed Networking : Knowledge of RDMA InfiniBand GPUDirect and how to expose these technologies to containerized workloads.

AI / ML Familiarity : Understanding of how distributed training works (e.g. PyTorch Distributed Megatron-LM DeepSpeed) and the infrastructure requirements of Large Language Models (LLMs).

Observability : Experience building monitoring for hardware health (DCGM) and distributed tracing for long-running jobs.

Location : Applicants must have legal authorization to work in the country where the position is based

Perks & Benefits

Indefinite contract.

Equal pay guaranteed.

Variable performance bonus.

Signing bonus.

Relocation package (if applicable).

Private health insurance.

Eligibility for educational budget according to internal policy.

Hybrid opportunity.

Flexible working hours.

Working in a high paced environment working on cutting edge technologies.

Career plan. Opportunity to learn and teach.

Progressive Company. Happy people culture

As an equal opportunity employer Multiverse Computing is committed to building an inclusive workplace. The company welcomes people from all different backgrounds including age citizenship ethnic and racial origins gender identities individuals with disabilities marital status religions and ideologies and sexual orientations to apply.

Key Skills

Active Directory Administration,Animal,Apparel,Entry Level,Jboss,Inventory Management

Employment Type : Full Time

Experience : years

Vacancy : 1

Jobalert für diese Suche erstellen

Senior Cloud Engineer • München, Bavaria, Germany

Ähnliche Stellen
(Senior) Infrastructure Software Engineer

(Senior) Infrastructure Software Engineer

Brainlab • Munich, Bavaria, Germany
Within our R&D RT Positioning division we have several software and hardware teams working in an agile environment on solutions to accurately position patients for cancer radiotherapy.To suppor...Mehr anzeigen
Zuletzt aktualisiert: vor 17 Tagen • Gesponsert
Senior Azure & Databricks Engineer (m / w / d) - Mnchen

Senior Azure & Databricks Engineer (m / w / d) - Mnchen

INFOMOTION GmbH • Freising, Germany
Mindestens fnf Jahre Erfahrung im Cloud Data Engineering mit Microsoft Azure & Databricks.Tiefes Know-how in Datenintegration, Datenarchitekturen und Cloud Data Management.Projekt- oder Teilprojekt...Mehr anzeigen
Zuletzt aktualisiert: vor 15 Tagen • Gesponsert
Cloud Partner Technical Lead Multi-cloud (mfd)

Cloud Partner Technical Lead Multi-cloud (mfd)

NetApp • Munich, Bavaria, Germany
The Partner Technical Lead will play a critical role in managing and expanding our technical partnerships.This role involves working closely with partners to integrate and optimize our cloud portfo...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
Lead Developer (m / w / d) IT-Beratung - Mnchen

Lead Developer (m / w / d) IT-Beratung - Mnchen

PTA GmbH • Freising, Germany
Mindestens 5 Jahre Erfahrung in der Softwareentwicklung, idealerweise bereits als Lead Developer in vergleichbaren Umfeldern. Erfahrung in der Entwicklung mit Java und / oder.NET-Technologien sowie Fr...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
Senior Engineer Networks (m / w / d)

Senior Engineer Networks (m / w / d)

CGI Deutschland B.V. & Co. KG • München, DE
Du brennst für IT-Infrastruktur und Enterprise Mobility? Dann werde Teil unserer Erfolgsgeschichte und gestalte mit uns die digitale Zukunft!. Du unterstützt unsere Kunden im öffentlichen Sektor bei...Mehr anzeigen
Zuletzt aktualisiert: vor 9 Tagen • Gesponsert
IT-Systemingenieur / IT-Consultant (m / w / d)

IT-Systemingenieur / IT-Consultant (m / w / d)

GermanTechJobs Talents • Freising, Germany
Mindestens 10 Jahre Erfahrung in IT-Infrastruktur und Consulting.Microsoft-Servern (Exchange, AD, SQL).Breites IT-Systemwissen ber Hardware, Software & Cloud. Sicheres und kundenorientiertes Auftret...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
(Senior) System Engineer MS Cloud (m / w / d)

(Senior) System Engineer MS Cloud (m / w / d)

netgo group GmbH • München, DE
Werde auch du "part of netgo group" - einem der größten IT-Dienstleister Deutschlands.Mitarbeiter •innen an zahlreichen Standorten in ganz Deutschland erwarten dich als neues Teammitglied.Senior) Sy...Mehr anzeigen
Zuletzt aktualisiert: vor 8 Tagen • Gesponsert
Senior System Engineer - Platform Infrastructure (m / w / d) || netgo tax

Senior System Engineer - Platform Infrastructure (m / w / d) || netgo tax

netgo group GmbH • München, DE
Werde auch du "part of netgo group" - einem der größten IT-Dienstleister Deutschlands.Mitarbeiter •innen an zahlreichen Standorten in ganz Deutschland erwarten dich als neues Teammitglied.Senior Sys...Mehr anzeigen
Zuletzt aktualisiert: vor 9 Tagen • Gesponsert
(Senior) Cloud Platform Engineer (Networking) (mfx)

(Senior) Cloud Platform Engineer (Networking) (mfx)

Scalable GmbH • München, Bavaria, Germany
Scalable Capital was built in the cloud from day one.Our services leverage a variety of serverless technologies including Fargate and Lambda. We additionally use a multi account strategy for logical...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
Lead Cloud Google Engineer (m / w / d) / Google Cloud Platform und AWS -Mnchen

Lead Cloud Google Engineer (m / w / d) / Google Cloud Platform und AWS -Mnchen

INFOMOTION GmbH • Bad Tölz, Germany
Erfahrung : 24 Jahre in Data Engineering mit Schwerpunkt Google Cloud Platform, gerne auch AWS-Kenntnisse.Know-how : Best Practices in Aufbau, Evaluierung und Umsetzung von GCP-Architekturen.Methoden...Mehr anzeigen
Zuletzt aktualisiert: vor 10 Tagen • Gesponsert
Data Engineer (m / w / d) - Mnchen

Data Engineer (m / w / d) - Mnchen

INFOMOTION GmbH • Freising, Germany
Abgeschlossenes IT-orientiertes Studium oder eine klassische Berufsausbildung,.Mehrjhrige Consulting Praxis im Cloud, Data Warehouse, Data Management oder Data Lake Umfeld,.Gute Kenntnisse in der K...Mehr anzeigen
Zuletzt aktualisiert: vor 24 Tagen • Gesponsert
Cloud (AWS) Platform Engineer (m / w / d)

Cloud (AWS) Platform Engineer (m / w / d)

Mandl. Executives & Experts • Munich, Bavaria, Germany
Homeoffice
Quick Apply
Du möchtest aktiv an der Zukunft moderner IT-Infrastrukturen mitwirken und bringst Erfahrung im Cloud-Umfeld mit?.In dieser Rolle gestaltest du den Aufbau einer hochverfügbaren und regulierten Clou...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen
System Engineer Cloud & Data Center (mwd)

System Engineer Cloud & Data Center (mwd)

Career Factory • München, Bavaria, Germany
Du verantwortlich für die Unterstützung bei.Zu Deinen Aufgaben gehören die.Wartung Optimierung und Absicherung von IT-Systemen. Einhaltung von Compliance-Vorgaben während der Übergänge sowie das.Sta...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
Senior Systems Engineer - (f / m / x) - Munich - (Remote / Hybrid) - Blockchain

Senior Systems Engineer - (f / m / x) - Munich - (Remote / Hybrid) - Blockchain

Staking Facilities GmbH • Munich, Bavaria, Germany
Homeoffice
Quick Apply
Everyone is talking about blockchain, but you’ll actually build it, we have done since 2017.You’ll have the agility, freedom and close-knit culture of a start-up, but the job security and stability...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen
Linux Administrator (w / m / d)

Linux Administrator (w / m / d)

GermanTechJobs Talents • Hallbergmoos, Bayern, Germany
Gerne auch Quereinsteiger mit Erfahrung und MINT-Background.Jahre Erfahrung in der Linux-Administration.Fhigkeit, Anforderungen selbstndig zu analysieren und zu bearbeiten.Eigenmotivation, Teamfhig...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
Senior System Engineer

Senior System Engineer

Entrix • München, Bavaria, Germany
Join us on our journey : To drive the smart and sustainable energy future we are looking for a Senior Systems Engineer in our Tech team to define functional requirements that translate business need...Mehr anzeigen
Zuletzt aktualisiert: vor 28 Tagen • Gesponsert
Senior Azure Cloud Architect

Senior Azure Cloud Architect

Nordcloud, an IBM company • Bad Tölz, Germany
Hands-on experience with Azure from successfully implemented projects.Experience with leading a technical team - providing guidance to your colleagues in the project. DevSecOps or SRE 'toolkit' and ...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
(Senior) Cloud Site Reliability Engineer (Cloud) (mfx)

(Senior) Cloud Site Reliability Engineer (Cloud) (mfx)

Scalable GmbH • München, Bavaria, Germany
Scalable Capital was built in the cloud from day one.Our services currently run on various AWS services like ECS Fargate and Lambda and utilise a multi account strategy. We embrace a DevOps culture ...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert