Talent.com
Senior AI Platform Engineer (Multi-tenant SaaS & MLOps) (m/f/d)
Senior AI Platform Engineer (Multi-tenant SaaS & MLOps) (m/f/d)Simon-Kucher & Partners • Cologne, DE
Senior AI Platform Engineer (Multi-tenant SaaS & MLOps) (m / f / d)

Senior AI Platform Engineer (Multi-tenant SaaS & MLOps) (m / f / d)

Simon-Kucher & Partners • Cologne, DE
Vor 4 Tagen
Stellenbeschreibung

Senior AI Platform Engineer (Multi-tenant SaaS & MLOps) (m / f / d)

In Germany - Berlin| Bonn | Cologne | Frankfurt / Main | Hamburg | Munich

We are seekingan experienced AI Platform Engineer to contribute to designing and buildingscalable SaaS products within our AI Lab. In this role, you’ll combine deeptechnical expertise with strategic vision to build AI-powered products thatwill help transform our clients’ business models and enable their growth.

Simon-Kucheris at the forefront of innovation in driving commercial excellence, revampingbusiness models, developing solutions and methodologies for unlocking bettergrowth of our clients. Within AI Lab, we are developing cutting-edge large scale AI products to deliversustained top-line impact for our clients.

Are you interested in working in a team of AI evangelists with a can-doattitude? Want to experience the dynamics of agile processes in open-mindedteams? How about getting creative in a startup atmosphere with a steepdevelopment curve and flat hierarchies? And most importantly, do you want tomake a difference? Then you've come to the right place.

What makes us special :

  • Advance your career with exciting professional opportunities in our thriving company with a startup feel
  • Innovate by transforming ideas into cutting-edge AI products, championing AI and Generative AI through creative experimentation to push boundaries and deliver transformative solutions.
  • Voice your unique ideas in a culture defined by our entrepreneurial spirit, openness, and integrity
  • Feel at home working with our helpful, enthusiastic colleagues who have great team spirit
  • Broaden your perspective with our extensive training curriculum and learning programs (e.g. LinkedIn Learning)
  • Speak your mind in our holistic feedback and development processes (e.g. 360-degree feedback)
  • Satisfy your need for adventure with our opportunities to live and work abroad in one of our many international offices
  • Enjoy our benefits, such as hybrid working, daycare allowance, corporate discounts, and wellbeing support (e.g. Headspace)
  • Unwind in our break areas where you can help yourself to the healthy snacks and beverages provided
  • See another side of your coworkers at our frequentemployee events, World Meetings and Holiday Parties

How you will create animpact :

  • Design and evolve a multi-tenant SaaS architecture, including tenant isolation for data, computer, and observability.
  • Build automated tenant provisioning / onboarding, configuration, and safe rollouts (canary / feature flags) across tenants.
  • Implement noisy-neighbor protection (per-tenant quotas, rate limits, priority scheduling) and per-tenant SLO monitoring.
  • Partner with security / compliance to deliver enterprise controls (audit logs, tenant-aware access control, retention).
  • Develop and maintain data architecture : create and manage robust data architectures that support high-volume, high-throughput SaaS applications, focusing on reliability and scalability.
  • Drive faster and more reliable ML delivery by building robust MLOps foundations, including automated training pipelines, experiment tracking, and scalable model deployment.
  • Accelerate AI product development by operationalizing LLMs end-to-end — from fine-tuning and evaluation to high-performance serving, monitoring, and embeddings workflows.
  • Increase engineering velocity and system reliability by developing and maintaining unified CI / CD pipelines that ship ML and application code seamlessly.
  • Enable scalable and cost-efficient AI workloads through well-architected cloud infrastructure across AWS.
  • Improve performance and resilience of AI systems by managing Kubernetes clusters, optimizing autoscaling, and orchestrating GPU-heavy workloads.
  • Enhance inference speed and portability by delivering highly optimized, secure Docker-based containers tailored for ML and LLM workloads.
  • Strengthen data quality and model performance through well-designed ETL / ELT pipelines, streaming systems, feature store integration, and workflow orchestration.
  • Ensure reliable and trustworthy AI operations by implementing comprehensive observability : logs, metrics, traces, and model / data drift detection.
  • Reduce operational risk by embedding security and compliance best practices — IAM, RBAC, VPC design, secrets management, and encryption — into every layer of the stack.
  • Increase automation, reduce manual toil, andsupport rapid experimentation by leveraging Python, Bash, and Terraform toscript, codify, and automate infrastructure and ML workflows.
  • About you :

  • You have shipped and operated customer-facing SaaS products used by real users at scale and bring hands-on experience operating multi-tenant SaaS with tenant isolation, per-tenant controls, and enterprise security expectations.
  • You have previously owned end-to-end ML / AI infrastructure — from data ingestion and feature pipelines to training, deployment, and production monitoring.
  • You enable engineers and data scientists to move faster by building self-service platforms, stable environments, and automated workflows that eliminate friction.
  • You have a track record of designing systems that scale globally across regions, workloads, and traffic patterns.
  • You’re comfortable participating in incident response and on-call rotations, and you know how to stabilize and improve critical production systems.
  • You think with a product mindset, focusing on customer value, reliability, and speed-to-market rather than technology for its own sake.
  • You have a strong bias for automation — you eliminate manual operational toil by designing robust tooling and pipelines.
  • Very strong communication and collaboration skills- supporting other engineers, async collaboration, explaining technicaldecisions to non-technical audiences, writing documentation, showinginitiative.
  • Technical skills required :

  • Proven patterns for tenant isolation (DB-per-tenant vs schema-per-tenant vs row-level security), plus tenant-aware caching and noisy-neighbor protection (rate limits, quotas, scheduling).
  • Experience with OIDC / OAuth2, tenant-awareRBAC / ABAC , SCIM provisioning, and audit logging requirements for B2BSaaS.
  • Deep Kubernetes experience : cluster ops, HPA / VPA , node pools, GPU scheduling, cluster autoscaler / Karpenter , PDBs, network policies, and multi-AZ design.
  • Service mesh (Istio / Linkerd) and ingress patterns(ALB / Nginx), plus secure egress and mTLS (where applicable).
  • Strong requirement for Infrastructure as Code beyond Terraform basics : Terraform modules, Terragrunt, policy-as-code (OPA / Conftest), and secrets automation.
  • GitOps (ArgoCD / Flux) and progressive delivery (ArgoRollouts / Flagger), feature flags, canaries and blue / green.
  • Model lifecycle tooling : MLflow / W&B , model registry, experiment tracking, reproducible training, dataset / versioning ( DVC / lakeFS ).
  • Pipeline orchestration (Airflow / Prefect / Dagster) + artifact stores.
  • Model serving patterns : online serving (KServe / Seldon / BentoML / Ray Serve), async / batch inference, autoscaling, and rollback strategies.
  • Experience with prompt / version management , offline + online evaluation harnesses, RAG evaluation (retrieval metrics, groundedness), guardrails, and red-teaming basics.
  • Handling streaming inference (SSE / WebSockets), caching, routing, and fallback models.
  • Vector DB experience (pgvector / Pinecone / Weaviate / Milvus) and embedding lifecycle (backfills, re-embedding,indexing strategies).
  • Explicit requirement for OpenTelemetry , tracing, and SLOs. Tools : Prometheus / Grafana, Loki / ELK, Datadog / New Relic—whatever you standardize on.
  • Incident mgmt : postmortems, runbooks, errorbudgets.
  • Requirements aligned to enterprise buyers : GDPR , encryption at rest / in transit, secrets mgmt (AWS Secrets Manager / Vault), KMS, key rotation.
  • SOC 2 / ISO 27001 familiarity, vulnerability scanning(Trivy / Grype), SBOMs, SAST / DAST, dependency management.
  • Have we sparked your interest? Simply click the 'Apply now' button tosubmit your application. Please note that, for data protection reasons, wecannot accept applications via email.

    Would you like to learn more about us and our company culture? Click hereto watch our recruitment video .

    About Simon-Kucher

    Simon-Kucher is a global consultancy with more than 2,000 employees in30+ countries.

    Our sole focus is on unlocking better growth that drives measurable revenue andprofit for our clients. We achieve this by optimizing every lever of theircommercial strategy – product, price, innovation, marketing, and sales – basedon deep insights into what customers want and value. With 40 years ofexperience in monetization topics of all kinds, we are regarded as the world’sleading pricing and growth specialist. simon-kucher.com

    We believe in building a culture that embraces diversity, equity, andinclusion, creating an environment in which our people feel valued, are able tobe themselves and feel their contribution matters. If we get that right,remarkable things will happen; people will grow faster, innovate, feel valued,and create better outcomes for everyone – our people, our clients and, ofcourse, our business.

    Your personal contact :

    Maria Weininger

    recruitment.germany(at)simon-kucher.com

    Please submit your application exclusively via the “Apply now” button !

    Better growth starts here. With you.

    Jobalert für diese Suche erstellen

    Senior AI Platform Engineer Multitenant SaaS MLOps mfd • Cologne, DE

    Ähnliche Stellen
    Senior AI Platform Expert Kubernetes GPU / HPC Workloads (m / w / d)

    Senior AI Platform Expert Kubernetes GPU / HPC Workloads (m / w / d)

    BWI GmbH • Bonn, DE
    Sorge gemeinsam mit uns für die digitale Zukunftsfähigkeit unseres Landes.Kolleg •innen betreiben und modernisieren wir eine der größten und komplexesten IT-Infrastrukturen in Deutschland.Der CTO Be...Mehr anzeigen
    Zuletzt aktualisiert: vor 25 Tagen • Gesponsert
    Teamlead Data Platform Engineering (m / w / d)

    Teamlead Data Platform Engineering (m / w / d)

    Deichmann SE • Essen, NW, DE
    Als eigenfinanziertes Familienunternehmen sind wir weit mehr als unser in 34 Ländern und rund 4.Standorte umfassendes Filialnetz, mehr als rund 8,7 Mrd. Euro Jahresumsatz und mehr als einer der erfo...Mehr anzeigen
    Zuletzt aktualisiert: vor 2 Tagen • Gesponsert
    Azure AI Engineer mit Fokus Agents (m / w / d)

    Azure AI Engineer mit Fokus Agents (m / w / d)

    netgo group GmbH • Köln, DE
    Werde auch du "part of netgo group" - einem der größten IT-Dienstleister Deutschlands.Mitarbeiter •innen an zahlreichen Standorten in ganz Deutschland erwarten dich als neues Teammitglied.Du entwick...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
    Platform Engineer IAM - SAP Governance, Risk & Compliance (f / m / d) - Inklusiver Job 🦼 🦻 🦯

    Platform Engineer IAM - SAP Governance, Risk & Compliance (f / m / d) - Inklusiver Job 🦼 🦻 🦯

    E.ON • Essen, Nordrhein-Westfalen, DE
    You have a passion for technology and want to make the world a greener place?.Then become a playmaker and join our team as Platform Engineer IAM - SAP Governance, Risk & Compliance (f / m / d) at E...Mehr anzeigen
    Zuletzt aktualisiert: vor 17 Tagen • Gesponsert
    AI Solution Engineer (m / w / d)

    AI Solution Engineer (m / w / d)

    HOCHTIEF PPP Solutions GmbH • Essen
    Künstliche Intelligenz verändert, wie wir planen, bauen und entscheiden.Als AI Engineer (m / w / d) bei Hochtief PPP Solutions gestaltest du den Aufbau unserer zukünftigen AI-Plattform.Von der Konz...Mehr anzeigen
    Zuletzt aktualisiert: vor 5 Tagen • Gesponsert
    Manager : in AI Scaling & Enablement (m / w / d)

    Manager : in AI Scaling & Enablement (m / w / d)

    InsurLab Germany e.V. • Köln
    Zur Verstärkung unseres Teams in Köln suchen wir zum nächstmöglichen Zeitpunkt eine : n : .Manager : in AI Scaling & Enablement (m / w / d). In deiner Rolle strukturierst, kuratierst und moderierst du die Zus...Mehr anzeigen
    Zuletzt aktualisiert: vor 12 Tagen • Gesponsert
    Senior AI & Automation Engineer (m / w / d)

    Senior AI & Automation Engineer (m / w / d)

    M. Weltzien GmbH • Köln
    Wir sind eine stark wachsende Spedition mit rund 170 eigenen LKWs und einem internationalen Fahrerteam.Für die optimale Steuerung unseres operativen Tagesgeschäfts suchen wir einen erfahrenen Dispo...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
    Cloud Engineer Platform (m / w / d)

    Cloud Engineer Platform (m / w / d)

    AXA Konzern AG • Köln
    Als einer der größten Erstversicherer in Deutschland betreuen wir rund 8 Millionen Kund : innen im Bereich Vorsorge und Versicherung. Was uns bei AXA antreibt? Unsere Neugier und unser Mut zur Innovat...Mehr anzeigen
    Zuletzt aktualisiert: vor 15 Tagen • Gesponsert
    Cloud Engineer Platform (m / w / d) - System Engineering / Admin, Ingenieur

    Cloud Engineer Platform (m / w / d) - System Engineering / Admin, Ingenieur

    AXA • Köln, DE
    Als Teil unseres Teams in Köln arbeitest du an dem Aufbau, der Optimierung und dem Betrieb derSelf Service Cloud Datenplattform. Die Datenplattform unterstützt alle Geschäftsbereiche ...Mehr anzeigen
    Zuletzt aktualisiert: vor 6 Tagen • Gesponsert
    Platform Engineer (m / f / d) - System Engineering / Admin, Ingenieur

    Platform Engineer (m / f / d) - System Engineering / Admin, Ingenieur

    Appsfactory • Köln, DE
    Wir sind die führende Spezialagentur für Digitale Transformation und arbeiten engagiert daran, innovative Lösungen zu entwickeln und die digitale Landschaft für unsere Kund : inne...Mehr anzeigen
    Zuletzt aktualisiert: vor 6 Tagen • Gesponsert
    Teamlead Data Platform Engineering (m / w / d)

    Teamlead Data Platform Engineering (m / w / d)

    Deichmann • Essen, Essen (Kreis), Nordrhein-Westfalen
    Über uns Als eigenfinanziertes Familienunternehmen sind wir weit mehr als unser in 34 Ländern und rund 4.Standorte umfassendes Filialnetz, mehr als rund 8,7 Mrd. Euro Jahresumsatz und mehr als einer...Mehr anzeigen
    Zuletzt aktualisiert: vor 4 Tagen • Gesponsert
    AI Automation Engineer (m / w / x)

    AI Automation Engineer (m / w / x)

    KStA Digitale Medien GmbH • Köln, Deutschland
    Homeoffice
    Du gestaltest unternehmensweit skalierbare Automations-Workflows in n8n – von Trigger-Logik bis Fehlerrouten – für z.Sales-Briefings, Billing-Prozesse und Chatbots. Du konzipierst LLM-Agenten mit (z...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Senior Agentic AI Engineer • (m / w / d) - Projektmanagement, System Engineering / Admin, Consulting, Embedded Systems, Ingenieur

    Senior Agentic AI Engineer • (m / w / d) - Projektmanagement, System Engineering / Admin, Consulting, Embedded Systems, Ingenieur

    Accso - Accelerated Solutions • Köln, DE
    LLMs, agentische Frameworks, Codegeneriergung und Testautomatisierung).Du analysierst komplexe Anforderungen und übersetzt diese in skalierbare Softwarelösungen.Beispielprojekte findest d...Mehr anzeigen
    Zuletzt aktualisiert: vor 6 Tagen • Gesponsert
    Senior AI Software Engineer (m / w / d) für Halbleiter-Messtechnik (Machine Learning, Edge AI, Deep Learning)

    Senior AI Software Engineer (m / w / d) für Halbleiter-Messtechnik (Machine Learning, Edge AI, Deep Learning)

    Camtek FRT Metrology • Bergisch Gladbach, Nordrhein-Westfalen, Deutschland
    Forme deine Zukunft bei einem Pionier der Hightech-Industrie.Camtek FRT Metrology ist ein weltweiter Top-Anbieter für die Oberflächenmessung von Wafern und Proben zur Microchip-Entwicklung in der H...Mehr anzeigen
    Zuletzt aktualisiert: vor 4 Tagen • Gesponsert
    AI Solution Engineer Azure Agent Stack (m / w / d)

    AI Solution Engineer Azure Agent Stack (m / w / d)

    netgo group GmbH • Köln, DE
    Werde auch du "part of netgo group" - einem der größten IT-Dienstleister Deutschlands.Mitarbeiter •innen an zahlreichen Standorten in ganz Deutschland erwarten dich als neues Teammitglied.Du entwick...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
    Senior Product Designer (f / m / x) - Agentic AI & Platform

    Senior Product Designer (f / m / x) - Agentic AI & Platform

    ilert GmbH • Cologne, North Rhine-Westphalia, Germany
    Quick Apply
    Hybrid – Cologne (Rheinauhafen) — 3 days in the office, 2 remote.Redesign the "Immune System" of the Internet.When Spotify stops playing, Amazon can't process orders, or a bank's app goes dark - it...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Fullstack Entwickler / AI Engineer (f / m / d)

    Fullstack Entwickler / AI Engineer (f / m / d)

    skulio • Cologne, North Rhine-Westphalia, Germany
    Quick Apply
    EdTech-Startup aus Köln mit dem Ziel, die.KI-Plattform für Schulen in Europa.Unsere Mission besteht darin die.Bildung der Zukunft mithilfe von innovativen KI-Lösungen zu gestalten.Lehrkräfte im All...Mehr anzeigen
    Zuletzt aktualisiert: vor 9 Tagen
    Platform Engineer (all genders)

    Platform Engineer (all genders)

    Kaufland e-commerce • Cologne, North Rhine-Westphalia, Germany
    Homeoffice
    Quick Apply
    Permanent contract, Full or Part-Time, Remote or Cologne / Darmstadt / Düsseldorf / Berlin.Kaufland : Several thousand sellers and millions of products make us one of the fastest growing online mark...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen