Talent.com
Senior Applied Scientist - Systems for ML Inference and Training Optimization, Deep Science for Systems and Services
Senior Applied Scientist - Systems for ML Inference and Training Optimization, Deep Science for Systems and ServicesAmazon Web Services Development Center Germany GmbH • Tübingen, Baden-Wurttemberg, DEU
Es werden keine Bewerbungen mehr angenommen
Senior Applied Scientist - Systems for ML Inference and Training Optimization, Deep Science for Systems and Services

Senior Applied Scientist - Systems for ML Inference and Training Optimization, Deep Science for Systems and Services

Amazon Web Services Development Center Germany GmbH • Tübingen, Baden-Wurttemberg, DEU
Vor 30+ Tagen
Stellenbeschreibung
We are seeking an exceptional Senior Applied Scientist specializing in ML Systems, training, and inference optimization to join DS3. This role requires deep expertise in performance engineering, kernel development, distributed systems optimization, and AI workload optimization across heterogeneous compute platforms. You will invent and implement novel optimization techniques that directly impact the performance and cost-efficiency of ML training and inference for AWS customers worldwide.
As a Senior Applied Scientist in DS3, you will work at the lowest levels of the software stack—writing custom CUDA kernels, optimizing PTX assembly, developing high-performance operators for GPUs and AWS Neuron, designing efficient communication patterns for multi-GPU and multi-node training, and inventing new algorithmic approaches to accelerate transformer models and emerging architectures. Your work will span from single-node inference optimization to large-scale distributed training systems, influencing the design of AWS training and inference services and setting new standards for ML systems performance across the industry.


Deep Science for Systems and Services (DS3) is a part of AWS Utility Computing (UC) which provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services.


Key job responsibilities
Systems-Level Scientific Innovation: Design and implement novel kernel-level optimizations for ML inference and training workloads, including custom CUDA kernels, PTX-level optimizations, and cross-platform acceleration for CUDA and AWS Neuron SDK.
Performance Engineering Leadership: Drive 2-10× performance improvements in latency, throughput, and memory efficiency for production ML inference & training systems through systematic profiling, analysis, and optimization.
Cross-Platform Optimization: Develop and port high-performance ML operators across GPUs, AWS Inferentia/Trainium, and emerging AI accelerators, ensuring optimal performance on each platform.
Product-Level Impact: Lead the design, implementation, and delivery of scientifically-complex optimization solutions that directly improve customer experience and reduce AWS operational costs at scale.
Scientific Rigor: Produce technical documentation and internal research reports demonstrating the correctness, efficiency, and scalability of your optimizations. Contribute to external publications when aligned with business needs.
Technical Leadership: Influence your team's technical direction and scientific roadmap. Build consensus across engineering and science teams on optimization strategies and architectural decisions.
Mentorship & Knowledge Sharing: Actively mentor junior scientists and engineers on performance engineering best practices, kernel development, and systems-level optimization techniques.

About the team
Deep Science for Systems and Services (DS3) is a science organization within AWS Compute & ML Services focused on advancing AI/ML technologies at the systems level. Our team works at the intersection of machine learning and high-performance computing, developing optimizations for large model inference across diverse hardware platforms. We push the boundaries of what's possible in ML inference performance, working directly with CUDA, AWS Neuron, and other low-level compute abstractions to deliver industry-leading latency, throughput, and cost-performance for AWS customers deploying AI at scale.
About AWS

Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Inclusive Team Culture
AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do.

Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

BASIC QUALIFICATIONS

- PhD in Computer Science, Computer Engineering, or related technical field, OR Master's degree with 8+ additional years of relevant research/industry experience.
- 5+ years of hands-on experience in performance optimization and systems programming for AI/ML workloads.
- Expert-level proficiency in CUDA programming and GPU architecture, with demonstrated ability to write high-performance custom kernels.
- Proven track record of delivering measurable performance improvements (2× or greater) in production systems.
- Strong C/C++ programming skills with experience in performance profiling tools such as NVIDIA Nsight, Linux Perf, or similar diagnostic frameworks.

PREFERRED QUALIFICATIONS

- Experience optimizing inference and/or training for large language models (LLMs) and transformer-based architectures, including MoE models, at scale.
- Hands-on experience with AWS Neuron SDK, or other non-NVIDIA AI acceleration platforms.
- Track record of optimizing ML workloads across diverse hardware: embedded devices (ARM Cortex, DSPs, NPUs) and data center GPUs (NVIDIA Ampere/Hopper).
- Experience with low-level optimization techniques including assembly-level tuning (NVIDIA PTX, x86/ARM assembly) and cross-platform kernel porting.
- Experience leading performance optimization initiatives that resulted in significant cost savings or multi-million dollar business impact.
- Proven ability to mentor and train engineers in performance engineering and low-level optimization (5+ team members or workshop instruction).
- Entrepreneurial experience or track record of driving technical vision in startup, co-founder, or product development environments.

Jobalert für diese Suche erstellen

Senior Applied Scientist Systems for ML Inference and Training Optimization Deep Science for Systems and Services • Tübingen, Baden-Wurttemberg, DEU

Ähnliche Stellen
Information Systems Security Officer – TS/SCI Clearance | Vaihingen, Germany

Information Systems Security Officer – TS/SCI Clearance | Vaihingen, Germany

Cambridge International Systems Inc • Vaihingen, DE
Quick Apply
Information Systems Security Officer – TS/SCI Clearance | Vaihingen, Germany Cambridge International Systems, Inc.Join a dynamic global team united by shared values: commitment, integrity, and pers...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen
Ausbildung zum Technischen Systemplaner (m/w/d)

Ausbildung zum Technischen Systemplaner (m/w/d)

Minimax GmbH • Korntal-Münchingen, Baden-Württemberg, DE
Ein Job, der Leben, Werte und die Umwelt schützt? Den findest du bei uns! Minimax gehört zu den Marktführern im Brandschutz und steht in der Branche weltweit für innovative Technologien und exzelle...Mehr anzeigen
Zuletzt aktualisiert: vor 19 Tagen • Gesponsert
ELEKTRO- UND INFORMATIONSTECHNIK // DUALES STUDIUM // BACHELOR OF ENGINEERING // BEGINN: 10.2026

ELEKTRO- UND INFORMATIONSTECHNIK // DUALES STUDIUM // BACHELOR OF ENGINEERING // BEGINN: 10.2026

BITZER Kühlmaschinenbau GmbH • Rottenburg am Neckar, Baden-Württemberg, DE
UNSERE PRODUKTE KANN MAN NIRGENDWO SEHEN.ABER IHRE LEISTUNG ÜBERALL SPÜREN.Als weltweit führendes unabhängiges Unternehmen in den Bereichen Kälte-, Klima- und Wärmepumpentechnik sowie für Komfortkl...Mehr anzeigen
Zuletzt aktualisiert: vor 19 Tagen • Gesponsert
PhD Project – Numerical Methods for High-Performance Dynamic Simulation of Power Systems (f/m/d)

PhD Project – Numerical Methods for High-Performance Dynamic Simulation of Power Systems (f/m/d)

DIgSILENT GmbH • Gomaringen, de, DE
PhD Project – Numerical Methods for High-Performance Dynamic Simulation of Power Systems (f/m/d).Institute of Applied Mathematics at TU Delft.Research into new methodologies for high-performance dy...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen
SAP Data Specialist (m/w/d)

SAP Data Specialist (m/w/d)

Mettler-Toledo (Albstadt) GmbH • Albstadt, DE
METTLER TOLEDO ist ein weltweit führender Anbieter von Präzisionsinstrumenten und Dienstleistungen.Unser Vertriebs-und Servicenetzwerk zählt zu den umfangreichsten der Branche.Unsere Produkte werde...Mehr anzeigen
Zuletzt aktualisiert: vor 2 Tagen • Gesponsert
Sales Engineer (m/w/d) Region Karlsruhe

Sales Engineer (m/w/d) Region Karlsruhe

SMC Deutschland GmbH • Süddeutschland (PLZ: 72), DE
Automation ist unsere Leidenschaft – Ihre auch? Begeistern Sie sich für neue Technologien, ergreifen die Initiative und arbeiten selbstständig? Dann sind Sie bei uns genau richtig.Gestalten Sie die...Mehr anzeigen
Zuletzt aktualisiert: vor 19 Tagen • Gesponsert
Masterarbeit: Bewertung & Validierung von GenAI-generierten Testspezifikationen

Masterarbeit: Bewertung & Validierung von GenAI-generierten Testspezifikationen

Akkodis • Sindelfingen, de
Masterarbeit: Bewertung & Validierung von GenAI-generierten Testspezifikationen.Der Einsatz Generativer Künstlicher Intelligenz (GenAI) zur automatischen Erstellung von Testspezifikationen und Test...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
Senior Systemingenieur (m/w/d) Space & Tech

Senior Systemingenieur (m/w/d) Space & Tech

Gallmond GmbH • Tübingen, Deutschland
In einem vertraulichen Gespräch haben Sie die Möglichkeit, die spannenden Perspektiven dieser Schlüsselrolle in einem der innovativsten Technologiefelder Europas kennenzulernen.Entscheidend für die...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen
Implementation Specialist Customer Service (m/w/d) Projektmanagement und Prozessoptimierung

Implementation Specialist Customer Service (m/w/d) Projektmanagement und Prozessoptimierung

Culligan Deutschland GmbH • Bietigheim-Bissingen, Baden-Württemberg, DE
Durchführung und Umsetzung abteilungsübergreifender (Digitalisierungs-) Projekte .Mitwirken bei der Reorganisation im Customer Service.Optimierung / Automatisierung von Prozessen und Arbeitsabläufe...Mehr anzeigen
Zuletzt aktualisiert: vor 1 Tag • Gesponsert
PhD - Optimization Of Multi-Scale Simulation Parameters Using Agentic AI

PhD - Optimization Of Multi-Scale Simulation Parameters Using Agentic AI

Bosch Gruppe • Renningen, DE
Do you want beneficial technologies being shaped by your ideas? Whether in the areas of mobility solutions, consumer goods, industrial technology or energy and building technology - with us, you wi...Mehr anzeigen
Zuletzt aktualisiert: vor 2 Tagen • Gesponsert
Senior Consultant (m/w/d) Machine Learning Engineering

Senior Consultant (m/w/d) Machine Learning Engineering

Deutsche Telekom AG • Leinfelden-Echterdingen, de
Bei T-Systems bieten wir unseren Geschäftskunden die richtigen Systemlösungen für ihr digitales Business.Mit unserem Portfolio stellen wir sicher, dass digitale Transformation Komplexität reduziert...Mehr anzeigen
Zuletzt aktualisiert: vor 6 Tagen • Gesponsert
Mandatory Internship Advanced Analytics starting from April 2026

Mandatory Internship Advanced Analytics starting from April 2026

Daimler Truck AG • Leinfelden-Echterdingen, Baden-Württemberg, DE
Aufgaben:Are you passionate about data and eager to apply your skills in a real-world setting? We're seeking a motivated intern to support our Cash-Flow Analytics project.Within this project, you'l...Mehr anzeigen
Zuletzt aktualisiert: vor 5 Tagen
Wirtschaftsinformatik - Data Science / Bachelor of Science (DH)

Wirtschaftsinformatik - Data Science / Bachelor of Science (DH)

Ensinger GmbH • Nufringen, Baden-Württemberg, DE
Grundlagen der Informationstechnik.Moderne IT-Technologien für die Entwicklung von Software-Systemen kennenlernen und anwenden.Planung und Steuerung von Projekten.Wechsel zwischen Dualer Hochschule...Mehr anzeigen
Zuletzt aktualisiert: vor 19 Tagen • Gesponsert
Duales Studium Bachelor of Science - Fachrichtung Informatik mit Ausrichtung Künstliche Intelligenz

Duales Studium Bachelor of Science - Fachrichtung Informatik mit Ausrichtung Künstliche Intelligenz

GELITA AG • Hirschhorn (Neckar), Baden-Württemberg, DE
Die GELITA-Unternehmensgruppe ist einer der führenden Hersteller von Kollagenproteinen weltweit und mit mehr als 2.Mitarbeitenden und 20 Standorten auf allen Kontinenten vertreten.Wir sind mit unse...Mehr anzeigen
Zuletzt aktualisiert: vor 19 Tagen • Gesponsert
Ausbildung zum Fachinformatiker (m/w/d) für Systemintegration

Ausbildung zum Fachinformatiker (m/w/d) für Systemintegration

BENSELER Holding GmbH & Co. KG • Schwieberdingen, Baden-Württemberg, DE
Fachinformatiker für Systemintegration (m/w/d) .Fachinformatiker (m/w/d) der Fachrichtung Systemintegration planen und konfigurieren IT-Systeme.Beim Auftreten von Störungen und Fehlern, die in den ...Mehr anzeigen
Zuletzt aktualisiert: vor 19 Tagen • Gesponsert
Portfolio and Application Manager Construction, Adhesives and Sealants (m/f/d)

Portfolio and Application Manager Construction, Adhesives and Sealants (m/f/d)

CHT Germany GmbH • Tübingen, de
Portfolio and Application Manager Construction, Adhesives and Sealants (m/f/d).Business Division: Functional Chemicals.Market Segment: Coatings and Construction.Manage and position the global produ...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
Duales Studium Bachelor of Science (m/w/d) - Wein-Technologie-Management 2026

Duales Studium Bachelor of Science (m/w/d) - Wein-Technologie-Management 2026

Württembergische Weingärtner-Zentralgenossenschaft e.G. • Möglingen, Baden-Württemberg, DE
Am Anfang stand eine gute Idee.Du interessierst dich für den Lebensmittel- und Getränkebereich? Du möchtest Teil unseres Teams werden und dein Studium mit einem starken Partner beginnen? Profitiere...Mehr anzeigen
Zuletzt aktualisiert: vor 19 Tagen • Gesponsert
Entwicklungsingenieur Connected Systems (m/w/d)

Entwicklungsingenieur Connected Systems (m/w/d)

Akkodis • Sindelfingen, de
Akkodis - entstanden durch den Zusammenschluss von AKKA & Modis - ist ein weltweit führendes Unternehmen im Bereich Engineering & IT.Als globaler Partner in einer sich ständig verändernden Technolo...Mehr anzeigen
Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert