Talent.com

Reliability engineer Jobs in Köln

Jobalert für diese Suche erstellen

Reliability engineer • koln

Zuletzt aktualisiert: vor 8 Stunden
Site Reliability Engineer (fmx)

Site Reliability Engineer (fmx)

ilert GmbHCologne, North Rhine-Westphalia, Germany
Hybrid Cologne (Rheinauhafen) 3 days in the office 2 remote (Tue Thu).Keep the world awake build reliability at scale.DevOps & IT teams detect fix and communicate incidents faster.Our plat...Mehr anzeigenZuletzt aktualisiert: vor über 30 Tagen
(Senior) Software Engineer / AI Engineer

(Senior) Software Engineer / AI Engineer

DL RemoteKöln, Nordrhein-Westfalen, DE
Quick Apply
DL Remote is a talent network for remote-ready or relocation-supported jobs at outstanding companies.We are currently filling a key role as (Senior) Software Engineer / AI Engineer (m / f / d) at ...Mehr anzeigenZuletzt aktualisiert: vor 25 Tagen
Mid-Senior IT Professional (Multiple Opportunities)

Mid-Senior IT Professional (Multiple Opportunities)

Hire Resolve.comCologne, NRW, DE
Quick Apply
Hire Resolve is assisting IT organisations in hiring experienced IT professionals to support Germany-based operations.This is a multi-role opportunity spanning several functions across Information ...Mehr anzeigenZuletzt aktualisiert: vor über 30 Tagen
Site Reliability Engineer

Site Reliability Engineer

Third RepublicCologne, North Rhine-Westphalia, Germany
Third Republic hat sich mit einem Kölner Startup gepartnert, das innerhalb von nur 3 Jahren nach Gründung den Status eines Marktführers erreicht hat. Das Unternehmen bietet eine einzigartige Kombina...Mehr anzeigenZuletzt aktualisiert: vor über 30 Tagen
QA Engineer

QA Engineer

ArangoCologne, DE
Quick Apply
Quality Assurance Engineer About ArangoDB At Arango , we believe the first generation of enterprise AI missed something essential : context. LLM models are powerful, but they don't understand the con...Mehr anzeigenZuletzt aktualisiert: vor 27 Tagen
Service Engineer

Service Engineer

SolplanetCologne, NRW, DE
Quick Apply
At SOLPLANET, we are driven by a simple idea : solar for everybody.We strive to create the best possible experience for distributors, installers and end users. That´s why our solar inverters, energy ...Mehr anzeigenZuletzt aktualisiert: vor über 30 Tagen
  • Gesponsert
Site Reliability Engineer (m / w / d) - E2E Observability & Dev(Sec)Ops Enablement

Site Reliability Engineer (m / w / d) - E2E Observability & Dev(Sec)Ops Enablement

RheinEnergie AGKöln, Nordrhein-Westfalen, Deutschland
Site Reliability Engineer (m / f / d) – E2E Observability & Dev(Sec)Ops Enablement.People who are passionate about meeting the needs of our customers. Together, we ensure the secure supply of energy...Mehr anzeigenZuletzt aktualisiert: vor 24 Tagen
Site Reliability Engineer (m / f / d)

Site Reliability Engineer (m / f / d)

gridscale GmbHKöln, NW, DE
At our company, it’s all about #OneTeam! Join gridscale and help shape the future of the cloud together with OVH.As a leading tech company, we’ve been working for over two decades to reduce our env...Mehr anzeigenZuletzt aktualisiert: vor 28 Tagen
QA Engineer- Salesforce Specialist

QA Engineer- Salesforce Specialist

TechBiz Global GmbHBergisch Gladbach, NW, DE
At TechBiz Global, we are providing recruitment service to our TOP clients from our portfolio.We are currently looking for a. QA Engineer- Salesforce Specialist to join one of our.If you're looking ...Mehr anzeigenZuletzt aktualisiert: vor 17 Tagen
MechanicalCAD Design Engineer (fmx) based in Cologne or Berlin

MechanicalCAD Design Engineer (fmx) based in Cologne or Berlin

L1VE GmbHCologne, North Rhine-Westphalia, Germany
Help us shape the future of immersive media.L1VE GmbH is a dynamically growing media and technology company that is revolutionizing the way people experience sports music and entertainment.With sta...Mehr anzeigenZuletzt aktualisiert: vor 9 Tagen
AI Platform Engineer (Multi-tenant SaaS & MLOps) (m / f / d)

AI Platform Engineer (Multi-tenant SaaS & MLOps) (m / f / d)

Simon-Kucher & PartnersCologne, DE
AI Platform Engineer (Multi-tenant SaaS & MLOps) (m / f / d).Berlin| Bonn | Cologne | Frankfurt / Main | Hamburg | Munich. We are seekingan experienced AI Ops Engineer to contribute to designing and build...Mehr anzeigenZuletzt aktualisiert: vor 12 Tagen
Site Reliability Engineering Manager, Managed Operations (m / w / d)

Site Reliability Engineering Manager, Managed Operations (m / w / d)

AWS European Sovereign Cloud Development Center GmbHCologne, DE
Über uns AWS is set to introduce the inaugural European Sovereign Cloud (ESC), marking a significant development in utility computing (UC). To spearhead this initiative, we are actively seeking expe...Mehr anzeigenZuletzt aktualisiert: vor 19 Tagen
  • Neu!
DevOps Engineer (Azure cloud)

DevOps Engineer (Azure cloud)

AvengaKöln, Nordrhein-Westfalen, .DE
Quick Apply
At Avenga, we believe that human creativity empowers technology that matters.Operating globally, our 6000+ specialists provide a full spectrum of services, including business and tech advisory, ent...Mehr anzeigenZuletzt aktualisiert: vor 8 Stunden
DevOps & Cloud Administration für Einsteiger : innen (m / w / d) - (IHK-Zertifikat)

DevOps & Cloud Administration für Einsteiger : innen (m / w / d) - (IHK-Zertifikat)

Syntex GmbHBergisch Gladbach, Nordrhein-Westfalen, DE
AZAV-zertifizierter Bildungsträger mit Fokus auf digitale Zukunftskompetenzen.Wir sind spezialisiert auf praxisnahe Trainings in App-, Web-m Cloud- und KI-Technologien. In unseren Programmen machen ...Mehr anzeigenZuletzt aktualisiert: vor 4 Tagen
Reliability Engineer (m / w / d)

Reliability Engineer (m / w / d)

MomentiveDE Leverkusen
In dieser Rolle bist du verantwortlich für eine gelebte, proaktive Sicherheitskultur, in der „nobody gets hurt“ Realität ist. Zudem prüfst du den gezielten Einsatz von KI in der Instandhaltung.Du en...Mehr anzeigenZuletzt aktualisiert: vor über 30 Tagen
International Project Manager Data Reliability (m / w / d) (Vollzeit, unbefristet)

International Project Manager Data Reliability (m / w / d) (Vollzeit, unbefristet)

DKMSCologne, North-Rhine-Westphalia, Germany
Ein guter Job kann so viel bewirken.Bei uns sogar zweite Lebenschancen!.Um noch mehr Patient : innen zu helfen, brauchen wir regelmäßig Verstärkung von engagierten und hochqualifizierten Mitarbeitend...Mehr anzeigenZuletzt aktualisiert: vor 3 Tagen
Platform Engineer (m / f / d)

Platform Engineer (m / f / d)

Redcare PharmacyKöln, DE
Redcare Pharmacy is powered by passionate teams and cutting-edge innovation.We strive to create a healthy collaborative work environment where every employee feels valued and inspired to contribute...Mehr anzeigenZuletzt aktualisiert: vor 25 Tagen
AI Engineer

AI Engineer

JUPUS GmbHCologne, North Rhine-Westphalia, Germany
Homeoffice
Quick Apply
With an established suite of AI-powered products already serving customers, we’re now scaling our capabilities to deliver even more value. This role lets you ship generative AI features, help build ...Mehr anzeigenZuletzt aktualisiert: vor 14 Tagen
Backup / Restore Engineer (w / m / d) – Enterprise Backup & Cloud

Backup / Restore Engineer (w / m / d) – Enterprise Backup & Cloud

indivHR | We IT RecruitingKöln, Nordrhein-Westfalen, .DE
Quick Apply
Willkommen bei indivHR, wo deine Karriere und Individualität an erster Stelle stehen.Wir sind nicht nur Experten im IT Recruiting – wir sind deine persönlichen Karriereberater.HR steht für ind...Mehr anzeigenZuletzt aktualisiert: vor über 30 Tagen
Diese Stelle ist in deinem Land nicht verfügbar.
Site Reliability Engineer (fmx)

Site Reliability Engineer (fmx)

ilert GmbHCologne, North Rhine-Westphalia, Germany
Vor 30+ Tagen
Stellenbeschreibung

Location : Hybrid Cologne (Rheinauhafen) 3 days in the office 2 remote (Tue Thu)

Team : Engineering Reports to CTO

Keep the world awake build reliability at scale

ilert helps thousands of DevOps & IT teams detect fix and communicate incidents faster.

Our platform is mission-critical : customers rely on us 24 / 7 to keep their always-on businesses running.

As a Site Reliability Engineer at ilert youll own the reliability performance and scalability of our core platform across AWS Kubernetes Kafka and more.

Tasks

Build & operate a highly available platform

  • Run and evolve our AWS-based infrastructure
  • Operate and optimize self-managed Kafka ClickHouse clusters and our Observability stack
  • Ensure resilience disaster recovery and capacity planning across the stack

Improve reliability & performance

  • Build and maintain SLOs SLIs error budgets and observability dashboards
  • Debug production issues across layers (networking Kubernetes application DB)
  • Improve performance of our ingestion pipeline
  • Automation & tooling

  • Automate operations with Terraform Helm Kubernetes operators and internal tooling
  • Build tooling for safer deploys blue / green rollouts and automated verification
  • Strengthen incident response workflows through deep collaboration with our AI SRE agent team
  • Security & compliance

  • Implement best practices for workload isolation secrets management IAM and auditability
  • Support our ISO27001 posture by automating controls and hardening our infrastructure
  • Cross-functional impact

  • Partner with Backend AI and Product teams to design reliable services
  • Participate in on-call rotation
  • Lead post-incident reviews and drive reliability improvements long-term
  • Requirements

  • 3 years experience as SRE Platform Engineer DevOps Engineer or Infrastructure Engineer
  • Strong hands-on experience with AWS Kubernetes Linux internals networking performance tuning
  • Experience operating self-managed distributed systems ideally Kafka or ClickHouse
  • Strong understanding of observability
  • Experience automating infrastructure with Terraform and CI / CD systems
  • Fluent English (our working language); German optional
  • Benefits

  • Product-centric - 100 % focused on solving a mission-critical pain felt by every always-on business
  • Hybrid freedom - 2 days remote by default; gorgeous Rheinauhafen roof terrace when youre in town
  • Focus >
  • meetings - We time-box syncs favour async docs and protect maker time

  • 28 days off - plus public holidays
  • Commute perks - subsidised public transport
  • Key Skills

    Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

    Employment Type : Employee

    Experience : years

    Vacancy : 1