Site Reliability Engineer

InterEx Group
Stuttgart, DE
Bedauerlicherweise ist der von Ihnen gesuchte Job nicht mehr verfügbar.

Our client is one of the world’s leading manufacturers of semiconductor chip-making equipment. A majority of the world’s microchips receive their critical lithographic patterning in machines made by this organisation.

In addition, they produce metrology tools and advanced applications to analyze and optimize the performance of the customer production process.

Job Mission

Troubleshoot short-term problems and translate, develop into structural improvements on our distributed data and compute platform infrastructure.

Be accurate, be precise and help drive up the aggregate availability of the installs of these distributed computing systems in Korea, Taiwan, Israel, China and the US (etc.

Be part of the computing platform that is one of the main pillars under the production of the next-generation microchips of Apple, Samsung and many others.

Responsibilities :

  • Create awareness in other teams about methods and procedures we use to help them to prevent repetitive help requests.
  • Help application developers to understand the infrastructure / cluster / system
  • We are the team that is in charge of understanding & explaining how the system fits into the customer’s ecosystem
  • Share knowledge / mindset to other teams (dev / infra engineers)
  • Cross functional, share knowledge between infra engineers
  • Contribute towards building VCP as a Product which meets our standards of quality
  • Increase stability and reliability of VCP by automated testing and automation
  • Customer satisfaction and product reliability
  • Improve the functionality and reliability of VCP
  • Translate customer ecosystem needs to engineering deliverables
  • Find the broken pieces of the puzzle at system / cluster level
  • Combination of individual stories’ in a complete book
  • Make the VCP reliable by improving system resilience (bug-fixing and beyond)
  • Resolve bugs in a sustaining way (implement regression test, design structural fixes)
  • Ambassador of predictable component lifecycle management
  • Technical roadmap maintenance (App life cycle management)
  • Support feature and service request from the field
  • Suggest improvements to our technical solutions and way of working, and implement them in alignment with your team and their stakeholders

Highly valued qualifications & experiences :

  • Experience with DC / OS
  • Experience with new technology introduction @ zero downtime including data migration
  • Fan of automatic testing and qualification, if can be part of CI / CD pipeline.
  • Affinity to dig deep into the details of networking issues
  • Available to work (remotely) outside regular office hours when it proves that attempt to build a fail-safe system was not yet successful.

We really want this to be an exception, not a rule.

Required qualifications & experiences :

  • Knowledge of distributed computing systems, practical experience (must!)
  • Experienced in build and release infrastructure, Maven, Nexus, Bamboo, Github
  • Familiar with at least one scripting language (Python)
  • Experience with Ansible
  • Linux expert
  • Vor 7 Tagen
Ähnliche Stellenangebote
Gesponsert
PRODYNA
Stuttgart, Baden-Württemberg

Bist du auf der Suche nach einem Ort, wo positive Einstellung und unglaublicher Teamgeist die Schlüsselwerte sind? Wir validieren und implementieren neue Technologien und schaffen maßgeschneiderte Lösungen für unsere Kunden.Wir engagieren uns für eine Kultur der Innovation ...

Gesponsert
GR4
Germany, Germany

Are you a Senior Site Reliability Engineer in Germany looking for a new role?. Ensure system architecture availability, reliability, and efficiency. ...

Gesponsert
Next Ventures
Stuttgart, Baden-Württemberg
Homeoffice

Site Reliability Engineer (AWS & Java) - 6 months - 90% remote/Stuttgart, Germany (GERMAN SPEAKING). ...

Gesponsert
Optimus Search
Stuttgart, Baden-Württemberg
Homeoffice

Are you an experienced Site Reliability Engineer looking to take on a new challenge? Are you passionate about working at the forefront of technical innovation? A major player in the German cloud infrastructure field is currently growing the technical team at their Karlsruhe location. Several years o...

IONOS SE
Remote, DE
Homeoffice

Site Reliability Engineer (w/m/d) - Defense Products. Mit seinen Web Presence & Productivity-Angeboten agiert das Unternehmen als “One-Stop-Shop" für alle Digitalisierungs-Bedürfnisse - von Domains und Webhosting über klassische Website-Builder und Do-It-Yourself-Lösungen, von E-Commerce bis zu Onli...

FERCHAU GmbH
Stuttgart, Baden-Württemberg

Was unsere Kunden von ihren Technologielösungen erwarten? Das nächste Level! Das gelingt unserem Team bei FERCHAU Tag für Tag.Wir suchen dich: als ambitionierte:n Kolleg:in, der:die wie wir Technologien auf die nächste Stufe bringen möchte.Wir realisieren spannende Projekte für namhafte OEMs und Zul...

IONOS SE
Remote, DE
Homeoffice

Site Reliability Engineer (w/m/d). Mit seinen Web Presence & Productivity-Angeboten agiert das Unternehmen als “One-Stop-Shop" für alle Digitalisierungs-Bedürfnisse - von Domains und Webhosting über klassische Website-Builder und Do-It-Yourself-Lösungen, von E-Commerce bis zu Online-Marketing-Tools....

ADG Apotheken-Dienstleistungsgesellschaft
Ludwigsburg, Baden-Württemberg

Site Reliability Engineer (m/w/d). Wir suchen einen engagierten und erfahrenen Site Reliability Engineer (SRE) (m/w/d), der leidenschaftlich daran interessiert ist, die Zuverlässigkeit, Skalierbarkeit und Leistung unserer Systeme zu optimieren. Nachgewiesene Berufserfahrung als Site Reliability Engi...

BestSecret
remote, Germany
Homeoffice

Site Reliability Engineer (m/f/d). We rely on our Site Reliability Engineers (SREs) to empower our users with solutions that offer a rich feature set, high availability, and stellar performance to pursue their missions. Enhance the reliability and quality of our software solutions by automating depl...

tonies - Boxine GmbH
Germany
Homeoffice

As a Site Reliability Engineer (all genders) within the Production Systems team at tonies, you will be responsible for ensuring the reliability, availability, and performance of our on-premise. As a Site Reliability Engineer (all genders) within the Production Systems team at tonies, you will be res...