Talent.com
Diese Stelle ist in deinem Land nicht verfügbar.
Site Reliability Engineer

Site Reliability Engineer

Infotree Global SolutionsBerlin, DE
Vor 12 Tagen
Anstellungsart
  • Vollzeit
Stellenbeschreibung

Join an Industry Leader in Digital Experiences!

Are you ready to help shape the future of digital experiences? Our client, a leading global company, is on the hunt for talented professionals to join their mission of transforming the way people interact with digital content. With a focus on empowering creativity and innovation, this company is at the forefront of the digital revolution, providing cutting-edge tools and platforms to artists, developers, and global brands.

The Opportunity

Be a part of the Developer Services organization, where the mission is to build and maintain highly scalable, resilient, and secure services that power some of the most widely used digital platforms in the world. This role is perfect for those who are passionate about reliability engineering and want to work on the backbone of innovative, customer-facing solutions.

As a member of the Reliability Engineering team, you will :

  • Collaborate with diverse, cross-functional teams to enhance the reliability, security, and scalability of our client’s services.
  • Implement and maintain advanced monitoring and incident response protocols to ensure the highest standards of service delivery.
  • Lead and manage incident response efforts, conducting in-depth analyses to prevent future occurrences.
  • Drive automation and Infrastructure as Code (IaC) initiatives, utilizing technologies like Kubernetes, Helm, and ArgoCD.
  • Focus on service resiliency and performance, employing cutting-edge techniques such as chaos engineering.
  • Play a key role in defining and achieving Service Level Objectives (SLOs) and Service Level Indicators (SLIs).

What We’re Looking For

This role demands a blend of technical expertise and strong communication skills. Ideal candidates will have :

  • Proven experience in building and scaling distributed systems, with a deep understanding of containerization (Docker, Kubernetes) and orchestration technologies.
  • Hands-on experience with monitoring tools like Cortex, Prometheus, and Grafana.
  • Strong programming skills in languages such as Python, TypeScript, Java, or Golang.
  • A solid understanding of web services, networking, and cloud platforms, especially AWS.
  • A collaborative spirit, with a passion for continuous learning and improvement.
  • Why You Should Apply

    This is a unique opportunity to contribute to the core platforms that support innovative digital solutions used by millions worldwide. You’ll work with a globally distributed team of experts, utilizing the latest technologies to deliver better software faster. If you’re driven by the challenge of building reliable, scalable systems and thrive in a fast-paced, collaborative environment, this is the role for you.

    Apply Now

    If you're ready to make an impact and work with some of the brightest minds in the industry, we want to hear from you. Join us in our mission to change the world through digital experiences!