Site Reliability Engineer
At Infotree, meeting your career needs is a top priority. Client satisfaction is largely dependent on the resources we can provide, and we take pride in our delivery.
We have a supportive team in place to give quality people a chance to grow and challenge themselves in their roles which has resulted in that we have placed many employees in positions that have grown into lifelong careers.
We have a team of dedicated recruiters and consultant care representatives that are committed to your success and well-being.
Check out our open roles to get started.
Infotree Poland Sp. z o.o. is part of Infotree Global Solutions. Agency number : 15970.
Join an Industry Leader in Digital Experiences!
Are you ready to help shape the future of digital experiences? Our client, a leading global company, is on the hunt for talented professionals to join their mission of transforming the way people interact with digital content.
With a focus on empowering creativity and innovation, this company is at the forefront of the digital revolution, providing cutting-edge tools and platforms to artists, developers, and global brands.
The Opportunity
Be a part of the Developer Services organization, where the mission is to build and maintain highly scalable, resilient, and secure services that power some of the most widely used digital platforms in the world.
This role is perfect for those who are passionate about reliability engineering and want to work on the backbone of innovative, customer-facing solutions.
As a member of the Reliability Engineering team, you will :
- Collaborate with diverse, cross-functional teams to enhance the reliability, security, and scalability of our client’s services.
- Implement and maintain advanced monitoring and incident response protocols to ensure the highest standards of service delivery.
- Lead and manage incident response efforts, conducting in-depth analyses to prevent future occurrences.
- Drive automation and Infrastructure as Code (IaC) initiatives, utilizing technologies like Kubernetes, Helm, and ArgoCD.
- Focus on service resiliency and performance, employing cutting-edge techniques such as chaos engineering.
- Play a key role in defining and achieving Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
What We’re Looking For
This role demands a blend of technical expertise and strong communication skills. Ideal candidates will have :
- Proven experience in building and scaling distributed systems, with a deep understanding of containerization (Docker, Kubernetes) and orchestration technologies.
- Hands-on experience with monitoring tools like Cortex, Prometheus, and Grafana.
- Strong programming skills in languages such as Python, TypeScript, Java, or Golang.
- A solid understanding of web services, networking, and cloud platforms, especially AWS.
- A collaborative spirit, with a passion for continuous learning and improvement.
Why You Should Apply
This is a unique opportunity to contribute to the core platforms that support innovative digital solutions used by millions worldwide.
You’ll work with a globally distributed team of experts, utilizing the latest technologies to deliver better software faster.
If you’re driven by the challenge of building reliable, scalable systems and thrive in a fast-paced, collaborative environment, this is the role for you.
Apply Now
If you're ready to make an impact and work with some of the brightest minds in the industry, we want to hear from you. Join us in our mission to change the world through digital experiences!