As a SRE Team Lead, you will establish and lead a Site Reliability Engineering team embedded within OBI’s product development units. Your mission is to ensure the operational stability, availability, and performance of all OBI digital platforms - including systems running on AWS, GCP, and SaaS-based solutions.

You and your team are accountable for monitoring, alerting, incident response, and post-mortem analysis across these systems. In addition to operational ownership, you act as a coach and reliability advocate, helping development teams design, build, and operate resilient services.

Together with OBI’s platform owner, you will continuously define, evolve, and enforce SRE standards and best practices, ensuring a culture of reliability and shared responsibility across all digital touchpoints.

Key Responsibilities:

  • Build and lead a high-performing SRE team responsible for the operational reliability and performance of OBI’s digital platforms across AWS, GCP, and SaaS environments.
  • Take ownership of monitoring, alerting, incident response, and post-mortem analysis for all related digital platform systems.
  • Collaborate closely with product engineering teams to embed SRE principles into design, development, and deployment lifecycles.
  • Act as a coach and reliability champion, enabling product teams to adopt best practices in observability, automation, scalability, and fault tolerance.
  • Partner with OBI’s platform owner to align on reliability targets (SLOs/SLIs), incident response procedures, and operational maturity goals.
  • Lead continuous improvement initiatives based on operational learnings and post-incident reviews.
  • Contribute to architecture and capacity planning to ensure resilience, efficiency, and scalability.
  • Maintain and evolve shared SRE frameworks, documentation, and reliability tooling.
  • Promote a culture of accountability, transparency, and proactive problem solving.

  • Motivizer Benefits Platform to choose and manage all your benefits in one place. You receive a budget (550 PLN monthly). You can choose medical care package, meal tickets, sports cards (we have Multisport and on preferential terms, we have membership cards to one of the most popular Gyms), cinema tickets, shop vouchers, discounts and many more.
  • Language Courses – you'll have access to a multi-language learning platform enabling you to practice you language skills and learn new ones!
  • Regular and systematic further training opportunities - both internally and from external providers. We support your ongoing learning and development.
  • Cooperation within  an internal community is our everyday reality. We have networking events, coding challenges, and company parties for different occasions.

  • Successfully completed university studies in computer science comparable courses of study.
  • Proven experience building or leading SRE or DevOps teams responsible for production systems in multi-cloud or hybrid environments.
  • Strong technical background in distributed systems, infrastructure automation, and modern deployment architectures (e.g. Kubernetes, microservices).
  • Hands-on experience with CI/CD pipelines, observability stacks (metrics, logging, tracing), and incident management frameworks.
  • Strong communicator able to bridge engineering, operations, and business perspectives.
  • Demonstrated ability to define and maintain reliability objectives and operational KPIs across diverse technology stacks.
  • Mindset of continuous improvement and a passion for mentoring and enabling others. 
  • Fluency in English and Polish.
  • Openness to work in hybrid model and visit Katowice office on regular basis.
  • Willingness to travel occasionally to OBI’s locations in Cologne and Wermelskirchen.

 Develop yourself and the digital future – at Reply!

Reply is made up of a network of highly specialised companies, which support leading industrial groups in defining and developing business models to optimise and integrate processes, applications and devices, using new technology and communication paradigms, such as Artificial Intelligence; Big Data; Cloud Computing; Digital Communication; Internet of Things; Mobile and Social Networking.