IT Operations Engineer (Kafka Platform Support)

💻 Ework Group - founded in 2000, listed on Nasdaq Stockholm, with around 13,000 independent professionals on assignment - we are the total talent solutions provider who partners with clients, in both the private and public sector, and professionals to create sustainable talent supply chains.

With a focus on IT/OT, R&D, Engineering and Business Development, we deliver sustainable value through a holistic and independent approach to total talent management.

By providing comprehensive talent solutions, combined with vast industry experience and excellence in execution, we form successful collaborations. We bridge clients and partners & professionals throughout the talent supply chain, for the benefit of individuals, organizations and society.

🔹 For our Client we are looking for IT Operations Engineer (Kafka Platform Support)🔹

KEY RESPONSIBILITIES

Platform Operations:

  • Operate, monitor, and support the Kafka-based messaging and event platform
  • Monitor system health using logs, metrics, and alerting tools
  • Ensure availability, stability, and performance of the platform

Incident Management & Troubleshooting:

  • Resolve incidents reported via tickets or internal channels
  • Troubleshoot issues across producers, consumers, brokers, and integrations
  • Analyze logs, metrics, and system behavior to identify root causes
  • Escalate complex or unresolved issues to engineering teams when required

Runbook-Driven Execution:

  • Execute operational tasks based on runbooks and standard operating procedures
  • Perform configuration changes (topics, access, settings) following defined processes
  • Maintain and improve runbooks based on recurring issues and learnings

Customer Support & Communication:

  • Act as primary point of contact for internal platform users
  • Support users via community channels (e.g. Slack, Teams)
  • Answer technical questions and guide users on best practices
  • Translate user issues into actionable technical insights

Collaboration & Improvement:

  • Collaborate with platform and engineering teams to resolve incidents
  • Identify recurring issues and propose improvements or automation
  • Contribute to incident post-mortems and operational improvements
  • Provide feedback on platform usability and user experience

REQUIRED SKILLS AND QUALIFICATIONS

Core Technical Skills:

  • Experience with Apache Kafka or similar event streaming platforms
  • Understanding of distributed systems concepts (e.g. partitioning, replication, scaling)
  • Strong troubleshooting and analytical skills in production environments
  • Experience working with logs, metrics, and monitoring systems
  • Knowledge of Git and experience working with version control systems
  • Knowledge of GitLab CI/CD and ability to understand and work with existing pipelines

Operations & Support Experience:

  • Proven experience in IT operations, production support, or platform support roles
  • Experience working with incident management processes and tools
  • Familiarity with runbooks, SOPs, and structured problem resolution

Communication & Collaboration:

  • Strong communication skills with the ability to explain technical topics clearly
  • Experience working with internal customers and cross-functional teams
  • Customer-oriented mindset with a proactive support approach
  • Excellent English skills (written and spoken)

Nice to Have:

  • Experience with AWS or other cloud platforms
  • Familiarity with Kubernetes and containerized environments
  • Experience with monitoring tools such as Grafana, Prometheus

WORKING MODEL:

  • Focus on daily operations and support activities
  • Hands-on, practical, and execution-oriented role
  • May include working within defined support hours or shift models

  • Contact person: karolina.malyska@eworkgroup.com

    Client code: DS00

    Do you know someone who would fit this position? Recommend a candidate by sending her/his CV to: polecenia@eworkgroup.com

  • Whistleblowing Policy, which provides guidelines for reporting misconduct can be found on Ework website: https://www.eworkgroup.com/about-us/our-responsibility

  • Locations: Remote
  • Technologies: Amazon Web Services (AWS), Git, Kafka, Kubernetes
  • Language: English