Senior Java Developer (SRE)
About the Role
We are seeking a Senior Java Engineer with an SRE-focused mindset to help design, build, and maintain high-volume, event-driven distributed systems. You will play a key role in driving performance, reliability, and scalability—both at the service level and across the architectural landscape.
This role combines strong engineering expertise with operational excellence. You’ll be expected to contribute to system design discussions, capacity planning, and performance tuning, while also ensuring reliability through observability, automation, and incident management best practices.
You will collaborate closely with devops, infrastructure and data teams to deliver resilient, high-performing systems, while also acting as a technical leader who can independently take ownership of critical initiatives.
What will I be doing?
- Design and implement large-scale, event-driven distributed systems from conception to production.
- Optimize performance and reliability at both service and architectural levels.
- Apply strong knowledge of algorithms and data structures to solve complex scalability challenges.
- Monitor, measure, and improve production systems to ensure resilience and high availability.
- Drive adoption of observability, automation, and reliability engineering practices across the stack.
- Participate in incident management and root cause analysis, ensuring issues are resolved and prevented.
- Mentor and guide engineers, fostering technical excellence and an ownership-driven culture.
- Collaborate with cross-functional teams (QA, Operations, Platform, DevOps) to deliver stable and performant releases.
- Participate in on-call rotations to support production systems.
What we’re looking for?
- BSc in Computer Science (or related field) or equivalent hands-on experience.
- 7+ years of Java/JEE development experience, with deep knowledge of concurrency, multithreading, and high-performance systems.
- Proven experience in designing and implementing event-driven distributed systems at scale.
- Strong background in algorithms, data structures, and systems architecture.
- Experience with system performance tuning, capacity planning, and reliability improvements.
- Practical knowledge of Linux environments and production troubleshooting.
- Familiarity with SRE practices such as monitoring, alerting, observability, automation, and chaos testing.
- Strong communicator with the ability to lead discussions, influence design decisions, and mentor peers.
- A systematic problem solver with a high sense of ownership and accountability.