Senior Site Reliability Engineer
Workplace: Stockholm, Sverige
Expires: November 1, 2025
Seeking a Senior Site Reliability Engineer to design, implement, and optimize cloud-native infrastructures for diverse clients, ensuring system reliability, scalability, and performance.
Main requirements:
- 5+ years experience in SRE, DevOps, or Platform Engineering roles
- Expert-level knowledge of Google Cloud Platform services
- Extensive Kubernetes orchestration experience
- Strong Apache Kafka expertise
- Proficiency with infrastructure as code tools
- Experience building and operating internal developer platforms using Crossplane
- Understanding of Helm chart development and lifecycle management
Responsibilities:
- Design and maintain scalable infrastructure on Google Cloud Platform tailored to client environments
- Manage and optimize Kubernetes clusters across projects
- Architect and maintain Apache Kafka streaming pipelines and event-driven architectures
- Implement infrastructure as code using Terraform, Ansible, or similar tools
- Build internal developer platforms for self-service infrastructure provisioning
- Develop Helm charts and Kubernetes operators to standardize deployments
- Collaborate with developer teams to define platform APIs and CI/CD workflows
- Develop SLIs, SLOs, error budgets and observability solutions for client systems
- Conduct capacity planning, performance optimization, and lead incident response
- Implement security best practices and ensure compliance with industry regulations
- Manage secrets, certificates, and access controls across environments
Required hard skills:
- Google Cloud Platform (GKE, Cloud Run, BigQuery, Pub/Sub)
- Kubernetes cluster management and troubleshooting
- Apache Kafka cluster management and stream processing
- Infrastructure as code tools (Terraform, Pulumi, CloudFormation)
- Crossplane for control-plane provisioning
- Helm chart development and templating
- Monitoring and observability tools (Prometheus, Grafana, ELK stack)
- Service mesh technologies (Istio, Linkerd)
- Networking, load balancing, distributed systems
- Database technologies (PostgreSQL, Redis, MongoDB)
Recommended hard skills:
- Ansible
- CloudFormation
- Pulumi
- API gateways
- Advanced database operational knowledge
Soft skills:
- Excellent client communication and presentation skills
- Ability to translate client requirements into technical solutions
- Multitasking and managing multiple clients simultaneously
- Strategic infrastructure guidance provision
Coding languages:
- Go
- Python
- YAML
- HCL (Terraform)
Frameworks:
- Kubernetes
- Helm
- Crossplane
- Istio
- Linkerd
Operating systems:
- Linux
Natural languages:
- English (Proficient)
Cultural skills:
- Client-focused consultancy mindset
- Adaptability to diverse industry clients
- Collaborative teamwork
Apply for this job
You might also like:
- IT Support Technician
- Azure Cloud Engineer
- Experienced Project Manager
- Senior Analyst for Passenger Analysis Section, NOA
- Microsoft Specialist and System Administrator
- Experienced Software Developer - SIP/VoIP
- ICT Risk Officer – Cyber Security and Resilience
- Security Guard at Avarn Security Älmhult
- Senior Project Manager
- Digital Business Developer with Focus on Microsoft 365