T3 Platform Operations Specialist/Manager (m/w/d) Compute, JSM & SLA - remote & Berlin/FFM

Startdatum:

05/2026

Enddatum:

12/2026 + Option

Beschäftigungsart:

Freiberuflich

Region:

remote & Berlin/FFM


Beschreibung:

Für unseren Kunden suchen wir ab 05/2026 einen T3 Platform Operations Specialist/Manager (m/w/d) Compute, JSM & SLA für die voraussichtliche Dauer bis 12/2026 mit der Option auf Verlängerung. Der Einsatz ist in Vollzeit geplant. Das Projekt findet größtenteils remote und ca. 3 Tage pro Monat vor Ort in Berlin oder Frankfurt am Main statt.

Aufgaben:
- Provide Tier-3 operational ownership for Compute & Operating System services for Local Production
- Ensure operational readiness for deployments
- Ensure operational stability and responsiveness for the managed Kubernetes platform
- Reduce operational toil and improve service reliability
- Ensure platform operations adhere to security and compliance standards

Must Have:
- 5-10+ years in IT operations / service delivery / platform operations with demonstrated leadership in mission-critical environments
- Proven experience implementing/leading Incident, Problem, Change, Release governance in production
- Expertice with ITSM: Jira Service Management (JSM), Jira, Confluence
- Experience of core operations processes (incident management, change management, problem management, IT Service Management) as well as SRE concepts
- Experience in gathering operational insights from monitoring or observability including SLI/SLA/SLO management and tracking
- Hand-on experience in documenting procedures properly and enforcing clear runbooks or playbooks
- Observability Hands-on experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog, Mimir, Loki)
- Familiarity with enterprise DevOps toolchains is a plus (GitLab, JFrog Artifactory, Backstage, Harness)

Nice to Have:
- Experience operating in regulated / high-availability industries (banking, telco, public sector, healthcare)
- Experience with SRE practices (SLOs/SLIs, error budgets) and reliability management.