Dev Architect (f/m/d) for the Autonomous Operations Platform (AIOps) - BTP Cross Projects Unit
About this role
We help the world run better. At SAP, we keep it simple: you bring your best to us, and we'll bring out the best in you. We're builders touching over 20 industries and 80% of global commerce, and we need your unique talents to help shape what's next. The work is challenging – but it matters. You'll find a place where you can be yourself, prioritize your wellbeing, and truly belong.
What you`ll do
We’re not just offering a job, we invite you to shape the future of cloud-native infrastructure at SAP and in all of Europe. We have committed to open source by donating our projects into the neutral NeoNephos foundation. We are transforming cloud infrastructure operations through AI at SAP. You'll build autonomous operations capabilities within the “Apeiro Reference Architecture” ApeiroRA as part of the EU's IPCEI-CIS initiative to strengthen Europe’s digital sovereignty, solving complex challenges that eliminate manual incident response, and enable predictive detection across distributed cloud-edge environments. Working within the Apeiro ecosystem and the Linux Foundation's NeoNephos community, you'll collaborate with platform engineers, architects, and SRE teams to build production-grade systems that reduce mean time to resolution.
The Role
In your role as Development Architect, you'll conceptualize and detail out the autonomous operations platform. You'll plan and design distributed systems that enable AI-driven incident management, building integrations with AI/ML services, and Kubernetes-native operators that autonomously remediate infrastructure issues based on AI insights. Your technical leadership will influence design and implementation approaches of the autonomous operations platform within the Apeiro Reference Architecture. You'll tackle challenges like telemetry correlation across logs/metrics/traces, automated root cause analysis, and knowledge graph systems that power runbook automation. You’ll design production systems that effectively utilize AI/ML services while ensuring core functionality remains intact during infrastructure failures and network partitions.
What you bring
Must-have Skills
- Technical Leadership: 5+ years of experience in a software architecture role with a proven track record of finding solutions to complex problems
- Expert Programming: Deep expertise in languages such as Python, Java, Go with focus on distributed systems, service integration, and cloud-native architecture at scale
- Cloud & Kubernetes: Expert-level Kubernetes skills including operator development, custom controllers, and production operations across multi-cloud environments
- Networking Expertise: Strong understanding of network technologies including IPv4/IPv6, VLANs, L2/L3 firewalls, routing protocols, and DNS
- Language Skills: Fluency both written and spoken in English and German
Nice-to-have Skills
- AI/ML Integration: Strong understanding of how to effectively consume AI/ML services, interpret model outputs, and integrate intelligent capabilities into operational workflows
- Advanced Systems: Experience building production systems that integrate AI/ML services, telemetry correlation engines, knowledge graphs, or intelligent automation platforms
- Innovation Drive: Demonstrated ability to pioneer solutions for autonomous operations, with contributions to open-source infrastructure or observability projects preferred
- Web Development: Experience with frameworks like Angular, React, or Vue.js and TypeScript would be beneficial
Meet your team
You'll join BTP Cross Projects, a strategic unit that collaborates across different board areas and lines of business at SAP on high-impact initiatives. We work on projects that drive meaningful customer value and shape the future of SAP's technology landscape.