Senior DevOps / Infrastructure Engineer — EU
Location: Remote — EU (Central European Time) · Zagreb preferred Type: Full-time
About the Role
SuiteFiles builds the intelligent workspace that professional services firms, accountants, lawyers, and advisors, rely on every day: document management, creation, digital signing, client collaboration, email management, search, secure storage, and workflow. It’s built deeply on Microsoft 365 (Microsoft Graph, SharePoint, Outlook, Entra ID) and runs on Azure, serving firms around the world.
We’re growing our engineering presence in Europe, and we’re looking for a Senior DevOps / Infrastructure Engineer to own and improve the foundations our product runs on. You’ll automate the path from commit to production, make the platform measurably more reliable and secure, manage cloud cost and scalability, and improve the developer experience so our product teams ship safely and quickly. It’s a hands-on senior role with real ownership of our cloud infrastructure, delivery pipelines, observability, and operational excellence.
SuiteFiles is a globally distributed team, across the US, the UK, Europe, and New Zealand. Working CET hours, you’ll have strong overlap with the UK and Europe and a solid afternoon overlap with the US; meaningful overlap with US-based colleagues matters for this role.
Key Responsibilities
- Design, build, and operate Azure cloud infrastructure for a multi-tenant SaaS platform, managed as Infrastructure as Code (Terraform / Bicep).
- Own and improve CI/CD pipelines (GitHub Actions / Azure DevOps), fast, safe, automated paths from commit to production.
- Strengthen observability and operational excellence: monitoring, logging, alerting, dashboards, SLOs, and incident response, reducing toil through automation.
- Lead on platform reliability and scalability, capacity, performance, resilience, and failure-mode design for high-volume production workloads.
- Drive security by default across infrastructure and pipelines: secrets management, least-privilege IAM, network security, patching, and support for compliance with sensitive client data.
- Manage cloud cost, visibility, accountability, and optimization without compromising reliability.
- Improve developer experience and release flow so product squads deliver safely and quickly; partner with engineers on supportability, deployment, and operability.
- Participate in and help mature on-call and incident response, including root-cause analysis and prevention.
Skills and Qualifications
Essential
- 5+ years in DevOps / infrastructure / platform / SRE / cloud engineering for production systems.
- Hands-on experience operating cloud-based distributed systems at production scale (Azure preferred; AWS/GCP transferable).
- Demonstrated ownership of CI/CD and Infrastructure as Code (Terraform and/or Bicep/ARM) in production.
- Experience running multi-tenant SaaS infrastructure, with the security and reliability expectations that come with sensitive customer data.
- Solid observability and incident-response experience (metrics, logging, tracing, alerting).
- Containers and orchestration (Docker, Kubernetes or equivalent) and scripting/automation (PowerShell, Bash, or Python).
- Based in the EU within Central European Time, with the right to work in your country of residence, and able to maintain meaningful overlap with US-based colleagues (plus strong overlap with the UK and Europe).
Desirable
- Microsoft 365 platform context, Microsoft Graph, SharePoint, Exchange, Entra ID.
- SRE practices: SLOs / error budgets, resilience testing, progressive delivery (canary / blue-green).
- Security tooling and practices (SAST/DAST, dependency and secret scanning, policy-as-code).
- Cloud-cost / FinOps tooling and optimization experience.
- Azure certification (e.g. AZ-104 / AZ-400 / AZ-305) or equivalent.
- Experience in the accounting, legal, or professional-services domain.
Personal Skills, Attributes & Competencies
- Strong ownership and a reliability- and customer-focused mindset, you care that the platform our customers depend on stays fast, available, and secure.
- Pragmatic about the balance between reliability, delivery speed, and cost.
- A mentor and enabler, you make other engineers more productive through better tooling and practice.
- Clear communicator, including asynchronously across time zones.
- High attention to detail, especially around security and data in a multi-tenant product.