Platform Site Reliability Engineer (SRE)
TD SYNNEX · San Jose
Job description
About the role
The Platform SRE will ensure reliability, operability, and continuous improvement of TD SYNNEX enterprise platforms across hybrid cloud and on‑premise environments. This engineering‑driven role focuses on automation, Infrastructure‑as‑Code, observability, and reducing operational toil while acting as the L3 escalation point for complex incidents.
Key responsibilities
- Own L3 reliability posture for hybrid cloud and on‑prem platforms, define SLOs/KPIs, and maintain runbooks and SOPs.
- Design and build operational automation, including health checks and remediation workflows using Terraform, Ansible, Python, PowerShell, or Bash.
- Lead diagnosis, stabilization, and recovery for major incidents; drive problem management, root‑cause analysis, and preventive actions.
- Define observability standards, create actionable alerts, dashboards, and reduce alert noise.
- Advance AIOps capabilities with predictive analytics, anomaly detection, and capacity forecasting.
- Enable outsourced L1/L2 providers with clear runbooks, training, and performance governance.
- Collaborate with Platform Engineering to embed operability‑by‑design and mentor peers.
Required profile
- 5+ years of experience in platform, SRE, or operations engineering with production ownership in large‑scale environments.
- Hands‑on experience with hybrid cloud (Azure) and on‑prem infrastructure, including compute, networking, storage, and identity.
- Proven track record of L3 incident troubleshooting and major incident leadership.
- Strong fundamentals in networking, virtualization, storage, and Windows/Linux server environments.
- Experience with ITSM processes (incident, problem, change) and ticket‑based operations.
Required skills
- Terraform
- Ansible
- Python (preferred)
- PowerShell
- Bash
- Azure cloud platform
- ITSM tools
- Linux and Windows Server administration
- Networking concepts including DNS and DHCP
- Virtualization and storage technologies
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 3 hours ago
Expires 1 month from now
1 views · 0 interested
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
TD SYNNEX
San Jose
Related job offers
-
Trilingual Technical Support Technician (Canadian French)
TD SYNNEX San Jose -
Sr. System Engineer (HPE Enterprise/Pre-Sales)
TD SYNNEX San Jose -
Project Manager Odoo
SPC Internacional San Jose -
Remote MLOps Engineer – Flexible Hours
Hire Feed Costa Rica -
Remote LLM Engineer – AI Code Review & Challenge Creation
Hire Feed Costa Rica