SRE Manager, iCloud

Santa Clara Valley (Cupertino), California, United States
Software and Services

Summary

Posted:
Weekly Hours: 40
Role Number:200527018
People at Apple don’t just build products — they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you’ve used Apple products, you’ve likely interacted with us. iCloud Services SRE teams are responsible for the systems and services that directly support those customers and their experiences. We focus on availability and automation of key services that run iCloud every minute of every day all around the world!

Description

We're looking for a hardworking and motivated person to join this amazing team. You will be an accomplished builder and leader of teams looking to take on your next challenge. You know SRE and you know what it will take to run services at Apple scale with a high degree of operational precision. This role will position you to help craft the future of how we build and run our services on a global scale. You will have the technical skills to go deep and retain the ability to focus on higher-level business and product goals. We hire high quality leaders and engineers with a diverse set of experiences and abilities for positions on Apple. Our customers count on us to provide extraordinary availability, scalability, and security for services. If you’d like to positively influence millions of customers’ experience of Apple this is the job for you. As a Site Reliability Engineering Manager, responsibilities include: - Lead SRE teams responsible for reliability and performance of on-prem and cloud-based services - Leading and growing the engineers on your team - Manage staging and production environments with goal of maximizing availability - Promote observability of systems for monitoring, alerting, and metrics reporting - Advocate best practices of reliability engineering

Minimum Qualifications

Key Qualifications

  • 5+ years experience with large scale distributed systems, especially ML infrastructure and services including LLMs, Generative AI, and transformers
  • Demonstrable success leading engineering teams - ideally SRE or Production Engineering
  • Knowledge of core operating system principles, networking fundamentals, and systems management
  • Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts
  • Experience with hiring and leading engineers
  • 5+ years professional experience in an engineering leadership position

Preferred Qualifications

Education & Experience

Bachelors or Master’s degree in computer science or equivalent field with 5+ years of experience

Additional Requirements

Pay & Benefits

  • Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.