





Locations: Canary Wharf | Boston
Who We Are
Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in 1963. Today, we help clients with total transformation-inspiring complex change, enabling organizations to grow, building competitive advantage, and driving bottom-line impact.
To succeed, organizations must blend digital and human capabilities. Our diverse, global teams bring deep industry and functional expertise and a range of perspectives to spark change. BCG delivers solutions through leading-edge management consulting along with technology and design, corporate and digital ventures-and business purpose. We work in a uniquely collaborative model across the firm and throughout all levels of the client organization, generating results that allow our clients to thrive.
What You'll Do
The Senior Director - Operations and Reliability Engineering is responsible for blendingSite Reliability Engineering (SRE), DevOps, and traditional operations modelsto build a next-generationReliability Engineering function. This role ensuresend-to-end automation at scale, 24x7 operational excellence, and high availabilityacrossall of BCG, includingBCG Core, BCG X, and Consulting Team (CT) worldwide. The leader will drivestrategic planning, execution, and optimizationof global IT infrastructure, cloud operations, and service management while ensuring asecure, scalable, and efficienttechnology environment. This role is accountable for embedding and assuringIT Service Management (ITSM) processesacross all teams, ensuring compliance with standardized frameworks and operational excellence.
Key Responsibilities:
Strategic Leadership & Transformation:
- Define and execute amodern Reliability Engineering strategy, integratingSRE, DevOps, and automation-first operational models.
- Driveend-to-end automationto eliminate toil, improve efficiency, and enhance operational resilience.
- Lead the transition from traditional IT operations to aproactive, AI-driven, self-healing infrastructure.
- Establish a globalobservability, telemetry, and predictive analytics frameworkfor real-time insights.
- Align operational strategies with business goals, ensuring IT supports digital transformation initiatives acrossBCG Core, BCG X, and CT.
Infrastructure & Cloud Operations:
- Overseeglobal IT infrastructure, cloud platforms, and hybrid hosting environmentsacrossall BCG business units.
- Managenetwork reliability, compute platforms, and cloud-native servicesacross AWS, Azure, and GCP.
- ScaleInfrastructure as Code (IaC),automated provisioning, andcloud workload optimization.
- Driveedge computing, containerized workloads, and high-performance computing strategies.
- ImplementAI-driven monitoring, self-healing automation, and full-stack observability.
IT Service Management & Operational Excellence:
- Mandate and assure the adoption of IT Service Management (ITSM) processes across all teams, ensuring standardized, efficient, and effective service delivery.
- EstablishSRE-based operational metrics, includingSLOs, SLIs, and error budgets.
- Overseeincident response, problem resolution, and root cause analysis with AI-driven remediation.
- Ensurehigh availability, performance, and security compliancefor all enterprise services.
- Develop afollow-the-sun operational support model, ensuring24x7 resilience and uptime across all of BCG.
- Optimizeincident, change, and capacity management, ensuring alignment withITIL best practicesand automated workflows.
- LeadService Asset and Configuration Management (SACM), ensuringaccurate and real-time management of software and IT assets within the CMDB.
- Drive continuousenhancements to the CMDB, improvingvisibility, compliance, and lifecycle managementof IT assets.
Security, Compliance & Risk Management:
- Embedsecurity and compliance into operational workflowswith automated security controls.
- Ensure adherence toISO 27001, NIST, SOC 2, GDPR, and cloud security best practices.
- Collaborate withcybersecurity teamsto integratezero-trust security models.
- Driveresiliency planning, disaster recovery, and business continuity initiatives.
Financial & Vendor Management:
- Optimize IT operational budgets with acost-effective, cloud-native strategy.
- Negotiatevendor contracts, ensuring alignment with business needs and service reliability.
- Drivecost efficiency in cloud spending, SaaS platforms, and infrastructure investments.
Leadership & Talent Development:
- Build and mentor a high-performingReliability Engineering team, fostering a culture of automation and innovation.
- Lead a team ofSREs, DevOps engineers, and platform reliability expertsacross global squads.
- Promote acollaborative, data-driven, and proactive mindset, ensuring agility and operational resilience.
- Establish workforce development programs forAI-driven operations, automation, and modern reliability practices.
What You'll Bring
Required Qualifications:
- 15+ years of experiencein IT operations, SRE, DevOps, or platform engineering.
- 5+ years in a senior leadership role, managinglarge-scale IT environments.
- Deep technical expertise incloud computing (AWS, Azure, GCP), on-prem infrastructure, and hybrid environments.
- Proven track record inend-to-end automation, Infrastructure as Code (IaC), and large-scale observability.
- Experience inAI-driven IT operations, predictive analytics, and automated remediation.
- Strong understanding ofzero-trust security, regulatory compliance, and risk management.
- Excellent leadership, communication, and stakeholder management skills.
Preferred Qualifications:
- Certifications:ITIL, AWS/Azure/GCP Solutions Architect, SRE Foundation, CISSP, or equivalent.
- Experience withKubernetes, Terraform, Ansible, and AI-powered operations tools.
- Strong problem-solving abilities, with a data-driven approach to operational excellence.
TheSenior Director - Operations Platform Leadis a pivotal leadership role responsible forshaping the future of IT operationsby integratingSRE, DevOps, and automation-first methodologies. If you are a highly technical, innovation-driven leader passionate aboutscaling operations through automation and AI-driven resilience, we invite you to apply.
Who You'll Work With
Work Environment & Additional Information:
- Hybrid or on-site work model.
- May require occasional travel forbusiness meetings, data center visits, or vendor engagements.
- Ability to work in afast-paced, high-availability IT environment, with a focus on automation and reliability.
Boston Consulting Group is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, age, religion, sex, sexual orientation, gender identity / expression, national origin, disability, protected veteran status, or any other characteristic protected under national, provincial, or local law, where applicable, and those with criminal histories will be considered in a manner consistent with applicable state and local laws.
BCG is an E - Verify Employer. Click here for more information on E-Verify.