Share this job

Sr. Site Reliability Engineer

REQ ID:  8590

Secaucus, New Jersey, US, 07094


At the NBA, we're passionate about growing and celebrating the game of basketball. Through the intensity of the game and the unrivalled athletic skill of our players, we deliver excitement to hundreds of millions of fans around the world. As a global sports and media business, the NBA is so much more. While Basketball Operations runs the league's on-court activities, other departments manage relationships with television and digital media partners, develop marketing partnerships with some of the world's most recognizable companies, oversee the licensing of NBA merchandise, and handle a wide variety of responsibilities that drive the NBA's success. The NBA runs numerous web sites and digital products that generate fan interest around the world. League Pass is one of the NBA's best-in-class products available in over 215+ countries that allows fans live and archive access to games from the preseason through the NBA Finals. Whether it be for the NBA, WNBA, G League or 2K League millions of people visit sites, engage with mobile apps, and view our content across multiple platforms.


The NBA is committed to providing a safe and healthy workplace.  To safeguard our employees and their families, our visitors and the broader community from COVID-19, and in consideration of recommendations from health authorities and the NBA’s own advisors, any individual working onsite in our New York and New Jersey offices must be fully vaccinated against COVID-19.  The NBA will discuss accommodations for individuals who cannot be vaccinated due to a medical reason or sincerely held religious belief, practice, or observance.



Position Summary:


The NBA is seeking an experienced Sr. Site Reliability Engineer with a background in public and private cloud infrastructure technologies. This role will be a key part in designing and implementing scalable, fault tolerant and reliable public/private cloud and core infrastructure resources while abiding by the best industry standards. Other key responsibilities will be assessing the current on-premises infrastructure to modernize, automate and orchestrate workflows and develop a self-service NBA cloud IaaS catalog.


Major Responsibility:


  • Automation of IT Operations - monitoring, alerts, incident management/response, patching, RCAs, documentation
  • Apply automation, orchestration and self-service opportunities to any tasks or parts of the system that would benefit from it or are performed manually
  • Help cultivate a culture of automation and orchestration to the infrastructure operations team
  • Responsible for the availability, integrity, recoverability, performance and general application support of a mix of on-premises and cloud-based systems used throughout the organization
  • Provide primary operational and engineering support for multiple technology suites (storage, OS, compute systems, load balancers, hypervisor, etc.)
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Able to troubleshoot complicated, cross platform issues handling OS, storage, networking, database, compute nodes, and/or hypervisor components (core infrastructure) and handle live production incidents, debug/troubleshoot application and infrastructure issues, follow and implement SRE best practices Document your system knowledge as you acquire it over time, create automated runbooks, and ensure critical system information is readily available to those who need it


Required Skills/Knowledge:


  • Experience in developing private cloud IaaS self-service using orchestration tools such as Service-Now
  • Good communication skills, both verbal and written, in technical and non-technical topics
  • Ability to communicate effectively with individuals at all levels of the organization
  • Proficient with Infrastructure As Code (Terraform, Cloud Formation, Ansible, etc.)
  • Hands on experience in configuration management of server farms (using tools such as Puppet, Chef, Ansible, SCCM, etc.,)
  • Ability to program and script with one or more high level languages, such as Python, Ruby, PowerShell, Bash, PHP,and JavaScript Proficient in datacenter automation and orchestration using the mentioned knowledge/skills; IaC, IaaS, and config management
  • A proactive engineering approach to spotting problems, areas for improvement, and performance bottlenecks
  • Experience in monitoring and analyzing infrastructure performance using standard performance monitoring tools - Nagios, New Relic, SCOM, etc.
  • Proficient in Linux but has experience in Windows Server operating systems
  • Experience with virtualization and VDI technologies (VMware preferred)
  • Experience with SAN/NAS storage & backup
  • Experience with Web server platforms (IIS, Apache, etc)
  • Experience with FTP clustering, SSO and containerization (Docker, K8s)
  • A base networking knowledge of TCP/IP, with a strong understanding of HTTP and DNS Hands-on datacenter work including but not limited to physicaly racking servers, storage components and appliances while also be able to troubleshoot hardware related issues




  • At least 6 years in Infrastructure Operations/Datacenter engineering
  • 4 years as an SRE with cloud (public/private) modernization effort is desired
  • Bachelor's degree in Computer Science, Management Information Services, or a related technical discipline
  • Technology familiarity with Dell/EMC, VMWare, Rubrik, HPE Simplivity, Service-Now


Salary Range:


  • $140,000 to $160,000 per year


We Consider Applicants For All Positions On The Basis Of Merit, Qualifications And Business Needs, And Without Regard To Race, Color, National Origin, Religion, Sex, Gender Identity, Age, Disability, Alienage Or Citizenship Status, Ancestry, Marital Status, Creed, Genetic Predisposition Or Carrier Status, Sexual Orientation, Veteran Status, Familial Status, Status As A Victim Of Domestic Violence Or Any Other Status Or Characteristic Protected By Applicable Federal, State, Or Local Law.


Nearest Major Market: New York City
Nearest Secondary Market: Newark

Job Segment: System Administrator, Data Center, Virtualization, Computer Science, Linux, Technology