Employment type
Full-Time
Organization size
10,000+
Salary
Salary undisclosed
Department
Engineering
On-site
Chandler, Arizona, United States
Los Lunas, New Mexico, United States
Ashburn, Virginia, United States
Atlanta, Georgia, United States
Reston, Virginia, United States
Temple, Texas, United States
Houston, Texas, United States
Garland, Texas, United States
Hillsboro, Oregon, United States
Kuna, Idaho, United States
Social Circle, GA, United States
Jeffersonville, IN, United States
Rosemount, MN, United States
Richmond, VA, United States
Montgomery, AL, United States
Cheyenne, WY, United States
Aiken, SC, United States
Henrico, Virginia, United States
Huntsville, Alabama, United States
Dekalb, Illinois, United States
Fort Worth, Texas, United States
Prineville, Oregon, United States
Gallatin, Tennessee, United States
Forest City, North Carolina, United States
Altoona, Iowa, United States
Fremont, California, United States
New Albany, Ohio, United States
Mesa, Arizona, United States
Santa Clara, California, United States
Sarpy County, Nebraska, United States
Newton County, Georgia, United States
Eagle Mountain, Utah, United States
Kansas City, Missouri, United States
Skills
- Data Center Operations AnalystEngineering
About the role
Description
Meta is seeking a forward thinking, experienced Data Center Systems Engineer to join the Data Center Site Operations team. Our data centers, and the tens of thousands of servers installed in them, are the foundation upon which our rapidly scaling infrastructure efficiently operates and upon which our innovative services are delivered. Meta is at the leading edge of the global data center industry both in terms of how data centers are designed and operated. This person should enjoy working in a fast-paced environment where adaptability and flexibility will be key to their success. We seek a forward-thinking IT professional with experience utilizing multiple diverse software tools to identify automation solutions intended to address complex operational issues. This role is cross-functional and considers the technical needs of frontline users to identify and automate diagnostic tooling which enables quality and efficient delivery of production servers. They should have demonstrated experience of performing data analysis to drive decisions on the top priorities to automate repairs on servers in a hyperscale environment. An engineer who can drive solutions with code, through collaboration, clear and timely communication with globally diverse teams.
SiteOps Global Systems Engineer Responsibilities:
- Deliver maximum server fleet up-time and utilization rates, by leveraging data to understand hardware failure conditions and root cause. Identify trends and systemic issues in the fleet and drive resolution.
- Collaborate with stakeholders and subject matter experts to interpret business and operational needs, articulate success criteria in partnership with engineering and field based operations teams.
- Build cross functional relationships and have the capacity to influence policies and procedures to improve global data center operations.
- Mentor team members to evaluate and identify better ways to resolve issues and define updates to tools and processes.
- Write and review code, develop documentation, and debug the hardest problems, live, on some of the largest and most complex systems in the world.
- Participate in defining diagnostic tooling requirements with multiple cross-functional support teams.
- Execute validation and verification activities for the new product integration, including system level testing.
- Through consistent collaboration with cross-functional tooling teams, help determine root cause and provide input into their development process, with an operations central view of how open issues are affecting the fleet.
- Capacity to travel up to 25% required.
Other requirements
Minimum Qualifications:
- Engineering degree or commensurate experience
- 7+ years of experience in systems infrastructure operations or related field
- Experience in configuration and maintenance of applications such as web servers, load balancers, relational databases, storage systems and messaging systems
- Experience coding in higher-level languages (e.g., Python, PHP, C++, or Java)
- Experience learning software, frameworks and APIs
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
- Experienced with Linux systems
undefined
Benefits
No benefits have been listed yet.
Technology, Information and Internet