Only AI Jobs


Systems Engineer/ Administrator II, Global Operations Support Engineering

ID: 6678

Type: Full-time

Category: Others

Company Name: Amazon Data Services, Inc.

Location: USA, VA, Herndon - Herndon - United States

Salary: 104,500.00 - 160,000.00 USD annually

Visit company vacancy
Job Description

AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we're looking for talented people who want to help.

You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.

The AWS Global Operations Support Engineering (GOSE) team is seeking a System Engineer to lead the technical implementation of business automation solutions and AI-driven operational intelligence platforms. This role will serve as the technical backbone for transforming critical infrastructure data into automated, intelligent systems that enable the Data Center Community (DCC) organization to prevent customer impact, reduce operational burden, and continuously improve fleet-wide reliability.

As a System Engineer, you will design and build production-grade automation infrastructure, lead the productionalization of AI-driven operational tools, and establish engineering best practices that scale across the global AWS data center portfolio. You will work at the intersection of infrastructure operations, automated solutions, and artificial intelligence to create systems that fundamentally change how AWS manages its global infrastructure.

Key job responsibilities
- Influence the team’s technical and business strategy by making insightful contributions to team priorities and lead in identifying and solving architecture deficiencies that limit the innovation

- Design and implement production infrastructure for AI-driven operational intelligence platforms and agentic systems, including event-driven architectures, Lambda functions, AgentCore deployments, API integrations, MCP server deployments, and AI orchestration systems that enable autonomous near-real-time actions across the global fleet

- Architect scalable automation solutions and agentic AI systems that integrate across multiple AWS services (Lambda, CloudWatch, Bedrock, AgentCore) and internal systems to eliminate manual processes and enable autonomous workflows

- Develop and maintain MCP (Model Context Protocol) servers and tools that expose data center operational data, runbooks, and automation capabilities to agentic systems

- Build and maintain AWS infrastructure for automation programs, including dedicated AWS accounts, IAM roles, security configurations, deployment pipelines, usage logging, and authentication systems

- Establish engineering standards, best practices, and operational excellence patterns for business automation and AI-driven systems, including CI/CD pipelines and infrastructure-as-code solutions using CDK/CloudFormation

- Drive proof-of-concept development for new automation ideas and own the productionalization of validated AI proof-of-concepts into production-ready systems with >95% uptime, implementing monitoring, alerting, and observability solutions for automation infrastructure

- Collaborate with Business Intelligence Engineers, TPMs, and Data Engineers to translate business requirements into technical solutions while leading design and code reviews

About the team
The Global Operations Support Engineering (GOSE) team is focused on maximizing AWS data center infrastructure availability and operational excellence. We achieve this by optimizing labor utilization, deep diving event and incident analysis, developing data engineering and business intelligence solutions, deploying business automation, and managing global operational improvement initiatives.

We transform critical infrastructure data into actionable intelligence that enables the Data Center Community (DCC) organization to prevent customer impact, reduce operational burden, focus on highest-impact activities, and continuously improve fleet-wide reliability and productivity. Through our comprehensive monitoring, analysis, reporting, and program/project management, we serve as the analytical backbone that drives continuous improvement in operational excellence across the global data center portfolio.

The team operates at the intersection of infrastructure operations, data engineering, and artificial intelligence—building systems that fundamentally change how AWS manages its global infrastructure at scale.

Basic Qualifications

- 5+ years of systems engineering experience
- Bachelor's degree in Systems Engineering, Computer Science, or related field or relevant work experience
- Experience in site reliability engineering (SRE), systems engineering, systems administration, DevOps, security administration, or network administration
- Experience in any of the following: Python, Java, Perl, PHP, Ruby, Bash, Shell or equivalent

Preferred Qualifications

- Knowledge of TCP/IP and networking protocols such as HTTP and DNS
- Experience designing and developing scripts to automate operational burdens and reviewing scripting changes to ensure they meet the standards for maintainability, scalability and security
- Experience working in 24/7 production environment
- Experience with service-oriented architecture and web services

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.



USA, VA, Herndon - 104,500.00 - 160,000.00 USD annually

Company Information

Company Name: Amazon Data Services, Inc.

Company Website: https://aws.amazon.com

Company Address: 410 Terry Ave N, Seattle, WA 98109-5210, United States

Amazon Data Services, Inc. is a corporate subsidiary within the Amazon corporate family that functions as an organizational and operational entity supporting Amazon’s cloud infrastructure and related enterprise technology operations in the United States. The company appears in Amazon’s publicly filed subsidiary lists and corporate filings and is associated with the activities required to own, operate, lease, maintain and administer the physical and network infrastructure that underpins Amazon Web Services (AWS) and other Amazon technology operations. As such, Amazon Data Services, Inc. is best understood not as a separate product-brand consumer-facing company, but as a technology-focused legal and operational arm that facilitates the delivery of large-scale cloud computing, storage, and networking services offered by Amazon’s cloud businesses. Overview and scope Amazon Data Services, Inc. operates within Amazon’s broader corporate structure and is engaged primarily in activities that support the provision of cloud computing infrastructure and related services. Public corporate disclosures list the entity among Amazon’s domestic subsidiaries, and business records and filings indicate the company’s role in managing aspects of data center operations and infrastructure assets. While the subsidiary itself does not market a separate set of retail products to end customers under its own brand, it performs essential back-office, property, and operational functions that enable the availability, resilience, and expansion of Amazon’s cloud platforms. Core business activities The core activities attributable to Amazon Data Services, Inc. are centered on infrastructure ownership and operations, including matters commonly associated with data center and cloud platform support: acquisition, leasing and management of data center properties; implementation and maintenance of power, cooling and facility systems; coordination of network connectivity and backbone infrastructure; logistics and physical security for technology sites; and compliance with regional regulatory, environmental and safety requirements for infrastructure operations. In addition, the entity plays a role in contractual and administrative arrangements required to support cloud service delivery—such as vendor and supplier agreements for data center equipment, construction and facility services—and in certain cases holds title or leases for physical locations used by Amazon’s cloud and technology businesses. Relationship to AWS products and services Although Amazon Data Services, Inc. itself does not market end-user cloud services under a distinct consumer brand, its operations are integral to the delivery of Amazon Web Services (AWS). AWS is the public-facing suite of cloud products provided by Amazon, offering on-demand infrastructure and platform services such as compute (Amazon EC2), object storage (Amazon S3), managed databases (Amazon RDS and Amazon DynamoDB), serverless computing (AWS Lambda), networking services (Amazon VPC, AWS Direct Connect), content delivery (Amazon CloudFront), and a broad portfolio of platform, security and analytics services. The physical infrastructure and site operations managed or supported by entities like Amazon Data Services, Inc. are the foundation on which these AWS offerings are hosted and delivered to customers globally. Operational and compliance responsibilities As part of Amazon’s enterprise infrastructure organization, Amazon Data Services, Inc. is involved in ensuring high availability, operational continuity and compliance of critical facilities. This includes meeting industry and regulatory requirements for data center operations, implementing redundancy and disaster recovery planning, and participating in certification processes where applicable. The company’s activities support the technical and operational reliability expected by enterprise and public-sector customers who use cloud services for production workloads. Public communications and regulatory filings from Amazon and AWS describe extensive investments in physical infrastructure, security, and compliance regimes—areas in which data services subsidiaries participate through ownership, management, or contractual arrangements. Role within Amazon’s corporate and legal structure Amazon Data Services, Inc. functions as a corporate vehicle used by Amazon to manage specific infrastructure-related assets and obligations. In large multinational technology companies, such subsidiaries are commonly used for administrative clarity, legal and tax structuring, asset management, and focused operational control of real estate and data center portfolios. Filings from Amazon identify numerous related subsidiaries with distinct legal names; Amazon Data Services, Inc. is one such entity that appears across public records and filings associated with Amazon’s technology operations. Public-facing presence and contact There is no separate consumer website or distinct public product catalog for Amazon Data Services, Inc.; instead, the public-facing information that relates to the company’s operational domain is made available through Amazon and Amazon Web Services channels. For customers seeking products or services supported by the company’s infrastructure, AWS’s official online resources describe the portfolio of cloud services, technical documentation, operational commitments, and compliance information. For corporate, legal or regulatory inquiries about subsidiary entities, Amazon’s corporate filings and investor relations disclosures provide formal references to the company’s legal entities and their roles within the broader Amazon organization.
Visit company vacancy