Only AI Jobs


Sr Software Development Manager, Generative AI for AWS Neuron

ID: 4808

Type: Full-time

Category: Others

Company Name: Annapurna Labs (U.S.) Inc. - D63

Location: USA, NY, New York - New York - United States

Salary: 242,100.00 - 327,500.00 USD annually

Visit company vacancy
Job Description

You will join a dynamic team working in the GenaAI revolution by applying AI to AI. You will work building agents, tools, and models to simplify and accelerate customer adoption of Neuron, the software stack supporting Amazon's Machine Learning silicon: Trainium. You will work with external and internal customers to identify the main obstacles and the opportunities to accelerate their adoption of the Neuron technology. You will be a key member of the team driving our vision and strategy in this space critical to AWS's Generative AI business. Will also lead a team building AI agents and tools that simplify AWS Neuron adoption for Machine Learning developers working with Trainium chips, partnering with external and internal customers to identify obstacles and accelerate their migration to AWS's ML silicon.

Key job responsibilities
This is a highly visible role that requires partnering with other Neuron Software teams, Applied Science, AWS AI Services, external partners and customers with a potential high impact on AWS's top and bottom line. As a member of the team applying Generative AI to accelerate Neuron adoption, you will play a key role in shaping this space with the following technical and leadership responsibilities:

* Collaborate with scientists, engineers, product managers and executive leadership to create the technical vision and roadmap for agents and tools used by internal and external Machine Learning developers.
* Serve as a technical lead on demanding, cross-functional projects critical to AWS's future success in the AI space..
* Deliver on ambitious goals to improve the time and effort it takes to port and optimize Machine Learning workloads on Neuron.
* Contribute intellectual property through patents.
* Hire and assist in the career development of your team members, actively mentoring individuals on advanced technical issues and their career growth.
* Contribute to the definition of best practices for the software development lifecycle used by junior engineers, including its design, implementation, testing, and operational characteristics.

About the team
The Neuroboros team was recently created to pursue the ambitious goal of leveraging and expanding Generative AI technologies to help customers benefit from the scale and price/performance equation offered by Amazon Machine Learning hardware. The creation of the team in NYC is key to Annapurna Labs location strategy, with the goal of creating an additional hub attracting top talent with varied backgrounds to work on challenging problems, using and building state-of-the-art tooling.

Basic Qualifications

- 10+ years of engineering experience
- 5+ years of engineering team management experience
- 10+ years of planning, designing, developing and delivering consumer software experience
- Experience partnering with product or program management teams
- Experience managing multiple concurrent programs, projects and development teams in an Agile environment
- Experience in one or more of the following areas ML compilers, production coding agents, GenAI model architecture, model training, neural network optimization. or alternatively applied math

Preferred Qualifications

- Proven track record building AI agents that automate ML workload optimization
- ML compiler tuning
- Distributed inference and training
- ML kernel authoring and optimization

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.



USA, NY, New York - 242,100.00 - 327,500.00 USD annually

Company Information

Company Name: Annapurna Labs (U.S.) Inc. - D63

Company Website: https://aws.amazon.com/annapurna-labs/

Company Address: USA, TX, Austin

Annapurna Labs is a technology engineering organization acquired by Amazon in January 2015 and integrated into Amazon Web Services (AWS). Originally founded as an independent semiconductor and systems-design startup, Annapurna Labs has since focused on the design and development of custom silicon, system-on-chip (SoC) solutions, and hardware subsystems used to optimize cloud infrastructure. The organization’s engineering work concentrates on creating purpose-built processors and hardware accelerators that improve performance, efficiency, security, and cost for hyperscale cloud services and virtualized computing environments. Annapurna Labs’ outputs are primarily incorporated into AWS compute, storage, and networking offerings rather than sold as products under the Annapurna brand to third-party customers. Company overview and positioning Annapurna Labs began as a specialized design team focused on low-power, high-performance SoCs and platform controllers suitable for consumer and data-center uses. After acquisition by Amazon, the group’s charter shifted to delivering hardware and firmware innovations that directly support AWS services and the EC2 compute platform. The team’s responsibilities include architecture, logic design, firmware, hardware engineering, and close integration with software and systems teams across AWS to enable differentiated instance types and infrastructure features. Engineers from Annapurna Labs collaborate with other AWS organizations to translate cloud service requirements (for performance, security, and scalability) into silicon and hardware subsystems integrated into Amazon’s data centers. Core business activities and capabilities Annapurna Labs’ core activities center on custom silicon design and hardware subsystem development for cloud infrastructure. Key capabilities include designing ARM-based processors and SoCs optimized for server workloads, developing dedicated virtualization and I/O offload hardware, and engineering secure, high-throughput networking and storage controllers. The organization also develops firmware, platform-level security features, and hardware-software co-design techniques to ensure that accelerators are tightly integrated with hypervisors, host operating systems, and AWS control plane services. Annapurna Labs’ engineering scope spans multiple layers of the stack: microarchitecture and CPU subsystem design; integration of memory, I/O, and accelerators on SoCs; board- and chassis-level hardware design; firmware and secure boot implementations; and the development of hardware subsystems that offload virtualization, networking, and storage tasks from general-purpose CPUs. This hardware-offload approach reduces overhead on server CPUs, enables higher consolidation and isolation for multi-tenant cloud environments, and allows AWS to offer instance types with improved price/performance profiles. Main products, technologies, and contributions While Annapurna Labs does not primarily market stand-alone commercial products under its own brand following the Amazon acquisition, its engineering output is visible across several AWS technologies and product families. Notable contributions linked to Annapurna Labs engineering include the AWS Nitro System and the Graviton family of processors. The AWS Nitro System is a collection of hardware and lightweight hypervisor software components that offload networking, storage, and security functions from host CPUs to dedicated hardware and firmware. Nitro enables improved performance, stronger isolation, and feature-rich instance types. The Graviton processors are AWS’s family of custom ARM-based CPUs designed for cloud workloads; these processors emphasize high throughput and energy efficiency for many server-use cases. Engineers from the Annapurna Labs organization have been reported as major contributors to the architecture and delivery of these initiatives. Beyond processors and Nitro, the group has focused on high-performance network controllers, storage controllers, and platform controllers that help AWS implement features such as enhanced networking, accelerated storage I/O, and hardware-enforced isolation. Annapurna Labs’ designs emphasize a hardware-software co-design approach: firmware and microcontroller subsystems are developed alongside host-level drivers and management software so that new hardware features can be exposed to AWS services and customers reliably and securely. This deep integration reduces virtualization overhead, improves I/O determinism, and enables new instance capabilities that would be difficult to achieve with off-the-shelf server components. Customers and deployment context Following the Amazon acquisition, Annapurna Labs’ technologies are principally deployed inside Amazon’s own global cloud footprint. The group’s work directly benefits AWS customers through improved EC2 instance performance, new instance families, and enhanced underlying infrastructure security and isolation. Rather than selling silicon or hardware directly to external customers, Annapurna Labs’ output is realized as AWS features, instance types, and managed services that incorporate their designs. This model allows Amazon to differentiate its cloud offerings by optimizing the underlying hardware for the specific demands of large-scale cloud workloads. Corporate and historical notes Annapurna Labs was founded as a private startup focused on SoC and hardware innovation. Amazon acquired the company in 2015, bringing its engineering talent into AWS. Since the acquisition, the team has been cited in AWS announcements and technical disclosures describing custom silicon and hardware systems used to advance AWS compute and virtualization technology. The organization operates as part of Amazon’s broader investment in custom infrastructure, in which in-house hardware design is used to achieve performance, cost, and feature advantages at hyperscale. In summary, Annapurna Labs is a specialized hardware engineering organization now operating within Amazon Web Services, responsible for designing custom processors, SoCs, and hardware subsystems that power and differentiate AWS compute and infrastructure services. Their work is built into internal AWS products (notably Nitro and the Graviton processor families) and focuses on improving performance, security, and efficiency for cloud customers through hardware-software co-design and purpose-built silicon.
Visit company vacancy