Only AI Jobs


Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

ID: 5791

Type: Full-time

Category: Others

Company Name: Amazon Development Centre Canada ULC

Location: CAN, ON, Toronto - Toronto - Canada

Salary: 150,700.00 - 251,700.00 CAD annually

Visit company vacancy
Job Description

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium.

The Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance kernels for ML functions, ensuring every FLOP counts in delivering optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.

The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance.

As part of the broader Neuron Compiler organization, our team works across multiple technology layers - from frameworks and compilers to runtime and collectives. We not only optimize current performance but also contribute to future architecture designs, working closely with customers to enable their models and ensure optimal performance. This role offers a unique opportunity to work at the intersection of machine learning, high-performance computing, and distributed architectures, where you'll help shape the future of AI acceleration technology

This is an opportunity to work on cutting-edge products at the intersection of machine-learning, high-performance computing, and distributed architectures. You will architect and implement business-critical features, publish cutting-edge research, and mentor a brilliant team of experienced engineers. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. It is a very unique learning culture. The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achieve optimal performance on AWS ML accelerators.

Explore the product and our history!
https://awsdocs-neuron.readthedocs-hosted.com/en/latest/neuron-guide/neuron-cc/index.html
https://aws.amazon.com/machine-learning/neuron/
https://github.com/aws/aws-neuron-sdk
https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-success


Key job responsibilities
Our kernel engineers collaborate across compiler, runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the intersection of software, hardware, and machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will:

* Design and implement high-performance compute kernels for ML operations, leveraging the Neuron architecture and programming models
* Analyze and optimize kernel-level performance across multiple generations of Neuron hardware
* Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks
* Implement compiler optimizations such as fusion, sharding, tiling, and scheduling
* Work directly with customers to enable and optimize their ML models on AWS accelerators
* Collaborate across teams to develop innovative kernel optimization techniques



A day in the life
As you design and code solutions to help our team drive efficiencies in software architecture, you’ll create metrics, implement automation and other improvements, and resolve the root cause of software defects. You’ll also:

Build high-impact solutions to deliver to our large customer base.

Participate in design discussions, code review, and communicate with internal and external stakeholders.

Work cross-functionally to help drive business decisions with your technical input.

Work in a startup-like development environment, where you’re always working on the most important stuff.

About the team
#1. Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

#2. Why AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

#3. Inclusive Team Culture
Here at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.

#4. Work/Life Balance
Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.

#5. Mentorship & Career Growth
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.

Basic Qualifications

- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team

Preferred Qualifications

- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- * Expertise in accelerator architectures for ML or HPC such as GPUs, CPUs, FPGAs, or custom architectures
- * Experience with GPU kernel optimization and GPGPU computing such as CUDA, NKI, Triton, OpenCL, SYCL, or ROCm
- * Demonstrated experience with NVIDIA PTX and/or AMD GPU ISA
- * Experience developing high performance libraries for HPC applications
- * Proficiency in low-level performance optimization for GPUs
- * Experience with LLVM/MLIR backend development for GPUs
- * Knowledge of ML frameworks (PyTorch, TensorFlow) and their GPU backends
- * Experience with parallel programming and optimization techniques
- * Understanding of GPU memory hierarchies and optimization strategies

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. As a total compensation company, Amazon's package may include other elements such as sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon offers comprehensive benefits including health insurance (medical, dental, vision, prescription, basic life & AD&D insurance), Registered Retirement Savings Plan (RRSP), Deferred Profit Sharing Plan (DPSP), paid time off, and other resources to improve health and well-being. We thank all applicants for their interest, however only those interviewed will be advised as to hiring status.



CAN, ON, Toronto - 150,700.00 - 251,700.00 CAD annually

Company Information

Company Name: Amazon Development Centre Canada ULC

Company Website: https://www.amazon.jobs/en/locations/canada

Company Address: CAN, BC, Vancouver

Amazon Development Centre Canada ULC is a Canadian legal entity and part of the global Amazon organization that houses software development, product engineering, research and related technical operations supporting Amazon’s consumer retail, cloud, devices, media and advertising businesses. As a development centre entity within Amazon’s corporate structure, it functions as an engineering and product delivery organization that employs software engineers, data scientists, machine learning specialists, product managers, designers, quality engineers and operations staff to design, build, test and operate software systems and services used by Amazon customers and internal business teams. The company’s core activities are centered on software and systems development across multiple technology domains. These activities typically include back-end and front-end application development, distributed systems engineering, cloud services work (including integration with Amazon Web Services), data engineering, analytics and machine learning model development, natural language processing work for voice services, computer vision R&D, infrastructure automation, security engineering and developer tooling. Teams operating under the development centre model commonly focus on delivering scalable services for retail commerce (catalog, search, recommendations, pricing, inventory and checkout), digital media (streaming and content delivery), consumer devices (software for Alexa-enabled products and IoT integrations), advertising technology (targeting, measurement and auction systems), and enterprise offerings (AWS features and management tools). The organisation also supports lifecycle activities for Amazon products and services, including product management, technical program management, continuous integration and continuous delivery (CI/CD) pipelines, site reliability engineering (SRE), monitoring and incident response, performance engineering, and operational support. Development centres play a role in prototyping new features, running experiments and A/B tests, and collaborating with global product teams to localize and adapt features for Canadian markets when appropriate. The teams frequently interface with cross-functional stakeholders—user experience researchers, UX designers, business analysts, legal and policy teams, and operations—to deliver end-to-end solutions aligning with Amazon’s product and customer experience goals. In terms of products and services, Amazon Development Centre Canada ULC does not sell consumer-facing products under its own brand; rather, it contributes engineering and product work to Amazon’s broad portfolio. Outputs from its teams feed into Amazon’s retail platforms (amazon.ca and global retail storefronts), AWS services, Alexa and Echo device software, Prime Video and digital content systems, Amazon Logistics and fulfillment technology, Amazon Advertising products, and other Amazon-owned services. These contributions include new features, performance and scale improvements, security and compliance implementations, localization for Canadian customers, and enhancements to data and machine-learning systems used for personalization, fraud detection, supply chain optimization and advertising. As a legal entity in Canada, Amazon Development Centre Canada ULC typically supports hiring and employment, payroll administration, workplace facilities and compliance with Canadian regulatory and labour requirements for Amazon’s engineering workforce in the country. The development centre model enables Amazon to maintain distributed engineering capacity outside its U.S. headquarters, providing both local product focus and integration into Amazon’s global engineering processes and technology stacks. Teams in such centres often collaborate closely with other Amazon engineering groups across North America, Europe and Asia, participating in shared codebases, microservice architectures, global release processes and Amazon-wide technical standards. Amazon’s publicly stated corporate mission—to be Earth’s most customer-centric company—is reflected in the development centre’s emphasis on customer-focused product delivery, data-driven decision-making and rapid iteration. The centre’s efforts are typically aligned to measurable customer outcomes such as lower latency, more relevant search results and recommendations, improved reliability and availability of services, faster feature delivery, and localized enhancements that improve the experience for Canadian customers. The operation also invests in workforce development, supporting internships, university hiring pipelines, co-op programs and community engagement in the technology ecosystem, as part of Amazon’s broader commitments to hiring and local economic participation. Security, privacy and regulatory compliance are also core considerations for engineering work done by Amazon Development Centre Canada ULC. Teams are expected to follow Amazon-wide security practices, data protection standards and compliance programs to meet applicable Canadian and international legal requirements, particularly for customer data, payment processing and cloud services. Research and engineering efforts may include building privacy-preserving machine learning, secure authentication systems, and compliance tooling to support regulated industries and jurisdictional requirements. Overall, Amazon Development Centre Canada ULC operates as a technology-focused subsidiary entity within the Amazon corporate family, delivering software engineering, data science and product development work that underpins many of Amazon’s consumer and enterprise offerings. Its contributions enable Amazon to scale technology development globally while retaining localized capabilities for the Canadian market.
Visit company vacancy