HBM Validation Engineer, Annapurna Labs

Austin

Wednesday, 29 April 2026

Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time agoeven yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world. We are seeking an HBM/ DD - Rx validation expert with a heavy focus on validation of AWS next generation ML Chips, Cards and server integration. As a member of our memory team, you will have the opportunity to participate in the execution of HBM across all Trainium platforms, with the goal of improving the characteristics of HBM for our world leading Trainium AI servers. Our HBM engineers need to independently work with vendors, understand the settings, write/modify tests, debug and collect data in the fleet. Key job responsibilities. As a member of the team, you will join a mixed group of hardware and software engineers working to design, integrate, and innovate the next generation of machine learning chips into Trainium servers. In this position it is expected that you will: - Collaborate with architects, design teams, and software engineers on our next generation ML chips - Support on-going debug and operations of previous ML chips within manufacturing and the data center - Dive deep into IP integration, packaging, silicon bring up, characterization, and validation of our HBM subsystems - Independently develop the scripts you need to execute and collaborate with software engineers as your needs scale. A day in the life. A day in the life of a CHDE focused on HBM on the MLA Technology team focuses on operational excellence, constructively identifying problems, prototyping solutions, and leading data collection at scale to improve our products. We start each day looking at our fleet, reviewing dashboards for emergent issues impacting our customers, partnering with other teams to drive complex debugs as it pertains to HBM and associated So. C subsystems. We then look forward to the future technologies being developed and how we can best focus our efforts to help improve them and ensure a high quality product on behalf of our customers. Our team members touch everything from electrical simulations, to hardware qualification on test benches, to software driven data center metrics, with a broad range of tasks across multiple skillsets where you can help improve the reliability and performance of our products. You help the team evolve by actively participating in design discussions, code reviews, tickets, and data center capacity initiatives. While you are not a software engineer, you are not blocked by missing code, leveraging your scripting knowledge and AI tooling to help you move forward. CHD - Es on the MLA Technology Team are expected to help mentor others on the team in their area of expertise, regardless of level, to help develop the teams baseline skillsets and to participate in the hiring process for the team. About the team. About the team. Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and were building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future. Diverse Experiences. AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasnt followed a traditional path, or includes alternative experiences, dont let it stop you from applying. About AWS - Amazon Web Services (AWS) is the worlds most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating thats why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture. Here at AWS, its in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and Amaze. Con (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Work/ Life Balance. We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, theres nothing we cant achieve in the cloud. Mentorship & Career Growth. Were continuously raising our performance bar as we strive to become Earths Best Employer. Thats why youll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Basic Qualifications- BS in Electrical Engineering, Computer Engineering, Systems Engineering, Computer Science or related field.- 5 years of experience in Silicon development with- 3 years in SOC/ IO/ Subsystems- Good understanding of DDR/ HBM at the PHY and controller level- Good knowledge of DDR/ HBM training, timing parameters and/or controller features- Support the physical design team with IP integration, silicon design, 2.5 D packaging, clocking and timing constraints- Ability to create scripts (lua, bash, python, etc.) to accomplish functional day to day tasks.- Drive cross-functional triage effort on functional and performance issues- Perform system-level debug and root-cause analysis through bring-up, characterization, validation and production phase- Experience Working with 3rd party IP and memory vendors. Preferred Qualifications- MS in Electrical Engineering, Computer Engineering, Systems Engineering, Computer Science or related field.- Strong Firmware development skills within embedded environments- Good leadership skills and ability to multi-task and thrive in a dynamic environment- Knowledge of HBM, DD - Rx and related protocols- Good communication skills and interpersonal skills.

apply
 
Loading Similar Jobs...
JOBZ is an independent Job Search Engine. JOBZ is not an agent or representative and is not endorsed, sponsored or affiliated with any employer. JOBZ uses proprietary technology to keep the availability and accuracy of its job listings and their details. All trademarks, service marks, logos, domain names, job descriptions and other company descriptions / details are the property of their respective holder. JOBZ does not have its users apply for a job on the J-O-B-Z.com website. Additionally, JOBZ may provide a list of third-party job listings that may not be affiliated with any employer. Please make sure you understand and agree to the website's Terms & Conditions and Privacy Policies you are applying on as they may differ from ours and are not in our control.