AI Hardware Systems Engineer

At eBay, we’re more than a global ecommerce leader — we’re changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190 markets around the world. We’re committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.

Our customers are our compass, authenticity thrives, bold ideas are welcome, and everyone can bring their unique selves to work — every day. We’re in this together, sustaining the future of our customers, our company, and our planet.

Join a team of passionate thinkers, innovators, and dreamers — and help us connect people and build communities to create economic opportunity for all.

At eBay, we have started a new chapter in our iconic internet history of being the largest online marketplace in the world. We have more than 800 million listings with 80% of them selling new items, in over 400 markets around the world. The collection of services runs on a significant server and storage infrastructure, and the hardware engineering team is chartered to drive reliability, efficiency and performance of this layer. More and more of this workload is taking advantage of AI accelerators, and we are creating a role to focus on this area.

The Role:

We are looking for a Systems Software Engineer to join our team to qualify and automate testing of new hardware technologies related to AI, as well as support some of our traditional qualifications efforts. This person will interface with internal eBay teams working on AI platforms, other platform teams, key technology and systems integration vendors, AI open source software communities, and with other members of the hardware engineering team.

Key Responsibilities:

  • You will work as part of the Hardware Engineering team to reduce the cost of purchasing and operating eBay’s fleet of servers, saving millions of dollars a year.
  • At eBay we believe that AI will fundamentally change the way we work and do business, as such you primary focus will be working on our AI hardware platforms.
  • You will translate internal customer requests into requirements, and develop benchmarks and test suites to ensure our platforms meet their needs.
  • Evaluate the performance and reliability of new hardware platforms and hardware components using automated tests, with a strong focus on AI accelerators.
  • Expand and maintain our automation that we use daily for testing, and reliability work.
  • Develop performance test plans and experiments with our customer teams to ensure we are able to utilize our hardware to the fullest of its ability.
  • Work with our customers to debug, and address any reliability or performance issues they have with our server products.
  • Identify and suggest the ideal OS and BIOS settings for our systems.
  • Explore and propose new hardware/software technologies that improve performance, or reduce cost of our products, particularly new AI accelerators.
  • You will improve our monitoring and data collection tooling, to ensure we’re recording relevant information.

What you need:

  • You have at least 5-8 years of systems engineering experience using Linux as an operating system.
  • You should understand how to configure servers to expose AI accelerators.
  • Experience with AI frameworks and platforms, ideally with experience benchmarking services or accelerators. Things like pytorch, deepspeed, or MLPerf for benchmarking.
  • You should be able to explain how linux utilizes various hardware components, and what tunables it provides.
  • We primarily use Python and Bash for automating tasks, you must be proficient in one of these languages.
  • You should have used a revision control system like GIT, and be familiar with concepts like branching and merging.
  • You must be able to build and use containers using Docker or another technology.
  • You should understand how to compile and build source code, especially the linux kernel.
  • BS EE or CS with continued formal or informal education

Desired Skills:

  • It would a bonus if you understood the differences between AI accelerators from multiple vendors, and the differences in their architectures.
  • We’d like you to be familiar with extending a monitoring framework like Prometheus, so we can collect additional data from our testing.
  • We’d like for you to be familiar with Kubernetes and cloud computing concepts.
  • It would be a bonus if you’ve used various profiling and performance tools like perf, vtune, or performance co-pilot.
  • It would be great if you have experience analyzing logs, and working with data repositories to help drive technical decisions.
  • It would be a bonus if you’ve deployed and configured systems at scale using standard technology like PXE, Ansible, Salt, and Puppet.
  • Position ideally based in San Jose, CA with minimal travel required.

The pay range for this position at commencement of employment in California, Washington, or New York is expected in the range below.

$149,200 – $234,850

Base pay offered may vary depending on multiple individualized factors, including location, skills, and experience. The total compensation package for this position may also include other elements, including a target bonus and restricted stock units (as applicable) in addition to a full range of medical, financial, and/or other benefits (including 401(k) eligibility and various paid time off benefits, such as PTO and parental leave). Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

If hired, employees will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.

Please see the Talent Privacy Notice for information regarding how eBay handles your personal data collected when you use the eBay Careers website or apply for a job with eBay.

eBay is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, sexual orientation, gender identity, veteran status, and disability, or other legally protected status.

Jobs posted with location as “Remote – United States (Excludes: HI, NM)” excludes residents of Hawaii and New Mexico.

This website uses cookies to enhance your experience. By continuing to browse the site, you agree to our use of cookies. Visit our Privacy Center for more information.

To apply for this job please visit itjobpro.com.