AI Inference Engineer Job at Signify Technology, Santa Clara, CA

ZllLSVlYRVVsMnJ0SzlycXp6UU5lNTJPQkE9PQ==
  • Signify Technology
  • Santa Clara, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Hudson

Manager, Warehouse & Logistics Systems Job at Hudson

Purpose: The Manager, Warehouse and Logistics Systems is responsible for the design, implementation, and improvement of SAP ECC and WM systems related to warehouse and logistics including peripheral hardware like scanners and barcode printers. Essential Functions:...

Gear Wash

Business Development Manager Job at Gear Wash

 ...Fire-Dex, the fastest-growing PPE manufacturer. Join our continued growth! Gear Wash is currently seeking a Business Development Manager Ventura responsible for the growth and profitability of new and existing accounts for Gear Wash. The candidate must live within... 

Optimum Healthcare IT

Entry Level Healthcare IT Analyst Job at Optimum Healthcare IT

 ...Entry Level Healthcare IT Analyst Start Your Career in Healthcare Information Technology Today! Getting your rst job can be difcult when employers...  ..., PeopleSoft, UKG), ITSM applications (ServiceNow), data and analytics applications (Tableau, PowerBI), cloud deployments... 

Brightpath Associates LLC

Senior Cost Accountant Job at Brightpath Associates LLC

 ...Job Summary We are seeking a highly analytical and detail-oriented Cost Accountant to join our finance team at a 24-hour manufacturing facility. The Cost Accountant will play a critical role in analyzing production costs, monitoring inventory and variances, and supporting... 

CAMRIS

Biologist Job at CAMRIS

Overview We are seeking a Biologist to support the National Institutes of Healths (NIH), world renowned National Institute of Allergy and Infectious Diseases (NIAID) in Bethesda, MD. Our NIAID professional, technical, and scientific support personnel are part of ...