AI Inference Engineer Job at Signify Technology, Santa Clara, CA

ZllLSVlYRVVsMnJ0SzlycXp6UU5lNTJPQkE9PQ==
  • Signify Technology
  • Santa Clara, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Tiger Recruitment

Personal Assistant Job at Tiger Recruitment

Title: Personal Assistant Salary: $90,000 - $100,000 p.a. Start: ASAP An Ultra High Net Worth individual is seeking a personal assistant to support their busy life! This will be a full-time role working 9:00AM 6:00PM Monday through Friday. The ideal candidate...

Business Needs Inc.

Mainframe Developer Job at Business Needs Inc.

 ...Job Title: Mainframe Developer with IMS and CICS Location : Jersey City, NJ/Plano, TX Duration: Long Term contract Required Skills: Programming Languages: Strong proficiency in COBOL Mainframe Operating Systems: In-depth knowledge of z/OS Database... 

Orchard Ridge Assisted Living

Cook - Entry Level Job at Orchard Ridge Assisted Living

 ...plates Maintain a clean and organized prep area Wash pots, pans, and dishes (specifically from Memory Care) Assist with catered events and holiday meals as needed Follow portion control and dietary guidelines Step into various kitchen roles when needed... 

Superior Fence & Rail

General Manager Job at Superior Fence & Rail

 ...General Manager Lead a High-Performance Team in a Fast-Growing Industry Are you an experienced leader in the construction or service industry who thrives on building and managing successful teams? Superior Fence & Rail is looking for a hands-on General Manager to oversee... 

Brown & Brown

Employee Benefits Producer Job at Brown & Brown

 ...Brown & Brown is seeking a Producer for our Employee Benefits Department in Phoenix, Arizona ! No prior insurance experience is required - Must have successful B2B sales experience! We offer our sales executives an opportunity to join a dynamic team with a tradition...