AI Inference Engineer – Stealth Startup | San Fransisco Onsite
Compensation: $200K–$300K + equity
Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.
This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.
What You’ll Be Doing:
What They’re Looking For:
Nice to Have:
This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.
...Business Analyst Banking (W2, $20-30/hr) Location: Hybrid, NYC Job Type: Contract W2 Rate: $ 20-30/hr #1... ...tocorp) Pay Rate: $20-30 per hour Location: Open to fully remote or onsite in select U.S. offices Contract Duration: TBD (with...
...Jubilant Radiopharma, the fastest growing radiopharmaceutical company in the nation, is seeking a Full Time driver for its Plainview, NY location. The hours of this position are 4:30am-1:00 pm. Weekends and call will be required. Address:51 East Bethpage Road...