AI Inference Engineer Job at Signify Technology, Fremont, CA

MzlFMWg3SmVpVy81Uk9wb2tmYU1jakkxM0E9PQ==
  • Signify Technology
  • Fremont, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

General Atomics

Flight Controls Engineer Job at General Atomics

 ...tactical reconnaissance radars, as well as advanced high-resolution surveillance systems.We have an exciting opportunity for a Flight Controls Engineer in the San Diego (Poway, CA) area. In this position, you will play an active role in the design and development of flight... 

Cap Swag

Entry Level T-Shirt Printer | Print Machine Operator Job at Cap Swag

 ...Direct to Garment (DTG) Entry-level printer needed to complete orders while ensuring the best quality and meeting deadlines Applicants should be driven, detail-oriented, with a strong desire to produce high-quality prints in record time. The ideal candidate should have... 

Nosh.com

Brewer - Confluence Brewing Company Job at Nosh.com

 ...year. When not scheduled to be on the brewhouse the team member will be responsible for all other tasks related to the production of beer from grain to glass. Typical job tasks include, but are not limited to: Produce Wort on a 4 vessel 20 bbl brewhouse 3-5 days a... 

Hudson Companies

Maintenance Technician & Groundskeeper Job at Hudson Companies

Maintenance Technician & Groundskeeper in Cranberry Township, PA, US. The Maintenance Technician & Groundskeeper is responsible for assisting in maintaining the physical integrity & cleanliness of the building at all times. This involves ensuring a safe and secure...

Pave Talent

Lab Technician/Research Associate - iPSC Cell Culture Specialist 🧬 Job at Pave Talent

 ...potential weekly scaling Perform molecular biology bench work including PCR, gel...  ...biological lab experience OR Bachelor's degree in Biology or related field Hands-on mammalian...  ...($40,000-$75,000 based on experience and level) with comprehensive benefits. Work in a state...