AI Inference Engineer Job at Signify Technology, Fremont, CA

MzlFMWg3SmVpVy81Uk9wb2tmYU1jakkxM0E9PQ==
  • Signify Technology
  • Fremont, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Excentia Human Services

Behavioral Health Technician Job at Excentia Human Services

 ...Benefits: ~$18/hour for BHTs,$21/hour for RBTs ~ Sign-on bonus ~ Mileage reimbursement ~ Paid training and ongoing supervision ~ Study support for RBT and BCBA certification ~ Tuition reimbursement ~ Eligible for benefits at 30 hours/week ~ Student loan... 

Howard Industries

Production Supervisor Job at Howard Industries

Production Supervisor Job description The production supervisor position is responsible for supervising employees in a manufacturing and assembly environment. This position plans and assigns work, implements policies and procedures and recommends improvements in ...

Virgin Galactic

Specialist Engineer, Product Safety Job at Virgin Galactic

 ...seeking a talented and motivated Specialist Engineer/Analyst to join our team. Guided by our...  ...proactively manage the risks inherent in human spaceflight through our daily actions in...  ...mitigation processes ~ Experience with Human Factors and its application to product safety.... 

BRIA

Unit Manager (RN) Job at BRIA

 ...Description: At BRIA, we are community-driven with a focus on work-life balance. Our nursing homes offer a compassionate care environment, empowering you. Unit Manager (RN) Benefits: ~ PTO package and paid holidays ~ Employee rewards program ~ Growth from... 

Partners Personnel

Customer Service Manager Job at Partners Personnel

 ...Job Summary We are seeking a dedicated and experienced Customer Service Manager to lead our customer service team. The ideal candidate will be responsible for ensuring exceptional customer experiences, managing daily operations, and developing strategies to enhance service...