AI Inference Engineer Job at Signify Technology, Fremont, CA

MzlFMWg3SmVpVy81Uk9wb2tmYU1jakkxM0E9PQ==
  • Signify Technology
  • Fremont, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

SCHOEN LEGAL SEARCH

Partner, Elite NYC Boutique Job at SCHOEN LEGAL SEARCH

Well-known and established elite, full service NYC boutique seeks additional partners looking to grow their practices among fantastic like-minded cohorts. 90 year-old 46 attorney NYC full service elite quality boutique on the approved list of top banks, hedge funds, ...

ContactLink Solutions LLC

Dari Freelance US-Based VRI/OPI Interpreter Job at ContactLink Solutions LLC

 ...Proficiency/Bilingual/Native level of English and target language Work letter from previous employer Resume with 2 professional...  ...information: Remote position, interpreter works from his/her home office Ongoing training and competency opportunities Monthly... 

Fancy Apartments LLC

Apartment Locator (HTX) Job at Fancy Apartments LLC

 ...Fancy Apartments is keen on hiring an effective Real Estate Agent Apartment Locator to play a vital role in the growth of our sales team. Our Apartment Locators focus on growing the Fancy Apartments brand by helping clients find apartments across Houston. Ideal candidates... 

Brock & Company Inc.

Cashier - Food Prep - Education Division Job at Brock & Company Inc.

 ...Description: Cashier- Food Prep - Full-Time - Day Schedule - Monday through Friday - Benefits Wage: $18.00 Per Hour Brock &...  ...Job Responsibilities: Prior experience in a high volume, fast-paced environment preferred. Operate a cash register, calculator... 

Hirtle, Callaghan & Co.

Client Engagement Specialist Job at Hirtle, Callaghan & Co.

 ...equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or orientation, Veteran Status, or...