Job Title: CUDA Machine Learning Engineer – GPU Optimization
Salary: $180,000 – $220,000 + equity
Location: New York City or San Francisco (on-site)
Responsibilities:
- Optimize generative ML models (e.g. video diffusion, 3DVAE) for GPU performance using CUDA
- Implement efficient GPU algorithms and identify performance bottlenecks in training and inference
- Collaborate with research and engineering teams to improve throughput on large-scale compute
- Ensure models run efficiently across various GPU architectures
- Stay up to date with latest advances in GPU programming and ML optimization
Requirements:
- Bachelor’s degree in CS, EE, or related field with focus on high-performance computing
- Strong skills in CUDA and C++ programming for GPU acceleration
- Experience with PyTorch or TensorFlow on GPU-based systems
- Solid understanding of parallel computing and GPU architectures
- Familiarity with generative ML models and performance debugging
- Startup experience and ability to thrive in a fast-paced environment
Benefits:
- Fully Covered Medical, Dental, and Vision Insurance
- Paid Time Off
- Parental Leave
- Free Lunches
- Team Building Events
- Collaborative Team environment
If you are interested in learning more about this position or any other roles we may have open, please apply today!
Accessibility Statement:
- Read and apply for this role in the way that works for you by using
- our Recite Me assistive technology tool. Click the circle at the bottom right side of the screen and select your preferences.
- We make an active choice to be inclusive towards everyone every day.
- Please let us know if you require any accessibility adjustments through the application or interview process.
Our Commitment to Diversity, Equity, and Inclusion:
Signify’s mission is to empower every person, regardless of their background
or circumstances, with an equitable chance to achieve the careers
they deserve. Building a diverse future, one placement at a time.