I'm hiring for the team I lead at Waymo. We're GPU experts who care about latency on the car.
Before coming to Waymo, I lead the CPU/GPU parts of XLA, the compiler for TensorFlow that also powers JAX. We're using XLA and other technologies to make ML and non-ML GPU code run fast on the car.
Ideally we're looking for someone who's comfortable hacking on high-ish level TensorFlow and low-level XLA (or similar frameworks, like TVM), to help with e.g. speeding up int8 stuff. But I'm open to considering strong candidates without this specific experience -- all of this stuff is learnable.
Apply here https://waymo.com/joinus/4764940/ and maybe mention that you saw this on HN! Also if you have questions I'll try to check this thread and reply.