Sr. Algorithm Engineer (ML) at Ambarella Corporation, Santa Clara, U.S.A., 2020/06 - Now
Accelerate model inference by optimizing bit width of tensors in post-training quantization and folding operations in a frozen computation graph. Also, implement specialized primitives and arithemtic operations for various target chips with GPU acceleration.