https://arxiv.org/abs/2109.09541
Scaling TensorFlow to 300 million predictions per second
Jan Hartman, Davorin Kopič
“In this work, we describe the process of scaling machine learning models implemented in the TensorFlow machine learning framework to over 300 million predictions per second…”