What is KFServing?
TL;DR: KFServing is a novel cloud-native multi-framework model serving tool for serverless inference. KFServing was born as part of the Kubeflow project, a joint effort between AI/ML industry leaders to standardize machine learning operations on top of Kubernetes. It aims at solving the difficulties of model deployment to production through the “model as data” approach, i.e. providing an API for inference requests.