A unified framework for scalable computing
A Pythonic framework to simplify AI service building
A high-performance ML model serving framework, offers dynamic batching
A library for accelerating Transformer models on NVIDIA GPUs
Bring the notion of Model-as-a-Service to life
Powering Amazon custom machine learning chips
Unified Model Serving Framework
Implementation of "Tree of Thoughts