Automatic question answering for local knowledge bases based on LLM
PyTorch library of curated Transformer models and their components
Powerful tool that lets you create and run intelligent agents
Reflexion: Language Agents with Verbal Reinforcement Learning
Test-Time Reinforcement Learning
The official implementation of RAPTOR
NeurIPS2025 Spotlight] Quantized Attention
Maimaibot, a (more focused) multi-platform intelligent agent
One-stop solution for creating your digital avatar from chat history
A New Axis of Sparsity for Large Language Models
"Big Model" trains a visual multimodal VLM with 26M parameters
Query MCP enables end-to-end management of Supabase via chat interface
K8s-mcp-server is a Model Context Protocol (MCP) server
Gorilla: An API store for LLMs
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Framework for building realtime multimodal voice AI agents apps
Contexts Optical Compression
Optimizing inference proxy for LLMs
The Multi-Agent Framework
A unified library of SOTA model optimization techniques
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Oobabooga - The definitive Web UI for local AI, with powerful features
Easiest and laziest way for building multi-agent LLMs applications
TensorRT LLM provides users with an easy-to-use Python API
An on-premises, OCR-free unstructured data extraction