Retrieval Augmented Generation (RAG) framework
Fault-tolerant, highly scalable GPU orchestration
Genome modeling and design across all domains of life
Low-latency AI inference engine optimized for mobile devices
Decomposable Multiscale Mixing for Time Series Forecasting
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
Faster and easier training and deployments
Running large language models on a single GPU
Unleashing 10,000+ Word Generation from Long Context LLMs
Empowering Code Generation with OSS-Instruct
Neural Network architecture based on ideas of the original LSTM
Leaderboard Comparing LLM Performance at Producing Hallucinations
Accessible large language models via k-bit quantization for PyTorch
Accelerate local LLM inference and finetuning
Implement a concise and clear Deep Search Agent from 0
The best ChatGPT that $100 can buy
4M: Massively Multimodal Masked Modeling
ICLR2024 Spotlight: curation/training code, metadata, distribution
[CVPR 2025 Best Paper Award] VGGT
Code to accompany "A Method for Animating Children's Drawings"
No-code multi-agent framework to build LLM Agents, workflows
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Fidelity and Controllable Generation of Textured 3D Assets
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Unifying 3D Mesh Generation with Language Models