Datasets, transforms and models specific to Computer Vision
A Simple and Universal Swarm Intelligence Engine
State-of-the-art 2D and 3D Face Analysis Project
A simple, high-quality voice conversion tool focused on ease of use
Industry leading face manipulation platform
AI agent harness for AI coding agents
Wan2.1: Open and Advanced Large-Scale Video Generative Model
AI video generator optimized for low VRAM and older GPUs use
Run Local LLMs on Any Device. Open-source
Official Python inference and LoRA trainer package
The highest-scoring AI memory system ever benchmarked
Awesome multilingual OCR toolkits based on PaddlePaddle
From Images to High-Fidelity 3D Assets
The most powerful and modular diffusion model GUI, api and backend
3D reconstruction software
1 min voice data can also be used to train a good TTS model
TTS with kokoro and onnx runtime
Open-source, high-performance AI model with advanced reasoning
Wan2.2: Open and Advanced Large-Scale Video Generative Model
The most powerful local music generation model
Improve your Baduk skills by training with KataGo
Claude Code skill for generating production-quality SVG+PNG technical
Agentic, Reasoning, and Coding (ARC) foundation models
DeepMind's software stack for physics-based simulation
Video-based AI memory library. Store millions of text chunks in MP4