A system for agentic LLM-powered data processing and ETL
Synthetic data curation for post-training and data extraction
Quick illustration of how one can easily read books together with LLMs
A modular Agentic RAG built with LangGraph
Gemma open-weight LLM library, from Google DeepMind
An elegent pytorch implement of transformers
Official inference repo for FLUX.1 models
Agents write python code to call tools and orchestrate other agents
Tokenizer-Free TTS for Multilingual Speech Generation
Qwen2.5-VL is the multimodal large language model series
LLM Council works together to answer your hardest questions
950 line, minimal, extensible LLM inference engine built from scratch
Generate short videos with one click using AI LLM
FlashInfer: Kernel Library for LLM Serving
Fast multimodal LLM for real-time voice interaction and AI apps
Multilingual Document Layout Parsing in a Single Vision-Language Model
Extension of Google Research’s PaperBanana
MemoryOS is designed to provide a memory operating system
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
A dataset consists of 15,140 ChatGPT prompts from Reddit
A simple, easy-to-hack GraphRAG implementation
Using AI models to automatically provide commentary and edit videos
Qwen3-ASR is an open-source series of ASR models
A nearly-live implementation of OpenAI's Whisper
Personal AI, On Personal Devices