Official inference repo for FLUX.2 models
High-Resolution Image Synthesis with Latent Diffusion Models
A Customizable Image-to-Video Model based on HunyuanVideo
Tokenizer-Free TTS for Multilingual Speech Generation
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A Family of Open Sourced Music Foundation Models
The official Meta Llama 3 GitHub site
MARS5 speech model (TTS) from CAMB.AI
Instant voice cloning by MIT and MyShell. Audio foundation model
Reference implementations of MLPerf™ training benchmarks
A sound cloning tool with a web interface, using your voice
Generative AI reference workflows
Interface for OuteTTS models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Personal notes from Wu Enda's machine learning course
Implementation of Vision Transformer, a simple way to achieve SOTA
A high-quality rapid TTS voice cloning model
This repository contains code released by Google Research
High-Quality Voice Cloning TTS for 600+ Languages
Taming Stable Diffusion for Lip Sync
Utilities intended for use with Llama models
Z80-μLM is a 2-bit quantized language model
Semantic search and workflows for medical/scientific papers
150+ quantitative finance Python programs