Controllable and fast Text-to-Speech for over 7000 languages
An AI-powered security review GitHub Action using Claude
Renderer for the harmony response format to be used with gpt-oss
Data Lake for Deep Learning. Build, manage, and query datasets
Machine Learning Engineering Open Book
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
State-of-the-art (SoTA) text-to-video pre-trained model
A simple yet powerful agent framework that delivers with models
This repository contains code released by Google Research
Spark-TTS Inference Code
Block Diffusion for Ultra-Fast Speculative Decoding
A long-running autonomous coding agent powered by the Claude Agent
Open source no-code system for text annotation and building of text
RAG Search API
Towards Human-Sounding Speech
Configuration UI for Home Assistant
Python Driver for ArangoDB with built-in validation
AI-powered document analysis and tagging for Paperless-ngx
Run all your local AI together in one package
FAIR Sequence Modeling Toolkit 2
The Standard Webhooks specification
When LLM Meets Domain Experts
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
GEO-first SEO skill for Claude Code
Open source AI model for generating full songs from lyrics prompts