Repo of Qwen2-Audio chat & pretrained large audio language model
Generate Any 3D Scene in Seconds
Making ALL Software Agent-Native
The official PyTorch implementation of Google's Gemma models
Machine Learning automation and tracking
Git-based data version control for machine learning workflows
AutoGluon: AutoML for Image, Text, and Tabular Data
Open source codebase for Scale Agentex
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Open platform for building, deploying, and managing LLM agents
AI tool that converts GitHub repositories into interactive diagrams
Bringing BERT into modernity via both architecture changes and scaling
A curated collection of skills for AI coding agents
Inference script for Oasis 500M
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Set of tools to assess and improve LLM security
Official implementation of DreamCraft3D
A Customizable Image-to-Video Model based on HunyuanVideo
Graph Neural Network Library for PyTorch
Models and examples built with TensorFlow
Large Multimodal Models for Video Understanding and Editing
Powering Amazon custom machine learning chips
A specialized Claude Code workspace for creating long-form
Handwritten Text Recognition (HTR) system implemented with TensorFlow
From nobody to big model (LLM) hero