Open-source deep-learning framework
Reference PyTorch implementation and models for DINOv3
Python inference and LoRA trainer package for the LTX-2 audio–video
A Systematic Framework for Interactive World Modeling
Open Source Speech Language Model
A Unified Framework for Text-to-3D and Image-to-3D Generation
Wan2.2: Open and Advanced Large-Scale Video Generative Model
FAIR Sequence Modeling Toolkit 2
Official repository for LTX-Video
Text and image to video generation: CogVideoX and CogVideo
Project Lyra: Open Generative 3D World Models
High-resolution models for human tasks
Open-source framework for intelligent speech interaction
An Efficient Agentic Model for Computer Use
Global weather forecasting model using graph neural networks and JAX
Generating Immersive, Explorable, and Interactive 3D Worlds
Open-Source Financial Large Language Models
Achieving 3+ generation speedup on reasoning tasks
LTX-Video Support for ComfyUI
A Powerful Native Multimodal Model for Image Generation
RGBD video generation model conditioned on camera input
Block Diffusion for Ultra-Fast Speculative Decoding
Official implementation of Watermark Anything with Localized Messages
code for Mesh R-CNN, ICCV 2019
PyTorch code and models for the DINOv2 self-supervised learning