Designed for training LLM/VLM agents via RL
Driving with Graph Visual Question Answering
Recipes to train reward model for RLHF
Constrained Value Alignment via Safe Reinforcement Learning
Autoregressive Model Beats Diffusion
An agentless approach to automatically solve software development
A simple, performant and scalable Jax LLM
LISA: Reasoning Segmentation via Large Language Model
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Skywork-R1V is an advanced multimodal AI model series
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Code and models for ICML 2024 paper, NExT-GPT
CV, NLP, LLM project applications, and advanced engineering deployment
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
A simple yet powerful agent framework for personal assistants
The common language for platforms, agents and businesses.
ImageBind One Embedding Space to Bind Them All
RGBD video generation model conditioned on camera input
An industrial grade federated learning framework
270+ Claude Code plugins with 739 agent skills
Implementation of Vision Transformer, a simple way to achieve SOTA
LTX-Video Support for ComfyUI
Less Code, Lower Barrier, Faster Deployment
Harness LLMs with Multi-Agent Programming
Agent framework and applications built upon Qwen>=3.0