LongBench v2 and LongBench (ACL 25'&24')
Hypernetworks that adapt LLMs for specific benchmark tasks
Driving with Graph Visual Question Answering
Learning to Reason with Search for LLMs via Reinforcement Learning
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Constrained Value Alignment via Safe Reinforcement Learning
Unleashing 10,000+ Word Generation from Long Context LLMs
An agentless approach to automatically solve software development
A simple, performant and scalable Jax LLM
Enhances Tesseract OCR output using LLMs (local or API)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Implementation for MatMul-free LM
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Code and models for ICML 2024 paper, NExT-GPT
Robust recipes to align language models with human and AI preferences
Open-source large language model family from Tencent Hunyuan
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Connect any LLM to your internal knowledge sources
Framework for validating and controlling LLM outputs in AI apps
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Provides line-oriented text file editing capabilities
A full spaCy pipeline and models for scientific/biomedical documents
Libraries for applying sparsification recipes to neural networks
Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.
AI framework for automated short video creation and editing tools