Change the repository type filter
All
Repositories list
97 repositories
- Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language
- [ICLR 2026] LightMem: Lightweight and Efficient Memory-Augmented Generation
- Create, Evaluate, and Connect AI Skills
- A Large-Scale Knowledge Graph for Automated Scientific Research
MemBase
PublicA Comprehensive Benchmarking Framework for Long-Term Conversational Memory Layers- Must-read Papers on LLM Agents.
- SkillX: Automatically Constructing Skill Knowledge Bases for Agents
- InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
- Can We Predict Before Executing Machine Learning Agents?
- Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency
Data2Behavior
PublicFrom Data to Behavior: Predicting Unintended Model Behaviors Before TrainingMemP
PublicMemP: Exploring Agent Procedural Memory- [ICLR/AAAI 2026] Open-Source LLM-Based Data Analysis Agents
WorldMind
PublicAligning Agentic World Models via Knowledgeable Experience Learningproject
PublicLookAheadTuning
Public[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer PreviewsKnowRL
PublicKnowRL: Exploring Knowledgeable Reinforcement Learning for FactualityCaKE
Public[EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
- [TASLP 2025] Spatial Knowledge Graph-Guided Synthesis for Multimodal LLMs
- Executable Knowledge Graphs for Replicating AI Research
AutoMind
PublicBiasEdit
Public[TrustNLP@NAACL 2025] BiasEdit: Debiasing Stereotyped Language Models via Model Editingunlearn
Public[ACL 2025] Knowledge Unlearning for Large Language ModelsDeco
Public[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination MitigationChineseHarm-bench
PublicChineseHarm-Bench: A Chinese Harmful Content Detection BenchmarkOmniThink
Public[EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.