Aktualności5 czerwca 2026 Do Language Models Need Sleep? Offline Recurrence and Memory Consolidation
Researchers from CMU and the University of Maryland propose "LLM sleep" — a phase where the model repeatedly processes context and writes it into fast weights before clearing its attention cache. Longer sleep improves reasoning over evicted context without increasing response latency.