TEST TIME Optimized AI REASONING (MIT)

Optimized Test-Time Training by @mit : Shaping AI’s Future in Reasoning.

This brilliant video introduces a novel approach to improving reasoning capabilities in large language models (LLMs) through Test-Time Training (TTT) with a Leave-One-Out (LOO) strategy, specifically applied to the Abstraction and Reasoning Corpus (ARC). ARC tasks require abstract pattern recognition and rule inference, often with only a few input-output examples. TTT addresses this by dynamically fine-tuning lightweight Low-Rank Adapters (LoRA) at inference time.

The method deconstructs the main task into independent subtasks, using LOO to exclude one test input-output pair while fine-tuning on the remaining pairs and augmented data. This fine-tuning adapts the model to the specific logic of each task, enabling the LLM to better generalize abstract transformations while avoiding information leakage from the excluded pair. The augmentation process enriches the limited examples with transformations like flips, rotations, and rule-based variations, ensuring robust task-specific adaptation.

This dynamic TTT process contrasts with static pre-training or in-context learning by actively updating model parameters during inference. Unlike in-context learning, which leverages examples directly as input without parameter updates, TTT uses the auxiliary dataset to fine-tune LoRA adapters for each subtask independently. This enables the model to handle ARC’s unique challenges, such as generalizing from minimal data and adapting to task-specific reasoning rules.

Achieving a state-of-the-art accuracy of 53% on ARC validation, the approach demonstrates significant performance improvements over baseline methods and offers a scalable framework for abstract reasoning tasks, especially in few-shot scenarios.

All rights w/ authors:
The Surprising Effectiveness of
Test-Time Training for Abstract Reasoning
[ Ссылка ]

00:00 Optimization of Test Time Training
01:08 ARC Intelligence test for AI
02:37 3 Insights into TTT
05:17 Test Time Dataset Creation
08:05 This is not ICL
09:47 Pre-train - Finetune - LoRA Adapter
13:00 ARC Dataset Characteristics
15:54 9000% Human AI
17:47 Leave One Out training
20:17 Cheating?
22:37 Limitations on TTT*
25:21 AI Agents and Security
26:30 Combine w Reward Policy MCTS

#reasoning
#ai
#massachusettsinstituteoftechnology
#training
#aieducation
#robot

Теги

Смотрите далее

Канавкорез за 5 минут из самореза.

49е В. Ремонт напольного котла. 3 серия. Чищу Основную Горелку.

Как сделать чистку и восстановление печатающей головки HP.

Epabuvayda. eramax. Lazerni urven 4.d. Lazerlar AKTSIYADA. Shurpaverlar. 2025 -yil. Makita. Bosch

Радиатор с тэном. Делаем своими руками.

Infinix Hot 50 Pro Plus খাঁটি বাংলা রিভিউ

The Blue Pearl II

TIA Portal: SINAMICS G120 with STARTDRIVE (Free Download) Part 1/2

Annika, Architektur-Studentin, über ihr Praktikum bei ATP

Part 1- How to Dockerize Micro Services Architecture Docker and Spring cloud

How to Add a Field and/or Entity to a Global Search - MS Dynamics 365 CRM

Заменить чип(гпу) без ИК-Станции реально? Приблуда за 500рублей. Поднимаем GTX780

Robinson R22 органы управления

Очистка палладиевой черни с переработки КМ конденсаторов от примесей меди, железа, платины и золота

Основы g-кода, написание скриптов

Новые клипы

Тренды Наука