ICLR 2026 Recursive Self-Improvement
by Sam account 2 · updated 21h ago
Notebooks
Tables
Files
reddit_analysis_results.tsv
59.4 KB · 17h ago
reddit_analysis_results.tsv
59.4 KB · 17h ago
TamperBench_A_Systematic_Framework_to_Stress-Test_LLM_Safety_Under_Fine-Tuning_and_Tampering__smLtz7WID0.pdf
1.1 MB · 2d ago
Depth_vs_Recursion_Outperforming_Transformers_in_Jigsaw_Reconstruction__1zDG1o16xB.pdf
862.2 KB · 2d ago
Test-Time_Adaptation_via_Many-Shot_Prompting_Benefits_Limits_and_Pitfalls__Ne0s3UOzdB.pdf
331.1 KB · 2d ago
Reference-Guided_Machine_Unlearning__0IbjWHRGMC.pdf
301.3 KB · 2d ago
Cross-Family_Speculative_Prefill_Training-Free_Long-Context_Compression_with_Small_Draft_Models__7dNjMllnWe.pdf
271.9 KB · 2d ago
Federation_over_Text__gh7aCT73cq.pdf
304.4 KB · 2d ago
Build_Judge_Optimize_A_Blueprint_for_Continuous_Improvement_of_Multi-Agent_Consumer_Assistants__FySoHBWmt9.pdf
1.2 MB · 2d ago
Real-Time_Procedural_Learning_From_Experience_for_AI_Agents__MKG4BaSieN.pdf
229.2 KB · 2d ago
Orthogonal_Gradient_Projection_for_Continual_LLM_Unlearning__lb6Ce20kl5.pdf
287.1 KB · 2d ago
Test-Time_Meta-Adaptation_with_Self-Synthesis__G0GE1xbR0w.pdf
386.5 KB · 2d ago
Compute_as_Teacher_Turning_Inference_Compute_Into_Reference-Free_Supervision__WA6q2pNQhj.pdf
1.3 MB · 2d ago
Constructive_Distortion_Improving_MLLMs_with_Attention-Guided_Image_Warping__sSwXGq1RRI.pdf
24.5 MB · 2d ago
VerlTool_Towards_Holistic_Agentic_Reinforcement_Learning_with_Tool_Use__l5QPYxMEaH.pdf
6.3 MB · 2d ago
Contrastive_Self-Refinement_for_Low-Cost_Adaptation_in_Real-World_Text-to-SQL__tgnbgt3Ctr.pdf
5.5 MB · 2d ago
AutoHarness_Improving_LLM_Agents_by_Automatically_Synthesizing_a_Code_Harness__g9rEYVNn5T.pdf
518.0 KB · 2d ago
Dynamic_Noise_Preference_Optimization__nT1gTSAf07.pdf
8.7 MB · 2d ago
Aligned_but_Stereotypical_Understanding_and_Mitigating_Social_Bias_in_LLM-Driven_Text-to-Image_Models__RvgoRhan6d.pdf
7.7 MB · 2d ago
Evaluating_AGENTS.md_Are_Repository-Level_Context_Files_Helpful_for_Coding_Agents__0DyJeJ3iia.pdf
2.0 MB · 2d ago
ESDAE_Evaluating_Synthetic_Data_for_Agent_Evaluation__DlhLLVhrZB.pdf
1.4 MB · 2d ago
Beyond_Solving_A_Closer_Look_at_LLMs_as_Solution_Verifiers__Hv5hDfbhuB.pdf
500.0 KB · 2d ago
Actor-Curator_Scalable_Policy-driven_Curriculum_Learning_for_RL_Post-Training__JYJXWnujvR.pdf
1.8 MB · 2d ago
Learning_to_Evolve_Scaling_Open-Ended_Discovery_with_Relative-Progress_RL__WnZHbe1Gu0.pdf
817.4 KB · 2d ago
Inference-Time_Scaling_in_Diffusion_Models_through_Iterative_Partial_Refinement__QopjICzGwr.pdf
4.2 MB · 2d ago
Self-Improving_Clinical_Reasoning_via_Textual_Gradients__VOa0lDJ5Ui.pdf
3.7 MB · 2d ago
Agentic_Context_Engineering_Evolving_Contexts_for_Self-Improving_Language_Models__f1eKYFvMNu.pdf
2.1 MB · 2d ago
Learning_What_to_Learn_Curriculum_Curation_for_Test-Time_Agent_Learning__TRQLuxgxBN.pdf
339.8 KB · 2d ago
Refining_Large_Language_Models_with_Self-Generated_Data_Through_Iterative_Training__81P20Qg44P.pdf
10.2 MB · 2d ago
AlphaApollo_A_System_for_Deep_Agentic_Reasoning__UcIFJPXqrB.pdf
2.7 MB · 2d ago
Teaching_Models_to_Teach_Themselves_Reasoning_at_the_Edge_of_Learnability__HuLoDnvC29.pdf
3.0 MB · 2d ago
TextBO_Bayesian_Optimization_in_Language_Space_for_Eval-Efficient_Self-Improving_AI__h0XYERyeL6.pdf
47.0 MB · 2d ago
SAGE_Self-play_Adversarial_Games_Enhance_Large_Language_Model_Reasoning_Capabilities__b3dPMokQki.pdf
942.1 KB · 2d ago
Self-Improvement_via_Fast_Tree-search__wZMNXHPYcO.pdf
1.7 MB · 2d ago
Federated_Agent_Reinforcement_Learning__h4iPWHGapI.pdf
1.8 MB · 2d ago
Reward_Hacking_in_Self-Improving_Code_Agents__ikrQWGgxYg.pdf
153.7 KB · 2d ago
Adaptive_Decoding_via_Test-Time_Policy_Learning_for_Self-Improving_Generation__7fIqKZ92Fy.pdf
813.3 KB · 2d ago
Leveraging_Suboptimal_and_Noisy_Trajectories_for_Goal-Conditional_Offline_RL__jfi88XbVsn.pdf
1.4 MB · 2d ago
Self-CriTeach_LLM_Self-Teaching_and_Self-Critiquing_for_Improving_Robotic_Planning_via_Automated_Domain_Generation__8I0n20ufAy.pdf
1.2 MB · 2d ago
Just_Enough_Learning_GRPO-Guided_Controllers_for_Hyperparameter_Sweeps__kKWSQsYgpa.pdf
253.3 KB · 2d ago
Discover_the_distinguishing_and_effective_reasoning_patterns_among_LLMs_via_an_LLM__z9MOHU8RjL.pdf
5.0 MB · 2d ago
Generative_Recursive_Reasoning_Models__Vxu6kcIjwV.pdf
4.8 MB · 2d ago
Improved_Iterative_Refinement_for_Chart-to-Code_Generation_via_Structured_Instruction__KgkMSbYoL1.pdf
2.5 MB · 2d ago
Duel-Evolve_Pairwise_Preference_Black-Box_Optimization_of_LLM_Responses__38t39AFZPE.pdf
980.8 KB · 2d ago
Soft_Mellowmax_Monte_Carlo_Planning__SeZCSEGZTq.pdf
293.9 KB · 2d ago
Language-Guided_Expertise_Evolution_for_Protein_Optimization__NeNWeyfUJj.pdf
1.4 MB · 2d ago
Vision-Guided_Iterative_Refinement_for_Frontend_Code_Generation__gF6jmAsbpZ.pdf
549.6 KB · 2d ago
A_Framework_for_Prompt_Optimization_and_Translation_Across_Foundation_Models__gOTCn5uZeE.pdf
268.0 KB · 2d ago
MimicAgent_Learning_Quadruped_Skills_via_Text-to-Trajectory_Generation__1iAEFtFQ9M.pdf
2.9 MB · 2d ago
Simple_Baselines_are_Competitive_with_Code_Evolution__QSWFqDcveB.pdf
2.8 MB · 2d ago
TangramSR_A_Benchmark_for_Recursive_Self-Improvement_In_Continuous_Geometric_Reasoning__5XGfbXlFmA.pdf
1.8 MB · 2d ago
CircuitBuilder_From_Polynomials_to_Circuits_via_Reinforcement_Learning__JNsTWIukjQ.pdf
3.4 MB · 2d ago
Feedback_Descent_Open-Ended_Text_Optimization_via_Pairwise_Comparison__A0dlCcZ4hT.pdf
4.0 MB · 2d ago
Do_Depth-Grown_Models_Overcome_the_Curse_of_Depth__Enqas0eZF6.pdf
1.1 MB · 2d ago
Agent0-VL_Exploring_Self-Evolving_Agent_for_Tool-Integrated_Vision-Language_Reasoning__JKx0zkUBsQ.pdf
2.3 MB · 2d ago
OMEGA_Optimizing_Machine_learning_by_Evaluating_Generated_Algorithms__4TUzVEzVdu.pdf
1.3 MB · 2d ago
Reasoning_as_Gradient_Scaling_MLE_Agents_Beyond_Tree_Search__TnjlvLY30w.pdf
12.2 MB · 2d ago
Intelligent_Robot_Manipulation_Requires_Self-Directed_Learning__hCjgh8jxKl.pdf
196.5 KB · 2d ago
Rethinking_Machine_Unlearning_Models_Designed_to_Forget_via_Key_Deletion__gGH3Xp1lHR.pdf
22.7 MB · 2d ago
Emergent_temporal_abstractions_in_autoregressive_models_enable_hierarchical_reinforcement_learning__5UbKomO77O.pdf
5.4 MB · 2d ago
Log-Augmented_Generation_Scaling_Test-Time_Reasoning_with_Reusable_Computation__32xd2HdB9m.pdf
399.9 KB · 2d ago
Differentiable_Evolutionary_Reinforcement_Learning__uXTXcZ615k.pdf
782.0 KB · 2d ago
Self-Improving_VLM_Judges_Without_Human_Annotations__8hYSvUpJBA.pdf
731.7 KB · 2d ago
In-Context_Adaptation__f58uDOwLaq.pdf
276.6 KB · 2d ago
LLM-FE_Automated_Feature_Engineering_for_Tabular_Data_with_LLMs_as_Evolutionary_Optimizers__KbGkWv7VEk.pdf
761.1 KB · 2d ago
Your_Self-Play_Algorithm_is_Secretly_an_Adversarial_Imitator__rL0GEyoMvE.pdf
416.7 KB · 2d ago
Self-Adapting_Agents_for_Automating_Research_Coding_Workflows__ihcRmUkXHF.pdf
597.1 KB · 2d ago
Self-EvolveRec_Self-Evolving_Recommender_Systems_with_LLM-based_Directional_Feedback__KCcpSuocz0.pdf
2.2 MB · 2d ago
Shape_of_Thought_When_Distribution_Matters_More_than_Correctness_in_Reasoning_Tasks__Oc5R3D2TfE.pdf
1.5 MB · 2d ago
Verifying_the_Verifiers_Failure_Attribution_for_Agentic_Benchmark_Diagnostics_and_Training_Data_Curation__iRhaK8PsuB.pdf
205.8 KB · 2d ago
Correct_Reasoning_Paths_Visit_Shared_Decision_Pivots__OlMSrldTNe.pdf
620.8 KB · 2d ago
CoT-Seg_Rethinking_Segmentation_with_Chain-of-Thought_Reasoning_and_Self-Correction__BtHRvwzJj8.pdf
9.9 MB · 2d ago
POLARIS_A_Godel_Agent_Framework_for_Small_Language_Models_Through_Experience_Abstracted_Policy_Repair__wlwkLVvB0W.pdf
1.1 MB · 2d ago
MAPPA_Scaling_Multiagent_Systems_with_Process_Rewards__s06wgoO65a.pdf
9.5 MB · 2d ago
Unlocking_Intrinsic_Self-Reflection_for_LLM_Preference_Policy_Optimization__0ZGL40jRKE.pdf
2.6 MB · 2d ago
A_Task-Centric_Theory_for_Iterative_Self-Improvement_with_Easy-to-Hard_Curricula__EEc4Pn2GSa.pdf
2.3 MB · 2d ago
Reasoning_Within_the_Mind_Dynamic_Multimodal_Interleaving_in_Latent_Space__fWrhAOQZ5A.pdf
10.4 MB · 2d ago
Theory-Driven_Modeling_and_LLM-Guided_Evolution_for_Power_System_Scheduling__djdSNvWpJ7.pdf
701.9 KB · 2d ago
SkillRL_Evolving_Agents_via_Recursive_Skill-Augmented_Reinforcement_Learning__56D2hjARkn.pdf
4.5 MB · 2d ago
One-Step_Video_Depth_Estimation_via_Self-Distillation__ucRjPd9HVU.pdf
5.8 MB · 2d ago
Reasoning_Cache_Learning_to_Extrapolate_to_Long_Lengths_via_Short-Length_RL__DROMQyqM52.pdf
927.2 KB · 2d ago
Residual_Off-Policy_RL_for_Finetuning_Behavior_Cloning_Policies__9m4kM3bK2b.pdf
5.1 MB · 2d ago
Structure_Enables_Effective_Self-Localization_of_Errors_in_LLMs__QbL99Fyqsl.pdf
926.8 KB · 2d ago
RFTF_Reinforcement_Fine-tuning_for_Vision-language-action_Models_with_Temporal_Feedback__mBgcG43j7a.pdf
1.3 MB · 2d ago
Escaping_Model_Collapse_via_Synthetic_Data_Verification__YzDC5hjGUM.pdf
2.6 MB · 2d ago
SAHOO_Safeguarded_Alignment_for_High-Order_Optimization_Objectives_in_Recursive_Self-Improvement__OAFPpQO0H9.pdf
3.5 MB · 2d ago
Unrolled_Policy_Iteration_for_Tiny_Recursive_Models__rIzREKws05.pdf
377.2 KB · 2d ago
Agent0_Unleashing_Self-Evolving_Agents_from_Zero_Data_via_Tool-Integrated_Reasoning__hYYeOl58xi.pdf
7.2 MB · 2d ago
Learning_to_Continually_Learn_via_Meta-learning_Agentic_Memory_Designs__sOq52KnJmR.pdf
21.0 MB · 2d ago
PostTrainBench_Can_LLM_Agents_Automate_LLM_Post-Training__FJKOIxkUxo.pdf
232.8 KB · 2d ago
Contextual_Drag_How_Errors_in_the_Context_Affect_LLM_Reasoning__zpiYsPVDlV.pdf
1.0 MB · 2d ago
Can_Current_Language_Models_Close_the_Discovery_to_Application_Loop__BAdK20xfqj.pdf
2.7 MB · 2d ago
Interestingness_as_an_Inductive_Heuristic_for_Future_Compression_Progress__6GTlSlWW9C.pdf
1.3 MB · 2d ago
Adaptive_Meta-Curriculum_for_Test-Time_Self-Improvement__GjoUJTfXiW.pdf
574.9 KB · 2d ago
SimpleMem_Efficient_Lifelong_Memory_for_LLM_Agents__oYHelQ3Edd.pdf
755.7 KB · 2d ago
Self-Improving_World_Models_via_Asymmetric_Forward-Inverse_Consistency__ajcjip0yFR.pdf
13.0 MB · 2d ago
Towards_Execution-Grounded_Automated_AI_Research__gpLJamvbsK.pdf
896.6 KB · 2d ago
Lang-PINN_From_Language_to_Physics-Informed_Neural_Networks_via_a_Multi-Agent_Framework__q5qN3oQ4D1.pdf
1.2 MB · 2d ago
VLAW_Iterative_Co-Improvement_of_Vision-Language-Action_Policy_and_World_Model__Ro0eQ0ly3q.pdf
5.4 MB · 2d ago
Presenting_a_Paper_is_an_Art_Self-Improvement_Aesthetic_Agents_for_Academic_Presentations__d40v7Qcpi4.pdf
34.8 MB · 2d ago
GASP_Guided_Asymmetric_Self-Play_For_Coding_LLMs__NYrOkAfDkP.pdf
1.3 MB · 2d ago
Self-Evolving_Rubrics_Interpretable_Instance-Level_Criteria_for_Scalable_RL__aA2PXFH2Cp.pdf
2.1 MB · 2d ago
Language_Self-Play_For_Data-Free_Training__uB8YQHNsh6.pdf
612.4 KB · 2d ago
Self-Improving_Vision-Language-Action_Models_with_Data_Generation_via_Residual_RL__shqo2VPfc7.pdf
24.4 MB · 2d ago
ACE_Self-Evolving_LLM_Coding_Framework_Adversarial_Unit_Test_Generation_and_Preference_Optimization__ecKAmz5vlO.pdf
1.2 MB · 2d ago
CausalEvolve_Towards_Open-Ended_Discovery_with_Causal_Scratchpad__14jSctSh0D.pdf
419.4 KB · 2d ago
Knowledge_is_Not_Enough_Injecting_RL_Skills_for_Continual_Adaptation__D93migx9av.pdf
865.6 KB · 2d ago
Test-Time_Self-Distillation__iEmRSwdzyw.pdf
890.8 KB · 2d ago
Anchored_Self-Play_for_Code_Repair__lTbBFAoPSA.pdf
9.2 MB · 2d ago
From_Growing_to_Looping_A_Unified_View_of_Iterative_Computation_in_LLMs__yIDgVx3OoN.pdf
743.6 KB · 2d ago
Tiny_Autoregressive_Recursive_Models__aY5kmaNrwB.pdf
558.2 KB · 2d ago
Can_Language_Models_Discover_Scaling_Laws__Nj6VGY4dej.pdf
1.5 MB · 2d ago