← Discover

ICLR 2026 Recursive Self-Improvement

by Sam account 2 · updated 21h ago

Sign in to join
0 forks2 members3 notebooks1 tables112 files0 events

Notebooks

Tables

Files

reddit_analysis_results.tsv

59.4 KB · 17h ago

reddit_analysis_results.tsv

59.4 KB · 17h ago

TamperBench_A_Systematic_Framework_to_Stress-Test_LLM_Safety_Under_Fine-Tuning_and_Tampering__smLtz7WID0.pdf

1.1 MB · 2d ago

Depth_vs_Recursion_Outperforming_Transformers_in_Jigsaw_Reconstruction__1zDG1o16xB.pdf

862.2 KB · 2d ago

Test-Time_Adaptation_via_Many-Shot_Prompting_Benefits_Limits_and_Pitfalls__Ne0s3UOzdB.pdf

331.1 KB · 2d ago

Reference-Guided_Machine_Unlearning__0IbjWHRGMC.pdf

301.3 KB · 2d ago

Cross-Family_Speculative_Prefill_Training-Free_Long-Context_Compression_with_Small_Draft_Models__7dNjMllnWe.pdf

271.9 KB · 2d ago

Federation_over_Text__gh7aCT73cq.pdf

304.4 KB · 2d ago

Build_Judge_Optimize_A_Blueprint_for_Continuous_Improvement_of_Multi-Agent_Consumer_Assistants__FySoHBWmt9.pdf

1.2 MB · 2d ago

Real-Time_Procedural_Learning_From_Experience_for_AI_Agents__MKG4BaSieN.pdf

229.2 KB · 2d ago

Orthogonal_Gradient_Projection_for_Continual_LLM_Unlearning__lb6Ce20kl5.pdf

287.1 KB · 2d ago

Test-Time_Meta-Adaptation_with_Self-Synthesis__G0GE1xbR0w.pdf

386.5 KB · 2d ago

Compute_as_Teacher_Turning_Inference_Compute_Into_Reference-Free_Supervision__WA6q2pNQhj.pdf

1.3 MB · 2d ago

Constructive_Distortion_Improving_MLLMs_with_Attention-Guided_Image_Warping__sSwXGq1RRI.pdf

24.5 MB · 2d ago

VerlTool_Towards_Holistic_Agentic_Reinforcement_Learning_with_Tool_Use__l5QPYxMEaH.pdf

6.3 MB · 2d ago

Contrastive_Self-Refinement_for_Low-Cost_Adaptation_in_Real-World_Text-to-SQL__tgnbgt3Ctr.pdf

5.5 MB · 2d ago

AutoHarness_Improving_LLM_Agents_by_Automatically_Synthesizing_a_Code_Harness__g9rEYVNn5T.pdf

518.0 KB · 2d ago

Dynamic_Noise_Preference_Optimization__nT1gTSAf07.pdf

8.7 MB · 2d ago

Aligned_but_Stereotypical_Understanding_and_Mitigating_Social_Bias_in_LLM-Driven_Text-to-Image_Models__RvgoRhan6d.pdf

7.7 MB · 2d ago

Evaluating_AGENTS.md_Are_Repository-Level_Context_Files_Helpful_for_Coding_Agents__0DyJeJ3iia.pdf

2.0 MB · 2d ago

ESDAE_Evaluating_Synthetic_Data_for_Agent_Evaluation__DlhLLVhrZB.pdf

1.4 MB · 2d ago

Beyond_Solving_A_Closer_Look_at_LLMs_as_Solution_Verifiers__Hv5hDfbhuB.pdf

500.0 KB · 2d ago

Actor-Curator_Scalable_Policy-driven_Curriculum_Learning_for_RL_Post-Training__JYJXWnujvR.pdf

1.8 MB · 2d ago

Learning_to_Evolve_Scaling_Open-Ended_Discovery_with_Relative-Progress_RL__WnZHbe1Gu0.pdf

817.4 KB · 2d ago

Inference-Time_Scaling_in_Diffusion_Models_through_Iterative_Partial_Refinement__QopjICzGwr.pdf

4.2 MB · 2d ago

Self-Improving_Clinical_Reasoning_via_Textual_Gradients__VOa0lDJ5Ui.pdf

3.7 MB · 2d ago

Agentic_Context_Engineering_Evolving_Contexts_for_Self-Improving_Language_Models__f1eKYFvMNu.pdf

2.1 MB · 2d ago

Learning_What_to_Learn_Curriculum_Curation_for_Test-Time_Agent_Learning__TRQLuxgxBN.pdf

339.8 KB · 2d ago

Refining_Large_Language_Models_with_Self-Generated_Data_Through_Iterative_Training__81P20Qg44P.pdf

10.2 MB · 2d ago

AlphaApollo_A_System_for_Deep_Agentic_Reasoning__UcIFJPXqrB.pdf

2.7 MB · 2d ago

Teaching_Models_to_Teach_Themselves_Reasoning_at_the_Edge_of_Learnability__HuLoDnvC29.pdf

3.0 MB · 2d ago

TextBO_Bayesian_Optimization_in_Language_Space_for_Eval-Efficient_Self-Improving_AI__h0XYERyeL6.pdf

47.0 MB · 2d ago

SAGE_Self-play_Adversarial_Games_Enhance_Large_Language_Model_Reasoning_Capabilities__b3dPMokQki.pdf

942.1 KB · 2d ago

Self-Improvement_via_Fast_Tree-search__wZMNXHPYcO.pdf

1.7 MB · 2d ago

Federated_Agent_Reinforcement_Learning__h4iPWHGapI.pdf

1.8 MB · 2d ago

Reward_Hacking_in_Self-Improving_Code_Agents__ikrQWGgxYg.pdf

153.7 KB · 2d ago

Adaptive_Decoding_via_Test-Time_Policy_Learning_for_Self-Improving_Generation__7fIqKZ92Fy.pdf

813.3 KB · 2d ago

Leveraging_Suboptimal_and_Noisy_Trajectories_for_Goal-Conditional_Offline_RL__jfi88XbVsn.pdf

1.4 MB · 2d ago

Self-CriTeach_LLM_Self-Teaching_and_Self-Critiquing_for_Improving_Robotic_Planning_via_Automated_Domain_Generation__8I0n20ufAy.pdf

1.2 MB · 2d ago

Just_Enough_Learning_GRPO-Guided_Controllers_for_Hyperparameter_Sweeps__kKWSQsYgpa.pdf

253.3 KB · 2d ago

Discover_the_distinguishing_and_effective_reasoning_patterns_among_LLMs_via_an_LLM__z9MOHU8RjL.pdf

5.0 MB · 2d ago

Generative_Recursive_Reasoning_Models__Vxu6kcIjwV.pdf

4.8 MB · 2d ago

Improved_Iterative_Refinement_for_Chart-to-Code_Generation_via_Structured_Instruction__KgkMSbYoL1.pdf

2.5 MB · 2d ago

Duel-Evolve_Pairwise_Preference_Black-Box_Optimization_of_LLM_Responses__38t39AFZPE.pdf

980.8 KB · 2d ago

Soft_Mellowmax_Monte_Carlo_Planning__SeZCSEGZTq.pdf

293.9 KB · 2d ago

Language-Guided_Expertise_Evolution_for_Protein_Optimization__NeNWeyfUJj.pdf

1.4 MB · 2d ago

Vision-Guided_Iterative_Refinement_for_Frontend_Code_Generation__gF6jmAsbpZ.pdf

549.6 KB · 2d ago

A_Framework_for_Prompt_Optimization_and_Translation_Across_Foundation_Models__gOTCn5uZeE.pdf

268.0 KB · 2d ago

MimicAgent_Learning_Quadruped_Skills_via_Text-to-Trajectory_Generation__1iAEFtFQ9M.pdf

2.9 MB · 2d ago

Simple_Baselines_are_Competitive_with_Code_Evolution__QSWFqDcveB.pdf

2.8 MB · 2d ago

TangramSR_A_Benchmark_for_Recursive_Self-Improvement_In_Continuous_Geometric_Reasoning__5XGfbXlFmA.pdf

1.8 MB · 2d ago

CircuitBuilder_From_Polynomials_to_Circuits_via_Reinforcement_Learning__JNsTWIukjQ.pdf

3.4 MB · 2d ago

Feedback_Descent_Open-Ended_Text_Optimization_via_Pairwise_Comparison__A0dlCcZ4hT.pdf

4.0 MB · 2d ago

Do_Depth-Grown_Models_Overcome_the_Curse_of_Depth__Enqas0eZF6.pdf

1.1 MB · 2d ago

Agent0-VL_Exploring_Self-Evolving_Agent_for_Tool-Integrated_Vision-Language_Reasoning__JKx0zkUBsQ.pdf

2.3 MB · 2d ago

OMEGA_Optimizing_Machine_learning_by_Evaluating_Generated_Algorithms__4TUzVEzVdu.pdf

1.3 MB · 2d ago

Reasoning_as_Gradient_Scaling_MLE_Agents_Beyond_Tree_Search__TnjlvLY30w.pdf

12.2 MB · 2d ago

Intelligent_Robot_Manipulation_Requires_Self-Directed_Learning__hCjgh8jxKl.pdf

196.5 KB · 2d ago

Rethinking_Machine_Unlearning_Models_Designed_to_Forget_via_Key_Deletion__gGH3Xp1lHR.pdf

22.7 MB · 2d ago

Emergent_temporal_abstractions_in_autoregressive_models_enable_hierarchical_reinforcement_learning__5UbKomO77O.pdf

5.4 MB · 2d ago

Log-Augmented_Generation_Scaling_Test-Time_Reasoning_with_Reusable_Computation__32xd2HdB9m.pdf

399.9 KB · 2d ago

Differentiable_Evolutionary_Reinforcement_Learning__uXTXcZ615k.pdf

782.0 KB · 2d ago

Self-Improving_VLM_Judges_Without_Human_Annotations__8hYSvUpJBA.pdf

731.7 KB · 2d ago

In-Context_Adaptation__f58uDOwLaq.pdf

276.6 KB · 2d ago

LLM-FE_Automated_Feature_Engineering_for_Tabular_Data_with_LLMs_as_Evolutionary_Optimizers__KbGkWv7VEk.pdf

761.1 KB · 2d ago

Your_Self-Play_Algorithm_is_Secretly_an_Adversarial_Imitator__rL0GEyoMvE.pdf

416.7 KB · 2d ago

Self-Adapting_Agents_for_Automating_Research_Coding_Workflows__ihcRmUkXHF.pdf

597.1 KB · 2d ago

Self-EvolveRec_Self-Evolving_Recommender_Systems_with_LLM-based_Directional_Feedback__KCcpSuocz0.pdf

2.2 MB · 2d ago

Shape_of_Thought_When_Distribution_Matters_More_than_Correctness_in_Reasoning_Tasks__Oc5R3D2TfE.pdf

1.5 MB · 2d ago

Verifying_the_Verifiers_Failure_Attribution_for_Agentic_Benchmark_Diagnostics_and_Training_Data_Curation__iRhaK8PsuB.pdf

205.8 KB · 2d ago

Correct_Reasoning_Paths_Visit_Shared_Decision_Pivots__OlMSrldTNe.pdf

620.8 KB · 2d ago

CoT-Seg_Rethinking_Segmentation_with_Chain-of-Thought_Reasoning_and_Self-Correction__BtHRvwzJj8.pdf

9.9 MB · 2d ago

POLARIS_A_Godel_Agent_Framework_for_Small_Language_Models_Through_Experience_Abstracted_Policy_Repair__wlwkLVvB0W.pdf

1.1 MB · 2d ago

MAPPA_Scaling_Multiagent_Systems_with_Process_Rewards__s06wgoO65a.pdf

9.5 MB · 2d ago

Unlocking_Intrinsic_Self-Reflection_for_LLM_Preference_Policy_Optimization__0ZGL40jRKE.pdf

2.6 MB · 2d ago

A_Task-Centric_Theory_for_Iterative_Self-Improvement_with_Easy-to-Hard_Curricula__EEc4Pn2GSa.pdf

2.3 MB · 2d ago

Reasoning_Within_the_Mind_Dynamic_Multimodal_Interleaving_in_Latent_Space__fWrhAOQZ5A.pdf

10.4 MB · 2d ago

Theory-Driven_Modeling_and_LLM-Guided_Evolution_for_Power_System_Scheduling__djdSNvWpJ7.pdf

701.9 KB · 2d ago

SkillRL_Evolving_Agents_via_Recursive_Skill-Augmented_Reinforcement_Learning__56D2hjARkn.pdf

4.5 MB · 2d ago

One-Step_Video_Depth_Estimation_via_Self-Distillation__ucRjPd9HVU.pdf

5.8 MB · 2d ago

Reasoning_Cache_Learning_to_Extrapolate_to_Long_Lengths_via_Short-Length_RL__DROMQyqM52.pdf

927.2 KB · 2d ago

Residual_Off-Policy_RL_for_Finetuning_Behavior_Cloning_Policies__9m4kM3bK2b.pdf

5.1 MB · 2d ago

Structure_Enables_Effective_Self-Localization_of_Errors_in_LLMs__QbL99Fyqsl.pdf

926.8 KB · 2d ago

RFTF_Reinforcement_Fine-tuning_for_Vision-language-action_Models_with_Temporal_Feedback__mBgcG43j7a.pdf

1.3 MB · 2d ago

Escaping_Model_Collapse_via_Synthetic_Data_Verification__YzDC5hjGUM.pdf

2.6 MB · 2d ago

SAHOO_Safeguarded_Alignment_for_High-Order_Optimization_Objectives_in_Recursive_Self-Improvement__OAFPpQO0H9.pdf

3.5 MB · 2d ago

Unrolled_Policy_Iteration_for_Tiny_Recursive_Models__rIzREKws05.pdf

377.2 KB · 2d ago

Agent0_Unleashing_Self-Evolving_Agents_from_Zero_Data_via_Tool-Integrated_Reasoning__hYYeOl58xi.pdf

7.2 MB · 2d ago

Learning_to_Continually_Learn_via_Meta-learning_Agentic_Memory_Designs__sOq52KnJmR.pdf

21.0 MB · 2d ago

PostTrainBench_Can_LLM_Agents_Automate_LLM_Post-Training__FJKOIxkUxo.pdf

232.8 KB · 2d ago

Contextual_Drag_How_Errors_in_the_Context_Affect_LLM_Reasoning__zpiYsPVDlV.pdf

1.0 MB · 2d ago

Can_Current_Language_Models_Close_the_Discovery_to_Application_Loop__BAdK20xfqj.pdf

2.7 MB · 2d ago

Interestingness_as_an_Inductive_Heuristic_for_Future_Compression_Progress__6GTlSlWW9C.pdf

1.3 MB · 2d ago

Adaptive_Meta-Curriculum_for_Test-Time_Self-Improvement__GjoUJTfXiW.pdf

574.9 KB · 2d ago

SimpleMem_Efficient_Lifelong_Memory_for_LLM_Agents__oYHelQ3Edd.pdf

755.7 KB · 2d ago

Self-Improving_World_Models_via_Asymmetric_Forward-Inverse_Consistency__ajcjip0yFR.pdf

13.0 MB · 2d ago

Towards_Execution-Grounded_Automated_AI_Research__gpLJamvbsK.pdf

896.6 KB · 2d ago

Lang-PINN_From_Language_to_Physics-Informed_Neural_Networks_via_a_Multi-Agent_Framework__q5qN3oQ4D1.pdf

1.2 MB · 2d ago

VLAW_Iterative_Co-Improvement_of_Vision-Language-Action_Policy_and_World_Model__Ro0eQ0ly3q.pdf

5.4 MB · 2d ago

Presenting_a_Paper_is_an_Art_Self-Improvement_Aesthetic_Agents_for_Academic_Presentations__d40v7Qcpi4.pdf

34.8 MB · 2d ago

GASP_Guided_Asymmetric_Self-Play_For_Coding_LLMs__NYrOkAfDkP.pdf

1.3 MB · 2d ago

Self-Evolving_Rubrics_Interpretable_Instance-Level_Criteria_for_Scalable_RL__aA2PXFH2Cp.pdf

2.1 MB · 2d ago

Language_Self-Play_For_Data-Free_Training__uB8YQHNsh6.pdf

612.4 KB · 2d ago

Self-Improving_Vision-Language-Action_Models_with_Data_Generation_via_Residual_RL__shqo2VPfc7.pdf

24.4 MB · 2d ago

ACE_Self-Evolving_LLM_Coding_Framework_Adversarial_Unit_Test_Generation_and_Preference_Optimization__ecKAmz5vlO.pdf

1.2 MB · 2d ago

CausalEvolve_Towards_Open-Ended_Discovery_with_Causal_Scratchpad__14jSctSh0D.pdf

419.4 KB · 2d ago

Knowledge_is_Not_Enough_Injecting_RL_Skills_for_Continual_Adaptation__D93migx9av.pdf

865.6 KB · 2d ago

Test-Time_Self-Distillation__iEmRSwdzyw.pdf

890.8 KB · 2d ago

Anchored_Self-Play_for_Code_Repair__lTbBFAoPSA.pdf

9.2 MB · 2d ago

From_Growing_to_Looping_A_Unified_View_of_Iterative_Computation_in_LLMs__yIDgVx3OoN.pdf

743.6 KB · 2d ago

Tiny_Autoregressive_Recursive_Models__aY5kmaNrwB.pdf

558.2 KB · 2d ago

Can_Language_Models_Discover_Scaling_Laws__Nj6VGY4dej.pdf

1.5 MB · 2d ago