Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Paper page - Agentic Uncertainty Reveals Agentic Overconfidence
[go: Go Back, main page]

Papers
arxiv:2602.06948

Agentic Uncertainty Reveals Agentic Overconfidence

Published on Feb 6
Authors:
,
,
,
,

Abstract

AI agents demonstrate systematic overconfidence in predicting task success, with pre-execution assessments sometimes outperforming post-execution reviews, though adversarial prompting improves calibration.

AI-generated summary

Can AI agents predict whether they will succeed at a task? We study agentic uncertainty by eliciting success probability estimates before, during, and after task execution. All results exhibit agentic overconfidence: some agents that succeed only 22% of the time predict 77% success. Counterintuitively, pre-execution assessment with strictly less information tends to yield better discrimination than standard post-execution review, though differences are not always significant. Adversarial prompting reframing assessment as bug-finding achieves the best calibration.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2602.06948
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2602.06948 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2602.06948 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2602.06948 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.