Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456 Paper page - Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening
https://arxivexplained.com/papers/spider-sense-intrinsic-risk-sensing-for-efficient-agent-defense-with-hierarchical-adaptive-screening\n","updatedAt":"2026-02-06T21:05:06.239Z","author":{"_id":"65d9fc2a0e6ad24551d87a1e","avatarUrl":"/avatars/3aedb9522cc3cd08349d654f523fd792.svg","fullname":"Grant Singleton","name":"grantsing","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":4,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7031875252723694},"editors":["grantsing"],"editorAvatarUrls":["/avatars/3aedb9522cc3cd08349d654f523fd792.svg"],"reactions":[],"isReport":false}},{"id":"6986579c661b7c0fa18fb234","author":{"_id":"65243980050781c16f234f1f","avatarUrl":"/avatars/743a009681d5d554c27e04300db9f267.svg","fullname":"Avi","name":"avahal","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"createdAt":"2026-02-06T21:05:32.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"arXivLens breakdown of this paper ๐ https://arxivlens.com/PaperView/Details/spider-sense-intrinsic-risk-sensing-for-efficient-agent-defense-with-hierarchical-adaptive-screening-6462-36111177\n- Executive Summary\n- Detailed Breakdown\n- Practical Applications","html":"
\n","updatedAt":"2026-02-06T21:05:32.435Z","author":{"_id":"65243980050781c16f234f1f","avatarUrl":"/avatars/743a009681d5d554c27e04300db9f267.svg","fullname":"Avi","name":"avahal","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6931737661361694},"editors":["avahal"],"editorAvatarUrls":["/avatars/743a009681d5d554c27e04300db9f267.svg"],"reactions":[],"isReport":false}},{"id":"698697c1dfd17b2d3dac9d30","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false},"createdAt":"2026-02-07T01:39:13.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"This is an automated message from the [Librarian Bot](https://huggingface.co/librarian-bots). I found the following papers similar to this paper. \n\nThe following papers were recommended by the Semantic Scholar API \n\n* [VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit](https://huggingface.co/papers/2601.05755) (2026)\n* [FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments](https://huggingface.co/papers/2601.07853) (2026)\n* [SHIELD: An Auto-Healing Agentic Defense Framework for LLM Resource Exhaustion Attacks](https://huggingface.co/papers/2601.19174) (2026)\n* [ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback](https://huggingface.co/papers/2601.10156) (2026)\n* [Risky-Bench: Probing Agentic Safety Risks under Real-World Deployment](https://huggingface.co/papers/2602.03100) (2026)\n* [BackdoorAgent: A Unified Framework for Backdoor Attacks on LLM-based Agents](https://huggingface.co/papers/2601.04566) (2026)\n* [AgenTRIM: Tool Risk Mitigation for Agentic AI](https://huggingface.co/papers/2601.12449) (2026)\n\n\n Please give a thumbs up to this comment if you found it helpful!\n\n If you want recommendations for any Paper on Hugging Face checkout [this](https://huggingface.co/spaces/librarian-bots/recommend_similar_papers) Space\n\n You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: `@librarian-bot recommend`","html":"
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
\n
The following papers were recommended by the Semantic Scholar API
Please give a thumbs up to this comment if you found it helpful!
\n
If you want recommendations for any Paper on Hugging Face checkout this Space
\n
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: \n\n@librarian-bot\n\t recommend
\n","updatedAt":"2026-02-07T01:39:13.935Z","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.719995379447937},"editors":["librarian-bot"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2602.05386","authors":[{"_id":"698599b14ad556f294b7ecdc","name":"Zhenxiong Yu","hidden":false},{"_id":"698599b14ad556f294b7ecdd","user":{"_id":"64aa645404e7b379feccc490","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64aa645404e7b379feccc490/4m8qcdy2OGK8visR5Jjl5.png","isPro":false,"fullname":"Zhi Yang","user":"yangzhi1","type":"user"},"name":"Zhi Yang","status":"claimed_verified","statusLastChangedAt":"2026-02-06T18:50:37.407Z","hidden":false},{"_id":"698599b14ad556f294b7ecde","name":"Zhiheng Jin","hidden":false},{"_id":"698599b14ad556f294b7ecdf","name":"Shuhe Wang","hidden":false},{"_id":"698599b14ad556f294b7ece0","name":"Heng Zhang","hidden":false},{"_id":"698599b14ad556f294b7ece1","name":"Yanlin Fei","hidden":false},{"_id":"698599b14ad556f294b7ece2","name":"Lingfeng Zeng","hidden":false},{"_id":"698599b14ad556f294b7ece3","name":"Fangqi Lou","hidden":false},{"_id":"698599b14ad556f294b7ece4","name":"Shuo Zhang","hidden":false},{"_id":"698599b14ad556f294b7ece5","name":"Tu Hu","hidden":false},{"_id":"698599b14ad556f294b7ece6","name":"Jingping Liu","hidden":false},{"_id":"698599b14ad556f294b7ece7","name":"Rongze Chen","hidden":false},{"_id":"698599b14ad556f294b7ece8","name":"Xingyu Zhu","hidden":false},{"_id":"698599b14ad556f294b7ece9","name":"Kunyi Wang","hidden":false},{"_id":"698599b14ad556f294b7ecea","name":"Chaofa Yuan","hidden":false},{"_id":"698599b14ad556f294b7eceb","name":"Xin Guo","hidden":false},{"_id":"698599b14ad556f294b7ecec","name":"Zhaowei Liu","hidden":false},{"_id":"698599b14ad556f294b7eced","name":"Feipeng Zhang","hidden":false},{"_id":"698599b14ad556f294b7ecee","name":"Jie Huang","hidden":false},{"_id":"698599b14ad556f294b7ecef","name":"Huacan Wang","hidden":false},{"_id":"698599b14ad556f294b7ecf0","name":"Ronghao Chen","hidden":false},{"_id":"698599b14ad556f294b7ecf1","name":"Liwen Zhang","hidden":false}],"publishedAt":"2026-02-05T07:11:05.000Z","submittedOnDailyAt":"2026-02-06T05:15:48.526Z","title":"Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening","submittedOnDailyBy":{"_id":"64aa645404e7b379feccc490","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64aa645404e7b379feccc490/4m8qcdy2OGK8visR5Jjl5.png","isPro":false,"fullname":"Zhi Yang","user":"yangzhi1","type":"user"},"summary":"As large language models (LLMs) evolve into autonomous agents, their real-world applicability has expanded significantly, accompanied by new security challenges. Most existing agent defense mechanisms adopt a mandatory checking paradigm, in which security validation is forcibly triggered at predefined stages of the agent lifecycle. In this work, we argue that effective agent security should be intrinsic and selective rather than architecturally decoupled and mandatory. We propose Spider-Sense framework, an event-driven defense framework based on Intrinsic Risk Sensing (IRS), which allows agents to maintain latent vigilance and trigger defenses only upon risk perception. Once triggered, the Spider-Sense invokes a hierarchical defence mechanism that trades off efficiency and precision: it resolves known patterns via lightweight similarity matching while escalating ambiguous cases to deep internal reasoning, thereby eliminating reliance on external models. To facilitate rigorous evaluation, we introduce S^2Bench, a lifecycle-aware benchmark featuring realistic tool execution and multi-stage attacks. Extensive experiments demonstrate that Spider-Sense achieves competitive or superior defense performance, attaining the lowest Attack Success Rate (ASR) and False Positive Rate (FPR), with only a marginal latency overhead of 8.3\\%.","upvotes":69,"discussionId":"698599b14ad556f294b7ecf2","githubRepo":"https://github.com/aifinlab/Spider-Sense","githubRepoAddedBy":"user","ai_summary":"Spider-Sense framework provides intrinsic and selective agent security through event-driven defense with intrinsic risk sensing, achieving low attack success and false positive rates with minimal latency overhead.","ai_keywords":["large language models","autonomous agents","security challenges","mandatory checking paradigm","event-driven defense","Intrinsic Risk Sensing","hierarchical defense mechanism","lightweight similarity matching","deep internal reasoning","lifecycle-aware benchmark","Attack Success Rate","False Positive Rate"],"githubStars":14,"organization":{"_id":"696875114bc2a5524dd8fcb7","name":"AIFin-Lab","fullname":"AIFin Lab","avatar":"https://cdn-uploads.huggingface.co/production/uploads/69670d031053fc18e0ac011e/58oVnjWlRuiLy6X-GsMl6.png"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"64aa645404e7b379feccc490","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64aa645404e7b379feccc490/4m8qcdy2OGK8visR5Jjl5.png","isPro":false,"fullname":"Zhi Yang","user":"yangzhi1","type":"user"},{"_id":"662911a202f5ad9a5195932f","avatarUrl":"/avatars/3c7db9bf9c1d95360b62fe4f56ee9c3a.svg","isPro":false,"fullname":"Tu Hu","user":"Blackteaxxx","type":"user"},{"_id":"6961c5e9607103f8046d103d","avatarUrl":"/avatars/296ca1cdfc8d0925eb7450325c1f27de.svg","isPro":false,"fullname":"Jinzh","user":"J-stan-zh","type":"user"},{"_id":"64b78eb76ab5d14ca7faac87","avatarUrl":"/avatars/cc847b8c8bf8cc2537e03ee218628396.svg","isPro":false,"fullname":"CristianoC","user":"CristianoC20","type":"user"},{"_id":"68e5cd2af7b5b87f951fdb13","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/Cuf7wio5ENpxWys6fNa3W.png","isPro":false,"fullname":"CHENG ZIMING","user":"HarrytheOrange2","type":"user"},{"_id":"69674549666228b695202137","avatarUrl":"/avatars/47ac3b87d395856b3dd1a9f24af82c25.svg","isPro":false,"fullname":"Han Jun","user":"Junqwef","type":"user"},{"_id":"673c9bc630316b2f3d7c2efd","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/zQX2REs4WVnpbdqi3a1Eq.png","isPro":false,"fullname":"zenglingfeng","user":"uu531","type":"user"},{"_id":"67e787cd05d7355e47634b0c","avatarUrl":"/avatars/cad004380557ba51c88d9a2bb659d938.svg","isPro":false,"fullname":"LE CHANG","user":"BeetleSpike","type":"user"},{"_id":"6985afb31684d458c221eef3","avatarUrl":"/avatars/02620eff84eb4b5d59f71a9bd1367b32.svg","isPro":false,"fullname":"zhangyifei","user":"winy520","type":"user"},{"_id":"64ad555cad6218d51a0eb1ca","avatarUrl":"/avatars/7c93be15117f3d26b7630910fc310e75.svg","isPro":false,"fullname":"lzw-+","user":"lzwLZW","type":"user"},{"_id":"66a0ab4923e426e19db92773","avatarUrl":"/avatars/19517dd085a3e48e644613ca0b2c3753.svg","isPro":false,"fullname":"ronghaochen","user":"cristiano28","type":"user"},{"_id":"687f601b7170fd281b898c0f","avatarUrl":"/avatars/d77ceb1dca5115d50abb1bb300d05209.svg","isPro":false,"fullname":"yifan","user":"dongyifan","type":"user"}],"acceptLanguages":["*"],"dailyPaperRank":2,"organization":{"_id":"696875114bc2a5524dd8fcb7","name":"AIFin-Lab","fullname":"AIFin Lab","avatar":"https://cdn-uploads.huggingface.co/production/uploads/69670d031053fc18e0ac011e/58oVnjWlRuiLy6X-GsMl6.png"}}">
Spider-Sense framework provides intrinsic and selective agent security through event-driven defense with intrinsic risk sensing, achieving low attack success and false positive rates with minimal latency overhead.
AI-generated summary
As large language models (LLMs) evolve into autonomous agents, their real-world applicability has expanded significantly, accompanied by new security challenges. Most existing agent defense mechanisms adopt a mandatory checking paradigm, in which security validation is forcibly triggered at predefined stages of the agent lifecycle. In this work, we argue that effective agent security should be intrinsic and selective rather than architecturally decoupled and mandatory. We propose Spider-Sense framework, an event-driven defense framework based on Intrinsic Risk Sensing (IRS), which allows agents to maintain latent vigilance and trigger defenses only upon risk perception. Once triggered, the Spider-Sense invokes a hierarchical defence mechanism that trades off efficiency and precision: it resolves known patterns via lightweight similarity matching while escalating ambiguous cases to deep internal reasoning, thereby eliminating reliance on external models. To facilitate rigorous evaluation, we introduce S^2Bench, a lifecycle-aware benchmark featuring realistic tool execution and multi-stage attacks. Extensive experiments demonstrate that Spider-Sense achieves competitive or superior defense performance, attaining the lowest Attack Success Rate (ASR) and False Positive Rate (FPR), with only a marginal latency overhead of 8.3\%.