Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
tksii (Takashi Ishida)

Takashi Ishida's picture

4

Takashi Ishida

tksii

·

https://takashiishida.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

upvoted a paper 14 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

authored a paper 15 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

View all activity

Organizations

tksii 's papers 5

arxiv:2606.07379

arxiv:2604.02986

arxiv:2510.00841

arxiv:2506.08762

arxiv:2505.18102