Rationale-aided Efficient 7B size Large Language and Vision Models. Let's enjoy it!
Byung-Kwan Lee
BK-Lee
AI & ML interests
Vision-Language Models
Recent Activity
upvoted
a
paper
about 1 month ago
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding
upvoted
a
paper
about 1 month ago
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
upvoted
a
paper
about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization