-
CARLA Environment Server
πControl a Carla driving simulation with custom actions
-
CARLA Environment Server
πControl a CARLA driving simulator with custom actions
-
Carla Grpo Trolley
πVisualize your programβs I/O activity in real time
-
sergiopaniego/Qwen3-0.6B-carla-trolley-escape
0.8B β’ Updated β’ 5
Sergio Paniego PRO
AI & ML interests
Recent Activity
Organizations
- RunningAgents41
comparevlms
π41Compare Vision Language Models
- Running on ZeroAgents67
OCR Time Machine
π67Extract text from images and XML files using OCR models
- RunningAgents26
Compare Docvqa Models
π¦26Compare different visual question answering
- Running on CPU UpgradeAgents23
Compare Clip Siglip
π23Compare strong zero-shot image classification models
-
Qwen/Qwen2.5-Omni-7B
Any-to-Any β’ 11B β’ Updated β’ 778k β’ 1.9k - RunningAgentsFeatured372
Qwen2.5 Omni 7B Demo
π372Chat with text, audio, images, and video, get spoken replies
-
Qwen2.5-Omni Technical Report
Paper β’ 2503.20215 β’ Published β’ 173 -
openbmb/MiniCPM-o-2_6
Any-to-Any β’ 9B β’ Updated β’ 386k β’ 1.29k
- Running3.88k
The Ultra-Scale Playbook
π3.88kThe ultimate guide to training LLM on large GPU Clusters
- Running on CPU UpgradeFeatured3.2k
The Smol Training Playbook
π3.2kThe secrets to building world-class LLMs
- Running325
Evaluation Guidebook
π325Explore LLM benchmark scores over time
- Running225
FineVision: Open Data is All You Need
π225A new open-source dataset for training VLMs
- RunningAgents41
comparevlms
π41Compare Vision Language Models
- Runtime errorAgents4
Gemma3 License Plate Detection
π4Gemma 3 for license plate detection
- Running on ZeroAgentsFeatured143
Gemma 3n E4B It
β‘143Chat with an AI that understands text, images, video, and audio
- Running on ZeroAgentsFeatured42
Moondream3
π’42Image and video tasks with moondream3.
- Runtime errorRL
CARLA Environment Server
πControl a Carla driving simulation with custom actions
- Runtime errorRL
CARLA Environment Server
πControl a CARLA driving simulator with custom actions
- SleepingAgents
Carla Grpo Trolley
πVisualize your programβs I/O activity in real time
-
sergiopaniego/Qwen3-0.6B-carla-trolley-escape
0.8B β’ Updated β’ 5
- Running3.88k
The Ultra-Scale Playbook
π3.88kThe ultimate guide to training LLM on large GPU Clusters
- Running on CPU UpgradeFeatured3.2k
The Smol Training Playbook
π3.2kThe secrets to building world-class LLMs
- Running325
Evaluation Guidebook
π325Explore LLM benchmark scores over time
- Running225
FineVision: Open Data is All You Need
π225A new open-source dataset for training VLMs
- RunningAgents41
comparevlms
π41Compare Vision Language Models
- Running on ZeroAgents67
OCR Time Machine
π67Extract text from images and XML files using OCR models
- RunningAgents26
Compare Docvqa Models
π¦26Compare different visual question answering
- Running on CPU UpgradeAgents23
Compare Clip Siglip
π23Compare strong zero-shot image classification models
- RunningAgents41
comparevlms
π41Compare Vision Language Models
- Runtime errorAgents4
Gemma3 License Plate Detection
π4Gemma 3 for license plate detection
- Running on ZeroAgentsFeatured143
Gemma 3n E4B It
β‘143Chat with an AI that understands text, images, video, and audio
- Running on ZeroAgentsFeatured42
Moondream3
π’42Image and video tasks with moondream3.
-
Qwen/Qwen2.5-Omni-7B
Any-to-Any β’ 11B β’ Updated β’ 778k β’ 1.9k - RunningAgentsFeatured372
Qwen2.5 Omni 7B Demo
π372Chat with text, audio, images, and video, get spoken replies
-
Qwen2.5-Omni Technical Report
Paper β’ 2503.20215 β’ Published β’ 173 -
openbmb/MiniCPM-o-2_6
Any-to-Any β’ 9B β’ Updated β’ 386k β’ 1.29k