view post Post 1228 I ran the Anthropic Misalignment Framework for a few top models and added it to a dataset: cfahlgren1/anthropic-agentic-misalignment-resultsYou can read the reasoning traces of the models trying to blackmail the user and perform other actions. It's very interesting!! See translation
Reasoning Datasets Reasoning datasets that are trending 🔥 Tiiny/QWQ-LONGCOT-500K Viewer • Updated Dec 26, 2024 • 286k • 268 • 124 O1-OPEN/OpenO1-SFT Viewer • Updated Apr 22, 2025 • 77.7k • 1.16k • 387 FreedomIntelligence/medical-o1-reasoning-SFT Viewer • Updated Apr 22, 2025 • 90.1k • 7.07k • 1.12k amphora/QwQ-LongCoT-130K Viewer • Updated Dec 22, 2024 • 133k • 137 • 152
MLC WebLLM Running 34 Phi-3.5-Mini WebLLM ⚡ 34 Chat with a model using text input Running 18 Qwen-2.5 WebLLM ⚡ 18 Chat with a language model using your browser Running 131 WebLLM Playground 🏎 131 Build a web app fast with Vite, React, and TypeScript Running 146 Qwen 2.5 Code Interpreter 🐍 146 Run code and get instant results with Qwen Code Interpreter
Running 146 Qwen 2.5 Code Interpreter 🐍 146 Run code and get instant results with Qwen Code Interpreter
Reasoning Datasets Reasoning datasets that are trending 🔥 Tiiny/QWQ-LONGCOT-500K Viewer • Updated Dec 26, 2024 • 286k • 268 • 124 O1-OPEN/OpenO1-SFT Viewer • Updated Apr 22, 2025 • 77.7k • 1.16k • 387 FreedomIntelligence/medical-o1-reasoning-SFT Viewer • Updated Apr 22, 2025 • 90.1k • 7.07k • 1.12k amphora/QwQ-LongCoT-130K Viewer • Updated Dec 22, 2024 • 133k • 137 • 152
MLC WebLLM Running 34 Phi-3.5-Mini WebLLM ⚡ 34 Chat with a model using text input Running 18 Qwen-2.5 WebLLM ⚡ 18 Chat with a language model using your browser Running 131 WebLLM Playground 🏎 131 Build a web app fast with Vite, React, and TypeScript Running 146 Qwen 2.5 Code Interpreter 🐍 146 Run code and get instant results with Qwen Code Interpreter
Running 146 Qwen 2.5 Code Interpreter 🐍 146 Run code and get instant results with Qwen Code Interpreter
pinned Running 146 Qwen 2.5 Code Interpreter 🐍 Run code and get instant results with Qwen Code Interpreter