Deprecated : The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
cool datasets - a Tonic Collection
cool datasets updated 7 days ago
Viewer
• Updated Apr 18, 2024 • 10k • 8.33k
• 539
rahulchakwate/squad-QG-dataset-original Viewer
• Updated Apr 27, 2023 • 87.6k • 14
• 2
Viewer
• Updated May 29, 2023 • 1k • 302
• 10
Viewer
• Updated Mar 4, 2024 • 98.2k • 140k
• 360
rahulchakwate/squad-QG-dataset-shuffled Viewer
• Updated Apr 27, 2023 • 87.6k • 6
Gautam9595/Squad_Translated Viewer
• Updated Apr 8, 2022 • 115k • 15
nreimers/reddit_question_best_answers Viewer
• Updated Jul 13, 2022 • 1.83M • 190
• 17
Preview
• Updated Nov 9, 2023 • 37
• 35
glaiveai/glaive-code-assistant Viewer
• Updated Sep 27, 2023 • 136k • 494
• 100
open-web-math/open-web-math Viewer
• Updated Oct 17, 2023 • 6.32M • 21.5k
• 333
Viewer
• Updated May 15, 2024 • 262k • 8.56k
• 303
Viewer
• Updated Mar 20, 2024 • 183k • 1.1k
• 295
Nexusflow/NexusRaven_API_evaluation Viewer
• Updated Sep 29, 2023 • 1.07k • 651
• 17
alielfilali01/MAD-Main-Test Viewer
• Updated Sep 18, 2023 • 67.1k • 3
• 1
Viewer
• Updated Apr 15, 2024 • 211k • 220
• 138
migtissera/Tess-Coder-v1.0 Viewer
• Updated Dec 16, 2023 • 117k • 37
• 25
Preview
• Updated Apr 14, 2024 • 1.03k
• 5
Viewer
• Updated Jan 11, 2024 • 135k • 9.29k
• 288
QuixiAI/Code-74k-ShareGPT-Vicuna Viewer
• Updated Dec 24, 2023 • 73.9k • 49
• 12
Viewer
• Updated Dec 7, 2023 • 109k • 768
• 61
Viewer
• Updated Mar 26, 2024 • 2.75M • 32.5k
• 391
Preview
• Updated Dec 26, 2023 • 117
• 52
Preview
• Updated Apr 3, 2025 • 235
• 194
wyzelabs/RuleRecommendation Preview
• Updated Nov 2, 2023 • 21
• 18
Updated Feb 12, 2024 • 499
• 7
Viewer
• Updated Feb 7, 2024 • 1.31M • 32
• 24
Updated Jan 18, 2024 • 303
• 6
Locutusque/UltraTextbooks Viewer
• Updated Feb 2, 2024 • 5.52M • 357
• 198
Updated Apr 17, 2024 • 3.03k
• 1.02k
Viewer
• Updated Apr 15, 2025 • 206k • 15k
• 346
Preview
• Updated Mar 3, 2024 • 68
• 49
Locutusque/function-calling-chatml Viewer
• Updated Jul 16, 2024 • 113k • 504
• 175
lilacai/glaive-function-calling-v2-sharegpt Viewer
• Updated Jan 29, 2024 • 113k • 105
• 29
Viewer
• Updated Feb 14, 2024 • 45.4k • 58
• 13
unalignment/comedy-snippets-v0.1 Viewer
• Updated Jan 9, 2024 • 44 • 21
• 10
Viewer
• Updated Feb 27, 2024 • 186M • 6.62k
• 39
Viewer
• Updated Aug 12, 2024 • 31.1M • 17.9k
• 684
Viewer
• Updated Apr 23, 2024 • 5.45B • 14.6k
• 540
Updated Jun 10, 2024 • 71.9k
• 138
Viewer
• Updated Mar 4, 2024 • 7.02k • 115
• 135
FreedomIntelligence/ALLaVA-4V Viewer
• Updated Jun 8, 2025 • 143k • 1.23k
• 96
Viewer
• Updated Mar 11, 2024 • 2M • 37
• 5
Viewer
• Updated Mar 13, 2024 • 1.07k • 37
• 26
CohereLabs/wikipedia-2023-11-embed-multilingual-v3 Viewer
• Updated about 1 month ago • 247M • 62.4k
• 246
Weyaxi/huggingface-spaces-codes Viewer
• Updated Nov 14, 2023 • 19.9k • 4.32k
• 11
Updated Sep 10, 2024 • 7.69k
• 68
Viewer
• Updated Aug 12, 2025 • 16.3k • 5.13k
• 101
Updated Mar 23, 2024 • 1.31k
• 1
Viewer
• Updated Mar 23, 2024 • 1.87k • 42
• 1
Viewer
• Updated May 15, 2024 • 629 • 34
• 10
NousResearch/json-mode-eval Viewer
• Updated Feb 21, 2024 • 100 • 379
• 43
NousResearch/func-calling-eval Viewer
• Updated Jan 30, 2024 • 100 • 35
• 16
Updated Oct 20, 2022 • 32.5k
• 356
Viewer
• Updated Mar 29, 2024 • 3.41M • 9.77k
• 193
Viewer
• Updated Jul 16, 2024 • 101k • 167
• 65
Viewer
• Updated Mar 29, 2024 • 7.1k • 5.96k
• 159
Viewer
• Updated Apr 20, 2023 • 3.35M • 2.59k
• 22
HuggingFaceM4/the_cauldron Viewer
• Updated May 6, 2024 • 1.88M • 33.4k
• 526
Viewer
• Updated Jul 11, 2025 • 52.5B • 649k
• 2.77k
gate369/alpaca-star-ascii Viewer
• Updated Mar 27, 2024 • 387 • 10
• 5
Viewer
• Updated Apr 18, 2024 • 765 • 2.08k
• 122
Viewer
• Updated Nov 11, 2024 • 2.49k • 237
• 10
motherduckdb/duckdb-text2sql-25k Viewer
• Updated Apr 7, 2024 • 25k • 74
• 42
asgaardlab/CommonGameCorruptions Viewer
• Updated Apr 23, 2024 • 7.19k • 24
• 2
Viewer
• Updated Dec 26, 2025 • 8.01M • 73k
• 504
chansung/merged_ds_coding Viewer
• Updated Apr 23, 2024 • 60.6k • 51
• 18
PleIAs/Post-OCR-Correction Viewer
• Updated Jul 7, 2025 • 50.4k • 655
• 135
MemGPT/MemGPT-DPO-Dataset Viewer
• Updated Apr 18, 2024 • 42.3k • 32
• 11
nthakur/swim-ir-monolingual Viewer
• Updated Apr 28, 2024 • 3.17M • 195
• 10
nthakur/swim-ir-cross-lingual Viewer
• Updated Apr 28, 2024 • 15.4M • 239
• 9
Updated Dec 11, 2023 • 262
• 14
Viewer
• Updated Dec 3, 2025 • 31.1k • 1.46k
• 13
AILab-CVC/SEED-Bench-2-plus Viewer
• Updated Apr 27, 2024 • 555 • 138
• 5
bigcode/self-oss-instruct-sc2-exec-filter-50k Viewer
• Updated Nov 4, 2024 • 50.7k • 3.12k
• 106
Viewer
• Updated Sep 11, 2023 • 143k • 288
• 15
masakhane/afriqa-gold-passages Updated Sep 27, 2024 • 45
• 5
masakhane/african-ultrachat Viewer
• Updated Apr 4, 2024 • 55k • 64
• 5
Viewer
• Updated Sep 11, 2023 • 153k • 2.32k
• 12
Viewer
• Updated Nov 1, 2024 • 1.28B • 735
• 58
Updated Jun 26, 2024 • 2.38k
• 381
NousResearch/CharacterCodex Viewer
• Updated Jun 17, 2024 • 15.9k • 133
• 229
Viewer
• Updated Jun 13, 2024 • 433k • 171
• 48
allenai/SciRIFF-train-mix Viewer
• Updated Jun 13, 2024 • 70.7k • 48
• 10
PromptSystematicReview/ThePromptReport Viewer
• Updated Jun 14, 2024 • 83 • 2.32k
• 46
louisbrulenaudet/legalkit Viewer
• Updated Jun 26, 2024 • 53k • 115
• 32
microsoft/MeetingBank-LLMCompressed Viewer
• Updated May 16, 2024 • 5.17k • 165
• 16
Viewer
• Updated Aug 21, 2024 • 17.3k • 1.12k
• 35
microsoft/MeetingBank-QA-Summary Viewer
• Updated May 16, 2024 • 862 • 94
• 15
Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1 Viewer
• Updated Jul 3, 2024 • 1M • 179
• 14
Viewer
• Updated Aug 26, 2024 • 2.55M • 8.6k
• 301
Viewer
• Updated Jul 22, 2024 • 486k • 278
• 64
Viewer
• Updated Aug 15, 2024 • 1.75M • 267
• 105
Viewer
• Updated Aug 14, 2024 • 6k • 591
• 201
Viewer
• Updated Jun 20, 2025 • 119k • 6.15k
• 91
CATMuS/medieval-segmentation Viewer
• Updated Jul 22, 2024 • 1.68k • 207
• 7
antoinejeannot/jurisprudence Viewer
• Updated Mar 20, 2025 • 2.12M • 315
• 26
Viewer
• Updated Dec 16, 2024 • 39.5k • 21.8k
• 361
HuggingFaceFW/fineweb-edu Viewer
• Updated Jul 11, 2025 • 3.5B • 355k
• 1.04k
Viewer
• Updated Sep 18, 2024 • 6.91k • 376
• 22
argilla/FinePersonas-v0.1 Viewer
• Updated Dec 11, 2024 • 42.1M • 8.93k
• 409
lmms-lab/LLaVA-Video-178K Viewer
• Updated Oct 11, 2024 • 1.63M • 42.8k
• 192
Updated May 26, 2025 • 580k
• 249
recursal/SuperWikiImage-7M Updated Oct 7, 2024 • 146
• 19
Preview
• Updated Aug 6, 2025 • 7.78k
• 90
Viewer
• Updated Dec 17, 2024 • 826M • 8.9k
• 66
Updated Feb 2, 2025 • 541
• 86
louisbrulenaudet/lemone-docs-embedded Viewer
• Updated Oct 27, 2024 • 16.1k • 39
• 3
naijavoices/naijavoices-dataset Viewer
• Updated Aug 7, 2025 • 1.92M • 1.44k
• 20
Viewer
• Updated Jan 9, 2025 • 12.4M • 1.9k
• 172
Viewer
• Updated Oct 15, 2024 • 824 • 11.6k
• 253
ClovenDoug/150k_keyphrases_labelled Viewer
• Updated Nov 3, 2024 • 2.26M • 68
• 2
Cour-de-cassation/alpaca_ccass_motivations_sommaires_titres Viewer
• Updated Sep 28, 2025 • 19.1k • 22
• 3
microsoft/orca-agentinstruct-1M-v1 Viewer
• Updated Nov 1, 2024 • 1.05M • 1.31k
• 461
alpindale/two-million-bluesky-posts Viewer
• Updated Nov 28, 2024 • 2.11M • 672
• 202
Viewer
• Updated 10 days ago • 115M • 4.4k
• 107
Viewer
• Updated Dec 26, 2024 • 286k • 83
• 124
agibot-world/AgiBotWorld-Alpha Viewer
• Updated Sep 29, 2025 • 49.8M • 10.8k
• 219
DAMO-NLP-SG/multimodal_textbook Updated Mar 17, 2025 • 1.6k
• 164
Viewer
• Updated Jan 9, 2025 • 926k • 746
• 21
bytedance-research/ToolHop Updated 13 days ago • 1.06k
• 21
Benchmark
• Updated Jan 20 • 2.5k • 49.5k
• 784
Viewer
• Updated Mar 17, 2025 • 182k • 563
• 123
ServiceNow-AI/R1-Distill-SFT Viewer
• Updated Feb 8, 2025 • 1.85M • 1.03k
• 318
open-thoughts/OpenThoughts-114k Viewer
• Updated Aug 31, 2025 • 228k • 151k
• 833
tomg-group-umd/alpaca_cleaned_dataset_short Viewer
• Updated Jan 25, 2025 • 32 • 9
• 1
Viewer
• Updated May 31, 2025 • 5.42M • 1.42k
• 4
MaziyarPanahi/M2Lingual-sharegpt Viewer
• Updated Nov 20, 2024 • 174k • 22
• 2
Viewer
• Updated Feb 27 • 4.59k • 391
• 10
Viewer
• Updated Feb 16, 2025 • 307k • 263
• 2
OpenLLM-France/Lucie-Training-Dataset Viewer
• Updated May 27, 2025 • 10.9B • 2.3k
• 35
Viewer
• Updated Dec 21, 2023 • 2.19k • 745
• 26
Josephgflowers/Finance-Instruct-500k Viewer
• Updated Feb 24 • 518k • 1.94k
• 223
facebook/natural_reasoning Viewer
• Updated Feb 21, 2025 • 1.15M • 1.52k
• 562
Updated Jan 18, 2024 • 4.56k
• 60
Viewer
• Updated Jul 17, 2024 • 3.08M • 330
• 7
VanWang/Bespoke_dpo_filter Viewer
• Updated Feb 18, 2025 • 10.1k • 4
• 1
VanWang/Bespoke_dpo_filter_len_long Viewer
• Updated Feb 21, 2025 • 1k • 7
• 1
TheFinAI/Fino1_Reasoning_Path_FinQA Viewer
• Updated Feb 26, 2025 • 5.5k • 5.55k
• 40
Preview
• Updated Nov 2, 2025 • 81
• 4
declare-lab/AlgoPuzzleVQA Viewer
• Updated Feb 26, 2025 • 1.8k • 151
• 9
Viewer
• Updated Mar 17, 2025 • 487k • 2.41k
• 106
Viewer
• Updated Oct 1, 2025 • 205 • 2.14k
• 32
Viewer
• Updated Mar 6, 2025 • 160 • 40
• 4
Viewer
• Updated Jul 7, 2025 • 79.5M • 1.41k
• 23
Preview
• Updated Feb 16 • 571
• 13
Locutusque/Platinum-CoT-v0.1-ShareGPT Viewer
• Updated Feb 15, 2025 • 2.42k • 9
• 1
gretelai/gretel-safety-alignment-en-v1 Viewer
• Updated Dec 17, 2025 • 16.7k • 261
• 22
Locutusque/deeplm-training-data Viewer
• Updated Apr 11, 2025 • 2.17M • 46
• 3
Viewer
• Updated Mar 7, 2025 • 1B • 3.86k
• 31
winglian/codeforces-cot-16k-context Viewer
• Updated Mar 13, 2025 • 24.3k • 44
• 1
glaiveai/reasoning-v1-20m Viewer
• Updated Mar 19, 2025 • 22.2M • 2.53k
• 233
nvidia/Llama-Nemotron-Post-Training-Dataset Viewer
• Updated May 8, 2025 • 3.91M • 2.96k
• 655
nomic-ai/cornstack-python-v1 Viewer
• Updated Mar 27, 2025 • 23.6M • 1.46k
• 21
Viewer
• Updated Mar 28, 2025 • 254k • 4.86k
• 214
Viewer
• Updated May 4, 2025 • 753k • 5.65k
• 536
Viewer
• Updated Mar 5 • 1.15k • 1.16k
• 115
Anthropic/values-in-the-wild Viewer
• Updated Apr 28, 2025 • 6.91k • 888
• 149
Viewer
• Updated May 27, 2025 • 1.98k • 115
• 33
Viewer
• Updated Nov 20, 2024 • 7.5k • 295
• 19
ZennyKenny/tactical-military-reasoning-v.1.0 Viewer
• Updated Apr 25, 2025 • 150 • 294
• 20
nvidia/Nemotron-CrossThink Preview
• Updated May 1, 2025 • 472
• 113
Preview
• Updated Aug 24, 2025 • 3.71k
• 31
a-m-team/AM-DeepSeek-Distilled-40M Viewer
• Updated May 10, 2025 • 11.5M • 2.4k
• 56
open-r1/Mixture-of-Thoughts Viewer
• Updated May 26, 2025 • 699k • 4.22k
• 312
Viewer
• Updated Jun 11, 2025 • 5.82M • 830
• 64
Viewer
• Updated Jun 10, 2025 • 157M • 650
• 56
facebook/seamless-interaction Updated Jul 14, 2025 • 67.8k
• 179
MaziyarPanahi/smoltalk2-sft-no-think Viewer
• Updated Jul 11, 2025 • 1.9M • 52
• 6
facebook/community-alignment-dataset Viewer
• Updated Feb 19 • 90.3k • 140
• 41
interstellarninja/hermes_reasoning_tool_use Viewer
• Updated Dec 26, 2025 • 51k • 534
• 163
Viewer
• Updated Jul 24, 2025 • 1.25M • 9.84k
• 128
MegaScience/TextbookReasoning Viewer
• Updated Jul 24, 2025 • 652k • 781
• 32
HuggingFaceH4/Multilingual-Thinking Viewer
• Updated Aug 7, 2025 • 1k • 13.1k
• 114
motionlabs/fineweb-ultra-mini Viewer
• Updated Aug 19, 2025 • 131k • 18
• 4
Viewer
• Updated Apr 24, 2024 • 168k • 1.39k
• 5
Updated Dec 4, 2024 • 3.91k
• 44
Viewer
• Updated Apr 16, 2024 • 71.4k • 205
• 9
OS-Copilot/OS-Genesis-web-data Updated Mar 17, 2025 • 36
• 8
Updated Sep 27, 2024 • 917
• 30
Preview
• Updated Jan 9 • 3.08k
• 77
nvidia/Nemotron-Post-Training-Dataset-v2 Viewer
• Updated Aug 21, 2025 • 6.34M • 9.67k
• 128
Text Generation
• 8B • Updated Sep 5, 2025 • 685
• 69
continuedev/instinct-data Viewer
• Updated Sep 4, 2025 • 9.04k • 102
• 31
Viewer
• Updated 21 days ago • 476M • 19.9k
• 851
Viewer
• Updated Feb 2 • 5.89M • 4.02k
• 91
Preview
• Updated Oct 10, 2025 • 1.35k
• 50
Viewer
• Updated 7 days ago • 7.09B • 203k
• 90
smolagents/aguvis-stage-2 Viewer
• Updated Sep 5, 2025 • 784k • 16.9k
• 28
nvidia/esm2_uniref_pretraining_data Viewer
• Updated Sep 28, 2025 • 188M • 931
• 7
biglam/doab-metadata-extraction Viewer
• Updated Oct 16, 2025 • 8.09k • 80
• 12
rl-research/dr-tulu-rl-data Viewer
• Updated Nov 25, 2025 • 4.88k • 906
• 12
RUC-DataLab/DataScience-Instruct-500K Viewer
• Updated Oct 21, 2025 • 26.2k • 1.14k
• 72
openbmb/InfLLM-V2-data-5B Viewer
• Updated Oct 25, 2025 • 7.19M • 138
• 33
OpenMed/Medical-Reasoning-SFT-GPT-OSS-120B Viewer
• Updated Dec 12, 2025 • 200k • 385
• 250
allenai/Dolci-Think-RL-7B-Completions-SFT Viewer
• Updated Jan 5 • 636k • 127
• 8
mahdi-ranjbar/math_search_strategy Viewer
• Updated Jan 24, 2025 • 40 • 7
• 1
genrobot2025/10Kh-RealOmin-OpenData Updated about 6 hours ago • 72.4k
• 198
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b Viewer
• Updated Jan 31 • 306k • 2.91k
• 348
Viewer
• Updated Jul 20, 2025 • 1.86M • 6.12k
• 240
Viewer
• Updated 7 days ago • 2.56M • 10.7k
• 221
nvidia/Nemotron-Math-Proofs-v1 Viewer
• Updated Jan 5 • 925k • 1.38k
• 117
Viewer
• Updated Feb 11 • 7.09M • 3.85k
• 176
Viewer
• Updated Apr 23, 2025 • 140 • 95
• 5
BigData-KSU/RS-instructions-dataset Viewer
• Updated Apr 23, 2024 • 73.3k • 52
• 1
henry-07/sentinel-imagery-captions Viewer
• Updated Mar 11, 2025 • 500 • 35
• 1
henry-07/sentinel-image-captions Viewer
• Updated Apr 6, 2025 • 6.01k • 38
• 1