Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
hmteams (hmTEAMS)
[go: Go Back, main page]

our GitHub repository.

\n

Leaderboard

\n

We test our pretrained language models on various datasets from HIPE-2020, HIPE-2022 and Europeana. The following table\nshows an overview of used datasets.

\n
\n\t\n\t\t\n\n\n\n\n\t\t\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\t
LanguageDatasets
EnglishAjMC - TopRes19th
GermanAjMC - NewsEye - HIPE-2020
FrenchAjMC - ICDAR-Europeana - LeTemps - NewsEye - HIPE-2020
FinnishNewsEye
SwedishNewsEye
DutchICDAR-Europeana
\n
\n

All results can be found in the hmLeaderboard.

\n

Acknowledgements

\n

We thank Luisa März, Katharina Schmid and\nErion Çano for their fruitful discussions about Historical Language Models.

\n

Research supported with Cloud TPUs from Google's TPU Research Cloud (TRC).\nMany Thanks for providing access to the TPUs ❤️

\n","classNames":"hf-sanitized hf-sanitized-MEc_eCreA1zpzi2RZRRQJ"},"users":[{"_id":"5e6a3d4ea9afd5125d9ec064","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1584020801691-noauth.jpeg","isPro":true,"fullname":"Stefan Schweter","user":"stefan-it","type":"user"}],"userCount":1,"collections":[],"datasets":[{"author":"hmteams","downloads":1,"gated":"manual","id":"hmteams/vocab-corpus","lastModified":"2023-08-01T12:05:47.000Z","datasetsServerInfo":{"viewer":"viewer-partial","numRows":1431387010,"libraries":["datasets","mlcroissant"],"formats":["text"],"modalities":["text"]},"private":false,"repoType":"dataset","likes":1,"isLikedByUser":false,"isBenchmark":false}],"models":[{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":8,"gated":false,"id":"hmteams/teams-base-historic-multilingual-generator","availableInferenceProviders":[],"lastModified":"2025-02-03T12:21:28.000Z","likes":1,"pipeline_tag":"fill-mask","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[],"numParameters":68123648},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":5,"gated":false,"id":"hmteams/teams-base-historic-multilingual-discriminator","availableInferenceProviders":[],"lastModified":"2025-02-03T06:56:46.000Z","likes":0,"private":false,"repoType":"model","isLikedByUser":false,"numParameters":110618113},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":0,"gated":false,"id":"hmteams/flair-hipe-2022-newseye-de","availableInferenceProviders":[],"lastModified":"2023-10-18T12:14:49.000Z","likes":0,"pipeline_tag":"token-classification","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[]},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":3,"gated":false,"id":"hmteams/flair-hipe-2022-hipe2020-fr","availableInferenceProviders":[],"lastModified":"2023-10-18T12:06:34.000Z","likes":0,"pipeline_tag":"token-classification","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[]},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":2,"gated":false,"id":"hmteams/flair-hipe-2022-hipe2020-de","availableInferenceProviders":[],"lastModified":"2023-10-17T22:29:41.000Z","likes":0,"pipeline_tag":"token-classification","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[]},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":1,"gated":false,"id":"hmteams/flair-hipe-2022-newseye-sv","availableInferenceProviders":[],"lastModified":"2023-10-17T22:29:36.000Z","likes":0,"pipeline_tag":"token-classification","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[]},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":0,"gated":false,"id":"hmteams/flair-hipe-2022-newseye-fi","availableInferenceProviders":[],"lastModified":"2023-10-17T22:29:20.000Z","likes":0,"pipeline_tag":"token-classification","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[]},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":0,"gated":false,"id":"hmteams/flair-hipe-2022-newseye-fr","availableInferenceProviders":[],"lastModified":"2023-10-17T22:28:59.000Z","likes":0,"pipeline_tag":"token-classification","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[]},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":3,"gated":false,"id":"hmteams/flair-hipe-2022-topres19th-en","availableInferenceProviders":[],"lastModified":"2023-10-17T21:31:11.000Z","likes":0,"pipeline_tag":"token-classification","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[]},{"author":"hmteams","authorData":{"_id":"64c8c42db8685df8003bb9c2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e6a3d4ea9afd5125d9ec064/04uZdGgDkZzjZJKOGB_rt.jpeg","fullname":"hmTEAMS","name":"hmteams","type":"org","isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"downloads":3,"gated":false,"id":"hmteams/flair-hipe-2022-letemps-fr","availableInferenceProviders":[],"lastModified":"2023-10-17T21:30:46.000Z","likes":0,"pipeline_tag":"token-classification","private":false,"repoType":"model","isLikedByUser":false,"widgetOutputUrls":[]}],"paperPreviews":[],"spaces":[],"buckets":[],"numBuckets":0,"numDatasets":1,"numModels":18,"numSpaces":1,"lastOrgActivities":[{"time":"2026-01-30T10:45:21.105Z","user":"stefan-it","userAvatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1584020801691-noauth.jpeg","type":"paper-daily","paper":{"id":"2601.22146","title":"FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale","thumbnailUrl":"https://cdn-thumbnails.huggingface.co/social-thumbnails/papers/2601.22146.png","upvotes":9,"publishedAt":"2026-01-29T18:58:47.000Z","isUpvotedByUser":true}},{"time":"2025-10-27T10:24:47.744Z","user":"stefan-it","userAvatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1584020801691-noauth.jpeg","type":"paper","paper":{"id":"2510.21364","title":"SindBERT, the Sailor: Charting the Seas of Turkish NLP","publishedAt":"2025-10-24T11:48:49.000Z","upvotes":1,"isUpvotedByUser":true}},{"time":"2025-10-17T05:56:03.619Z","user":"stefan-it","userAvatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1584020801691-noauth.jpeg","type":"paper","paper":{"id":"2510.13996","title":"The German Commons - 154 Billion Tokens of Openly Licensed Text for\n German Language Models","publishedAt":"2025-10-15T18:24:26.000Z","upvotes":9,"isUpvotedByUser":true}}],"acceptLanguages":["*"],"canReadRepos":false,"canReadSpaces":false,"blogPosts":[],"currentRepoPage":0,"filters":{},"paperView":false}">

AI & ML interests

Pretraining Historical Multilingual Language Models

Recent Activity

hmTEAMS

Historical Multilingual TEAMS Models. Following languages are currently covered:

  • English (British Library Corpus - Books)
  • German (Europeana Newspaper)
  • French (Europeana Newspaper)
  • Finnish (Europeana Newspaper, Digilib)
  • Swedish (Europeana Newspaper, Digilib)
  • Dutch (Delpher Corpus)
  • Norwegian (NCC Corpus)

More details can be found in our GitHub repository.

Leaderboard

We test our pretrained language models on various datasets from HIPE-2020, HIPE-2022 and Europeana. The following table shows an overview of used datasets.

Language Datasets
English AjMC - TopRes19th
German AjMC - NewsEye - HIPE-2020
French AjMC - ICDAR-Europeana - LeTemps - NewsEye - HIPE-2020
Finnish NewsEye
Swedish NewsEye
Dutch ICDAR-Europeana

All results can be found in the hmLeaderboard.

Acknowledgements

We thank Luisa März, Katharina Schmid and Erion Çano for their fruitful discussions about Historical Language Models.

Research supported with Cloud TPUs from Google's TPU Research Cloud (TRC). Many Thanks for providing access to the TPUs ❤️