[B! ascend910] rawwellのブックマーク

Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
[B! ascend910] rawwellのブックマーク

rawwell id:rawwell

ascend910に関するrawwellのブックマーク (2)

PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation
rawwell 2025/05/14
PanGu

LLM

ascend910
リンク
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
The scaling of large language models has greatly improved natural language understanding, generation, and reasoning. In this work, we develop a system that trained a trillion-parameter language model on a cluster of Ascend 910 AI processors and MindSpore framework, and present the language model with 1.085T parameters named PanGu-Σ. With parameter inherent from PanGu-α, we extend the dense Transfo
rawwell 2025/05/14
PanGu

LLM

ascend910
リンク
1

お知らせ

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx

はてなブックマーク

タグ

関連タグで絞り込む (2)

ascend910に関するrawwellのブックマーク (2)

お知らせ

今週のはてなブックマーク数ランキング（2025年10月第3週）

今週のはてなブックマーク数ランキング（2025年10月第2週）

今週のはてなブックマーク数ランキング（2025年10月第1週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス

タグ

関連タグで絞り込む (2)

ascend910に関するrawwellのブックマーク (2)

PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

お知らせ

今週のはてなブックマーク数ランキング（2025年10月第3週）

今週のはてなブックマーク数ランキング（2025年10月第2週）

今週のはてなブックマーク数ランキング（2025年10月第1週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス