-
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 125 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper β’ 2307.08691 β’ Published β’ 9 -
Mixtral of Experts
Paper β’ 2401.04088 β’ Published β’ 161 -
Mistral 7B
Paper β’ 2310.06825 β’ Published β’ 60
Snehasish Barman
sbarman25
AI & ML interests
Machine Learning for Health, AI, Distributed Systems
Recent Activity
upvoted an article about 2 months ago
β΄οΈ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use updated a collection 3 months ago
Audio Stuff liked a model 3 months ago
RoyalCities/Foundation-1Organizations
None yet