Instructions to use ethicalabs/Echo-DSRN-v0.1.3-Embed-Exp with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use ethicalabs/Echo-DSRN-v0.1.3-Embed-Exp with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("ethicalabs/Echo-DSRN-v0.1.3-Embed-Exp", trust_remote_code=True) sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Transformers
How to use ethicalabs/Echo-DSRN-v0.1.3-Embed-Exp with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("ethicalabs/Echo-DSRN-v0.1.3-Embed-Exp", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Echo-DSRN Embedding Model (Echo-DSRN-v0.1.3-Embed-Exp)
This is a high-performance experimental sentence embedding model based on the recurrent-hybrid Echo-DSRN architecture.
It scales linearly ($O(N)$) with sequence length, offering extreme efficiency and sub-millisecond latency on both CPU and GPU.
π Model Details
- Developer: ethicalabs
- Base Model: ethicalabs/Echo-DSRN-114M-v0.1.2
- Architecture: Echo-DSRN Recurrent-Hybrid
- Parameters: 98.26M (98,264,064)
- Embedding Dimension: 2048
π Evaluation Results (MTEB STS)
Spearman Rank Correlation scores on Semantic Textual Similarity (STS) benchmark tasks:
| Benchmark Task | Echo-DSRN (Ours) |
|---|---|
| STS12 | 0.6667 |
| STS13 | 0.7692 |
| STS14 | 0.7683 |
| STS15 | 0.8227 |
| STS16 | 0.7460 |
| STSBenchmark | 0.7293 |
| SICK-R | 0.7876 |
| Average STS | 0.7557 |
β‘ Efficiency and Systems Scaling Profile
Inference performance (latency and peak VRAM allocation) on GPU and CPU configurations across different sequence lengths:
GPU Latency & VRAM Benchmark
| Sequence Length | Echo-DSRN Latency (GPU) | Echo-DSRN VRAM (GPU) |
|---|---|---|
| 128 | 15.93 ms |
516.69 MB |
| 256 | 17.56 ms |
548.44 MB |
| 512 | 32.14 ms |
604.95 MB |
| 1024 | 71.30 ms |
710.96 MB |
| 2048 | 155.26 ms |
932.99 MB |
| 4096 | N/A (OOR) | N/A (OOR) |
CPU Latency Benchmark
| Sequence Length | Echo-DSRN Latency (CPU) |
|---|---|
| 128 | 48.50 ms |
| 256 | 84.94 ms |
| 512 | 160.93 ms |
| 1024 | 328.93 ms |
| 2048 | 727.57 ms |
| 4096 | N/A (OOR) |
Note: 'N/A (OOR)' indicates sequence length exceeds model's maximum position embedding range.
ποΈ Architecture Details
| Property | Value |
|---|---|
| Layers | 8 |
| Hidden Dim | 512 |
| Vocab Size | 32017 |
| Attention Heads | 4 |
π Parameter Breakdown
| Component | Parameters | % of Total |
|---|---|---|
| Total | 98.26M (98,264,064) | 100% |
| Embeddings | 16.39M | 16.68% |
| DSRN Recurrent Blocks | 81.87M | 83.32% |
| Norms & Biases | 512 | 0.00% |
π» Usage
You can load and use this model directly via sentence-transformers:
from sentence_transformers import SentenceTransformer
# Load model with auto-mapping enabled
model = SentenceTransformer("ethicalabs/Echo-DSRN-v0.1.3-Embed-Exp", trust_remote_code=True)
# Encode text to get 2048-dimensional embeddings
sentences = ["The recurrent slow state contains the aligned sequence representations.", "Echo-DSRN has linear complexity."]
embeddings = model.encode(sentences)
print(embeddings.shape) # (2, 2048)
π οΈ Training Procedure
The model was trained in three sequential phases:
- Contrastive Pre-training: Representation space alignment using natural language inference datasets.
- Fine-grained Similarity Tuning: Fine-tuning using semantic textual similarity benchmarks to calibrate similarity scores.
- Multi-Task Generalization Tuning: Training on NLI retrieval and STS semantic similarity.
This model card was automatically generated by scripts/generate_model_card.py.
- Downloads last month
- 1