SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation.
Snowflake
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Collections
3
A collection of text embedding models optimized for retrieval accuracy and efficiency
-
Snowflake/snowflake-arctic-embed-m
Sentence Similarity • Updated • 449k • 147 -
Snowflake/snowflake-arctic-embed-l
Sentence Similarity • Updated • 14.6k • 89 -
Snowflake/snowflake-arctic-embed-m-long
Sentence Similarity • Updated • 47.3k • 33 -
Snowflake/snowflake-arctic-embed-xs
Sentence Similarity • Updated • 103k • 31
models
14
Snowflake/snowflake-arctic-embed-l
Sentence Similarity
•
Updated
•
14.6k
•
89
Snowflake/snowflake-arctic-embed-m-v2.0
Sentence Similarity
•
Updated
•
8.15k
•
47
Snowflake/snowflake-arctic-embed-l-v2.0
Sentence Similarity
•
Updated
•
31k
•
91
Snowflake/snowflake-arctic-embed-m-v1.5
Sentence Similarity
•
Updated
•
23k
•
51
Snowflake/snowflake-arctic-embed-xs
Sentence Similarity
•
Updated
•
103k
•
31
Snowflake/snowflake-arctic-embed-m-long
Sentence Similarity
•
Updated
•
47.3k
•
33
Snowflake/snowflake-arctic-embed-m
Sentence Similarity
•
Updated
•
449k
•
147
Snowflake/Llama-3.1-SwiftKV-405B-Instruct-FP8
Updated
•
97
Snowflake/Llama-3.1-SwiftKV-8B-Instruct-FP8
Updated
•
12
Snowflake/Llama-3.1-SwiftKV-8B-Instruct
Updated
•
115
•
5