-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy path.zenodo.json
More file actions
27 lines (27 loc) · 909 Bytes
/
.zenodo.json
File metadata and controls
27 lines (27 loc) · 909 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
{
"title": "ElasticKV: Exploiting Attention Haze for Structure-Preserving KV-Cache Compression",
"description": "ElasticKV is a training-free KV-cache sparsification method for llama.cpp that removes low-magnitude, low-selectivity attention haze via per-head adaptive thresholds with sink token protection. This archival release bundles the CUDA hook, paper source, raw artifacts, and analysis scripts used in the accompanying manuscript.",
"creators": [
{
"name": "Camargo, Guilherme de",
"orcid": "0009-0004-8913-9419",
"affiliation": "Independent Researcher",
"type": "ProjectMember"
}
],
"keywords": [
"kv-cache",
"llama.cpp",
"llm",
"cuda",
"long-context",
"sparsity",
"attention",
"reproducibility"
],
"license": "apache-2.0",
"upload_type": "software",
"access_right": "open",
"language": "eng",
"version": "0.1.1"
}