You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
👉 With Descript Audio Codec, you can compress **44.1 KHz audio** into discrete codes at a **low 8 kbps bitrate**. <br>
13
+
🤌 That's approximately **90x compression** while maintaining exceptional fidelity and minimizing artifacts. <br>
14
+
💪 Our universal model works on all domains (speech, environment, music, etc.), making it widely applicable to generative modeling of all audio. <br>
15
+
👌 It can be used as a drop-in replacement for EnCodec for all audio language modeling applications (such as AudioLMs, MusicLMs, MusicGen, etc.) <br>
16
+
17
+
<palign="center">
18
+
<imgsrc="./assets/comparsion_stats.png"alt="Comparison of compressions approaches. Our model achieves a higher compression factor compared to all baseline methods. Our model has a ~90x compression factor compared to 32x compression factor of EnCodec and 64x of SoundStream. Note that we operate at a target bitrate of 8 kbps, whereas EnCodec operates at 24 kbps and SoundStream at 6 kbps. We also operate at 44.1 kHz, whereas EnCodec operates at 48 kHz and SoundStream operates at 24 kHz."width=35%></p>
9
19
10
20
11
21
## Usage
@@ -17,6 +27,16 @@ cd descript-audio-codec
17
27
pip install .
18
28
```
19
29
30
+
### Weights
31
+
Weights are released as part of this repo under MIT license.
32
+
They are automatically downloaded when you first run `encode` or `decode` command. They can be cached locally with
33
+
```
34
+
python3 -m dac download
35
+
```
36
+
We provide a Dockerfile that installs all required dependencies for encoding and decoding. The build process caches model weights inside the image. This allows the image to be used without an internet connection. [Please refer to instructions below.](#docker-image)
0 commit comments