Skip to content

EffNet heads from 2022-08-25 declare input 1200; newer heads and EffNet embedding are 1280 #1490

@xiaden

Description

@xiaden

All EffNet classifier sidecars from 2022-08-25 specify a shape of 1200 while newer heads (nsynth*, approachability/engagement) specify 1280 or batch_size,1280.

EffNet embedding on my end is 1280-D. Please confirm if the 1200 in the Aug-25 sidecars is a typo. If so, could the sidecars be republished with input 1280?

Models showing 1200 (all dated 2022-08-25): mood_acoustic, mood_aggressive, mood_electronic, mood_happy, mood_party, mood_relaxed, mood_sad, genre_electronic, danceability_classifier, gender, tonal_atonal, voice_instrumental.
Models showing 1280: nsynth_reverb, nsynth_instrument, nsynth_acoustic_electronic (2023-02-05), bright/dark_timbre (2023-11-04), approachability_regression (2022-06-16), engagement_regression (2022-06-16).

Why it matters:
Custom runners that validate shapes either fail or force users to truncate embeddings; if heads were trained on 1280, truncating to 1200 drops information. Specifically, I'm developing a docker image that uses the sidecars to automatically tag songs via an API.

Raw output of the json files looking for shape, and release data:

Get-ChildItem -Recurse **redacted (folder structure)** | ForEach-Object { $content = Get-Content $_.FullName -Raw | ConvertFrom-Json; $embedding = if ($content.inference.embedding_model) { $content.inference.embedding_model.model_name } else { 'EMBEDDING' }; $shape = if ($content.schema.inputs) { $content.schema.inputs[0].shape -join ',' } else { ($content.schema.outputs | Where-Object { $_.output_purpose -eq 'embeddings' }).shape -join ',' }; [PSCustomObject]@{ Type = if ($embedding -eq 'EMBEDDING') { 'EMB' } else { 'HEAD' }; Name = $content.name; ReleaseDate = $content.release_date; Shape = $shape } } | Sort-Object Type,ReleaseDate | Format-Table -AutoSize

Type Name                           ReleaseDate Shape
---- ----                           ----------- -----
EMB  EffnetDiscogs                  2022-02-17  64,128,96
HEAD nsynth reverb                  02/05/2023  batch_size,1280
HEAD nsynth instrument              02/05/2023  batch_size,1280
HEAD nsynth acoustic electronic     02/05/2023  batch_size,1280
HEAD nsynth bright/dark timre       11/04/2023  batch_size,1280
HEAD approachability_regression     2022-06-16  1280
HEAD engagement_regression          2022-06-16  1280
HEAD genre electronic               2022-08-25  1200
HEAD mood sad                       2022-08-25  1200
HEAD mood relaxed                   2022-08-25  1200
HEAD mood party                     2022-08-25  1200
HEAD mood happy                     2022-08-25  1200
HEAD mood electronic                2022-08-25  1200
HEAD mood aggressive                2022-08-25  1200
HEAD mood acoustic                  2022-08-25  1200
HEAD gender                         2022-08-25  1200
HEAD danceability classifier        2022-08-25  1200
HEAD tonal/atonal                   2022-08-25  1200
HEAD voice/instrumental classifiers 2022-08-25  1200

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions