added modified MFCC features based on DNN-c and fDNN-c features; it i… #2908

pegahgh · 2018-12-12T06:21:37Z

…s activated using --modified option.

…fied-mel-kaldi

RuABraun · 2018-12-30T02:56:13Z

Any preprint available of the paper mentioned?

danpovey

The documentation seems to be out of date w.r.t. the code here.
Can you please let me know if this configuration is the one that you are currently recommending, or did you change it somehow since this?

danpovey · 2018-12-30T23:54:19Z

src/feat/mel-computations.h

@@ -48,14 +48,16 @@ struct MelBanksOptions {
  BaseFloat vtln_low;  // vtln lower cutoff of warping function.
  BaseFloat vtln_high;  // vtln upper cutoff of warping function: if negative, added
                        // to the Nyquist frequency to get the cutoff.
+  bool modified;       // If true, use 'modified' MFCC, which uses a breakpoint of
+                       // 900 instead of 700.


changes to documentation needed here

danpovey · 2018-12-30T23:54:59Z

src/feat/mel-computations.h

@@ -69,6 +71,13 @@ struct MelBanksOptions {
    opts->Register("vtln-high", &vtln_high,
                   "High inflection point in piecewise linear VTLN warping function"
                   " (if negative, offset from high-mel-freq");
+    opts->Register("modified", &modified,
+                   "Modified MFCCs, based on paper 'An alternative to MFCCs for ASR' "


Please update this documentation to be accurate and fix typos. (the stuff about 1nt and 2nd formant isn't accurate any more, I believe).

danpovey · 2018-12-30T23:55:09Z

src/feat/mel-computations.cc

+  a lot of bins, their diamter is defined by a formula and it's a function of
+  the center frequency f of the bin:
+     diameter = 30 + 60 f / (f + 500).
+  so it increases from 30Hz to 90Hz with a knee around 500Hz.


This documentation seems a bit out of date.

danpovey · 2018-12-31T00:02:46Z

src/feat/mel-computations.h

+  // breakpoint_ is 700 for normal mel, or 900 for modified.
+  inline BaseFloat InverseMelScale(BaseFloat mel_freq) {
+    if (sec_breakpoint_ > 0.0)
+      return 3500.0 * (expf((expf(mel_freq) - breakpoint_) / 3500.0) - 1.0);


I imagine this should be sec_breakpoint_ instead of 3500.

danpovey · 2018-12-31T00:02:54Z

src/feat/mel-computations.h

+  // and for other purposes.
+  BaseFloat breakpoint_;  // The breakpoint in the mel scale: 700 normally;
+                          // 500 if opts.modified is true.
+  BaseFloat sec_breakpoint_; // The second breakpoint used in the modified


please call this either second_breakpoint_ or breakpoint2_.

danpovey · 2018-12-31T00:03:20Z

src/feat/mel-computations.cc

+    BaseFloat diameter_floor = (next_center - center_freq) * 1.1,
+        diameter = 30.0 + 60.0 * (center_freq / (center_freq + breakpoint_));
+
+    diameter = pow(diameter * diameter + diameter_floor * diameter_floor, 0.5);


I think sqrt would be easier than pow(.., 0.5).

kkm000 · 2019-03-25T00:59:38Z

Tangential thought unrelated to the contents of this MR. It pleases me that someone at last had a look at the feature engineering part of our overall business. MFCC were invented to drop as much "irrelevant" information as possible, when ASR was tiny and puny. With the DNN renaissance, our general approach has changed: just give the network all information you have, and let it figure out what is really correlated. I am not at all sure that the currently "standard" features discard mostly useless information.

The field mostly got rid of HMMs (hooray!) which make no sense in modeling speech signals: they decay exponentially, which speech obviously do not, ye-e-e-e-eah. My general feeling is our features are another dinosaur that has outlived its time.

stale · 2020-06-19T08:36:01Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2020-07-19T05:23:44Z

This issue has been automatically closed by a bot strictly because of inactivity. This does not mean that we think that this issue is not important! If you believe it has been closed hastily, add a comment to the issue and mention @kkm000, and I'll gladly reopen it.

stale · 2020-09-17T08:30:10Z

This issue has been automatically marked as stale by a bot solely because it has not had recent activity. Please add any comment (simply 'ping' is enough) to prevent the issue from being closed for 60 more days if you believe it should be kept open.

pegahgh added 4 commits December 12, 2018 01:07

added modified MFCC features based on DNN-c and fDNN-c features; it i…

232df9f

…s activated using --modified option.

pushed to trigger the build (travis issue)

4eb4862

Merge branch 'master' of https://github.com/kaldi-asr/kaldi into modi…

799969e

…fied-mel-kaldi

modified test set w.r.t new VtlnWarpMelFreq function.

126c89a

danpovey reviewed Dec 30, 2018

View reviewed changes

danpovey reviewed Dec 31, 2018

View reviewed changes

fixed typos.

e272089

stale bot added the stale Stale bot on the loose label Jun 19, 2020

stale bot closed this Jul 19, 2020

kkm000 reopened this Jul 19, 2020

stale bot removed the stale Stale bot on the loose label Jul 19, 2020

stale bot added the stale Stale bot on the loose label Sep 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added modified MFCC features based on DNN-c and fDNN-c features; it i… #2908

added modified MFCC features based on DNN-c and fDNN-c features; it i… #2908

pegahgh commented Dec 12, 2018

RuABraun commented Dec 30, 2018 •

edited

Loading

danpovey left a comment

danpovey Dec 30, 2018

danpovey Dec 30, 2018

danpovey Dec 30, 2018

danpovey Dec 31, 2018

danpovey Dec 31, 2018

danpovey Dec 31, 2018

kkm000 commented Mar 25, 2019

stale bot commented Jun 19, 2020

stale bot commented Jul 19, 2020

stale bot commented Sep 17, 2020

added modified MFCC features based on DNN-c and fDNN-c features; it i… #2908

Are you sure you want to change the base?

added modified MFCC features based on DNN-c and fDNN-c features; it i… #2908

Conversation

pegahgh commented Dec 12, 2018

RuABraun commented Dec 30, 2018 • edited Loading

danpovey left a comment

Choose a reason for hiding this comment

danpovey Dec 30, 2018

Choose a reason for hiding this comment

danpovey Dec 30, 2018

Choose a reason for hiding this comment

danpovey Dec 30, 2018

Choose a reason for hiding this comment

danpovey Dec 31, 2018

Choose a reason for hiding this comment

danpovey Dec 31, 2018

Choose a reason for hiding this comment

danpovey Dec 31, 2018

Choose a reason for hiding this comment

kkm000 commented Mar 25, 2019

stale bot commented Jun 19, 2020

stale bot commented Jul 19, 2020

stale bot commented Sep 17, 2020

RuABraun commented Dec 30, 2018 •

edited

Loading