You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
*[Sound Classification](#Application Specific Audio Models)
24
24
*[Pitch Estimation](#PitchEstimation)
25
25
*[Speech to Text](https://github.com/matlab-deep-learning/MATLAB-Deep-Learning-Model-Hub#speech-to-text-)
26
26
@@ -41,27 +41,29 @@ These networks have been trained on more than a million images and can classify
41
41
42
42
**Models available in MATLAB:**
43
43
44
+
**Note 1: Since R2024a, please use the [imagePretrainedNetwork](https://www.mathworks.com/help/deeplearning/ref/imagepretrainednetwork.html) function instead and specify the pretrained model**.
|[BERT](https://github.com/matlab-deep-learning/transformer-models#bert-and-finbert)| Feature Extraction (Sentence and Word embedding), Text Classification, Token Classification, Masked Language Modeling, Question Answering |390 |[GitHub](https://github.com/matlab-deep-learning/transformer-models#bert-and-finbert)|
240
+
|[BERT](https://www.mathworks.com/help/textanalytics/ref/bert.html)| Feature Extraction (Sentence and Word embedding), Text Classification, Token Classification, Masked Language Modeling, Question Answering |390 |[GitHub](https://github.com/matlab-deep-learning/transformer-models#bert-and-finbert) <br /> [Doc](https://www.mathworks.com/help/textanalytics/ref/bert.html)|
241
+
|[all-MiniLM-L6-v2](https://www.mathworks.com/matlabcentral/fileexchange/156399-text-analytics-toolbox-model-for-all-minilm-l6-v2-network)| Document Embedding, Clustering, Information Retrieval |80 |[Doc](https://www.mathworks.com/matlabcentral/fileexchange/156399-text-analytics-toolbox-model-for-all-minilm-l6-v2-network)|
242
+
|[all-MiniLM-L12-v2](https://www.mathworks.com/matlabcentral/fileexchange/156394-text-analytics-toolbox-model-for-all-minilm-l12-v2-network)| Document Embedding, Clustering, Information Retrieval |120 |[Doc](https://www.mathworks.com/matlabcentral/fileexchange/156394-text-analytics-toolbox-model-for-all-minilm-l12-v2-network)|
239
243
240
244
**Application Specific Transformers**
241
245
@@ -248,30 +252,31 @@ Inputs are sequences of text, outputs are text feature embeddings.
248
252
249
253
## Audio Embeddings <aname="AudioEmbeddings"/>
250
254
251
-
Audio Embedding pretrained models have already learned to extract powerful and informative features from audio signals. Use them as a starting point to learn a new task using transfer learning.
255
+
Audio embedding pretrained models have already learned to extract powerful and informative features from audio signals. Use them as a starting point to learn a new task using transfer learning.
252
256
253
-
Inputs are Audio signals, outputs are audio feature embeddings.
257
+
Inputs are audio signals, outputs are audio feature embeddings.
254
258
255
259

256
260
261
+
**Note 2: Since R2024a, please use the [audioPretrainedNetwork](https://www.mathworks.com/help/audio/ref/audiopretrainednetwork.html) function instead and specify the pretrained model**.
Speechto text pretrained models take an audio input and translate it into a text output. They are useful in digitizating audio files for downstream text processing tasks such as text summarization and sentiment analysis.
277
+
Speech-to-text models provide a fast, efficient method to convert spoken language into written text, enhancing accessibility for individuals with disabilities, enabling downstream tasks like text summarization and sentiment analysis, and streamlining documentation processes. As a key element of human-machine interfaces, including personal assistants, it allows for natural and intuitive interactions, enabling machines to understand and execute spoken commands, improving usability and broadening inclusivity across various applications.
0 commit comments