added speech recognition tutorial

x4nth055 · x4nth055 · commit b6526cd5682f · 2019-10-15T12:34:10.000+02:00
diff --git a/README.md b/README.md
@@ -29,6 +29,7 @@ This is a repository of all the tutorials of [The Python Code](https://www.thepy
     - [How to Make an Image Classifier in Python using Keras](https://www.thepythoncode.com/article/image-classification-keras-python). ([code](machine-learning/image-classifier))
     - [How to Use Transfer Learning for Image Classification using Keras in Python](https://www.thepythoncode.com/article/use-transfer-learning-for-image-flower-classification-keras-python). ([code](machine-learning/image-classifier-using-transfer-learning))
     - [How to Perform Edge Detection in Python using OpenCV](https://www.thepythoncode.com/article/canny-edge-detection-opencv-python). ([code](machine-learning/edge-detection))
+    - [How to Convert Speech to Text in Python](https://www.thepythoncode.com/article/using-speech-recognition-to-convert-speech-to-text-python). ([code](machine-learning/speech-recognition))
     - [Top 8 Python Libraries For Data Scientists and Machine Learning Engineers](https://www.thepythoncode.com/article/top-python-libraries-for-data-scientists).
     
 
diff --git a/machine-learning/speech-recognition/16-122828-0002.wav b/machine-learning/speech-recognition/16-122828-0002.wav
diff --git a/machine-learning/speech-recognition/README.md b/machine-learning/speech-recognition/README.md
@@ -0,0 +1,16 @@
+# [How to Convert Speech to Text in Python](https://www.thepythoncode.com/article/using-speech-recognition-to-convert-speech-to-text-python)
+To run this:
+- `pip3 install -r requirements.txt`
+- To recognize the text of an audio file named `16-122828-0002.wav`:
+    ```
+    python recognizer;py 16-122828-0002.wav
+    ```
+    **Output**:
+    ```
+    I believe you're just talking nonsense
+    ```
+- To recognize the text from your microphone after talking 5 seconds:
+    ```
+    python live_recognizer.py 5
+    ```
+    This will record your talking in 5 seconds and then uploads the audio data to Google to get the desired output.
diff --git a/machine-learning/speech-recognition/live_recognizer.py b/machine-learning/speech-recognition/live_recognizer.py
@@ -0,0 +1,15 @@
+import speech_recognition as sr
+import sys
+
+duration = int(sys.argv[1])
+
+# initialize the recognizer
+r = sr.Recognizer()
+print("Please talk")
+with sr.Microphone() as source:
+    # read the audio data from the default microphone
+    audio_data = r.record(source, duration=duration)
+    print("Recognizing...")
+    # convert speech to text
+    text = r.recognize_google(audio_data)
+    print(text)
diff --git a/machine-learning/speech-recognition/recognizer.py b/machine-learning/speech-recognition/recognizer.py
@@ -0,0 +1,15 @@
+import speech_recognition as sr
+import sys
+
+filename = sys.argv[1]
+
+# initialize the recognizer
+r = sr.Recognizer()
+
+# open the file
+with sr.AudioFile(filename) as source:
+    # listen for the data (load audio to memory)
+    audio_data = r.record(source)
+    # recognize (convert from speech to text)
+    text = r.recognize_google(audio_data)
+    print(text)
diff --git a/machine-learning/speech-recognition/requirements.txt b/machine-learning/speech-recognition/requirements.txt
@@ -0,0 +1 @@
+speech_recognition