Speech to Text with DeepSpeech
Speech to text (STT) is a useful building block so I took a look at setting up DeepSpeech 0.6.1 for a test drive. My recipe for installing DeepSpeech on a Pi 4 running Raspbian Lite follows. If the Pi 4 is running the GUI desktop some packages may already be installed.
sudo apt install git python3-pip python3-scipy python3-numpy python3-pyaudio libatlas3-base
pip3 install deepspeech==0.6.1
mkdir ~/dspeech
cd ~/dspeech
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.6.1/deepspeech-0.6.1-models.tar.gz
tar xvf deepspeech-0.6.1-models.tar.gz
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.6.1/audio-0.6.1.tar.gz
tar xvf audio-0.6.1.tar.gz
source ~/.profile
deepspeech --model deepspeech-0.6.1-models/output_graph.tflite --lm deepspeech-0.6.1-models/lm.binary --trie deepspeech-0.6.1-models/trie --audio audio/2830-3980-0043.wav
deepspeech --model deepspeech-0.6.1-models/output_graph.tflite --lm deepspeech-0.6.1-models/lm.binary --trie deepspeech-0.6.1-models/trie --audio audio/4507-16021-0012.wav
deepspeech --model deepspeech-0.6.1-models/output_graph.tflite --lm deepspeech-0.6.1-models/lm.binary --trie deepspeech-0.6.1-models/trie --audio audio/8455-210777-0068.wav
At this point DeepSpeech should have transcribed the three test audio files.
The next step is to plug in a USB microphone to test live STT. Change the default ALSA device from 0 to 1. ALSA device 0 is the Raspberry Pi internal audio hardware. ALSA device 1 is the external USB microphone.Change alsa.conf file so the microphone is the default ALSA device.
sudo nano /usr/share/alsa/alsa.conf
OLD:defaults.ctl.card 0
NEW:defaults.ctl.card 1
OLD:defaults.pcm.card 0
NEW:defaults.pcm.card 1
Install examples including the microphone example.
git clone https://github.com/mozilla/DeepSpeech-examples
pip3 install halo webrtcvad
cd deepspeech-0.6.1-models/
python3 ../DeepSpeech-examples/mic_vad_streaming/mic_vad_streaming.py -m ./output_graph.tflite -l lm.binary -t trie -v 3
The results are very good. Once DeepSpeech is installed it does not depend on cloud servers or the Internet. All the work is done on one core of the Pi 4. With some additional hardware such as a Trinket M0, STT could be added to systems without STT but allow USB keyboard input.
https://github.com/mozilla/DeepSpeech/wiki
https://discourse.mozilla.org/c/deep-speech
sudo apt install git python3-pip python3-scipy python3-numpy python3-pyaudio libatlas3-base
pip3 install deepspeech==0.6.1
mkdir ~/dspeech
cd ~/dspeech
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.6.1/deepspeech-0.6.1-models.tar.gz
tar xvf deepspeech-0.6.1-models.tar.gz
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.6.1/audio-0.6.1.tar.gz
tar xvf audio-0.6.1.tar.gz
source ~/.profile
deepspeech --model deepspeech-0.6.1-models/output_graph.tflite --lm deepspeech-0.6.1-models/lm.binary --trie deepspeech-0.6.1-models/trie --audio audio/2830-3980-0043.wav
deepspeech --model deepspeech-0.6.1-models/output_graph.tflite --lm deepspeech-0.6.1-models/lm.binary --trie deepspeech-0.6.1-models/trie --audio audio/4507-16021-0012.wav
deepspeech --model deepspeech-0.6.1-models/output_graph.tflite --lm deepspeech-0.6.1-models/lm.binary --trie deepspeech-0.6.1-models/trie --audio audio/8455-210777-0068.wav
At this point DeepSpeech should have transcribed the three test audio files.
The next step is to plug in a USB microphone to test live STT. Change the default ALSA device from 0 to 1. ALSA device 0 is the Raspberry Pi internal audio hardware. ALSA device 1 is the external USB microphone.Change alsa.conf file so the microphone is the default ALSA device.
sudo nano /usr/share/alsa/alsa.conf
OLD:defaults.ctl.card 0
NEW:defaults.ctl.card 1
OLD:defaults.pcm.card 0
NEW:defaults.pcm.card 1
Install examples including the microphone example.
git clone https://github.com/mozilla/DeepSpeech-examples
pip3 install halo webrtcvad
cd deepspeech-0.6.1-models/
python3 ../DeepSpeech-examples/mic_vad_streaming/mic_vad_streaming.py -m ./output_graph.tflite -l lm.binary -t trie -v 3
The results are very good. Once DeepSpeech is installed it does not depend on cloud servers or the Internet. All the work is done on one core of the Pi 4. With some additional hardware such as a Trinket M0, STT could be added to systems without STT but allow USB keyboard input.
References
https://github.com/mozilla/DeepSpeechhttps://github.com/mozilla/DeepSpeech/wiki
https://discourse.mozilla.org/c/deep-speech
Fullz with CC Info (USA, UK, CAN)
ReplyDeleteUSA FULLZ (SSN DOB DL) & HIGH CREDIT SCORES FULLZ
Track 101 & 202 Dumps With Pin Codes
EIN Business Fullz
PayPal/Office365 Logins
USA/Canada Photo I.D's/Passports
Bank Logins
Here you can DM me Guy's
IcQ - 752822040
TG/icq - @killhacks
Wickr/Skype - peeterhacks
Contact with full potential
Many More stuff will be provided on demand
Will Serve you with Loyalty
Here we are for you 24by7
(Te.legram) (@leadsupplier)
(I/C/Q) (752.822.040)
::GADGETS LIST::
SMTP's/Web-Mailers/PHP-mailers/Alexus-Mailer
C-panel/RDP's
S-hells/Bru-tes/Dorks
Spa-mming To-ols with Guides/Tutorials (package available also)
PC/Mob/Web/Server Ha-cking Stuff on demand
Credit/Debit Cards Tuts for Transfers, Cashouts, CC To BTC
Car-ding (Amazon, PayPal, E-bay, BTC, etc) Updated Tuts
Flasher/Cracker for Bit.Co*n
2021 OR 2022 Fr**D B**le
I.p's & Pro.xies
Bulk (Netflix/Amazon/Spotify/Coinbase) Acount Logs
W.eb L-inks D*R* DE*p Updated
Web Complete Course for D**P & D**K with Video Tuts
Scri-pting for Sca-m Pages
Master Class Complete Kal-i Li-nux
Vi-ruses for Attac-king & Sp-ying
Senders (Email, SMS, Codes)