|
||
---|---|---|
assets | ||
frames | ||
.env.example | ||
.gitignore | ||
README.md | ||
capture.py | ||
narrator.py | ||
requirements.txt | ||
setup.sh |
README.md
David Attenborough narrates your life
https://twitter.com/charliebholtz/status/1724815159590293764
Want to make your own AI app?
Check out Replicate. We make it easy to run machine learning models with an API.
Setup
Clone this repo, and setup and activate a virtualenv:
python3 -m pip install virtualenv
python3 -m virtualenv venv
source venv/bin/activate
Then, install the dependencies:
pip install -r requirements.txt
Make a Replicate, OpenAI, and ElevenLabs account and set your tokens:
export OPENAI_API_KEY=<token>
export ELEVENLABS_API_KEY=<eleven-token>
Make a new voice in Eleven and get the voice id of that voice using their get voices API, or by clicking the flask icon next to the voice in the VoiceLab tab.
export ELEVENLABS_VOICE_ID=<voice-id>
Run it!
In on terminal, run the webcam capture:
python capture.py
In another terminal, run the narrator:
python narrator.py
Options
Setup
Script
Alternative to running the Setup commands above individually, one can use the setup.sh
script to facilitate getting the two required shell envs ready to rock.
Note: will have to run source source venv/bin/activate
afterwards to activate the virtual env.
Dotenv
One can set the environment variables via the .env
file, which is read every time the process starts. It is recommended to copy the .env.example
file and rename to .env
.
Streaming
If you would like the speech to start quicker via a streaming manner set the environment variable to enable. The concession is that the audio and corresponding image is not saved in the /narration
directory.
export ELEVENLABS_STREAMING=true
PhotoBooth
The default behavior of this app will continually analyze images. If you would like to use in a mode more similar to a photo booth, set the environment variable. In this mode, the image will only be analyzed when the spacebar key is pressed.
export PHOTOBOOTH_MODE=true