Skip to content

๐ŸŽ™๏ธ Transform documents and personas into engaging podcasts with AI-driven transcripts and high-quality audio!

License

Notifications You must be signed in to change notification settings

anujdutt9/Podcastify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

27 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Logo

๐ŸŽ™๏ธ Podcastify - A Document and Story Podcast Creator

Transform documents into engaging podcasts or create random story podcasts using personas with just a few clicks!


๐ŸŒŸ Features

  • Upload Documents: Generate podcasts based on your document's content.
  • Storytelling Personas: Choose from pre-defined personas for unique narration styles.
  • Audio Generation: High-quality podcast audio created using ElevenLabs.
  • Dynamic Transcript: Easily readable podcast-style transcripts.

๐ŸŽฅ UI in Action

Upload a Document

UI Upload

Generated Podcast Transcript (with Document)

UI Transcript

Generated Podcast Transcript (without Document)

UI Transcript

Podcast Audio Output

UI Audio


๐ŸŽง Sample Audio Outputs

Sample 1: Generated Podcast for a Technical Document

Detective_podcast_audio.mp4

Sample 2: Story Podcast - Magical Storyteller Persona

MagicalStoryteller_podcast_audio.mp4

Sample 3: Comedy Podcast - Stand-Up Comedian Persona

Stand-UpComedian_podcast_audio.mp4

Sample 4: Inspirational Story - Fitness Coach Persona

FitnessCoach_podcast_audio.mp4

๐Ÿš€ Getting Started

1๏ธโƒฃ Installation

  1. Clone this repository:

    git clone [email protected]:anujdutt9/Podcastify.git
    cd Podcastify
  2. Create a virtual environment and activate it:

    python3 -m venv venv
    source venv/bin/activate
  3. Install the dependencies:

    pip install -r requirements.txt

2๏ธโƒฃ Setup Environment Variables

Create a .env file in the root directory with the following content:

 # OpenAI API Key
OPENAI_API_KEY=<your_openai_api_key>

# ElevenLabs API Key
ELEVENLABS_API_KEY=<your_elevenlabs_api_key>

# OpenAI Model Settings
OPENAI_MODEL=gpt-4o-mini
OPENAI_MAX_TOKENS=1500
OPENAI_TEMPERATURE=0.7

3๏ธโƒฃ Run the Application

  1. Launch the application:
 python app.py
  1. Open your browser and navigate to the provided local URL (e.g., http://127.0.0.1:7860/).
  2. Upload a document or leave it blank, choose a persona, and click Generate Podcast.

๐Ÿค Contribution

Please fork this repository, make your changes, and submit a pull request. For major changes, open an issue first to discuss what youโ€™d like to change.


๐Ÿ› ๏ธ Troubleshooting

โ€ข Missing API Keys: Ensure OPENAI_API_KEY and ELEVENLABS_API_KEY are correctly set in the .env file.

โ€ข Audio Generation Issues: Verify that ElevenLabs API access is active.

โ€ข UI Not Loading: Ensure all dependencies are installed and the correct Python environment is activated.


๐Ÿ“œ License

This project is licensed under the MIT License. See the LICENSE file for more information.

About

๐ŸŽ™๏ธ Transform documents and personas into engaging podcasts with AI-driven transcripts and high-quality audio!

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages