ChatGPT

StoryTeller Is an Open-source Free Multimodal AI Story Teller, built with Stable Diffusion, ChatGPT, and neural text-to-speech (TTS).

Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

Hamza Musa

30 Aug 2023 — 2 min read

Introducing Story Teller, a multimodal AI storyteller created with Stable Diffusion, GPT, and neural text-to-speech (TTS). With just a prompt as the opening line, GPT generates the plot, while Stable Diffusion creates an image for each sentence. Then, a TTS model narrates each line, resulting in a fully animated video of a short story complete with audio and visuals.

To start developing locally, install dev dependencies and pre-commit hooks. This will ensure that linting and code quality checks are completed before each commit. The final video will be saved as /out/out.mp4, with other intermediate images, audio files, and subtitles.

Given a prompt as an opening line of a story, GPT writes the rest of the plot; Stable Diffusion draws an image for each sentence; a TTS model narrates each line, resulting in a fully animated video of a short story, replete with audio and visuals.

Story Teller is available on PyPI, and the quickest way to run a demo is through the CLI. Simply type the command, and your video will be ready. Additionally, you can adjust the defaults with custom parameters by toggling the CLI flags as needed. For more advanced use cases, you can interface directly with Story Teller using Python code and configure the model with custom settings.

Features

Available on PyPI
Quick demo through CLI
Intermediate images, audio files, and subtitles generated
Customizable parameters using CLI flags
Advanced use cases supported with Python code interface
Model can be configured with custom settings

License

MIT License

Resources

StoryTeller

Download StoryTeller for free. Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc. A multimodal AI story teller, built with Stable Diffusion, GPT, and neural text-to-speech (TTS). Given a prompt as an opening line of a story, GPT writes the rest of the plot;

SourceForge

How to Use AI for Medical Legal and Compensation Claims: A Patient’s Guide

Navigating a medical-legal or compensation claim can feel like trying to translate a foreign language while buried under an avalanche of paperwork. Medical records regularly stretch across thousands of pages, loaded with clinical jargon, dense coding, and fragmented timelines. When you're dealing with an injury, a misdiagnosis, or

AI For Students: The Sandbox for the Future, How to use Google AI Studio?

If you’ve been following the AI gold rush, you know that things move fast. One day we’re talking about basic chatbots, and the next, we’re looking at platforms that can practically build entire applications from a single "vibe." As a tech writer, I’ve seen

Unlock the Full Power of Claude: 5 Hidden Settings You Need to Enable

Most users only scratch the surface of what Claude can do. While it is a formidable conversationalist by default, there are several "buried" features in the settings menu that can make the AI significantly more powerful and personalized. By adjusting these five hidden settings, you can transform Claude

BotBrowser: Free Professional Cross-Platform Browser with Unified Fingerprint Technology!

What is BotBrowser? Meet BotBrowser, a next-generation privacy browser core built to defeat browser fingerprinting. Fingerprinting is when websites track you by collecting tiny, unique details about your device, operating system, and settings. BotBrowser stops this dead in its tracks by making your digital footprint completely uniform. Why is this