Image & Video Editor

A comprehensive AI-powered creative platform that combines advanced photo editing with professional video generation. Create, edit, and enhance images using Gemini 2.5 Flash's multi-image capabilities, then generate stunning videos with Veo 3 - all in one seamless interface.

Note

This application features three distinct creative modes with a modern Material 3 Expressive design system, providing professional-grade tools for content creators, designers, and storytellers.

🎨 Three Creative Modes

1. Photo Editor (Primary Mode)

Multi-Image Chat Interface: Upload up to 50 reference images for context
Iterative Editing: Conversational workflow for step-by-step image refinement
Character Consistency: Maintain subjects and styles across generations
Advanced Prompting: Leverage Gemini 2.5 Flash's full 3,600 image API capability
Download Management: Save all generated variations with timestamps

2. Single Video Generation

Text-to-Video: Create videos from detailed text descriptions
Image-to-Video: Transform static images into dynamic video content
Custom Aspect Ratios: Support for 16:9, 9:16, 1:1, 4:5, and more
Timeline Editor: Trim videos with precision using browser-based tools

3. Storyboard Mode

Multi-Scene Projects: Create complex video narratives with multiple scenes
Drag & Drop Organization: Reorder scenes with intuitive interface
Batch Generation: Process multiple scenes with progress tracking
Scene Management: Individual prompts and settings per scene

🚀 Quick Start Guide

Prerequisites

Node.js (version 18 or higher) and npm
Gemini API Key (Paid tier required) from AI Studio

Warning

Paid Tier Required: Veo 3 video generation and Gemini 2.5 Flash image editing require the Gemini API Paid tier.

Installation

Clone the repository

git clone https://github.com/rdfitted/storycomposer.git
cd storycomposer

Install dependencies
```
npm install
```
Set up environment variables Create a .env file in the project root:
```
GEMINI_API_KEY="your-gemini-api-key-here"
```
Start the development server
```
npm run dev
```
Open your browser Navigate to http://localhost:3000

🎯 Getting Started with Each Mode

Photo Editor Mode

Upload Reference Images: Click "Images" button to add up to 50 context images
Start Chatting: Describe what you want to create or edit
Iterate & Refine: Continue the conversation to refine your images
Download Results: Click download buttons on generated images

Single Video Mode

Add Image (Optional): Click "Image" to upload a reference image
Write Prompt: Describe your video in the text area
Configure Settings: Choose aspect ratio and model
Generate: Click the arrow button to start video generation
Edit & Download: Use timeline controls to trim and download

Storyboard Mode

Create Scenes: Click "Add Scene" to start building your storyboard
Upload Images: Add reference images for each scene
Write Prompts: Describe each scene's action
Generate All: Click "Generate All Scenes" for batch processing
Organize: Drag and drop to reorder your final storyboard

🛠️ Technical Overview

Architecture

Built with Next.js 15 and React 19, featuring:

Server-Side API Routes for secure AI model integration
Real-time Polling for operation status updates
Client-Side State Management for complex UI interactions
Material 3 Design System with custom Tailwind CSS implementation

API Routes

app/api/
├── photo-editor/generate/     # Multi-image chat generation
├── veo/generate/             # Video generation initiation  
├── veo/operation/            # Operation status polling
├── veo/download/             # Secure video download
└── imagen/generate/          # Single image generation

Key Components

components/ui/
├── PhotoEditor.tsx           # Main photo editing interface
├── PhotoEditorComposer.tsx   # Multi-image upload & chat input
├── ChatMessage.tsx           # Conversation message display
├── VideoPlayer.tsx           # Custom video player with timeline
├── ModeSelector.tsx          # Tab navigation component
└── StoryboardComposer.tsx    # Multi-scene project manager

Data Flow

User Input → FormData with images/prompts
API Processing → Gemini/Veo model generation
Status Polling → Real-time progress updates
Content Delivery → Base64/blob URL responses
User Download → Direct file save capabilities

🔧 Technologies & Dependencies

Core Stack

Next.js 15 - Full-stack React framework
React 19 - Modern UI library with latest features
TypeScript - Type-safe development
Tailwind CSS - Utility-first styling

AI Integration

Google Gemini API - Advanced AI model access
Veo 3 - State-of-the-art video generation
Gemini 2.5 Flash - Multi-modal image processing

UI Components & Interactions

Lucide React - Beautiful icon library
React Player - Video playbook components
RC Slider - Timeline controls
React Dropzone - File upload handling

⚡ Performance Features

Lazy Loading - Components and images load on demand
Memory Management - Automatic cleanup of blob URLs and file references
Progressive Enhancement - Works across all modern browsers
Responsive Design - Mobile-first approach with desktop optimization
Real-time Updates - Live status polling with optimized intervals

🚀 Development Commands

# Development
npm run dev      # Start development server (http://localhost:3000)
npm run build    # Create production build
npm run start    # Start production server
npm run lint     # Run ESLint for code quality

# Environment Setup
echo 'GEMINI_API_KEY="your-api-key-here"' > .env

🛡️ Security & Best Practices

Input Validation - Comprehensive server-side validation
File Type Restrictions - Only allows safe image formats
Size Limits - 10MB per file, 50 files maximum
Environment Variables - Secure API key management
Error Handling - Graceful error responses without data exposure

📝 Attribution

This project builds upon the Google Gemini Veo 3 API Quickstart and includes substantial enhancements for photo editing capabilities.

For more information, visit fitted-automation.com.

📄 License

This project is licensed under the Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.allstar		.allstar
.gemini/commands/plan		.gemini/commands/plan
aidocs		aidocs
app		app
components/ui		components/ui
lib		lib
plans		plans
public		public
services		services
stores		stores
.gitignore		.gitignore
BUILD_YOUR_OWN_AI_CREATIVE_PLATFORM.md		BUILD_YOUR_OWN_AI_CREATIVE_PLATFORM.md
HOW_TO_BUILD_GUIDE.txt		HOW_TO_BUILD_GUIDE.txt
LICENSE		LICENSE
README.md		README.md
components.json		components.json
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image & Video Editor

🎨 Three Creative Modes

1. Photo Editor (Primary Mode)

2. Single Video Generation

3. Storyboard Mode

🚀 Quick Start Guide

Prerequisites

Installation

🎯 Getting Started with Each Mode

Photo Editor Mode

Single Video Mode

Storyboard Mode

🛠️ Technical Overview

Architecture

API Routes

Key Components

Data Flow

🔧 Technologies & Dependencies

Core Stack

AI Integration

UI Components & Interactions

⚡ Performance Features

🚀 Development Commands

🛡️ Security & Best Practices

📝 Attribution

📄 License

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

rdfitted/storycomposer

Folders and files

Latest commit

History

Repository files navigation

Image & Video Editor

🎨 Three Creative Modes

1. Photo Editor (Primary Mode)

2. Single Video Generation

3. Storyboard Mode

🚀 Quick Start Guide

Prerequisites

Installation

🎯 Getting Started with Each Mode

Photo Editor Mode

Single Video Mode

Storyboard Mode

🛠️ Technical Overview

Architecture

API Routes

Key Components

Data Flow

🔧 Technologies & Dependencies

Core Stack

AI Integration

UI Components & Interactions

⚡ Performance Features

🚀 Development Commands

🛡️ Security & Best Practices

📝 Attribution

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages