feat: modernize PPT styles and add deep-research skill

Update presentation generation with contemporary design styles
(glassmorphism, dark-premium, neo-brutalist, etc.) and add a new
deep-research skill to guide thorough web research before content
generation tasks.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
hetaoBackend
2026-01-29 01:54:57 +08:00
parent d4bfed271b
commit 248ffe61bc
2 changed files with 361 additions and 80 deletions

View File

@@ -0,0 +1,169 @@
---
name: deep-research
description: Use this skill BEFORE any content generation task (PPT, design, articles, images, videos, reports). Provides a systematic methodology for conducting thorough, multi-angle web research to gather comprehensive information.
---
# Deep Research Skill
## Overview
This skill provides a systematic methodology for conducting thorough web research. **Load this skill BEFORE starting any content generation task** to ensure you gather sufficient information from multiple angles, depths, and sources.
## When to Use This Skill
**Always load this skill first when the task involves creating:**
- Presentations (PPT/slides)
- Frontend designs or UI mockups
- Articles, reports, or documentation
- Videos or multimedia content
- Any content that requires real-world information, examples, or current data
## Core Principle
**Never generate content based solely on general knowledge.** The quality of your output directly depends on the quality and quantity of research conducted beforehand. A single search query is NEVER enough.
## Research Methodology
### Phase 1: Broad Exploration
Start with broad searches to understand the landscape:
1. **Initial Survey**: Search for the main topic to understand the overall context
2. **Identify Dimensions**: From initial results, identify key subtopics, themes, angles, or aspects that need deeper exploration
3. **Map the Territory**: Note different perspectives, stakeholders, or viewpoints that exist
Example:
```
Topic: "AI in healthcare"
Initial searches:
- "AI healthcare applications 2024"
- "artificial intelligence medical diagnosis"
- "healthcare AI market trends"
Identified dimensions:
- Diagnostic AI (radiology, pathology)
- Treatment recommendation systems
- Administrative automation
- Patient monitoring
- Regulatory landscape
- Ethical considerations
```
### Phase 2: Deep Dive
For each important dimension identified, conduct targeted research:
1. **Specific Queries**: Search with precise keywords for each subtopic
2. **Multiple Phrasings**: Try different keyword combinations and phrasings
3. **Fetch Full Content**: Use `web_fetch` to read important sources in full, not just snippets
4. **Follow References**: When sources mention other important resources, search for those too
Example:
```
Dimension: "Diagnostic AI in radiology"
Targeted searches:
- "AI radiology FDA approved systems"
- "chest X-ray AI detection accuracy"
- "radiology AI clinical trials results"
Then fetch and read:
- Key research papers or summaries
- Industry reports
- Real-world case studies
```
### Phase 3: Diversity & Validation
Ensure comprehensive coverage by seeking diverse information types:
| Information Type | Purpose | Example Searches |
|-----------------|---------|------------------|
| **Facts & Data** | Concrete evidence | "statistics", "data", "numbers", "market size" |
| **Examples & Cases** | Real-world applications | "case study", "example", "implementation" |
| **Expert Opinions** | Authority perspectives | "expert analysis", "interview", "commentary" |
| **Trends & Predictions** | Future direction | "trends 2024", "forecast", "future of" |
| **Comparisons** | Context and alternatives | "vs", "comparison", "alternatives" |
| **Challenges & Criticisms** | Balanced view | "challenges", "limitations", "criticism" |
### Phase 4: Synthesis Check
Before proceeding to content generation, verify:
- [ ] Have I searched from at least 3-5 different angles?
- [ ] Have I fetched and read the most important sources in full?
- [ ] Do I have concrete data, examples, and expert perspectives?
- [ ] Have I explored both positive aspects and challenges/limitations?
- [ ] Is my information current and from authoritative sources?
**If any answer is NO, continue researching before generating content.**
## Search Strategy Tips
### Effective Query Patterns
```
# Be specific with context
❌ "AI trends"
✅ "enterprise AI adoption trends 2024"
# Include authoritative source hints
"[topic] research paper"
"[topic] McKinsey report"
"[topic] industry analysis"
# Search for specific content types
"[topic] case study"
"[topic] statistics"
"[topic] expert interview"
# Use temporal qualifiers
"[topic] 2024"
"[topic] latest"
"[topic] recent developments"
```
### When to Use web_fetch
Use `web_fetch` to read full content when:
- A search result looks highly relevant and authoritative
- You need detailed information beyond the snippet
- The source contains data, case studies, or expert analysis
- You want to understand the full context of a finding
### Iterative Refinement
Research is iterative. After initial searches:
1. Review what you've learned
2. Identify gaps in your understanding
3. Formulate new, more targeted queries
4. Repeat until you have comprehensive coverage
## Quality Bar
Your research is sufficient when you can confidently answer:
- What are the key facts and data points?
- What are 2-3 concrete real-world examples?
- What do experts say about this topic?
- What are the current trends and future directions?
- What are the challenges or limitations?
- What makes this topic relevant or important now?
## Common Mistakes to Avoid
- ❌ Stopping after 1-2 searches
- ❌ Relying on search snippets without reading full sources
- ❌ Searching only one aspect of a multi-faceted topic
- ❌ Ignoring contradicting viewpoints or challenges
- ❌ Using outdated information when current data exists
- ❌ Starting content generation before research is complete
## Output
After completing research, you should have:
1. A comprehensive understanding of the topic from multiple angles
2. Specific facts, data points, and statistics
3. Real-world examples and case studies
4. Expert perspectives and authoritative sources
5. Current trends and relevant context
**Only then proceed to content generation**, using the gathered information to create high-quality, well-informed content.

View File

@@ -23,11 +23,14 @@ Choose one of the following styles when creating the presentation plan:
| Style | Description | Best For |
|-------|-------------|----------|
| **business** | Professional corporate look with clean lines, navy/blue tones, structured layouts, subtle gradients | Corporate reports, business proposals, quarterly reviews |
| **academic** | Scholarly and formal, serif fonts aesthetic, muted colors, data-focused layouts, whitespace emphasis | Research presentations, lectures, thesis defense |
| **minimal** | Ultra-clean with maximum whitespace, single accent color, simple geometric shapes, focus on content | Product launches, tech demos, modern startups |
| **glassmorphism** | Frosted glass panels with blur effects, floating translucent cards, vibrant gradient backgrounds, depth through layering | Tech products, AI/SaaS demos, futuristic pitches |
| **dark-premium** | Rich black backgrounds (#0a0a0a), luminous accent colors, subtle glow effects, luxury brand aesthetic | Premium products, executive presentations, high-end brands |
| **gradient-modern** | Bold mesh gradients, fluid color transitions, contemporary typography, vibrant yet sophisticated | Startups, creative agencies, brand launches |
| **neo-brutalist** | Raw bold typography, high contrast, intentional "ugly" aesthetic, anti-design as design, Memphis-inspired | Edgy brands, Gen-Z targeting, disruptive startups |
| **3d-isometric** | Clean isometric illustrations, floating 3D elements, soft shadows, tech-forward aesthetic | Tech explainers, product features, SaaS presentations |
| **editorial** | Magazine-quality layouts, sophisticated typography hierarchy, dramatic photography, Vogue/Bloomberg aesthetic | Annual reports, luxury brands, thought leadership |
| **minimal-swiss** | Grid-based precision, Helvetica-inspired typography, bold use of negative space, timeless modernism | Architecture, design firms, premium consulting |
| **keynote** | Apple-inspired aesthetic with bold typography, dramatic imagery, high contrast, cinematic feel | Keynotes, product reveals, inspirational talks |
| **creative** | Bold colors, artistic layouts, unique compositions, expressive visuals, unconventional designs | Creative pitches, design portfolios, artistic presentations |
## Workflow
@@ -159,59 +162,61 @@ Parameters:
[!NOTE]
Do NOT read the python file, just call it with the parameters.
## Complete Example: Apple Keynote Style
## Complete Example: Glassmorphism Style (最现代前卫)
User request: "Create a keynote-style presentation about the future of AI in healthcare"
User request: "Create a presentation about AI product launch"
### Step 1: Create presentation plan
Create `/mnt/user-data/workspace/ai-healthcare-plan.json`:
Create `/mnt/user-data/workspace/ai-product-plan.json`:
```json
{
"title": "The Future of AI in Healthcare",
"style": "keynote",
"title": "Introducing Nova AI",
"style": "glassmorphism",
"style_guidelines": {
"color_palette": "Deep black or dark gray backgrounds, crisp white text, electric blue accent color for highlights",
"typography": "San Francisco or Helvetica Neue inspired, bold headlines 72pt+, light body text, extreme size contrast",
"imagery": "Cinematic photography, dramatic lighting, shallow depth of field, human-centered tech imagery",
"layout": "Single focal point per slide, asymmetric balance, 60%+ negative space, no bullet points visible"
"color_palette": "Vibrant purple-to-cyan gradient background (#667eea→#00d4ff), frosted glass panels with 15-20% white opacity, electric accents",
"typography": "SF Pro Display style, bold 700 weight white titles with subtle text-shadow, clean 400 weight body text, excellent contrast on glass",
"imagery": "Abstract 3D glass spheres, floating translucent geometric shapes, soft luminous orbs, depth through layered transparency",
"layout": "Centered frosted glass cards with 32px rounded corners, 48-64px padding, floating above gradient, layered depth with soft shadows",
"effects": "Backdrop blur 20-40px on glass panels, subtle white border glow, soft colored shadows matching gradient, light refraction effects",
"visual_language": "Apple Vision Pro / visionOS aesthetic, premium depth through transparency, futuristic yet approachable, 2024 design trends"
},
"aspect_ratio": "16:9",
"slides": [
{
"slide_number": 1,
"type": "title",
"title": "The Future of AI in Healthcare",
"subtitle": "Transforming Patient Care",
"visual_description": "Dark cinematic background with subtle blue light rays. Large bold white title centered. Subtle medical imagery (heartbeat line, DNA helix) as abstract light trails. Apple keynote aesthetic with dramatic lighting."
"title": "Introducing Nova AI",
"subtitle": "Intelligence, Reimagined",
"visual_description": "Stunning gradient background flowing from deep purple (#667eea) through magenta to cyan (#00d4ff). Center: large frosted glass panel with strong backdrop blur, containing bold white title 'Introducing Nova AI' and lighter subtitle. Floating 3D glass spheres and abstract shapes around the card creating depth. Soft glow emanating from behind the glass panel. Premium visionOS aesthetic. The glass card has subtle white border (1px rgba 255,255,255,0.3) and soft purple-tinted shadow."
},
{
"slide_number": 2,
"type": "content",
"title": "Diagnosis Revolution",
"key_points": ["AI detects diseases earlier", "98% accuracy in imaging", "Saves countless lives"],
"visual_description": "Split composition: left side shows a doctor viewing a holographic medical scan, right side has the title in large white text. Dark background with blue accent lighting on the hologram. Cinematic, dramatic."
"title": "Why Nova?",
"key_points": ["10x faster processing", "Human-like understanding", "Enterprise-grade security"],
"visual_description": "Same purple-cyan gradient background. Left side: floating frosted glass card with title 'Why Nova?' in bold white, three key points below with subtle glass pill badges. Right side: abstract 3D visualization of neural network as interconnected glass nodes with soft glow. Floating translucent geometric shapes (icosahedrons, tori) adding depth. Consistent glassmorphism aesthetic with previous slide."
},
{
"slide_number": 3,
"type": "content",
"title": "Personalized Medicine",
"key_points": ["Treatment tailored to your DNA", "Predictive health insights", "AI-powered drug discovery"],
"visual_description": "Abstract DNA double helix rendered in glowing blue light against deep black. Title overlaid in bold white. Futuristic yet human. Shallow depth of field effect."
"title": "How It Works",
"key_points": ["Natural language input", "Multi-modal processing", "Instant insights"],
"visual_description": "Gradient background consistent with previous slides. Central composition: three stacked frosted glass cards at slight angles showing the workflow steps, connected by soft glowing lines. Each card has an abstract icon. Floating glass orbs and light particles around the composition. Title 'How It Works' in bold white at top. Depth created through card layering and transparency."
},
{
"slide_number": 4,
"type": "content",
"title": "Always There For You",
"key_points": ["24/7 AI health monitoring", "Early warning systems", "Peace of mind"],
"visual_description": "Warm scene of a smartwatch on a wrist displaying health metrics, soft bokeh background of a family moment. Emotional, human-centered. Title in white, positioned to not overlap the main image."
"title": "Built for Scale",
"key_points": ["1M+ concurrent users", "99.99% uptime", "Global infrastructure"],
"visual_description": "Same gradient background. Asymmetric layout: right side features large frosted glass panel with metrics displayed in bold typography. Left side: abstract 3D globe made of glass panels and connection lines, representing global scale. Floating data visualization elements as small glass cards with numbers. Soft ambient glow throughout. Premium tech aesthetic."
},
{
"slide_number": 5,
"type": "conclusion",
"title": "The Future is Now",
"subtitle": "",
"visual_description": "Powerful closing image: silhouette of a healthcare worker against a sunrise/dawn sky with subtle tech elements. Hopeful, inspiring. Large bold title. Minimal, impactful."
"title": "The Future Starts Now",
"subtitle": "Join the waitlist",
"visual_description": "Dramatic finale slide. Gradient background with slightly increased vibrancy. Central frosted glass card with bold title 'The Future Starts Now' and call-to-action subtitle. Behind the card: burst of soft light rays and floating glass particles creating celebration effect. Multiple layered glass shapes creating depth. The most visually impactful slide while maintaining style consistency."
}
]
}
@@ -223,108 +228,183 @@ Read `/mnt/skills/public/image-generation/SKILL.md` to understand how to generat
### Step 3: Generate slide images sequentially with reference chaining
**Slide 1 - Title (establishes the style):**
**Slide 1 - Title (establishes the visual language):**
Create `/mnt/user-data/workspace/ai-healthcare-slide-01.json`:
Create `/mnt/user-data/workspace/nova-slide-01.json`:
```json
{
"prompt": "Professional presentation title slide in Apple Keynote style. Deep black background with subtle blue light rays emanating from center. Large bold white sans-serif title 'The Future of AI in Healthcare' centered, subtitle 'Transforming Patient Care' below in lighter weight. Abstract medical elements (heartbeat line, DNA helix) as subtle glowing blue light trails. Cinematic dramatic lighting. Premium tech aesthetic. 16:9 aspect ratio presentation slide.",
"style": "Apple Keynote presentation, premium tech aesthetic, cinematic dramatic lighting, WWDC style",
"composition": "Centered title, 60% negative space, subtle background imagery, clear text hierarchy",
"lighting": "Dramatic rim lighting, subtle blue accent glow, dark moody atmosphere",
"color_palette": "Deep black background (#0a0a0a), pure white text (#ffffff), electric blue accent (#0071e3)"
"prompt": "Ultra-premium presentation title slide with glassmorphism design. Background: smooth flowing gradient from deep purple (#667eea) through magenta (#f093fb) to cyan (#00d4ff), soft and vibrant. Center: large frosted glass panel with strong backdrop blur effect, rounded corners 32px, containing bold white sans-serif title 'Introducing Nova AI' (72pt, SF Pro Display style, font-weight 700) with subtle text shadow, subtitle 'Intelligence, Reimagined' below in lighter weight. The glass panel has subtle white border (1px rgba 255,255,255,0.25) and soft purple-tinted drop shadow. Floating around the card: 3D glass spheres with refraction, translucent geometric shapes (icosahedrons, abstract blobs), creating depth and dimension. Soft luminous glow emanating from behind the glass panel. Small floating particles of light. Apple Vision Pro / visionOS UI aesthetic. Professional presentation slide, 16:9 aspect ratio. Hyper-modern, premium tech product launch feel.",
"style": "Glassmorphism, visionOS aesthetic, Apple Vision Pro UI style, premium tech, 2024 design trends",
"composition": "Centered glass card as focal point, floating 3D elements creating depth at edges, 40% negative space, clear visual hierarchy",
"lighting": "Soft ambient glow from gradient, light refraction through glass elements, subtle rim lighting on 3D shapes",
"color_palette": "Purple gradient #667eea, magenta #f093fb, cyan #00d4ff, frosted white rgba(255,255,255,0.15), pure white text #ffffff",
"effects": "Backdrop blur on glass panels, soft drop shadows with color tint, light refraction, subtle noise texture on glass, floating particles"
}
```
```bash
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/ai-healthcare-slide-01.json \
--output-file /mnt/user-data/outputs/ai-healthcare-slide-01.jpg \
--prompt-file /mnt/user-data/workspace/nova-slide-01.json \
--output-file /mnt/user-data/outputs/nova-slide-01.jpg \
--aspect-ratio 16:9
```
**Slide 2 - Content (references slide 1):**
**Slide 2 - Content (MUST reference slide 1 for consistency):**
Create `/mnt/user-data/workspace/ai-healthcare-slide-02.json`:
Create `/mnt/user-data/workspace/nova-slide-02.json`:
```json
{
"prompt": "Presentation slide continuing the exact visual style from the reference image. Same dark background, same typography style, same blue accent color. Title 'Diagnosis Revolution' in bold white on the right. Left side shows a doctor viewing a holographic medical brain scan with blue glow. Maintain the cinematic Apple Keynote aesthetic from reference. Dark premium tech look.",
"style": "Match reference image style exactly - Apple Keynote, dark cinematic, premium",
"composition": "Asymmetric split layout, imagery left, text right, consistent with reference aesthetic",
"color_palette": "Exactly match the reference image colors - deep black, white text, blue accents",
"consistency_note": "Must appear as part of the same presentation as the reference image"
"prompt": "Presentation slide continuing EXACT visual style from reference image. SAME purple-to-cyan gradient background, SAME glassmorphism aesthetic, SAME typography style. Left side: frosted glass card with backdrop blur containing title 'Why Nova?' in bold white (matching reference font style), three feature points as subtle glass pill badges below. Right side: abstract 3D neural network visualization made of interconnected glass nodes with soft cyan glow, floating in space. Floating translucent geometric shapes (matching style from reference) adding depth. The frosted glass has identical treatment: white border, purple-tinted shadow, same blur intensity. CRITICAL: This slide must look like it belongs in the exact same presentation as the reference image - same colors, same glass treatment, same overall aesthetic.",
"style": "MATCH REFERENCE EXACTLY - Glassmorphism, visionOS aesthetic, same visual language",
"composition": "Asymmetric split: glass card left (40%), 3D visualization right (40%), breathing room between elements",
"color_palette": "EXACTLY match reference: purple #667eea, cyan #00d4ff gradient, same frosted white treatment, same text white",
"consistency_note": "CRITICAL: Must be visually identical in style to reference image. Same gradient colors, same glass blur intensity, same shadow treatment, same typography weight and style. Viewer should immediately recognize this as the same presentation."
}
```
```bash
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/ai-healthcare-slide-02.json \
--reference-images /mnt/user-data/outputs/ai-healthcare-slide-01.jpg \
--output-file /mnt/user-data/outputs/ai-healthcare-slide-02.jpg \
--prompt-file /mnt/user-data/workspace/nova-slide-02.json \
--reference-images /mnt/user-data/outputs/nova-slide-01.jpg \
--output-file /mnt/user-data/outputs/nova-slide-02.jpg \
--aspect-ratio 16:9
```
**Continue for slides 3-5, each referencing the previous slide...**
**Slides 3-5: Continue the same pattern, each referencing the previous slide**
Key consistency rules for subsequent slides:
- Always include "continuing EXACT visual style from reference image" in prompt
- Specify "SAME gradient background", "SAME glass treatment", "SAME typography"
- Include `consistency_note` emphasizing style matching
- Reference the immediately previous slide image
### Step 4: Compose final PPT
```bash
python /mnt/skills/public/ppt-generation/scripts/generate.py \
--plan-file /mnt/user-data/workspace/ai-healthcare-plan.json \
--slide-images /mnt/user-data/outputs/ai-healthcare-slide-01.jpg /mnt/user-data/outputs/ai-healthcare-slide-02.jpg /mnt/user-data/outputs/ai-healthcare-slide-03.jpg /mnt/user-data/outputs/ai-healthcare-slide-04.jpg /mnt/user-data/outputs/ai-healthcare-slide-05.jpg \
--output-file /mnt/user-data/outputs/ai-healthcare-presentation.pptx
--plan-file /mnt/user-data/workspace/nova-plan.json \
--slide-images /mnt/user-data/outputs/nova-slide-01.jpg /mnt/user-data/outputs/nova-slide-02.jpg /mnt/user-data/outputs/nova-slide-03.jpg /mnt/user-data/outputs/nova-slide-04.jpg /mnt/user-data/outputs/nova-slide-05.jpg \
--output-file /mnt/user-data/outputs/nova-presentation.pptx
```
## Style-Specific Guidelines
### Business Style
### Glassmorphism Style (推荐 - 最现代前卫)
```json
{
"style": "business",
"style": "glassmorphism",
"style_guidelines": {
"color_palette": "Navy blue (#1a365d), white, light gray backgrounds, subtle gold accents",
"typography": "Clean sans-serif (Arial, Calibri style), professional hierarchy, 44pt titles",
"imagery": "Professional photography, office environments, handshakes, charts, clean iconography",
"layout": "Grid-based, structured, clear sections, subtle divider lines, company branding space"
"color_palette": "Vibrant gradient backgrounds (purple #667eea to pink #f093fb, or cyan #4facfe to blue #00f2fe), frosted white panels with 20% opacity, accent colors that pop against the gradient",
"typography": "SF Pro Display or Inter font style, bold 600-700 weight titles, clean 400 weight body, white text with subtle drop shadow for readability on glass",
"imagery": "Abstract 3D shapes floating in space, soft blurred orbs, geometric primitives with glass material, depth through overlapping translucent layers",
"layout": "Floating card panels with backdrop-blur effect, generous padding (48-64px), rounded corners (24-32px radius), layered depth with subtle shadows",
"effects": "Frosted glass blur (backdrop-filter: blur 20px), subtle white border (1px rgba 255,255,255,0.2), soft glow behind panels, floating elements with drop shadows",
"visual_language": "Premium tech aesthetic like Apple Vision Pro UI, depth through transparency, light refracting through glass surfaces"
}
}
```
### Academic Style
### Dark Premium Style
```json
{
"style": "academic",
"style": "dark-premium",
"style_guidelines": {
"color_palette": "White/cream backgrounds, dark navy text, burgundy or forest green accents",
"typography": "Serif fonts (Times, Georgia style) for scholarly feel, clear hierarchy for citations",
"imagery": "Diagrams, charts, scholarly imagery, books, research settings, data visualizations",
"layout": "Traditional layouts, room for references, structured content areas, institution logo space"
"color_palette": "Deep black base (#0a0a0a to #121212), luminous accent color (electric blue #00d4ff, neon purple #bf5af2, or gold #ffd700), subtle gray gradients for depth (#1a1a1a to #0a0a0a)",
"typography": "Elegant sans-serif (Neue Haas Grotesk or Suisse Int'l style), dramatic size contrast (72pt+ headlines, 18pt body), letter-spacing -0.02em for headlines, pure white (#ffffff) text",
"imagery": "Dramatic studio lighting, rim lights and edge glow, cinematic product shots, abstract light trails, premium material textures (brushed metal, matte surfaces)",
"layout": "Generous negative space (60%+), asymmetric balance, content anchored to grid but with breathing room, single focal point per slide",
"effects": "Subtle ambient glow behind key elements, light bloom effects, grain texture overlay (2-3% opacity), vignette on edges",
"visual_language": "Luxury tech brand aesthetic (Bang & Olufsen, Porsche Design), sophistication through restraint, every element intentional"
}
}
```
### Minimal Style
### Gradient Modern Style
```json
{
"style": "minimal",
"style": "gradient-modern",
"style_guidelines": {
"color_palette": "Pure white background, black text, single accent color (e.g., coral, teal)",
"typography": "Thin modern sans-serif, generous letter-spacing, one font weight for body",
"imagery": "Simple line illustrations, geometric shapes, isolated objects, vast whitespace",
"layout": "Maximum whitespace (70%+), single element focus, extreme simplicity"
"color_palette": "Bold mesh gradients (Stripe/Linear style: purple-pink-orange #7c3aed→#ec4899→#f97316, or cool tones: cyan-blue-purple #06b6d4→#3b82f6→#8b5cf6), white or dark text depending on background intensity",
"typography": "Modern geometric sans-serif (Satoshi, General Sans, or Clash Display style), variable font weights, oversized bold headlines (80pt+), comfortable body text (20pt)",
"imagery": "Abstract fluid shapes, morphing gradients, 3D rendered abstract objects, soft organic forms, floating geometric primitives",
"layout": "Dynamic asymmetric compositions, overlapping elements with blend modes, text integrated with gradient flows, full-bleed backgrounds",
"effects": "Smooth gradient transitions, subtle noise texture (3-5% for depth), soft shadows with color tint matching gradient, motion blur suggesting movement",
"visual_language": "Contemporary SaaS aesthetic (Stripe, Linear, Vercel), energetic yet professional, forward-thinking tech vibes"
}
}
```
### Creative Style
### Neo-Brutalist Style
```json
{
"style": "creative",
"style": "neo-brutalist",
"style_guidelines": {
"color_palette": "Bold vibrant colors, gradients, unexpected color combinations, high saturation",
"typography": "Mix of display and body fonts, creative text arrangements, variable sizes",
"imagery": "Artistic photography, illustrations, collage elements, textures, hand-drawn elements",
"layout": "Breaking the grid, overlapping elements, dynamic compositions, visual storytelling"
"color_palette": "High contrast primaries: stark black, pure white, with bold accent (hot pink #ff0080, electric yellow #ffff00, or raw red #ff0000), optional: Memphis-inspired pastels as secondary",
"typography": "Ultra-bold condensed type (Impact, Druk, or Bebas Neue style), UPPERCASE headlines, extreme size contrast, intentionally tight or overlapping letter-spacing",
"imagery": "Raw unfiltered photography, intentional visual noise, halftone patterns, cut-out collage aesthetic, hand-drawn elements, stickers and stamps",
"layout": "Broken grid, overlapping elements, thick black borders (4-8px), visible structure, anti-whitespace (dense but organized chaos)",
"effects": "Hard shadows (no blur, offset 8-12px), pixelation accents, scan lines, CRT screen effects, intentional 'mistakes'",
"visual_language": "Anti-corporate rebellion, DIY zine aesthetic meets digital, raw authenticity, memorable through boldness"
}
}
```
### 3D Isometric Style
```json
{
"style": "3d-isometric",
"style_guidelines": {
"color_palette": "Soft contemporary palette: muted purples (#8b5cf6), teals (#14b8a6), warm corals (#fb7185), with cream or light gray backgrounds (#fafafa), consistent saturation across elements",
"typography": "Friendly geometric sans-serif (Circular, Gilroy, or Quicksand style), medium weight headlines, excellent readability, comfortable 24pt body text",
"imagery": "Clean isometric 3D illustrations, consistent 30° isometric angle, soft clay-render aesthetic, floating platforms and devices, cute simplified objects",
"layout": "Central isometric scene as hero, text balanced around 3D elements, clear visual hierarchy, comfortable margins (64px+)",
"effects": "Soft drop shadows (20px blur, 30% opacity), ambient occlusion on 3D objects, subtle gradients on surfaces, consistent light source (top-left)",
"visual_language": "Friendly tech illustration (Slack, Notion, Asana style), approachable complexity, clarity through simplification"
}
}
```
### Editorial Style
```json
{
"style": "editorial",
"style_guidelines": {
"color_palette": "Sophisticated neutrals: off-white (#f5f5f0), charcoal (#2d2d2d), with single accent color (burgundy #7c2d12, forest #14532d, or navy #1e3a5f), occasional full-color photography",
"typography": "Refined serif for headlines (Playfair Display, Freight, or Editorial New style), clean sans-serif for body (Söhne, Graphik), dramatic size hierarchy (96pt headlines, 16pt body), generous line-height 1.6",
"imagery": "Magazine-quality photography, dramatic crops, full-bleed images, portraits with intentional negative space, editorial lighting (Vogue, Bloomberg Businessweek style)",
"layout": "Sophisticated grid system (12-column), intentional asymmetry, pull quotes as design elements, text wrapping around images, elegant margins",
"effects": "Minimal effects - let photography and typography shine, subtle image treatments (slight desaturation, film grain), elegant borders and rules",
"visual_language": "High-end magazine aesthetic, intellectual sophistication, content elevated through design restraint"
}
}
```
### Minimal Swiss Style
```json
{
"style": "minimal-swiss",
"style_guidelines": {
"color_palette": "Pure white (#ffffff) or off-white (#fafaf9) backgrounds, true black (#000000) text, single bold accent (Swiss red #ff0000, Klein blue #002fa7, or signal yellow #ffcc00)",
"typography": "Helvetica Neue or Aktiv Grotesk, strict type scale (12/16/24/48/96), medium weight for body, bold for emphasis only, flush-left ragged-right alignment",
"imagery": "Objective photography, geometric shapes, clean iconography, mathematical precision, intentional empty space as compositional element",
"layout": "Strict grid adherence (baseline grid visible in spirit), modular compositions, generous whitespace (40%+ of slide), content aligned to invisible grid lines",
"effects": "None - purity of form, no shadows, no gradients, no decorative elements, occasional single hairline rules",
"visual_language": "International Typographic Style, form follows function, timeless modernism, Dieter Rams-inspired restraint"
}
}
```
### Keynote Style (Apple风格)
```json
{
"style": "keynote",
"style_guidelines": {
"color_palette": "Deep blacks (#000000 to #1d1d1f), pure white text, signature blue (#0071e3) or gradient accents (purple-pink for creative, blue-teal for tech)",
"typography": "San Francisco Pro Display, extreme weight contrast (bold 80pt+ titles, light 24pt body), negative letter-spacing on headlines (-0.03em), optical alignment",
"imagery": "Cinematic photography, shallow depth of field, dramatic lighting (rim lights, spot lighting), product hero shots with reflections, full-bleed imagery",
"layout": "Maximum negative space, single powerful image or statement per slide, content centered or dramatically offset, no clutter",
"effects": "Subtle gradient overlays, light bloom and glow on key elements, reflection on surfaces, smooth gradient backgrounds",
"visual_language": "Apple WWDC keynote aesthetic, confidence through simplicity, every pixel considered, theatrical presentation"
}
}
```
@@ -341,10 +421,42 @@ After generation:
## Notes
### Critical Quality Guidelines
**Prompt Engineering for Professional Results:**
- Always use English for image prompts regardless of user's language
- **Generate slides sequentially** - each slide must reference the previous one for consistency
- Be EXTREMELY specific about visual details - vague prompts produce generic results
- Include exact hex color codes (e.g., #667eea not "purple")
- Specify typography details: font weight (400/700), size hierarchy, letter-spacing
- Describe effects precisely: "backdrop blur 20px", "drop shadow 8px blur 30% opacity"
- Reference real design systems: "visionOS aesthetic", "Stripe website style", "Bloomberg Businessweek layout"
**Visual Consistency (Most Important):**
- **Generate slides sequentially** - each slide MUST reference the previous one
- The first slide is critical - it establishes the visual language for the entire presentation
- Include style_guidelines in the prompt for every slide to maintain consistency
- The image-generation skill is a dependency - ensure it's available
- If a slide looks inconsistent, regenerate it with stronger reference to the previous slide
- For best results, be very specific about matching colors, typography, and mood in prompts
- In every subsequent slide prompt, explicitly state: "continuing EXACT visual style from reference image"
- Use SAME, EXACT, MATCH keywords emphatically in prompts to enforce consistency
- Include a `consistency_note` field in every JSON prompt after slide 1
- If a slide looks inconsistent, regenerate it with STRONGER reference emphasis
**Design Principles for Modern Aesthetics:**
- Embrace negative space - 40-60% empty space creates premium feel
- Limit elements per slide - one focal point, one message
- Use depth through layering (shadows, transparency, z-depth)
- Typography hierarchy: massive headlines (72pt+), comfortable body (18-24pt)
- Color restraint: one primary palette, 1-2 accent colors maximum
**Common Mistakes to Avoid:**
- ❌ Generic prompts like "professional slide" - be specific
- ❌ Too many elements/text per slide - cluttered = unprofessional
- ❌ Inconsistent colors between slides - always reference previous slide
- ❌ Skipping the reference image parameter - this breaks visual consistency
- ❌ Using different design styles within one presentation
**Recommended Styles for Different Contexts:**
- Tech product launch → `glassmorphism` or `gradient-modern`
- Luxury/premium brand → `dark-premium` or `editorial`
- Startup pitch → `gradient-modern` or `minimal-swiss`
- Executive presentation → `dark-premium` or `keynote`
- Creative agency → `neo-brutalist` or `gradient-modern`
- Data/analytics → `minimal-swiss` or `3d-isometric`