共计 15138 个字符,预计需要花费 38 分钟才能阅读完成。

据VidQ数据,这个频道广告收入在3000-9000美元。这个频道做的是简单且转化率高的视频,教观众打造提升专注力的高效时段,内容能帮学生和年轻人集中精力、更好学习和完成作业。视频全是AI制作,每个制作时间不到15分钟。
下面我们拆解它的做法,从频道搭建、选题到快速制作,甚至缩略图设计。文章最后有提供文档,包括了所有的提示词、工具和流程,你可以照着做。
首先得有个频道名,最快的方法是用ChatGPT。去提示词文件里复制频道名提示词,粘贴到ChatGPT,一秒就能得到一堆名字,选个喜欢的,简单好记的就行。

有了名字,接着做频道标志和横幅。回到提示词文件,复制标志图片提示词,粘贴到ChatGPT,加上频道名回车。

再复制横幅提示词,同样操作:

输出结果范例:

准备好两个提示词后,去Google Whisk,粘贴logo提示词,改下长宽比,点击生成,下载喜欢的logo。

横幅banner也这么操作,粘贴横幅提示词,设置长宽比16:9,点击生成并下载。这样,频道名、标志和横幅就有了。
接下来制作视频。很多新手创作者会随机选题,这就错了。复制视频创意提示词,粘贴到ChatGPT,找个热门话题。
去看看竞争对手,把那些有几十万播放量的标题复制5到10个,再回到ChatGPT,把标题和视频创意提示词一起粘贴进去,它会根据热门内容给你新角度,选个话题就行。

将复制的标题等替换下面的[PASTED_TOPICS]部分:

得到回复:

有了话题下一步写脚本。回到提示词文件,复制脚本提示词,粘贴到ChatGPT,输入视频标题,生成完整脚本,也可以改写几句,让它有你自己的风格。

输入到ChatGPT对话得到脚本:

下一步我们需要每个场景的图像提示词。复制文档中的图片生成提示词,粘贴到ChatGPT

再把整个脚本粘贴到输入框,ChatGPT会输出每个场景的图像提示词,复制保存。

接着生成图像。去Whisk,粘贴第一个图像提示词,点击生成,会得到两张图,选一张作为基础风格。

点击这张图,粘贴第二个图像提示词,Whisk会自动匹配风格,保证视觉效果从头到尾一致。下载所有图像,解压压缩包,视觉素材就准备好了。

然后是配音。一般用 ElevenLabs就可以了,这里选了Adam这个声音,因为和竞争对手的语气相符,你也可以选其他匹配风格的声音。把脚本粘贴到ElevenLabs,整理好格式,生成语音,听一遍,修改不合适的地方,下载最终音频。

最后一步剪辑整合素材。打开剪映(capcut),新建项目,导入所有图像和配音文件。先把配音拖到时间轴,剪掉空白部分和长停顿,让节奏紧凑、更有吸引力。接着添加图像,让每个场景和配音对应,保证画面和内容相符。

视频做好了,来做缩略图。回到提示词文件,复制缩略图概念提示词,粘贴到ChatGPT,把占位符换成视频标题,它会生成和视频匹配的缩略图提示词。
复制生成的提示词,去Whisk生成缩略图。一个符合热门风格的缩略图就做好了。
这种视频形式并不复杂,都是静态的图片,可能第一个视频做出来需要花费多点时间,熟悉之后单个视频制作需要的时间很快,使用智能体流程化批量完成,可以节省更多的时间。
以上提示词都是经过不断优化的,输出的结果具有很好的可控性和一致性。除了案例中的主题,也可以稍作修改,用于类似的主题风格的视频,赶快去测试一下效果吧!
YouTube Channel Name (Optimized)
Name
You are an expert in branding YouTube channels for the productivity, focus, and self-improvement niche, aimed at students and young adults who want to upgrade their habits, study smarter, and perform better in life.
Task:
Generate 10 original, memorable, and algorithm-friendly YouTube channel name ideas that:
Sound modern, minimal, and motivational (avoid clichés like “Study Hub” or “Focus Zone”).
Reflect a mix of discipline + inspiration — think Atomic Habits meets Ali Abdaal.
Are short (1–3 words) and easy to remember or brand.
Can naturally pair with future subtopics (study, mindset, morning routines, etc).
Have emotional or aspirational weight — evoke clarity, mastery, or purpose.
Extra context for better naming:
Analyze Simply, Easy, Time For Growth and generate names that feel in the same world but stronger, cleaner, and more timeless.
Provide a short 1-sentence reasoning under each suggestion explaining the vibe and angle behind the name.
Channel Logo Picture Prompt
You are an AI image prompt creator, not a designer.
Your goal is to generate a detailed, high-quality image prompt for a text-to-image generator, describing how to create a modern, minimalist YouTube profile picture for a productivity and studying channel.
Input:
Channel Name: [CHANNEL_NAME]
Competitor Reference (for visual tone & structure):
{
"scene_description": "The image is a circular logo featuring a design that combines a clock and a brain, divided vertically down the center. The left side depicts a stylized analog clock, while the right side illustrates the outline of a human brain.",
"characters": [],
"environment": {
"setting": "Abstract digital design",
"background_elements": ["black circular background with subtle diagonal line texture"],
"lighting": {
"type": "flat digital shading",
"sources": ["uniform digital lighting"]
},
"atmosphere": "modern, conceptual, minimalist"
},
"objects": [
{"name": "Clock","type": "symbolic graphic element","position": "left half of the circle"},
{"name": "Brain","type": "symbolic graphic element","position": "right half of the circle"}
],
"mood": "intellectual, analytical, time-focused",
"dominant_colors": ["#FFFFFF", "#000000"],
"camera": {
"perspective": "flat vector frontal view",
"composition": "symmetrical vertical split"
}
}
Instructions:
Do not generate the image — write a vivid and complete image prompt suitable for an AI art model.
The prompt should clearly describe:
The layout and composition (e.g., circular logo, centered minimal design).
Symbolism representing productivity, focus, or growth — like a brain, book, spark, or clock.
Style and color tone — flat vector or soft gradient, using 1–2 colors that convey clarity and energy (navy, mint, off-white, or deep orange).
Lighting and background — simple, clean, no busy detail; suitable for mobile icons.
Maintain a professional yet personal brand energy, similar to real modern YouTuber logos (not corporate or over-designed).
Reference the JSON block above for structure, mood, and style alignment.
Output the final text-to-image prompt in one clear paragraph ready for generation.
Channel BannerPrompt
You are an AI image prompt creator, not an artist.
Your task is to generate a descriptive image prompt for a text-to-image generator that will create a YouTube banner for a productivity, focus, and self-improvement channel.
Input:
Channel Name: [CHANNEL_NAME]
Competitor Reference (for structure and aesthetic):
{
"scene_description": "A minimalist rectangular banner with a white background displaying motivational text in bold black capital letters that reads 'TIME FOR GROWTH' and a smaller subtitle beneath stating 'DISCIPLINE + MINDSET = SUCCESS.'.",
"environment": {
"setting": "digital banner design",
"background_elements": ["plain white background"],
"lighting": {"type": "flat digital color fill"},
"atmosphere": "motivational, professional, clean"
},
"logos_or_text": [
{"content": "TIME FOR GROWTH","font_family": "Sans-serif (bold, block-style)"},
{"content": "DISCIPLINE + MINDSET = SUCCESS.","font_family": "Sans-serif (thin, modern)"}
],
"mood": "motivational, empowering, growth-oriented",
"dominant_colors": ["#FFFFFF", "#000000"],
"camera": {"perspective": "flat frontal view","composition": "centered alignment"}
}
Instructions:
Do not design or generate the banner yourself — instead, write a polished and vivid image prompt that describes exactly what a text-to-image model should create.
Include the following in your prompt:
Composition and layout: A 2560×1440 minimalist banner (safe zone: 1546×423 center).
Style: Bright, clean, professional, with possible gradient or soft light texture — not cluttered or overly complex.
Visual themes: abstract time or growth elements (sunrise, progress bars, clocks, open book, path lines).
Text ideas:
“Focus Harder. Study Smarter.”
“Upgrade Your Mind. One Video at a Time.”
Include in your prompt how the text should be placed and styled (modern sans-serif, uppercase, bold).
Color tone: match the channel’s profile picture design for brand cohesion (e.g., navy, white, or mint palette).
Mood: motivational, growth-focused, calming yet empowering.
Reference the JSON above for style guidance and text layout.
Output the final image prompt as a detailed paragraph, ready for AI generation.
Video Idea Prompt
You are a YouTube strategist specializing in productivity and self-improvement content for students and young adults.
Task:
Analyze the list of topics I pasted below → [PASTED_TOPICS].
Generate 10 fresh video ideas that:
Fit naturally into this niche and target similar audiences.
Follow proven YouTube algorithm frameworks: “relatable problem → solution,” “shock → transformation,” “myth → truth,” etc.
Include a title idea and a short one-line video concept explaining the emotional or practical angle.
Balance between educational value and entertainment / relatability.
Prioritize titles that feel like something you’d click at 2 AM while procrastinating.
Video Script Prompt
You are a YouTube scriptwriter skilled in creating relatable, high-retention productivity videos for students and young adults (ages 16–25).
Task:
Write a YouTube video script that is around anywhere from 6 mins - 9 mins in playback speed about [TOPIC].
Tone:
Friendly, confident, and real — like a young creator giving advice from experience. Mix motivation with vulnerability.
Avoid robotic or overly formal phrasing. Write like you’re talking to a friend who needs to hear this.
Requirements:
Start with a hook that instantly connects (story, question, or shocking fact).
Use short, natural sentences for retention.
Add relatable examples (school stress, procrastination, burnout, etc).
Break down tips in clear, actionable steps.
Use mini cliffhangers or emotional pauses to keep attention every 30–40 seconds.
End with a tight summary + motivational call to action (“Try this tonight”, “Let’s level up together”, etc).
Video Image Prompts
You are an AI visual prompt engineer creating stick-figure storyboard-style image prompts for a YouTube video about productivity and focus.
Input: Video Script → [PASTED_SCRIPT]
Style Reference JSON:
{
"scene_description": "An illustrated scene of a person lying on a bed in a softly lit bedroom, reaching to turn off an alarm clock on a bedside table. The style is minimalist and uses limited color — primarily grayscale with selective green highlights.",
"characters": [
{
"name": "Unknown",
"age": "young adult",
"gender": "unknown",
"ethnicity": "unspecified",
"skin_tone": "white (simplified cartoon style)",
"hair": { "style": "short curly", "color": "black" },
"clothing": {
"head": "none",
"torso": "green long-sleeve jacket over a white shirt",
"legs": "black pants",
"feet": "white shoes",
"materials": ["cloth (illustrated)"]
},
"pose": "lying diagonally on the bed with one arm extended toward the nightstand to turn off the alarm clock",
"facial_expression": "sleepy, eyes closed",
"accessories": [],
"held_objects": [],
"position_in_scene": "midground center-right, on the bed",
"emotions": ["tired", "groggy"]
}
],
"environment": {
"setting": "bedroom interior",
"background_elements": ["window with light-colored curtains", "potted plant on window sill", "soft sunlight casting shadows on wall"],
"architectural_features": ["wooden floor", "bed frame", "nightstand", "wall socket"],
"weather": "clear sunny morning (implied by sunlight)",
"lighting": {
"type": "natural and artificial mix",
"sources": ["sunlight through window", "table lamp on nightstand"],
"shadows": "soft elongated morning shadows across the wall and floor",
"reflections": "minimal, matte surfaces"
},
"atmosphere": "calm, early morning, quiet household setting"
},
"objects": [
{
"name": "Alarm clock",
"type": "analog clock",
"position": "on the nightstand next to the bed",
"appearance": "white circular clock with black hands and numbers",
"materials": ["plastic", "metal"],
"interaction": "being turned off by the person"
},
{
"name": "Lamp",
"type": "table lamp",
"position": "on top of nightstand",
"appearance": "gray lampshade with yellow glow cast on wall",
"materials": ["metal", "fabric"],
"interaction": "emitting light"
},
{
"name": "Books",
"type": "stack of books",
"position": "one on the floor beside the nightstand, two on the lower shelf of the nightstand",
"appearance": "simple covers, one in green tone",
"materials": ["paper", "cardboard"],
"interaction": "stationary"
},
{
"name": "Bed",
"type": "furniture",
"position": "center-left of image",
"appearance": "white bedsheet, gray blanket, white pillow",
"materials": ["fabric", "wood"],
"interaction": "supports the person lying on it"
}
],
"logos_or_text": [],
"mood": "peaceful, sleepy, relatable, early-morning tone",
"dominant_colors": ["#FFFFFF", "#000000", "#D3D3D3", "#5A9E6B"],
"camera": {
"perspective": "isometric-like side view",
"angle": "slightly elevated diagonal angle from front-left",
"position": "mid-distance capturing the full bed and side furniture",
"focal_length": "moderate (keeps all elements clear)",
"depth_of_field": "flat (illustrated 2D style)",
"composition": "balanced layout with subject centered horizontally and background providing depth through light and shadow"
}
}
Instructions:
Generate one stick-figure image prompt for every ~8 seconds of narration.
Each prompt should describe a simple, readable composition that visually matches the emotion or message of that moment (e.g., “a tired student surrounded by distractions,” “a stick figure organizing their desk as sunlight hits the window”).
Keep consistent character design (same stick figure throughout).
Apply the JSON code’s simplified, minimalist drawing style (clean lines, limited color palette).
Each prompt should be one sentence, clearly describing what should appear in the frame — no unnecessary details.
Maintain a cohesive emotional arc from start → problem → improvement → result.
Thumbnail (Optimized)
You are an AI image prompt creator, not a designer.
Your task is to generate a single, detailed text-to-image prompt that describes how to create a high-converting YouTube thumbnail for the given video.
The thumbnail must be click-optimized for CVR (Click-Through Rate) and consistent with the channel’s minimalist productivity / self-improvement aesthetic.
Input:
Video Title: [VIDEO_TITLE]
Competitor Thumbnail JSON Breakdown:
{
"scene_description": "An illustrated scene of a person lying on a bed in a softly lit bedroom, reaching to turn off an alarm clock on a bedside table. The style is minimalist and uses limited color — primarily grayscale with selective green highlights.",
"characters": [
{
"name": "Unknown",
"age": "young adult",
"gender": "unknown",
"ethnicity": "unspecified",
"skin_tone": "white (simplified cartoon style)",
"hair": { "style": "short curly", "color": "black" },
"clothing": {
"head": "none",
"torso": "green long-sleeve jacket over a white shirt",
"legs": "black pants",
"feet": "white shoes",
"materials": ["cloth (illustrated)"]
},
"pose": "lying diagonally on the bed with one arm extended toward the nightstand to turn off the alarm clock",
"facial_expression": "sleepy, eyes closed",
"accessories": [],
"held_objects": [],
"position_in_scene": "midground center-right, on the bed",
"emotions": ["tired", "groggy"]
}
],
"environment": {
"setting": "bedroom interior",
"background_elements": ["window with light-colored curtains", "potted plant on window sill", "soft sunlight casting shadows on wall"],
"architectural_features": ["wooden floor", "bed frame", "nightstand", "wall socket"],
"weather": "clear sunny morning (implied by sunlight)",
"lighting": {
"type": "natural and artificial mix",
"sources": ["sunlight through window", "table lamp on nightstand"],
"shadows": "soft elongated morning shadows across the wall and floor",
"reflections": "minimal, matte surfaces"
},
"atmosphere": "calm, early morning, quiet household setting"
},
"objects": [
{
"name": "Alarm clock",
"type": "analog clock",
"position": "on the nightstand next to the bed",
"appearance": "white circular clock with black hands and numbers",
"materials": ["plastic", "metal"],
"interaction": "being turned off by the person"
},
{
"name": "Lamp",
"type": "table lamp",
"position": "on top of nightstand",
"appearance": "gray lampshade with yellow glow cast on wall",
"materials": ["metal", "fabric"],
"interaction": "emitting light"
},
{
"name": "Books",
"type": "stack of books",
"position": "one on the floor beside the nightstand, two on the lower shelf of the nightstand",
"appearance": "simple covers, one in green tone",
"materials": ["paper", "cardboard"],
"interaction": "stationary"
},
{
"name": "Bed",
"type": "furniture",
"position": "center-left of image",
"appearance": "white bedsheet, gray blanket, white pillow",
"materials": ["fabric", "wood"],
"interaction": "supports the person lying on it"
}
],
"logos_or_text": [],
"mood": "peaceful, sleepy, relatable, early-morning tone",
"dominant_colors": ["#FFFFFF", "#000000", "#D3D3D3", "#5A9E6B"],
"camera": {
"perspective": "isometric-like side view",
"angle": "slightly elevated diagonal angle from front-left",
"position": "mid-distance capturing the full bed and side furniture",
"focal_length": "moderate (keeps all elements clear)",
"depth_of_field": "flat (illustrated 2D style)",
"composition": "balanced layout with subject centered horizontally and background providing depth through light and shadow"
}
}
Instructions:
Do not generate or render the image.
Instead, write a cinematic, clear, text-to-image prompt that describes the perfect thumbnail scene based on:
The title’s message (emotion, action, transformation, or mystery).
The video concept (core story or advice).
The competitor’s JSON reference (style, mood, composition).
Your output must describe:
Primary subject: who or what is shown (e.g., a student, thinker, entrepreneur).
Emotion / Expression: visible feeling that mirrors the video’s problem or solution (frustration, focus, relief, motivation).
Action or Contrast: before/after visual or symbolic action (mess vs order, dark vs light, chaos vs calm).
Environment: minimal but expressive — desk, books, screen glow, sunlight, etc., based on the video’s tone.
Lighting: clear, directional lighting to guide the eye and improve CTR (bright subject, darker background, cinematic shadows).
Text placement and style: specify where and how bold text appears (upper-left, white sans-serif, short phrase max 4 words).
Color psychology: use strong contrast — blues for calm/focus, oranges for energy/action.
Framing: clean composition that is easily readable in small mobile thumbnails — one main subject, few details, instant clarity.
Mood: consistent with the brand — modern, focused, motivational, never spammy or over-stylized.
Output the final image prompt as one polished paragraph — ready for AI generation.
Ensure it captures emotion + clarity + instant comprehension that makes the viewer feel compelled to click