Googleは12月17日(火)、動画生成AI「Veo 2を一般公開した。現実世界の物理学や人間の動き、表情のニュアンスを理解・表現でき、最大4K解像度、長さ数分まで対応する。Google Labs内「VideoFX」で利用できるが、現在は順番待ちリスト(Early Access Waitlist)への登録が必要となる。

Veo 2では動画生成AIの問題として知られるハルシネーション(多指の描画や予期しないオブジェクトの登場など)の軽減、映画撮影に関する技術用語の理解(ジャンル、レンズ、アングルなど)などにも対応。Googleによると、OpenAISora Turbo」、「Kiling AI  v1.5」、「Meta Movie Gen」とVeo 2での生成結果比較において、人間による投票評価でVeo 2がパフォーマンスおよびプロンプトへの忠実度について最高の評価を得たという。

Googleが提供する他の生成AIと同様、Veo 2で生成した動画にはSynthIDウォーターマークが含まれ、AIによる生成物であることが識別できるようになっている。

■Veo 2公式ページ(Google DeepMind内、英語)
https://deepmind.google/technologies/veo/veo-2/

■State-of-the-art video and image generation with Veo 2 and Imagen 3(Google Labs公式ブログ、英語)
https://blog.google/technology/google-labs/video-image-generation-update-december-2024/

■VideoFX(Google Labs内)
https://labs.google/fx/ja/tools/video-fx

Veo 2による生成サンプルとプロンプト

Prompt: Extreme close-up on the car's speedometer, seen through the steering wheel. The needle climbs rapidly as the driver turns the wheel sharply, the view outside the window blurring into streaks of light. The camera focuses on the speedometer of an olive green muscle car, the needle climbing rapidly. The view is framed by the steering wheel as the driver aggressively turns, the cityscape outside – yellow cabs, flashing neon signs, and crowds of pedestrians – blurring into streaks of vibrant color and light. The sound of the powerful engine and screeching tires underscores the car's speed and the sharp turn.
Prompt: This medium shot, with a shallow depth of field, portrays a cute cartoon girl with wavy brown hair, sitting upright in a 1980s kitchen. Her hair is medium length and wavy. She has a small, slightly upturned nose, and small, rounded ears. She is very animated and excited as she talks to the camera.
Prompt: A perfect cube rotates in the center of a soft, foggy void. The surface shifts between different hyper-real textures—smooth marble, velvety suede, hammered brass, and raw concrete. Each material reveals subtle details: marble veins slowly spreading, suede fibers brushing with wind, brass tarnishing in slow motion, and concrete crumbling to reveal polished stone inside. Ends with a soft glow surrounding the cube as it transitions to a smooth mirrored surface, reflecting infinity.
Prompt: The sun rises slowly behind a perfectly plated breakfast scene. Thick, golden maple syrup pours in slow motion over a stack of fluffy pancakes, each one releasing a soft, warm steam cloud. A close-up of crispy bacon sizzles, sending tiny embers of golden grease into the air. Coffee pours in smooth, swirling motion into a crystal-clear cup, filling it with deep brown layers of crema. Scene ends with a camera swoop into a fresh-cut orange, revealing its bright, juicy segments in stunning macro detail.
Prompt: The camera floats gently through rows of pastel-painted wooden beehives, buzzing honeybees gliding in and out of frame. The motion settles on the refined farmer standing at the center, his pristine white beekeeping suit gleaming in the golden afternoon light. He lifts a jar of honey, tilting it slightly to catch the light. Behind him, tall sunflowers sway rhythmically in the breeze, their petals glowing in the warm sunlight. The camera tilts upward to reveal a retro farmhouse with mint-green shutters, its walls dappled with shadows from swaying trees. Shot with a 35mm lens on Kodak Portra 400 film, the golden light creates rich textures on the farmer’s gloves, marmalade jar, and weathered wood of the beehives.
Prompt: A cinematic, high-action tracking shot follows an incredibly cute dachshund wearing swimming goggles as it leaps into a crystal-clear pool. The camera plunges underwater with the dog, capturing the joyful moment of submersion and the ensuing flurry of paddling with adorable little paws. Sunlight filters through the water, illuminating the dachshund's sleek, wet fur and highlighting the determined expression on its face. The shot is filled with the vibrant blues and greens of the pool water, creating a dynamic and visually stunning sequence that captures the pure joy and energy of the swimming dachshund.
Prompt: The camera spirals down through an infinite network of glowing threads, pulsating with multicolored light. The setting feels alive, each thread thrumming with faint whispers and bursts of imagery—fractals, mythological beasts, and celestial maps. The courier darts through the maze, their silhouette painted with the kaleidoscopic glow of the fibers. As they weave between strands, their every touch triggers animations—one a glowing phoenix, another a blooming lotus—until they stumble upon a massive, golden thread. It flares, and a holographic figure emerges: a younger version of themselves, surrounded by fiery glyphs. The scene shifts between soft, glowing pastels and brilliant, fiery tones, blending hand-drawn 2D animation with dynamic light effects, captured in fluid, sweeping motion.
Prompt: An upsampled wide-angle shot, 35mm lens. A celestial figure sits cross-legged on a floating platform high above a swirling, watercolor-painted sky. Her hands move gracefully, pulling shimmering threads of light from the air and weaving them into constellations that spark to life with bursts of radiant energy. The camera slowly tracks around her, capturing the ethereal glow of her movements, while the vast, starlit cosmos stretches endlessly in the background. The constellations morph into mythical creatures that leap and soar through the air before dissolving into star dust. The color palette transitions fluidly, from deep indigo blues to vibrant golds, evoking the awe of a living night sky. The scene ends with a slow push-in-out, revealing the figure as a tiny speck within the infinite cosmos. Soft, painterly animation with glowing, hand-drawn elements and dreamlike textures. Clean shapes and hands.
Prompt: Low-angle tracking shot, 18mm lens. The car drifts, leaving trails of light and tire smoke, creating a visually striking and abstract composition. The camera tracks low, capturing the sleek, olive green muscle car as it approaches a corner. As the car executes a dramatic drift, the shot becomes more stylized. The spinning wheels and billowing tire smoke, illuminated by the surrounding city lights and lens flare, create streaks of light and color against the dark asphalt. The cityscape – yellow cabs, neon signs, and pedestrians – becomes a blurred, abstract backdrop. Volumetric lighting adds depth and atmosphere, transforming the scene into a visually striking composition of motion, light, and urban energy.
Prompt: A cinematic shot captures a fluffy Cockapoo, perched atop a vibrant pink flamingo float, in a sun-drenched Los Angeles swimming pool. The crystal-clear water sparkles under the bright California sun, reflecting the playful scene. The Cockapoo's fur, a soft blend of white and apricot, is highlighted by the golden sunlight, its floppy ears gently swaying in the breeze. Its happy expression and wagging tail convey pure joy and summer bliss. The vibrant pink flamingo adds a whimsical touch, creating a picture-perfect image of carefree fun in the LA sunshine.
Prompt: A low-angle shot captures a flock of pink flamingos gracefully wading in a lush, tranquil lagoon. The vibrant pink of their plumage contrasts beautifully with the verdant green of the surrounding vegetation and the crystal-clear turquoise water. Sunlight glints off the water's surface, creating shimmering reflections that dance on the flamingos' feathers. The birds' elegant, curved necks are submerged as they walk through the shallow water, their movements creating gentle ripples that spread across the lagoon. The composition emphasizes the serenity and natural beauty of the scene, highlighting the delicate balance of the ecosystem and the inherent grace of these magnificent birds. The soft, diffused light of early morning bathes the entire scene in a warm, ethereal glow.
Prompt: Cinematic shot of a female doctor in a dark yellow hazmat suit, illuminated by the harsh fluorescent light of a laboratory. The camera slowly zooms in on her face, panning gently to emphasize the worry and anxiety etched across her brow. She is hunched over a lab table, peering intently into a microscope, her gloved hands carefully adjusting the focus. The muted color palette of the scene, dominated by the sickly yellow of the suit and the sterile steel of the lab, underscores the gravity of the situation and the weight of the unknown she is facing. The shallow depth of field focuses on the fear in her eyes, reflecting the immense pressure and responsibility she bears.
Prompt: A close-up medium shot captures two Peruvian indigenous women walking along a winding mountain trail, an alpaca companionably walking between them. Their vibrant traditional clothing in vibrant color and intricate patterns, pops against the backdrop of the rugged Andean landscape. The women's faces, etched with the wisdom of generations, convey a sense of quiet strength and deep connection to their heritage. The alpaca, with its soft, fluffy fleece, adds a touch of gentle charm to the scene. The trail, dusty or rocky. The natural light bathes the scene in a warm glow. The image evokes a sense of timeless tradition, resilience, and the enduring harmony between humanity and nature in the heart of the Peruvian Andes. Cinematic vibrant colors, high contrast.
Prompt: An extreme close-up shot focuses on the face of a female DJ, her beautiful, voluminous black curly hair framing her features as she becomes completely absorbed in the music. Her eyes are closed, lost in the rhythm, and a slight smile plays on her lips. The camera captures the subtle movements of her head as she nods and sways to the beat, her body instinctively responding to the music pulsating through her headphones and out into the crowd. The shallow depth of field blurs the background. She’s surrounded by vibrant neon colors. The close-up emphasizes her captivating presence and the power of music to transport and transcend.
Prompt: A close-up shot captures a small, fluffy dog dressed in a pink ballerina costume. The tutu's layers of tulle are perfectly arranged, and the satin bodice sparkles under the studio lights. The dog's head is tilted, its tongue lolling out in a happy grin. Its big, brown eyes are filled with joy and excitement, reflecting the anticipation of the performance. The background is a blur of soft colors, ensuring all focus remains on the adorable canine ballerina.

画像生成AI「Imagen 3」の改良版も「ImageFX」で一般公開

Veo 2の一般公開と同時に、画像生成AI「Imagen 3」の改良版もGoogle Labs内「ImageFX」で一般公開され、誰でも利用可能となっている。改良されたImagen 3では、より明るく、より良い構図の画像を生成できるとされ、従来よりも多様なスタイルの生成に対応し、プロンプトへの忠実度やディテール・テクスチャの豊かさが向上しているという。

■Imagen 3公式ページ(Google DeepMind内、英語)
https://deepmind.google/technologies/imagen-3/

■ImageFX(Google Labs内)
https://labs.google/fx/ja/tools/image-fx

CGWORLD関連情報

●動画生成AIは1年半でここまで来た! Alex Patrascu氏が昨年制作のAIショートフィルムをOpenAI「Sora」のRemix機能でリマスター

AIを活用したコンテンツ制作を行うクリエイティブスタジオ、MASSIVE STUDIOの創業者のひとり、Alex Patrascu氏が、2023年7月末にRunway社・Gen-2を使用して制作した作品を、先日一般公開されたOpenAI社・SoraのRemix機能を用いてリマスターしたAIショートフィルム『Nexus: Hive Mind』を投稿した。
https://cgworld.jp/flashnews/202412-Patrascu-Sora-Film.html

●OpenAIの動画生成AI「Sora」一般公開! 有料プランで最大1080p・20秒の動画を月500本まで生成可能

OpenAI社が動画生成AI「Sora(ソラ)」をリリース。動画生成にあたってはプロンプトの編集が行えるだけでなく、生成した動画に対してRe-Cut、Remix、Blend、Loopという4種の編集ツールが利用できるほか、台本の作成と編集により生成動画の調節が可能な「Storyboard」機能も用意されている。
https://cgworld.jp/flashnews/202412-OpenAI-Sora.html

●動画生成AI「Pika 2.0」リリース! アップロードした複数の画像とテキストプロンプトを材料にシーンを生成する「Scene Ingredients」を搭載

Mellis社が動画生成AI「Pika 2.0」をリリース。アップロードした複数の画像とテキストプロンプトを材料(Ingredient)にしてシーンを生成できる「Scene Ingredients」機能を搭載し、従来よりも生成動画のコントロール性が向上した。なお、12月22日まではPika 2.0の機能を無料で利用できるキャンペーンを実施している。
https://cgworld.jp/flashnews/202412-Pika20.html