{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreic4dptpuka4fkopv4ar2zebolii7zepi4xtplnjq2y32et4jap3au",
"uri": "at://did:plc:25rdn5elo5izoxrmtis34zuk/app.bsky.feed.post/3mppnvmtyt3z2"
},
"coverImage": {
"$type": "blob",
"ref": {
"$link": "bafkreicbuatbt7nhz5rryv4eybn7l2ra6abxbw4jupfvopwplume4zorqm"
},
"mimeType": "image/webp",
"size": 302558
},
"path": "/kamal_llm_manuplator/nano-banana-2-lite-and-gemini-omni-flash-whats-actually-new-in-googles-gemini-api-3hng",
"publishedAt": "2026-07-03T02:58:18.000Z",
"site": "https://dev.to",
"tags": [
"aivideos",
"nanobanana",
"googleomniflash",
"googlegemini"
],
"textContent": "Google added two new models to the Gemini API today: Nano Banana 2 Lite (image generation) and Gemini Omni Flash (video generation + editing). Neither is the Gemini 3.5 Pro release people have been waiting for, so it's easy to miss. Here's what's actually in them.\n\n**TL;DR**\n\n * Nano Banana 2 Lite: `gemini-3.1-flash-lite-image` = text-to-image in ~4s, $0.034/1K images\n * Gemini Omni Flash: `gemini-omni-flash-preview` = video gen + conversational editing, $0.10/sec\n * Both are built to be chained: generate an image fast, then animate it into video\n * Neither model is positioned as a quality upgrade = both are cost/speed plays\n\n\n\n**Nano Banana 2 Lite**\n\n**Model ID:`gemini-3.1-flash-lite-image`**\n\n * Text-to-image output in about 4 seconds\n * $0.034 per 1K-resolution image\n * Positioned as the direct replacement for the original Nano Banana (`gemini-2.5-flash-image`) - if you're on that model, this is a drop-in upgrade\n * Available in Google AI Studio, Gemini API, Gemini Enterprise Agent Platform, and consumer surfaces (Search AI Mode, Gemini app, Photos, NotebookLM, Flow, Google Ads)\n\n\n\n**Gemini Omni Flash**\n\n**Model ID:`gemini-omni-flash-preview`**\n\n * Public preview in Google AI Studio and the Gemini API\n * Conversational editing - refine a generated video using plain-language instructions instead of re-prompting from zero\n * Multimodal referencing - combine text, image, and video inputs to keep a scene consistent\n * $0.10 per second of video output (same rate as Veo 3.1 Fast)\n\n\n\n**Known limitations right now**\n\n * Generations capped at 10 seconds\n * No audio reference uploads yet\n * No scene extension yet\n * Video references under 3 seconds are accepted by the API schema but not correctly processed yet\n * Character consistency across scene changes/pans still has rough edges\n\n\n\nGoogle says longer durations are coming. The part worth paying attention to: chaining them\n\n 1. Generate an image with Nano Banana 2 Lite (fast, cheap)\n 2. Pass that image as a reference into Omni Flash\n 3. Omni Flash animates it into a video\n\n\n\nBoth models are optimized for throughput and cost, not for topping a quality benchmark. If you're running high-volume image or video generation and speed/price matter more than peak output quality, these are worth testing. If you need top-tier quality, Nano Banana Pro is still the model for that. Has anyone here built the chained image-to-video workflow yet? Curious how the multi-turn editing holds up in practice.",
"title": "Nano Banana 2 Lite and Gemini Omni Flash: What's Actually New in Google's Gemini API"
}