Google Bard can generate images now: Here's how Imagen 2 compares to DALL·E 3

Google Bard and Microsoft Copilot logos over AI generated image gallery

(Image credit: Google / Microsoft)

Google's AI chatbot Bard is finally getting the image generation capabilities it needs to stay competitive with rival Large Language Models (LLMs) like Copilot/Bing and ChatGPT 4. The feature is currently live in the US, with other regions to follow.

Bard has already been on the end of a recent upgrade, now running on Google's powerful Gemini Pro LLM, but will now also include the Imagen 2 text-to-image model to generate images for users.

Google has been playing catch-up to the competition ever since ChatGPT exploded onto the scene last year, and the addition of image generation is a big stride in the right direction.

While Google Bard is unlikely to dethrone ChatGPT as the world's most popular chatbot, it does stand a compelling chance against search engine rival Microsoft, who's Copilot AI has successfully managed to sway users away from Chrome and to the Edge Browser since its initial launch as Bing Chat.

With both AI chatbots offering similar search capabilities and image generation, we thought we would compare to two to see just how far Google Bard has come along. So without further ado, let's see how Bard's Imagen 2 software matches up against Copilot's DALL·E 3-powered alternative.

Imagen 2 vs. DALL·E 3: Generating hands

There's often one key giveaway that a picture or photo has been generated by AI: hands. Much like most human artists, AI seems to crumble to pieces the moment it's tasked with drawing the human hand — instead resorting to weird hot dog-like appendages that defy all conventional wisdom of human biology.

So how do the two compare when it comes to generating images from simple or complex prompts involving the human hand? Let's find out.

AI generated image comparison showing simple hand recreation. Left image created by Imagen 2 shows a group of hands rising to high-5 one another but the image is distorted and fingers blur and bend awkwardly into one another, the right image generated by Dall.E 3 shows two identical men wearing sunglasses and almost exact clothing in the same pose performing a high-5 with nonsensical writing on their shirts and a sunset in the background. — AI generated images made by Google Bard (left) and Microsoft Copilot (right). Prompt using the prompt "Create an image of a high-5" to generate both images. (Image credit: Images generated with Google Bard and Microsoft Copilot, powered by Imagen 2 and Dall.E 3)

AI generated image comparison showing complex hand recreation. Left image created by Imagen 2 shows crisp photorealistic image of hands holding one another with jewelry, right image made by Dall.E 3 shows a man's hand holding an Asian woman's hand but the image is overly smoothed and softened with heavy bokeh. — AI generated images made by Google Bard (left) and Microsoft Copilot (right). Prompt using the prompt "Create an image of two people holding hands" to generate both images. (Image credit: Images generated with Google Bard and Microsoft Copilot, powered by Imagen 2 and Dall.E 3)

Imagen 2 vs. DALL·E 3: Generating text

Another keen giveaway that an image has been AI-generated is the fact that the primary language of almost all image generators is Simlish mixed with the melted English found in Captcha checks.

AI image generators are getting better at this, especially when prompted to use actual words or phrases, but their ability to naturally embed context-fitting language into images is still very much hit-or-miss. Let's see how Bard's image generation holds up against Copilot on the text front.

AI generated image comparison showing simple word recreation. Left image created by Imagen 2 shows a photorealistic image of the words 'Hello World!' carved into a wooden sign. The right image, generated by Dall.E 3, has a more rendered feel, more vivid and colorful, showcasing a sign that says 'Hello World!' that is planted in a pot next to sprouting plants. — AI generated images made by Google Bard (left) and Microsoft Copilot (right). Prompt using the prompt "create an image of a sign saying 'Hello World!'" to generate both images. (Image credit: Images generated with Google Bard and Microsoft Copilot, powered by Imagen 2 and Dall.E 3)

AI generated image comparison showing complex word recreation. Left image created by Imagen 2 shows a photorealistic scene of a newspaper at a breakfast table. The newspaper has the title of 'NSW News' though the remaining text and images are muddled and incomprehensible. The image on the right, generated by Dall.E 3, is more stylized, inky outlines, and has a cool blue tone. It's words are incomprehensible, but the lede image on the page seems to showcase Donald Trump, also at a breakfast table, surrounded by politicians. — AI generated images made by Google Bard (left) and Microsoft Copilot (right). Prompt using the prompt "Create an image of the front page of a newspaper on a breakfast table" to generate both images. (Image credit: Images generated with Google Bard and Microsoft Copilot, powered by Imagen 2 and Dall.E 3)

Imagen 2 vs. DALL·E 3: Generating tools

LLMs know what tools are. They can even tell you how to use them. Ask one to make an image of said tool in operation, however, and you're likely to see scenes so ridiculous you would presume it to be a still from a late-night infomercial designed to make you think the hammer is a tool too complex for the average person to grasp the operation of.

That's how it's been in the past, at least. Though, while image generators have improved their grasp on the concept of tools and handiwork in action, it's not quite perfected the art of digitally depicting DIY. Or has it? Let's find out.

AI generated image comparison showing simple tools laid out on a bench. Left image created by Imagen 2 shows a photorealistic image of tools neatly laid out on a workbench though some of these tools are not real and amalgamations of real life tools, the image on the right is generated by Dall.E 3 and is similarly photorealistic though littered with unrealistic tools also. — AI generated images made by Google Bard (left) and Microsoft Copilot (right). Prompt using the prompt "Create an image of a workbench with tools on it" to generate both images. (Image credit: Images generated with Google Bard and Microsoft Copilot, powered by Imagen 2 and Dall.E 3)

AI generated image comparison showing complex tool use recreation. Left image created by Imagen 2 shows a photorealistic image of a man using a circular saw and wearing ear protection but sawing the wrong side of the wood he is holding, the right image generated by Dall.E 3 shows a man in winter gloves with six fingers on one hand drilling into his own tools instead of the wood he is holding. — AI generated images made by Google Bard (left) and Microsoft Copilot (right). Prompt using the prompt "Create an image of a person using tools to cut wood" to generate both images. (Image credit: Images generated with Google Bard and Microsoft Copilot, powered by Imagen 2 and Dall.E 3)

Conclusion

It's plain to see from the above images that Google Bard's Imagen 2 generation abilities offer some impressively photorealistic results. While its results can still offer muddied hands and lack the logic of a genuine scenario, the images typically 'feel' more real.

In contrast, Copilot's DALL·E 3-powered image generation offers overly softened and smooth images that feel slightly dream-like or aspirational. The colors are more vivid, the lighting more dramatic, and everything has a very 'rendered' feel to it.

Google Bard also has the leg up on resolution, with its generated images sitting at a resolution of 1532 x 1532 compared to DALL·E 3's 1024 x 1024 limitation.

Both Imagen 2 and DALL·E 3 have some impressive qualities to the images they produce, with each being able to replicate various styles. Both can also be used to generate non-photo image results such as line drawings, info-graphics, comic strips, and more.

It's hard to definitively say which image generator is the best, as this will mostly come down to how you plan to use each piece of software. However, for photorealistic results, I'm very impressed with what Bard has to offer. Plus, Bard is noticeably faster at generating images compared to Copilot's DALL·E 3, being able to churn out higher-resolution images much more promptly.

For that, I'd have to conclude that Google has the edge on this one, at least for now.

Back to Apple MacBook Pro

Acer

Apple

Asus

Dell

Lenovo

AMD Ryzen 7

Intel Core i5

Intel Core i7

Intel Core i9

Intel Core M3

8GB RAM

16GB RAM

32GB RAM

64GB RAM

256GB

512GB

1TB

2TB

4TB

13.6-inch

14-inch

15.6-inch

Black

Blue

Silver

New

Refurbished

Showing 10 of 146 deals

Filters☰

Apple MacBook Pro 14-inch M3 (2023)

(1TB Silver)

Our Review

☆☆☆☆☆

$1,199

View

Asus ZenBook 14 OLED Q409Z

(Blue)

$649.99

View

Asus Zenbook S 13 OLED

(OLED)

(Silver)

Apple MacBook Pro 14-inch M3 (2023)

(1TB Black)

Our Review

☆☆☆☆☆

Apple MacBook Air M2 2022

$1,499

View

Lenovo ThinkPad X1 Carbon (Gen 11)

Our Review

☆☆☆☆☆

$2,042.14

View

Lenovo ThinkPad X1 Carbon (Gen 11)

(14-inch 1TB)

Our Review

☆☆☆☆☆

$2,546.32

View

Rael Hornby, potentially influenced by far too many LucasArts titles at an early age, once thought he’d grow up to be a mighty pirate. However, after several interventions with close friends and family members, you’re now much more likely to see his name attached to the bylines of tech articles. While not maintaining a double life as an aspiring writer by day and indie game dev by night, you’ll find him sat in a corner somewhere muttering to himself about microtransactions or hunting down promising indie games on Twitter.

Imagen 2 vs. DALL·E 3: Generating hands

Stay in the know with Laptop Mag

Imagen 2 vs. DALL·E 3: Generating text

Imagen 2 vs. DALL·E 3: Generating tools

Conclusion