• Digestible AI
  • Posts
  • OpenAI's Latest and Greatest Image Generation Model

OpenAI's Latest and Greatest Image Generation Model

+ Apple Intelligence in the Vision Pro šŸ‘€

In partnership with

Has OpenAI Reclaimed The Niche Of Image Generation?

In this edition we’ll be covering…

  • The latest news on OpenAI’s most recent image model launch

  • A tutorial on how to use OpenAI’s new image model

  • A look at Apple’s announcement on Apple Intelligence on the Vision Pro

  • 5 AI trends

  • 3 AI tools to keep you productive

  • And much more…

The Latest in AI

OpenAI Launches Image Generation on GPT-4o

Image generation is now a really popular capability that’s offered by many different vendors. They all offer different kinds of performance and stylistic flavors on their outputs, and open source models such as Mistral’s have really been impressing as of late.

OpenAI’s DALL-E fell a bit out of the spotlight for a while, with even others making custom GPTs on the GPT Store overshadowing it.

However, OpenAI has now released image generation on 4o, which brings some exciting new features that could bring ChatGPT back into the spotlight (for image generation, as it’s already quite popular for everything else).

What’s the big deal?

  • The new capability demonstrates a really impressive amount of precision for symbols in image generation, with the ability to effectively render things like language on signs.

  • 4o introduces multi-turn generation, which means that it is more capable of accurate image refinement (remember trying to edit an image by selecting and suggesting to DALL-E? This is wayyy better).

  • It also has in-context learning, which allows it to effectively analyze user uploaded photos and use it as context to generate new photos!

  • There are other features as well, so we suggest you check out the official blog post from OpenAI for showcases of the features above and way more!

So What?

Image generation continues to be one of the most popular and sought after AI use cases. It has a wide sphere of applications and can be applied for endless use cases. No matter who you are, you will be able to benefit from the creativity gains of better image generation.

But what does ā€œbetterā€ really mean?

Well, OpenAI is redefining the standard for image generation with this model, as its quality, precision, refinement, and other capabilities are truly a great stride toward more responsive and intuitive image generation. Put simply, you can have more trust and faith that you’ll get the creative output you want from the creative input you give.

Together with Stack Influence

Want to scale up your new product launches into listings that make $100K+ yearly revenue in less than 2 months?

Use the platform Stack Influence to automate Micro-Influencer collaborations at scale (get thousands of collabs per month) and boost your Amazon launches, generate UGC, and scale up your brand awareness like never before.

Top Amazon brands like Magic Spoon, Unilever, and MaryRuth Organics have turned their new Amazon product launches into listings with monthly revenue on pace to break $100K over the year.

  • Pay influencers only with products (stop negotiating fees)

  • Increase external traffic Amazon sales (get to top page rankings)

  • Get full rights image/video UGC (build your brand with authentic content

  • 100% automated management (don’t lift a finger to get influencer collabs at scale)

Get Your Hands Dirty!

How to Use Image Generation with 4o

We won’t go over all the features OpenAI has launched, but we’ll go over the basic use case. If you’re curious for more, you can always check out the official blog!

Here’s how you can use the multi-turn generation feature:

  1. Navigate to the main ChatGPT interface and select the 4o model in the top left.

  2. No need to toggle anything, just input your prompt, asking it to produce an image of whatever you want. Start simple here! I asked it to just generate an image of a cat because I like cats (in fact I have 2)!

  3. Ask it to add something to the photo! In my example, I told it to put a hat on my cat!

Seriously, this is some really impressive stuff, as you can see the second image retains all the previous features of the original image. The hat is added exactly according to my (albeit) simple description, without any weird distortions like a sixth toe bean. Try it for yourself! šŸ‘‡

Industry Intel

Apple Intelligence On Vision Pro

Image from Apple

Apple dropped a whole slew of features on their mixed-reality headset, the Vision Pro. This includes:

  • Writing Tools - rewrite, proofread, refine, summarize, or use compose to integrate with ChatGPT to generate content.

  • Image Playground - generate images supporting native integrations with Messages, your photo library, and FreeForm.

  • Genmoji - generate original Genmoji by typing/speaking.

  • Create a Memory Movie - create a ā€œmovieā€ using your photo library

So What?

Apple Intelligence has been scoffed at here and there as just Siri 2.0, but this new release showcases some serious innovation on Apple’s part.

Not only that, but also it’s in a very interesting niche of AI that not many have ventured in it. Augmented reality and mixed reality are all spaces that could really benefit from the creativity and productivity gains of AI, and that’s exactly what Apple is showing off with these new features!

All these features are pretty useful for Vision Pro users in the sense of fun and productivity, but also in another sense, it demonstrates an increasing investment in AI for augmented/mixed reality experiences. Ultimately, this could lead to improvements in other fields as well that could really benefit people’s lives (like a deaf person could be able to see AI-generated and transcribed text on sounds and speech happening around them)!

Quick Bites

Stay updated with our favorite highlights, dive in for a full flavor of the coverage!

Image from Huffington Post

Bill Gates says healthcare and education will be obsolete and easily replaced by AI in his latest interview with Jimmy Fallon.

Though OpenAI is killing it with 4o image generation, they are getting hit pretty hard with the amount of traffic, so they had to disable video gen for certain Sora users.

Anthropic provides a deeper look at the ā€œbiologyā€ of their extremely successful model, Claude.

Elon Musk is under fire with a trademark dispute over the naming of xAI’s Grok.

šŸ‘š Outfit.fm - Create professional grade photos for your clothing brand.

šŸ“‹ Ping - AI-powered todo-list.

šŸ“— Qodo - Become a better developer with Qodo AI.

Until we Type Again…

Thank you for reading yet another edition of Digestible AI!

How did we do?

This helps us create better newsletters!

Login or Subscribe to participate in polls.

If you have any suggestions or specific feedback, simply reply to this email. Additionally, if you found this insightful, don't hesitate to engage with us on our socials and forward this over to your friends!