Was NOT Expecting This! Midjourney V6 Competes with DALL-E 3 | Comparison & Review
Ғылым және технология
Midjourney v6 is a new AI model offering enhanced features such as longer prompts, better color and shading control, improved realism, and the ability to create convincing in-image text. It supports prompts for multiple subjects and excels in photorealism. While the default version might be v5.2, users can switch to v6 by typing "/imagine" with their prompt and adding “-v 6” at the end.
▼ Link(s) From Today’s Video:
✩ Midjourney: www.midjourney.com/home?callb...
✩ DALL-E 3 Bing: www.bing.com/images/create
✩ DALL-E 3 Designer: designer.microsoft.com/image-...
Chase Lean's Thread: / 1737816505507795060
Nick's Thread: / 1737728299332460681
Cig Test: / 1737892253543039440
► MattVidPro Discord: / discord
► Follow Me on Twitter: / mattvidpro
-------------------------------------------------
▼ Extra Links of Interest:
✩ AI LINKS MASTER LIST: www.futurepedia.io/
✩ General AI Playlist: • General MattVidPro AI ...
✩ AI I use to edit videos: www.descript.com/?lmref=nA4fDg
✩ Instagram: mattvidpro
✩ Tiktok: tiktok.com/@mattvidpro
✩ Second Channel: / @matt_pie
-------------------------------------------------
Thanks for watching Matt Video Productions! I make all sorts of videos here on KZread! Technology, Tutorials, and Reviews! Enjoy Your stay here, and subscribe!
All Suggestions, Thoughts And Comments Are Greatly Appreciated… Because I Actually Read Them.
-------------------------------------------------
► Business Contact: MattVidProSecond@gmail.com
Пікірлер: 335
*What do you guys think? Is Midjourney BACK?* Either way, I am pleasantly surprised by this early Christmas gift from Midjourney! Great work! Share Image Results Here!!!: ► MattVidPro Discord: discord.gg/bQgcbjs2Sg ► Follow Me on Twitter: twitter.com/MattVidPro
@LouisGedo
6 ай бұрын
7:11 Yes........Discord sucks! A functional Midjourney API is looooooong overdue.
@davehugstrees
6 ай бұрын
I don't see that MidJourney v6 has in-painting yet? Maybe I'm wrong but I don't see how to do it like with v5.
@Yipper64
6 ай бұрын
it has definitely been IMPROVED but really I think we are sleeping on google's image generator. It was able to do things that I havnt been able to do with other image generators, mainly an itchthys fish, and the pokemon Zoroark. Most AI image generators cant seem to get the small details right, either specific symbols or unusual body types for fictional creatures, so the fact that google's image generator can tells me it has a lot of potential, it just doesnt have quite the same quality and ability to handle a long prompt that these other generators do.
@soscilogical1904
6 ай бұрын
What about logic and prompt complexity? Could do with describing 10 things in a scene and see which engine wins. Also for complex things like a fish on a horse jumping over a car near in a waterpark.
@undergroundo
6 ай бұрын
Video idea: Small recap video for the end of 2023 with the evolution of all the Lemon images, to see the amazing progress AI has in just one year.
Midjourney since V4 has been amazing, and nothing has beat it in terms of aesthetic quality. Dall-E3 was better in understanding/prompt coherency, but failed in a lot of instances. V6 is looking incredible and it follows prompts more closely from the early feedback we have.
@BobbyMasteria
6 ай бұрын
lol, get real dude ! midjourney does NOT understand context at all
@88heiling
6 ай бұрын
Slightly better? More like lightyears better. MidJourney is still MID when it comes to prompt understanding.
@Kavriel
6 ай бұрын
@@88heiling I've used dallE-3 extensively and it's prompt understanding is not that good.
@Thatguynotgay
6 ай бұрын
Dalle3 has the best comprehension I could make absolutely anything in my mind if it's unfiltered
@Kavriel
6 ай бұрын
@@Thatguynotgay Try to make it generate a Torus-shaped planet or an O'neil cylinder. And if those weren't in your mind, well, they were in mine, and Dall-E-3 failed spectacularly.
The power of DALLE3 is in its capability to combine far or even contradictory concepts. I still didn't see Midjourney could do this.
Midjourney 6 is looking beautiful. Midjourney always has looked beautiful. But for me, if it mashes characters with each other, and struggles to mix various different elements into one image, ignore key elements of the prompt requests I make, I’m looking forward to Midjourney 7, because Dalle-3 still wins in that sense for me. But wonderful to know that Midjourney is stepping up their game!
@jaredf6205
6 ай бұрын
Dalle will always be at least slightly ahead on understanding since they have access to the best language model.
@Prodigy396
6 ай бұрын
I don’t think you will have to wait for v7, given that this is an alpha.
Great video as always Matt!
MJ website is pretty good, anyone with 10k+ images has access to the alpha. Personally I prefer almost everything about MJ to Dalle 3. The one thing I do like about Dalle 3 is its ability to get the scene set up exactly as described. Still learning how to prompt V6 and it is Alpha but Dalle will take some beating in that area.
Combine DALL-E 3 for initial generation with Stable Diffusion for image-to-image generation and Adobe Firefly for post-production; this is a solid combination for now.
@Author_SoftwareDesigner
6 ай бұрын
What are the benefits of this combination?
@EdwardAustin
6 ай бұрын
Also curious about this @@Author_SoftwareDesigner
@MrShepardDog
4 ай бұрын
I agree. Combining two or three agencies gives some great results...
All of these options have major limitations: Midjourney V6 is censored, still requires discord, as well as an expensive subscription to use professionally. DALL-E 3 is heavily censored and does not produce realistic photographs of humans (try to generate a realistic street photo of a fashion model and gaze at the laughable plastic Barbie doil skin). SDXL can't do text and lacks just a touch of that realistic sparkle you can get with Midjourney. I'm sticking with SDXL for now and playing with prompts and Loras to try for a Midjourney-quality realistic result (text isn't something I need). At the velocity this space is evolving and improving I expect within 24 months or so all of these options (and others) will have made professional AI image generation kind of a solved problem. Good video, thanks!
You are right Matt!...Thank you... another great review. ✨✨🍷✨✨
i really like your winter setup
Excellent breakdown!
I still won’t use Midjourney unless they solve the problem of character consistency, and also learn to accurately depict multiple characters in a scene without mashing them up together.
@naturallydope247
6 ай бұрын
Have you found character consistency in DallE?
It's so good in coherence to the prompt now, I'm so happy with it.
Hello, I am blind and relying solely on your verbal descriptions. When you mention that it's better in text, are you referring to enhancements in various aspects such as characters, font style, position, color and other designs? It's worth noting that this kind of AI holds tremendous utility for individuals with visual impairments, as it opens up possibilities for us to engage in photography and design. We verify the accuracy of generated images by using the amazing app for the blind called (Be My AI) which uses GPT-4 Vission. One issue we encounter in AI is its inaccuracy when reading and generating text. I didn't know it included the style and artistic design as well.
Considering that Midjouney is better at handling the photo realism aspect and DALL-E 3 is often the clear winner at interpreting certain compositional details, perhaps the most beneficial action would be to merge them together somehow to ultimately get even more impressive results! I even came up with the perfect name for it… 😌 "Mid-DALL-journ-E"
@Joshua_Froschauer
6 ай бұрын
Middle Journey, for sure! It just might work, you brave bastard, it just might fucking work!!!
Doesn’t exist until an uncensored equivalent is on my laptop.
@MattVidPro
6 ай бұрын
Totally understandable
Sadly, Microsoft reduced quality of DALL-E images generated in Image Creator some time ago. They reduced number of steps due to load or something like this. So little details, and especially backgrounds, took a massive hit in quality. I don't know how well OpenAI API version works, but it does support "hd" mode.
@spacekitt.n
6 ай бұрын
the stuff they are doing behind the hood makes images look very boring and literal. that and the extreme over the top censorship and its a disaster if youre an artist wanting to leverage ai.
@vomm
6 ай бұрын
Can't confirm this. I use Dalle-3 with Bing Creator and with OpenAI-Subscription AND with API and I think they're all more or less the same (except for the "natural" flag over the API which you don't have in the OpenAI or Bing Interface available).
@Athari-P
6 ай бұрын
@@vomm It depends on complexity of your prompts (number of concepts, interactions, characters, patterns etc.). For simpler prompts, there's little to no difference. If you're pushing the limits of complexity to the absolute maximum while juggling jailbreaks to bypass 5 levels of censorship, the difference between before and after is obvious.
@BionicAnimations
6 ай бұрын
@@spacekitt.n I have GPT Plus. Mid destroys DALLE when it comes to people and realism. Plus, DALLE has too many errors. I have not been able to get it to generate anything over the past 24 hours; just lots of errors all over the place; the same thing happened last week. Plus to violations when you ask it to generate something. It's really annoying. I am going back to Midjourney, at least until OpenAI improves DALLE.
@Matt Does Bing use Dalle3 HD? I would assume its the non-HD version, which is less intensive. If you really want to compare the best dalle3, use Dalle3 HD. My results with the Dalle3 HD API are ridiculously cool.
Yeah, I get what you mean even though it's not as feature complete yet I only prompted on the website and not Discord it just feels smoother.
Upgrades are comin in hot!
Looks cool. Personally Im only going back to midjourney once they introduce consistent characters. This was one of the biggest weaknesses I found with actually using it a lot.
@lamsmiley1944
6 ай бұрын
I accidentally signed up for a one year subscription to MidJourney in August. I still use Dalle more as it’s better at following prompts.
@chariots8x230
6 ай бұрын
I agree. I’m waiting for consistent characters, but also the ability to pose multiple of my custom characters together in scenes. I need to create scenes with multiple custom characters in them, and each character has to be accurate and consistent in every scene where they appear.
Considering the ridiculously fast evoloution, I'm hoping for some decent animation, that allows more then just simple movements. Let's get some cinematic action. :-)
@MattVidPro
6 ай бұрын
Excited for that! Check out Pika labs 1.0 for more on that.
@chariots8x230
6 ай бұрын
I hope we get some character consistency first, and also improvement in posing multiple characters together in a scene without Midjourney mashing them up. With consistent characters in our images, we can then use AI to animate those images and be able to create a story with them, instead of just creating random results.
It's nice you made comparison of V6 and Dalle3 but I want also to see V5 vs V6 (besides text generatuon which is obviously better)
I would argue the only thing dalle had over MJ5 is the ability to write.
@Infinity269
6 ай бұрын
For the average person (i.e. someone not highly skilled in prompt engineering) the ease of prompting with DALL-E is a big selling point - as is going back and forth with it in ChatGPT.
@MattVidPro
6 ай бұрын
DALL-E 3 is free thanks to Microsoft, and it has a better ability to incorporate more of the prompt in. Still, V6 is a huge leap towards competing on those fronts!
@cesar4729
6 ай бұрын
¿Maybe that people can actually USE IT FOR FREE? 🙄
@Enu_Vibe
6 ай бұрын
@@Infinity269 you are right about prompt engineering. You have to know how to in DALLE to get amazing results.
@southcoastinventors6583
6 ай бұрын
This sounds like it came from the xbox vs PS debate even though Nintendo is way better. Each have their own use cases but considering both are closed system in a few years people just run free version on their machines same as they do with word processors.
Did you notice the resemblance between you and the lemon?🍋. Uncanny!😁👏
I ran most of the same prompt with Foocus 2.1.48 and got similar to if not better (more accurate stand up pouch with hanger cutaway and tear notches ) and correct spelling 5 out of 6 images. Foocus is a downloadable Stable Diffusion xl
@user-nl7fw3yp8p
6 ай бұрын
it's not SDXL tho, but yeah I get better images with it
hi matt, love your vids. have you heard of the two science applications of ai. one was googles sth proposing great advances in material science and the other was a chinese projekt about an fully autonome working robot, analyzing material for production of oxygen. maybe that’s also your kind of stuff, just picked it up in shorts. peace 🖖
I don't consider myself THAT old, but I found it funny that out of 147 other comments, with only a handful of people commenting on the supposed Tom Hanks image, that nobody could tell that it was the likeness of a young John Wayne! A few people observed that it wasn't, in fact, Tom Hanks but nobody picked up on who it actually looked like. If you think I'm right and that the image at 16:35 looks like John Wayne (and not Tom Hanks, or Kevin Spacey, or...), please like this comment! 🙂
@ 8:06 the lemon character on the right top looks exactly like Matt hahaha
So its caught up with SDXL with text loras. Neat.
Have you tried the new OpenDalle1.1 model for SDXL? It does text, too. And runs offline
👏👍💕 Learning something everyday!
*(**8:08**)* When you mentioned that the Disney logo had been spelled correctly, it made me do a double take before noticing that it was actually missing the "e" at the end. 😅
Is inpainting possible for MJv6 or D3?
2:09 - what UI is he using for Dalle3 here ??
What AI does Google use for their Image Generator? I love using that one but theres hardly any information on it.
@IceMetalPunk
6 ай бұрын
Imagen, I believe. It's either that or Parti, but I think it's Imagen.
Hows the unlit candles test?
Remember the "lost footage of the sea monster" you should try it and see if you can get it to look like actual lost cctv
The Tom Hanks image looked like Kevin Spacey
5:18 This has got to me some sort of subtle Deltarune reference or maybe i'm losing my mind lmao
I think the next step for image generation is video. Consistency from frame to frame. In a not so distant future we'll be generating our own personalized Netflix shows. Democratization of technology is taking a whole new meaning.
@CoolhandLukeSkywalkr
6 ай бұрын
Text to video ai already exists. It's not a replacement for text to image software. They coexist.
@Elwaves2925
6 ай бұрын
As well as what the above person says, video is not the next step for image generation. Consistency might be the next step for video generation but image generation will go it's own route. For all the crossover, they are two different avenues.
@user-ge5et2lw1f
6 ай бұрын
Absolutely, 2 distinct technologies. I’m pointing to the idea that the more image generation tends toward perfection the more focus is likely to shift toward video generation. Lot of real world applications/value will be unlocked then📈
I think to be a fair comparison between Dall-E3 and Midjourney V6 you need a side by side comparison using specific variables. For example for cinematic scenes, motion and photorealism V6 beats DallE hands down. Especially with studio style, fashion, street style human photography. DallE is only better with text. DallE is also heavily censored compared to Midjourney! Try describing the facial characteristics and clothing of each character in the scene to avoid what I coined “The twin effect”.
@naturallydope247
6 ай бұрын
I agree 100%. DallE3 is not good yet IMHO.
As always... a great video. Got my MJ subscription and never dropped it. v6 is super good.
KREA AI is currently my favourite. The live-creation function lets you tailor your results so specifically, as well as letting you add pictures or drawings to your side of the screen to influence the results. That makes up for any other shortcomings, in my view. Though, I imagine it will only get better as it continues to evolve, as well.
@MattVidPro
6 ай бұрын
Check out my tutorial for doing this locally!
How you zoom out on V6 from a mobile phone ?
The Floral Symphony picture at 5:03 looks amazing except the label and the cap look a bit odd. Looking forward to what AI can do in 2024.
When I put "A logo for KZreadr Anti-HyperLink with a red and black character" into DALL-E 3 or Midjourney, sometimes it produces a character similar to one I had on that channel for a while, and it was created by Midjourney. That just excites me because that means it found that logo on that channel. I don't think it means anything. Both give cool logos and Midjourney does that way better than my trio, but DALL-E can rarely get the text right. I had to roll the prompt many times for all my channels on DALL-E 3 to get something usable with text. I have a lot of cool images in my creations now, though.
Where the video of you trying out the new MJ Alpha web interface currently available Matticus?
@MattVidPro
6 ай бұрын
I haven’t generated enough midjourney images 😳
@matthewoates
6 ай бұрын
I think it'll move from Alpha soon, I spent too much money on MJ but for concept art inspiration it's amazing. I don't like that you have to pay for private mode on it though. @@MattVidPro
When MJ will depicts 'quantum squeezing' correctly it will be my day.
some of these examples are the exact things i been doing over and over on v5 basically training it. like making fictional title cards to an in gamd dnd version of netflix
I’ve been experimenting with Midjourney using a bit of python commands and it seems to help a bit with the words
Finally something as good as dall e 3 not as censored
@MattVidPro
6 ай бұрын
The lack of censorship is very refreshing
DallE 4 has entered the chat.
@MattVidPro
6 ай бұрын
Oh lawd. If we see DALL E 4 next year ill loose it
I've been using Midjourney from V3 and DALL-E 3 after it came into ChatGPT but after making nearly 10k images in Midjourney, Dall-e can only say it understands the language better to get closer to your idea and it more reliably makes proper text. Though I have made Midjourney make correct text as far back as v 4. It just takes giving short simple words that are the image, v 5 and v 5.1 I was making Coca-Cola cans that said Coca-Cola. So improvements in fidelity of the fine details and text of v6 was what I hoped for. Dalle on the other hand is always so CG looking. Like 2000s and 2010s CGI, not bad not great, I feel it competes better with SDXL than Midjourney
I think the bing Dalle-3 version is now running a better model than the bing image generation model. Edit: Also I think Openai is doing something to make the models less likely to make realistic. images.
@MattVidPro
6 ай бұрын
I've heard a lot about this, and from my personal testing I think you might be correct.
@Athari-P
6 ай бұрын
Last time I tried, Bing Chat just initiated a generation in Image Creator as normal. Did they change this interaction? Also, Bing Chat is an extra LLM layer of censorship on top of 5 layers in Image Creator, so I'd rather avoid that.
@countofst.germain6417
6 ай бұрын
@@Athari-P the outputs are definitely different idk if it just saves your image there, but I'm getting wildly different results from Bing and Bing image creator from the same prompts also I read somewhere they updated the Bing model specifically.
I've never had copyrighted characters blocked with DALL-E 3. The only issue I've had is not knowing some things like Digimon or Trailer Park Boys. Digimon produces Digimon-esque creatures with a lot of Pikachus thrown in there, Trailer Park Boys produces fucking hilarious results. If you specify the characters from TPB, it merges them mostly. It definitely understands Bubbles, but cannot get Ricky or Julian perfectly.
At 4:28 there wasn't even an attempt to include text in the image because the prompt hadn't "Coca-Cola" inside quotation marks
trying out some comparisons with bing and midjourney has me floored. this is like bing with prompt adherence, but with actual style. whatever dall e is doing behind the hood makes things much 'plainer' looking. midjourney crushes them. good. was sick of dall e being the only prompt-adherent game in town, they deserve to be crushed for how hard they censor things
They are already working on and will release a web interface very soon great mention!
@southcoastinventors6583
6 ай бұрын
Slow and steady I guess
@yoagcur
6 ай бұрын
It's already available for those that have created loads of images already (was 20,000 but may be lower now). It's pretty good and I tend to use it over Discord
When are they gonna make it so that we can access it from a browser for goodness sake! I cannot use Disccord
00:03 Mid Journey V6 competes with DALL-E 3 02:15 Mid Journey V6 produces more cinematic and realistic images compared to DALL-E 3 and SDXL. 04:20 Midjourney V6 competes with DALL-E 3 in accuracy and aesthetics. 06:24 Midjourney V6 shows better prompt accuracy and creativity. 08:27 Mid Journey outperforms DALL-E 3 in prompt accuracy 10:20 DALL-E 3 and Midjourney V6 can go head-to-head with each other. 12:18 Mid Journey V6 provides realistic dog images but struggles with multiple characters in a scene 14:20 DALL-E 3 and Midjourney V6 have strengths and weaknesses in image production 16:19 Comparison of Mid Journey V6 with DALL-E 3 18:12 Midjourney V6 competes with DALL-E 3
We’re finally getting out of discord with an alpha website you can access now if you’ve generated enough, not quite there yet but soon.
@MattVidPro
6 ай бұрын
Ik it’s taking foreverrr
Midjourney got professional costumes where Dall-e got consumer grade costumes.
Has anybody tried generating comics with text bubbles?
To focus on text is easier to evaluate. All though it also is less important because text can and will be added manually
I don't really understand the need to generate text with the image when it would give you way more freedom and control to rather get the image right and then add the text you want manually. You might get a terrific image but the text is all wrong or perfect text but the image is all wrong. It would be like one in a million to get both right at the same time.
We deserve a lemon animation😂
5:37 but it adds cuteness.
I love it on discord I have it in app on PC always logged in I can do the prompt on PC and see the prompt going on my phone when I get coffee
You should mention that Dalle3 also has an “hd” mode that you can access through API only, which costs 2x than a normal generation, but improves quality quite a bit in small details.
@maxington26
6 ай бұрын
How to access this "hd" mode in Dalle3?
@KlimovArtem1
6 ай бұрын
@@maxington26 a new field - “style”: “hd”, in the API request.
16:00 Tom Hanks? More like young Kevin Spacey
The cow eating the cheeseburger is actually quite accurate, as burger usually made from cow 😂😂😂😂 midjourney just pointing out the cow in the room
i wonder if md6 uses gpt4 turbo api to understand user request
at 3:12 the top banana looks weird and the second from the top is missing something
Midjourney coca can is better for a simple reason: coca cans are unlikely to be in multiple colors - midjourney correctly makes it all single color, dalle makes it in full color
Does it understand font styles if requested?
@MattVidPro
6 ай бұрын
Yes
@Wangavision
6 ай бұрын
@@MattVidPro - Thanks. I wonder how thatwill work with fonts that are paid / licensed only? Excellent videos from you BTW - they have been my go-to for AI and have helped me a great deal with my job.
BUT we lost the zoom-out function 😞
I can say that V6 so far is not nearly as big of a jump as V5 was from V4. It almost feels like a lateral move, tuning the model to do some things that people wanted it to do better, at the expense of other things it did well. Feels like hidden model parameters and default parameters were tweaked, but it doesn't feel like a major jump in capability or semantic understanding to me. Perhaps the biggest thing I saw it show promise was when I asked it to generate an image that had a gradient of textures, continuously transitioning through a series of textures seamlessly and convincingly. Version 5.2 could be hit or miss with that, even using the same prompt.
11:10 I'm not sure which is best at animation-style images but you now need to be *very* specific in your prompts to get the best out of Midjouney v6. There is now a 350-word limit so go to town,.:-)
8:55 HOW BA-A-A-AD CAN I BE IM JUST DOIN WHAT COMES NATURALLY
@MattVidPro
6 ай бұрын
Can someone AI me into that song pls
Complaining that you have to type /imagine is epic 😂😂
That Tom Hanks pic only looks like Tom Hanks if Kevin Spacey changed his name to Tom Hanks
It's a fascinating breakdown of Mid Journey V6 and Dolly 3! It's incredible to see how these AI tools are evolving, especially with Mid Journey stepping up its game. The text rendering capabilities in Mid Journey V6 seem impressive, and the photo realism aspect is mind-blowing. Still, Dolly 3's diverse outputs should be noticed. It's like watching a tight race where each has its unique strengths. I can't wait to see how they continue to evolve and what this means for AI art creation. It's an exciting time for digital artists and tech enthusiasts alike! I'm really looking forward to more deep dives like this.
@Octamed
6 ай бұрын
That was written by AI I presume?
@I-Dophler
6 ай бұрын
@@Octamed You guessed it! The impressive advancements in AI tech have made it possible to generate content that's increasingly difficult to distinguish from human-created work. These tools are becoming adept at understanding and replicating our styles and nuances. It's a fascinating time in tech, but it also raises questions about authenticity and creativity in the digital age. The line is blurring, and it's both exciting and a bit unsettling to think about where this could lead us in the future.
@atorik1076
6 ай бұрын
@@I-DophlerDolly 3
@I-Dophler
6 ай бұрын
@@atorik1076 Just watched your deep dive into Mid Journey V6 and Dolly 3 - fascinating stuff! The battle of the AI image generators is like watching two smart artists in a paint-off, but instead of paint, they use pixels and algorithms. Here's a joke to lighten the mood: Why did the AI go to art school? Because it wanted to learn how to "draw" conclusions! 😂 Keep up the great work, love your in-depth analyses!
@vanguardianofficial6048
6 ай бұрын
@@I-Dophlerdude at least be human when you’re commenting LOL it’s still very obviously AI
How do we get access midjourney 6 in discord? U answered lol in settings yay done
@MattVidPro
6 ай бұрын
/settings in the Midjourney Bot, and change version to V6 Alpha
Nothing beats Dalle-3 via Bing at moment. Because 50 tries á 4 images of which 1 turns out great completely for free is still better than 10 tries of which 1 turns out great but reaching the usage limit even if you have a paid subscription ... . The main issue with Dalle-3 is that all the creations doesn't look really photorealistic. They have a natural filter via the API which turns out to create very realistic creations, but it is not as coherent imho. But yeah, you also get the default Dalle-3 to create quite realistic prompts if you tweak your prompts. Another downfall is Dalle-3 seems to be trained hugely on models footage, almost all people look like from a catalogue, it's quite hard to get it to create normal looking persons, like if you add things like "mild acne" to your prompt. Also it tends to give all persons the same hair style, it has very less of variation as long as if you don't explicitely add variations to your prompt. But yeah ... if you put a lot of efforts into your prompts I think Dalle-3 still beats Midjourney generally spoken. And Dalle-3 is really good at understanding prompts, compared to MJ and others.
2:25 "Just for gits and shiggles" 😂🤣🙃😉
The blending of characters in the same image is a problem I struggled a lit since I started with V4. That's a shame.
I can totally see why the cow is morphing into the burger
With text usually you get better result when the the word is more common English. "Video" will give better result then "Vid"
The McDonalds worker looks more like John Wayne.
They should pick a niche within the text-to-image language and focus on improving their language within that language model (if possible).
Well, I couldn't spell when I was 1-1/2 years old either.
@MattVidPro
6 ай бұрын
So true we are harsh on these lil AI!
Damn! I really love the crazy words we’ve been getting with AI image generation. True!
theres no content warning when using ip like breaking bad and superman???
Something like the finetuned Realities Edge XL model on Civitai will come extremely close to the photorealism of MJ V6, I gotta say, not too impressed with it - although it did follow the prompt OKish, something SDXL can still struggle with sometimes!
Walter White and Jessie Pinkman can easily be made with SDXL and extensions, so the flexibility of SD still beats both Dall-E and MJ. It's easier to get exactly what you want with SD, but with some work of course!
WOW :O
"Shiggles" is short for shits and giggles.
Tom Hanks? That looked more like Kevin Spacey. Anywhos, MJ 6.0 is way better in terms of prompt understanding and a little up in quality over 5.2. But no Remix (inpainting) or the zoom and pan features, which I hope will be back, make me still use 5.2. Text is nice, though, and hopefully, it all improves and, as mentioned, the Alpha website goes public like it was supposed to last month.
Im surprised it didn't try to spell "mattvidro" like Dalle does sometimes 😅