Jump to content

How does your avatar look when enhanced by AI?


Danielle Atheria
 Share

Recommended Posts

Started playing with sdxl model found on Replicate.

Original Picture:

RachelSam16October2023_003.thumb.png.92e95b95ebcfc8d11ac1729f0b8aeb1a.png

 

Two attempts: (Prompt: "full body picture of a woman with very long hair")

replicate-prediction-jm553gdbxxcmdqzem4tpz65owq.thumb.png.a5426f89d1356aff51d28e3cb049ca93.pngreplicate-prediction-ngyrzmlbtdiybsqlnp3rc4hyna.thumb.png.f763ab311b7fb8c20e4a57c99beabcfa.png

 

I'm not totally satisfied with the results but I will continue playing around with the tool.

Edited by Gopi Passiflora
Added prompt for img2img
  • Like 7
Link to comment
Share on other sites

I've been playing a lot with easy diffusion today, goofing around with settings.  I initially was loading an image at lower resolution believing it would speed up the process and be less of a strain on my video card but that assumption was completely incorrect.  At 768x400 for the initial image and control net, the AI rendered images would be far less quality, the faces would be really horrible and I couldn't get it right at all.

Initial image, and controlnet (system avatar)

Snapshot_198.png.def97bd94ba42529ba42d7723e417bd4.png

Dimensions: 512x512, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 5, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

highres__ultra_detailed__Ultra_precise_depiction__Ultra_detailed_depic_S2456674043_St25_G5.thumb.jpeg.06a54ad2f47e83a12b34a4376aff4b82.jpeg

I did not like the background of the above one, I added a new prompt for the next one which was supposed to be a line drawing but it did not quite follow through as expected.

Dimensions: 512x512, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 5, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

highres__ultra_detailed__Ultra_precise_depiction__Ultra_detailed_depic_S1787663982_St25_G5.thumb.jpeg.64e582c136a288ed0c175a659138fae2.jpeg

Second shot, using higher resolution image this time for initial image and control net.

Snapshot_176.thumb.png.e18addf156d0395abfe10ec8352a4f0c.png

Below is using a LORA

Dimensions: 768x400, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 1.1, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Lora Model: detailSliderALT2, Lora Strength: 0.24, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

highres__ultra_detailed__Ultra_precise_depiction__Ultra_detailed_depic_S1645859762_St25_G1.1.thumb.jpeg.5716dd8dff8b18f6bbda589d6eab4e66.jpeg

This picture below is not using a Lora

Dimensions: 768x400, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 1.1, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

highres__ultra_detailed__Ultra_precise_depiction__Ultra_detailed_depic_S3203680988_St25_G1.1.thumb.jpeg.078a940b97bb838a13d74fe1c37e35c0.jpeg

Edited by Istelathis
  • Like 10
Link to comment
Share on other sites

21 hours ago, Janet Voxel said:

I think the confusion comes from progress that has been made with AI. It has been with us for a while. Before you’d use AI to “Enhance” your picture. Really what it did was run your picture through a series of filters and voilà….you had an AI enhanced photo.

Over the last couple years the advancement is really, that it’s doing its own thing. The better ones allow you to put how much it’ll actually alter your photo. The real magic is you can just type something, a few sentences and you’ll get a fully rendered drawing, painting or photo.

I’m not a techie person, so I may not have explained it properly. I think the misunderstanding is that it’s supposed to enhance what you put in there as a reference. It’s really taking what you put in there and making something new.

Personally, I enjoy playing around with it.

I'm the resident techie adjacent gal and I give this my seal of approval. Good explanation!

 

27 minutes ago, Istelathis said:

I've been playing a lot with easy diffusion today, goofing around with settings.  I initially was loading an image at lower resolution believing it would speed up the process and be less of a strain on my video card but that assumption was completely incorrect.  At 768x400 for the initial image and control net, the AI rendered images would be far less quality, the faces would be really horrible and I couldn't get it right at all.

Initial image, and controlnet (default SL avatar)

Snapshot_198.png.def97bd94ba42529ba42d7723e417bd4.png

Dimensions: 512x512, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 5, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

highres__ultra_detailed__Ultra_precise_depiction__Ultra_detailed_depic_S2456674043_St25_G5.thumb.jpeg.06a54ad2f47e83a12b34a4376aff4b82.jpeg

I did not like the background of the above one, I added a new prompt for the next one which was supposed to be a line drawing but it did not quite follow through as expected.

Dimensions: 512x512, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 5, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

highres__ultra_detailed__Ultra_precise_depiction__Ultra_detailed_depic_S1787663982_St25_G5.thumb.jpeg.64e582c136a288ed0c175a659138fae2.jpeg

Second shot, using higher resolution image this time for initial image and control net.

Snapshot_176.thumb.png.e18addf156d0395abfe10ec8352a4f0c.png

Below is using a LORA

Dimensions: 768x400, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 1.1, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Lora Model: detailSliderALT2, Lora Strength: 0.24, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

highres__ultra_detailed__Ultra_precise_depiction__Ultra_detailed_depic_S1645859762_St25_G1.1.thumb.jpeg.5716dd8dff8b18f6bbda589d6eab4e66.jpeg

This picture below is not using a Lora

Dimensions: 768x400, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 1.1, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

highres__ultra_detailed__Ultra_precise_depiction__Ultra_detailed_depic_S3203680988_St25_G1.1.thumb.jpeg.078a940b97bb838a13d74fe1c37e35c0.jpeg

Woah - you've made leaps and bounds with these already! Yah, having the correct resolution is very important. SD 1.5 works best in the range of 512-768. Deviate too strongly and you're going to lose a considerable amount of quality and cohesion. The crux with that is that you really want a higher resolution for things like faces and hands.

There's a little trick you can do and that is head on over to inpaint, set it at a low denoise of give or take 0.4, switch to "only masked", mask out the face and then run the same prompt at the highest resolution you can. This will give that small area more resolution to work with (and for those that would like to automate that: adetailer extension does this for you under a1111).

  • Like 4
  • Thanks 1
Link to comment
Share on other sites

20 hours ago, Orwar said:

The day we can prompt an AI to 'mesh, rig, and texture some fancy clothes for SL', I'll be opening a store selling that sort of stuff!

That has already happened... ChatGPT has been used to create mesh in SL and its really good... you do need to know how to prompt it correctly, though.

  • Like 3
  • Thanks 1
Link to comment
Share on other sites

How do you guys do that? All I can do is describe what I want to have as a picture but I can't find something where I can put an image of my Avatar in and say "Make it look real" (for free if possible ^^).

I made pics but that's not what I'm looking for, she's looking gorgeous tho:

neytiri-darkblue-skin-with-tigerstripes-long-black-braided-hair-huge-yellow-catlike-eyes-standi-2650846.png

 

Edit: I think I found a way. The results are disturbing but I'll get there one day haha

Edited by Sabrina Nebula
  • Like 9
Link to comment
Share on other sites

2 hours ago, ValKalAstra said:

There's a little trick you can do and that is head on over to inpaint, set it at a low denoise of give or take 0.4, switch to "only masked", mask out the face and then run the same prompt at the highest resolution you can. This will give that small area more resolution to work with (and for those that would like to automate that: adetailer extension does this for you under a1111).

Thank you for the kind words, and advice.  My poor graphics card throws out error messages even when only inpaint is used to produce higher resolution, the card on this laptop only has 4 gigs on it.  I think it might also have to be due to using easy diffusion, the settings are not as configurable as stable diffusion or a1111.  I actually downloaded JuggernautXL for a model because it looked really impressive for making more realistic pictures and it just about killed my laptop 🤣 Trying to load a 6gb model was probably not the best of ideas I had today.

I did manage to get it to work at producing a slightly higher resolution, using inpaint and I was impressed with some of the changes, I'll have to play around with it a bit more to see if I can get better results.  The face was inpainted, but there are no advanced settings by how much I can change it by.  There are preset icons I can click on for the options of opacity and sharpness.

latest.thumb.jpeg.8700b90b2a1d64b01b556e16bb66e50c.jpeg

When zoomed in, it looks like the nose is broken though, lol.  It was less noticeable before I upscaled it.

Hopefully, I will get a newer laptop in a few months, if I do,  I'll try using just stable diffusion as I hope to have become more familiar with the generating AI art.  I think easy diffusion is great, for starting with to become familiar with everything, and it is capable but from what I have seen not nearly as configurable.  Regardless, it is a lot of fun learning how this all works, and trying to get this old laptop to produce all that it can.  

Edited by Istelathis
  • Like 4
Link to comment
Share on other sites

I can't even figure out what changed on artbreeder.. I can't even find where to upload my image to get started so I can do any of this fun stuff.. hehehe

 

ETA: Always two seconds after I'm ready to give up, things land right in my lap.. Found it!! \o/

My advise in this world is, don't look for things, just stumble around until you trip over them.. lol

Edited by Ceka Cianci
  • Like 5
Link to comment
Share on other sites

On 11/16/2023 at 10:29 PM, Orwar said:

   The day we can prompt an AI to 'mesh, rig, and texture some fancy clothes for SL', I'll be opening a store selling that sort of stuff!

Yep. It's undoubtedly coming too.

I'm sure it already exists in some form, I'd genuinely love to see what this type of technology is capable of when it comes to generating 3D art.

edit: saw someone up there mention ChatGPT as having done this, would be curious to see it. If you look at a collada file in a text editor you will see how possible this is, it's laid out in a surprisingly meatbag-readable way, there's a lot of it for anything truly complex but to a language AI like ChatGPT that can already write code (kinda) there's no reason it couldn't string together a file like this.

 

 

Edited by AmeliaJ08
  • Like 3
Link to comment
Share on other sites

 

image.png.6eb423134d7cfa6be585fbfefdec729b.png

Initial Image/Controlnet

 

photograph__beautiful_woman_S2560686154_St25_G5.jpeg.af585f8abd191ea4850cde48086fdab9.jpeg

Photograph

 

watercolor__golden_hour__beautiful_woman_S1781881195_St25_G5.jpeg.4dbfea3d307ef5849ef5a7aca56deb69.jpeg

Watercolor

 

anime__beautiful_woman_S2246413695_St25_G5.jpeg.f1031aff954a4b6c018fdae317424a40.jpeg

Anime

beautiful_woman__Line_Art_S969103821_St25_G5.jpeg.e47fd616a721a19b015fc5c8e70ba399.jpeg

Line Art (eye goofed on this one prompt strength raised to .99)

 

beautiful_woman__Crosshatch_S3719215096_St25_G5.jpeg.a0d14d57b35a10dba6138bf5fc1f91e8.jpeg

Crosshatch(added "closed eyes" in negative prompt, PS .99)

With the exception of prompts, seeds, or otherwise indicated under each picture the settings were as follows:

Dimensions: 512x512, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 5, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

Upscaling really messes with my pictures, I downloaded another upscaler but run into memory limitations on my video card.

-------------------------EDIT--------------------------

Out of curiosity,  I went to https://stable-diffusion.site/image-to-prompt/ to convert the initial image to a text prompt, which was:


"a beautiful young woman sitting on top of a bed, inspired by Allan Linder, cg society contest winner, at the park, firefly lights, pretty face!!, not cropped"

I changed it to:

"a beautiful woman sitting on top of a bed, inspired by Allan Linder, cg society contest winner, at the park, firefly lights, pretty face!!, not cropped, symmetrical eyes"

 

Using this prompt, in easy diffusion without an initial image or ControlNet, this is what it produced:

a_beautiful_woman_sitting_on_top_of_a_bed__inspired_by_Allan_Linder__c_S4110314991_St25_G5.jpeg.389474eda38a4108cbb309bb4c608cdd.jpeg

Edited by Istelathis
  • Like 12
  • Thanks 1
Link to comment
Share on other sites

22 hours ago, Istelathis said:

 

image.png.6eb423134d7cfa6be585fbfefdec729b.png

Initial Image/Controlnet

 

photograph__beautiful_woman_S2560686154_St25_G5.jpeg.af585f8abd191ea4850cde48086fdab9.jpeg

Photograph

 

watercolor__golden_hour__beautiful_woman_S1781881195_St25_G5.jpeg.4dbfea3d307ef5849ef5a7aca56deb69.jpeg

Watercolor

 

anime__beautiful_woman_S2246413695_St25_G5.jpeg.f1031aff954a4b6c018fdae317424a40.jpeg

Anime

beautiful_woman__Line_Art_S969103821_St25_G5.jpeg.e47fd616a721a19b015fc5c8e70ba399.jpeg

Line Art (eye goofed on this one prompt strength raised to .99)

 

beautiful_woman__Crosshatch_S3719215096_St25_G5.jpeg.a0d14d57b35a10dba6138bf5fc1f91e8.jpeg

Crosshatch(added "closed eyes" in negative prompt, PS .99)

With the exception of prompts, seeds, or otherwise indicated under each picture the settings were as follows:

Dimensions: 512x512, Sampler: dpmpp_2m_sde, Inference Steps: 25, Guidance Scale: 5, Model: cyberrealistic_v33, VAE: vae-ft-mse-840000-ema-pruned, Prompt Strength: 0.6, Preserve Color Profile: false, ControlNet Model: control_v11p_sd15_canny

Upscaling really messes with my pictures, I downloaded another upscaler but run into memory limitations on my video card.

-------------------------EDIT--------------------------

Out of curiosity,  I went to https://stable-diffusion.site/image-to-prompt/ to convert the initial image to a text prompt, which was:


"a beautiful young woman sitting on top of a bed, inspired by Allan Linder, cg society contest winner, at the park, firefly lights, pretty face!!, not cropped"

I changed it to:

"a beautiful woman sitting on top of a bed, inspired by Allan Linder, cg society contest winner, at the park, firefly lights, pretty face!!, not cropped, symmetrical eyes"

 

Using this prompt, in easy diffusion without an initial image or ControlNet, this is what it produced:

a_beautiful_woman_sitting_on_top_of_a_bed__inspired_by_Allan_Linder__c_S4110314991_St25_G5.jpeg.389474eda38a4108cbb309bb4c608cdd.jpeg

So interesting but it also looks quite painful; the AI seems to have fused your two hands into one in at least two of those images.  🙀

  • Like 4
  • Thanks 1
Link to comment
Share on other sites

1 minute ago, Leora Greenwood said:

So interesting but it also looks quite painful; the AI seems to have fused your two hands into one in at least two of those images.  🙀

One of those cases where if the AI was "smart", it would think "hey, lucky me - she's hiding her hands so I don't have to draw them in any generated pictures!" Instead of, "hey, lucky me - a chance to generate more Nightmare Fuel".

  • Haha 2
Link to comment
Share on other sites

On 11/16/2023 at 4:38 PM, Bagnu said:

I did a bit of fiddling . Here's how AI interpreted Caitlin:. The original is first. I didn't make any alterations to the AI generated images. The original was the reference image. I was able to crop the original a bit in the Gencraft online AI I used, so it would pick up her features better, but I can't download that from the website.

Cat and me in livingroom.png

Cat1.jpg

Cat2.jpg

Cat3.jpg

I love the 1950s vibe but I am not sure in pic #2, what that lump  between the trapezium and radius of her left hand?

I love the pics though!

  • Like 3
  • Thanks 1
Link to comment
Share on other sites

On 11/16/2023 at 10:29 PM, Orwar said:

   The day we can prompt an AI to 'mesh, rig, and texture some fancy clothes for SL', I'll be opening a store selling that sort of stuff!

Me too but I guess, by the time this will be available, everyone will have figure out how to do it and opened a store. My granpa used to say "If I become a hatter, men will be born headless". LOL

  • Like 2
  • Haha 1
Link to comment
Share on other sites

48 minutes ago, Krystina Ferraris said:

I love the 1950s vibe but I am not sure in pic #2, what that lump  between the trapezium and radius of her left hand?

I love the pics though!

Thanks!!! AI seems to have a problem with hands. I didn't fix any of the pics, so the AI errors can be seen. 

Caitlin liked the pics too.

  • Like 4
Link to comment
Share on other sites

Please sign in to comment

You will be able to leave a comment after signing in



Sign In Now
 Share

×
×
  • Create New...