Helpful Resources

I’ll add more here as I remember them. Feel free to add more in the comments.

Models

Models are basically the brains of Stable Diffusion. They are the data SD uses to learn what your prompts mean.
The built-in models that come with Stable Diffusion are really bad for porn. Don’t use them. In fact don’t use them at all unless you’re training your own models, there are better SFW models.

Here are some of my personal favourites:

Anime

  • MeinaHentai is a great model to start with. Compared to other models it’s really easy to prompt
  • AOM3 also does really well, though it might be a little more difficult to guide

For all of those, I recommend installing https://github.com/DominikDoom/a1111-sd-webui-tagcomplete, as they heavily rely on danbooru tags.

  • Berry Mix (Pre-mixed version here) can also work pretty well, depending on what you want to do. AFAIK it uses rule34 tags instead of danbooru, so it probably won’t work all too well with prompts used for the above ones

Realistic

  • Uber Realistic Porn Merge is the only realistic model I know of that does hardcore stuff. It’s unfortunate problem is that it’s REALLY DAMN HARD TO USE

VAEs

VAEs are mostly used for finetuning colors, sharpness, what have you. Some models come with a VAE builtin, but for ones that don’t, it’s recommended to have one on hand.

  • “Anything VAE”, “Orangemix VAE”, and “NAI Leak VAE” are the same exact thing under different names. If you already have one on hand, don’t bother with the others. Most VAEs are renamed versions or modifications of this one.
  • Waifu Diffusion’s kl-f8-anime2 is also a pretty good one. It doesn’t require Waifu Diffusion.
  • The one that comes with Stable Diffusion is the only one that seems to work for realistic stuff.

LoRAs

LoRAs teach models about concepts (characters, clothing, environments, style, …) they might not know about. There are a LOT of them, so feel free to browse Civitai to find ones you might want.

LoRAs tend to be specific for families of models, or at the very least styles (using anime LoRAs on realistic models tend to be a bad idea), but there are a fair few that will work across the board.

Locon and LyCORIS are newer formats of LoRAs. Not sure on the technical differences between them, but they will not work out of the box and need an extension such as https://github.com/KohakuBlueleaf/a1111-sd-webui-lycoris to get working

Textual Inversions / Embeddings and Hypernetworks

These are mostly obsoleted by LoRAs. There are a few embeddings such as Deep Negative and EasyNegative that are still quite useful, but in most cases you’ll want to use LoRAs instead.

  • RestfulD34m0N@lemmynsfw.com
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Hmm ill probably have mroe questions but for now im curious:

    • how much space was the download(s)?
    • how confusing is the software to use?
    • what kind of limitations does the software have? can i do multiple people? monsters? futa? etc.

    Thanks for your help! :)

    • echoct@lemmynsfw.com
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 year ago

      Sure,

      1: The initial download was pretty small ~10GB. But with the models, lora(s), extensions, ect. I’m up to ~60GB.

      2: The guide at the top was pretty easy to follow. Install the dependencies, then install the UI. Launch and run. There is a bit of a learning curve with all of the options but so far it hasn’t been too confusing.

      3: That’s where the extra models/lora(s) come in. Various models are trained in different styles, poses, actions, ect. The lora files are smaller things, like poses. IE: Cowgirl is it’s own lora file that tells the model how to use the prompts you give.

    • Triple Underscore@lemmynsfw.comOPM
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      1 year ago

      how much space was the download(s)?

      On my end, it’s sitting at ~64GB (with btrfs compression shenanigans), though 60 of those are from all the models I have installed. The download would probably be ~2GB, even less if you disable downloading the “default” models with --no-download-sd-model and instead pick models off of Civit or wherever manually.

      Edit: Should have mentioned. Most full models are between 2-4 GBs each. Some can be 5+ but they tend to be “full” versions intended for merging & such. LoRAs are generally smaller. Depending on how much they’re pruned they’ll be anywhere between 10-100 MBs each.

      how confusing is the software to use?

      There’s definitely a learning curve, yes. But there’s plenty of resources (and more importantly, examples) out there.

      what kind of limitations does the software have? can i do multiple people? monsters? futa? etc.

      As long as you have the correct models set up it can generate basically anything. At least with anime models, monsters and futa are a given. Your main issue will probably be multiple people, although there are solutions to that. (See the multidiffusion upscaler GitHub repo on the main post)