There’s a work-in-progress checkpoint model based on Z-Image Turbo that promises better photographic-quality NSFW results than the existing ones, and at least one of my terminally-online 1girl-maker acquaintances gave it a thumbs-up, so I took it for a spin.
First impression: equal parts Teen Vogue and Barely Legal, with a dash of Girls’ Life to bring the ages down. In some cases way down, leading to quick deletion of images where prompts requesting adult female humans produced lolis. It also often produced elf ears, but that’s not something that would help defend you in court.
I’ve also been running the generated prompts through an LLM ordered to diversify the output by only adding flattering details to descriptions of faces, bodies, hair, clothing, and makeup, but LLMs do whatever is statistically likely, and will randomly remove keywords or change things they’re told not to. Using explicit numbers for ages seems to limit that sort of damage, although there was a surprisingly youthful “127-year-old” in one batch. Must have been some elf blood in there, even though it didn’t give her the ears.
I didn’t require full frontal nudity in every pic, so a few of these are outside the NSFW tag; most, however, are topless, bottomless, or both. The training in v5.0 of the model is unstable, leading to a higher rate of anatomy fails than the base ZIT model, especially for genitals, so I rejected a lot of images. v6.0 will be available in a few hours, so hopefully it’s less disaster-prone.
I threw in a bunch of random art styles, but the strong training bias towards photorealism meant that the subject was often a photo in front of an artsy background, sometimes literally casting a shadow on a painting.
(note: my Mac Mini with an M4 Pro takes about 3x as long to do text-generation as my Windows box with an RTX 4090, using the same model (gemma-3-12b-it-heretic-x-i1) and software (LM Studio); what I’ve seen of early benchmarks on the M5 MacBook Pros suggests that they’re still not great at running text or image models. All they really offer is the ability to slowly run models that don’t fit into consumer-graphics-card VRAM)
Markdown formatting and simple HTML accepted.
Sometimes you have to double-click to enter text in the form (interaction between Isso and Bootstrap?). Tab is more reliable.