Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: I made a site that easily lets you generate AI images using templates (stavros.io)
86 points by stavros on Sept 21, 2022 | hide | past | favorite | 48 comments



This is really neat, and the templates idea is really great for packaging it up to help people get better results without needing to know how to engineer prompts. Behind the scenes is it just a Mad Libs style fill-in-the-blank or is there more going on?

Some questions/feedback that come to mind:

- How do I know how many credits my generation is going to cost? I ran one D&D template and the cost was 2.4 credits. Also a question relevant on the normal "Generate" tab where I have options to adjust - presumably higher quality costs more credits since it takes more GPU time to run more iterations?

- When generating from a template, there's no option to make the image private until it's been generated. If I were paying for a tool like this I'd expect a way to default to private. Perhaps making images private and defaulting to private can be features for paying users, so that the "public" images for free users are also free advertising for your frontpage?

- Rights issues around the images are unclear. Are these 100% mine to do with as I please, just as if I had generated them running SD myself? Clarification around this is probably important for anybody wanting to use a tool like this in commercial creative endeavors.

- Maybe there should be a username filter? I was able to register as "admin" but if you need that changed I'll happily take "root" or "system" instead ;)


Thanks for the interest!

1. That's a bit of an omission right now, you're right. I need to add a bit of JS to the generation page, thanks for the feedback.

2. You can click the "Private" button while it's generating, and it will be hidden. Though, you're right, anyone who sees it before will be able to see it after too. I'll add a checkbox to the form right now.

3. You're right, I need to add a license page. It's the same license as SD, so IIRC it's a perpetual license to use them as you see fit.

4. Yeah I should probably have a username blacklist, you're right :P

Thanks, this is very useful!


EDIT: You beat me to the punch explaining "Private" on the generation page. Thanks! Original comment follows

-----

> anyone who sees it before will be able to see it after too

I can see where opinions might differ, but to me "Private" means nobody could see it even with a link, while "Unlisted" means I could send the link to anybody but it won't appear in my profile/the front page.

Maybe the middle ground if you don't want to bikeshed on naming things (relatable) would be including a small explanation alongside the checkbox during image generation, and also in the FAQ?


I've added the "Private" button and licensing info in the FAQ, thanks!


How does someone change a template? Or is it just one fixed template per category?

I didn’t create an account because it seemed to be saying I need to put in money first before I could try anything. Other users might have similar interpretations.

If you haven’t seen it, take a look at artbreeder which by the way has a free level.


This has a free tier as well (8 credits), you're right, I should clarify!

Right now you can't change templates, you can only add info to generate the image you want, but in the future I want to add the ability for everyone to generate templates.

EDIT: I've added the free tier to the pricing page and a bit of an explanation to the templates page, thanks for the feedback!


Nice. That ArtBreeder website looks like another great site I can use for inspiration for Blender 3D project ideas… Thanks for that information. I'd never heard of it before today.


Great simple tool for dabbling in the AI generated content! Going to pass it to my writer friend. That's the target demography for you - non geeky content creators.

One bug though: After I wrote my prompt I discovered that there are some 'Advance options', clicked on the button, more advanced options unraveled, but I lost my prompt.


Yes, unfortunately that button will do that. I'll fix it, thank you!


I agree, by the way, you're spot on: Non-geeky content creators will have a lot of use for this, I hope.


Great idea dude, but I really think it's too simple to make a whole service out of, it should really be a feature on one of the OS UIs.


This space always needs people who can streamline the task for professionals.

In another comment you point out that this service expands into a much larger prompt and then here you wonder why someone would pay for that. Meanwhile I think it's obvious money on the table when I have to write a paragraph when I really just want permutations of ork portraits for the next Shadowrun game or whatever.

Having to become an experienced prompt engineer to use these tools productively is serious friction.

I'd like to see tools/services made for professionals and I think this is the direction it would need to go.


Agreed, this is the direction I want to take it. Basically, I want to make a service for "I don't know anything about AI and I want a pleasing image for my X".


Can you clarify how's this different from visiting https://huggingface.co/spaces/stabilityai/stable-diffusion and entering the corresponding prompt (like "fantasy landscape")?

Or even better running stable diffusion locally?


Incredibly simple.

Huggingface can't effectively build a consumer product. They're building for B2B to be a github for models.

Huggingface explicitly rejected building B2C tools.

That huggingface UI will forever remain unfriendly and unable to scale.


I can't edit after three hours, but I figured I'd add a new thought.

B2C / creator tools are going to start disrupting businesses and B2B. I'm actually extremely bullish on B2C/tools now more than ever.


I'm guessing it applies human cognition to the task of understanding what people want when they say "fantasy landscape" using human heuristics and pre-alters your prompt to get you a closer result to your imagination than that simple keyword.


Phantasmagoria is a consumer product, geared towards people who don't want to know anything about what a prompt or a sampler is. It's the difference between Dropbox and using SVN or CVS on a curlftpfs-mounted FTP account.


This one too, without signing up being required: https://replicate.com/tommoore515/material_stable_diffusion


I like the concept and approach, but is there a way to quickly see how it works without signing up?


Unfortunately no, these images are too costly to generate :/ You can see what others have generated on the front page, that's the closest.


We need more tools like this. AI image generation is such a powerful technology but in order to utilise most image generators to their potential, you need to be fluent in 'AI prompts'.

I like this approach, it's far more intuitive.


Thanks, I hope people will like it!


Well done!

I love generating generic cartoons from input. Very useful when creating icons / avatars!


Thanks! I really love some of the prompts, I generated a ton of monoliths:

https://phantasmagoria.stavros.io/images/i3VCADPjZ9RY6K7q/

Some of these images I want to hang on my wall:

https://phantasmagoria.stavros.io/images/6cJAbYyeDXfA6oHZ/


Is there an AI equivalent of image macros yet? Like use the familiar joke setup image but insert "Microsoft" or other random thing you are making a joke about into it in an amusingly crappy way?


Yes, the templates this post links to :P

However, it's AI, so you don't always get what you expect.


Possibly I'm just missing it in the UI but I'm thinking something like the two Spiderman pointing at each other.

Can you generate something that is recognisably that, but inserts a random person, say batman or Jesus into that scene in place of spiderman so that people will recognize the reference?


Ah, you mean the actual memes? Hmm, I'm not sure actually, that's an interesting idea.


Yes, by adding inpainting and/or face swap model after the image generation model.


Neat. You might want to either filter words or blur NSFW pictures that were generated :P


Yeah, it's a bit of a free-for-all now


Congrats, I'm going to sign up and try. Where are you renting the GPUs from?


Thank you! I'm renting from banana.dev, which poses an interesting technical challenge:

Image generation takes 5 sec, but loading the model takes 120 sec. There's some time you can keep the model "alive" for, to wait for another image generation request, but if I have too few users, it means everyone gets image generation times of 120 sec and I pay for 120 sec per image, which is a lot.

With enough users, though, I can always keep the model up, and everyone gets 5 sec generation times and the economics for this make sense. Hopefully if this picks up steam I'll be able to rent a dedicated server to generate things on, but it's just a sideproject for now.

At least I hope it will be useful/fun to some people.


How'd you compare banana.dev to inferrd and paperspace which seem more affordable at a glance?


I don't really have much experience with the ML/Torch side of things, so Banana was simpler to get started with. I'd like to switch to something cheaper, but I couldn't get Paperspace set up.

Also, Inferrd doesn't really look cheaper, a 16GB GPU is $600/mo, which is about the same as Banana, except Banana doesn't bill for the time the model isn't working.

For my needs, with the few images I need to generate, the difference is between paying $50/mo (admittedly, with worse UX for my customers) and paying $600/mo for a card that's mostly sitting idle.


Would it make sense for you to query whether the model is up and idling and if so show a demo widget on landing, both making it possible for potential customers to try (if they are lucky to have landed at the right time) and keeping the model alive?


Yes but it's bad UX if people can't see their prompts right away, basically :/ So they're just likely to bounce.

A few days ago I was generating images using my home desktop, and had a Web push notification notifying people when it was on. That was much cheaper, but very bad UX, as it could be hours before you got your image (and you were long gone by then).


With a marketing hat on I can see a flow where you (still only if the model is already up and not busy) ask for an email in the "try" prompt, both using it to send the results subsequently and offering an option to subscribe to a newsletter (thus generating new leads).

However, I can see why you might choose not to bother with that and find another way to keep the model alive without the try prompt.

Edit: didn't you mention it takes 5 sec if model is alive?


Ah yeah, that's a good idea for when the model is very async, thank you! I'll keep that in mind.


Thanks for the explanation! If I may offer a suggestion, the GTA 5 Poster prompt that was on /r/stablediffusion recently was really good.


Would you happen to have a link? I'd love to add it.

EDIT: Added, good shout, thanks!


Great, just tried!


Uh oh this seems way too expensive ? It seems to cost like $22 per day if the GPU runs all the time!


Yeah, GPUs are quite expensive, unfortunately.


This is fantastic, thank you!


Thank you!


Hell yeah dude




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: