This is really cool! My #1 feature request would be to try to preserve the floorplan and not change any of the walls, windows, or doors. That sounds like a very difficult challenge though. I've seen some new AI tools recently that can render 3d objects. It would also be awesome if I could upload a 2D floorplan to get some ideas from a top-down view.
I actually tried this myself with DALLE-2 recently, but it had very poor results. I'm atrocious at interior design and constantly make mistakes and bad choices, so I regularly look at subreddits like /r/designmyroom and /r/malelivingspace. I'm looking forward to seeing some good tools in this space. I can imagine a polished product that is actually of the room layout. Imagine if it could even scrape second-hand furniture photos from ebay and TradeMe (in New Zealand) and automatically figure out the size and generate 3D models that you could place in your room (e.g. using SweetHome3D.) That would be extremely useful.
That's a really cool idea. Maybe it's possible to use this tool to build a two-step process:
1. Remove all furniture from the room.
2. Add furniture to the empty room.
It sounds like img2img might make it possible to preserve the existing floorplan so that it doesn't change any walls or doors.
I don't have a good enough GPU to play around with this stuff (2019 Macbook Pro), but I should probably spin up an instance on AWS or find an API I can use. This new generation of AI tools is really exciting.
It's a very clever idea for an awesome, interactive app but in practice the results are extremely terrible. The output has almost nothing to do with the input, the outputs are all really just slight variations on what was clearly a very small training set, and the results are hardly even "possible" or "sane" suggestions with extremely obvious AI artifacts like impossible shapes or walls that are also chairs that are also windows that are also some weird thing no one can name.
Have a chat with IKEA - if you could integrate this with their inventory and then link to the specific items shown in generated images (for a revenue share or some other arrangement) you could have a slam dunk on your hands.
They're probably the only retailer with a big enough range and significant enough global presence for it to work well for users and scale.
I'm building a PDF generation service (DocSpring), so it's probably time to look into using AI to detect fields on uploaded PDFs so that it's easy to create PDF templates. That's been in my backlog for years. It was probably easy to build this many years ago, but maybe it's even easier now with the latest AI libraries and services.
I should really start to get familiar with AI and learn how to use things like PyTorch. I'm a bit nervous though, because it feels very different to all the programming that I've been doing for over a decade. I'm using to building CRUD web apps and mobile apps, writing SQL queries, setting up servers on AWS, etc. I'm not very good at math and I don't really understand how neural networks work or which kind of algorithm I should choose, so it feels like learning a whole set of new skills from scratch. But I can probably figure out how to use some off-the-shelf libraries and frameworks and string something together. Maybe this will be my next weekend project.
This is awesome, but I'm a bit concerned about the "latest renders" feature potentially doxxing people. Are you checking for identifiable information somehow?
Images would still contain EXIF data from your phone. So you could take a picture, then take a screenshot of that picture before uploading the screenshot.
The "latest renders" wouldn't contain EXIF from the original user because it's been run through Stable Diffusion (I assume). However, it's possible things like street signs and addresses from the original photo would still exist in the transformation, and there doesn't seem to be a warning when you submit the photo that you're submitting something that will be viewable by other people.
I am still not sure whether it was a typo in his math. He said "100,000 renders * 1/2 cent per render"[0] when someone asked him how come he was paying that much. And when someone asked a similar question to how the math worked out, there was no reply.
Still not sure how to interpret this, no matter how I look at it. 100k renders at 1/2 cent each comes out to only $500 total. Someone guessed that he could've meant $0.5 when he said "1/2 cent each", but that comes out to $50k total. Whichever math mistake I was trying to intentionally make, I failed to arrive at $1k. Could it be that his math was correct, but the total number was the one that contained the typo?
Not trying to accuse him of lying, I believe it was a genuine mistake or a typo, but as of now, I am still left not knowing whether the stated number was the one he actually meant or whether his math was wrong.
Sidenote: the actual product is great, and I absolutely dig the idea of it too.
Sorry I'm not US, with 1/2 cent I meant 1 to 2 cent. Actually it's more like 1-4 cents depending on image dimensions. A square image is 1 cent, a wide image can be 4 cents. But yes it's not cheap
Ah, makes sense, thanks for clarifying, I really appreciate it. Until this moment, I was genuinely trying every possible interpretation I could think of, but this one never crossed my mind.
This is really great. My only suggestion for improvement is to enable the web UI to allow pasting directly from clipboard.
Can I ask what's going on under the hood? My guess is you're using Stable Diffusion with the uploaded picture and a phrase like "in a modern style" for a prompt?
I actually tried this myself with DALLE-2 recently, but it had very poor results. I'm atrocious at interior design and constantly make mistakes and bad choices, so I regularly look at subreddits like /r/designmyroom and /r/malelivingspace. I'm looking forward to seeing some good tools in this space. I can imagine a polished product that is actually of the room layout. Imagine if it could even scrape second-hand furniture photos from ebay and TradeMe (in New Zealand) and automatically figure out the size and generate 3D models that you could place in your room (e.g. using SweetHome3D.) That would be extremely useful.