Try it with internet photos of people. What it does is just morph the face a little: larger eyes, shorter forehead, elephant ears, narrow and taller mouth. The only thing it tries to generate is hair, and it fails horribly at it if you try with completely bald heads.
I for one welcome the singularity and all it entails.
That surprised me, I didn't expect an image hosting website to require JS.
The celebrity examples might be a different case in terms of our recognition sentiment, because of their familiarity, and the juxtaposition of a number of phenotypically diverse celebrities in one set, which gives a correspondence in the diversity of the input set and the output set.
What modifications would be required to the algorithm to score consistently higher in facial recognition of the avatar by people who view the avatar cold without access to the original photo?
I'm asking rhetorically, of course. Such research might take a long time. Toonify looks like a good foundation for what might be an open ended project.
Specifically, there's this snippet:
"These StyleGAN face models can produce a huge diversity of faces and it’s actually possible to find basically any face inside the model. It’s actually a straight forward process to search for any image of a face in the model. So given an example image you want to find you can find a “code” (aka latent vector) which, when given as an input to the model, will produce an output which looks almost exactly like the face you’re looking for."
So it seems like they first generate an image that is similar to yours with one model, and then translate that too its more cartoony counterpart.
1. Clicks on the browse button to find a good selfie.
2. Uploads the pic to Toonify.
3. Clicks on 'Toonify' button:
> Failed: Didn't find any faces in the supplied image.
Ack what? Ok. The machine is still learning.
(P.S: I like the project though. Hope it will improve over time. But toonification of any pic without deep learning/machine learning thrown into the mix is already a solved problem, no?)
The logic is basically: ask dlib for face landmarks; if no landmarks, resize image to 1024x1024 and proceed; otherwise align face.
If you happen to like lisp: https://github.com/shawwn/stylegan-server/blob/10c396fa2ef07...
Also, congratulations on launching this! Super exciting.
Readers, if you use twitter, be sure to follow the creators:
Been following them for like twenty years now. They tweet a bunch of awesome stuff about ML, and stylegan in particular.
Rotate your phone 90° clockwise then try again.
By default the portrait image will be rotated, causing the face detection to fail. I know this because I‘ve run in to the same problem before, had to correct it server side :)
Answer: Please check that your genetics is compatible with ML?
Sure, then don't hide an otherwise perfectly normal computer program under a random web interface.
Y'know, the same reason that the "perfectly normal computer programs" that generate the data for scientific journal papers aren't usually published.
And yeah, many journal papers suck specifically for the reason that their results are not reproducible.
I know that for a fact since I wrote paint.net plugin that does exactly that. Said plugin works perfectly fine with images of the size they use (512x512) and generates output (CPU-only, mind you) in less than 5 seconds on a dual-core laptop...
Currently I believe that is what is causing your facial detection step to fail.
If you want to try this on iOS, rotate your phone clockwise 90 degrees then take a photo.
OTOH I could immediately guess if you've shown me one of these photos: https://duckduckgo.com/?q=leonardo+di+caprio+toon&iax=images...
I look like a drunk Simpson... Barney.
What gets shipped to production: [Toonified Idris Elba]
1) Whether the generated character is readily identifiable by an observer on its own.
2) Whether the generated character is a reasonable manifestation of the chosen art style.
I think it does a reasonable job of #2 and a mediocre job on average of #1. I think greater eyebrow variability would help some so that it doesn't look like all the women get "woman's pencil eyebrow" and all the men get "man's extremely bushy eyebrow".
Didn't work, it can't find the face.
Otherwise it worked ok on the fascist leaders I tried. Black and white got color added. And one photo had an artefact put on top of both the original and the Toonify, a mirror of the top of the image to fill space perhaps?
Also Startup idea, facial recognition for pets and sell the data to the Russians for addition tracking ability on people through proxies.
I look forward to seeing if she recognises herself and what she thinks. I don't believe any adult likes a cartoon version of themselves, rarely are critiques truly flattering, a true cartoon hurts!
I tried cartoonifying my niece's favourite cartoon characters. The program failed. So you can't toonify a toon.
I imagine this is all in good fun, but this disclaimer doesn't rule out data being derived from your face and stored or sold later.
Because right now you say you're not doing anything with the images, that doesn't rule out collecting face data and storing that instead.
Oddly enough, I bet if the output was designed to always generate more attractive features (ala insta filters), this will take off.
Maybe I don’t high cheekbones or a chiseled jaw, but lie to me dammit!
Just a tip.
Or rather, the answer is "no" right now, but I will bet you $500 that it'll be "yes" within ... hm ... 2 years max. And if I lower my bet to $100, I'll do 1 year max.
Let's just say it's a very lucrative, very active area of research.
One observation: facial detection gives you landmarks, and those landmarks are essentially UVs for a texture. So if you do that process repeatedly, and rotate the face slightly using latent directions, then you can back-project the result onto a 2D texture.
EDIT: Some leads: https://twitter.com/Zergfriend/status/1283954193946906625
A video from July: https://www.youtube.com/watch?v=mvk3dtdeXBE&ab_channel=Compu...
And I saw someone use Unity to learn normal+bump+diffuse+ambient occlusion maps using ML, but sadly haven't been able to find the link yet.
This search gives lots of useful results: https://twitter.com/search?q=stylegan%20texture&src=typed_qu...
If I was a better coder I'd being trying to get an Epic Grant for this. I can't imagine getting a full blown model would take more than a few hundred thousand