>But have you ever wondered if they made photos that look like you? Let's check!
>We collected the huge dataset of 428526 fake generated photos and extracted their facial parameters with https://github.com/ageitgey/face_recognition. Now you can match your image against fake faces and compare with the closest matches. Enjoy!
Maybe it is just me, but how would I go if I wanted to harvest large numbers of "real" photos of "real" persons?
Having curious people uploading them to my website could be an easy way.
Harvesting even more photos is not really necessary at this point, and in any case, scraping them off the web would be faster than creating a novelty website.
But scraping them from the web says nothing about the source, even if you manage to remove all stock photos.
This way it is IMHO more likely that it is a "real" photo, most probably uploaded by a "real" user and the site has also the IP of the sender.
Morover, most photos you can find on the web have had their EXIF information removed by the host, maybe it is not the case for a casual user.
As I see it scraping them off the web is good for quantity but not so much for quality, this (completely hypothetical) approach would give less quantity but IMHO better quality data.
I guess you could just steal them from Flickr?
Use APIs or scraping to collect profile photos from facebook, twitter, google, gravatar, etc. You'll get a lot of non-person photos, but havetheyfaked.me probably does too.
Then you will need some AI (or whatever) to remove non-human photos or non-suitable photos (position, lighting, etc.) whilst this method would almost guarantee only "portraits" or "upper torso" pictures of humans.
What I tried to say wasn't that this is the "only" method, but that it is one of the "easy" ones with a high probability of getting "reliable" data.
Also I got told I matched with this https://havetheyfaked.me/d/c4662bf7-eb98-4c7f-a59d-6c96c9c1f...
> when some random small website
It could be some random website. It could also be a 3 letters agency website training airport security face detection, a social engineering website, a random website storing images in a public s3 bucket.
> The internet has reality distorted everyone's perspective.
You give all your personal / biometric info when you get a new passport. If some rando guy were asking for your biometric data in a street would you give him ?
3 letter agencies will kindly ask/force Facebook to share their face-detection data and algorithms, no need to do this themselves.
In fact I think the biggest co-conspirator of NSA and other agencies is Apple. First they put fingerprint sensor inside Home button, so you can't avoid using it, and have to share your fingerprints whether you like it or not. Now that they captured enough fingerprints they move to FaceID, which sensors are so conveniently placed that you cant avoid having your face scanned, even if you don't ever set up FaceID for your own use.
All the pics and their vectors stay on this small server.
if you want to be deleted please drop me a letter email@example.com
This should be mentioned on the website.
> if you want to be deleted please drop me a letter firstname.lastname@example.org
There should be an easier mechanism to request a deletion of our photo. Better still, request permission from the user to store the photo in your servers before actually storing them.
I think this is the bare minimum of transparency that should required before letting people upload personal data, especially in this day and age.
I know everyone's used to the wild west but I'm glad that's changing, because of comments like yours - this transparency should NOT be something done out of the website owner's good heart (because as we've seen, most will just give us the finger), but enforced by law.
Edit: For the record, wastemaster's actually quite nice, and this is not directed at them, just websites in general.
Does the GDPR apply to non-commercial, non-business, non-organizations?
You provide this information (a picture of your face) to every convenience store, casino, bank, airport, and office building you walk into, many hundreds of times per day, for permanent storage. What is the threat model here from someone with a webserver having a single picture of your face with no other associated identifying information about it?
I can't think of any reason not to immediately delete the data, other than that you intend to use it for something else in the future.
That said, I appreciate your honesty. If you had actually nefarious intentions, you would presumably just claim that you deleted the data when you don't
Not to take a jab at you specifically, but the fact that someone that can make websites like these is ignorant enough about privacy (law) to casually drop this line marks a worrying development in the accessibility of AI tech.
For impactful technologies, we probably want the required domain knowledge to come with some structural social disciplining so that we can collectively steer that impact in the right direction (whatever that is). Clearly AI libraries have become so easy to use professional ethics are not part of the curriculum.
And - with all due respect - an alternative would be providing an open-source program to create offline this "128-dimentional vector" and upload only this latter and NOT the photo.
For example if I extract color map from your photo is this your personal data still?
I didn't trim my beard or mustache from 2009 - 2012, when I shaved it off entirely, and I didn't cut my hair from 1996 - 2017, so there's some good variation in these pictures.
Joke aside, I really enjoy both of these projects. It's fascinating that you can put a story behind every "fake" face.
1 - https://havetheyfaked.me/u/f/7b8ec517-c5a6-4e43-8bae-5be69ee...
Now photos and data are only stored for 3 minuites, then deleted (app artitecture requires server side processing of uploaded file and to compare results with your photo I have to store it - to be able to show)
Also reflected information above on the website itself.
Thank you for your interest to this project!
edit: working again. So far I haven't been faked ;)
Disclaimer: This the result of a "find my twin" search. I know nothing about the credibility of these sites.