The real metric to judge it' s effectiveness is by comparing its accuracy to an average human observer's responses. I doubt a human would do a lot better at estimating someone's age.

Most people think I'm around 30 and I'm 42. The software said I was 40 the first time and 44 on the second try.

