Silhouettes don't have enough information for a neural network to really learn the structural relations. You can do "generative zoology" with full color and get very convincing results-- here's a GAN trained on beetle illustrations: https://www.cunicode.com/works/confusing-coleopterists