So is this 98% with stained images from the data set? I thought CNN's were data ...

devl82 · on April 2, 2017

it is very expensive & time consuming to create a vast amount of properly labeled image cell dataset. In general you need >2 pathologists to confirm the cell types (they disagree sometimes and you usually take the majority vote); this almost never happens with cat images;) Also there exist a multitude of device acquisition modalities for image capturing in microscopy, different stains for the same types of cell, etc. & actually simple RGB cameras are considered fairly low tech for these kind of operations. ps. I am no deep l expert (i use more 'traditional' ml) but as you pointed out ~400 images for these techniques can be an 'overfitting' recipe of disaster..

itchyjunk · on April 2, 2017

Would something like images of rat cells to train to than transfer learn on humans be worth while? The author of the article tried it with ImageNet and it didn't work out. But I wonder about the viability of that techniques with non human cells.

devl82 · on April 2, 2017

well the same principle as with the VGGnet applies here too. If the rat images differ 'significantly' (whatever that means; i am a research engineer not a pathologist) then you will have nothing to transfer. Maybe it would be more fruitful to try to transfer via a huge amount of artificially generated cell images (there are toolboxes for that and its not linear transformations like rotation etc.) blended with some subset of vggnet (or similar) trained only with 'circular' objects ..