Neural Networks in iOS 10 and macOS

emcq · on July 1, 2016

This feels like a hardware play to me. Start off getting developers to use your inference library, determine the common cases and killer apps, then build an ASIC that does it all using far less power.

Perhaps the front facing camera might get a bit more interesting in the future for more than just selfies.

taliesinb · on July 1, 2016

It's worth noting than BNNS (the CPU library Apple released) defines just 3 layer types. It's clear they are for now targeting just convolutional networks (the kind you'd use for computer vision), and even then just the simplest possible kind (for example Alexnet, now ~4 years old).

There needs to be much more functionality to build a state-of-the-art vision network, such as Google's Inception v3, at least if you want to avoid writing your own layer definitions. Hopefully they'll expand this out later. In the foundation keynote they did mention using LSTMs for predictive text, so hopefully more advanced things are in the pipeline and just haven't been API-ified yet.

For reference, the deep learning library I'm working on will contain 17 layer types in its first version, with later versions probably asymptoting to double that. There are a LOT of things out there that deserve their own layer types.

EDIT: to be fair to Apple, since convolution, pooling, and fully connected layers are among the most costly layers, they are the most valuable to have efficient platform implementations for.

daveguy · on July 1, 2016

Looking up info on the inception architecture. Here is a paper if others are wondering:

http://arxiv.org/pdf/1512.00567v3

From late 2015, so I guess the most up to date info? It only mentions v2. Is there a paper for v3?

Edit: Sure enough v3 is right there. I saw the early focus on v2 and jumped to the big dumb assumption that v3 wasn't in there.

taliesinb · on July 1, 2016

That's the right paper. On page 8 you'll see the line "The last line is referring to all the changes is what we refer to as “Inception-v3” below".

imh · on July 1, 2016

I really hope that BNNS is pronounced "bananas." But more substantially, I'm not totally convinced neural nets have quite as well defined required subroutines yet. Because of backprop, it's not just about what the blocks are, but also about how you combine them. Adding recursion changes the problem. Or a paper like the one on stochastic computation graphs comes along and does the same. I like the idea, but I expect it's still too early to expect something analogous to BLAS to stick.

spiderfarmer · on July 1, 2016

Does Android support this as well?

I think it's a great decision for Apple to make sure developers use this functionality. Apple is still very much a hardware company. By pushing the hardware to it's limits, Apple makes sure you'll want to upgrade to the latest and greatest. Also, by doing the heavy lifting locally instead of in the cloud, Apple saves money on servers and infrastructure. And to top it off, they can make some privacy claims.

embiggen · on July 1, 2016

Thanks, Apple, but no thanks. I'd rather stick with the completely open-source Google Tensor Flow rather than lock myself into any more of Apple's proprietary shit.

taliesinb · on July 1, 2016

You can stick to TensorFlow for training your networks, but if you want to deploy a trained network to iOS or macOS devices (and your network is expressible in terms of Apple's primitives), you'd be doing your users a disservice not to use the fastest and most energy efficient backend to do the actual inference.

I'll add that at this point, there's isn't much 'lock in' between different frameworks. Once you've trained, and if your primitives are available in the target framework, porting is just a matter of getting your weights and topology into the right format. Not too hard compared to the nitty gritty of gathering data, designing a network, and doing training and hyperparameter optimization.

vrv · on July 1, 2016

For what it's worth, we're hoping to integrate these APIs into our iOS version of the TensorFlow runtime, so you can maintain graph portability but still get the benefits of the optimized implementation on the platform.

We only just got iOS support in TensorFlow last week (https://developers.googleblog.com/2016/06/tensorflow-v09-now...), but this is something our team would like to get done, so stay tuned!

taliesinb · on July 1, 2016

That's great to hear. Can't wait for TensorFlow to fully support Windows, that's the main thing stopping us from using TF instead of MXNet as a backend. Any news on that?

vrv · on July 1, 2016

Windows support is definitely being worked on and lots of progress has been made, so it will eventually arrive -- just lots of little details to work out, but we're optimistic it'll come soon.

embiggen · on July 1, 2016

Thank you for your thoughtful comment, you've convinced me to take a moment and think more carefully about this.

The energy efficiency is an excellent point.

Ultimately I am still extremely leery of the apple lock in factor in general and their arbitrary rulings of what is and is not okay within the garden.

I am still pretty upset and maybe even somewhat traumatized about all the previous times they've fucked me over in scenarios like this. It starts out great and then gets ruined.

edit It looks like I've hit the HN rate limiter, so I'm merging my reply:

Unfortunately I can't go into detail about these scenarios because I don't want to get into trouble with my employer. Suffice it to say that I no longer place trust in Apple keeping anything of value "open".

taliesinb · on July 1, 2016

> I am still pretty upset and maybe even somewhat traumatized about all the previous times they've fucked me over in scenarios like this. It starts out great and then gets ruined.

I haven't done much development using Apple frameworks. I'm curious, where has this happened to you before?

michaelvoz · on July 1, 2016

No OP (Or is it OC: Original Commenter? Anyway...) but Core Data is some serious garbage.

kkoomi · on July 1, 2016

Why is it garbage? I've found it useful for the model portion of my apps..

michaelvoz · on July 2, 2016

The threading model is poorly written. It is very hard to setup. Keeping legacy model data and writing transformers for said data is painful (person upgrades the app, schema changed). There are lots of crashes we've seen at scale. Also, its often really poorly performant, and the IO is completely synchronous. Most apps do not need this wrapper around SQLite (or SQLite at all) and in fact should just use simple file writes. Easier to debug, maintain, and scale, with fewer bugs.

zepto · on July 1, 2016

You can't seriously believe that stuff about Apple deliberately slowing old devices. Quoting that really undermines the rest of what you are saying.

kkoomi · on July 1, 2016

Didn't see the original comment, but why wouldn't Apple do this? It could be as simple as allowing more complex visual effects (blurring, stereoscopic views), more features in general causing low-RAM devices to suffer.

It doesn't necessarily have to be evil or crazy that they do this. In fact it would be strange if they worried excessively about preserving the performance of all legacy devices.

csydas · on July 1, 2016

I think there's malicious and then there's new expectation. Recently on the desktop side, Apple's legacy retention is great- you have iMacs from 2007 I think running El Capitan, albeit a mildly neutered version of it, and treating it like any other update. I've worked with a lot of now legacy machines that spec wise are fit for purpose (write a paper, read some stuff on facebook), but due to software restrictions (individual browsers), they were formerly unable to simply because the browser was no longer supported on OS X10.7 or lower. Now these older machines have a new life.

iOS is slightly different, as each time apple works to retain legacy devices and pushes out the major iteration of iOS, the performanceon other devices do suffer a little, but even with the most recent iOS release they started focusing on slicing down unnecessary parts of apps to save space.

Apple has really been working decently ont he preservation leg of their line up.

zepto · on July 2, 2016

I'm not disputing that newer OS's run slower on older devices.

I'm disputing that this is done intentionally to degrade performance and make it more attractive to upgrade - which is what the original comment stated.

They actually disable most of the new effects on older hardware which indicates that they want the software to perform acceptably.

And more to the point, supporting older hardware at all extends the useful life of that hardware by allowing it to run modern applications and have access to new features.

embiggen · on July 1, 2016

Fair enough. Removed that portion as I do not wish to taint the conversation with my own (unproven) speculation. Thank you @zepto.

jd20 · on July 1, 2016

I don't think Apple is intentionally trying to create lock-in, but I do think there is a valid fragmentation concern in the field of deep learning right now. For example, you look at CUDA vs OpenCL, and CUDA has clearly become the winner there. Anyone building a system for deep learning would be crazy not to buy nVidia hardware. And while some projects support both CUDA and OpenCL (e.g. OpenCV), you can usually count on the CUDA implementation being more tested and performant. Metal is going to just throw one more wrench into the mix :)

pjmlp · on July 1, 2016

A common theme that both Metal and CUDA share, which OpenCL ignored, is tooling.

Both embrace modern languages instead of plain old C, have very nice visual debuggers.

Also on Android side, Google is all about Renderscript not OpenCL.

OpenCL had to become an almost irrelevant, for Khronos to step up with SPIR and SYS-C++.

Yet they did it again with Vulkan, being plain old C.

yalogin · on July 1, 2016

What does it being open source matter that much? Are you building a business or making a political statement? Are you not better off using what is provided on the system and hardware so that your app gets better?

pjmlp · on July 1, 2016

That is because of that mentality that game studios shy away from GNU/Linux.