You can realistically skip 1 and 6, control it with your smartphone instead. Or, you can simplify the problem by using more constrained verbal commands. I was told that speech recognition works very well when you have a simple grammar instead of an open-ended language.