> a randomly-initialized LSTM [12] with a learned linear output layer can predict time series where traditional RNNs fail