So maybe it's meta. Instead of being a compression algorithm, how about a machine learning algorithm that takes a corpus of data and generates a compression algorithm specialized to compress that type of data. You train it against music and get an algorithm suited for compressing music, you train it against spreadsheets and you get an algorithm suited for compressing spreadsheets, etc.