Either you're going to need tons of cash to pay for tons of servers, or you're going to need to rethink how the data gets used. Is it really completely random access that you need? Or is there some way to pre-process or pre-order it so you can at least get the indexes into memory on one machine?