Splitting things is the relatively easy part -- it's building the consistency model for multi-node systems that's tricky.
For read-write partitioning it's pretty simple -- each item is largely independent; it has its columns of data and is handled by an index, so once you're just reading / writing / updating items, it's no problem to do the hashing from key to index then using that index to locate the appropriate node.
The devil is of course in the details. If we see that barrier approaching we'll plan ahead for scaling out this way.
However, just doing some quick calculations, looks like the English wikipedia gets 5.4 billion page views per month, which translates to about 2100 per second. On my MacBook I get an average query time on our profile dataset for wikipedia's graph of 2.5 ms per query -- meaning 400 requests per second, extrapolating from there, scaling that up to 6x that on a hefty server doesn't seem unreasonable, and that ignores the fact that we could go further in caching the results (since most of them would be duplicate requests) to push that number up even higher.
So, yeah, it's an issue that's in the back of our heads, but not one we're currently dreading.
Is the dataset changing (being written to) while you make those queries?
Still an untested assumption, but the system is architectured to hold up well in those situations and I think that it will reasonably scale there.