With its built in thread-thread FIFO support, something like RaftLib (http://www.raftlib.io) would likely rock on this. Almost have it working on Parallella, initial results are pretty decent. Seems far better than TLB/DRAM/potential page fault for every thread-thread FIFO access.