Hey, can you point me to this Google Group? This is a topic I've been very interested in for many years now. I actually wrote this library (https://github.com/zencephalon/Tactful_Tokenizer) for sentence tokenization so that I could get Git to work across of sentences instead of lines.