Inuktitut is very cool. I'm not a speaker so I'm relying heavily on research, but I've always loved making tools for syllabics and working with text generally.
Then congrats all the more on this work, I had assumed a motivation from attachment also. As for the tech stacks and langs yeah they're fun, but I meant in how abstract Inuktitut looks, I guess the human language in the loop is the real meat of the problem and of the "working with".
If you're curious, I don't have any direct connection with the Inuit but I got into syllabics because I have some Ojibwe speakers in my family, and Ojibwe also uses syllabics. None of the current speakers I know can write in Ojibwe (in any writing system) due to their residential schooling, but we have letters from the generations that have passed on.
I made a few syllabics converters for Ojibwe when I was younger, but Ojibwe and Cree are harder to build converters from than Inuktitut because they have a lenis-fortis distinction in the latin script but not syllabics, so the word "anishinaabe" can be converted unambiguously to ᐊᓂᔑᓈᐯᒃ, but trying to convert the other way could render any one of anishinaabe, ani*zh*inaabe, anishinaa*pe*, or ani*zh*inaa*pe*. I don't think it could be done accurately without some sort of machine learning. There's also a lot more complexities surrounding how different dialects are encoded. In the end, Inuktitut was simpler to work with and has a lot more everyday speakers and text to work with, especially because the Canadian territory of Nunavut publishes its government documents and websites in Inuktitut too.
There's a lot to cover but I think the technical challenges might be interesting to the Hacker News crowd, I may make a series of videos about it someday!
Yes it is right up my hacking alley. I'm working too with two-way asymmetric conversions between languages/charsets as noted in this thread. It's interesting to have such a niche tech challenge associated with your ethnic group of languages. In Manitoba are any of those spoken?
Ojibwe and Cree are both spoken in Manitoba, but it's too far south for Inuktitut.
I read through your other comment on this post and read about the work you're doing. I didn't quite understand –I think it's hard to explain a lot of these linguistics tools briefly without uploading a lot of context– but it seemed up my alley as well.
> It's interesting to have such a niche tech challenge associated with your ethnic group of languages.
That's true, I think it can be a lot of fun and rewarding for technologists. Kevin King is a font designer who was tasked with making a new font for syllabics, then found that some syllabics had never been adopted into Unicode and set about getting them approved[1]. He worked with the communities and the Unicode committee and got them into Unicode, and that must have been really rewarding.
There's difficulty in approaching from the outside, though. I'm trying to be careful to provide value and not overstep. The Inuit especially can be defensive, because you see a lot of people online who sort of fetishize the culture, or people (certainly linguists) whose work can seem extractive to the tight-knit Northern communities.
I've always wanted to build tools working with text that people use on a day-to-day, but I also worry that this will be just another project I spend months on that will not see any use. I'm trying to make connections so that I can build value without seeming extractive. It's hard to know where to start, however.
I wouldn't consider your months spent on it in any way wasted, even if you have native critics, it's a kind of ad-hoc anthropological resource that you add to the pool of tools available in the web, and on the long run I think it will have a good impact.
Thanks for your review also and yes it's still early days, tons to do but of the fun type. I'll look into Manitoba as I have relatives which may be related to the cultures you mentioned, thank you!
When you say syllabics it reminds me of syllabaries (?) of mesopotamian languages and hieroglyphs, but I guess it's just the terms sounding alike.