I wish this was written with more care. None of the symbols are defined. Worst o...

pico_creator · on May 23, 2023

If you have more specific feedback, like a specific digram or page, and how it can be made better. I will gladly forward that info, to improve the paper draft.

Because channel mixing, is a core component of this architecture, and that keyword "channel"is all over the place. I have no idea what is it you are critiquing specifically (i could not find the mention of "channel dimension" in the paper)

sorz · on May 23, 2023

They said the paper is still working in progress and will improve it.

https://twitter.com/AiEleuther/status/1660811180901019648