_______ __ _______
| | |.---.-..----.| |--..-----..----. | | |.-----..--.--.--..-----.
| || _ || __|| < | -__|| _| | || -__|| | | ||__ --|
|___|___||___._||____||__|__||_____||__| |__|____||_____||________||_____|
on Gopher (inofficial)
URI Visit Hacker News on the Web
COMMENT PAGE FOR:
URI Hierarchical Autoregressive Modeling for Memory-Efficient Language Generation
mxkopy wrote 13 hours 45 min ago:
Skimming it I get this incredible sci-fi feeling of AI being the thing
that solves P vs. NP (the diagrams are reminiscent of
boolean/arithmetic circuits which have produced some results in the
compcomp space)
pama wrote 15 hours 36 min ago:
At least the authors acknowledge it for what it is: a tiny model on a
tiny corpus and worse than the comparable transformers in terms of
accuracy. I like the experimentation with new designs and one doesnt
always need to show near SOTA results. From a brief inspection,
however, I think it will be hard for the work to become a high profile
conference acceptance without significan additional work.
jeffjeffbear wrote 14 hours 55 min ago:
I would really like to see more testing with a deeper hierarchy and
alpha and beta nonzero.
DIR <- back to front page