_______               __                   _______
       |   |   |.---.-..----.|  |--..-----..----. |    |  |.-----..--.--.--..-----.
       |       ||  _  ||  __||    < |  -__||   _| |       ||  -__||  |  |  ||__ --|
       |___|___||___._||____||__|__||_____||__|   |__|____||_____||________||_____|
                                                             on Gopher (inofficial)
   URI Visit Hacker News on the Web
       
       
       COMMENT PAGE FOR:
   URI   Hierarchical Autoregressive Modeling for Memory-Efficient Language Generation
       
       
        mxkopy wrote 13 hours 45 min ago:
        Skimming it I get this incredible sci-fi feeling of AI being the thing
        that solves P vs. NP (the diagrams are reminiscent of
        boolean/arithmetic circuits which have produced some results in the
        compcomp space)
       
        pama wrote 15 hours 36 min ago:
        At least the authors acknowledge it for what it is: a tiny model on a
        tiny corpus and worse than the comparable transformers in terms of
        accuracy.  I like the experimentation with new designs and one doesnt
        always need to show near SOTA results. From a brief inspection,
        however, I think it will be hard for the work to become a high profile
        conference acceptance without significan additional work.
       
          jeffjeffbear wrote 14 hours 55 min ago:
          I would really like to see more testing with a deeper hierarchy and
          alpha and beta nonzero.
       
       
   DIR <- back to front page