During this period, it is necessary to continue to indonesia number details learn, trial and error, correct, adapt... toA new architecture that surpasses rfrr and b has just been born. The method proposed by researchers from Stanford D and other institutions directly replaces the attention mechanism, and the language model method may be completely changed from now on. Wake up and a new architecture that surpasses rfrr and b is born? Researchers from Stanford, D, Berkeley and proposed a new architecture that replaces the hidden state of R with a machine learning model. The paper compresses the context, and this method is called "test time training layer (-i-riig lyr,)".

The layer directly replaces the attention mechanism, unlocks the linear complexity architecture with expressive memory, and enables us to train LL containing millions (possibly billions in the future) of k in context. The author believes that this project, which has been studied for more than a year, will fundamentally change our language model method. The ability model and learning improvement of B-side product managers The first major challenge facing B-side product managers is how to correctly analyze and diagnose business problems. This is also the most difficult part. Product design knowledge is basically of no help in this part of the work.