Why flash attention helps the inferences of Large Language Models.
Notes for criticality stage of Recurrent Neural Networks.
Notes for grid cell coding, preperation for my further project about the topology transfer coding in Hippo or EC.
A compact reflection on how biological circuit ideas can guide robust AI model design.