Toggle light / dark theme

Paper page — LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

Posted in robotics/AI

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters.

https://huggingface.co/papers/2405.

A good initialization of deep learning models is essential since it can help them converge better and faster.


Join the discussion on this paper page.