It even grasps common reasoning.
Nvidia and Microsoft revealed their largest and most powerful monolithic transformer language model trained to date: Megatron-Turing Natural Language Generation (MT-NLG), complete with a staggering 530 billion parameters built together, according to a press release.
MT-NLG outperforms prior transformer-based systems by both companies. MT-NLG is substantially larger and more complex than Microsoft’s Turing-NLG model and Nvidia’s Megatron-LM, with three times as many parameters spread across 105 layers.
Comments are closed.