Toggle light / dark theme

Tutel is a high-performance MoE library developed by Microsoft researchers to aid in the development of large-scale DNN (Deep Neural Network) models; Tutel is highly optimized for the new Azure NDm A100 v4 series, and Tutel’s diverse and flexible MoE algorithmic support allows developers across AI domains to execute MoE more easily and efficiently. Tutel achieves an 8.49x speedup on an NDm A100 v4 node with 8 GPUs and a 2.75x speedup on 64 NDm A100 v4 nodes with 512 A100 GPUs compared to state-of-the-art MoE implementations like Meta’s Facebook AI Research Sequence-to-Sequence Toolkit (fairseq) in PyTorch for a single MoE layer.

Tutel delivers a more than 40% speedup for Meta’s 1.1 trillion–parameter MoE language model with 64 NDm A100 v4 nodes for end-to-end performance, thanks to optimization for all-to-all communication. When working on the Azure NDm A100 v4 cluster, Tutel delivers exceptional compatibility and comprehensive capabilities to assure outstanding performance. Tutel is free and open-source software that has been integrated into fairseq.

Tutel is a high-level MoE solution that complements existing high-level MoE solutions like fairseq and FastMoE by focusing on the optimizations of MoE-specific computation and all-to-all communication and other diverse and flexible algorithmic MoE supports. Tutel features a straightforward user interface that makes it simple to combine with other MoE systems. Developers can also use the Tutel interface to include independent MoE layers into their own DNN models from the ground up, taking advantage of the highly optimized state-of-the-art MoE features right away.

I have a small YouTube channel which I create videos on clean energy and the environment. I have under 600 subs and many videos have not even hit 100 views but I am being increasingly targeted by fossil fuel activists and supporters, with personal attacks and misinformation.
I do respond to misinformation, and remove the worst comments but if anyone would like to help support me, nipping over to my channel, watching some videos and subscribing to the channel would be most appreciated.
We can show them that they are the minority, not us, and the wider the information spreads the quicker the change will be and the better life will be for everyone.
Thanks in advance and have an awesome day.


It is very likely that treatments to address the issues that cause aging & its related conditions & diseases will be within our reach in 15 to 20 years.

It is highly likely that a general realisation that these treatments are not only scientifically possible but within our reach will start to become increasingly apparent to the wider population in as little as maybe 5 years.

“The Singularity” is a term coined by John von Neumann, a major figure in the history of computer science. The concept refers to a hypothetical time when computers become more intelligent than humans and can improve themselves without our input. Imagine a run-away reaction where artificial intelligence is able to improve itself. This improved self is able to further improve itself. With each improvement the rate at which…

NVIDIA CMP 170HX cryptomining card, Source: Linus Tech Tips.

Due to the very limited availability and the high price of this card, there are not actually that many pictures of CMP 170HX on the Internet. Fortunately, Linus was brave enough to take a look under the card’s hood. As it turns out, the GPU has a very large heat spreader completetly covering the whole interposer area.