Top

With Windows 10 going away, time to get serious about Windows 11

EU Commission bans TikTok from employee devices over cybersecurity fears

Some Intel software can be downloaded in Russia again because of "warranty obligations"...

3 Reasons Why Cloud Contact Centres Are the Next Step in Customer Success

Microsoft Discovered New 'Powerdir' macOS Vulnerability, Fixed in 12.1 Update...

image credit: Freepik

Microsoft’s ZeRO-2 Speeds up AI Training 10x

July 7, 2020

Via: InfoQ

Category:

Microsoft open-sourced Zero Redundancy Optimizer version 2 (ZeRO-2), a distributed deep-learning optimization algorithm that scales super-linearly with cluster size. Using ZeRO-2, Microsoft trained a 100-billion-parameter natural-language processing (NLP) model 10x faster than with previous distributed learning techniques.

Writing in a blog post, program manager Rangan Majumder and distinguished engineer Junhua Wang described the algorithm and their experiments. ZeRO-2 is part of Microsoft’s open-source DeepSpeed library for deep-learning training optimization. ZeRO-2 optimizes memory consumption during training, allowing for distributed training of models as large as 170 billion parameters.

Read More on InfoQ

Microsoft’s ZeRO-2 Speeds up AI Training 10x

Latest Publications

Expedia Opensourced Its Container-Startup-Autoscaler (CSA) for Kubernetes Workloads.

11 top productivity tips for Microsoft Edge

How to upstream code to open source projects

Microsoft’s ZeRO-2 Speeds up AI Training 10x

Previous Article

Next Article

RELATED PUBLICATIONS

Trending

Tags

Latest Publications

Expedia Opensourced Its Container-Startup-Autoscaler (CSA) for Kubernetes Workloads.

11 top productivity tips for Microsoft Edge

How to upstream code to open source projects