Top

With Windows 10 going away, time to get serious about Windows 11

EU Commission bans TikTok from employee devices over cybersecurity fears

Some Intel software can be downloaded in Russia again because of "warranty obligations"...

3 Reasons Why Cloud Contact Centres Are the Next Step in Customer Success

Microsoft Discovered New 'Powerdir' macOS Vulnerability, Fixed in 12.1 Update...

image credit: Pixabay

Microsoft Trains Two Billion Parameter Vision-Language AI Model BEiT-3

September 27, 2022

Via: InfoQ

Category:

Researchers from Microsoft’s Natural Language Computing (NLC) group announced the latest version of Bidirectional Encoder representation from Image Transformers: BEiT-3, a 1.9B parameter vision-language AI model. BEiT-3 models images as another language and achieves state-of-the-art performance on a wide range of downstream tasks.

The model and experiments were described in a paper published on arXiv. The key idea in BEiT-3 is to model images as another language (which the authors call “Imglish”); this allows the model to be pretrained using only the masked language modeling (MLM) objective, and the training process can therefore be scaled up more easily.

Read More on InfoQ

Microsoft Trains Two Billion Parameter Vision-Language AI Model BEiT-3

Latest Publications

Expedia Opensourced Its Container-Startup-Autoscaler (CSA) for Kubernetes Workloads.

11 top productivity tips for Microsoft Edge

How to upstream code to open source projects

Microsoft Trains Two Billion Parameter Vision-Language AI Model BEiT-3

Previous Article

Next Article

RELATED PUBLICATIONS

Trending

Tags

Latest Publications

Expedia Opensourced Its Container-Startup-Autoscaler (CSA) for Kubernetes Workloads.

11 top productivity tips for Microsoft Edge

How to upstream code to open source projects