Research teams from Google and Microsoft have recently developed natural language processing (NLP) AI models which have scored higher than the human baseline score on the SuperGLUE benchmark. SuperGLUE measures a model’s score on several natural language understanding (NLU) tasks, including question answering and reading comprehension.
Both teams submitted their models to the SuperGLUE Leaderboard on January 5. Microsoft Research’s model Decoding-enhanced BERT with disentangled attention (DeBERTa) scored a 90.3 on the benchmark, slightly beating Google Brain’s model, based on the Text-to-Text Transfer Transformer (T5) and the Meena chatbot, which scored 90.2.