I actively follow the state of the art pre trained models on paperswithcode.com ...

ThomThom · on July 3, 2020

"zero researchers have tried to improve on top of XLnet" I question this assertion.

In particular at least the Roberta model by Facebook is already improving significantly upon XLNet.

algo_trader · on July 3, 2020

Are reformer/linformer more space-efficient or also inference-runtime improved?

For me the greatest trick is improved runtime compared to older seq/RNN techniques