Tweet

Ankur Singh 🤗

Follow @ankur310794

2 May, 5 tweets, 2 min read

@TensorFlow

BART model using @TensorFlow Keras (@fchollet) from scratch in less than 100 lines.

Bart uses a standard seq2seq/machine translation architecture with a bidirectional encoder (like BERT) and a left-to-right decoder (like GPT)

The pretraining task involves randomly shuffling the order of the original sentences and a novel in-filling scheme, where spans of text are replaced with a single mask token.

NLP Benchmark Tasks: BART performs comparably to RoBERTa and
XLNet.

Paper: arxiv.org/abs/1910.13461

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

Share this page!

Ankur Singh 🤗

Try unrolling a thread yourself!

Did Thread Reader help you today?

Like this author's thread?