Pretraining BART language model

### Feature request

Hi,

I'm looking into BART [docs](https://huggingface.co/docs/transformers/model_doc/bart). Seems that the provided examples are on fine-tuning BART on Seq2Seq summarization tasks. I'm wondering if there is any example on pertaining BART's "Language Model" itself, with the pre-training objectives (Token infilling, Token Masking, etc.) that are mentioned in the original paper. I was looking into this a couple of months ago and found this thread: https://github.com/huggingface/transformers/issues/4151 and a relevant issue in fairseq: https://github.com/facebookresearch/fairseq/issues/1899. Now decided to ask it directly here to see if there has been any update so far. 

Thanks, @patrickvonplaten @patil-suraj,


### Motivation

Making BART for further pre-training (on Language Model). 



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pretraining BART language model #18030

Feature request

Motivation

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Pretraining BART language model #18030

Description

Feature request

Motivation

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions