If the "proper paper" you need refers to the of a downloadable text file found on GitHub or similar repositories, it is typically used for:
: Developers often download .txt files containing ~215,000 French words (like those found in french.txt repositories) to build " Le Pendu " (Hangman) games or search algorithms. Download 215K French txt
: For formal linguistic tagging, the Universal Dependencies project provides treebanks; while counts vary, their releases are the standard for "proper" citation in French NLP papers. If the "proper paper" you need refers to