Corpus full texts

jraimbault · 2018-10-06 10:18:07 UTC

A gold mine : https://www.reddit.com/r/DataHoarder/comments/9ifnda/600_gb_corpus_of_all_paywalled_scholarly_sources/
How about integrating that into gargantext, for example in collaboration with multivac which can handle such a massive dataset ? (and for example provide a new search option into full texts of this corpus, do text mining on full texts, etc.)