HPLT/hplt-3.0-ukr_Cyrl-llama-2b-100bt
2B • Updated • 1
Web as a corpus, Large Language Models, Machine Translation, Language Technologies, Natural Language Processing, Internet Archive, CommonCrawl
OpenLID-v3: Improving the Precision of Closely Related Language Identification -- An Experience Report
DHPLT: large-scale multilingual diachronic corpora and word representations for semantic change modelling