Large-scale text collection for the Distributed Ledger Technology (DLT) domain, containing 2.98 billion tokens across 22.12 million documents