Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Soughing
/
MLRA
like
1
Text Generation
arxiv:
2603.02188
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
main
MLRA
/
ablation_initialization
/
gqa_normal_initialization
11.5 GB
2 contributors
History:
1 commit
Soughing
Upload folder using huggingface_hub
b91c018
verified
21 days ago
config.json
Safe
325 Bytes
Upload folder using huggingface_hub
21 days ago
pytorch_model-00001-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
4.91 GB
xet
Upload folder using huggingface_hub
21 days ago
pytorch_model-00002-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
4.98 GB
xet
Upload folder using huggingface_hub
21 days ago
pytorch_model-00003-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.6 GB
xet
Upload folder using huggingface_hub
21 days ago
pytorch_model.bin.index.json
Safe
17.3 kB
Upload folder using huggingface_hub
21 days ago