acwkim
/

ppo-harmless

Reinforcement Learning

Model card Files Files and versions

135 MB

Ctrl+K

Ctrl+K

1 contributor

History: 6 commits

acwkim's picture

Update adapter_config.json

0e1ad83 verified 3 months ago

.gitattributes

1.52 kB
initial commit 3 months ago
README.md

1.28 kB
Upload folder using huggingface_hub 3 months ago
adapter_config.json

659 Bytes
Update adapter_config.json 3 months ago
adapter_model.safetensors

134 MB
xet

Upload folder using huggingface_hub 3 months ago
added_tokens.json

21 Bytes
Upload folder using huggingface_hub 3 months ago
config.json

1.3 kB
Upload folder using huggingface_hub 3 months ago
pytorch_model.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage"
What is a pickle import?
17.9 kB
xet

Upload folder using huggingface_hub 3 months ago
special_tokens_map.json

552 Bytes
Upload folder using huggingface_hub 3 months ago
tokenizer.model

500 kB
xet

Upload folder using huggingface_hub 3 months ago
tokenizer_config.json

1.16 kB
Upload folder using huggingface_hub 3 months ago