Open to Collab

1 3 23

R PRO

juiceb0xc0de

JuiceB0xC0de

AI & ML interests

destroying heuristic determination in 4 dimensions to flood the engines with diversity and a lot of swear words

Recent Activity

updated a model about 12 hours ago

juiceb0xc0de/bella-bartender-3b

updated a model about 12 hours ago

juiceb0xc0de/bella-bartender-v2-8b

commentedon their article about 12 hours ago

DNA Evidence in Language Models

View all activity

Organizations

updated 2 models about 12 hours ago

juiceb0xc0de/bella-bartender-3b

Text Generation • 3B • Updated about 12 hours ago • 2.84k • 2

juiceb0xc0de/bella-bartender-v2-8b

Text Generation • 8B • Updated about 12 hours ago • 254 • 3

commentedon DNA Evidence in Language Models about 12 hours ago

I would have guessed that reintroducing tensors which originated from a non-abliterated variant would have had a negative impact on refusals. How ever it would make sense that the replacement of a targeted and well managed area of vector repairs makes perfect sense. The refusal mechanisms don't regain persistence with the reintroduction of repaired vectors due to the fact that the refusal weight doesn't have the support it needs to activate. Without similar neighbouring weights it's just an alarm without a power source to complete its circuit.

When you are repairing a model from excessive abliteration damage which methods of vector replacement are you using? SLERP would make sense at a low ratio but could TIES be effective as well? Have you found a replacement ratio that has allowed the refusal circuit to regain its dominance allowing the baseline refusals to revert?

This idea has definitely got my mind racing with new possibilities for my research projects. I plan on trying post abliteration repair out this evening. This also has me thinking of hybridizing the training pipeline. I would like to try an SFT with fewer epochs than I would apply to a fully trained model, followed by abliteration and vector correction and finish the model with 1 more epoch of SFT. You could also substitute the optimizer out and see if adamw vs muon has any effect on refusal.

reactedto BibbyResearch's post with 🔥🔥 about 22 hours ago

Post

336

🍌 Paper Banana is now live! Create academic illustrations using natural language

We just launched Paper Banana — a tool that lets you generate clean academic illustrations simply by describing them in natural language.

🔗 Try it here: https://trybibby.com/paper-banana

Whether you need diagrams for papers, presentations, or teaching materials, Paper Banana helps you turn ideas into visuals in seconds.

We’d love your feedback:

What did you like?
What features should we add next?

Give it a spin and let us know what you think! 🚀

Dear Huggingface, show this post to all my fellow researchers!

upvoted a changelog 1 day ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

8 days ago

• 121

repliedto CRAFTFramework's post 1 day ago

Call me crazy, but I always thought of how much more efficient it made me in token usage and how much of my work I was actively retaining between sessions. Was it annoying? Absolutely. Did the benefits outweigh the tokens lost? After awhile maybe?

updated a collection 2 days ago

🍺 The Bartenders 🍺

Collection

This is a collection of models that I've trained on data collected through conversations with frontier models GPT, Claude, Perplexity and myself. • 8 items • Updated 2 days ago • 1

liked a model 2 days ago

01-ai/Yi-1.5-34B-Chat

Text Generation • 34B • Updated Aug 27, 2024 • 13k • 276

updated a model 2 days ago

juiceb0xc0de/bella-bartender-34b-yi

Text Generation • 34B • Updated 2 days ago • 156

published a model 2 days ago

juiceb0xc0de/bella-bartender-34b-yi

Text Generation • 34B • Updated 2 days ago • 156

updated a collection 2 days ago

🍺 The Bartenders 🍺

Collection

This is a collection of models that I've trained on data collected through conversations with frontier models GPT, Claude, Perplexity and myself. • 8 items • Updated 2 days ago • 1

liked 2 models 2 days ago

Madras1/Jade8b-GGUF

Text Generation • 8B • Updated 2 days ago • 181 • 1

XiaoyanLi/GooGooLM

Updated 2 days ago • 1

updated a model 2 days ago

juiceb0xc0de/bella-bartender-9b-yi

Text Generation • 9B • Updated 1 day ago • 261

published a model 2 days ago

juiceb0xc0de/bella-bartender-9b-yi

Text Generation • 9B • Updated 1 day ago • 261

commentedon DNA Evidence in Language Models 3 days ago

I noticed you train role play models. Have you noticed a drop in quality when you train with a model that has already been abliterated vs. training a model on a base that hasn't underwent abliteration and then running the model through the process with heretic once you've finished SFT? I've actually theorized and tested quite a bit on the best possible training outcomes based on when the abliteration process happens, and specific base models for adapting to a voice style.

reactedto AINovice2005's post with 🔥 3 days ago

Post

3459

In celebration of the new storage graph feature on the Hub, here's mine 😊 :

Post inspired by @ZennyKenny

reactedto cahlen's post with 🚀 3 days ago

Post

2962

It’s wild to me how you can just make shit now.

You can take a weekend with a raspberry pi 5, a pi camera, a 3d printer, and a smidgen of custom fine tuning (wakeword, whisper, tinybert, and pipertts) and you have physical device as a talking personal assistant.

What a time to be alive.

Edge ai, physical ai, ai augmented animatronics… tiny models. Tiny agents.

Going to be a wild year.

6 replies

reactedto umarbutler's post with 🚀 3 days ago

Post

4782

Isaacus, the AI research company building legal superintelligence, is hiring!

We're looking for passionate engineers who love to build and tinker and want to have an impact on the world. Specifically, we're hiring:
• ML engineers (Australia).
• Data engineers (Australia).
• Full-stack engineers (Australia).
• DevRel engineers (Australia, San Francisco, and London).
• DevOps engineers (Australia, San Francisco, and London).

If you'd like to be a founding employee at one of the few VC-backed LLM research labs in the world, receive generous equity compensation, and work alongside other highly motivated, highly skilled engineers, get in touch: https://isaacus.com/careers

R PRO

AI & ML interests

Recent Activity

Organizations

juiceb0xc0de's activity

Hugging Face Papers for AI Agents